<html>
<body>
I think the honest answer is that it is a question with no clear
answer.<br><br>
I know that legal concerns have prevented US government funded projects
such as TREC
(<a href="http://trec.nist.gov/" eudora="autourl">http://trec.nist.gov</a>)
from building Web collections and they have got other organisations to
build and distribute such collections. I also know that Web search
engines have been ordered to take off image and sound collections from
their Web sites, but I don't think this has happened with HTML. Maybe
text is viewed as being generally less valuable than other media
types.<br><br>
<br><br>
At 09:49 13/06/2003 -0300, delucca@nilc.icmc.usp.br wrote:<br><br>
<blockquote type=cite class=cite cite>Dear Linguists and
Lawyers,<br><br>
I am troubled with Legal aspects of corpora compiling. I am in <br>
doubt if is an illegal procedure storage webpages (or part of them)
<br>
in a database (see at http://www.dictionarium.com/project.htm), <br>
not available to public, and display its contents as short
collocations<br>
less than 100 characters by time by search method. <br><br>
On the other hand, the Internet search engines uses cached (temporary ?)
<br>
copies of the sites and display a short of the web pages.<br><br>
My procedure is wrong? Which the Legal difference? I need ask
permission<br>
for each website to storage its pages? If I mention the source and the
author<br>
I will be protecting the copyrights? <br>
<br><br>
I look forward to hearing from you.<br><br>
<br>
Yours Sincerely,<br><br>
<br>
J. L. De Lucca<br><br>
-------------------------------------------------<br>
This mail sent through IMP:
<a href="http://horde.org/imp/" eudora="autourl">http://horde.org/imp/</a></blockquote>
<x-sigsep><p></x-sigsep>
<font face="Courier, Courier">_________________________________________________________________________<br>
Mark Sanderson, Room
303
Tel: +44 (0) 114 22 22648<br>
Department of Information
Studies Fax: +44
(0) 114 27 80300<br>
University of Sheffield, Regent Court,
<a href="mailto:m.sanderson@shef.ac.uk" eudora="autourl">mailto:m.sanderson@shef.ac.uk</a><br>
211 Portobello St., Sheffield, S1 4DP, UK
<a href="http://dis.shef.ac.uk/mark/" eudora="autourl">http://dis.shef.ac.uk/mark/</a><br>
_________________________________________________________________________<br>
Good judgement comes from experience, experience comes from bad
judgement<br>
</font></body>
</html>