[Corpora-List] Legal aspects of compiling corpora

Adam Kilgarriff Adam.Kilgarriff at itri.brighton.ac.uk
Fri Jun 13 13:36:14 UTC 2003


On the one hand, if your enemies are rich enough you'll lose.

On the other you're probably less worth sueing than Google and they are
still going strong (anyone out there from Google?  Your contribution
most welcome), and it doesn't sound like you are doing anything with any
salient legal difference.   (Getting authors' agreements takes huge
amounts of resources and isn't feasible; listing references doesn't
help.)

People do get unhappy about their pictures and audio being grabbed from
the web for use in other people's databases, and I have heard of cases
of web developers having to rein in their ambitions because objections
have been made.  As yet, mercifully, that hasn't happened with text -
people don't seem alarmed at the idea that the text they publish on the
web gets re-used.  Let's all pray it stays that way (though sooner or
later we're bound to get chancers trying it on - can't help fearing the
web is in its honeymoon phase, and the racketeers will mess it all up
before too long).

In the meantime - take courage! Do it!


	Adam


=======================
Adam Kilgarriff
Lexicography MasterClass Ltd:   http://www.lexmasterclass.com
adam at lexmasterclass.com
+44 (0)1273 705773
     --and--
ITRI, University of Brighton
Lewes Road, Brighton BN2 0BL, UK
http://www.itri.brighton.ac.uk/~Adam.Kilgarriff
adam at itri.brighton.ac.uk
+44 (0)1273 642919
==============================
World is crazier and more of it than we think,
Incorrigibly plural
                         ---'Snow', Louis MacNeice
==============================


> -----Original Message-----
> From: owner-corpora at lists.uib.no [mailto:owner-corpora at lists.uib.no]
On
> Behalf Of delucca at nilc.icmc.usp.br
> Sent: 13 June 2003 13:49
> To: corpora at hd.uib.no
> Subject: [Corpora-List] Legal aspects of compiling corpora
>
>
> Dear Linguists and Lawyers,
>
> I am troubled with Legal aspects of corpora compiling. I am in
> doubt if is an illegal procedure storage webpages (or part of them)
> in a database (see at http://www.dictionarium.com/project.htm),
> not available to public, and display its contents as short
collocations
> less than 100 characters by time by search method.
>
> On the other hand, the Internet search engines uses cached (temporary
?)
> copies of the sites and display a short of the web pages.
>
> My procedure is wrong? Which the Legal difference? I need ask
permission
> for each website to storage its pages? If I mention the source and the
> author
> I will be protecting the copyrights?
>
>
> I look forward to hearing from you.
>
>
> Yours Sincerely,
>
>
> J. L. De Lucca
>
> -------------------------------------------------
> This mail sent through IMP: http://horde.org/imp/



More information about the Corpora mailing list