7.1666, Sum: Lancaster Corpus

The Linguist List linguist at unix.tamu.edu
Mon Nov 25 02:42:37 UTC 1996


---------------------------------------------------------------------------
LINGUIST List:  Vol-7-1666. Sun Nov 24 1996. ISSN: 1068-4875. Lines:  90
 
Subject: 7.1666, Sum: Lancaster Corpus
 
Moderators: Anthony Rodrigues Aristar: Texas A&M U. <aristar at unix.tamu.edu>
            Helen Dry: Eastern Michigan U. <hdry at emunix.emich.edu> (On Leave)
            T. Daniel Seely: Eastern Michigan U. <dseely at emunix.emich.edu>
 
Associate Editors: Ljuba Veselinova <lveselin at emunix.emich.edu>
                   Ann Dizdar <dizdar at unix.tamu.edu>
Assistant Editor:  Sue Robinson <robinson at emunix.emich.edu>
Technical Editor:  Ron Reck <rreck at emunix.emich.edu>
 
Software development: John H. Remmers <remmers at emunix.emich.edu>
 
Editor for this issue: robinson at emunix.emich.edu (Susan Robinson)
 
---------------------------------Directory-----------------------------------
1)
Date:  Fri, 22 Nov 1996 17:14:00 +0100
From:  0431659800-0001 at t-online.de (Guenter Schubert)
Subject:  Sum: Lancaster Corpus
 
---------------------------------Messages------------------------------------
1)
Date:  Fri, 22 Nov 1996 17:14:00 +0100
From:  0431659800-0001 at t-online.de (Guenter Schubert)
Subject:  Sum: Lancaster Corpus
 
 
Recently I issued questions about the existence and nature of the
Lancaster Corpus, also about access possibilities and availability on
a CD-ROM.  I'm indebted to the following people all over the world for
their helpful hints:
 
1)Tracy Cameron Mansfield ( mansfieldmail at worldnet.att.net)
2) Yael Maschler ( yaelm at vms.huji.ac.il)
3) Donald C. Freeman (dfreeman at bcf.usc.edu)
4)M.Hundt (hundt at rcs.urz.tu-dresden.de)
5)Jonathan Swift (jons at ais.co.uk)
6) Sung-Ho Ahn (shahn at email.hanyang.ac.kr)
7) Ingo Plag (plag at mailer.uni-marburg.de)
8) Jane Setter (egjanes at polyu.edu.hk)
9) Martha Jo McGinnis ( marthajo at MIT.edu)
10) Suzanne E Kemmer ( kemmer at ruf.rice.edu)
11) Dennis Newson ( dnewson at dosunil.rz.uni-osnabrueck.de)
12) Lex Olorenshaw (lexo at lsi.sel.sony.com)
 
These helpful people sent me information as to specialists involved in
the completion of the corpus or to people who could help for other
reasons. I got relevant WWW- and e-mail-addresses, and hints at other
existing corpora.
 
The Corpus in question is a joint enterprise of Lancaster - Oslo -
Bergen ( the LOB - Corpus). It was compiled at the Norwegian Computing
Centre for the Humanities in Bergen; it is a 30-mill. word archive of
the English published in 1961. There is a (slightly expensive) CD-ROM
available that also includes LOB's American equivalent, the BROWN-
Corpus (Univ.,Providence, Rhode Island, USA; also known as
FRANCIS/KUCHERA- Corpus),and the LONDON-LUND-Corpus.
 
Information about the CD-ROM can be acquired from the International
Computer Archive of Modern English in Oslo: ICAME (e-mail:
icame at hd.uib.no)
 
Other corpora mentioned in the correspondence were: MARSEC (Leeds
Univ.) and PENN TREEBANK (Univ. of Pennsylvania).
 
Relevant internet addresses:
 
http://www.awl-elt.com/dictionaries/lasde.html (= The Longman's
     dictionary web  site)
http://www.ruf.rice.edu/~barlow/corpus.html ( = available corpora)
http://www.cis.upenn.edu/~treebank/home.html (= Treebank Corpus)
http://www.ldc.upenn.edu/ (= Linguistic Data Consortium : American
     English data)
http://www.dcs.shef.ac.uk/research/ilash/info/papers/ShATR (=for
     British Engl.)
 
The Univ. of Freiburg/Germany is presently working on a corpus which
will be an update (1990s) of LOB and BROWN.
 
Thank you all very much, I appreciate your help.
 
Guenter Schubert
Kiel/Germany
e-mail: guenter.schubert at t-online.de
------------------------------------------------------------------------
LINGUIST List: Vol-7-1666.



More information about the LINGUIST mailing list