Corpora: word use-frequency corpora

LDC Office ldc at unagi.cis.upenn.edu
Wed Apr 5 19:45:36 UTC 2000


Dear Andrew,

The Linguistic Data Consortium's CallHome Lexicons offer
information on word frequency, as well as orthography, morphology,
pronunciation, and stress.  The languages covered by the CallHome
Lexicons are American English, Egyptian Arabic, German, Japanese,
Mandarin, and Spanish.

A search by corpus type in our Catalog will point you to further
details regarding these lexicons, as well as all other data
available through LDC.

http://morph.ldc.upenn.edu/Catalog/search.html

Please let me know if you have further questions.

Best,

Shannon Sears
Manager, Intellectual Property Rights and Membership
----------------------------------------------------------------------
Linguistic Data Consortium          Phone: (215) 898-0464
3615 Market Street                  Fax:   (215) 573-2175
Suite 200                           email: ssears at ldc.upenn.edu
Philadelphia, PA 19104-2608         www: http://www.ldc.upenn.edu


> Date: Sat, 1 Apr 2000 10:05:35 +0100
> From: andrew mccrum <andrewm at fsbdial.co.uk>
> To: corpora at hd.uib.no
> Subject: Corpora: word use-frequency corpora
> Precedence: bulk
>
> I am researching into motivation in word initial consonant onsets
> at Sussex University, England.
>
> 1 are there any word use-frequency corpora available for languages
> other than English readable by a normal Windows using PC ?
>
> 2 are there any machine-readable phonetic dictionaries available
> for the same languages with the same accessibility ?
>
> Andrew McCrum
> Postraduate Research degree student
> Department of Cognitive and Computing Sciences
> Sussex University
> England
>
>



More information about the Corpora mailing list