18.918, Software: ELRA Language Resources Catalogue Update 03/07

LINGUIST Network linguist at LINGUISTLIST.ORG
Tue Mar 27 19:09:01 UTC 2007


LINGUIST List: Vol-18-918. Tue Mar 27 2007. ISSN: 1068 - 4875.

Subject: 18.918, Software: ELRA Language Resources Catalogue Update 03/07

Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
            Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
 
Reviews: Laura Welcher, Rosetta Project  
       <reviews at linguistlist.org> 

Homepage: http://linguistlist.org/

The LINGUIST List is funded by Eastern Michigan University, 
and donations from subscribers and publishers.

Editor for this issue: Hannah Morales <hannah at linguistlist.org>
================================================================  

To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.html.

===========================Directory==============================  

1)
Date: 20-Mar-2007
From: Helene Mazo < mazo at elda.org >
Subject: ELRA Language Resources Catalogue Update 03/07

*******************************************************************************
          Fund Drive FLASH: We still need $40,685 to end Fund Drive.
   If you have not donated, please visit http://linguistlist.org/donate.html
******************************************************************************* 

	
-------------------------Message 1 ---------------------------------- 
Date: Tue, 27 Mar 2007 15:06:52
From: Helene Mazo < mazo at elda.org >
Subject: ELRA Language Resources Catalogue Update 03/07 
 


ELRA is happy to announce that 3 new Speech Related Resources are now
available in its catalogue. Moreover, we are pleased to announce that years
2005 and 2006 from the Text Corpus of 'Le Monde'  (ELRA-W0015) are now
available.

ELRA-S0235 LC-STAR Hebrew (Israel) phonetic lexicon 
The LC-STAR Hebrew (Israel) phonetic lexicon comprises 109,580 words,
including a set of 62,431 common words, a set of 47,149 proper names
(including person names, family names, cities, streets, companies and brand
names) and a list of 8,677 special application words. The lexicon is
provided in XML format and includes phonetic transcriptions in SAMPA.
For more information, see:
http://catalog.elra.info/product_info.php?products_id=984&language=en 

ELRA-S0236 LC-STAR English-Hebrew (Israel) Bilingual Aligned Phrasal lexicon 
The LC-STAR English-Hebrew (Israel) Bilingual Aligned Phrasal lexicon
comprises 10,520 phrases from the tourist domain. It is based on a list of
short sentences obtained by translation from US-English 10,449 phrasal
corpus. The lexicon is provided in XML format.
For more information, see:
http://catalog.elra.info/product_info.php?products_id=985&language=en 

ELRA-S0237 LC-STAR US English phonetic lexicon 
The LC-STAR US English phonetic lexicon comprises 102,310 words, including
a set of 51,119 common words, a set of 51,111 proper names (including
person names, family names, cities, streets, companies and brand names) and
a list of 6,807 special application words. The lexicon is provided in XML
format and includes phonetic transcriptions in SAMPA.
For more information, see:
http://catalog.elra.info/product_info.php?products_id=986&language=en 

ELRA-W0015 Text corpus of 'Le Monde'
Corpus from 'Le Monde' newspaper. Years 1987 to 2002 are available in an
ASCII text format. Years 2003 to 2006 are available in .XML format. Each
month consists of some 10 MB of data (circa 120 MB per year).
For more information, see:
http://catalog.elra.info/product_info.php?products_id=438&language=en   


For more information on the catalogue, please contact Valérie Mapelli
mailto:mapelli at elda.org

Our on-line catalogue has moved to the following address:
http://catalog.elra.info. Please update your bookmarks. 
Linguistic Field(s): Computational Linguistics




-----------------------------------------------------------

This Year the LINGUIST List hopes to raise $55,000. This money will go to help keep the 
List running by supporting all of our Student Editors for the coming year.

See below for donation instructions, and don't forget to check out our Fund Drive 2007 
LINGUIST List Superhero Adventure for some Fund Drive fun!

http://linguistlist.org/donation/fund-drive2007/ 

There are many ways to donate to LINGUIST!

You can donate right now using our secure credit card form.

Alternatively you can also pledge right now and pay later.

For all information on donating and pledging, including information on how to donate by 
check, money order, or wire transfer, please visit:

http://linguistlist.org/donate.html

The LINGUIST List is under the umbrella of Eastern Michigan University and as such can 
receive donations through the EMU Foundation, which is a registered 501(c) Non Profit 
organization. Our Federal Tax number is 38-6005986. These donations can be offset against 
your federal and sometimes your state tax return (U.S. tax payers only). For more 
information visit the IRS Web-Site, or contact your financial advisor.

Many companies also offer a gift matching program, such that they will match any gift 
you make to a non-profit organization. Normally this entails your contacting your human 
resources department and sending us a form that the EMU Foundation fills in and returns 
to your employer. This is generally a simple administrative procedure that doubles the 
value of your gift to LINGUIST, without costing you an extra penny. Please take a moment 
to check if your company operates such a program.

Thank you very much for your support of LINGUIST!


 

-----------------------------------------------------------
LINGUIST List: Vol-18-918	

	



More information about the LINGUIST mailing list