[Corpora-List] Manually annotated alignments

Valerie Mapelli mapelli at elda.org
Tue May 22 08:02:05 UTC 2007


Dear Lexi,

You might be interested in the following corpora as well, available in the 
ELRA catalogue (http://catalog.elra.info):

<http://catalog.elra.info/product_info.php?products_id=534&osCsid=928f15b746f092e824c577f00b2c2ccc>W0017 
<http://catalog.elra.info/product_info.php?products_id=534&osCsid=928f15b746f092e824c577f00b2c2ccc>MULTEXT 
JOC Corpus
English, French, Italian

<http://catalog.elra.info/product_info.php?products_id=764&osCsid=928f15b746f092e824c577f00b2c2ccc>W0023 
<http://catalog.elra.info/product_info.php?products_id=764&osCsid=928f15b746f092e824c577f00b2c2ccc>MLCC 
Multilingual and Parallel Corpora (MLCC)
Dutch, English, German, French,  Italian,  Spanish, Danish, Greek

<http://catalog.elra.info/product_info.php?products_id=633&osCsid=928f15b746f092e824c577f00b2c2ccc>W0031 
<http://catalog.elra.info/product_info.php?products_id=633&osCsid=928f15b746f092e824c577f00b2c2ccc>GeFRePaC 
- German French Reciprocal Parallel Corpus
German, French

<http://catalog.elra.info/product_info.php?products_id=636&osCsid=928f15b746f092e824c577f00b2c2ccc>W0033 
<http://catalog.elra.info/product_info.php?products_id=636&osCsid=928f15b746f092e824c577f00b2c2ccc>CRATER 
2 Corpus
French, English, Spanish

I will be happy to provide you with further information.

Best,

Valerie Mapelli



At 01:52 18/05/2007, Alexandra Birch wrote:
>Hi there,
>
>I am searching for manually annotated word/phrase alignments from
>parallel corpora. So far I have discovered:
>
>ACL2003 shared task
>http://www.cs.unt.edu/~rada/wpt/
>Romanian - English (Mihalcea & Pedersen 2003)
>English - French (Och & Ney 2000)
>
>ACL2005 shared task
>http://www.cse.unt.edu/~rada/wpt05/
>English - Inuktitut
>English - Hindi
>
>EPPS Word Alignment Trial and Test Set
>Spanish - English (500 sentences)
>http://gps-tsc.upc.es/veu/LR/epps_ensp_alignref.php3
>
>I will keep looking but  I would appreciate it if anyone could
>inform me of other resources they know about.
>
>Thank you
>
>Lexi
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20070522/c618194c/attachment.htm>


More information about the Corpora mailing list