[Corpora-List] sentence aligner script

Tony Berber Sardinha tony4 at uol.com.br
Sat Dec 14 10:25:24 UTC 2002


Dear list members

Does anyone have a script (perl, Unix text utilities, etc) for aligning
sentences of a bilingual corpus? The input is two files, one in language A, the
other in language B, and the the results don't need to be neat and polished, or
perfect.

I found references for a 'Vanilla aligner' at
http://tractor.bham.ac.uk/tractor/tools.html but access is password protected.
This
aligner was also mentioned in a previous thread on this list
(http://www.hit.uib.no/corpora/2002-3/0145.html), which focused on aligners for
ParaConc

Any ideas will be appreciated

thank you very much

cheers
tony.
-------------------------------------
Dr Tony Berber Sardinha
LAEL, PUC/SP
(Catholic University of Sao Paulo, Brazil)
tony4 at uol.com.br
http://lael.pucsp.br/~tony
[New website]



More information about the Corpora mailing list