<html>
<head>
<meta content="text/html; charset=ISO-8859-1"
http-equiv="Content-Type">
</head>
<body text="#000000" bgcolor="#FFFFFF">
<div class="moz-cite-prefix">Dear All,<br>
<br>
Those working with comparable corpora might be interested in our
Aranea web corpora project:<br>
<br>
<a class="moz-txt-link-freetext" href="http://ucts.uniba.sk/aranea_about/">http://ucts.uniba.sk/aranea_about/</a><br>
<br>
Our corpora are "comparable" in a sense that they are of the same
size, the respective texts have been downloaded from the web at
(approximately) the same time, contain (hopefully ;-) similar
mixture of web-specific domains, genres and registers, were
processed and annotated by (possibly) the same tools and are
available under the same corpus manager. The smaller (100 Mword)
versions of our corpora are freely accessible (without
registration) under the NoSketch Engine here:<br>
<br>
<a class="moz-txt-link-freetext" href="http://ucts.uniba.sk/guest/index.html">http://ucts.uniba.sk/guest/index.html</a><br>
<br>
Any feedback is welcome :-)<br>
<br>
Best regards,<br>
<br>
V Benko, 17:05 <br>
<br>
</div>
<blockquote
cite="mid:CAKaOVPAYXymPLB6pwsF57RFGqFOdDw1=-Wn7s6bytYwBav5BDQ@mail.gmail.com"
type="cite">
<div dir="ltr">
<div class="gmail_default" style="font-family:trebuchet
ms,sans-serif">Hi,<br>
</div>
<div class="gmail_default" style="font-family:trebuchet
ms,sans-serif">Is there any tool for extracting probabilistic
bilingual dictionary for a bilingual comparable corpora? Does
Moses support such a task?<br>
</div>
<div class="gmail_default" style="font-family:trebuchet
ms,sans-serif">Best,<br>
</div>
<div class="gmail_default" style="font-family:trebuchet
ms,sans-serif">Javid<br>
</div>
</div>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">_______________________________________________
UNSUBSCRIBE from this page: <a class="moz-txt-link-freetext" href="http://mailman.uib.no/options/corpora">http://mailman.uib.no/options/corpora</a>
Corpora mailing list
<a class="moz-txt-link-abbreviated" href="mailto:Corpora@uib.no">Corpora@uib.no</a>
<a class="moz-txt-link-freetext" href="http://mailman.uib.no/listinfo/corpora">http://mailman.uib.no/listinfo/corpora</a>
</pre>
</blockquote>
<br>
<br>
<pre class="moz-signature" cols="72">--
Vladimír Benko
Slovak Academy of Sciences
Ľ. Štúr Institute of Linguistics
Panská 26, SK-81101 Bratislava
Tel +421-2-54431762 Fax -54431756</pre>
</body>
</html>