<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv=Content-Type content="text/html; charset=us-ascii">
<meta name=Generator content="Microsoft Word 12 (filtered medium)">
<style>
<!--
/* Font Definitions */
@font-face
{font-family:Vrinda;
panose-1:1 1 6 0 1 1 1 1 1 1;}
@font-face
{font-family:Vrinda;
panose-1:1 1 6 0 1 1 1 1 1 1;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
{font-family:Verdana;
panose-1:2 11 6 4 3 5 4 4 2 4;}
@font-face
{font-family:Papyrus;
panose-1:3 7 5 2 6 5 2 3 2 5;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
margin-bottom:.0001pt;
font-size:11.0pt;
font-family:"Calibri","sans-serif";}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{mso-style-priority:99;
color:purple;
text-decoration:underline;}
span.EmailStyle17
{mso-style-type:personal-compose;
font-family:"Calibri","sans-serif";
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;}
@page Section1
{size:8.5in 11.0in;
margin:1.0in 1.25in 1.0in 1.25in;}
div.Section1
{page:Section1;}
-->
</style>
<!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang=EN-US link=blue vlink=purple>
<div class=Section1>
<p class=MsoNormal>Dear all,<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal> I
have an annotated corpus and a Morphological Analyzer.<o:p></o:p></p>
<p class=MsoNormal>My task is to use the nbest of the SRILM to choose the best
solution from the Morphological Analyzer solutions for every new unanalyzed word.<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>The Question:<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>Should I use the srilm to make ONE language model file with
all the annotated data features (concatenated with ‘+’ for every
word) like this: (PREP+DET+#+NOUN+NSUFF+#+i/GEN DET+#+#+ADJ+NSUFF+#+i/GEN)<o:p></o:p></p>
<p class=MsoNormal>Or should I make for every feature, or some group of
features, a separate model file?<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>I need to know the best way to automatically annotate new
data according to my manually annotated data.<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>Thanks and best regards.<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal><span style='font-size:10.0pt;font-family:"Verdana","sans-serif";
color:#000099'>Eslam Amgad Abdel Salam<o:p></o:p></span></p>
<p class=MsoNormal><span style='font-size:10.0pt;font-family:"Verdana","sans-serif";
color:#000099'>Computational Linguist,<o:p></o:p></span></p>
<p class=MsoNormal><span style='font-size:10.0pt;font-family:"Verdana","sans-serif"'><a
href="http://www.bibalex.org/"><span style='color:blue'>Bibliotheca Alexandrina</span></a><span
style='color:#000099'>, ICT Sector, <o:p></o:p></span></span></p>
<p class=MsoNormal><span style='font-size:10.0pt;font-family:"Verdana","sans-serif"'><a
href="http://www.bibalex.org/isis/"><span style='color:blue'>ISIS</span></a><span
style='color:#000099'>, ISAUC .<o:p></o:p></span></span></p>
<p class=MsoNormal><span style='font-size:10.0pt;font-family:"Verdana","sans-serif";
color:#000099'>P.O. Box 138, Ashshatby, <o:p></o:p></span></p>
<p class=MsoNormal><span style='font-size:10.0pt;font-family:"Verdana","sans-serif";
color:#000099'>Alexandria 21526, ARE.<o:p></o:p></span></p>
<p class=MsoNormal><span style='font-size:10.0pt;font-family:"Verdana","sans-serif";
color:#000099'>Tel. : +2034839999, Ext.: 2726<o:p></o:p></span></p>
<p class=MsoNormal><span style='font-size:10.0pt;font-family:"Verdana","sans-serif";
color:#000099'>Fax: +2034820405<o:p></o:p></span></p>
<p class=MsoNormal><span style='font-size:10.0pt;font-family:"Verdana","sans-serif";
color:#000099'>Cellular: +20101000725 <o:p></o:p></span></p>
<p class=MsoNormal><span style='font-size:10.0pt;font-family:"Verdana","sans-serif";
color:#000099'>E-mail: </span><span style='font-size:10.0pt;font-family:"Verdana","sans-serif";
color:blue'><a href="mailto:eslam.amgad@bibalex.org"><span style='color:blue'>Eslam.Amgad@bibalex.org</span></a><o:p></o:p></span></p>
<p class=MsoNormal><span style='font-size:10.0pt;font-family:"Verdana","sans-serif";
color:#000099'>Web site:</span><span style='font-size:10.0pt;font-family:"Verdana","sans-serif";
color:blue'> <a href="http://www.bibalex.org/"><span style='color:blue'>http://www.bibalex.org</span></a></span><span
style='font-size:10.0pt;font-family:"Verdana","sans-serif";color:#1F497D'><o:p></o:p></span></p>
<p class=MsoNormal><span style='font-family:Papyrus'><o:p> </o:p></span></p>
<p class=MsoNormal><i><span style='font-family:Papyrus;color:#0070C0'>"A
language is a dialect with an army and a navy. "<o:p></o:p></span></i></p>
<p class=MsoNormal><i><span style='font-family:Papyrus;color:#0070C0'>Max
Weinreich<o:p></o:p></span></i></p>
<p class=MsoNormal><o:p> </o:p></p>
</div>
</body>
</html>