<html><body><div style="color:#000; background-color:#fff; font-family:Courier New, courier, monaco, monospace, sans-serif;font-size:12pt"><div><span>I mean that I have two different scripts for the same word (e.g. two scripts for "cat") </span><span style="font-size: 12pt;">written</span><span style="font-size: 12pt;"> </span><span style="font-size: 12pt;">by different people. The first script generates 358 words (and only 107 words are correct), and the second script generates 497 words (and 471 words are correct). Can I say that the result of the first script is worse or not?</span></div><div style="color: rgb(0, 0, 0); font-size: 16px; font-family: 'Courier New', courier, monaco, monospace, sans-serif; background-color: transparent; font-style: normal;"><span>Once again sorry for bothering.</span></div><div></div><div> </div><div><b>Irina L</b></div><div><br></div> <div style="font-family: 'Courier New', courier, monaco, monospace,
sans-serif; font-size: 12pt;"> <div style="font-family: 'times new roman', 'new york', times, serif; font-size: 12pt;"> <div dir="ltr"> <font size="2" face="Arial"> <hr size="1"> <b><span style="font-weight:bold;">From:</span></b> Mike Maxwell <maxwell@umiacs.umd.edu><br> <b><span style="font-weight: bold;">To:</span></b> Eirini LS <eirini_ls@yahoo.com> <br><b><span style="font-weight: bold;">Cc:</span></b> "corpora@uib.no" <corpora@uib.no> <br> <b><span style="font-weight: bold;">Sent:</span></b> Thursday, January 17, 2013 5:11 PM<br> <b><span style="font-weight: bold;">Subject:</span></b> Re: [Corpora-List] (no subject)<br> </font> </div> <br>
On 1/17/2013 3:09 AM, Eirini LS wrote:<br>> Thank you very much for your answer. But if I have two scripts for a word, and the first script<br>> generates 358 units (107 units - correct) and the second script - 497 units (471 units - correct)<br>> after my hand-validation of the list, which I get using "print lower-words" (this command helps<br>> me to provide output in .txt file, because of utf8 code, which isn't visible in xfst), does it<br>> mean that the first script is not a correct one? Which of this two scripts is better? Thank you<br>> in advance, *Irina L*<br><br>Sorry, I don't understand the question; I'm not sure what it means to have two scripts for a word, nor what the units are.<br><br>As for UTF8, whether it appears in xfst depends on the settings in whatever command-line processor you're using (Linux bash, Windows' cmd, etc.). That said, for testing purposes (as opposed to, say, debugging a new rule), you
generally want to send your output to a file, so you can compare it with previous results.<br>-- Mike Maxwell<br> <a ymailto="mailto:maxwell@umiacs.umd.edu" href="mailto:maxwell@umiacs.umd.edu">maxwell@umiacs.umd.edu</a><br> "My definition of an interesting universe is<br> one that has the capacity to study itself."<br> --Stephen Eastmond<br><br><br> </div> </div> </div></body></html>