You should definitely look at this page: <a href="http://norvig.com/spell-correct.html">http://norvig.com/spell-correct.html</a><div><br></div><div><a href="http://norvig.com/spell-correct.html"></a>I have worked for many years with spell-checkers that have been or are OEM-ed by all the biggest software providers. However, I have found the information in this site quite complete and up-to-date. Beside the 21 lines of Python code, it has links to most applications and theories on spell-checkers.</div>
<div><br></div><div>I would only add the use of FST/FSM based dictionaries and morphological engines (Xerox PARC/Inxight and Teragram) in spell-checkers.</div><div><br></div><div>Hope this helps,</div><div><br></div><div>
Alex<br><br><div class="gmail_quote">On Thu, Dec 3, 2009 at 10:04 AM, Nicola Bertoldi <span dir="ltr"><<a href="mailto:bertoldi@fbk.eu">bertoldi@fbk.eu</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">
I send again this message with a more appropriate heading.<br>
Sorry for the inconvenience.<br>
<div class="im"><br>
<br>
<br>
I am going to do some investigation to improve machine translation<br>
when it is applied to texts corrupted by misspellings of any sort (non-word, real-word errors).<br>
<br>
In this preliminary phase I am collecting information about the spelling correction task<br>
and other applications and tasks which involves spelling correction.<br>
<br>
In particular, I am interested in<br>
- surveys about the task<br>
- statistics about the most common misspellings in texts of different languages and different genres<br>
- public available software for spelling correction<br>
- available corpora of noisy texts<br>
- any further resources which is possibly useful for my topic<br>
<br>
<br>
<br>
Thanks!<br>
<br>
Nicola<br>
<br>
</div>------ End of Forwarded Message<br>
<div><div></div><div class="h5"><br>
_______________________________________________<br>
Corpora mailing list<br>
<a href="mailto:Corpora@uib.no">Corpora@uib.no</a><br>
<a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a><br>
</div></div></blockquote></div><br></div>