[Corpora-List] American and British English spelling converter

Hamish Cunningham hamish at dcs.shef.ac.uk
Fri Nov 3 12:04:33 UTC 2006


Martin

> I want to use a spelling converter ONLY as a form to 'normalize' the a 
> large collection of biomedical text for subsequent IE, IR, document 

You might be better off using some form of stand-off markup and just
annotating your texts with the "normal form" instead of actually transforming
your documents. This is considerably more flexible, and is pretty much the
orthodoxy amongst IE architectures these days - see e.g. GATE, UIMA, others.

Best

Hamish
--
Dr. Hamish Cunningham
Senior Research Scientist
Department of Computer Science
University of Sheffield
Regent Court
211 Portobello St.
Sheffield  S1 4DP
United Kingdom
http://www.dcs.shef.ac.uk/~hamish/



More information about the Corpora mailing list