[Corpora-List] American and British English spelling converter
Hamish Cunningham
hamish at dcs.shef.ac.uk
Fri Nov 3 12:04:33 UTC 2006
Martin
> I want to use a spelling converter ONLY as a form to 'normalize' the a
> large collection of biomedical text for subsequent IE, IR, document
You might be better off using some form of stand-off markup and just
annotating your texts with the "normal form" instead of actually transforming
your documents. This is considerably more flexible, and is pretty much the
orthodoxy amongst IE architectures these days - see e.g. GATE, UIMA, others.
Best
Hamish
--
Dr. Hamish Cunningham
Senior Research Scientist
Department of Computer Science
University of Sheffield
Regent Court
211 Portobello St.
Sheffield S1 4DP
United Kingdom
http://www.dcs.shef.ac.uk/~hamish/
More information about the Corpora
mailing list