[Corpora-List] American and British English spelling converter

John Milton lcjohn at ust.hk
Fri Nov 3 11:05:20 UTC 2006


I once had a student write: "Ahmed is a poor man. He lives in a puncture
in Isa town." You have to speak both dialects to figure this one out!

John Milton
Hong Kong University of Science and Technology


On Fri, 3 Nov 2006, Harold Somers wrote:

> It would be a grave mistake to think that the only difference between
> British and American English is a few wayward spellings. There are
> considerable and extensive lexical, grammatical and idiomatic
> differences. The 1st and 3rd of those are more or less well known, but
> the grammatical differences never cease to surprise me. I'd be
> moderately interested to see what other examples corpora listers come up
> with (though no doubt they will also remind me that there are
> significant differences in usage between American dialects, not to
> mention Canadian etc)
> 
> To give just one example of each:
> 
> Lift vs elevator
> Have you got vs do you have
> Half four vs 4:30
> 
> Harold Somers  
> 
> > -----Original Message-----
> > >
> > > Martin Krallinger wrote:
> > >
> > >> Dear all,
> > >> 
> > >> I was looking for some simple tool (preferable in Python) which is 
> > >> able to do automatic conversion of texts (or words) from British 
> > >> English (UK) to American (US)  English and vice versa.
> > >> (Example:  realize <-> realise)
> > >> 
> > >> This seems to be an easy task, but I could not find any 
> > ready to use 
> > >> stand alone tool capable of performing this task.
> > >> 
> > >> I want to integrate this application into an Information 
> > extraction 
> > >> system which handles scientific literature.
> > >> 
> > >> I am also interested in references where aspects related to US/UK 
> > >> English spelling has been analyzed in the context of information 
> > >> extraction, text mining and terminology extraction.
> > >> 
> > >> Best regards,
> > >> 
> > >> 
> > >> Martin
> > >> 
> > >> 
> > >
> > >
> 
> 



More information about the Corpora mailing list