[Corpora-List] Codings for corpus files to be used in ParaConc

Mon Jun 19 15:44:12 UTC 2006

Dear colleagues,

I'm compiling a corpus on European Parlamentary Speeches and I have 
found out that names of MEPs from Eastern Europe countries are displayed 
with errors when we use them with ParaConc. The same happens with 
accents in Spanish texts. We have saved our files as .txt using the 
coding Unicode (UTF-8). When we use the texts saved using the coding 
Western Europe (ISO) Spanish problems disappear but not those given with 
Eastern Europe languages.
Does anybody know a single coding we can use for any language spoken in 
the European Parliament?
Thank very much.
Best regards,

José Manuel

______________________________________________ 
LLama Gratis a cualquier PC del Mundo. 
Llamadas a fijos y móviles desde 1 céntimo por minuto. 
http://es.voice.yahoo.com