[Corpora-List] tagger for Ukranian

Serge Sharoff S.Sharoff at leeds.ac.uk
Mon Feb 7 17:40:35 UTC 2011


Dear all,

getting a reliable HMM tagger done is a matter of a weekend job (or one week if you want to debug and document it) provided that you have a lexicon and a tagger for a closely related language.  This follows research done by Anna Feldman and Chris Brew at Ohio State, see
@INPROCEEDINGS{feldman06,
  author = {Anna Feldman and Jirka Hana and Chris Brew},
  title = {A cross-language approach to rapid creation of new morpho-syntactically
	annotated resources},
  booktitle = {Proceedings of the Fifth Language Resources and Evaluation Conference LREC 2006},
location = {Genoa, Italy},
  year = {2006}
}

A reliable Russian tagger exists
@InProceedings{sharoff08lrec-mocky,
  author = 	 {Serge Sharoff and Mikhail Kopotev and Toma\v{z} Erjavec and Anna Feldman and Dagmar Divjak},
  title = 	 {Designing and Evaluating a {Russian} Tagset},
  booktitle =	 {Proceedings of the Sixth Language Resources and Evaluation Conference, {LREC} 2008},
  year =	 2008,
  address =	 {Marrakech}
}

The Ukrainian lexicon is available from Multext-East:
@InProceedings{erjavec10,
  author = {Tomaž Erjavec},
  title = {MULTEXT-East Version 4: Multilingual Morphosyntactic Specifications, Lexicons and Corpora},
  booktitle = {Proceedings of the Seventh conference on Language Resources and Evaluation (LREC'10)},
  year = {2010},
  address = {Valletta, Malta}
 } 

so getting the Ukrainian tagger done is just the matter of getting someone to use a simple set of tools.

Best,
Serge


________________________________________
From: corpora-bounces at uib.no [corpora-bounces at uib.no] On Behalf Of Mcenery, Tony [eiaamme at exchange.lancs.ac.uk]
Sent: 07 February 2011 14:48
To: corpora at uib.no
Subject: Re: [Corpora-List] tagger for Ukranian

Yes, I too have been looking for a Ukrainian tagger, but to no avail. I made a real effort to get a response from the UGTag people but failed - I fear it is vapourware. I think serge Sharoff at Leeds may be planning to produce a Ukrainian tagger, but it is some way off. Best,

Tony

-----Original Message-----
From: corpora-bounces at uib.no [mailto:corpora-bounces at uib.no] On Behalf Of Noah Bubenhofer
Sent: 07 February 2011 14:25
To: corpora at uib.no
Subject: Re: [Corpora-List] tagger for Ukranian

Hi,

this is indeed difficult: I discovered UGTag:
http://www.domeczek.pl/~polukr/parcor/

But the development seems to be suspended, I didn't succeed to get a
copy of the tagger.

ABBYY has a commercial solution for at least the morphological analysis
of Ukrainian. Its name is ABBYY Morphology Engine. Perhaps you can get
an evaluation copy of the tool.

I'd also be interested in an Ukrainian tagger.

Noah

Am 07.02.2011 12:19, schrieb Montserrat Civit:
> Hi,
>
> Does anyone know about a tagger for Ukranian?
> I greatly appreciate any idea and suggestion. Thanks so much in advance!
>
>
> --
> Montserrat Civit Torruella
>
>
>
> _______________________________________________
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora

--
Dr. Noah Bubenhofer
Institut für Deutsche Sprache, R5 6-13, D-68161 Mannheim
Postadresse: Postfach 10 16 21, D-68016 Mannheim
Tel: +49(621) 1581-217
Fax: +49(621) 1581-200
E-Mail: bubenhofer at ids-mannheim.de

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list