[Corpora-List] Help
Diana Maynard
d.maynard at dcs.shef.ac.uk
Mon Jun 27 13:06:02 UTC 2011
It's a few years old now, but it's probably worth having a look at the
TIDES Surprise Language Exercise. In particular, the dry run was on
Cebuano, a language for which few resources were available (at least at
the time).
For a general description of the exercise, see
http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.151.3647&rep=rep1&type=pdf
Google "TIDES Surprise Language Exercise" and you'll get a bunch of
relevant papers about work done as part of the exercise, e.g.
"NE recognition without training data on a language you don't speak"
http://gate.ac.uk/sale/acl03/surprise.pdf
Regards
Diana
On 27/06/11 13:51, Inès Zribi wrote:
> Dear Francis
>
> Thank you for your response.
>
> I'm not interested in a particular group of languages. I am looking for
> studying methods that deal with languages that are characterized by the
> lack or even the absence of resources.
>
> Best wishes.
>
> Inès.
>
> 2011/6/27 Francis Tyers <ftyers at prompsit.com <mailto:ftyers at prompsit.com>>
>
> El dl 27 de 06 de 2011 a les 11:05 +0100, en/na Inès Zribi va escriure:
> > Dear Corpora list,
> >
> > Does anyone know any works (morphological analysis, parsing,
> > tokenization, POS tagging, etc.) deal with under-resourced languages?
>
> Any language or group of languages in particular ?
>
> Fran
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list