[Corpora-List] Help

Vlado Keselj vlado at cs.dal.ca
Mon Jun 27 13:29:00 UTC 2011


Hi,

The following paper is relevant.  A PDF version can be found by searching 
on Google with the title as a query.


A Suffix Subsumption-based Approach to Building Stemmers and Lemmatizers 
for Highly Inflectional Languages with Sparse Resources
Vlado Keselj and Danko Sipka. INFOTHECA, Journal of Informatics and 
Librarianship, vol. IX, no. 1--2, pp. 23a-33a, 21--31, May 2008.


Regards,
Vlado

On Mon, 27 Jun 2011, Diana Maynard wrote:

> It's a few years old now, but it's probably worth having a look at the TIDES
> Surprise Language Exercise.  In particular, the dry run was on Cebuano, a
> language for which few resources were available (at least at the time).
> 
> For a general description of the exercise, see
> http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.151.3647&rep=rep1&type=pdf
> 
> Google "TIDES Surprise Language Exercise" and you'll get a bunch of relevant
> papers about work done as part of the exercise, e.g.
>  "NE recognition without training data on a language you don't speak"
> http://gate.ac.uk/sale/acl03/surprise.pdf
> 
> Regards
> Diana
> 
> 
> On 27/06/11 13:51, Inès Zribi wrote:
> > Dear Francis
> > 
> > Thank you for your response.
> > 
> > I'm not interested in a particular group of languages. I am looking for
> > studying methods that deal with languages that are characterized by the
> > lack or even the absence of resources.
> > 
> > Best wishes.
> > 
> > Inès.
> > 
> > 2011/6/27 Francis Tyers <ftyers at prompsit.com <mailto:ftyers at prompsit.com>>
> > 
> >     El dl 27 de 06 de 2011 a les 11:05 +0100, en/na Inès Zribi va escriure:
> >      > Dear Corpora list,
> >      >
> >      > Does anyone know any works (morphological analysis, parsing,
> >      > tokenization, POS tagging, etc.) deal with under-resourced languages?
> > 
> >     Any language or group of languages in particular ?
> > 
> >     Fran
> 
> 
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
> 
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list