Corpora: Parsing morphologically rich languages

"Alexander Mikhailian <mikhailian" at altern.org "Alexander Mikhailian <mikhailian" at altern.org
Fri Jan 12 14:18:34 UTC 2001


Hello,

I am looking for references to syntactic parsers
that deal with morphologically rich flexive languages.

In particular, I am interested in :

1. Approaches to deal with the number of POS tags
(terminals) that would supposedly be larger
than for English or French, e.g if one tries
to build a list of POS tags for a morphologically
rich language in order to follow approaches
developed for English, this list may easily grow up
to thousands of entries which implies that grammars
using such a huge list of terminals would be quite
complicated.

2. Approaches to deal with the free or loosely
restricted order of words that is often proper to
morphologically rich languages and which requires
different parsing techniques than for English,
where a common shift/reduce parser is often sufficient.

Thanks in advance,

--
Alexander Mikahilian



More information about the Corpora mailing list