[Corpora-List] Chunking of Slavic languages

Adam Radziszewski kocikikut at gmail.com
Sun May 22 10:09:09 UTC 2011


Dear corpora users,
I'm doing a research on chunking of Slavic languages, including both
chunking algorithms and the problem of chunk definitions for Slavic
languages.

I'm aware of the following works:
• *Bulgarian*: P. Osenova's works on BaseNP chunking and transformation into
HPSG (hand-written grammars)
• *Croatian*: works of K. Vučković (including her thesis), M. Tadić and Z.
Dovedan (hand-written grammars, NooJ platform)
• *Polish*: works by A. Przepiórkowski (related to the Spejd formalism and
the National Corpus of Polish) as well as those of me and M. Piasecki (a
base NP chunker based on decision trees).
• *Serbo-Croatian*: works of G. Nenadić and D. Vitas (hand-writtern local
grammars).

Probably the above list is incomplete, so I would be grateful if someone
could point me to some more references. It is especially hard to find
discussions on chunk definitions for Slavic languages.

Best,
Adam Radziszewski
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20110522/b07f1713/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list