[Corpora-List] corpus ------>>>>> thesaurus

Viktor Pekar v.pekar at wlv.ac.uk
Tue Nov 9 11:12:06 UTC 2004


Hi Vladimir,

You can find a good introduction to lexical acquisition methods based on
co-occurrence statistics in Manning and Schuetze's "Foundations of
Statistical Natural Language Processing". You can find an overview of work
on semantic clustering of words based on their spelling in M.Oakes'
"Statistics for Corpus Linguistics".

Best wishes,

Viktor.


----- Original Message -----
From: "P bI K O B___ B.B. (MOCKBA)" <rykov at narod.ru>
To: <corpora at uib.no>
Sent: Tuesday, November 09, 2004 7:50 AM
Subject: [Corpora-List] corpus ------>>>>> thesaurus


>
>     I would be very grateful to anyone for any info concerning compiling
thesaurus from corpus (esp. from corpus of specific domain documents).
>
>     As example - thesaurus of financial terms compiled from financial
documents corpus.
>
>       Best wishes to all our corpus society !
>
> --
>   Regards Vladimir Rykov
>
> PhD in Computational Linguistics
> Personal web-site: rykov.narod.ru
> mailto: rykov2000 at mail.ru
> Si etiam omnes - ego non
> English version:   www.blkbox.com/~gigawatt/rykov.html
>
> --
> Яндекс.Игрушки - яркий перерыв в серых трудовых буднях.
http://play.yandex.ru/
>
>



More information about the Corpora mailing list