[Corpora-List] How to measure n-grams (n>2)

Mihail Kopotev Mihail.Kopotev at helsinki.fi
Wed Oct 24 13:22:24 UTC 2012


Dear Corpora-listers,

As I see, the commonly used approach to extract collocations from 
n-grams (n>2) is to treat them somehow as as pseudo-bigrams. I'm 
wondering whether there are more immediate techniques that are 
non-derivative of bigrams.
Could you please point me any articles/resources especially those, where 
these techniques are evaluated against the data?

Thank you,
Mikhail Kopotev

-- 
Mikhail Kopotev, PhD, Adj.Prof.
University Lecturer
Department of Modern Languages
University of Helsinki
http://www.helsinki.fi/~kopotev

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20121024/825bf04c/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list