[Corpora-List] Uses of N-grams?

Imene Bensalem bens.imene at gmail.com
Thu Jul 18 14:14:47 UTC 2013


Hi Cedirc ;

n-gram is a technique commonly used in natural language processing (NLP)
Instead of representing a document as a bag-of-word, it is sometimes useful
to represent it using n-grams,
especially if the application needs to capture contextual information, or
needs to be robust against spelling mistakes.
Examples of this technique use in NLP applicaitons includes: automatic
plagiarism
detection , authorship attribution , topic identification.

Kind regards

Imene
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20130718/25cc6d78/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list