[Corpora-List] Questions for Google syntactic N-grams corpus

Don Tuggener tuggener at cl.uzh.ch
Mon Nov 18 08:35:41 UTC 2013


Dear Gang Tian

There are information extraction tools which aim at what you described under Q2.
I guess a major problem of using the Google ngrams corpus is that it is hard or even impossible to parse, because the ngrams do not make up full sentences.
However you could have a look at the following tools (I haven't used them myself):

Reverb (doesn't require parsing):
http://reverb.cs.washington.edu/

Ollie (requires parsing):
http://knowitall.github.io/ollie/

Best
Don

_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list