[Corpora-List] Fangorn: a system for querying very large treebanks

Steven Bird sb at csse.unimelb.edu.au
Thu Aug 30 06:21:47 UTC 2012


Fangorn is an open source tool for querying very large treebanks,
built on top of Apache Lucene.  Fangorn implements the LPath
linguistic path language, which has an XPath-like syntax along with
linguistically motivated extensions.  Result trees are annotated with
the query in order to show how the query matched the tree, and these
annotations can themselves be modified and submitted as further
queries.

Demonstration site:
    http://nltk.ldc.upenn.edu:9090/

Query language tutorial:
    https://code.google.com/p/fangorn/wiki/Query_Language

Source code:
    http://code.google.com/p/fangorn/

Steven Bird and Sumukh Ghodke

_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list