[Corpora-List] adaptable FSM (finite state machine) for NLP

Yannick Versley versley at sfs.uni-tuebingen.de
Fri Jul 4 08:07:11 UTC 2008


Albrecht,

many of the people who develop HPSG grammars actually made their grammars 
public in the last years, which would be pretty close to what you suggest:
http://www.delph-in.net/index.php?page=3
http://www-tsujii.is.s.u-tokyo.ac.jp/downloads/pack.html
Otherwise, Joan Bresnan's book on Lexical Functional Grammar has a couple 
examples in other languages.

If you are looking for what people nowadays use for Natural Language 
Understanding (even if people are shy to call it so since the goals are less 
ambitious and the evaluation more rigorous), you won't find many hand-crafted 
general grammars anymore. In most cases people use a parsing model
derived from a treebank, such as the 
Charniak/Stanford/Berkeley/Collins/Bikel/Malt/MST parsers.

Oh, and none of these is finite-state-based.

Best,
Yannick
>  As we well know, there are many languages that are very similar. In
> many cases, to a large extent if not fully, they share the same
> alphabet, syntax rules and even phonemes
>
>  So, since parsers are essentially fed text sequentially (and
> naturally so) I wonder what are the strategies developed out there for
> pluggable parsing strategies (depth or breath first), some lexicon
> (which does not have to be totally complete) and rules describing the
> generative possibilities of this lexicon
>
>  As you could tell I am not a linguist myself, but after reading James
> Allen's Natural Lang Understanding, in which he, even if the theory is
> general, exclusively uses plenty of examples of English, I think such
> an "English Grammar file" may not be that difficult to device and if
> you do it for English I could easily imagine that there are such files
> for other NL which definitely are less fractured/more homogeneous
>
>  Where can you find actual well-formed, declarative description of
> some NL grammar including language features, constrains and
> everything( possible ;-)) in XML format or Backus-Naur form or such
> in-depth theoretical studies?
>
>  thanks
>  lbrtchx
>
> _______________________________________________
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora

-- 
Yannick Versley
Seminar für Sprachwissenschaft, Abt. Computerlinguistik
Wilhelmstr. 19, 72074 Tübingen
Tel.: (07071) 29 77352

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list