Soft: morphological-semantical parser for English compound participles

alexis nasr alexis.nasr at
Wed Mar 21 18:00:13 UTC 2001

A prototype morphological-semantical parser for English compound participles
is now available at

Any comments, criticisms, suggestions for improvements, etc. are most
welcome, esp. from native English speakers.


CP-Parser 2 (CP2) parses English NP constructions of the following syntax:

NP ::= <prefix> + ?-? + <participle> + ? ? + <head>
Prefix ::= <noun> | <adj> | <adv> | <prep> | <LBM>
Participle ::= <present participle> | <past participle>
Head ::= <noun>

where LBM = lexical, bound morpheme.

CP2 includes approx. 100 present and 100 past participle examples, which
were extracted from the British National Corpus (BNC), using the Corpus
Query Processor (CQP) tool © IMS, Stuttgart University.

The BNC data provides the core of CP2?s lexicon, which was formatted by
means of the WordSmith application. The word lists were then tagged and
lemmatized by Conexor?s web tagger at Finally, information
concerning valency, semantic selectional restrictions and semantic
categorization was added manually.

The morphological-semantical parsing algorithm builds on the principles set
out at - further documentation is forthcoming.

Mange Hilsener

Jens Ahlmann Hansen

Karlsbjergvej 39 C
DK-5672 Broby

mailto:jahlmann at
TEL: 00 45 65 50 33 10 (office)
     00 45 62 63 39 39 (home)
     00 45 29 46 43 22 (mobile)
Message diffusé par la liste Langage Naturel <LN at>
Informations, abonnement :
English version          :
Archives                 :

La liste LN est parrainée par l'ATALA (Association pour le Traitement
Automatique des Langues)
Information et adhésion  :

More information about the Ln mailing list