Extract word list from a mor tagged corpus, restore the original form of the tagged words

Paul Feitzinger paul.feitzinger at gmail.com
Thu Dec 20 16:35:10 UTC 2012


The Python Natural Language Toolkit has a corpus reader for XML
Childes corpora. If you know any python, you could write a quick
script to output only the main-tier form and the parts of the
tag-cluster you care about, and then match against that.
https://github.com/nltk/nltk/blob/master/nltk/corpus/reader/childes.py

Here's a tutorial by the author, though I'm not sure how up to date it
is.
http://faculty.laguardia.edu/tnagano/research/CHILDES/

paul

-- 
You received this message because you are subscribed to the Google Groups "chibolts" group.
To post to this group, send email to chibolts at googlegroups.com.
To unsubscribe from this group, send email to chibolts+unsubscribe at googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.



More information about the Chibolts mailing list