Arabic-L:LING:Another arabiCorpus feature

Dilworth Parkinson dil at BYU.EDU
Tue Jul 5 17:56:45 UTC 2011


------------------------------------------------------------------------
Arabic-L: Tue 05 Jul 2011
Moderator: Dilworth Parkinson <dil at byu.edu>
[To post messages to the list, send them to arabic-l at byu.edu]
[To unsubscribe, send message from same address you subscribed from to
listserv at byu.edu with first line reading:
            unsubscribe arabic-l                                      ]

-------------------------Directory------------------------------------

1) Subject: Another arabiCorpus feature

-------------------------Messages-----------------------------------
1)
Date: 05 Jul 2011
From: Dil Parkinson <dil at byu.edu>
Subject: Another arabiCorpus feature

I have added a 'collocates' feature to the results on arabiCorpus.byu.edu.  This will give you an ordered list of the most common words found in the space between four words before and four words after the search term.  Note that this is not comparable to the collocates feature on tagged corpora; since this is an untagged and unlemmatized corpus, the 'collocates' list deals with word forms, not 'words' or 'lemmas' (i.e. kitaab would be listed separately from Alkitaab, and separately again from kitaabuhu), but it could still prove useful for checking out the 'neighborhood' of words you are interested in.
dil

--------------------------------------------------------------------------
End of Arabic-L: 05 Jul 2011
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/arabic-l/attachments/20110705/e6cfe48b/attachment.htm>


More information about the Arabic-l mailing list