18.267, Qs: Categorisation of Semantic Relations/American English Corpus

LINGUIST Network linguist at LINGUISTLIST.ORG
Thu Jan 25 21:47:19 UTC 2007


LINGUIST List: Vol-18-267. Thu Jan 25 2007. ISSN: 1068 - 4875.

Subject: 18.267, Qs: Categorisation of Semantic Relations/American English Corpus

Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
            Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
 
Reviews: Laura Welcher, Rosetta Project / Long Now Foundation  
         <reviews at linguistlist.org> 

Homepage: http://linguistlist.org/

The LINGUIST List is funded by Eastern Michigan University, Wayne
State University, and donations from subscribers and publishers.

Editor for this issue: Kevin Burrows <kevin at linguistlist.org>
================================================================  

We'd like to remind readers that the responses to queries are usually
best posted to the individual asking the question. That individual is
then strongly encouraged to post a summary to the list. This policy was
instituted to help control the huge volume of mail on LINGUIST; so we
would appreciate your cooperating with it whenever it seems appropriate.

In addition to posting a summary, we'd like to remind people that it
is usually a good idea to personally thank those individuals who have
taken the trouble to respond to the query.

To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.html.


===========================Directory==============================  

1)
Date: 22-Jan-2007
From: Jo Wiltshire < jo.wiltshire at gmail.com >
Subject: Categorisation of Semantic Relations 

2)
Date: 22-Jan-2007
From: Christopher Maloof < cjm62 at georgetown.edu >
Subject: Large Amer English Corpus with Parts of Speech 

	
-------------------------Message 1 ---------------------------------- 
Date: Thu, 25 Jan 2007 16:45:37
From: Jo Wiltshire < jo.wiltshire at gmail.com >
Subject: Categorisation of Semantic Relations 
 


I am just commencing a PhD in Psychology looking at children's 
understanding of semantic relations and how this relates to their 
understanding of analogy. My first problem is how to choose the relations 
which we should look at. For example, previous research has looked at 
relations such as 'part of', 'lives in', 'is made of', etc. But I don't simply want 
to pick the first relations that come in to my head. 

Can you recommend any books, papers or other resources which attempt 
to list and categorise relations in a meaningful way? Several authors appear 
to have demonstrated how most relations can be put in to five or six basic 
categories, however, this tends to end up with quite abstract categories 
which aren't suitable for tasks with children. What I really need is something 
which categorises and organises relations in a more fine grained way. 

Linguistic Field(s): Semantics


	
-------------------------Message 2 ---------------------------------- 
Date: Thu, 25 Jan 2007 16:45:43
From: Christopher Maloof < cjm62 at georgetown.edu >
Subject: Large Amer English Corpus with Parts of Speech 

	

We are looking for a large word frequency list for English which
distinguishes parts of speech.  We're currently using counts from an
automatically-tagged corpus of 50 million words of AP newswire text, but
this isn't enough to reliably distinguish counts of moderately infrequent
words, especially for inflected forms like past participles.  

Does anyone know of any large, POS-tagged corpus of American English, or a
list of word frequency counts from such a corpus?  We're more concerned
about size than balance; a very large list from a single source (such as
newswire text) would be fine.

Please e-mail any suggestions to Chris Maloof at cjm62 at georgetown.edu.

Thanks very much,

Christopher Maloof
Michael Ullman

Brain and Language Lab, Georgetown University 

Linguistic Field(s): Text/Corpus Linguistics
 



-----------------------------------------------------------
LINGUIST List: Vol-18-267	

	



More information about the LINGUIST mailing list