30.3545, Disc: Hard to Categorize Phrases

The LINGUIST List linguist at listserv.linguistlist.org
Fri Sep 20 09:32:54 UTC 2019


LINGUIST List: Vol-30-3545. Fri Sep 20 2019. ISSN: 1069 - 4875.

Subject: 30.3545, Disc: Hard to Categorize Phrases

Moderator: Malgorzata E. Cavar (linguist at linguistlist.org)
Student Moderator: Jeremy Coburn
Managing Editor: Becca Morris
Team: Helen Aristar-Dry, Everett Green, Sarah Robinson, Peace Han, Nils Hjortnaes, Yiwen Zhang, Julian Dietrich
Jobs: jobs at linguistlist.org | Conferences: callconf at linguistlist.org | Pubs: pubs at linguistlist.org

Homepage: http://linguistlist.org

Please support the LL editors and operation with a donation at:
           https://funddrive.linguistlist.org/donate/

Editor for this issue: Everett Green <everett at linguistlist.org>
================================================================


Date: Fri, 20 Sep 2019 05:32:21
From: Raymond kao [rkao at ucsd.edu]
Subject: Hard to Categorize Phrases

 
Hi all! I am currently working on a string classification project using
Wikidata entries. I was wondering if there was a special linguistic term for a
compound phrase. Let me give a few examples of the project in its current
state. 

Simple Examples: 
1. "George Washington" - this string is a personal name.
2. "California" - this string is a toponym.

More Complex Examples: 
1. "The Vatican" - the entity that this refers to is a place of worship, but
the string itself cannot be a place of worship. Therefore, it is categorized
as a toponym.
2. "NATO" - the entity NATO is an international organization, but the string
"NATO" is categorized as an abbreviation.

Unknown:
1. "David Bowie (1947-2016)" - it contains both a personal name and a time
interval, but what would we call the ENTIRE string itself?
2. 92 (88 men and 4 women)

Thank you so much in advance! If I can clarify anything further, please let me
know. 



Linguistic Field(s): Applied Linguistics
                     Cognitive Science
                     Computational Linguistics
                     General Linguistics
                     Syntax
                     Text/Corpus Linguistics

Subject Language(s): English (eng)



------------------------------------------------------------------------------

***************************    LINGUIST List Support    ***************************
 The 2019 Fund Drive is under way! Please visit https://funddrive.linguistlist.org
  to find out how to donate and check how your university, country or discipline
     ranks in the fund drive challenges. Or go directly to the donation site:
               https://iufoundation.fundly.com/the-linguist-list-2019

                        Let's make this a short fund drive!
                Please feel free to share the link to our campaign:
                    https://funddrive.linguistlist.org/donate/
 


----------------------------------------------------------
LINGUIST List: Vol-30-3545	
----------------------------------------------------------






More information about the LINGUIST mailing list