25.2125, Software: Computational Linguistics, Syntax: SentiLecto

The LINGUIST List linguist at linguistlist.org
Tue May 13 15:18:50 UTC 2014


LINGUIST List: Vol-25-2125. Tue May 13 2014. ISSN: 1069 - 4875.

Subject: 25.2125, Software: Computational Linguistics, Syntax: SentiLecto

Moderators: Damir Cavar, Eastern Michigan U <damir at linguistlist.org>

Reviews: Monica Macaulay, U of Wisconsin Madison
Rajiv Rao, U of Wisconsin Madison
Joseph Salmons, U of Wisconsin Madison
Mateja Schuck, U of Wisconsin Madison
Anja Wanner, U of Wisconsin Madison
       <reviews at linguistlist.org>

Homepage: http://linguistlist.org

Do you want to donate to LINGUIST without spending an extra penny? Bookmark
the Amazon link for your country below; then use it whenever you buy from
Amazon!

USA: http://www.amazon.com/?_encoding=UTF8&tag=linguistlist-20
Britain: http://www.amazon.co.uk/?_encoding=UTF8&tag=linguistlist-21
Germany: http://www.amazon.de/?_encoding=UTF8&tag=linguistlistd-21
Japan: http://www.amazon.co.jp/?_encoding=UTF8&tag=linguistlist-22
Canada: http://www.amazon.ca/?_encoding=UTF8&tag=linguistlistc-20
France: http://www.amazon.fr/?_encoding=UTF8&tag=linguistlistf-21

For more information on the LINGUIST Amazon store please visit our
FAQ at http://linguistlist.org/amazon-faq.cfm.

Editor for this issue: Andrew Lamont <alamont at linguistlist.org>
================================================================  


Date: Tue, 13 May 2014 11:18:25
From: Fernando Balbachan [fernando_balbachan at yahoo.com.ar]
Subject: Computational Linguistics, Syntax: SentiLecto

E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=25-2125.html&submissionid=32568797&topicid=13&msgnumber=1
 
Spanish is particularly challenging for syntax analysis, as it is a free-order constituency language. Full parsing in Spanish is very tough, as it is not obvious where to find high-level syntax functions as subject, direct object, etc. Moreover, Spanish deals with a morphologically rich system in pronouns, agreements, etc., which makes the task harder.

SentiLecto is our Spanish Sentiment Analysis solution at Tecnolecto http://tecnolecto.com/sentilecto 
This solution yields a highly fine-grained representation for the entities involved in each opinion. Unlike other approaches, this solution can deal with polarity shifting in the same sentence ('I like chocolate but I hate strawberry ice-cream') or even within embedded clauses ('Norwegians, who are an aggressive people, export the exquisite herring'). SentiLecto better represents the assumptions whereby the entities involved in the opinion are syntactically mapped onto SVO (subject-verb-object) slots for their sentiment assignments: 'Mary hates John' (2 entities but only the object has a negative presentation) vs. 'Mary harasses John' (the same 2 entities but only the subject has negative presentation).

SentiLecto leans on outstanding linguistic features such as: passive/active voice transformation, anaphora resolution and co-reference chains, modality treatment and accurate verb scripts for all verbs in Spanish, even with different pronominal cases (for example, 'destacar' 'to appraise something' vs. 'destacarSE' 'to highlight oneself from the rest')

Try it out and behold the results! http://tecnolecto.com/sentilecto

Fernando Balbachan
Grupo Tecnolecto
Senior Computational Linguist
fernando_balbachan at yahoo.com.ar

Disclaimer: So far, we have released the syntax analysis, including passive/active voice transformation, clause extraction, anaphora resolution but we are currently working on modality and sentiment analysis (proof of concept in http://tecnolecto.com/sentitext). Also, we will develop a fact extraction and checking module (through DBpedia) and adapt this approach for English in a near future.

Linguistic Field(s): Computational Linguistics
                     Syntax

Subject Language(s): Spanish (spa)






------------------------------------------------------------------------------
This Year the LINGUIST List hopes to raise $75,000. This money will go to help keep the List running by supporting all of our Student Editors for the coming year.

See below for donation instructions, and don't forget to check out Fund Drive 2014 site!

http://linguistlist.org/fund-drive/2014/

There are many ways to donate to LINGUIST!

You can donate right now using our secure credit card form at https://linguistlist.org/donation/donate/donate1.cfm

Alternatively you can also pledge right now and pay later. To do so, go to: https://linguistlist.org/donation/pledge/pledge1.cfm

For all information on donating and pledging, including information on how to donate by check, money order, PayPal or wire transfer, please visit: http://linguistlist.org/donation/

The LINGUIST List is under the umbrella of Eastern Michigan University and as such can receive donations through the EMU Foundation, which is a registered 501(c) Non Profit organization. Our Federal Tax number is 38-6005986. These donations can be offset against your federal and sometimes your state tax return (U.S. tax payers only). For more information visit the IRS Web-Site, or contact your financial advisor.

Many companies also offer a gift matching program, such that they will match any gift you make to a non-profit organization. Normally this entails your contacting your human resources department and sending us a form that the EMU Foundation fills in and returns to your employer. This is generally a simple administrative procedure that doubles the value of your gift to LINGUIST, without costing you an extra penny. Please take a moment to check if your company operates such a program.

Thank you very much for your support of LINGUIST!
 


----------------------------------------------------------
LINGUIST List: Vol-25-2125	
----------------------------------------------------------



More information about the LINGUIST mailing list