Thanks Diana for your response and your paper.<div><br></div><div>I explain you what I want to do. I have done several experiments with tweets in Spanish following a machine learning approach, but the problem is I don't have a corpus with a reliable labelling, so I want to build a corpus with a manual labelling. So I've downloaded a set of politic tweets during the last Spanish elections. For the manual labelling process, I want to automatically delete those tweets that aren't opinions. So I'm looking for a Spanish or English word list of opinion words. If a tweet doesn't contain any opinion word I consider that it isn't an opinion tweet. I know that a person can express a politic opinion without using any typical opinion word, but it is a simple heuristic to reduce the set of tweets to be manually labelling.</div>
<div><br></div><div>Regards.</div><div><br clear="all">Eugenio Martínez Cámara.<div>Grupo de Investigación SINAI.</div><div>Departamento de Informática.<br><div>Universidad de Jaén.</div><div>emcamara at ujaen dot es<br>
<br>
</div></div><br>
<br><br><div class="gmail_quote">El 17 de diciembre de 2011 19:40, Diana Maynard <span dir="ltr"><<a href="mailto:d.maynard@dcs.shef.ac.uk">d.maynard@dcs.shef.ac.uk</a>></span> escribió:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Hi Eugenio<br>
Are you asking for some gazetteer list of opinionated words to determine whether a tweet is opinionated or not? Or are you asking for some method which uses bag-of-words (matching against such a list) in order to compare your tools with.<br>
If the former, obviously you want to be very careful about using such an approach on its own, because there are lots of words which can convey an opinion or not, depending how they are used.<br>
<br>
I am also working on opinion mining from tweets, for English and German, on political tweets amongst other things. You can see my paper about this for English here:<br>
<br>
D. Maynard and A. Funk. Automatic detection of political opinions in tweets. In Proceedings of MSM 2011: Making Sense of Microposts. Workshop at 8th Extended Semantic Web Conference (ESWC 2011). Heraklion, Greece. June 2011.<br>
<a href="http://gate.ac.uk/sale/eswc11/opinion-mining.pdf" target="_blank">http://gate.ac.uk/sale/eswc11/<u></u>opinion-mining.pdf</a><br>
<br>
There is also an extended version currently in press.<br>
Regards<span class="HOEnZb"><font color="#888888"><br>
Diana</font></span><div><div class="h5"><br>
<br>
<br>
On 17/12/2011 16:05, Eugenio Martínez Cámara wrote:<br>
</div></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div><div class="h5">
Hi All,<br>
<br>
Currently I'm working in Sentiment Analysis on Twitter. I have done<br>
several experiments with Spanish Twitter corpus following the Go et al.<br>
(2009) noisy labels technique, but I want to build a gold standard. So,<br>
I downloaded a corpus of Spanish tweets in the politic domain. At first,<br>
I want to erase all non-opinion tweets, so I'm going to delete all<br>
tweets that not contain any opinion word. So, do you know any Spanish<br>
opinion bag-of-words (positive/negative)? do you know any English<br>
opinion bag-of-words (positive/negative)?<br>
<br>
Thanks.<br>
<br>
<br>
Eugenio Martínez Cámara.<br>
SINAI Research Group<br>
Computer Science Department<br>
University of Jaén<br>
emcamara at ujaen dot es<br>
<br>
<br>
<br>
<br></div></div><div class="im">
______________________________<u></u>_________________<br>
UNSUBSCRIBE from this page: <a href="http://mailman.uib.no/options/corpora" target="_blank">http://mailman.uib.no/options/<u></u>corpora</a><br>
Corpora mailing list<br>
<a href="mailto:Corpora@uib.no" target="_blank">Corpora@uib.no</a><br>
<a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/<u></u>listinfo/corpora</a><br>
</div></blockquote>
<br>
</blockquote></div><br></div>