[Corpora-List] EmoText - Software for opinion mining and lexical affect sensing

Michal Ptaszynski ptaszynski at media.eng.hokudai.ac.jp
Tue Dec 20 01:40:08 UTC 2011


Hi Alexander,

A few comments on the statistical engine.

I tried a couple of reviews from Amazon. Among different feature sets from  
1 to 6, always one is close to the amazon's ranking, but unfortunately its  
never one feature set in particular, but rather randomly one from the six.

Besides the closest method, all other are usually reversed (e.g., if the  
closest method gives 5 star, all other give 1). However, this might have  
just happen for those couple examples I tried (Reviews of Kindle on  
Amazon).

NaiveBayes seems to be hitting closer.

SVMs are very slow or freeze (or perhaps its just your machine getting  
busy with the traffic).

Best,

Michal


------------------
Od: Alexander Osherenko <osherenko at gmx.de>
Do: corpora at uib.no
Data: Mon, 19 Dec 2011 15:43:27 +0100
Temat: Re: [Corpora-List] EmoText - Software for opinion mining and  
lexical affect sensing

I published a more comprehensive version of the statistical engine that  
additionally considers the BNC frequency list. The reason why I didn't do  
it previously : the BNC processing needs more computational power and is  
therefore slower. However, in the current version processing is slower but  
actually OK.

Hence, I process three sources of lexical features: the corpus frequency  
list, BNC, Whissell's DAL. In the PhD thesis, I had additionally three  
lemmatized lists but the performance was not much better that's why I  
don't consider them in the current demo version. Stylistic difference: the  
frequency list is possibly tailored to the corpus and therefore only  
appropriate for opinion mining in this only corpus, the BNC frequency list  
is general, DAL contains emotion words. In my opinion, this demo version  
is beneficial for answering the question we discussed previously in this  
mailing list about the types of features and differences in votes.

The link: www.socioware.de/EmoTextDemoWithBNC

_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list