[Corpora-List] EmoText - Software for opinion mining and lexical affect sensing
Michal Ptaszynski
ptaszynski at media.eng.hokudai.ac.jp
Tue Dec 20 01:40:08 UTC 2011
Hi Alexander,
A few comments on the statistical engine.
I tried a couple of reviews from Amazon. Among different feature sets from
1 to 6, always one is close to the amazon's ranking, but unfortunately its
never one feature set in particular, but rather randomly one from the six.
Besides the closest method, all other are usually reversed (e.g., if the
closest method gives 5 star, all other give 1). However, this might have
just happen for those couple examples I tried (Reviews of Kindle on
Amazon).
NaiveBayes seems to be hitting closer.
SVMs are very slow or freeze (or perhaps its just your machine getting
busy with the traffic).
Best,
Michal
------------------
Od: Alexander Osherenko <osherenko at gmx.de>
Do: corpora at uib.no
Data: Mon, 19 Dec 2011 15:43:27 +0100
Temat: Re: [Corpora-List] EmoText - Software for opinion mining and
lexical affect sensing
I published a more comprehensive version of the statistical engine that
additionally considers the BNC frequency list. The reason why I didn't do
it previously : the BNC processing needs more computational power and is
therefore slower. However, in the current version processing is slower but
actually OK.
Hence, I process three sources of lexical features: the corpus frequency
list, BNC, Whissell's DAL. In the PhD thesis, I had additionally three
lemmatized lists but the performance was not much better that's why I
don't consider them in the current demo version. Stylistic difference: the
frequency list is possibly tailored to the corpus and therefore only
appropriate for opinion mining in this only corpus, the BNC frequency list
is general, DAL contains emotion words. In my opinion, this demo version
is beneficial for answering the question we discussed previously in this
mailing list about the types of features and differences in votes.
The link: www.socioware.de/EmoTextDemoWithBNC
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list