[Corpora-List] Corpora Digest, Vol 24, Issue 1 (fwd)

Hercules Dalianis hercules at dsv.su.se
Tue Jun 2 19:42:41 UTC 2009


Dear Vasiliki!

Have you checked these Movie reviews pos/neg
http://www.cs.cornell.edu/people/pabo/movie-review-data/

Xairetismous

Hrakhlhs

On Mon, 1 Jun 2009, corpora-request at uib.no wrote:

> Today's Topics:
>
>   1. Re:  Tagging with synsets? (Ciarán Ó Duibhín)
>   2.  Call for papers : JADT 2010 (Anne Dister)
>   3. Re:  Tagging with synsets? (Tony Berber Sardinha)
>   4.  Corpus in English tagged with opinion or sentiment
>      (vrentoumi at iit.demokritos.gr)
>   5. Re:  Corpus in English tagged with opinion or	sentiment
>      (CRuehlemann at aol.com)
>   6.  Third CFPs: Workshop Adaptation of LanguageResources	and
>      Technology to New Domains (Kiril Simov)
> 
> 
> ----------------------------------------------------------------------
> 
> Message: 1
> Date: Mon, 1 Jun 2009 01:44:57 +0100
> From: Ciarán Ó Duibhín 	<ciaran at oduibhin.freeserve.co.uk>
> Subject: Re: [Corpora-List] Tagging with synsets?
> To: <corpora at uib.no>
> 
> Thank you for all replies on this, which I summarize:
> . FreeLing  http://garraf.epsevg.upc.es/freeling/  (Francis Tyers) (compiled 
> application)
> . SenseRelate http://senserelate.sourceforge.net; web interface 
> http://marimba.d.umn.edu/allwords/allwords.html  (Ted Pedersen) (Perl)
> . UKB http://ixa2.si.ehu.es/ukb (Eneko Agirre)
> . SenseLearner http://lit.csci.unt.edu/~senselearner/ incl web interface 
> (Rada Mihalcea) (Perl)
> . Graph-based WSD 
> http://lit.csci.unt.edu/index.php/Downloads#GWSD:_Unsupervised_Graph-based_Word_Sense_Disambiguation 
> (Rada Mihalcea) (Perl)
> . NLTK http://nltk.googlecode.com/svn/trunk/doc/howto/wordnet.html ; 
> http://nltk.googlecode.com/svn/trunk/doc/book/ch02.html (Claire Brierley) 
> (Python)
> 
> I'm pleasantly surprised to find that some of these include implementations 
> of algorithms to tag running English text with preferred synsets.
> 
> Any of them could involve a struggle to get it working on Windows, but I have 
> a few possibilities there.
> 
> Thanks again,
> Ciarán Ó Duibhín.
> 
> 
> 
> 
> 
> -------------- next part --------------
> A non-text attachment was scrubbed...
> Name: not available
> Type: text/html
> Size: 2798 bytes
> Desc: not available
> Url : 
> http://www.uib.no/mailman/public/corpora/attachments/20090601/28a35cba/attachment.txt
> 
> ------------------------------
> 
> Message: 2
> Date: Mon, 01 Jun 2009 10:13:35 +0200
> From: Anne Dister <anne.dister at uclouvain.be>
> Subject: [Corpora-List] Call for papers : JADT 2010
> To: <corpora at uib.no>
> 
> A non-text attachment was scrubbed...
> Name: not available
> Type: text/html
> Size: 4492 bytes
> Desc: not available
> Url : 
> http://www.uib.no/mailman/public/corpora/attachments/20090601/7a8f999c/attachment.txt
> 
> ------------------------------
> 
> Message: 3
> Date: Mon, 1 Jun 2009 00:42:31 -0300
> From: Tony Berber Sardinha <tony at corpuslg.org>
> Subject: Re: [Corpora-List] Tagging with synsets?
> To: Ciarán Ó Duibhín <ciaran at oduibhin.freeserve.co.uk>,	CORPORA
> 	<corpora at uib.no>
> 
> Dear all
> 
> Thanks for the thread and for this summary.
> 
> I installed freeling, which looks great, but its tags are numerical
> sense codes, for which I can't find an explanation. For example, in
> the sentence 'General Motors is beginning its reinvention', the verb
> 'beginning' is tagged as 00239960. I'd like to know what this tag
> means. Thanks ahead for any pointers.
> 
> bye
> 
> tony
> 
> 
> 
> On May 31, 2009, at 9:44 PM, Ciarán Ó Duibhín wrote:
> 
>> Thank you for all replies on this, which I summarize:
>> ? FreeLing  http://garraf.epsevg.upc.es/freeling/  (Francis Tyers)
>> (compiled application)
>> ? SenseRelate http://senserelate.sourceforge.net; web interface 
>> http://marimba.d.umn.edu/allwords/allwords.html
>>   (Ted Pedersen) (Perl)
>> ? UKB http://ixa2.si.ehu.es/ukb (Eneko Agirre)
>> ? SenseLearner http://lit.csci.unt.edu/~senselearner/ incl web
>> interface (Rada Mihalcea) (Perl)
>> ? Graph-based WSD http://lit.csci.unt.edu/index.php/
>> Downloads#GWSD:_Unsupervised_Graph-based_Word_Sense_Disambiguation
>> (Rada Mihalcea) (Perl)
>> ? NLTK http://nltk.googlecode.com/svn/trunk/doc/howto/wordnet.html ; 
>> http://nltk.googlecode.com/svn/trunk/doc/book/ch02.html
>>  (Claire Brierley) (Python)
>> 
>> I'm pleasantly surprised to find that some of these include
>> implementations of algorithms to tag running English text with
>> preferred synsets.
>> 
>> Any of them could involve a struggle to get it working on Windows,
>> but I have a few possibilities there.
>> 
>> Thanks again,
>> Ciarán Ó Duibhín.
>> 
>> 
>> 
>> 
>> 
>> _______________________________________________
>> Corpora mailing list
>> Corpora at uib.no
>> http://mailman.uib.no/listinfo/corpora
> 
> 
> 
> 
> ------------------------------
> 
> Message: 4
> Date: Mon, 1 Jun 2009 12:36:41 +0300 (EEST)
> From: vrentoumi at iit.demokritos.gr
> Subject: [Corpora-List] Corpus in English tagged with opinion or
> 	sentiment
> To: CORPORA at UIB.NO
> 
> -------------------------------------------------------------------
> 
> 
> Hello, the lab I am working for is looking to buy a corpus in English that
> has been annotated with opinion or sentiment. Can someone please provide
> relevant information?
> 
> Thank you in advance,
> 
> Vassiliki Rentoumi
> Phd Student
> NCSR Demokritos,
> IIT (Institute of Informatics and Telecommunications)
> SKEL (Software and Knowledge Laboratory)
> Athens
> Greece
> 
> 
> 
> 
> 
> ------------------------------
> 
> Message: 5
> Date: Mon, 1 Jun 2009 07:31:44 EDT
> From: CRuehlemann at aol.com
> Subject: Re: [Corpora-List] Corpus in English tagged with opinion or
> 	sentiment
> To: CORPORA at UIB.NO
> 
> 
> Hi Vassiliki,
> 
> you might check out the MPQA Corpus freely available at:
> 
> _http://nrrc.mitre.org/NRRC/publications.htm_
> (http://nrrc.mitre.org/NRRC/publications.htm)
> 
> described in:
> 
> 
> Wiebe, Janyce, Theresa Wilson, and Claire  Cardie. ?_Annotating
> expressions of opinions and emotions in language?._
> (http://www.cs.pitt.edu/~wiebe/pubs/papers/lre05.pdf)  Language Resources and 
> Evaluation 39(2-3): 165-210
> (available  at:http://nrrc.mitre.org/NRRC/publications.htm)
> Hope this  helps
> Chris
> ------------------------------------------------------------------
> Dr. Christoph Rühlemann,  Munich
> 
> -------------- next part --------------
> A non-text attachment was scrubbed...
> Name: not available
> Type: text/html
> Size: 2607 bytes
> Desc: not available
> Url : 
> http://www.uib.no/mailman/public/corpora/attachments/20090601/80178f8a/attachment.txt
> 
> ------------------------------
> 
> Message: 6
> Date: Mon, 1 Jun 2009 15:04:17 +0300
> From: "Kiril Simov" <kivs at bultreebank.org>
> Subject: [Corpora-List] Third CFPs: Workshop Adaptation of
> 	LanguageResources	and Technology to New Domains
> To: <corpora at uib.no>
> 
> Adaptation of Language Resources and Technology to New Domains
> (AdaptLRTtoND)
> http://www.bultreebank.org/AdaptLRTtoND/
> 
> RANLP 2009 Workshop
> http://www.lml.bas.bg/ranlp2009/
> 
> Motivation
> 
> It is widely acknowledged that despite the great advances in
> Computational Linguistics nowadays, the creation of new
> Language Resources (LR) and Language Technology (LT) for a
> new domain or task is still quite expensive and
> time-consuming. At the same time there are already a lot of
> varieties of LR and LT, developed for various languages and
> purposes. What happens when new tasks come? Do we have to
> develop new resources and technology from the beginning, or
> can we re-use or adapt the existent ones? Last, but not
> least alternative is to combine both strategies depending on
> the task. The first option seems reasonable when richer and
> larger data is needed for the new applications. The second
> option is justified only if such a resource or technology
> does not exist at all, or some new approach is applied. The
> third one is the ever ?compromising?, but also very
> realistic option.
> As the machine learning techniques have matured enough to
> successfully support real applications within various
> domains, a new bottleneck became the requirement for large
> and adequate training data for input. Thus, the NLP
> community faced the question of the relevant LR and LT
> adaptation. It concerns the operability between general
> domain NLP toolkits and specific domain tasks with respect
> to terminology, language, structure, steps of preprocessing
> etc.
> Thus, the Workshop is devoted to various methods for
> transferring the linguistic knowledge and supportive
> technology from the existing language resources in one
> domain into a different one.
> 
> Topics
> 
> - parameters of adaptivity and re-usability of LR and LT
> - methods for adaptation of existing NLP resources to specific tasks
> - domain specific requirements to the LR and LT
> - general domain vs. specific domain processing
> - profiling LR
> - extrapolation of richer annotations to large data
> - evaluation of adapted LR and LT
> 
> Organizers
> 
> Núria Bel, Pompeu Fabra University
> Erhard Hinrichs, Tuebingen University (co-chair)
> Petya Osenova, Bulgarian Academy of Sciences and Sofia University
> Kiril Simov, Bulgarian Academy of Sciences (co-chair)
> 
> Invited speaker
> 
> Jun'ichi Tsujii, University of Tokyo and University of Manchester - NacTeM
> 
> Submission details
> 
> Authors are invited to submit an extended abstract up to 800
> words. Abstracts should describe existing research connected
> to the topics of the workshop. The following formats are
> accepted: PDF, PS, MS Word, ASCII text. Each submission
> should provide the following information: title; author(s);
> affiliation(s); and contact author's e-mail address, postal
> address.
> 
> The abstracts should be sent electronically to:
> Petya Osenova
> Email: petya at bultreebank.org
> by the deadline listed below. The submissions will be
> reviewed by the workshop's programme committee.
> 
> The accepted papers will appear in the workshop proceedings.
> The final paper should not exceed 15 A4 pages formatted
> according RANLP09 guidelines
> (http://www.lml.bas.bg/ranlp2009/).
> 
> 
> Important Dates
> 
> Deadline for abstract submission:   7th June 2009
> Notification of acceptance              7th July 2009
> Final version of the papers              23rd August 2004
> 
> Program Committee
> 
> Núria Bel, Pompeu Fabra University
> Gosse Bouma, Groningen University
> António Branco, Lisbon University
> Walter Daelemans, Antwerp University
> Markus Dickinson, Indiana University
> Erhard Hinrichs, Tuebingen University
> Josef van Genabith, Dublin City University
> Iryna Gurevych, Technische Universität Darmstadt - UKP Lab
> Atanas Kiryakov, Ontotext OOD
> Vladislav Kubon, Charles University
> Sandra Kuebler, Indiana University
> Lothar Lemnitzer, DWDS, Berlin-Brandenburgische Akademie der Wissenschaften
> Bernardo Magnini, FBK
> Detmar Meurers, Tuebingen University
> Paola Monachesi, Utrecht University
> Preslav Nakov, National University of Singapore
> John Nerbonne, Groningen University
> Petya Osenova, Bulgarian Academy of Sciences and Sofia University
> Gabor Proszeky, MophoLogic
> Adam Przepiorkowski, Polish Academy of Sciences
> Marta Sabou, Open University - UK
> Kiril Simov, Bulgarian Academy of Sciences
> Cristina Vertan, Hamburg University
> 
> 
> 
> 
> ----------------------------------------------------------------------
> Send Corpora mailing list submissions to
> 	corpora at uib.no
> 
> To subscribe or unsubscribe via the World Wide Web, visit
> 	http://mailman.uib.no/listinfo/corpora
> or, via email, send a message with subject or body 'help' to
> 	corpora-request at uib.no
> 
> You can reach the person managing the list at
> 	corpora-owner at uib.no
> 
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Corpora digest..."
> 
> 
> _______________________________________________
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
> 
> 
> End of Corpora Digest, Vol 24, Issue 1
> **************************************
>

___________________________________________________________________________
Dr. Hercules Dalianis
Associate Professor (Docent)    Department of Computer and Systems Sciences
ph:        +46 8 674 75 47	DSV-KTH-Stockholm University
mobile ph: +46 70 568 13 59	Forum 100
fax:       +46 8 703 90 25	164 40 Kista
email:     hercules at dsv.su.se	Stockholm, Sweden
www:       http://www.dsv.su.se/~hercules/
___________________________________________________________________________
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list