[Corpora-List] Phrase similarity/relatedness dataset

Eneko Agirre e.agirre at ehu.es
Wed Oct 12 16:26:26 UTC 2011


Muhammad,

in case you can wait some weeks, we are preparing a task on Semantic 
Textual Similarity for the next Semeval 
(http://www.cs.york.ac.uk/semeval/task17/). We plan to release trial 
data in the following days, and some training data early December. You 
can sign up http://groups.google.com/group/sts-semeval to get all 
relevant announcements.

best

eneko

On 10/04/2011 01:21 PM, Muhammad Muhammad wrote:
> Hi
>
> I have worked towards compiling -from books of Quran commentary-  a dataset of around 8,000 pairs of Quranic verses that are somehow related. In the course of evaluating this dataset I want to compare this with similar datasets where phrase pairs are tagged related by human judge. From my investigation most works are small in size and deals mostly with pair of words rather than phrases/sentences.
>
> Any help?
>
> Abdul-Baquee M. Sharaf
> PhD Student
> Language Technologies Group
> School of Computing
> University of Leeds
> UK
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
>    


_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list