[Corpora-List] Arabic Giga Word Annotation,

Abdusalam F Ahmad Nwesri a.nwesri at student.rmit.edu.au
Sat Dec 1 05:16:05 UTC 2007


Hi,

I have built a tool to create a manual judgement for the Arabic giga Word  (AGW) corpus. The corpus is by far the biggest available from LDC. 

Most Arabic Information retrieval systems have been evaluated using the AFP TREC2001 corpus and 75 queries. The corpus is relatively small compared to English corpora. AGW is five times bigger than the TREC2001 corpus. 

Currently I have used a group of 20 people and have collected about 20000 judgement for around 80 queries. 

If you are an Arabic native speaker, I would highly appreciate your  contribution to build this ground truth. If you can add one topic and find its relevant documents and mark them you would contribute another topic to the judgement. I am looking to get as more judgements as possible. 

I will make this ground truth available to the research community once I finish my evaluations.  

The link for the annotation tool is 

http://goanna.cs.rmit.edu.au/~nwesri/agw/index.php

Once again I  am looking for your support and hope that this will benefit the Arabic IR.


Thanks In Advance,

Abdusalam Nwesri
PhD Student,
School of Computer Science and IT,
RMIT University,
Melbourne,
Australia.

d to English corpora. AGW is five times bigger than the TREC2001 corpus. 

Currently I have used a group of 20 people and have collected about 20000 judgement for around 80 queries. 

If you are an Arabic native speaker, I would highly appreciate your  contribution to build this ground truth. If you can add one topic and find its relevant documents and mark them you would contribute another topic to the judgement. I am looking to get as more judgements as possible. 

I will make this ground truth available to the research community once I finish my evaluations.  

The link for the annotation tool is 

http://goanna.cs.rmit.edu.au/~n

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list