[Corpora-List] Corpus Development

fatima zuhra fateeshah at yahoo.com
Sat Apr 19 02:25:10 UTC 2008


Hi All,
   
  Thanks a lot to all, who paid attention to my message and provided me with their valuable suggestions.
   
  Dear Laxmi, my corpus is a general-purpose corpus of written Pashto. Dear Mr. Adam, the corpus currently contains 30,000 words and its size is increasing.  I haven't used Xiara, but am interested in using it. Dear Lou, I'll be too much thankful to you if you help me further by forwarding me some guidelines about Xiara. The web page http://www.xaira.net/  cannot be displayed in my browser. 
   
  Dear Gee Raza, I am also glad to see someone from Pakistan on the list. Well, I only know the three languages, you have mentioned, but am interested in learning Arabic and Persian. I hope I'll soon learn these two.
   
  Dear Oliver, I meant to ask that am I going in a right direction for a general-purpose Pashto corpus? By fully functional, I mean something that can be rightly called a corpus. I also wanted to investigate the appropriate statistical measures, which can be used for the evaluation of any newly developed software. In our country, there are statisticians, who know each and every statistical measure, but cannot guide us which one to use for which purpose. If there are some, who can guide, we do not have access to them.
   
  Thanks to Sir Ramesh for his encouragement and valuable suggestions.
   
  I have also developed a finite state morphological analyzer for Pashto. I will provide the details from time to time. 
   
  Regards.

       
---------------------------------
Be a better friend, newshound, and know-it-all with Yahoo! Mobile.  Try it now.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20080418/ece2d4c3/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list