31.3638, Software: KLPT - the Kurdish Language Processing Toolkit

The LINGUIST List linguist at listserv.linguistlist.org
Wed Nov 25 23:08:06 UTC 2020


LINGUIST List: Vol-31-3638. Wed Nov 25 2020. ISSN: 1069 - 4875.

Subject: 31.3638, Software: KLPT - the Kurdish Language Processing Toolkit

Moderator: Malgorzata E. Cavar (linguist at linguistlist.org)
Student Moderator: Jeremy Coburn
Managing Editor: Becca Morris
Team: Helen Aristar-Dry, Everett Green, Sarah Robinson, Lauren Perkins, Nils Hjortnaes, Yiwen Zhang, Joshua Sims
Jobs: jobs at linguistlist.org | Conferences: callconf at linguistlist.org | Pubs: pubs at linguistlist.org

Homepage: http://linguistlist.org

Please support the LL editors and operation with a donation at:
           https://funddrive.linguistlist.org/donate/

Editor for this issue: Everett Green <everett at linguistlist.org>
================================================================


Date: Wed, 25 Nov 2020 18:07:23
From: Sina Ahmadi [ahmadi.sina at outlook.com]
Subject: KLPT - the Kurdish Language Processing Toolkit

 
[with apologies for cross-posting]

I am thrilled to be releasing the Kurdish Language Processing Toolkit
(https://github.com/sinaahmadi/klpt).

KLPT is a natural language processing (NLP) toolkit in Python for the Kurdish
language, a less-resourced Indo-European language which is spoken by 20-30
million speakers. This initial version comes with four core modules for the
Sorani and Kurmanji dialects of Kurdish, namely preprocess, stem,
transliterate and tokenize, and addresses basic language processing tasks such
as:

- text preprocessing
- stemming
- tokenziation
- spell error detection and correction
- morphological analysis

More importantly, it is an open-source project!

I hope that this toolkit will pave the way for further advances in Kurdish
language processing and that it receives more attention in the NLP field.

Best regards,
Sina Ahmadi
http://sinaahmadi.github.io/


Linguistic Field(s): Computational Linguistics

Subject Language(s): Kurdish, Central (ckb)
                     Kurdish, Northern (kmr)
                     Kurdish, Southern (sdh)



------------------------------------------------------------------------------

***************************    LINGUIST List Support    ***************************
 The 2020 Fund Drive is under way! Please visit https://funddrive.linguistlist.org
  to find out how to donate and check how your university, country or discipline
     ranks in the fund drive challenges. Or go directly to the donation site:
                   https://crowdfunding.iu.edu/the-linguist-list

                        Let's make this a short fund drive!
                Please feel free to share the link to our campaign:
                    https://funddrive.linguistlist.org/donate/
 


----------------------------------------------------------
LINGUIST List: Vol-31-3638	
----------------------------------------------------------






More information about the LINGUIST mailing list