[Corpora-List] arabic transliteration

Motaz SAAD motaz.saad at inria.fr
Tue May 29 10:57:32 UTC 2012


Hello, 

You can find Arabic transliteration implementation in perl at http://search.cpan.org/dist/Encode-Arabic/lib/Encode/Arabic/Buckwalter.pm and in python at https://github.com/guysoft/3arabi 

all the best, 
Motaz 

----- Original Message -----

> From: "Imene Bensalem" <bens.imene at gmail.com>
> To: corpora at uib.no
> Sent: Saturday, May 26, 2012 2:53:21 PM
> Subject: Re: [Corpora-List] arabic transliteration

> Hi Amira,

> You could try the Buckwalter transliteration
> http://www.qamus.org/transliteration.htm , but is does not support
> punctuation, so if your task needs to keep punctuation and you want
> to encode the transliterated text in ANSI or ASCII, you have to add
> to the Buckwalter table a transliteration of the Arabic punctuation
> marks; for example, Arabic comma should be transliterated to Latin
> comma.

> All the best !

> Imene

> > On Sat, May 26, 2012 at 11:00 AM, < corpora-request at uib.no > wrote:
> 

> > > ------------------------------
> > 
> 

> > > Message: 8
> > 
> 
> > > Date: Fri, 25 May 2012 21:07:02 +0100 (BST)
> > 
> 
> > > From: Amira Barhoumi < amirabarhoumi29 at yahoo.fr >
> > 
> 
> > > Subject: [Corpora-List] arabic transliteration
> > 
> 
> > > To: " Corpora at uib.no " < Corpora at uib.no >
> > 
> 

> > > Hello,
> > 
> 

> > > I want to know how doing transliteration of an Arabic sentence.
> > 
> 

> > > Is there any norm?
> > 
> 

> > > Thanks,
> > 
> 
> > > Amira
> > 
> 

> > > Master student in computer science
> > 
> 
> > > -------------- next part --------------
> > 
> 
> > > A non-text attachment was scrubbed...
> > 
> 
> > > Name: not available
> > 
> 
> > > Type: text/html
> > 
> 
> > > Size: 647 bytes
> > 
> 
> > > Desc: not available
> > 
> 
> > > URL: <
> > > http://www.uib.no/mailman/public/corpora/attachments/20120525/15943f83/attachment.txt
> > > >
> > 
> 

> > > ------------------------------
> > 
> 

> > > Message: 9
> > 
> 
> > > Date: Sat, 26 May 2012 00:31:51 +0000
> > 
> 
> > > From: Yassine Benajiba < benajibayassine at gmail.com >
> > 
> 
> > > Subject: Re: [Corpora-List] arabic transliteration
> > 
> 
> > > To: Amira Barhoumi < amirabarhoumi29 at yahoo.fr >
> > 
> 
> > > Cc: " Corpora at uib.no " < Corpora at uib.no >
> > 
> 

> > > Did you look at Buckwalter transliteration? .. it's not a norm
> > > but
> > > I
> > > think
> > 
> 
> > > most of the people doing Arabic NLP use it. Here is a link:
> > 
> 
> > > http://www.qamus.org/transliteration.htm
> > 
> 

> > > Good luck!
> > 
> 
> > > --Yassine.
> > 
> 

> > > On Fri, May 25, 2012 at 8:07 PM, Amira Barhoumi <
> > > amirabarhoumi29 at yahoo.fr >wrote:
> > 
> 

> > > > Hello,
> > 
> 
> > > >
> > 
> 
> > > > I want to know how doing transliteration of an Arabic sentence.
> > 
> 
> > > > Is there any norm?
> > 
> 
> > > >
> > 
> 
> > > > Thanks,
> > 
> 
> > > > Amira
> > 
> 
> > > > Master student in computer science
> > 
> 
> > > >
> > 
> 
> > > >
> > 
> 
> > > >
> > 
> 
> > > >
> > 
> 
> > > > _______________________________________________
> > 
> 
> > > > UNSUBSCRIBE from this page:
> > > > http://mailman.uib.no/options/corpora
> > 
> 
> > > > Corpora mailing list
> > 
> 
> > > > Corpora at uib.no
> > 
> 
> > > > http://mailman.uib.no/listinfo/corpora
> > 
> 
> > > >
> > 
> 
> > > >
> > 
> 
> > > -------------- next part --------------
> > 
> 
> > > A non-text attachment was scrubbed...
> > 
> 
> > > Name: not available
> > 
> 
> > > Type: text/html
> > 
> 
> > > Size: 1595 bytes
> > 
> 
> > > Desc: not available
> > 
> 
> > > URL: <
> > > http://www.uib.no/mailman/public/corpora/attachments/20120526/65b4f811/attachment.txt
> > > >
> > 
> 

> > > ----------------------------------------------------------------------
> > 
> 
> > > Send Corpora mailing list submissions to
> > 
> 
> > > corpora at uib.no
> > 
> 

> > > To subscribe or unsubscribe via the World Wide Web, visit
> > 
> 
> > > http://mailman.uib.no/listinfo/corpora
> > 
> 
> > > or, via email, send a message with subject or body 'help' to
> > 
> 
> > > corpora-request at uib.no
> > 
> 

> > > You can reach the person managing the list at
> > 
> 
> > > corpora-owner at uib.no
> > 
> 

> > > When replying, please edit your Subject line so it is more
> > > specific
> > 
> 
> > > than "Re: Contents of Corpora digest..."
> > 
> 

> > > _______________________________________________
> > 
> 
> > > Corpora mailing list
> > 
> 
> > > Corpora at uib.no
> > 
> 
> > > http://mailman.uib.no/listinfo/corpora
> > 
> 

> > > End of Corpora Digest, Vol 59, Issue 31
> > 
> 
> > > ***************************************
> > 
> 

> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20120529/8cf41431/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list