[Corpora-List] What is best for text processing Perl of Python?

Alexandre Rafalovitch arafalov at gmail.com
Tue Mar 4 13:38:52 UTC 2008


I found Perl to be write-only language. Unless you use it a lot, the
syntax gets semi-forgotten soon and it becomes hard to read what you
wrote. Of course, with good documentation and practice, it is a great
language.

I heard good things about Python. I don't think PHP is suitable for
text processing; it is more for presentation (webpages).

Also, regular expressions (important requirement for text processing)
exist in other languages such as Java. If you use something like
Groovy, which is built on top of Java, you get flexibility of
scripting language, but at the same time access to libraries written
in Java (e.g. Stanford parsers). In the same way, JPython might be an
interesting consideration.

In summary, between Perl and Python, I would choose Python. But there
are more options too.

Regards,
   Alex
-- 
Personal blog: http://blog.outerthoughts.com/
Research group: http://www.clt.mq.edu.au/Research/


On Tue, Mar 4, 2008 at 3:25 AM, True Friend <true.friend2004 at gmail.com> wrote:
> I am wondering what language to learn for text processing. I've found
> scripting languages too easy for simple but teasing tasks in corpus text
> processing. Plz suggest me what language I should learn Perl or Python.
>  Regards

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list