[Corpora-List] python's NLTK vs R's TM

Michele Filannino michele.filannino at cs.manchester.ac.uk
Mon Aug 20 08:34:44 UTC 2012


Hi Matías,

I would suggest you NLTK for Python. You can start using the book published
by O'Reilly, it's very easy and effective. It fits your needs.

Bye,
michele.

On Mon, Aug 20, 2012 at 1:04 AM, Matías Guzmán <mortem.dei at gmail.com> wrote:

> Dear all,
>
> I'm not a very strong programmer but I know a bit of python and a bit of
> R, and I was wandering which is better for corpus work. I'm not interesting
> in creating any fancy language technology thingy, I just need to extract
> raw text from documents off and on-line, analyze them and perform some
> basic statistics on them. Which one would you recommend? should I use both?
>
> Thanks,
>
> Matías Guzmán
>
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
>
>


-- 
Michele Filannino

CDT PhD student in Computer Science
Room IT301 - IT Building
The University of Manchester
filannim at cs.manchester.ac.uk
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20120820/f8e731ed/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list