[Corpora-List] tool for extracting text from web forum and websites

Isabella Chiari isabella.chiari at uniroma1.it
Wed Oct 14 09:22:56 UTC 2009


Dear Linguists,

I need a tool for extracting all the text from pages and subpages of a Web
Forum. I do not need a cleaning tool at the moment.

Can you suggest a tool to perform this operation?

Thanks,

Isabella Chiari

 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20091014/d22d75e5/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list