Corpora: sgml detagger
Tine & Colleen
tine.lassen at tdcadsl.dk
Tue Apr 16 18:13:22 UTC 2002
Hi
I am compiling a corpus for research reasons and some of the texts are sgml-tagged.
Does anybody know an easy way to remove the tags and save the texts as 'raw' .txt files?
Maybe a PERL script?
Thanks in advance
Tine Lassen
Copenhagen
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20020416/76e62c42/attachment.htm>
More information about the Corpora
mailing list