Corpora: sgml detagger

Tine & Colleen tine.lassen at tdcadsl.dk
Tue Apr 16 18:13:22 UTC 2002


Hi
I am compiling a corpus for research reasons and some of the texts are sgml-tagged.
Does anybody know an easy way to remove the tags and save the texts as 'raw' .txt files?
Maybe a PERL script?

Thanks in advance

Tine Lassen
Copenhagen
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20020416/76e62c42/attachment.htm>


More information about the Corpora mailing list