<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META http-equiv=Content-Type content="text/html; charset=iso-8859-1">
<META content="MSHTML 6.00.2713.1100" name=GENERATOR>
<STYLE></STYLE>
</HEAD>
<BODY bgColor=#ffffff>
<DIV>Hi:</DIV>
<DIV>This Perl script should do the job:</DIV>
<DIV> </DIV>
<DIV>print "What is your input file
name:\n";<BR>chomp($infile=<STDIN>);<BR>open IN, $infile or die "No file,
no fun!";<BR>open OUT, ">$infile.out" or die "No file, no fun!";<BR>while
(<IN>) {<BR>
$_=~s/\<.+?\>//g;<BR> print OUT
"$_";<BR> }<BR>close (IN) or die "D'oh!";<BR>close (OUT) or
die "D'oh!";<BR></DIV>
<DIV>Best,</DIV>
<DIV> </DIV>
<DIV>Danko Sipka</DIV>
<DIV><A href="mailto:sipkadan@main.amu.edu.pl">sipkadan@main.amu.edu.pl</A> | <A
href="mailto:Danko.Sipka@asu.edu">Danko.Sipka@asu.edu</A></DIV>
<DIV><A
href="http://main.amu.edu.pl/~sipkadan">http://main.amu.edu.pl/~sipkadan</A> |
<A
href="http://www.public.asu.edu/~dsipka">http://www.public.asu.edu/~dsipka</A></DIV>
<DIV> </DIV>
<DIV> </DIV>
<BLOCKQUOTE dir=ltr
style="PADDING-RIGHT: 0px; PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #000000 2px solid; MARGIN-RIGHT: 0px">
<DIV style="FONT: 10pt arial">----- Original Message ----- </DIV>
<DIV
style="BACKGROUND: #e4e4e4; FONT: 10pt arial; font-color: black"><B>From:</B>
<A title=tine.lassen@tdcadsl.dk href="mailto:tine.lassen@tdcadsl.dk">Tine
& Colleen</A> </DIV>
<DIV style="FONT: 10pt arial"><B>To:</B> <A title=CORPORA@HD.UIB.NO
href="mailto:CORPORA@HD.UIB.NO">CORPORA@HD.UIB.NO</A> </DIV>
<DIV style="FONT: 10pt arial"><B>Sent:</B> Tuesday, April 16, 2002 8:13
PM</DIV>
<DIV style="FONT: 10pt arial"><B>Subject:</B> Corpora: sgml detagger</DIV>
<DIV><BR></DIV>
<DIV><FONT face=Arial color=#800000 size=2>Hi</FONT></DIV>
<DIV><FONT face=Arial color=#800000 size=2>I am compiling a corpus for
research reasons and some of the texts are sgml-tagged.</FONT></DIV>
<DIV><FONT face=Arial color=#800000 size=2>Does anybody know an easy way to
remove the tags and save the texts as 'raw' .txt files?</FONT></DIV>
<DIV><FONT face=Arial color=#800000 size=2>Maybe a PERL script?</FONT></DIV>
<DIV><FONT face=Arial color=#800000 size=2></FONT> </DIV>
<DIV><FONT face=Arial color=#800000 size=2>Thanks in advance</FONT></DIV>
<DIV><FONT face=Arial color=#800000 size=2></FONT> </DIV>
<DIV><FONT face=Arial color=#800000 size=2>Tine Lassen</FONT></DIV>
<DIV><FONT face=Arial color=#800000
size=2>Copenhagen</FONT></DIV></BLOCKQUOTE></BODY></HTML>