[Corpora-List] Looking for a XML to TEXT convertor/editor

Oliver Mason O.Mason at bham.ac.uk
Mon Nov 27 16:54:16 UTC 2006


With sed it's even easier...

cat yourxmltext | sed 's/<[^>]*>//g' > yourplaintext

This removes everything in '<..>'; not as complete as Lou's earlier
suggestion regarding XSLT, but I guess it wins the prize for the
shortest solution...

Oliver

On 27/11/06, Daniel Zeman <zeman at ufal.mff.cuni.cz> wrote:
> If you have Perl on your machine (default on Linux), the attached Perl
> script could help you.



More information about the Corpora mailing list