Corpora: Tools to convert HTML files into plain text
htakashi at mse.biglobe.ne.jp
htakashi at mse.biglobe.ne.jp
Mon Mar 27 13:45:17 UTC 2000
Jean Veronis wrote:
>I have a related question. What tools do you use once you have downloaded
>the HTML files to (batch-)convert them in reasonably clean "plain" text?
I am using my tools, "DeHTML".
DOS/Win16/Win32/OS2 versions are now available at
http://www2d.biglobe.ne.jp/~htakashi/software/DEHTML_E.HTM or
http://www2d.biglobe.ne.jp/~htakashi/software/DEHTML_J.HTM (Japanese version)
--
= HAMAGUCHI Takashi(KOBE, Japan) htakashi at mse.biglobe.ne.jp =
=[ http://www2d.biglobe.ne.jp/~htakashi/ ] NBC03301 at nifty.ne.jp ==
More information about the Corpora
mailing list