Corpora: Tools to convert HTML files into plain text

htakashi at mse.biglobe.ne.jp htakashi at mse.biglobe.ne.jp
Mon Mar 27 13:45:17 UTC 2000


Jean Veronis wrote:
>I have a related question. What tools do you use once you have downloaded
>the HTML files to (batch-)convert them in reasonably clean "plain" text?

I am using my tools, "DeHTML".
DOS/Win16/Win32/OS2 versions are now available at
 http://www2d.biglobe.ne.jp/~htakashi/software/DEHTML_E.HTM or
 http://www2d.biglobe.ne.jp/~htakashi/software/DEHTML_J.HTM (Japanese version)

--
= HAMAGUCHI Takashi(KOBE, Japan)            htakashi at mse.biglobe.ne.jp =
=[ http://www2d.biglobe.ne.jp/~htakashi/ ]  NBC03301 at nifty.ne.jp ==



More information about the Corpora mailing list