[Corpora-List] Free n-gram Software Released

William H. Fletcher fletcher at usna.edu
Mon Oct 14 12:40:51 UTC 2002


Tony (and others with thhe same problem),

Several people from the list have given me useful feedback, which means they
have been able to access it.  My website can be slow (I'm looking for
another provider), so I've posted keep a current version at
http://www.chesapeake.net/~fletcher/kfNgramHelp.html as well.

Please don't hesitate to suggest additional features you might like.

Regards,
Bill

PS I've had problems accessing your site and sending you e-mail directly.


----- Original Message -----
From: "Tony Berber Sardinha" <tony4 at uol.com.br>
To: "William H. Fletcher" <fletcher at usna.edu>; "corpora list - messages to
list" <CORPORA at hd.uib.no>
Sent: Saturday, October 12, 2002 6:37 AM
Subject: Re: [Corpora-List] Free n-gram Software Released


> Dear list members
>
> Has anyone managed to download this software? I've been trying since the
message
> was posted, at different times of the day, without success. The page won't
> finish loading.
>
> cheers
> tony.
> -------------------------------------
> Dr Tony Berber Sardinha
> LAEL, PUC/SP
> (Catholic University of Sao Paulo, Brazil)
> tony4 at uol.com.br
> http://lael.pucsp.br/~tony
> [New website]
>
> ----- Original Message -----
> From: "William H. Fletcher" <fletcher at usna.edu>
> To: <    >
> Sent: segunda-feira, 30 de setembro de 2002 18:39
> Subject: [Corpora-List] Free n-gram Software Released
>
>
> > The recent flurry of discussion on n-gram software inspired me to
revisit a
> > project from last year.   I reprogrammed kfNgram using aspects of the
> > "suffix array" approach described by Mikio Yamamoto and Kenneth W.
Church
> > and further developed by Chunyu Kit and Yorick Wilks.  The result was a
> > quantum leap in performance which makes it useful even for large
corpora.
> > (It indexes the 25 million word CETENFolha corpus announced here last
week
> > in about 10 minutes on my Pentium III machine with  800 MHz processor
and
> > 256 MB RAM, then cranks out n-gram files in under a minute.)
> >
> > kfNgram supports user-defined character sets and sort orders, and its
GUI
> > (graphical user interface) makes it accessible even to casual users.
> >
> > This free Windows program is available at
> > http://miniappolis.com/KWiCFinder/kfNgramHelp.html
> > Suggestions and comments on its usability and performance will be
greatly
> > appreciated.
> >
> > Bill Fletcher
> >
> > - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
> >
> >   William H. Fletcher              410.293.6362 [voice]
> >   Associate Professor, German & Spanish   410.293.2729 [fax]
> >   Language Studies Department
> >   US Naval Academy
> >   589 McNair Road
> >   Annapolis, MD 21402 - 5030
> >
> >   fletcher at usna.edu
> >   http://www.usna.edu/LangStudy/
> >   http://kwicfinder.com/
> >   http://miniappolis.com/
> >
> > - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
> >
> >
> >
>
>



More information about the Corpora mailing list