[Corpora-List] Free n-gram Software Released

Tony Berber Sardinha tony4 at uol.com.br
Tue Oct 15 11:45:58 UTC 2002


Thanks to all who responded, and to Bill for putting up a new website and for
emailing me the software.

cheers
tony.
-------------------------------------
Dr Tony Berber Sardinha
LAEL, PUC/SP
(Catholic University of Sao Paulo, Brazil)
tony4 at uol.com.br
http://lael.pucsp.br/~tony
[New website]

----- Original Message -----
From: "William H. Fletcher" <fletcher at usna.edu>
To: "Tony Berber Sardinha" <tony4 at uol.com.br>; <CORPORA at hd.uib.no>
Sent: segunda-feira, 14 de outubro de 2002 09:40
Subject: Re: [Corpora-List] Free n-gram Software Released


> Tony (and others with thhe same problem),
>
> Several people from the list have given me useful feedback, which means they
> have been able to access it.  My website can be slow (I'm looking for
> another provider), so I've posted keep a current version at
> http://www.chesapeake.net/~fletcher/kfNgramHelp.html as well.
>
> Please don't hesitate to suggest additional features you might like.
>
> Regards,
> Bill
>
> PS I've had problems accessing your site and sending you e-mail directly.
>
>
> ----- Original Message -----
> From: "Tony Berber Sardinha" <tony4 at uol.com.br>
> To: "William H. Fletcher" <fletcher at usna.edu>; "corpora list - messages to
> list" <CORPORA at hd.uib.no>
> Sent: Saturday, October 12, 2002 6:37 AM
> Subject: Re: [Corpora-List] Free n-gram Software Released
>
>
> > Dear list members
> >
> > Has anyone managed to download this software? I've been trying since the
> message
> > was posted, at different times of the day, without success. The page won't
> > finish loading.
> >
> > cheers
> > tony.
> > -------------------------------------
> > Dr Tony Berber Sardinha
> > LAEL, PUC/SP
> > (Catholic University of Sao Paulo, Brazil)
> > tony4 at uol.com.br
> > http://lael.pucsp.br/~tony
> > [New website]
> >
> > ----- Original Message -----
> > From: "William H. Fletcher" <fletcher at usna.edu>
> > To: <    >
> > Sent: segunda-feira, 30 de setembro de 2002 18:39
> > Subject: [Corpora-List] Free n-gram Software Released
> >
> >
> > > The recent flurry of discussion on n-gram software inspired me to
> revisit a
> > > project from last year.   I reprogrammed kfNgram using aspects of the
> > > "suffix array" approach described by Mikio Yamamoto and Kenneth W.
> Church
> > > and further developed by Chunyu Kit and Yorick Wilks.  The result was a
> > > quantum leap in performance which makes it useful even for large
> corpora.
> > > (It indexes the 25 million word CETENFolha corpus announced here last
> week
> > > in about 10 minutes on my Pentium III machine with  800 MHz processor
> and
> > > 256 MB RAM, then cranks out n-gram files in under a minute.)
> > >
> > > kfNgram supports user-defined character sets and sort orders, and its
> GUI
> > > (graphical user interface) makes it accessible even to casual users.
> > >
> > > This free Windows program is available at
> > > http://miniappolis.com/KWiCFinder/kfNgramHelp.html
> > > Suggestions and comments on its usability and performance will be
> greatly
> > > appreciated.
> > >
> > > Bill Fletcher
> > >
> > > - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
> > >
> > >   William H. Fletcher              410.293.6362 [voice]
> > >   Associate Professor, German & Spanish   410.293.2729 [fax]
> > >   Language Studies Department
> > >   US Naval Academy
> > >   589 McNair Road
> > >   Annapolis, MD 21402 - 5030
> > >
> > >   fletcher at usna.edu
> > >   http://www.usna.edu/LangStudy/
> > >   http://kwicfinder.com/
> > >   http://miniappolis.com/
> > >
> > > - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
> > >
> > >
> > >
> >
> >
>
>



More information about the Corpora mailing list