Corpora: Re: multiple-category classification of text

Seth Russell seth at robustai.net
Fri May 12 01:46:20 UTC 2000


"J. Zavrel" wrote:

> > > I'm looking for references on multiple-category classification
> > > of text.
>
> Just couldn't resist putting in a small shameless plug for my
> favorite machine learning flavor:
>
> This type of text classification can be done very easily using
> memory-based learning. If the nearest neighbors of a phrase to be
> classified are multiple-category, you will also be able to assign
> multiple-categories (i.e. a distribution of them). By controlling the
> number of nearest neighbors you can systematically increase the number
> of suggested categories. But of course this can be done using Naive
> Bayes as well...
>
> It's easy to try with our memory-based learning package TiMBL
> which can be obtained from http://ilk.kub.nl (follow the link to
> software).

Super!  I think it may be close to what I need for the eMouth project
http://robustai.net/ai/word_of_emouth.htm
Too bad it's on a 'research only' license, eMouth is an
open source project.

--
Seth Russell
Http://RobustAi.net/Ai/SymKnow.htm
Http://RobustAi.net/Ai/Conjecture.htm



More information about the Corpora mailing list