[Corpora-List] efficient decision tree tool?

Andy Roberts andyr at comp.leeds.ac.uk
Thu Jan 19 09:59:22 UTC 2006


I expect Ross Quinlan's C4.5 will be adequate then (which is what J4.8
is based on).

You can get it from http://www.rulequest.com/Personal/

Andy

On Thu, 19 Jan 2006, Caren Brinckmann wrote:

> Dear all,
>
> we are currently working on corpus-based models of duration, F0, intensity, 
> and segmental reductions in read and spontaneous speech. For the first part 
> of our study we will use decision trees.
>
> Since our database is fairly large, I am looking for an efficient decision 
> tree tool with the following features:
>
> * nominal and numeric input features and predictees (classification and 
> regression trees)
> * binary as well as multi-way splits
> * efficient handling of large datasets (200,000 cases/records/instances with 
> up to 100 attributes/features/variables)
> * nice to have: integrated feature selection algorithm
>
> In previous studies, I've worked with "wagon" from the Edinburgh Speech Tools 
> Library (http://www.cstr.ed.ac.uk/projects/speech_tools/) and "J48" from Weka 
> (http://www.cs.waikato.ac.nz/ml/weka/). While wagon is very fast and 
> memory-efficient, it only allows binary splits (as far as I know). Weka 
> allows multi-way splits, but is too slow and memory-consuming for our current 
> datasets.
>
> I'm looking forward to your suggestions!
>
> Kind regards,
>
> Caren.
>
> P.S.: If you know any other mailing list or forum where I could post my 
> question, please let me know.
>
> --
> Caren Brinckmann
> Saarland University, FR 4.7 Institute of Phonetics
> P.O.Box 151150, 66041 Saarbruecken, Germany
> Phone: +49-681-3024244, Fax: +49-681-3024684
>
>



More information about the Corpora mailing list