[Corpora-List] unsupervised with semi-supervised

Diarmuid O'Seaghdha do242 at cam.ac.uk
Mon Apr 21 15:35:01 UTC 2008


Hi there,

As Eric pointed out, the short Wikipedia entry on semi-supervised 
learning (http://en.wikipedia.org/wiki/Semi-supervised_learning) states 
the fundamental issues. It's worth mentioning that the references on 
that page are also useful, in particular:

Xiaojin Zhu maintains a comprehensive overview of semi-supervised
classification in all its guises at 
http://www.cs.wisc.edu/~jerryzhu/pub/ssl_survey.pdf; it may be overkill
for your purposes, and I'm not sure it even addresses your question
directly, but it gives a good idea of what's out there.

Olivier Chapelle, Bernhard Schölkopf and Alexander Zien have edited a 
book called "Semi-Supervised Learning", published by MIT Press. It's 
written for a machine learning audience and can be heavy going, but the
introductory chapter contains the formal definitions you're looking for;
conveniently, that chapter is available online at
http://mitpress.mit.edu/books/chapters/0262033585chapm1.pdf

Diarmuid

Taras Zagibalov wrote:
> Dear colleagues!
> I've been trying to find good definitions for supervised, 
> semi-supervised and unsupervised machine learning in Computational 
> Linguistics and NLP. I am especially interested in a good explanation of 
> the difference between  unsupervised and semi-supervised learning: to my 
> mind there must be some formally stated difference between a system that 
> uses only two seed words and a system that uses a manually created 
> word-list consisting of thousand items.
> I will be thankful for all ideas regarding the problem
> 
> Best regards,
> Taras Zagibalov
> University of Sussex
> 
> _______________________________________________
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora



_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list