[Corpora-List] Texts with keywords for supervised learning

William Mann bill_mann at sil.org
Thu Jan 16 16:25:49 UTC 2003


My impression is that many library catalogs are really this sort of corpus,
except that the texts are on the shelves.

Perhaps catalogs of items that are available on line could be converted into
being this sort of corpus.

Bill Mann

----- Original Message -----
From: "Anette Hulth" <hulth at dsv.su.se>
To: <corpora at hit.uib.no>
Sent: Thursday, January 16, 2003 9:41 AM
Subject: [Corpora-List] Texts with keywords for supervised learning


> Dear list members,
>
> I'm currently doing experiments on keyword derivation,
> treating it as a supervised learning task. (By keywords
> a mean a set of say 3-15 words reflecting the content
> of the actual text.) I wonder if there is anybody who's
> aware of any freely available corpus of text documents
> in English, with manually assigned keywords that may
> be (automatically) extracted. Any pointers will be much
> appreciated!
>
> Kind regards
>     /Anette Hulth
>
> ---------------------------------------------------
>   Anette Hulth
>   Dept. of Computer and Systems Sciences
>   Stockholm University / KTH
>   Sweden
> ---------------------------------------------------
>
>
>
>
>



More information about the Corpora mailing list