[Corpora-List] Most frequent 5K words in Icelandic?

Tristan Miller miller at ukp.informatik.tu-darmstadt.de
Mon Nov 19 10:13:01 UTC 2012


Greetings.

On 19/11/12 10:58 AM, Kim Witten wrote:
> Hi Corpora Subscribers,
> I'm wondering if somebody might be able to point me in the direction to find a simple list of the 5,000 most frequent words in Icelandic, from any (relatively current, non-historical) Icelandic corpus? With English gloss would be even better, but it's not necessary. Thanks!

Wiktionary has a 5K list derived from movie and television subtitles:
http://en.wiktionary.org/wiki/Wiktionary:Frequency_lists/Icelandic_wordlist

It is most likely a truncated version of the lists at
http://invokeit.wordpress.com/frequency-word-lists/ which include 50K
and even longer versions.

Regards,
Tristan

-- 
Tristan Miller, Doctoral Researcher
Ubiquitous Knowledge Processing Lab (UKP-TUDA)
Department of Computer Science, Technische Universität Darmstadt
Tel: +49 6151 16 6166 | Web: http://www.ukp.tu-darmstadt.de/

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 259 bytes
Desc: OpenPGP digital signature
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20121119/c8947841/attachment-0001.sig>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list