[Corpora-List] Wiktionary ?

Joel Nothman jnothman at student.usyd.edu.au
Fri Aug 17 00:06:59 UTC 2012


Downloading it - one language at a time - is not hard. For most purposes  
the xxx-pages-articles.xml.bz2 dump is sufficient, but needs to then be  
processed.

Instead, a tool like http://www.ukp.tu-darmstadt.de/software/jwktl/ might  
help extract and access the content.

- Joel
PhD Candidate
School of IT
University of Sydney

On Fri, 17 Aug 2012 00:22:55 +0930, Sofia Zidane <sof.zidane at gmail.com>  
wrote:

> Hello dear members,
>
> I would like to know if it's possible to download all the lexical entries
> of Wiktionary ?
>
> I know there are dump files, but I see there are too many and I don't  
> know
> which one can be helpful.
>
> Best regards,

_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list