[Corpora-List] Moving Lexical Semantics from Alchemy to Science
Diana McCarthy
diana at dianamccarthy.co.uk
Sat Jan 29 15:04:52 UTC 2011
Hi John
Adam may have more specific info as he was involved in the production
but in case it helps:
Help on making web searches is at:
http://www.webdante.com/getting_started.html (only words in M-R
available on this site)
some sample xml files:
http://www.webdante.com/sample_xml.html
(you can download the whole sample or individual headwords)
DTD is at http://www.webdante.com/the_dtd.html
Various publications at:
http://www.webdante.com/publications.html
note DANTE should be useful for many purposes (not just noun compounds).
If anyone wants to have access to the full xml I'll be adding a link to
a request form at:
http://www.webdante.com/licensing_options.html
just as soon as we have everything ready for release (as Adam says, we
hope by end of Feb)
hope that helps
Diana
John F. Sowa wrote, On 29/01/11 14:43:
> On 1/29/2011 9:28 AM, Adam Kilgarriff wrote:
>> Dante (http://webdante.com) has done something very like this in a v
>> large exercise on corpus-driven lexicography for English. It contains
>> 20,844 compounds, all identified automatically and confirmed and
>> classified by lexicographers. Of these, 2,989 have more than one
>> meaning.
>
> Are there any documents on the web site that describe the formats and
> show some sample entries?
>
> I tried to type some simple words and word compounds, but the only
> thing I get is "Search in progress, please wait."
>
> John
>
> _______________________________________________
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
--
===========================================================================
Diana McCarthy, http://www.dianamccarthy.co.uk/
Lexical Computing Ltd. http://www.sketchengine.co.uk/
===========================================================================
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list