[Corpora-List] text XML representation for NLP

Lou Burnard lou.burnard at computing-services.oxford.ac.uk
Mon Feb 28 10:24:29 UTC 2005


The Text Encoding Initiative's Recommendations for encoding are also
very useful for NLP (not surprisingly, since the  ACL was one of the
first sponsors of the TEI, and most of those currently active in the
field of XML annotation "cut their teeth" on the TEI.

The TEI Recommendations were updated to use XML at the last major
revision (TEI P4, published in 2000); the next major revision, a
preliminary release of which is now available at the TEI's sourceforge
site, is a complete rewrite, aiming to include new materials and
standards. See http://www.tei-c.org/P5/ for details. I think the new ODD
system may be of particular interest to NLP practitioners.

Lou Burnard

Constantin Orasan wrote:

> Hi,
>
> Have a look at:
> XCES: http://www.xces.org/ and
> EAGLES/ISLE: http://www.mpi.nl/world/ISLE/
>
> Unfortunately these pages haven't been updated for a while. Maybe
> someone will be able to indicate more up-to-date pages.
>
> Regards,
>
> Constantin
>
>
>>Dear, CORPORA list people,
>>
>>         Right now, I am working on a text XML representation for
>>natural language processing.
>>
>>         The representation is used for representation of any text. It
>>will used for our natural language processing. It will include the
>>layers from base to top of NLP. The base layer may be about the
>>part-of-speech information. The top layer may be about the syntax
>>analysis result or shallow semantic information.
>>
>>As I known, there were so many conferences on XML for NLP. So I guess
>>there is some existed text XML representation for NLP. But I have not
>>found out.
>>
>>Could you give some information about it?
>>
>>Thank you very much!
>>
>>
>>
>>Best wishes;
>>
>>-Bill_Lang
>>
>
> ============================================
> Constantin Orasan
> Research Group in Computational Linguistics
> University of Wolverhampton
> http://www.wlv.ac.uk/~in6093/
>
>
>



More information about the Corpora mailing list