[Corpora-List] XML encoding database of tagged documents

Lou Burnard lou.burnard at computing-services.oxford.ac.uk
Mon Jun 5 15:08:49 UTC 2006


Why, TEI of course. Is there any other?

I would represent each user annotation ("comment")_ nusing a <note> 
element, with attributes resp to identify the coder's name, date for 
creation date, and type for the "tag name".

See http://www.tei-c.org for more on the TEI, in particular the section 
on the current update to TEI called P5.

Lou



Normand Peladeau wrote:
> I am currently writing an exportation procedure to store in XML format a 
> flat database containing both numeric, dates, short strings and tagged 
> documents.
> 
> The tagged documents include tags (or codes) manually attached by 
> different users.  Each tag (or code) includes the following information:
>    * Tag name
>    * Coder's name
>    * Creation date
>    * Comment (plain text)
> I would like to choose an XML standard that supports such type of data 
> and would be supported a wide range of application in various domains.  
> Any suggestion of which standard I should consider?
> 
> Normand Péladeau
> Provalis Research
> www.provalisresearch.com
> 
> 
> 
> 
> 
> 



More information about the Corpora mailing list