[Corpora-List] converting non-embedded tags into embedded ones

Lou Burnard lou.burnard at computing-services.oxford.ac.uk
Sun Feb 24 12:32:26 UTC 2008


If your input is well-formed XML (which it looks like, except that there 
should be quote marks round the NN) then the easiest way would be to use 
an XSLT script containing a template like this:

<xsl:template match="lex">
<xsl:value-of select="."/>_<xsl:value-of select="@pos"/>
</xsl:template>

You can process such a script with (for example) xsltproc which is 
available for windows or unix



Warren Tang wrote:
> Could someone help me with this problem:
>
> I have texts with non-embedded tags:
>
> eg: <lex pos=NN>time</lex>
>
> but I would like to convert them to embedded tags (if this is the 
> right term):
>
> e

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list