[Corpora-List] Wordsmith tag searches of CLAWS 7 Pseudo XML corpus
Marko, Georg (georg.marko@uni-graz.at)
georg.marko at uni-graz.at
Sun Oct 20 20:40:08 UTC 2013
Dear Peter,
I probably misunderstand the question, but what happens if you delete the "<*>" in "Mark-up to ignore". It will probably make estimating distances difficult, with all the pieces included in the tags here, but if you look for the core bit - the "VV0", e.g. - this should be there (at least it was, when I did a little test with the line you've given as a µ-corpus).
Simplistic solution, and probably not what you meant, but maybe...
Best
Georg
________________________________________
Von: corpora-bounces at uib.no [corpora-bounces at uib.no] im Auftrag von Peter Saunders [peter.saunders at lang.ox.ac.uk]
Gesendet: Sonntag, 20. Oktober 2013 22:01
An: corpora at uib.no
Betreff: [Corpora-List] Wordsmith tag searches of CLAWS 7 Pseudo XML corpus
Dear All
Does anyone know how I can configure Wordsmith settings so that it will do tag searches on a CLAWS 7 Pseudo XML tagged corpus? Here's a corpus line:
<w id="2.5" pos="VV0">give</w> <w id="2.6" pos="AT1">an</w>
I think the id="*" parameter causes problems and I don't know how to strip this part out of tag searches.
Best
Peter
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list