[Corpora-List] Wordsmith tag searches of CLAWS 7 Pseudo XML corpus

Marko, Georg (georg.marko@uni-graz.at) georg.marko at uni-graz.at
Sun Oct 20 20:40:08 UTC 2013


Dear Peter,

I probably misunderstand the question, but what happens if you delete the "<*>" in "Mark-up to ignore". It will probably make estimating distances difficult, with all the pieces included in the tags here, but if you look for the core bit - the "VV0", e.g. - this should be there (at least it was, when I did a little test with the line you've given as a µ-corpus).

Simplistic solution, and probably not what you meant, but maybe...

Best

Georg
________________________________________
Von: corpora-bounces at uib.no [corpora-bounces at uib.no] im Auftrag von Peter Saunders [peter.saunders at lang.ox.ac.uk]
Gesendet: Sonntag, 20. Oktober 2013 22:01
An: corpora at uib.no
Betreff: [Corpora-List] Wordsmith tag searches of CLAWS 7 Pseudo XML corpus

Dear All

Does anyone know how I can configure Wordsmith settings so that it will do tag searches on a CLAWS 7 Pseudo XML tagged corpus? Here's a corpus line:

<w id="2.5" pos="VV0">give</w> <w id="2.6" pos="AT1">an</w>

I think the id="*"  parameter causes problems and I don't know how to strip this part out of tag searches.

Best

Peter

_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list