[Corpora-List] XML parsers vs regex

Phil Gooch philgooch at gmail.com
Mon Jun 30 19:33:39 UTC 2014


On Mon, Jun 30, 2014 at 7:08 PM, Matías Guzmán Naranjo <mortem.dei at gmail.com
> wrote:

> wouldn't just writing <date>.*?</date> get me 'week after'?


I'd go for

<date>[^<]+</date>

which will consume line breaks. Of course, this assumes that date only
contains text and no other markup.

Phil
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20140630/a3419241/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list