[Corpora-List] Metrics for corpus "parseability"

Miles Osborne miles at inf.ed.ac.uk
Mon Feb 4 22:21:35 UTC 2008


Chris Brew suggested I actually explain what it is I meant:  here is a
sample paper on phase transitions in solving problems like 3-sat:

http://www.sciencemag.org/cgi/content/abstract/264/5163/1297

Props to Chris!

Miles

On 04/02/2008, Miles Osborne <miles at inf.ed.ac.uk> wrote:
>
> I must confess, the idea that a corpus can be described in terms of
> "parseability" sounds a little ill-founded to me.  The choice of particular
> parsing algorithm may dictate which examples are hard to process, as will
> the underlying grammar etc etc.
>
> What would be interesting (read:  hard) would be to look at the work on
> phase transitions in 3-sat problems and the like.  So, are there underlying
> graph-related characteristics of parsing which make certain sentences
> intrinsically hard to process and in particular can these characteristics be
> framed in a manner that was independent of the actual parser.
>
> Miles
>
> --
> The University of Edinburgh is a charitable body, registered in Scotland,
> with registration number SC005336.




-- 
The University of Edinburgh is a charitable body, registered in Scotland,
with registration number SC005336.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20080204/fecfd396/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list