<div dir="ltr">the more usual definition of "overgeneration" is that a grammar can be used to parse strings that are ungrammatical. this used to be seen as a problem when people didn't use probabilities as such, since then the definition of being well-formed meant being accepted by some grammar.<br>
<br>nowadays, this is less of a problem since you can simply interpret low probabilities as being an indicator of a sentence being instead a string (ie junk). overgeneration could become a problem however if the number of spurious parses becomes so large that parameters become fragmented, you need to produce them in a reasonable time / space etc.<br>
<br>Miles<br clear="all"><br>-- <br>The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336.<br>
</div>