<div dir="ltr">Hi Mike et All,<div><br></div><div>One supposes that ambiguity is a principle cause of contradictions. I forget who proved it (about a century or more ago), but *all* structured entities of any complexity whatsoever (e. g., theories) are mathematically *guaranteed* to produce contradictions! Also, one thing early (1960s) attempts at automatic semantic analysis of sentences (e.g., the Harvard project at that time) showed was that apparently inoffensive sentences turned out to be surprisingly multiply ambiguous--not quite the number of each word in a sentence's meanings multiplied by that number for each other word in the sentence, but still a rather large number for even short sentences. Unambiguous sentences are extremely difficult to produce, much less find. So if even designed languages ineluctably lead to contradictions, good luck with relational databases!</div>
<div><br></div><div>[Note: (I'm aware I skipped several steps in the above 'proof'', but just sayin' ...)</div><div><br></div><div>Jim</div></div><div class="gmail_extra"><br clear="all"><div>James L. Fidelholtz<br>
Posgrado en Ciencias del Lenguaje<br>Instituto de Ciencias Sociales y Humanidades<br>Benemérita Universidad Autónoma de Puebla, MÉXICO</div>
<br><br><div class="gmail_quote">On Fri, Dec 6, 2013 at 3:04 PM, maxwell <span dir="ltr"><<a href="mailto:maxwell@umiacs.umd.edu" target="_blank">maxwell@umiacs.umd.edu</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div class="im">On 2013-12-06 15:47, Otto Lassen wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
If texts are structured or unstructured data depends on their origin.<br>
</blockquote>
<br></div>
I think a cross-cutting problem, and perhaps a more easily quantified (but maybe still useful) one, is that of ambiguity. Structured data is often designed to avoid ambiguity. (Structured data may provide an explicit representation of ambiguities, but the explicit representation should not in itself be ambiguous.)<br>
<br>
I'm sure someone will come up with counter-examples, but relational databases and XML documents are both designed to be unambiguously parseable (given a database schema or an XML schema). So were blueprints, if anyone remembers those. Natural language, otoh, is inherently (and often exceedingly) ambiguous. So are Nekker cubes.<br>
<br>
So it might be helpful (if possible) to re-phrase the question to ask how much data is potentially ambiguous, and at what level (syntactically, morphologically, lexically, semantically, pragmatically). By "potentially" ambiguous, I mean in principle; a particular instance of a natural language sentence might be syntactically unambiguous, but natural language in general is syntactically ambiguous. I suppose anything is _pragmatically_ ambiguous.<br>
<br>
Mike Maxwell<div class="HOEnZb"><div class="h5"><br>
<br>
______________________________<u></u>_________________<br>
UNSUBSCRIBE from this page: <a href="http://mailman.uib.no/options/corpora" target="_blank">http://mailman.uib.no/options/<u></u>corpora</a><br>
Corpora mailing list<br>
<a href="mailto:Corpora@uib.no" target="_blank">Corpora@uib.no</a><br>
<a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/<u></u>listinfo/corpora</a><br>
</div></div></blockquote></div><br></div>