[Corpora-List] WordNet vs Ontology

Eduard Barbu eduard_barbu at yahoo.com
Fri Aug 8 09:07:51 UTC 2014


Hi,
John said:
"For empirical analysis, however, the ontology must be derived or
discovered from the documents.  The primary reason for the failure
of so many projects in formal linguistics is that they start with
an a priori ontology of the subject matter."
I see two problems with this statement. The first: when a human being reads a text he is using the background knowledge to interpret the text. Therefore a system should also have coded background knowledge before extracting relevant facts from text. The second : there is no system capable of learning an ontology from text. The best systems are supervised ones. They are learning relations and rules from text with some level of precision and they are relying on annotated data. Moreover the output of such systems should be manually cleaned because it has a high number of errors.


On Friday, August 8, 2014 3:48 AM, John F Sowa <sowa at bestweb.net> wrote:
 


Mike,

I'm not arguing for prescriptive linguistics.  On the contrary,
note my strong endorsement of the point by Sue A. & Adam K.:
"I don't believe in word senses."

JFS
>> That is an important point:  Using the word 'ontology' as
>> a catchall term for any kind of list makes it meaningless.

MM
> it's amazing to me to see a bunch of *corpus* linguists arguing
> about what a word *should* mean...  But the discussion here has
> been bordering on prescriptive linguistics, if you ask me.

Thank you for raising that point, because it clarifies the issues.
In short, WordNet is *descriptive*, and the ontologies used in
AI and knowledge representation are most often *prescriptive*.
Unless you add an emulsifier, they mix like oil and vinegar.

Note the often quoted definition by Tom Gruber:  "An ontology
is a formalization of a conceptualization."  I don't like that
definition, and I don't recommend it -- because it is strongly
biased toward prescriptive ontologies.

For certain applications, such as software design, a prescriptive
approach is obligatory:  the chief engineer must choose the design
principles and enforce them consistently.

For empirical analysis, however, the ontology must be derived or
discovered from the documents.  The primary reason for the failure
of so many projects in formal linguistics is that they start with
an a priori ontology of the subject matter.  Then they take pride
in following Frege's principle of using the syntax to guide the way
their a priori ontology is combined to generate an interpretation.
That method is guaranteed to fail.

I keep telling them to read Wittgenstein or Adam K's article.
But they never get the point:  To understand a document in
any natural language, you must *discover* its underlying
ontology.  Nobody (human or computer) can understand a document
if they start by trying to impose an a priori ontology on it.

John

_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20140808/ae8b0e41/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list