[Corpora-List] Patent application for "Referent tracking of portions of reality"

John F. Sowa sowa at bestweb.net
Fri Sep 3 20:26:31 UTC 2010


On 9/1/2010 4:27 PM, Adam Kilgarriff wrote:
> It seems like this thread is proceeding in ignorance of the horrors
> of software patents.

I agree, and I would like to add another horror that was brought to
my attention in an offline note.  In my previous note about the patent
by Werner Ceusters, et al., I didn't go into detail about the patent
description, which has a very lengthy description of formal ontologies
and how they are used.

But the offline note pointed out a serious danger:  the so-called
inventors submitted another patent application as a "continuation" of
the patent that had already been granted.  Although the continuation
has only one fairly general claim, the "inventors" can add any claims
they please until the time that the new patent is granted.

At the end of this note, I copied the presentation or "teaching"
from that patent, which describes how they use the formal ontology.
There is nothing new in it.  All of it has been published and
implemented many times over during the past half century.  But the
patent examiners didn't know that.  They granted the patent, and
it's quite likely that they will approve this "continuation".

For a patent application, the US patent law allows the "inventors"
to add any new claims they please to the application up until the
date the patent is granted.  Since this application has just one
claim, that indicates the "inventors" plan to stuff the application
with many more claims just before it is granted.

I suggest that readers of this list look at the description quoted
below with one thought in mind:  "Is this similar to anything I have
done or plan to do with an ontology."  If so, these "inventors"
(or any company they assign the patent to) could sue you.

For example, look at the last sentence quoted below:  "By linking
the concepts in the ontology to the appropriate codes, the system
is able to annotate free text documents with these codes as the
documents are being indexed."  If the "inventors" add this as a
claim, anybody who annotates documents with codes could be sued.

Of course, such a claim could be rejected in court, but the cost
of proving that each and every one of the claims is invalid could
be prohibitive.  That could have a chilling effect on funding,
even for academic research.

John Sowa 
________________________________________________________________________

Source: http://www.faqs.org/patents/app/20090259459

The Formal Ontology

[0038] The formal ontology according to the current invention comprises 
a plurality of concepts one part of them being independent of a specific 
language, the other part being those concepts that explain the 
relationships between language-independent concepts and language as a 
medium of communication. By independent of language it is meant that the 
concepts do not depend on a particular language to be given a definition 
within the system. For example, in English, the word "dog" is a label 
applied to the concept of a particular animal. In other languages the 
same concept may be labeled with a different word, such as "Hund" in 
German, "cane" in Italian, or "perro" in Spanish. In reality, regardless 
of the label used in a particular language, the concept of the animal 
remains reasonably constant. The concept is therefore said to be 
independent of a specific language. Similarly, in the domain ontology 
according to the current invention, the concept for this particular 
animal is not dependent on a particular language. By keeping concepts 
independent of a specific language, the system according to the current 
invention can link concepts contained in the formal ontology to terms in 
more than one language.

[0039] However, although the concepts are independent of any specific 
language (such as English, French, . . . ), in the present invention 
they are not represented as being independent of language as a medium of 
communication. The second part of the formal ontology, the linguistic 
ontology, contains concepts about how humans interpret language. For 
example, the linguistic ontology according to the current invention 
contains the concept labeled "dispositive doing", which as a real world 
object relates to instances of an actor doing something to an actee. The 
concept is independent of a specific language because the notion of 
actor and actee in the context of the real world object, an action, is 
common to all languages. However, the concept is not totally independent 
of language in that the concept governs how the relationship between the 
actor and actee is understood by human beings.

For example, in the sentence [0040] "The doctor treated the patient."
it is understood in language that the action "treated" has an actor 
"doctor" and an actee "patient". That is, in the real world human beings 
understand that doctors treat patients, and patients don't treat 
doctors. The linguistic ontology applies this understanding to the real 
world object "treatment".

[0041] Thus, the concepts that are contained in the formal ontology are 
of two types generally. The first type of concept relates to real world 
objects that are recognized by human beings as metaphysical instances. 
These concepts comprise physical entities, procedures, ideas, etc and 
are contained in the domain ontology. The second type of concept relates 
to how human beings understand language and allows the identification of 
real world instances. That is, how human beings understand the 
interactions of real world objects represented by the concepts in the 
domain ontology.


[0042] The concepts that are contained in the formal ontology will 
depend on the knowledge area that the ontology is to be applied to, as 
well as on the principles according to which human languages function 
independent of the knowledge area. The domain ontology may contain 
concepts comprising general knowledge about the world, or may be limited 
to a specific knowledge area of interest to a user. Similarly, the 
linguistic ontology may define very broad rules about how language 
functions, or it may define very narrow rules to limit the relationships 
that can exist between concepts in the domain ontology. In a preferred 
embodiment of the system according to the current invention, the 
concepts contained in the domain ontology are limited to the knowledge 
area of medical concepts complemented by a linguistic ontology 
containing the concepts required to understand how natural language 
functions, and how humans deal with natural language. However, 
ontologies built with concepts from other knowledge areas can be created 
with equal success.

[0043] By allowing the concepts in the formal ontology to remain 
independent of specific language, the system according to the current 
invention allows documents in a variety of languages to be indexed and 
searched independent of the language(s) known by the system user. 
According to a preferred embodiment of the invention, the concepts in 
the formal ontology are tagged with labels in English to allow easy 
maintenance of the formal ontology by a user. However, the labels in 
English are for ease of use in maintaining the formal ontology only and 
do not contribute to the functioning of the system in indexing or 
retrieval of documents. The concepts in the formal ontology can be 
alternatively labeled in Dutch, German, French, Italian or any other 
language desired by the user. Alternatively, the concepts may be labeled 
using a coding system that is completely independent of language, such 
as ICD-9 or ICD-10.

[0044] The basic architecture of the formal ontology of the current 
invention is a directed graph, i.e. a hierarchical structure that allows 
multiple parents. Referring to FIG. 2, an example of the hierarchical 
structure is shown. In the hierarchy shown in FIG. 2, a primary node 
comprises a single primary concept. In the example shown, the single 
primary concept is the concept "City". The primary concept has as direct 
children, narrower related concepts, such as "European City" and "North 
American City". Each of the child concepts further have one or more 
child concepts that further narrow the primary concept. For example, the 
concept of "European City" may be narrowed to "French City", "German 
City" and "Belgian City". The concept of "North American City" may be 
narrowed to "Canadian City" and "US. City".

[0045] The hierarchical structure of the formal ontology, creates the 
most basic relationships between concepts contained in the formal 
ontology, that of parent and child in a strict formal subsumption 
interpretation, and that of siblings. The formal subsumption 
interpretation guarantees that all characteristics described of a 
parent, apply to all of its children without any exception. Referring 
again to the example, the concept of "City", which occupies the highest 
level of the hierarchy is the parent concept to "European City" and 
"North American City". By reciprocal relationship, the concepts of 
"European City" and "North American City" are the children of the 
concept "City". Further, the concept of "European City" is the parent of 
the concept "German City", etc. Further, the concept of "City" is the 
grandparent concept to the concept "German City", etc. Still further, 
the concepts of "European City" and "North American City" have the 
relationship of siblings since they share a common parent.

[0046] Regardless of the knowledge area of the concepts contained in the 
formal ontology according to the current invention, a similar 
hierarchical structure with parent/child and sibling relationships 
exists. This is true of both the general world concepts in the domain 
ontology and the linguistic concepts in the linguistic ontology. In a 
preferred embodiment of the invention, the highest level of the 
hierarchy is occupied by a primary concept with a label such as "Domain 
Entity". According to the preferred embodiment of the invention, the 
primary concept of "Domain Entity" encompasses all real things whether 
they be physical entities, states, ideas, etc. The primary concept may 
then preferably be sub-divided into physical entities, states, ideas, 
linguistic concepts, etc. at the next lower level of the ontology.

[0047] It should be apparent that because the hierarchical structure of 
the formal ontology, that all concepts in the ontology can be traced 
back to a single related concept at the highest level of the ontology, 
such as "Domain Entity". On the most basic level therefore, the degree 
of relatedness between two concepts can be measured by how many steps in 
the hierarchy must be traversed to find a common ancestor for the two 
concepts. Again referring to the example, the concepts of "Brussels" and 
"Antwerp" are siblings since they share a common parent, and are 
therefore closely related to each other within the hierarchy. By 
contrast, one must traverse the hierarchy back to the primary concept of 
"City" to find a common ancestor for the concepts of "Brussels" and 
"Chicago". Since the concepts of "Brussels" and "Chicago" share only a 
great-grandparent concept in common, they are less closely related 
within the context of the hierarchy than are the concepts "Brussels" and 
"Antwerp".

[0048] It should further be recognized that a single concept can have 
more than one direct parent. For example, in addition to the child 
concepts shown in FIG. 2, the concept "City" may have a child concept 
"Capital City". In this case "Paris", "Berlin" and "Brussels" would be 
children of the concept "Capital City" in addition to being children of 
"French City", "German City" and "Belgian City" respectively. By 
allowing a concept to have multiple parent concepts, the degree of 
relatedness between two concepts within the hierarchy may vary based on 
the context of the relationship. As can be seen from the examples, 
"Paris", "Berlin" and "Brussels" are more closely related in the context 
of "Capital City" than in the context of "European City". The only 
limitation on the structure of the hierarchy is that a concept cannot 
have itself as an ancestor, which would lead to a circular reference of 
a concept to itself.

[0049] As stated above, the most basic relationship between concepts in 
the formal ontology according to the current invention is the link 
created by the parent/child relationship. However, the relationships 
that can exist between two concepts in the formal ontology according to 
the present invention is not limited to that of parent and child. By 
allowing other relationships to exist, the richness of the knowledge 
contained in the formal ontology is greatly enhanced, while limiting the 
overall size of the ontology. For example, in reality the medical 
concepts of "brain", "inflammation" and "meningitis" are quite closely 
related. However, the concept "brain" refers to a body part, whereas 
"inflammation" is a symptom and "meningitis" is a disease. If a formal 
ontology were limited to parent/child relationships as a measure of the 
relatedness of concepts it is likely that the degree of relatedness 
between these three concepts within the ontology would potentially be 
very low. This is because a large number of parent/child relationships 
would likely have to be traversed before a common ancestor was found for 
all three concepts. This would of course lead to an inaccurate 
reflection of reality. A potential solution to this problem would be to 
construct a formal ontology with sufficient detail to narrow the gap 
between these concepts in the hierarchy. For example, the concepts of 
the body part "brain" and the symptom "inflammation" could be made 
children of the concept of the disease "meningitis". However, in order 
to provide an accurate reflection of reality it would be necessary to 
construct similar relationships between "brain" and "inflammation" and 
every other concept that they are related to. Since the concepts of 
"brain" and "inflammation" would most likely be attached to a large 
number of concepts, this would result in a large number of such 
parent/child relationships. Further, similar parent child relationships 
would have to be built for every concept in the ontology. This would 
result in an unmanageably large ontology. In addition, such a solution 
would violate the formal subsumption nature of the parent/child 
relationships exploited in this invention.

[0050] The current system solves this problem by providing a large 
number of link types for linking concepts within the formal ontology. 
The link types within the formal ontology according to the current 
invention are used to define relationships between concepts. For 
example, in reality the concept of "brain" is linked to the concept 
"meningitis" in that the brain is the location for the disease 
meningitis. Using the link types available in the formal ontology, a 
user can create a link between the concepts "brain" and "meningitis" in 
the formal ontology so that this conceptual link is also recognized by 
the system. A user may further create a link between the concept 
"inflammation" and the concept "meningitis" in the formal ontology to 
indicate that inflammation is a symptom of meningitis. Again, this 
allows the system to recognize a conceptual link that exists in reality. 
Furthermore, by linking the concepts "brain" and "inflammation" to the 
concept "meningitis", a conceptual link between the brain and 
inflammation is created. That is, the link through the concept 
"meningitis" shortens the distance between "brain" and "inflammation" 
within the ontology. By shortening the distance between these two 
concepts, the conceptual linkage between the two concepts in the 
ontology is increased.

[0051] An advantage of this type of linking of concepts is that it 
allows for more accurate indexing of documents because the deep meaning 
of the text can be pulled out. For example, a text that contains a 
discussion of meningitis may contain very few instances of the exact 
term "meningitis". However, the document may contain a significant 
number of references to inflammation in the brain. A standard indexing 
technique that looks only for the specific concept "meningitis" may rank 
such a document of very low relevance, while in reality it may have a 
very high relevance to the subject. In contrast, the system according to 
the current invention will recognize the linkage between the concepts of 
"brain", "inflammation", "meningitis" and as a result rank the document 
with a more accurate relevance to the subject.

[0052] The number of link types that can be provided for an ontology is 
only limited by the number of such relationships that can exist in 
reality. According to a preferred embodiment of the invention, a user 
can use the available concepts and link types to build criteria and 
concept criteria. A criteria according to this embodiment is comprised 
of a concept with an associated link type. For example, the link type 
HAS-LOCATION can be associated with the concept BRAIN to produce the 
criteria [HAS-LOCATION] [BRAIN]. This criteria can further be used to 
define a property of another concept as part of a concept criteria. For 
example [MENINGITIS] [HAS-LOCATION BRAIN]. The association of the 
criteria [HAS_LOCATION] [BRAIN] to the concept MENINGITIS provides a 
partial definition of the concept meningitis.

[0053] In a preferred embodiment of the invention, each link type from a 
first concept to a second concept has a complimentary reciprocal or 
contra link type that can be established from the second concept to the 
first concept. For example in reality, when two objects "A" and "B" are 
close to each other, we say that "A" is close to "B" and "B" is close to 
"A". In such case where a relation operates bi-directionally, the 
ontology is constructed by placing the same link type twice, from "A" to 
"B" and from "B" to "A". E.g.: A IS-NEAR-OF B, B IS-NEAR-OF A.

[0054] A second case of paired link types according to this embodiment 
is used to describe an inverse relationship. For example, where concept 
"A" performs some action on "B", "A" is defined as acting on "B" whereas 
"B" is defined as being acted on by "A". E.g.: A HAS-ACTOR B<-> B 
IS-ACTOR-OF A; or A IS-SPATIAL-PART-OF B<-> B HAS-SPATIAL-PART A. The 
link types can be declared each other's inverse by use of either CONTRA 
or AUTOCONTRA attributes that can be assigned to them.

[0055] he operation of link types and reciprocation will now be 
explained by means of example. Prior to the explanation, it is necessary 
to define what is meant herein by the term "instance". As used herein, 
the term "instance" refers to an individual manifestation or embodiment 
of a concept in the real world (i.e. metaphysical instances). By 
example, for the concept of the disease meningitis, an individual 
diagnosed case of meningitis contracted by a specific person would be an 
occurrence or "instance" of the disease.

[0056] Now if we declare in the formal ontology "MENINGITIS" IS-CAUSE-OF 
"INFLAMMATION IN THE BRAIN", then it means that all metaphysical 
instances of meningitis cause inflammation in the brain. However, this 
does not provide any reciprocal information about metaphysical instances 
of inflammation in the brain.

[0057] By contrast, if we declared "INFLAMMATION IN THE BRAIN" HAS-CAUSE 
"MENINGITIS", then it means that all metaphysical instances of 
inflammation in the brain are caused by meningitis. Here again however, 
we are provided with no information about metaphysical instances of 
meningitis.

[0058] By declaring a CONTRA, such as "MENINGITIS" IS-CAUSE-OF CONTRA 
HAS-CAUSE "INFLAMMATION IN THE BRAIN", the system according to the 
current invention provides information about all instances of 
meningitis: all instances of meningitis cause inflammation in the brain. 
By declaring a CONTRA, the system also provides information about some 
instances of inflammation in the brain: some instances of inflammation 
in the brain are caused by meningitis.

[0059] By declaring an AUTOCONTRA, such as "MENINGITIS" IS-CAUSE-OF 
AUTOCONTRA HAS-CAUSE "INFLAMMATION IN THE BRAIN", the system according 
to the current invention provides information about all instances of 
meningitis and all instances of inflammation in the brain: all instances 
of meningitis cause inflammation in the brain AND all instances of 
inflammation in the brain are caused by meningitis.

[0060] By using the various link types, and CONTRA and AUTOCONTRA 
declarations to link concepts within the ontology, a user can build 
definitions of the concepts in the ontology, while giving it a precise 
semantics as to how these declarations are to be applied by interpreting 
events in the world, this however without the computational burdens 
related to full first order logic.

[0061] As stated above, creating a link between two concepts defines a 
relationship between the two concepts. It also defines something about 
at least one of the concepts itself, such as "brain" is the location of 
"meningitis", or "inflammation" is a symptom of "meningitis". By 
creating these two links, a user enriches the knowledge contained on the 
ontology by providing a definition for the concept "meningitis" based on 
its interactions with other concepts in the ontology. In a preferred 
embodiment of the invention, a full definition can be created for each 
concept in the formal ontology. The full definition as it is used here 
means the set of necessary and sufficient links that a concept has to 
identify occurrences in the real world as instances of the concept. In 
other words: the set of all links of a given concept in the ontology 
defines what is true for all occurrences in the real world that are 
instances of the concept. The full definitions assigned to a concept in 
the ontology allow occurrences in the real world to be recognized as 
instances of the particular concept.

[0062] A further feature of the formal ontology provided according to 
the invention is the subsumption of child concepts within parent 
concepts, which results in full inheritability of links from parent to 
child concepts. That is, a child concept will automatically be linked to 
all concepts that its parent is linked to. For example, the concept 
"meningitis" may have the child concepts of "viral meningitis" and 
"bacterial meningitis", both of which are more specific concepts 
subsumed within the concept "meningitis". Thus the link established 
between the concept of "meningitis" and "brain" will automatically be 
established between the concept of "viral meningitis" and "brain", and 
"bacterial meningitis" and "brain". Therefore, "viral meningitis" and 
"bacterial meningitis" will inherit the definition of the parent concept 
"meningitis", but will be further defined based on the further links 
that each has to other concepts. In this way, the system according to 
the current invention can recognize each instance of either "viral 
meningitis" or "bacterial meningitis" as an instance of "meningitis", 
but will not necessarily recognize each instance of "meningitis" as 
"viral meningitis" or "bacterial meningitis". This feature provides the 
advantage of allowing a user to propagate a link to the progeny of a 
concept by establishing a single link.

[0063] As stated above, the link types provided as part of the formal 
ontology can be used by a user to define relationships between two 
concepts. At the same time the link types can provide full definitions 
of the concepts in the formal ontology. However, it is recognized in 
reality that, some relationships between concepts do not make sense. For 
example, it is recognized in reality that the disease "meningitis" 
cannot not have "inflammation" as a location. In computerized systems 
however, such nonsensical relationships are not automatically recognized 
unless you make the system work under a "close world assumption" (i.e. 
what is not known, is not allowed), or if it is specified explicitly 
what is not allowed. It is necessary to teach a natural language 
understanding system what are and are not appropriate relationships 
between concepts.

[0064] The system according to the current invention solves this problem 
by providing the linguistic ontology as part of the formal ontology. The 
linguistic ontology contains the rules about how language works as well 
as the principles that the human mind adheres to when representing 
reality at the conscious level of a human being.

[0065] In the linguistic ontology provided according to the current 
invention, rules are established regarding what relationships can exist 
between concepts on the basis of how these relations are expressed in 
language in general (though independent of a specific language). For 
example, a rule may be established that the concept "disease" in the 
formal ontology cannot be linked to the concept "symptom" in the formal 
ontology as a location. Because "meningitis" and "inflammation" are 
children of "disease" and "symptom" respectively in the hierarchy, the 
rule prohibiting this link would be inherited by them. As a result, the 
definition of inflammation as a location for meningitis could not exist 
in the formal ontology.

[0066] In one embodiment, the linguistic ontology may be set up so that 
there is an absolute prohibition against using certain link types to 
link certain concepts. In the example above, a user would not be able to 
create a link indicating the concept "inflammation" as a location for 
the concept "meningitis". Alternatively, the linguistic ontology could 
be set up such that a verification by the user will be required when a 
prohibited link is proposed. In this embodiment, the user still has the 
option to create the link.

[0067] The rules established in the linguistic ontology may be as broad 
or restricting as required for a given application or knowledge area.

[0068] A second application of the linguistic ontology is that it 
restricts the possible representations of reality to those that are 
closest to the way reality is talked about by means of language. For 
example, in a shooting event, there are a number of participants such as 
the shooter, the deer, the bullet, the gun, etc. There is only that one 
specific event that happened (the shooting) in a precise way (the deer 
hit by the bullet shot from the gun by the shooter), but there are 
different ways to represent it formally: it can be represented from the 
viewpoint of the deer, the bullet, the shooter, etc. The present 
invention exploits the way humans usually talk about such an event, 
giving a central place to those aspects that are put central by the 
story teller.

[0069] A third application follows from the second in that sometimes 
single events are described as distinguishable entities by means of 
natural language. An example is the notion of baby brought on earth, 
wherein the view of "birth" (the baby's viewpoint) is equally preferred 
in medical language usage as that of "parturition" (the mother's point 
of view) or "delivery" (the physician's point of view).

[0070] The domain and linguistic ontologies have thus far been spoken of 
as being separate entities within the formal ontology. However, in the 
current invention they are connected within the formal ontology in that 
a concept may have both a domain and a linguistic concept as a direct 
parent. For example, the linguistic concept of "dispositive doing" may 
have as a child the concept of a "treatment", wherein a "treatment" as 
an action has a physician as actor and a patient or disease as actee. At 
the same time, "treatment" may descend from the parent concept 
"healthcare procedure" in the domain ontology. Within the domain 
ontology, the concept of a "treatment" is defined as a real world 
object, but this definition cannot be used to relate the object to other 
real world objects. The linguistic ontology defines how the real world 
object actively relates to other concepts and relates other concepts in 
language.

[0071} As indicated above, the formal ontology according to the current 
system is independent of any specific language, although not independent 
of language altogether. However, free text documents are written in 
specific languages. In order to be useful for indexing free text 
documents it is necessary to relate the language independent concepts to 
specific languages.

[0072] The system according to the current invention accomplishes this 
by providing a lexicon of terms that are linked to the formal ontology. 
The terms contained in the lexicon may comprise single words or 
multi-word units that correspond to concepts, criteria and concept 
criteria in the formal ontology. Further, each term in the lexicon may 
be linked to more than one concept, criteria or concept criteria in the 
formal ontology, which allows for the existence of homonyms. Likewise, 
each concept, criteria and concept criteria may be linked to more than 
one term in the lexicon, such as when terms in two or more languages are 
contained in the lexicon.

[0073] When indexing a free text document or interpreting a query to 
retrieve an indexed document, the system according to the current 
invention uses the lexicon of terms to segment the free text and to 
relate the free text to the concepts, criteria and concept criteria 
contained in the formal ontology. Thus, the current system makes use of 
both terms and independent concepts in the analysis of free text.

Managing the System, System Architecture

[0074] An additional feature of the present invention provides a 
management system for managing the formal ontology. As discussed, the 
formal ontology according to the current invention can be constructed 
using any available relational database system, such as ORACLE®, SYBASE® 
and SQLSERVER®. The ontology itself is abstracted away from the 
relational database system by wrapping access to the database into a 
management tool that exposes functionality to the user. The database 
functions as a physical storage medium for the ontology. According to 
the current invention a management tool is provided for giving a user 
access to the ontology for the purpose of adding to or manipulating the 
ontology. The tool allows the user to view the formal ontology using a 
variety of different criteria that together give a complete picture of 
the structure of the formal ontology. In a preferred embodiment of the 
invention a user can view several different views of the ontology at 
once as a layout, allowing the ontology to be viewed from several 
perspectives at once.

[0075]The management system for maintaining the formal ontology will be 
explained with reference to FIG. 3, which shows the architecture of the 
ontology management system according to the current invention. The 
formal ontology and lexicon of terms are stored on a database 20, which 
is in communication with a server 22, which houses the server based 
component of the ontology management tool 26. The server based component 
of the ontology management system comprises a relational database which 
controls access to the formal ontology, and contains the components for 
building the formal ontology, such as the hierarchical structure, link 
types, setting rules in the linguistic ontology, linking terms to 
concepts, etc, along with the tools for creating multiple views of the 
ontology. The ontology management system further comprises a client 
based component(s) 24 that allows a user to access and maintain the 
ontology via the server based component 22. The system can be 
implemented on a number of platforms, including but not limited to 
WINDOWS®, SOLARIS®, UNIX® and LINUX®. Preferably, the management tool 26 
is a set of business objects. A low layer is a thin wrapper on top of 
the database structure that implements the base functions to access a 
particular relational database. A middle layer also exposes a set of 
functions that manage multi-user access to any type of supported 
database, such as a relational database. As such the middle layer allows 
the creation of customized versions of the management tool within 
certain limited parameters. A top layer implements the high level 
interface. This interface surfaces functionality from a logical point of 
view to outside users (e.g. "getConceptTree" is a high level layer 
function that makes use of the underlying middle and low level layer 
functions to populate a tree object with information about the place of 
a concept in the formal ontology). Functionality implemented by the low 
and middle layers includes but is not limited to the linking of external 
databases, database manipulation and navigation, and text searching.

Linking External Databases

[0076] As described thus far, the formal ontology according to the 
current invention is constructed manually by a user by creating 
hierarchical levels, slots within those hierarchical levels and further 
filling those slots with concepts, thereby creating the basic hierarchy 
with its parent/child relationships between concepts. The user further 
enriches the knowledge base by using the link types provided to define 
relationships between the concepts entered into the hierarchy. In 
addition to being able to manually construct the formal ontology, an 
alternative embodiment of the system according to the current invention 
provides the ability to map data from an independent database onto the 
formal ontology.

[0077] In a number of knowledge areas, large databases of information 
are already in existence. In order to avoid the laborious work of 
manually re-entering this information into the formal ontology, the 
system according to the current invention provides the capability to 
link the formal ontology to an external independent database. Although 
the external data never becomes a physical part of the ontology, this 
feature allows a user to access and use data contained on an independent 
database as if it were part of the formal ontology.

[0078] Data in an external database is linked to the ontology by 
creating a parent/child relationship between at least one concept in the 
formal ontology and at least one item of data in the database. In the 
case of an external database in tabular format, such as an ACCESS® 
database, a user can link an entire column of data in the external 
database to one or more concepts in the formal ontology by creating a 
parent/child link between at least one concept in the ontology to the 
header for the column in the table. Normally, when data is provided in 
tabular format, each column of the table is given a header with a 
descriptive title for the data contained in that column. In creating the 
parent/child relationship between the concept in the ontology and the 
column of data, the system analyzes the title and associates it with 
appropriate concepts in the ontology. Alternatively, the system may 
provide the user with a list of potential concepts that the data can be 
mapped to. The system may make use of the terms contained in the lexicon 
when performing this function. In an alternative embodiment of the 
invention, a user can manually map an item or column of data to the 
desired concept.

[0079]Referring to FIG. 4, an example of how an external independent 
database may be mapped to the formal ontology is shown. The relational 
database 30, server 32 and client based component 34 are as described in 
FIG. 3. Databases 36 and 38 are external independent databases, such as 
ACCESS® databases containing data to be mapped onto the formal ontology. 
Database servers 40 and 42 associated with each database 36 and 38 allow 
access to their respective databases so that queries can be run. A 
database directory service 44, assigns keywords to the separate 
databases 36 and 38. According to the current system, the same keyword 
may be assigned to two or more databases containing similar data that 
can be accessed at the same time. The database directory service 
provides the location of all of the available databases to an ontology 
proxy module 46. The ontology proxy module 46 receives queries from a 
user via the client based component 34. The ontology proxy module then 
directs the queries to the server 32 and to a database-ontology mediator 
module 48. The database-ontology mediator module comprises an 
ontology-to-database translator 50 and a database-to-ontology translator 
52. The ontology-to-database translator 50 serves the function of 
translating the ontology concept based queries to database queries that 
can be used to search the databases 36 and 38 for data that is mapped to 
the particular concept or concepts embodied in the query. The 
database-to-ontology translator 52 serves the function of translating 
the information returned from the database to a form that can be viewed 
by the user via the client based component 34.

Coding Using Independent Coding Systems

[0080] In a preferred embodiment of the system according to the current 
invention, the formal ontology is comprised of a knowledge base of 
medical concepts. A preferred use for the system is in the indexing of 
medical documents. A further preferred application of the system 
according to the current invention is the coding of medical documents 
using a standard medical coding system. Standard medical coding systems 
that can be used in conjunction with the current invention include, but 
are not limited to ICD-9, ICD-10, MedDRA and SNOMED.

[0081] To accomplish this, the medical concepts contained in the formal 
ontology of the system can be mapped to the appropriate codes contained 
in the appropriate independent database (i.e. ICD-9, etc). 
Alternatively, the appropriate coding system may be included in the 
formal ontology as a separate and parallel hierarchy to the hierarchy of 
medical concepts. In this alternative embodiment, each medical concept 
is linked to the appropriate code via a "has code" link type. For 
example, the concept "meningitis" would be linked to the ICD-9 code 
322.9 or the MedDRA code 10027252.

[0082] By linking the concepts in the ontology to the appropriate codes, 
the system is able to annotate free text documents with these codes as 
the documents are being indexed.


_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list