<font face="verdana,sans-serif">Hey Rich,<br><br>RDBMS is an industry standard that works well for some things such as storing the extracted metadata, but might not be optimal for performing reasoning over it. That might be one reason some people use other representations such as RDF/SPARQL for higher-level tasks. In general, storing everything in the Common Analysis Structure defined UIMA's type system works for me and where needed I could write them into a Database. What is the optimal way to represent the metadata for reasoning tasks? How could I transfer my UIMA CAS into that "thing"?<br>
<br clear="all"></font><span style="font-family:verdana,sans-serif">Sincerely,</span><br style="font-family:verdana,sans-serif"><span style="font-family:verdana,sans-serif">Siddhartha Jonnalagadda, </span>Ph.D.<br style="font-family:verdana,sans-serif">
<span style="font-family:verdana,sans-serif"></span><span style="font-family:verdana,sans-serif"></span><a style="font-family:verdana,sans-serif" href="http://sjonnalagadda.wordpress.com" target="_blank">sjonnalagadda.wordpress.com</a><br style="font-family:verdana,sans-serif">
<br style="font-family:verdana,sans-serif"><br>
<br><br><div class="gmail_quote">On Fri, Dec 9, 2011 at 11:56 AM, Rich Cooper <span dir="ltr"><<a href="mailto:rich@englishlogickernel.com">rich@englishlogickernel.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">
<div link="blue" vlink="blue" lang="EN-US">
<div>
<p class="MsoNormal"><font color="blue" size="2" face="Arial"><span style="font-size:10.0pt;font-family:Arial;color:blue">Dear Siddhartha,<u></u><u></u></span></font></p>
<p class="MsoNormal"><font color="blue" size="2" face="Arial"><span style="font-size:10.0pt;font-family:Arial;color:blue"><u></u> <u></u></span></font></p>
<p class="MsoNormal"><font color="blue" size="2" face="Arial"><span style="font-size:10.0pt;font-family:Arial;color:blue">Could you please provide more detail about
what you need in the way of “more computer-interpretable than RDBMS”?
I use the RDBMS columns with unstructured text, analyze the text in software,
and populate new columns to store the analyzed NLP information. By iteratively
aggregating RDBMS columns, I am able to process NLP quite well using the RDBMS
capabilities plus software functionality for interpretation. <u></u><u></u></span></font></p>
<p class="MsoNormal"><font color="blue" size="2" face="Arial"><span style="font-size:10.0pt;font-family:Arial;color:blue"><u></u> <u></u></span></font></p>
<p class="MsoNormal"><font color="blue" size="2" face="Arial"><span style="font-size:10.0pt;font-family:Arial;color:blue">More information would be useful,<u></u><u></u></span></font></p>
<p class="MsoNormal"><font color="blue" size="2" face="Arial"><span style="font-size:10.0pt;font-family:Arial;color:blue">-Rich<u></u><u></u></span></font></p>
<p class="MsoNormal"><font color="blue" size="2" face="Arial"><span style="font-size:10.0pt;font-family:Arial;color:blue"><u></u> <u></u></span></font></p>
<div>
<p class="MsoNormal"><font color="black" size="3" face="Times New Roman"><span style="font-size:12.0pt;color:black">Sincerely,<u></u><u></u></span></font></p>
<p class="MsoNormal"><font color="black" size="3" face="Times New Roman"><span style="font-size:12.0pt;color:black">Rich Cooper<u></u><u></u></span></font></p>
<p class="MsoNormal"><font color="black" size="3" face="Times New Roman"><span style="font-size:12.0pt;color:black">EnglishLogicKernel.com</span></font><font color="blue"><span style="color:blue"><u></u><u></u></span></font></p>
<p class="MsoNormal"><font color="black" size="3" face="Times New Roman"><span style="font-size:12.0pt;color:black">Rich AT EnglishLogicKernel DOT com</span></font><font color="blue"><span style="color:blue"><u></u><u></u></span></font></p>
<p class="MsoNormal"><font color="black" size="3" face="Times New Roman"><span style="font-size:12.0pt;color:black">9 4 9 \ 5 2 5 - 5 7 1 2</span></font><u></u><u></u></p>
</div>
<div>
<div class="MsoNormal" style="text-align:center" align="center"><font size="3" face="Times New Roman"><span style="font-size:12.0pt">
<hr align="center" size="3" width="100%">
</span></font></div>
<p class="MsoNormal"><b><font size="2" face="Tahoma"><span style="font-size:10.0pt;font-family:Tahoma;font-weight:bold">From:</span></font></b><font size="2" face="Tahoma"><span style="font-size:10.0pt;font-family:Tahoma">
<a href="mailto:corpora-bounces@uib.no" target="_blank">corpora-bounces@uib.no</a> [mailto:<a href="mailto:corpora-bounces@uib.no" target="_blank">corpora-bounces@uib.no</a>] <b><span style="font-weight:bold">On Behalf Of </span></b>Siddhartha Jonnalagadda<br>
<b><span style="font-weight:bold">Sent:</span></b> Friday, December 09, 2011
9:07 AM<br>
<b><span style="font-weight:bold">To:</span></b>
<a href="mailto:nlp2rdf@lists.informatik.uni-leipzig.de" target="_blank">nlp2rdf@lists.informatik.uni-leipzig.de</a>; CORPORA List<br>
<b><span style="font-weight:bold">Cc:</span></b> Jens Lehmann<br>
<b><span style="font-weight:bold">Subject:</span></b> Re: [Corpora-List]
[NLP2RDF] Announcement: NLP Interchange Format(NIF)</span></font><u></u><u></u></p>
</div><div><div class="h5">
<p class="MsoNormal"><font size="3" face="Times New Roman"><span style="font-size:12.0pt"><u></u> <u></u></span></font></p>
<p class="MsoNormal" style="margin-bottom:12.0pt"><font size="3" face="Verdana"><span style="font-size:12.0pt;font-family:Verdana">Somewhat related issue:<br>
Since UIMA is seeing an increasing use within NLP community (both Information
Extraction and others such as Question/Answering), I wonder why another
standard as opposed to an interface between the UIMA type system and one of the
many existing standards. In other words, is there some work on representing the
information we extract in a way more computer-interpretable than RDBMS?<br>
<br clear="all">
Sincerely,<br>
Siddhartha Jonnalagadda, </span></font>Ph.D.<font face="Verdana"><span style="font-family:Verdana"><br>
</span></font><a href="http://sjonnalagadda.wordpress.com" target="_blank"><font face="Verdana"><span style="font-family:Verdana">sjonnalagadda.wordpress.com</span></font></a><font face="Verdana"><span style="font-family:Verdana"><br>
<br>
</span></font><br>
<br>
<u></u><u></u></p>
<div>
<p class="MsoNormal"><font size="3" face="Times New Roman"><span style="font-size:12.0pt">On Fri, Dec 9, 2011 at 10:39 AM, John F. Sowa <<a href="mailto:sowa@bestweb.net" target="_blank">sowa@bestweb.net</a>> wrote:<u></u><u></u></span></font></p>
<div>
<p class="MsoNormal"><font size="3" face="Times New Roman"><span style="font-size:12.0pt">Before making a firm commitment to any notation as a standard for NLP,<br>
I suggest that you poll computational linguists and ask them what they<br>
would prefer for their work. Among the questions you could ask is to<br>
look at those five serializations and check which one(s) they prefer.<br>
<br>
Corpora List is a good place to start such a poll.<u></u><u></u></span></font></p>
</div>
</div>
<p class="MsoNormal"><font size="3" face="Times New Roman"><span style="font-size:12.0pt"><u></u> <u></u></span></font></p>
</div></div></div>
</div>
</blockquote></div><br>