<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40"><head><meta http-equiv=Content-Type content="text/html; charset=us-ascii"><meta name=Generator content="Microsoft Word 14 (filtered medium)"><style><!--

/* Font Definitions */

@font-face

        {font-family:Calibri;

        panose-1:2 15 5 2 2 2 4 3 2 4;}

@font-face

        {font-family:Tahoma;

        panose-1:2 11 6 4 3 5 4 4 2 4;}

/* Style Definitions */

p.MsoNormal, li.MsoNormal, div.MsoNormal

        {margin:0in;

        margin-bottom:.0001pt;

        font-size:11.0pt;

        font-family:"Calibri","sans-serif";}

a:link, span.MsoHyperlink

        {mso-style-priority:99;

        color:blue;

        text-decoration:underline;}

a:visited, span.MsoHyperlinkFollowed

        {mso-style-priority:99;

        color:purple;

        text-decoration:underline;}

p.MsoPlainText, li.MsoPlainText, div.MsoPlainText

        {mso-style-priority:99;

        mso-style-link:"Plain Text Char";

        margin:0in;

        margin-bottom:.0001pt;

        font-size:11.0pt;

        font-family:"Calibri","sans-serif";}

p.MsoAcetate, li.MsoAcetate, div.MsoAcetate

        {mso-style-priority:99;

        mso-style-link:"Balloon Text Char";

        margin:0in;

        margin-bottom:.0001pt;

        font-size:8.0pt;

        font-family:"Tahoma","sans-serif";}

span.PlainTextChar

        {mso-style-name:"Plain Text Char";

        mso-style-priority:99;

        mso-style-link:"Plain Text";

        font-family:"Calibri","sans-serif";}

span.BalloonTextChar

        {mso-style-name:"Balloon Text Char";

        mso-style-priority:99;

        mso-style-link:"Balloon Text";

        font-family:"Tahoma","sans-serif";}

span.EmailStyle21

        {mso-style-type:personal;

        font-family:"Calibri","sans-serif";

        color:#1F497D;}

span.EmailStyle22

        {mso-style-type:personal-reply;

        font-family:"Calibri","sans-serif";

        color:#1F497D;}

.MsoChpDefault

        {mso-style-type:export-only;

        font-size:10.0pt;}

@page WordSection1

        {size:8.5in 11.0in;

        margin:1.0in 1.0in 1.0in 1.0in;}

div.WordSection1

        {page:WordSection1;}

--></style><!--[if gte mso 9]><xml>

<o:shapedefaults v:ext="edit" spidmax="1026" />

</xml><![endif]--><!--[if gte mso 9]><xml>

<o:shapelayout v:ext="edit">

<o:idmap v:ext="edit" data="1" />

</o:shapelayout></xml><![endif]--></head><body lang=EN-US link=blue vlink=purple><div class=WordSection1><p class=MsoNormal><span style='color:#1F497D'>In developing ontologies to match corpora samples, as in learning algorithms, what kind of analysis of each document would be useful to compare one patent claim against that patent’s description, and against an arbitrary potential prior art candidate?<o:p></o:p></span></p><p class=MsoNormal><span style='color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal style='margin-left:.5in'><b><span style='color:#1F497D'>Entity recognition, with and without names or descriptions or anaphora; <o:p></o:p></span></b></p><p class=MsoNormal style='margin-left:.5in'><b><span style='color:#1F497D'>Objects and activities mentioned in the claims, as compared to those mentioned in each patent; Mereological relationships among the identified objects and activities; <o:p></o:p></span></b></p><p class=MsoNormal style='margin-left:.5in'><b><span style='color:#1F497D'>Common verb signature database with identified variables and constants,<o:p></o:p></span></b></p><p class=MsoNormal style='margin-left:.5in'><b><span style='color:#1F497D'>Modus ponens interpreter of signature phrases wrt the identified objects and activities,<o:p></o:p></span></b></p><p class=MsoNormal><span style='color:#1F497D'>               <b>Logic language of FOL level, Horne clause, lexical scopes, question answering,<o:p></o:p></b></span></p><p class=MsoNormal><b><span style='color:#1F497D'>               Heuristic search through And/or graphs with FOL parameterization, simple algebra<o:p></o:p></span></b></p><p class=MsoNormal><span style='color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><span style='color:#1F497D'>What have I missed?<o:p></o:p></span></p><p class=MsoNormal><span style='color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><span style='color:#1F497D'>The idea, or long term goal, is to build an ontology of patent claims as encountered in published patents.  If that turns out to be helpful, other document analysis tasks might benefit from the ontology so developed.  <o:p></o:p></span></p><p class=MsoNormal><span style='color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><span style='color:#1F497D'>-Rich<o:p></o:p></span></p><p class=MsoNormal><span style='color:#1F497D'><o:p> </o:p></span></p><div><p class=MsoNormal><span style='font-size:12.0pt;font-family:"Times New Roman","serif";color:black'>Sincerely,<o:p></o:p></span></p><p class=MsoNormal><span style='font-size:12.0pt;font-family:"Times New Roman","serif";color:black'>Rich Cooper<o:p></o:p></span></p><p class=MsoNormal><span style='font-size:12.0pt;font-family:"Times New Roman","serif";color:black'>EnglishLogicKernel.com</span><span style='font-size:12.0pt;font-family:"Times New Roman","serif";color:blue'><o:p></o:p></span></p><p class=MsoNormal><span style='font-size:12.0pt;font-family:"Times New Roman","serif";color:black'>Rich AT EnglishLogicKernel DOT com</span><span style='font-size:12.0pt;font-family:"Times New Roman","serif";color:blue'><o:p></o:p></span></p></div><p class=MsoNormal><span style='font-size:12.0pt;font-family:"Times New Roman","serif";color:black'>9 4 9 \ 5 2 5 - 5 7 1 2</span><span style='color:#1F497D'><o:p></o:p></span></p><div><div style='border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0in 0in 0in'><p class=MsoNormal><b><span style='font-size:10.0pt;font-family:"Tahoma","sans-serif"'>From:</span></b><span style='font-size:10.0pt;font-family:"Tahoma","sans-serif"'> corpora-bounces@uib.no [mailto:corpora-bounces@uib.no] <b>On Behalf Of </b>Rich Cooper<br><b>Sent:</b> Friday, August 08, 2014 11:12 AM<br><b>To:</b> 'John F Sowa'; corpora@uib.no<br><b>Cc:</b> '[ontolog-forum] '<br><b>Subject:</b> [Corpora-List] What support should a corpus provide?<o:p></o:p></span></p></div></div><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal><span style='color:#1F497D'>Dear Corpus Analysts and Ontologists,<o:p></o:p></span></p><p class=MsoNormal><span style='color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><span style='color:#1F497D'>I have just made available a corpus of documents from the US Patent and Trademark Office which are available for corpus analysts.  The tools available now are sufficient for supporting attorneys, inventors, scientists, and other similar application legal and technology roles.  <o:p></o:p></span></p><p class=MsoNormal><span style='color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><span style='color:#1F497D'>What additional support should I provide in the software for supporting corpus analysis of selected patent document subsets?  I have a web site with extensive help and tutorial materials – I suggest starting at:<o:p></o:p></span></p><p class=MsoNormal><span style='color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><span style='color:#1F497D'><a href="http://www.EnglishLogicKernel.com/Help/help.htm">www.EnglishLogicKernel.com/Help/help.htm</a><o:p></o:p></span></p><p class=MsoNormal><span style='color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><span style='color:#1F497D'>to see an index of capability descriptions.  I can make available the “frequent words” and the “rare words” lists as text files, along with the patent documents in whole or in sections for data, abstract, description and claims, which are already extracted from the selected document set.  The claim tree is parsed, and the claims are separated into claim elements, all of which can be provided.  <o:p></o:p></span></p><p class=MsoNormal><span style='color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><span style='color:#1F497D'>Is there anything else that corpus analysts would like to see in the software?<o:p></o:p></span></p><p class=MsoNormal><span style='color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><span style='color:#1F497D'>Suggestions highly appreciated,<o:p></o:p></span></p><p class=MsoNormal><span style='color:#1F497D'>-Rich<o:p></o:p></span></p><p class=MsoNormal><span style='color:#1F497D'><o:p> </o:p></span></p><div><p class=MsoNormal><span style='font-size:12.0pt;font-family:"Times New Roman","serif";color:black'>Sincerely,<o:p></o:p></span></p><p class=MsoNormal><span style='font-size:12.0pt;font-family:"Times New Roman","serif";color:black'>Rich Cooper<o:p></o:p></span></p><p class=MsoNormal><span style='font-size:12.0pt;font-family:"Times New Roman","serif";color:black'>EnglishLogicKernel.com</span><span style='font-size:12.0pt;font-family:"Times New Roman","serif";color:blue'><o:p></o:p></span></p><p class=MsoNormal><span style='font-size:12.0pt;font-family:"Times New Roman","serif";color:black'>Rich AT EnglishLogicKernel DOT com</span><span style='font-size:12.0pt;font-family:"Times New Roman","serif";color:blue'><o:p></o:p></span></p></div><p class=MsoNormal><span style='font-size:12.0pt;font-family:"Times New Roman","serif";color:black'>9 4 9 \ 5 2 5 - 5 7 1 2</span><span style='color:#1F497D'><o:p></o:p></span></p></div></body></html>