[Corpora-List] What support should a corpus provide?

Rich Cooper rich at englishlogickernel.com
Fri Aug 8 18:12:00 UTC 2014


Dear Corpus Analysts and Ontologists,

 

I have just made available a corpus of documents
from the US Patent and Trademark Office which are
available for corpus analysts.  The tools
available now are sufficient for supporting
attorneys, inventors, scientists, and other
similar application legal and technology roles.  

 

What additional support should I provide in the
software for supporting corpus analysis of
selected patent document subsets?  I have a web
site with extensive help and tutorial materials -
I suggest starting at:

 

www.EnglishLogicKernel.com/Help/help.htm

 

to see an index of capability descriptions.  I can
make available the "frequent words" and the "rare
words" lists as text files, along with the patent
documents in whole or in sections for data,
abstract, description and claims, which are
already extracted from the selected document set.
The claim tree is parsed, and the claims are
separated into claim elements, all of which can be
provided.  

 

Is there anything else that corpus analysts would
like to see in the software?

 

Suggestions highly appreciated,

-Rich

 

Sincerely,

Rich Cooper

EnglishLogicKernel.com

Rich AT EnglishLogicKernel DOT com

9 4 9 \ 5 2 5 - 5 7 1 2

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20140808/24b2d4a6/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list