Hello,<br><br>We have just started a project here at the Radboud University of Nijmegen that deals with Passage Retrieval and Text Mining in patent texts. I was wondering if anyone could point me to some literature/research/interesting facts on the linguistic and statistical characteristics of the language used in patent texts (e.g. frequency and hierarchical organisation of PP-attachments, use of gerund clauses vs. the relative clause with an inflected verb, average sentence length in the different sections, ... ).<br>
<br>I will of course post a summary of your replies on this list.<br><br>Thank you ever so much!<br><br> Eva<br><br><br>Eva D'hondt, PhD student<br>Centre for Language and Speech Technology<br>University of Nijmegen<br>
Email: <a href="mailto:e.dhondt@let.ru.nl">e.dhondt@let.ru.nl</a><br><br>