<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />
</head><body style="">
<div>
Hello,
</div>
<div>
</div>
<div>
I would like to find co-occuring words in german texts. The number of texts I have is about 1 000 000 (one million), with each text having about 10 sentences.
</div>
<div>
</div>
<div>
Does anybody know where I can find a software to do the analysis on such a big amount of texts?
</div>
<div>
</div>
<div>
I would prefer a java software but others are also ok provided they run on ubuntu.
</div>
<div>
</div>
<div>
Any help would be appreciated.
</div>
<div>
</div>
<div>
Regards,
</div>
<div>
</div>
<div>
Drame A.
<img alt="Senden" class="menu-item-icon-large" src="https://communicator.strato.de/ox6/v=OaNMrJJ/v=OaNMrJJ/themes/default/icons/24/mail_send.png" border="0" />
</div>
</body></html>