Also <dl><dt><ul><li>
                Adam Kilgarriff<a href="http://kilgarriff.co.uk/Publications/2001-K-CompCorpIJCL.pdf">  Comparing
                Corpora</a> 2001 <em>International Journal of Corpus Linguistics</em> 6 (1):
                1-37.
                </li><li>Reprinted in <i>Corpus Linguistics: Critical Concepts in Linguistics.</i> Teubert and Krishnamurthy, editors.  Routledge. 2007.
                </li><dt><ul><li>
                </li></ul>
                </dt></ul>

                </dt></dl>(with work on this from back in the 20th century. I think it stands up OK.  We are currently reviewing, and implementing an improved version of the definition given there of 'corpus heterogeneity' for viewing in the Sketch Engine.  In brief, the new definition builds on a definition of corpus similarity, and is,  "the similarity between the two most different parts".  We cluster documents to identify the two most different parts. )<br>

<br>Adam<br><br><div class="gmail_quote">On 6 November 2012 15:33, Stefan Th. Gries <span dir="ltr"><<a href="mailto:stgries@gmail.com" target="_blank">stgries@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

Dear Alexander<br>
<br>
Please see: Gries, Stefan Th. 2006. Exploring variability within and<br>
between corpora: some methodological considerations. Corpora 1(2).<br>
109-151.<br>
<br>
Cheers,<br>
STG<br>
--<br>
Stefan Th. Gries<br>
-----------------------------------------------<br>
University of California, Santa Barbara<br>
<a href="http://www.linguistics.ucsb.edu/faculty/stgries" target="_blank">http://www.linguistics.ucsb.edu/faculty/stgries</a><br>
-----------------------------------------------<br>
<br>
_______________________________________________<br>
UNSUBSCRIBE from this page: <a href="http://mailman.uib.no/options/corpora" target="_blank">http://mailman.uib.no/options/corpora</a><br>
Corpora mailing list<br>
<a href="mailto:Corpora@uib.no">Corpora@uib.no</a><br>
<a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a><br>
</blockquote></div><br><br clear="all"><br>-- <br>========================================<br><a href="http://www.kilgarriff.co.uk/" target="_blank">Adam Kilgarriff</a>                  <a href="mailto:adam@lexmasterclass.com" target="_blank">adam@lexmasterclass.com</a>                                             <br>

Director                                    <a href="http://www.sketchengine.co.uk/" target="_blank">Lexical Computing Ltd</a>                <br>Visiting Research Fellow                 <a href="http://leeds.ac.uk" target="_blank">University of Leeds</a>     <div>

<i><font color="#006600">Corpora for all</font></i> with <a href="http://www.sketchengine.co.uk" target="_blank">the Sketch Engine</a>                 </div><div>                        <i><a href="http://www.webdante.com" target="_blank">DANTE: <font color="#009900">a lexical database for English</font></a><font color="#009900"> </font>                 </i><div>

========================================</div></div><br>