I am not an expert in this matter and I have not given enough thought to the questions raised here, but they do interest me, so I suggest two possible (not very technical and perhaps obvious) reasons for why what has happened with DBMS has not happened with corpora:<div>

<br></div><div>1. DBMS (that is, relational DBMS) has a very simple problem to solve. In fact the simplicity of the DBMS structure is its 'genius', if you like. But when we say corpora, we can mean a lot of very different things of very different complexities and properties. It may not be fair to expect the same of, say, the CMS (Corpora Management Systems) community, as has been achieved by the DBMS community. More analogous to corpora will be the XDMS (XML Data  Management Systems) rather than the DBMS. Similarly XPath and XSLT rather than SQL.</div>

<div><br></div><div>2. The researcher and developer community for relational databases is much larger than for corpora, and the support for the former is also much more.</div><div><br></div><div>Another possible reason might be that as soon as you say 'corpora' and you include annotated corpora among them, you bring in the whole battlefield of linguistic theories. No wonder it is hard to achieve any degree of consensus about most of the issues. And when you include phonological information and speech (and may be even visual data), the problems are all compounded.</div>

<div><br></div><div>Having said that, this is still a worthy goal and some people have been working towards it. And (paraphrasing an old popular phrase in Indian English), contemporary Gods permitting, there might be something at the end of it.</div>

<div><br></div><div>- Anil Kumar Singh</div><div> </div><div><br><div class="gmail_quote">On Sat, Jul 7, 2012 at 2:01 AM, Albretch Mueller <span dir="ltr"><<a href="mailto:lbrtchx@gmail.com" target="_blank">lbrtchx@gmail.com</a>></span> wrote:<br>

<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"> Thank you Adam<br>

~<br>

 I read your paper on "getting to know your corpus (content based on<br>

the currency of keyword lists)" and I also found very interesting some<br>

of the referenced papers<br>

~<br>

 and BTW, I am not claiming that everything pertaining to DBMSs is<br>

implemented and documented "by 'the' book", however there is a certain<br>

degree of common understanding and what German people call<br>

"Sachlichkeit" in that culture that we sorely lack. I can't quite get<br>

why<br>

~<br>

 I was more like (very wishfully ;-)) thinking of something like that:<br>

~<br>

 <a href="http://en.wikipedia.org/wiki/Comparison_of_relational_database_management_systems" target="_blank">http://en.wikipedia.org/wiki/Comparison_of_relational_database_management_systems</a><br>

~<br>

 <a href="http://en.wikipedia.org/wiki/Database_testing" target="_blank">http://en.wikipedia.org/wiki/Database_testing</a><br>

~<br>

 <a href="http://en.wikipedia.org/wiki/Structured_Query_Language" target="_blank">http://en.wikipedia.org/wiki/Structured_Query_Language</a><br>

<div class="HOEnZb"><div class="h5">~<br>

 lbrtchx<br>

<br>

_______________________________________________<br>

UNSUBSCRIBE from this page: <a href="http://mailman.uib.no/options/corpora" target="_blank">http://mailman.uib.no/options/corpora</a><br>

Corpora mailing list<br>

<a href="mailto:Corpora@uib.no">Corpora@uib.no</a><br>

<a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a><br>

</div></div></blockquote></div><br><br clear="all"><div><br></div>-- <br>- Anil<br>

</div>