[Corpora-List] Anything resembling TPC benchmarks for corpora?

Adam Kilgarriff adam at lexmasterclass.com
Wed Jul 11 17:33:35 UTC 2012


Dear Albrecht,

I really can't swallow the analogy with DBMS.  That's technology, (corpus)
linguistics is science.  There, the task is getting your house in order and
singing from the same hymnsheet.  Here, the big picture is that we are
trying to find out how language works.  Of course it won't be neat and
tidy, because language is used for all sorts of purposes in all sorts of
situations  by all sorts of communities with all sorts of histories and is
as complex as the minds and societies it emanates from

adam

On 11 July 2012 12:47, Albretch Mueller <lbrtchx at gmail.com> wrote:

>  Anil et al,
> ~
>  I recently finished reading a number of seminal papers on corpus
> linguistics going as far back as the early 1960's (ISBN-10:
> 082648803X) and, basically, those questions are implicitly (not that
> "implicitly"), as well as very explicitly aired throughout the book.
> ~
>  When we talk about "corpora" it doesn't seem that we are talking from
> the same referent paradigm.
> ~
>  Also the DBMS community dealt with their particular issues. I am old
> enough to remember how much of a deal bitwise addressing was. I don't
> have to deal with storage capacity or computing power, yet, at some
> point, we will have to start making sense. Computers won't do that for
> us.
> ~
>  lbrtchx
>



-- 
========================================
Adam Kilgarriff <http://www.kilgarriff.co.uk/>
adam at lexmasterclass.com
Director                                    Lexical Computing
Ltd<http://www.sketchengine.co.uk/>

Visiting Research Fellow                 University of
Leeds<http://leeds.ac.uk>

*Corpora for all* with the Sketch Engine <http://www.sketchengine.co.uk>

                        *DANTE: a lexical database for
English<http://www.webdante.com>
                  *
========================================
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20120711/8baf623a/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list