Corpora: Corp[us|ora]
Darren Pearce
darrenp at cogs.susx.ac.uk
Wed Apr 4 21:27:32 UTC 2001
AltaVista advanced search page counts:
corpuses: ~500
corpora: ~55,500
I have 50M words of BNC data and the stats I have from that (based on
dependencies) give:
corpora: 11
corpuses: 5
Surprisingly low (and close)...
Darren.
On Wed, 4 Apr 2001, James L. Fidelholtz wrote:
> On Wed, 4 Apr 2001, Harold Somers wrote:
>
> >Could users of this mailing list at least get it right?
> >One corpus.
> >Several corpora.
> >It's not too much to ask is it?
>
> Harold:
> Well, it depends what language you are using. In Spanish, the
> plural of 'corpus' would be 'corpus', especially if you're not trying to
> be snotty or hypercorrect. Even in English, except here on the list, I
> might very well use 'corpuses'. Maybe someone can check the BNC and see
> what they come up with.
> Jim
>
> --
> James L. Fidelholtz e-mail: jfidel at siu.buap.mx
> Posgrado en Ciencias del Lenguaje tel.: +(52-2)229-5500 x5705
> Instituto de Ciencias Sociales y Humanidades fax: +(01-2) 229-5681
> Benemérita Universidad Autónoma de Puebla, MÉXICO
>
>
>
+-------------------------------------------------------------------------+
| |
| Darren Pearce |
| COGS, Sussex University, Falmer, Brighton |
| Mobile: 07950 255 448 |
| Email: darrenmpearce at bigfoot.com |
| |
+-------------------------------------------------------------------------+
More information about the Corpora
mailing list