Corpora: Corp[us|ora]

Darren Pearce darrenp at cogs.susx.ac.uk
Wed Apr 4 21:27:32 UTC 2001


AltaVista advanced search page counts:
	corpuses: ~500
	corpora:  ~55,500

I have 50M words of BNC data and the stats I have from that (based on
dependencies) give:

	corpora: 11
	corpuses: 5

Surprisingly low (and close)...

Darren.

On Wed, 4 Apr 2001, James L. Fidelholtz wrote:

> On Wed, 4 Apr 2001, Harold Somers wrote:
> 
> >Could users of this mailing list at least get it right?
> >One corpus.
> >Several corpora.
> >It's not too much to ask is it?
> 
> Harold:
> 	Well, it depends what language you are using.  In Spanish, the
> plural of 'corpus' would be 'corpus', especially if you're not trying to
> be snotty or hypercorrect.  Even in English, except here on the list, I
> might very well use 'corpuses'.  Maybe someone can check the BNC and see
> what they come up with.
> 		Jim
> 
> -- 
> James L. Fidelholtz			e-mail: jfidel at siu.buap.mx
> Posgrado en Ciencias del Lenguaje	tel.: +(52-2)229-5500 x5705
> Instituto de Ciencias Sociales y Humanidades	fax: +(01-2) 229-5681
> Benemérita Universidad Autónoma de Puebla, MÉXICO
> 
> 
> 

+-------------------------------------------------------------------------+
|                                                                         |
| Darren Pearce 	                                                  |
| COGS, Sussex University, Falmer, Brighton                               |
| Mobile: 07950 255 448                                                   |
| Email: darrenmpearce at bigfoot.com                                        |
|                                                                         |
+-------------------------------------------------------------------------+



More information about the Corpora mailing list