Corpora: Thank you-Results-Spanish Examples

Thu Sep 27 12:51:01 UTC 2001

First in English then in Spanish

Dear colleagues:

I would like to thank the people listed below for taking time out from their busy schedules to give me on-line addresses of Spanish corpra (which I will share with my Spanish teaching colleagues) and to send me examples of strings which I had requested. For those of you who may be interested I received the following information. If I have forgotten anyone, my sincerest apologies. 
Best wishes,
Linda

 Estimados colegas: 
 Me gustaría dar las gracias a los siguientes por haber tomado el tiempo de sus apretadas agendas para dar me los 'on-line' direcciones de corpra en español (lo cual  me compartiré con mis colegas que enseñan español) y para enviarme ejemplos que yo había pedido. Si estas interesado, recibe la siguiente información. Si he olvide alguien, mis mas sinceros disculpas. 

Saludos,

Linda 

ON-LINE ADDRESSES

http://www.webcorp.org.uk/             Antoinette Renouf 
http://miniappolis.com/KWiCFinder/ Bill Fletcher 

>From Mark Davies I received the following:

Feel free to take a look at http://mdavies.for.ilstu.edu/corpus, which is a 
small three million word sample corpus that shows some of the functionality 
that will be part of the 100 million word corpus of Spanish that I'm 
creating as part of an NEH-funded project.

One of the nice things about this corpus is that for any one of 30,000 
different lemma, you can input the lemma and it will show you all of the 
synonyms in the corpus, and these can be limited by frequency (e.g. all of 
the synonyms of "triunfar" that occur at least 5 times in the 1900s).  You 
can also search by part of speech (35 different categories) as well as by 
lemma (e.g. all forms of a given verb).  In the case of your query, for 
example, you could look for < haber.* que > and it will give you the 
construction in all of the different tenses.        

http://cronos.rae.es/interno/creanet.html Knut Hofland
http://mdavies.for.ilstu.edu/personal/texts.asp   Knut Hofland
http://www.niederlandistik.fu-berlin.de/cgi-bin/web-conc.cgi  Eliska Bokova

http://133.12.37.60/kwic/kwic.html and the person who maintains it is
Antonio Ruiz Tinoco a-ruiz at hoffman.cc.sophia.ac.jp.  Constantin Orasan 

>From Aquilino Sánchez Pérez I received the following information which I'll (try) to translate.

Te aconsejo que adquiers el Gran Diccionario de Uso del Español Actual, de
la editorial SGEL s.a. Con él te regalarán un corpus del español de 2
millones de palabras, con el gestor incluido. En el corpus Cumbre, de 20 millones de palabras, TAL VEZ y QUIZÁ(s) ofrecen
datos muy abundantes, como verás en los ficheros que te adjunto.

I suggest you get thel Gran Diccionario de Uso del Español, from the publishers SGEL S.A. With this you will receive a Spanish corpus of 2 million words free of charge, including the program (I'm not sure I've translated that right). In the Cumbre corpus of 20 million words TAL VEZ y Quizá give abundant results, as you will see in the files which I have attached.

Susana Sotillo offered to look in her corpus  and Dr. Leonel Ruis who wrote the following:
Tengo un corpus escrito de la prensa y escolares cubanos. Si desea, la podemos ayudar al respecto.

In addition, Rachel Virgínia Xavier Aires has a Portugese corpus for those of you are interested. 

Again, thank you all very much for your response.
Linda

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20010927/ed814be2/attachment.htm>