Corpora: Thank you-Results-Spanish Examples
Linda Bawcom
lbawcom at eresmas.net
Thu Sep 27 12:51:01 UTC 2001
First in English then in Spanish
Dear colleagues:
I would like to thank the people listed below for taking time out from their busy schedules to give me on-line addresses of Spanish corpra (which I will share with my Spanish teaching colleagues) and to send me examples of strings which I had requested. For those of you who may be interested I received the following information. If I have forgotten anyone, my sincerest apologies.
Best wishes,
Linda
Estimados colegas:
Me gustaría dar las gracias a los siguientes por haber tomado el tiempo de sus apretadas agendas para dar me los 'on-line' direcciones de corpra en español (lo cual me compartiré con mis colegas que enseñan español) y para enviarme ejemplos que yo había pedido. Si estas interesado, recibe la siguiente información. Si he olvide alguien, mis mas sinceros disculpas.
Saludos,
Linda
ON-LINE ADDRESSES
http://www.webcorp.org.uk/ Antoinette Renouf
http://miniappolis.com/KWiCFinder/ Bill Fletcher
>From Mark Davies I received the following:
Feel free to take a look at http://mdavies.for.ilstu.edu/corpus, which is a
small three million word sample corpus that shows some of the functionality
that will be part of the 100 million word corpus of Spanish that I'm
creating as part of an NEH-funded project.
One of the nice things about this corpus is that for any one of 30,000
different lemma, you can input the lemma and it will show you all of the
synonyms in the corpus, and these can be limited by frequency (e.g. all of
the synonyms of "triunfar" that occur at least 5 times in the 1900s). You
can also search by part of speech (35 different categories) as well as by
lemma (e.g. all forms of a given verb). In the case of your query, for
example, you could look for < haber.* que > and it will give you the
construction in all of the different tenses.
http://cronos.rae.es/interno/creanet.html Knut Hofland
http://mdavies.for.ilstu.edu/personal/texts.asp Knut Hofland
http://www.niederlandistik.fu-berlin.de/cgi-bin/web-conc.cgi Eliska Bokova
http://133.12.37.60/kwic/kwic.html and the person who maintains it is
Antonio Ruiz Tinoco a-ruiz at hoffman.cc.sophia.ac.jp. Constantin Orasan
>From Aquilino Sánchez Pérez I received the following information which I'll (try) to translate.
Te aconsejo que adquiers el Gran Diccionario de Uso del Español Actual, de
la editorial SGEL s.a. Con él te regalarán un corpus del español de 2
millones de palabras, con el gestor incluido. En el corpus Cumbre, de 20 millones de palabras, TAL VEZ y QUIZÁ(s) ofrecen
datos muy abundantes, como verás en los ficheros que te adjunto.
I suggest you get thel Gran Diccionario de Uso del Español, from the publishers SGEL S.A. With this you will receive a Spanish corpus of 2 million words free of charge, including the program (I'm not sure I've translated that right). In the Cumbre corpus of 20 million words TAL VEZ y Quizá give abundant results, as you will see in the files which I have attached.
Susana Sotillo offered to look in her corpus and Dr. Leonel Ruis who wrote the following:
Tengo un corpus escrito de la prensa y escolares cubanos. Si desea, la podemos ayudar al respecto.
In addition, Rachel Virgínia Xavier Aires has a Portugese corpus for those of you are interested.
Again, thank you all very much for your response.
Linda
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20010927/ed814be2/attachment.htm>
More information about the Corpora
mailing list