<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<style type="text/css" style="display:none"><!-- p { margin-top: 0px; margin-bottom: 0px; }--></style>
</head>
<body dir="ltr" style="font-size:10pt;color:#000000;background-color:#FFFFFF;font-family:Tahoma,Geneva,sans-serif;">
<p>Mario,<br>
</p>
<p><br>
</p>
<p>>> <span style="color: rgb(33, 33, 33); font-size: 13.63636302948px;">We are studying the degree of repetition of certain word in comparative expressions in Spanish. I wonder if you know a Spanish corpus allowing the researcher to find expressions as it
follows:</span></p>
<p style="color: rgb(33, 33, 33); font-family: Tahoma, Geneva, sans-serif; font-size: 13.63636302948px; background-color: rgb(255, 255, 255);">
<em>tan</em> <strong>WORD</strong>.adv <em>como lo</em> <strong>WORD</strong>.adv <br>
<em>Tan</em> <strong>WORD</strong>.adj<em> como lo</em> <strong>WORD</strong>.adj<br>
<em>Tant*</em> <strong>WORD</strong>.noun <em>como l*</em> <strong>WORD</strong>.noun <br>
</p>
<p style="color: rgb(33, 33, 33); font-family: Tahoma, Geneva, sans-serif; font-size: 13.63636302948px; background-color: rgb(255, 255, 255);">
>> <span style="color: rgb(33, 33, 33); font-family: Tahoma, Geneva, sans-serif; font-size: 13.63636302948px; background-color: rgb(255, 255, 255);">For all cases, the token expressed as WORD must be the same, that is, that word occurs twice in the comparative
structure.</span><br>
</p>
<p>>> <span style="color: rgb(33, 33, 33); font-family: 'Segoe UI', 'Segoe WP', 'Segoe UI WPC', Tahoma, Arial, sans-serif; font-size: 15.4545450210571px; background-color: rgb(255, 255, 255);">We have tried MArk David Corpus and CREA, but in principle they
do not allow to use variables with the same value.</span><br>
</p>
<p><br>
</p>
<p>This isn't possible via the publicly-accessible Corpus del Espanol web interface (www.corpusdelespanol.org), but I can easily generate it backend via SQL queries to the corpus.<br>
</p>
<p><br>
</p>
<p>Mark Davies<br>
</p>
<p><br>
</p>
<div id="Signature">
<div style="font-family:Tahoma; font-size:13px">
<div style="font-family:Tahoma; font-size:13px">
<p>============================================<br>
Mark Davies<br>
Professor of Linguistics / Brigham Young University<br>
<a tabindex="0" href="http://davies-linguistics.byu.edu/">http://davies-linguistics.byu.edu/</a></p>
<p>** Corpus design and use // Linguistic databases **<br>
** Historical linguistics // Language variation **<br>
** English, Spanish, and Portuguese **<br>
============================================<br>
</p>
</div>
</div>
</div>
<div style="color: rgb(33, 33, 33);">
<hr tabindex="-1" style="display:inline-block; width:98%">
<div id="divRplyFwdMsg" dir="ltr"><font face="Calibri, sans-serif" color="#000000" style="font-size:11pt"><b>From:</b> corpora-bounces@uib.no <corpora-bounces@uib.no> on behalf of Mario Crespo Miguel <mario.crespo@uca.es><br>
<b>Sent:</b> Monday, October 20, 2014 4:39 AM<br>
<b>To:</b> corpora@uib.no<br>
<b>Cc:</b> pedropablo.devis@uca.es<br>
<b>Subject:</b> [Corpora-List] Use of variables in a Spanish corpus</font>
<div> </div>
</div>
<div>
<p>Dear members of corpora list, </p>
<p>We are studying the degree of repetition of certain word in comparative expressions in Spanish. I wonder if you know a Spanish corpus allowing the researcher to find expressions as it follows:</p>
<p><em>tan</em> <strong>WORD</strong>.adv <em>como lo</em> <strong>WORD</strong>.adv <br>
<em>Tan</em> <strong>WORD</strong>.adj<em> como lo</em> <strong>WORD</strong>.adj<br>
<em>Tant*</em> <strong>WORD</strong>.noun <em>como l*</em> <strong>WORD</strong>.noun </p>
<p>For all cases, the token expressed as WORD must be the same, that is, that word occurs twice in the comparative structure.</p>
<p>We have tried MArk David Corpus and CREA, but in principle they do not allow to use variables with the same value.</p>
<br>
<p>Thank you very much in advance, </p>
<p>Mario Crespo</p>
<br>
<br>
<br>
<div id="firmaInstitucionalUCA" style="border-top:1px solid #aaa">
<table border="0" cellspacing="0" style="padding:0; margin:0">
<tbody>
<tr>
<td valign="top">
<div style="float:left; width:150px"><img alt="UCA" width="145" height="100" src="https://webmail.uca.es/img/uca.gif"></div>
</td>
<td valign="top">
<div style="float:left; border-left:2px solid #f29200; padding-left:10px; margin-top:5px; font-family:Helvetica,Garamond,Arial,sans-serif; font-size:12px; color:#000">
<div style="padding-top:10px; font-size:16px; color:#102954">Mario Crespo Miguel</div>
<div style="padding-top:5x; font-size:12px; color:#2c758f">Profesor Sustituto Interino</div>
<div style="padding-top:5px; font-size:12px; color:#2c758f"><strong>Mario Crespo Miguel
<br>
<br>
Área de Lingüística <br>
Departamento de Filología</strong></div>
<div style="padding-top:5x; font-size:12px; color:#2c758f">Universidad de Cádiz</div>
</div>
</td>
</tr>
</tbody>
</table>
</div>
</div>
</div>
</body>
</html>