<HTML><BODY STYLE="font:10pt verdana; border:none;"><DIV> <P><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO"><FONT face="Times New Roman, Times, Serif">Dear Corpora Listers:<?xml:namespace prefix = o ns = "urn:schemas-microsoft-com:office:office" /><o:p></o:p></FONT></SPAN></P> <P><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO"><FONT face="Times New Roman, Times, Serif">Here is a summary of my inquiry concerning the corpus analysis of keywords in literary texts.<o:p></o:p></FONT></SPAN></P> <P class=MsoNormal><FONT face="Times New Roman, Times, Serif"><SPAN style="mso-bidi-font-size: 10.0pt">1. Mick Short noted that there is some discussion of keywords in the play <EM><SPAN style="FONT-STYLE: normal">Romeo and Juliet</SPAN></EM> in chapter 4 of Jonathan Culpeper, <EM><SPAN style="FONT-STYLE: normal">Language and Characterisation</SPAN></EM>, (Longman 2001). Mick also suggested that it might be worth having a look at David Hoover's </SPAN><SPAN lang=EN-GB style="mso-bidi-font-size: 10.0pt; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-GB">Language and Style in The Inheritors (University Press of America 1998), which compares Golding's book with various corpora. </SPAN><SPAN style="mso-bidi-font-size: 10.0pt"><o:p></o:p></SPAN></FONT></P> <P class=MsoNormal><SPAN style="mso-bidi-font-size: 10.0pt"><FONT face="Times New Roman, Times, Serif"> </FONT></SPAN><SPAN style="mso-bidi-font-size: 10.0pt"><FONT face="Times New Roman, Times, Serif">2. Christopher Tribble noted that M. Stubbs, Text and Corpus Analysis (1996) specifically mentions Raymond Williams¡¯ notion of keywords. Christopher also commented that Mike Scott has been doing work on cultural keywords using Guardian newspaper data.<o:p></o:p></FONT></SPAN></P> <P><FONT face="Times New Roman, Times, Serif"><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-fareast-language: KO">3</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt">. Adam Kilgarriff remind</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-fareast-language: KO">s</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt"> me that </SPAN></FONT><FONT face="Times New Roman, Times, Serif"><SPAN style="FONT-FAMILY: 'Times New Roman'">Mike Scott's Wordsmith supports this sort of analysis, and that Tony Bernber Sardinha knows a lot about the area but from an EFL rather than a literary perspective</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO"><o:p></o:p></SPAN></FONT></P> <P><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO"><FONT face="Times New Roman, Times, Serif">The following two leads were very useful:<o:p></o:p></FONT></SPAN></P> <P><FONT face="Times New Roman, Times, Serif"><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">4. Ramesh Krishnamurthy has written ¡°</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">Ethnic, Racial and Tribal: The Language of Racism?</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">¡± </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">in Texts and</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO"> </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">Practices, eds. Caldas-Coulthard & Coulthard, Routledge, London</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">, 1996.<SPAN style="mso-spacerun: yes"> </SPAN>In this article , Ramesh looked at three </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">keywords in the Bank of English corpus (then 121 million</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO"> </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">words, now 418 million words) and ma</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">d</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">e specific references to</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO"> Ra</SPAN></FONT><FONT face="Times New Roman, Times, Serif"><SPAN style="FONT-FAMILY: 'Times New Roman'">ymond Williams' Keywords.<BR><BR></SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">5. </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">Andrius Utka</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">,<B> </B></SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">a master student at Vytautas Magnus University, Faculty of Humanities </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">has done an analysis of </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 24.0pt">George Orwell¡¯s 1984 </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 24.0pt; mso-fareast-language: KO">using the s</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 24.0pt">tatistical </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 24.0pt; mso-fareast-language: KO">m</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 24.0pt">ethods of </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 24.0pt; mso-fareast-language: KO">c</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 24.0pt">orpus </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 24.0pt; mso-fareast-language: KO">l</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 24.0pt">inguistics. </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 24.0pt; mso-fareast-language: KO">It is available for viewing at<SPAN style="mso-spacerun: yes"> </SPAN></SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'"><A href="http://donelaitis.vdu.lt/">http://donelaitis.vdu.lt</A>, </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">by </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">follow </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">the </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">link </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">from </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">"publications" </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">to </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">"sankirta"</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">. <o:p></o:p></SPAN></FONT></P> <P><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO"><FONT face="Times New Roman, Times, Serif">Among other things, this paper suggests a useful method for discovering what the keywords of a given literary text actually are:</FONT></SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'"><BR style="mso-special-character: line-break"><BR style="mso-special-character: line-break"></SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO"><o:p></o:p></SPAN></P> <P><FONT face="Times New Roman, Times, Serif"><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">¡°The </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">following procedure is used for finding key words in 1984: <o:p></o:p></SPAN></FONT></P> <OL type=1> <LI class=MsoNormal style="TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify; tab-stops: list 36.0pt; mso-margin-top-alt: auto; mso-margin-bottom-alt: auto; mso-list: l0 level1 lfo1"><FONT face="Times New Roman, Times, Serif">The frequency list of all word forms is produced by the computer program Wordsmith Tools. </FONT></LI> <LI class=MsoNormal style="TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify; tab-stops: list 36.0pt; mso-margin-top-alt: auto; mso-margin-bottom-alt: auto; mso-list: l0 level1 lfo1"><FONT face="Times New Roman, Times, Serif">Only 100 most frequent nouns are left and all the other words are removed from the list. </FONT></LI> <LI class=MsoNormal style="TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify; tab-stops: list 36.0pt; mso-margin-top-alt: auto; mso-margin-bottom-alt: auto; mso-list: l0 level1 lfo1"><FONT face="Times New Roman, Times, Serif">The nouns are lemmatized. </FONT></LI> <LI class=MsoNormal style="TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify; tab-stops: list 36.0pt; mso-margin-top-alt: auto; mso-margin-bottom-alt: auto; mso-list: l0 level1 lfo1"><FONT face="Times New Roman, Times, Serif">The frequency list of these 100 nouns is produced from the large corpus of the Bank of English. </FONT></LI> <LI class=MsoNormal style="TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify; tab-stops: list 36.0pt; mso-margin-top-alt: auto; mso-margin-bottom-alt: auto; mso-list: l0 level1 lfo1"><FONT face="Times New Roman, Times, Serif">The occurrences of words in both frequency lists are compared using chi-squared statistical test¡±. </FONT></LI> <LI class=MsoNormal style="TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify; tab-stops: list 36.0pt; mso-margin-top-alt: auto; mso-margin-bottom-alt: auto; mso-list: l0 level1 lfo1"><FONT face="Times New Roman, Times, Serif">The key words are sorted out according the chi-square value.</FONT></LI></OL> <P class=MsoNormal><FONT face="Times New Roman, Times, Serif">Finally, there were two respondents working on texts other than literary ones:</FONT></P> <P class=MsoNormal><FONT face="Times New Roman, Times, Serif"> </FONT><FONT face="Times New Roman, Times, Serif">6. <SPAN style="COLOR: black; mso-bidi-font-size: 10.0pt">Wendy J. Anderson, a PhD Student in the Department of French at the University of St Andrews is carrying out keyword analysis </SPAN><SPAN style="mso-bidi-font-size: 10.0pt">on administrative texts in French.<o:p></o:p></SPAN></FONT></P> <P class=MsoNormal><SPAN style="mso-bidi-font-size: 10.0pt"><FONT face="Times New Roman, Times, Serif"> </FONT></SPAN><FONT face="Times New Roman, Times, Serif"><SPAN style="mso-bidi-font-size: 10.0pt">7. Geoffrey Williams has done work on extracting keywords in scientific corpora. Geoffrey also notes that Berry Roghe worked in a similar way on literary texts in the 70's.</SPAN><SPAN style="mso-bidi-font-size: 10.0pt; mso-fareast-language: EN-US"><o:p></o:p></SPAN></FONT></P> <P class=MsoNormal><SPAN style="mso-bidi-font-size: 10.0pt"><FONT face="Times New Roman, Times, Serif"> </FONT></SPAN><SPAN style="mso-bidi-font-size: 10.0pt"><FONT face="Times New Roman, Times, Serif">The references that Geoffrey provided are:<o:p></o:p></FONT></SPAN></P> <P class=MsoNormal><SPAN style="mso-bidi-font-size: 10.0pt"><FONT face="Times New Roman, Times, Serif"> </FONT></SPAN><FONT face="Times New Roman, Times, Serif"><SPAN lang=EN-GB style="mso-bidi-font-size: 10.0pt; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-GB; mso-fareast-language: FR">Berry-Roghe G.L.M. (1973). The computation of collocations and their relevance in lexical studies, dans Aitken A.J,. Bailey R., Hamilton-Smith N., (eds), The Computer and Literary Studies, Edinburgh, Edinburgh University Press </SPAN><SPAN style="mso-bidi-font-size: 10.0pt"><o:p></o:p></SPAN></FONT></P> <P class=editotexte0 style="MARGIN-TOP: 6pt; TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify"><FONT face="Times New Roman, Times, Serif"><SPAN lang=EN-GB style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-ansi-language: EN-GB">Williams, G. 1998. "Collocational Networks: Interlocking Patterns of Lexis in a Corpus of Plant Biology". International Journal of Corpus Linguistics. .</SPAN><SPAN lang=FR style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-ansi-language: FR">3(1): 151-171<o:p></o:p></SPAN></FONT></P> <P class=editotexte0 style="MARGIN-TOP: 6pt; TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify"><FONT face="Times New Roman, Times, Serif"><SPAN lang=FR style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-ansi-language: FR">Williams G. 1999. Les rseaux collocationnels dans la construction et l'exploitation d'un corpus dans le cadre d'une communaut de discours scientifique. Th</SPAN><SPAN lang=FR style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-ansi-language: FR; mso-fareast-language: KO">e</SPAN><SPAN lang=FR style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-ansi-language: FR">se en anglais linguistique de corpus. Universit de Nantes. </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt"><A href="http://perso.wanadoo/geoffrey.williams">http://perso.wanadoo/geoffrey.williams</A></SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-fareast-language: KO"><o:p></o:p></SPAN></FONT></P> <P class=editotexte0 style="MARGIN-TOP: 6pt; TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify"><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-fareast-language: KO"><FONT face="Times New Roman, Times, Serif">It seems clear that the field is still in its very early stages of development. I suspect, however, that it may experience some growth over the next few years, although perhaps the non-literary areas may grow more quickly than the literary one that concerns me.<o:p></o:p></FONT></SPAN></P> <P class=editotexte0 style="MARGIN-TOP: 6pt; TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify"><FONT face="Times New Roman, Times, Serif"><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt">Thanks </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-fareast-language: KO">very much </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt">to all who responded.</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-fareast-language: KO"><o:p></o:p></SPAN></FONT></P> <P class=editotexte0 style="MARGIN-TOP: 6pt; TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify"><FONT face="Times New Roman, Times, Serif"><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-fareast-language: KO"><SPAN style="mso-spacerun: yes"> </SPAN></SPAN></FONT>Dr. Terry Murphy<BR>Yonsei University<BR>College of Liberal Arts<BR>Dept. of English Language and Literature<BR>Seoul 120--749<BR>Korea</P></DIV></BODY></HTML><br clear=all><hr>Get more from the Web. FREE MSN Explorer download : <a href='http://explorer.msn.com'>http://explorer.msn.com</a><br></p>