<HTML><BODY STYLE="font:10pt verdana; border:none;"><DIV> <P><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO"><FONT face="Times New Roman, Times, Serif">Dear Corpora Listers:<?xml:namespace prefix = o ns = "urn:schemas-microsoft-com:office:office" /><o:p></o:p></FONT></SPAN></P> <P><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO"><FONT face="Times New Roman, Times, Serif">Here is a summary of my inquiry concerning the corpus analysis of keywords in literary texts.<o:p></o:p></FONT></SPAN></P> <P class=MsoNormal><FONT face="Times New Roman, Times, Serif"><SPAN style="mso-bidi-font-size: 10.0pt">1. Mick Short noted that there is some discussion of keywords in the play <EM><SPAN style="FONT-STYLE: normal">Romeo and Juliet</SPAN></EM> in chapter 4 of Jonathan Culpeper, <EM><SPAN style="FONT-STYLE: normal">Language and Characterisation</SPAN></EM>, (Longman 2001). Mick also suggested that it might be worth having a look at David Hoover's </SPAN><SPAN lang=EN-GB style="mso-bidi-font-size: 10.0pt; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-GB">Language and Style in The Inheritors (University Press of America 1998), which compares Golding's book with various corpora. </SPAN><SPAN style="mso-bidi-font-size: 10.0pt"><o:p></o:p></SPAN></FONT></P> <P class=MsoNormal><SPAN style="mso-bidi-font-size: 10.0pt"><FONT face="Times New Roman, Times, Serif"> </FONT></SPAN><SPAN style="mso-bidi-font-size: 10.0pt"><FONT face="Times New Roman, Times, Serif">2. Christopher Tribble noted that M. Stubbs, Text and Corpus Analysis (1996) specifically mentions Raymond Williams’ notion of keywords. Christopher also commented that Mike Scott has been doing work on cultural keywords using Guardian newspaper data.<o:p></o:p></FONT></SPAN></P> <P><FONT face="Times New Roman, Times, Serif"><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-fareast-language: KO">3</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt">. Adam Kilgarriff remind</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-fareast-language: KO">s</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt"> me that </SPAN></FONT><FONT face="Times New Roman, Times, Serif"><SPAN style="FONT-FAMILY: 'Times New Roman'">Mike Scott's Wordsmith supports this sort of analysis, and that Tony Bernber Sardinha knows a lot about the area but from an EFL rather than a literary perspective</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO"><o:p></o:p></SPAN></FONT></P> <P><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO"><FONT face="Times New Roman, Times, Serif">The following two leads were very useful:<o:p></o:p></FONT></SPAN></P> <P><FONT face="Times New Roman, Times, Serif"><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">4. Ramesh Krishnamurthy has written “</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">Ethnic, Racial and Tribal: The Language of Racism?</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">” </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">in Texts and</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO"> </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">Practices, eds. Caldas-Coulthard & Coulthard, Routledge, London</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">, 1996.<SPAN style="mso-spacerun: yes"> </SPAN>In this article , Ramesh looked at three </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">keywords in the Bank of English corpus (then 121 million</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO"> </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">words, now 418 million words) and ma</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">d</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">e specific references to</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO"> Ra</SPAN></FONT><FONT face="Times New Roman, Times, Serif"><SPAN style="FONT-FAMILY: 'Times New Roman'">ymond Williams' Keywords.<BR><BR></SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">5. </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">Andrius Utka</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">,<B> </B></SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">a master student at Vytautas Magnus University, Faculty of Humanities </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">has done an analysis of </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 24.0pt">George Orwell’s 1984 </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 24.0pt; mso-fareast-language: KO">using the s</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 24.0pt">tatistical </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 24.0pt; mso-fareast-language: KO">m</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 24.0pt">ethods of </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 24.0pt; mso-fareast-language: KO">c</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 24.0pt">orpus </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 24.0pt; mso-fareast-language: KO">l</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 24.0pt">inguistics. </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 24.0pt; mso-fareast-language: KO">It is available for viewing at<SPAN style="mso-spacerun: yes"> </SPAN></SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'"><A href="http://donelaitis.vdu.lt/">http://donelaitis.vdu.lt</A>, </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">by </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">follow </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">the </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">link </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">from </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">"publications" </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">to </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">"sankirta"</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">. <o:p></o:p></SPAN></FONT></P> <P><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO"><FONT face="Times New Roman, Times, Serif">Among other things, this paper suggests a useful method for discovering what the keywords of a given literary text actually are:</FONT></SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'"><BR style="mso-special-character: line-break"><BR style="mso-special-character: line-break"></SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO"><o:p></o:p></SPAN></P> <P><FONT face="Times New Roman, Times, Serif"><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-fareast-language: KO">“The </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'">following procedure is used for finding key words in 1984: <o:p></o:p></SPAN></FONT></P> <OL type=1> <LI class=MsoNormal style="TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify; tab-stops: list 36.0pt; mso-margin-top-alt: auto; mso-margin-bottom-alt: auto; mso-list: l0 level1 lfo1"><FONT face="Times New Roman, Times, Serif">The frequency list of all word forms is produced by the computer program Wordsmith Tools. </FONT></LI> <LI class=MsoNormal style="TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify; tab-stops: list 36.0pt; mso-margin-top-alt: auto; mso-margin-bottom-alt: auto; mso-list: l0 level1 lfo1"><FONT face="Times New Roman, Times, Serif">Only 100 most frequent nouns are left and all the other words are removed from the list. </FONT></LI> <LI class=MsoNormal style="TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify; tab-stops: list 36.0pt; mso-margin-top-alt: auto; mso-margin-bottom-alt: auto; mso-list: l0 level1 lfo1"><FONT face="Times New Roman, Times, Serif">The nouns are lemmatized. </FONT></LI> <LI class=MsoNormal style="TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify; tab-stops: list 36.0pt; mso-margin-top-alt: auto; mso-margin-bottom-alt: auto; mso-list: l0 level1 lfo1"><FONT face="Times New Roman, Times, Serif">The frequency list of these 100 nouns is produced from the large corpus of the Bank of English. </FONT></LI> <LI class=MsoNormal style="TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify; tab-stops: list 36.0pt; mso-margin-top-alt: auto; mso-margin-bottom-alt: auto; mso-list: l0 level1 lfo1"><FONT face="Times New Roman, Times, Serif">The occurrences of words in both frequency lists are compared using chi-squared statistical test”. </FONT></LI> <LI class=MsoNormal style="TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify; tab-stops: list 36.0pt; mso-margin-top-alt: auto; mso-margin-bottom-alt: auto; mso-list: l0 level1 lfo1"><FONT face="Times New Roman, Times, Serif">The key words are sorted out according the chi-square value.</FONT></LI></OL> <P class=MsoNormal><FONT face="Times New Roman, Times, Serif">Finally, there were two respondents working on texts other than literary ones:</FONT></P> <P class=MsoNormal><FONT face="Times New Roman, Times, Serif"> </FONT><FONT face="Times New Roman, Times, Serif">6. <SPAN style="COLOR: black; mso-bidi-font-size: 10.0pt">Wendy J. Anderson, a PhD Student in the Department of French at the University of St Andrews is carrying out keyword analysis </SPAN><SPAN style="mso-bidi-font-size: 10.0pt">on administrative texts in French.<o:p></o:p></SPAN></FONT></P> <P class=MsoNormal><SPAN style="mso-bidi-font-size: 10.0pt"><FONT face="Times New Roman, Times, Serif"> </FONT></SPAN><FONT face="Times New Roman, Times, Serif"><SPAN style="mso-bidi-font-size: 10.0pt">7. Geoffrey Williams has done work on extracting keywords in scientific corpora. Geoffrey also notes that Berry Roghe worked in a similar way on literary texts in the 70's.</SPAN><SPAN style="mso-bidi-font-size: 10.0pt; mso-fareast-language: EN-US"><o:p></o:p></SPAN></FONT></P> <P class=MsoNormal><SPAN style="mso-bidi-font-size: 10.0pt"><FONT face="Times New Roman, Times, Serif"> </FONT></SPAN><SPAN style="mso-bidi-font-size: 10.0pt"><FONT face="Times New Roman, Times, Serif">The references that Geoffrey provided are:<o:p></o:p></FONT></SPAN></P> <P class=MsoNormal><SPAN style="mso-bidi-font-size: 10.0pt"><FONT face="Times New Roman, Times, Serif"> </FONT></SPAN><FONT face="Times New Roman, Times, Serif"><SPAN lang=EN-GB style="mso-bidi-font-size: 10.0pt; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-GB; mso-fareast-language: FR">Berry-Roghe G.L.M. (1973). The computation of collocations and their relevance in lexical studies, dans Aitken A.J,. Bailey R., Hamilton-Smith N., (eds), The Computer and Literary Studies, Edinburgh, Edinburgh University Press </SPAN><SPAN style="mso-bidi-font-size: 10.0pt"><o:p></o:p></SPAN></FONT></P> <P class=editotexte0 style="MARGIN-TOP: 6pt; TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify"><FONT face="Times New Roman, Times, Serif"><SPAN lang=EN-GB style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-ansi-language: EN-GB">Williams, G. 1998. "Collocational Networks: Interlocking Patterns of Lexis in a Corpus of Plant Biology". International Journal of Corpus Linguistics. .</SPAN><SPAN lang=FR style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-ansi-language: FR">3(1): 151-171<o:p></o:p></SPAN></FONT></P> <P class=editotexte0 style="MARGIN-TOP: 6pt; TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify"><FONT face="Times New Roman, Times, Serif"><SPAN lang=FR style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-ansi-language: FR">Williams G. 1999. Les rseaux collocationnels dans la construction et l'exploitation d'un corpus dans le cadre d'une communaut de discours scientifique. Th</SPAN><SPAN lang=FR style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-ansi-language: FR; mso-fareast-language: KO">e</SPAN><SPAN lang=FR style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-ansi-language: FR">se en anglais linguistique de corpus. Universit de Nantes. </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt"><A href="http://perso.wanadoo/geoffrey.williams">http://perso.wanadoo/geoffrey.williams</A></SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-fareast-language: KO"><o:p></o:p></SPAN></FONT></P> <P class=editotexte0 style="MARGIN-TOP: 6pt; TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify"><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-fareast-language: KO"><FONT face="Times New Roman, Times, Serif">It seems clear that the field is still in its very early stages of development. I suspect, however, that it may experience some growth over the next few years, although perhaps the non-literary areas may grow more quickly than the literary one that concerns me.<o:p></o:p></FONT></SPAN></P> <P class=editotexte0 style="MARGIN-TOP: 6pt; TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify"><FONT face="Times New Roman, Times, Serif"><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt">Thanks </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-fareast-language: KO">very much </SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt">to all who responded.</SPAN><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-fareast-language: KO"><o:p></o:p></SPAN></FONT></P> <P class=editotexte0 style="MARGIN-TOP: 6pt; TEXT-JUSTIFY: inter-ideograph; TEXT-ALIGN: justify"><FONT face="Times New Roman, Times, Serif"><SPAN style="FONT-FAMILY: 'Times New Roman'; mso-bidi-font-size: 10.0pt; mso-fareast-language: KO"><SPAN style="mso-spacerun: yes"> </SPAN></SPAN></FONT>Dr. Terry Murphy<BR>Yonsei University<BR>College of Liberal Arts<BR>Dept. of English Language and Literature<BR>Seoul 120--749<BR>Korea</P></DIV></BODY></HTML><br clear=all><hr>Get more from the Web. FREE MSN Explorer download : <a href='http://explorer.msn.com'>http://explorer.msn.com</a><br></p>