<html><head></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">Im sure you can get something useful from word proximity computed over large corpora; to me the issues are what mechanisms best do that and what you can get out of it. theres a whole tradition of corpus based clustering leading to clumps or graphs, from Karen Sparck Jones thesis (1960s!) to the Pathfinder algorithms that gave nice weighted graphs *** in the 80s. I dont quite see whats new unless vectors and matrices add a lot to those simpler methods--until proved wrong Im prepared to bet they dont--it's just that people have forgotten them as they forget everything, and "matrix" sounds mathematically posher (though in fact Speck Jones did try to compute matrices but EDSAC2 was too small to cope at Cambridge in those days).<div>BUT, I just cant see that what youre describing leads to helpful paraphrases---listing out the "interpretations" of nearby words wont help will it.?</div><div>YW</div><div><br></div><div>***</div><div><span class="Apple-style-span" style="font-family: Georgia; font-size: small; ">McDonald, J. E., Plate, T. A., & Schvaneveldt, R. W. (1990). Using Pathfinder to extract semantic information from text. In R. Schvaneveldt (Ed.), <em>Pathfinder associative networks: Studies in knowledge organization</em>. (pp. 149-164). Norwood, NJ: Ablex.</span></div><div><br></div><div><div><blockquote type="cite"><div><br></div></blockquote></div></div><div><div>On 28 Jan 2011, at 14:59, Katrin Erk wrote:</div><br class="Apple-interchange-newline"><blockquote type="cite"><div><blockquote type="cite">OK but the problem is that you have to know what it is you are looking for closeness TO. Why would you seek relative proximity to<br></blockquote><blockquote type="cite">toy and food unless you already knew those were the words corresponding to or capturing the ambiguity--in other words you already have to know what the choices are (and in the case of "rubber chicken" it has both senses and proximity in a space cannot show ambiguity, can it?). Theres nothing about toys in any o the three components rubber/duck/chicken surely? Cohen and Margalit were asking one right question which is whether and how one could determine combination meaning from component-meanings--I dont see how the proximity analysis you cite can do that.<br></blockquote><br>Baroni and Zamparelli actually don't use predefined words (like toy)<br>to compare to, but determine the nearest neighbors of an expression in<br>space. Then you can interpret each word through its nearest neighbors.<br>So the interpretation of each expression is a list of paraphrases.<br><br>But I think you mean more than that by "determinining combination<br>meaning". Something along the lines of being able to list all<br>appropriate inferences to draw?<br><br>Assuming you mean that, my answer would be: Yes, in the end that's the<br>goal, but for now this goal is too large. For now we need goals that<br>work for the intermediate, smaller steps. And I think models like this<br>one are on the right track because they can use corpus data to predict<br>the meaning of words and expressions in context.<br><br>Katrin<br><br><br><blockquote type="cite">Y<br></blockquote><blockquote type="cite"><br></blockquote><blockquote type="cite">On 28 Jan 2011, at 14:41, Katrin Erk wrote:<br></blockquote><blockquote type="cite"><br></blockquote><blockquote type="cite"><blockquote type="cite">On Fri, Jan 28, 2011 at 1:32 PM, Yorick Wilks <<a href="mailto:Y.Wilks@dcs.shef.ac.uk">Y.Wilks@dcs.shef.ac.uk</a>> wrote:<br></blockquote></blockquote><blockquote type="cite"><blockquote type="cite"><blockquote type="cite">Hmmm...not quite sure what "doing the right thing" for the rubber duck and chicken would be. Surely no method like this could provide a representation so what could it give at best?<br></blockquote></blockquote></blockquote><blockquote type="cite"><blockquote type="cite"><br></blockquote></blockquote><blockquote type="cite"><blockquote type="cite">It does provide a representation, the question is just how to<br></blockquote></blockquote><blockquote type="cite"><blockquote type="cite">interpret it and draw inferences from it. The most straightforward way<br></blockquote></blockquote><blockquote type="cite"><blockquote type="cite">is by testing closeness of the representation of the adj+noun pair to<br></blockquote></blockquote><blockquote type="cite"><blockquote type="cite">the representations of other expressions, say "toy" and "food". If the<br></blockquote></blockquote><blockquote type="cite"><blockquote type="cite">model gets rubber duck and chicken right, then it should predict (by<br></blockquote></blockquote><blockquote type="cite"><blockquote type="cite">measuring similarity/proximity in semantic space) that rubber duck is<br></blockquote></blockquote><blockquote type="cite"><blockquote type="cite">closer to "toy" than "food", and the other way round for the rubber<br></blockquote></blockquote><blockquote type="cite"><blockquote type="cite">chicken.<br></blockquote></blockquote><blockquote type="cite"><blockquote type="cite"><br></blockquote></blockquote><blockquote type="cite"><blockquote type="cite">Katrin<br></blockquote></blockquote><blockquote type="cite"><blockquote type="cite"><br></blockquote></blockquote><blockquote type="cite"><blockquote type="cite"><blockquote type="cite"><br></blockquote></blockquote></blockquote><blockquote type="cite"><blockquote type="cite"><blockquote type="cite">On 28 Jan 2011, at 14:25, Katrin Erk wrote:<br></blockquote></blockquote></blockquote><blockquote type="cite"><blockquote type="cite"><blockquote type="cite"><br></blockquote></blockquote></blockquote><blockquote type="cite"><blockquote type="cite"><blockquote type="cite"><blockquote type="cite">Hi all,<br></blockquote></blockquote></blockquote></blockquote><blockquote type="cite"><blockquote type="cite"><blockquote type="cite"><blockquote type="cite"><br></blockquote></blockquote></blockquote></blockquote></div></blockquote></div><br></body></html>