[Corpora-List] Distance & word context.

Thierry Fontenelle thierryf at microsoft.com
Thu May 1 01:15:31 UTC 2008


Hi Justin,

You might be interested in the following paper:

Grefenstette, G. (1996) "Evaluation Techniques for Automatic Semantic Extraction: Comparing Syntactic and Window Based Approaches", in Boguraev and Pustejovsky (eds) Corpus Processing for Lexical Acquisition, The MIT Press, 205-216.

It's not "new" any more, but it seems to correspond to the opposition between distance and roles within grammatical constructs you are alluding to.

I hope it helps,

Thierry

Thierry Fontenelle
Microsoft Natural Language Group
Redmond, WA
thierryf at microsoft.com


-----Original Message-----
From: corpora-bounces at uib.no [mailto:corpora-bounces at uib.no] On Behalf Of J Washtell
Sent: Wednesday, April 30, 2008 2:50 PM
To: corpora at uib.no
Subject: [Corpora-List] Distance & word context.

Hello all,

This list is stimulating as always. I feel it is my turn to throw some
questions around :-)

Can anybody point me towards works (however old or new) that exploit
the distance between terms in a corpus (such as, but not restricted
to, the use of "distance-weighted" context windows). The specific
applications are not important; I am interested in any works that deal
with the concept of distance as opposed to (or in addition to) say
frequency counts or roles/positions within grammatical constructs.

Related to this, I am also interested in any work that courts the
notion of link-distance between words and texts within hypertext
structures (such as the Web); again, specific applications are
unimportant.

Finally (for the moment :-)), does anybody know of any (perhaps more
linguistically oriented) works that discuss the existence/importance
of *very* long range dependencies and associations in text (e.g.
Dear... Yours, Results... Conclusion, etc), and the role these play
when considering word context.

Kindest regards to all,

Justin Washtell
University of Leeds

----------------------------------------------------------------
This message was sent using IMP, the Internet Messaging Program.


_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list