[Corpora-List] Fwd: Re: Distance & word context.

J Washtell lec3jrw at leeds.ac.uk
Thu May 1 19:24:27 UTC 2008


John,

Thank you very much for your feedback.

I was referring to what one might call the linear "physical distance"  
or "narrative distance" in corpora (as would correlate with the time  
between terms occurring as a reader reads, if you like). Hence my  
citing "distance-weighted context windows" as an example of one way in  
which this is considered (also referred to as "ramped" windows etc).  
As there doesn't seem to be a consistent established terminology - at  
least none that I'm familiar with - Google is unsatisfactory by  
itself. I'm definitely not asking about semantic distance.

What I ceratinly didn't make clear is that I'm particularly interested  
in approaches that have been used to *mine* these relationships from  
corpora, as well as general linguistic discussions concerning their  
existence, rather than formal ways in which they can be expressed (as  
per context-free grammar) - although I am certainly interested in the  
models that have made such mining possible.

Do you have any more insights which I might find useful in light of  
this? Perhaps something that you might expect to have fewer hits  
(owing to our now hopefully increased precision)?

Many thanks!

Justin Washtell
University of Leeds


Quoting "John F. Sowa" <sowa at bestweb.net>:

> JW> I am interested in any works that deal with the concept of
>> distance as opposed to (or in addition to) say frequency counts
>> or roles/positions within grammatical constructs.
>
> There are many different notions of 'distance'.  One example
> is the term "semantic distance", for which Google provides
> 26,700 hits.
>
>> does anybody know of any (perhaps more linguistically oriented)
>> works that discuss the existence/importance of *very* long range
>> dependencies and associations in text (e.g. Dear... Yours,
>> Results... Conclusion, etc), and the role these play when
>> considering word context.
>
> Some such connections can be defined by context-free rules:
>
>    Letter  ->  Salutation  Body  ComplimentaryClose
>
> The two terms "salutation" and "complimentary close" give
> 16,400 hits on Google.
>
> John Sowa



----------------------------------------------------------------
This message was sent using IMP, the Internet Messaging Program.



----- End forwarded message -----


----------------------------------------------------------------
This message was sent using IMP, the Internet Messaging Program.


_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list