[Corpora-List] Fwd: Re: Distance & word context.
J Washtell
lec3jrw at leeds.ac.uk
Thu May 1 19:24:27 UTC 2008
John,
Thank you very much for your feedback.
I was referring to what one might call the linear "physical distance"
or "narrative distance" in corpora (as would correlate with the time
between terms occurring as a reader reads, if you like). Hence my
citing "distance-weighted context windows" as an example of one way in
which this is considered (also referred to as "ramped" windows etc).
As there doesn't seem to be a consistent established terminology - at
least none that I'm familiar with - Google is unsatisfactory by
itself. I'm definitely not asking about semantic distance.
What I ceratinly didn't make clear is that I'm particularly interested
in approaches that have been used to *mine* these relationships from
corpora, as well as general linguistic discussions concerning their
existence, rather than formal ways in which they can be expressed (as
per context-free grammar) - although I am certainly interested in the
models that have made such mining possible.
Do you have any more insights which I might find useful in light of
this? Perhaps something that you might expect to have fewer hits
(owing to our now hopefully increased precision)?
Many thanks!
Justin Washtell
University of Leeds
Quoting "John F. Sowa" <sowa at bestweb.net>:
> JW> I am interested in any works that deal with the concept of
>> distance as opposed to (or in addition to) say frequency counts
>> or roles/positions within grammatical constructs.
>
> There are many different notions of 'distance'. One example
> is the term "semantic distance", for which Google provides
> 26,700 hits.
>
>> does anybody know of any (perhaps more linguistically oriented)
>> works that discuss the existence/importance of *very* long range
>> dependencies and associations in text (e.g. Dear... Yours,
>> Results... Conclusion, etc), and the role these play when
>> considering word context.
>
> Some such connections can be defined by context-free rules:
>
> Letter -> Salutation Body ComplimentaryClose
>
> The two terms "salutation" and "complimentary close" give
> 16,400 hits on Google.
>
> John Sowa
----------------------------------------------------------------
This message was sent using IMP, the Internet Messaging Program.
----- End forwarded message -----
----------------------------------------------------------------
This message was sent using IMP, the Internet Messaging Program.
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list