[Corpora-List] Off-list discussion (about "corpus syntax")
Rich Cooper
Rich at EnglishLogicKernel.com
Thu Sep 20 06:38:34 UTC 2007
I agree David,
The title John suggests: "Theories and Methodologies for Processing Large
Corpora" is a good one. Grammatical completeness was the title for just one
discussion, but there are a huge number of issues in linguistics that I
would like to see discussed by people with the kind of linguistic knowledge
that John and Yorick have.
JMHO,
Rich
-----Original Message-----
From: corpora-bounces at uib.no [mailto:corpora-bounces at uib.no] On Behalf Of
John F. Sowa
Sent: Wednesday, September 19, 2007 4:50 AM
To: David Brooks
Cc: CORPORA at UIB.NO
Subject: Re: [Corpora-List] Off-list discussion (about "corpus syntax")
David,
Thank you very much for setting up that discussion group,
but the title "grammatical incompleteness" is too narrow.
Related issues include
1. Semantic incompleteness.
2. Vagueness and precision.
3. Question answering without deep deduction.
4. Supervised and unsupervised learning methods.
5. Any methodology, formal or informal, that can process
large corpora (gigabyte or more) without relying on tags.
6. Any novel technologies, theories, or ideas for extracting
useful information from large corpora.
As a more general title, I would suggest something like
Theories and Methodologies for Processing Large Corpora
This would allow the current Corpora list to focus on short
announcements and brief answers to questions. Anything
devoted to issues about comparing different theories and
technologies would go to the new group.
Periodically, someone might send a note to Corpora list
with a brief summary of the ongoing threads. But all
discussions, such as the ones we've been having recently
would be moved to the new group.
John Sowa
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list