[Corpora-List] describing languages as [link]-[sem]-[morphsyn] tripplets...
Rich Cooper
rich at englishlogickernel.com
Sun Jan 9 21:01:28 UTC 2011
Hi Robert,
Thanks for the info,
I am looking for small vocabularies that could conceivably be interpreted by
software custom written to do so. The choice of vocabulary I was looking
for was, initially smaller than 2,000 although that is a larger number I may
have to accept.
The requirement for proper interpretation is that each of the N words be
defined in terms of some others, or said word be individually interpretable,
in a way that fits the quirks and angles of the English language as used by
a young person, with little world knowledge other than what has been
directly experienced. Things that are embedded into common English, such as
verbs about liquids, containers, designation phrases, etc.
Wierzbicka's primitive set of 60 or so would be ideally small and easy to
build, but is probably not expressive enough to serve as an effective kernel
vocabulary. Masterman's reported 100 concepts, with 15,000 compositions of
the 100 expressed, along with the specific relations used to represent those
compositions with further concepts, might be a core, if not a full kernel
vocabulary.
Is there a vocabulary with those characteristics that springs to mind? Any
suggestions appreciated
Thanks,
-Rich
Sincerely,
Rich Cooper
EnglishLogicKernel.com
Rich AT EnglishLogicKernel DOT com
9 4 9 \ 5 2 5 - 5 7 1 2
_____
From: Robert Parks [mailto:rqparks at gmail.com]
Sent: Sunday, January 09, 2011 12:39 PM
To: Rich Cooper; corpora at uib.no
Cc: 'David Eddy'
Subject: Re: [Corpora-List] describing languages as [link]-[sem]-[morphsyn]
tripplets...
If you're looking for small vocabularies, there are several associated with
ESL dictionaries, such as Longman's and Macmillans. Both have approximately
2000 words, and Wordsmyth also has a 2000 word vocabulary used in its
Beginner's Dictionary. (http://www.wordsmyth.net) We're also using a 1500
word vocabulary in our 1st reader's dictionary for K-2 school children.
Bob
At 12:05 PM -0800 1/9/11, Rich Cooper wrote:
Hi John and David,
This is quote from John's web site link below:
Margaret Masterman's system at Cambridge University (1961) was the first to
be called a semantic network. She developed a list of 100 primitive concept
types, such as Folk, Stuff, Thing, Do, and Be. In terms of those primitives,
her group defined a conceptual dictionary of 15,000 entries. She organized
the concept types into a lattice, which permits inheritance from multiple
supertypes. The basic principles and even many of the primitive concepts
have survived in more recent systems of preference semantics (Wilks & Fass
1992).
John, is there somewhere I can find the 100 concept types and the 15,000
entries? If they have been time tested as a core of concepts, they should
have value in refinement.
David has been interested in small vocabularies for companies, and may also
be interested in the Masterman concept types.
Wierzbicka's primitive concepts number even less than 100. I wonder if
there is much overlap between the two vocabularies?
-Rich
Sincerely,
Rich Cooper
EnglishLogicKernel.com
Rich AT EnglishLogicKernel DOT com
9 4 9 \ 5 2 5 - 5 7 1 2
-----Original Message-----
From: corpora-bounces at uib.no [mailto:corpora-bounces at uib.no] On Behalf Of
sowa at bestweb.net
Sent: Sunday, January 09, 2011 8:39 AM
To: corpora at uib.no
Subject: Re: [Corpora-List] describing languages as [link]-[sem]-[morphsyn]
tripplets...
> Do you know of these types of language analysis (based on
"triplet"
> formants) as I have explained, or anything
similar to it?
When triplets are joined together on their
common labels, they form
a directed labeled graph. Such graphs have
been used to represent
semantics for centuries. For a brief summary
of the many varieties,
see my article on Semantic Networks in the
Encyclopedia of AI:
http://www.jfsowa.com/pubs/semnet.htm
John Sowa
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
--
* The best dictionary and integrated thesaurus on the web:
http://www.wordsmyth.net
* 119 S. Cayuga Street, Ithaca, NY - (607) 272-2190
* "To imagine a language is to imagine a form of life." "Whereof one cannot
speak, thereof one must remain silent". (LW) "Philosophers have only
interpreted the world. The point, however, is to change it." (KM) "Without
knowing the force of words, it is impossible to know men." (Confucius)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20110109/be0c6e2b/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list