16.1364, Disc: Re: A Challenge to the Minimalist Community

Fri May 6 16:36:42 UTC 2005

Editor's note: issues 16.1364 to 16.1372 are being reposted due to
delivery problems we encountered last week.  We apologize for any
inconvenience.

LINGUIST List: Vol-16-1364. Fri Apr 29 2005. ISSN: 1068 - 4875.

Subject: 16.1364, Disc: Re: A Challenge to the Minimalist Community

Moderators: Anthony Aristar, Wayne State U <aristar at linguistlist.org>
            Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>

Reviews (reviews at linguistlist.org)
        Sheila Dooley, U of Arizona
        Terry Langendoen, U of Arizona

Homepage: http://linguistlist.org/

The LINGUIST List is funded by Eastern Michigan University, Wayne
State University, and donations from subscribers and publishers.

Editor for this issue: Michael Appleby <michael at linguistlist.org>
================================================================

To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.html.

===========================Directory==============================

1)
Date: 22-Apr-2005
From: Sean Fulop < sfulop at csufresno.edu >
Subject: Re: A Challenge to the Minimalist Community

2)
Date: 24-Apr-2005
From: Ash Asudeh < asudeh at csli.stanford.edu >
Subject: Re: A Challenge to the Minimalist Community

3)
Date: 25-Apr-2005
From: Stefan Müller < Stefan.Mueller at cl.uni-bremen.de >
Subject: Re: A Challenge to the Minimalist Community

-------------------------Message 1 ----------------------------------
Date: Fri, 29 Apr 2005 10:43:49
From: Sean Fulop < sfulop at csufresno.edu >
Subject: Re: A Challenge to the Minimalist Community

I believe the challenge offered by Sproat and Lappin, for someone to
build a successful Principles and Parameters parser that learns from
treebanks, is mixed up in a number of respects.  Let me point out
some of the ways the original challenge doesn't take into account the
goals of theoretical linguistics, and misses an important distinction
between applied natural language processing and theoretical
computational linguistics.

First, let me say that I am not in favor of the Principles and
Parameters  framework as a linguistic theory, nor am I in favor of
Minimalism (whatever that actually turns out to be).  I promote type-
logical categorial grammar as a framework for Universal Grammar, but
whatever, my goals are broadly the same as those of P&P
proponents, and Sproat & Lappin's challenge might equally be applied
to type-logical grammar.

There have been some spirited defenses of P&P posted already by
various proponents, and I apologize if I overlap with what is said there
a little bit in the sequel.

P&P is an effort to describe and explain what the human language
faculty is, and what it does during learning.  Personally, I don't think
there is much hope for its approach, but that remains debatable, so
let's presume it's a good theory, meaning some future tweak of it will
turn out to effectively describe, to every linguists' satisfaction, the
fundamental grammars of every language and how those could be
learned given the specific Universal Grammar that would also be
needed.

Now, who said anything about computational tractability?  This is a
complete theory, not an efficiency contest.  I think Ed Stabler's work
on GB and Minimalism over the years should be convincing enough,
that a computational implementation is at least possible, and won't be
tractable because it is bereft of performance limitations.  But we have
to be happy with our theories when they correctly describe the input-
output relation of the human language acquisition process, never mind
the actual process.  It is hopeless to try to conform to the "actual
process" used by the mind, we have no way of knowing what that is or
whether our particular computers are even up for the job.  Some
cognitive scientists have even suggested that the brain is not
constrained by computability at all, since mathematical computability is
defined in reference to computational procedures that we now
fathom.  Worrying about tractability in P&P is like denigrating relativity
theory because it makes it needlessly harder to calculate artillery
ballistics.  That's of course true, but that's also why no one invokes
such a complete theory of relative motion simply to calculate artillery
ballistics.

Sproat and Lappin note that learning from treebanks is supervised,
meaning the parser is trained by a subset of the right answers.  They
go on to reference work by Klein and Manning which induces
grammars in an "unsupervised" fashion from text.  Well first of all, it is
still debatable whether anything can ever actually do this (see the
algorithmic learning theory literature, summarized in Jain et al. 1999),
and Sproat and Lappin also note that Klein and Manning's scheme
uses part-of-speech tagged text, which is a far cry from text.  This is a
huge annotation, and could be taken as a component of Universal
Grammar.  It is a component that is argued for in P&P, as well.

I agree with Sproat and Lappin, that P&P would be better off with an
attempt to implement its general learning scheme---an analysis of the
informational complexity of this problem is provided in Partha Niyogi's
book. I disagree that this computational effort should be expected to
succeed in practice, since that would require tractability, and that is
too much to ask of a generally correct theory.  The effort should be
mathematically proven to succeed in theory, by defining just exactly
what it would learn if you could wait long enough for it, and by proving
that it would terminate if you waited long enough for it.

This is the sort of learnability result that I've obtained for type-logical
grammar, given a certain format for Universal Grammar and certain
assumed mathematical properties of all human languages.  What I
don't require is the parts of speech - we learn those, surely.  The
learning scheme is outlined in my book and also some current papers
(see references).  Instead of the POS annotation, it requires
annotation by skeletal semantic structures, but there is nothing wrong
with requiring annotation.  Children certainly receive "annotated"
sentences, in that they get other clues to meaning when sentences
are spoken.

While I support the current efforts in statistical NLP, I don't believe it
has any future as a linguistic theory.  It is a computational
methodology, inspired by the recognition that it would be too
hard/slow to invoke the most complete form of linguistic theory to
perform basic NLP tasks.  As I've often told my comp ling students, "if
you've made software that works well, it probably doesn't actually
solve a theoretical problem, which is what makes it part of applied
NLP; if you've made software that implements a full theoretical
solution, it probably won't finish by the end of the day, which is what
makes it theoretical computational linguistics."  That doesn't mean the
twain shall never meet.  But not yet.

References:

Fulop, Sean A. (2005) "Semantic bootstrapping of type-logical
grammar," Journal of Logic, Language and Information 14:49-86.

Fulop, Sean A. (2004) On the Logic and Learning of Language.
Victoria, Canada: Trafford.

Fulop, Sean A. (2003) "Discovering a new class of languages,"
Proceedings of Mathematics of Language 8, available online at MOL
website.

Niyogi, Partha (1998) The Informational Complexity of Learning.
Kluwer.

Stabler, Edward (1992)? The Logical Approach to Syntax.
Cambridge,  MA: MIT Press.

Linguistic Field(s): Computational Linguistics
                     Discipline of Linguistics
                     Syntax

-------------------------Message 2 ----------------------------------
Date: Fri, 29 Apr 2005 10:43:52
From: Ash Asudeh < asudeh at csli.stanford.edu >
Subject: Re: A Challenge to the Minimalist Community

Some confusion has arisen in the subsequent discussion of the
Sproat-Lappin challenge. Most of the subsequent posts discuss
statistical parsing versus P&P parsing. However, the challenge has
nothing to do with statistical parsers per se, although it's true that
much of the original challenge discussed these parsers. There are,
however, non-statistical or hybrid broad-coverage parsers in theories
other than P&P, such as the PARC LFG parser mentioned in the
original Sproat-Lappin challenge and the HPSG Lingo/LKB parser at
Stanford. For theorists who for whatever reason think that statistical
parsers are a different sort of thing from the P&P parser that the
challenge focuses on (they are not, for the reasons sketched by
Sproat and Lappin themselves and also by Ken Shan (LL16.1288)),
these can perhaps serve as a reference point instead.

There is also, in my opinion, a confusing aspect to the Sproat-Lappin
challenge. They are not only asking for a P&P parser, but also asking
for a large P&P grammar. Grammars and parsers are different things,
at least for non-statistical "deep grammar" approaches (purely
statistical methods instead induce the grammar), such as the HPSG
and LFG ones noted above and, presumably, like the P&P one in
question. For example, the PARC LFG parser has been applied to
analyses of a large number of languages, although the English
grammar is what Sproat and Lappin seem to have in mind in their
message, given the citation of Riezler et al. Another example is the
LKB system, which implements an HPSG parser. The English
Resource Grammar (ERG) is used with this parser, but they are
developed separately (although with strong interactions -- same goes
for the PARC XLE parser and the ParGram grammars).

Carson Schütze (16.1288) wrote:

> In addition to capturing the distinction between learnable and
> unlearnable languages, P&P has as an important goal capturing
> the distinction between well-formed (grammatical) and ill-formed
> (ungrammatical) sentences within a language. As I understand
> it, the challenge demands only correct parsing of grammatical
> sentences, not correct rejection of ungrammatical ones. This
> represents another case where the P&P system, by virtue of
> the goals of the theory, is being subjected to greater demands
> than the statistical parsers.

I don't understand the substance of this objection. All grammars,
those used in statistical parsing or otherwise, attempt to reject
ungrammatical sentences: Nobody wants their grammar/parser to
overgenerate. Even if the claim is true of statistical parsers (I
don't think it is), it certainly isn't true of the LFG and HPSG
parsers and grammars noted above.

Peter Hallman (LL16.1251) and Martha McGinnis (also LL16.1251),
although positively inclined to the challenge, also raise objections. The
substance of the objections are that P&P is attempting to do much
more than just parse sentences (Hallman) and that the goals of P&P
are different to those of computational linguistics (McGinnis).  I think
there is merit to both these statements, but they are ultimately non
sequiturs to the challenge.  P&P is not just concerned with how the
adult grammar results from the inititial state (although this is a principal
goal): it is also concerned with the state of the final grammar
(Chomsky, 1986: "What constitutes knowledge of language?"). The
requirement of capturing the adult grammar also means that it's
insubstantial whether the goals of P&P are those of computational
linguistics: P&P is still expected to capture adult grammatical
competence in the end, even if this isn't a *motivation* for a lot of its
practitioners.

Lastly, I agree with Ken Shan (LL16.1288):
"All other things being equal, poor (or unknown) parsing performance
indicates failure at (resp. disinterest in) answering Q [What is a
possible human language?]."

It is no surprise that the objections to the challenge so far have sought
to argue that all other things are not equal. Like Ken, however, I think
that this challenge is a *good* thing for P&P and I too am optimistic for
P&P. In any case, the attempt to meet the challenge will without doubt
reap huge rewards, not just for computational linguistics, but
especially for P&P theory.

Ash Asudeh

Linguistic Field(s): Computational Linguistics
                     Discipline of Linguistics
                     Syntax

-------------------------Message 3 ----------------------------------
Date: Fri, 29 Apr 2005 10:43:54
From: Stefan Müller < Stefan.Mueller at cl.uni-bremen.de >
Subject: Re: A Challenge to the Minimalist Community

Carson Schutze:
> So, I would like to suggest a revised version of the challenge that
> incorporates a second corpus consisting of ungrammatical
> sentences that are to be identified as such. (Earlier P&P parsers
> such as Fong's were designed to do this, but it's not obvious that
> this ability will easily scale up with broader coverage, so I don't
> think this is a sucker's bet.) Furthermore, since the
> computationalists got to choose the corpus of good sentences,
> it would seem only fair that the theoreticians get to choose the
> corpus of bad sentences :-)

This is a very important point and negative data has been collected
and is used to evaluate deep linguistic processing.

A nice software for evaluating systems and working with test suite
databases can be found at:

http://www.delph-in.net/itsdb/

Test suites for German, English, French, Spanish, and other
languages are also available there.

You may find test suites for German at:

http://www.cl.uni-bremen.de/Software/TS/

These test suites contain (normalized) examples from the descriptive
literature, P&P, HPSG, and other theoretical literature. With the [incr
TSDB()] software it is possible to get a selection of sentences that is
relevant for a certain phenomenon. Sentences are crossclassified
according to the phenomena they are relevant for.

The idea is to develop these collections further into a generally
accepted benchmark for linguistic theories in general and for deep
linguistic processing in particular. Of course the negative sentences
can be used to check what statistical parsers have to say about them
in comparison to the well-formed examples.

So if somebody has a look at the German collection and wants to
contribute, please send me the relevant examples and pointers to the
publications in which the examples are discussed.

Best wishes

Stefan Müller
Universität Bremen/Fachbereich 10

http://www.cl.uni-bremen.de/~stefan/
http://www.cl.uni-bremen.de/~stefan/Babel/Interaktiv/

Linguistic Field(s): Computational Linguistics
                     Discipline of Linguistics
                     Linguistic Theories

Subject Language(s): German, Standard (GER)

-----------------------------------------------------------
LINGUIST List: Vol-16-1364

----------------------------------------------------------------
This message was sent using IMP, the Internet Messaging Program.