21.2874, Books: Computational Linguistics/Lexicography: Westerhout

linguist at LINGUISTLIST.ORG linguist at LINGUISTLIST.ORG
Sun Jul 11 01:47:49 UTC 2010


LINGUIST List: Vol-21-2874. Sat Jul 10 2010. ISSN: 1068 - 4875.

Subject: 21.2874, Books: Computational Linguistics/Lexicography: Westerhout

Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
            Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
 
Reviews: Monica Macaulay, U of Wisconsin-Madison  
Eric Raimy, U of Wisconsin-Madison  
Joseph Salmons, U of Wisconsin-Madison  
Anja Wanner, U of Wisconsin-Madison  
       <reviews at linguistlist.org> 

Homepage: http://linguistlist.org/

The LINGUIST List is funded by Eastern Michigan University, 
and donations from subscribers and publishers.

Editor for this issue: Maria Moreno-Rollins <maria at linguistlist.org>
================================================================  

Links to the websites of all LINGUIST's supporting publishers
are available at the end of this issue. 

===========================Directory==============================  

1)
Date: 15-Jun-2010
From: Mariëtte Bonenkamp < lot at uu.nl >
Subject: Definition Extraction for Glossary Creation: Westerhout
 

	
-------------------------Message 1 ---------------------------------- 
Date: Sat, 10 Jul 2010 21:46:45
From: Mariëtte Bonenkamp [lot at uu.nl]
Subject: Definition Extraction for Glossary Creation: Westerhout

E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=21-2874.html&submissionid=2638260&topicid=2&msgnumber=1
  



Title: Definition Extraction for Glossary Creation 
Subtitle: A study on extracting definitions for semi-automatic glossary creation in
Dutch
 
Series Title: LOT Dissertation Series  

Publication Year: 2010 
Publisher: Netherlands Graduate School of Linguistics / Landelijke - LOT
	   http://www.lotpublications.nl/
	
Author: Eline Westerhout

Paperback: ISBN:  9789460930348 Pages:  Price: U.K. £ 24.86


Abstract:

The central topic of this thesis is the automatic extraction of definitions 
from text. Definition extraction can play a role in various applications
including the semi-automatic development of glossaries in an eLearning
context, which constitutes the main focus of this dissertation. A glossary
provides definitions
for the most important terms that are discussed in a text. The
semi-automatic extraction approach presented in this study consists of two
phases. As a first step, a method entirely based on lexico-syntactic
patterns has been used to distinguish between definitions and
non-definitions. A corpus consisting of 600 definitions has been employed
to identify recurrent definition patterns. Since many of these patterns are
not unique to definitions, a second step was employed to reduce the number
of non-definitions identified. It has been investigated whether other
textual characteristics can contribute to the correct classification of
definitions, in addition to the lexico-syntactic patterns. The properties
that have been examined vary from the importance of the defined word
(phrase) within a text to the layout of the definition. Machine learning
techniques have been employed to identify which are the most relevant
(combinations of) definition properties. The results of this dissertation
are relevant for researchers in linguistics and lexicography as well as for
the development of language technology applications. 



Linguistic Field(s): Computational Linguistics
                     Lexicography

Subject Language(s): Dutch (nld)


Written In: English  (eng)
	
See this book announcement on our website: 
http://linguistlist.org/pubs/books/get-book.cfm?BookID=48869


MAJOR SUPPORTERS

	Brill          
		http://www.brill.nl	

	Cambridge Scholars Publishing          
		http://www.c-s-p.org	

	Cambridge University Press          
		http://us.cambridge.org	

	Cascadilla Press          
		http://www.cascadilla.com/	

	Continuum International Publishing Group Ltd          
		http://www.continuumbooks.com	

	De Gruyter Mouton          
		http://www.degruyter.com/mouton	

	Edinburgh University Press          
		http://www.eup.ed.ac.uk/	

	Elsevier Ltd          
		http://www.elsevier.com/linguistics	

	Emerald Group Publishing Limited          
		http://www.emeraldinsight.com/	

	European Language Resources Association - ELRA          
		http://www.elra.info.	

	Georgetown University Press          
		http://www.press.georgetown.edu	

	John Benjamins          
		http://www.benjamins.com/	

	Lincom GmbH          
		http://www.lincom.eu	

	MIT Press          
		http://mitpress.mit.edu/	

	Multilingual Matters          
		http://www.multilingual-matters.com/	

	Narr Francke Attempto Verlag GmbH + Co. KG          
		http://www.narr.de/	

	Oxford University Press          
		http://www.oup.com/us	

	Palgrave Macmillan          
		http://www.palgrave.com	

	Peter Lang AG          
		http://www.peterlang.com	

	Rodopi          
		http://www.rodopi.nl/	

	Routledge (Taylor and Francis)          
		http://www.routledge.com/	

	Springer          
		http://www.springer.com	

	University of Toronto Press          
		http://www.utpjournals.com/	

	Wiley-Blackwell          
		http://www.wiley.com	

OTHER SUPPORTING PUBLISHERS	

	Association of Editors of the Journal of Portuguese Linguistics
		http://www.fl.ul.pt/revistas/JPL/JPLweb.htm 

	Graduate Linguistic Students' Association, Umass
		http://glsa.hypermart.net/ 

	International Pragmatics Assoc.
		http://www.ipra.be 

	Langues et Linguistique
		http://y.ennaji.free.fr/fr/ 

	Linguistic Association of Finland
		http://www.ling.helsinki.fi/sky/ 

	Netherlands Graduate School of Linguistics / Landelijke - LOT
		http://www.lotpublications.nl/ 

	Pacific Linguistics
		http://pacling.anu.edu.au/ 

	SIL International
		http://www.ethnologue.com/bookstore.asp 

	St. Jerome Publishing Ltd
		http://www.stjerome.co.uk 

	Utrecht institute of Linguistics
		http://www-uilots.let.uu.nl/ 
	



-----------------------------------------------------------
LINGUIST List: Vol-21-2874	

	



More information about the LINGUIST mailing list