17.3080, Software: CLAIRLIB release
LINGUIST Network
linguist at LINGUISTLIST.ORG
Thu Oct 19 17:39:18 UTC 2006
LINGUIST List: Vol-17-3080. Thu Oct 19 2006. ISSN: 1068 - 4875.
Subject: 17.3080, Software: CLAIRLIB release
Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
Reviews: Laura Welcher, Rosetta Project / Long Now Foundation
<reviews at linguistlist.org>
Homepage: http://linguistlist.org/
The LINGUIST List is funded by Eastern Michigan University, Wayne
State University, and donations from subscribers and publishers.
Editor for this issue: Svetlana Aksenova <svetlana at linguistlist.org>
================================================================
To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.html.
===========================Directory==============================
1)
Date: 18-Oct-2006
From: Mark Joseph < mtjoseph at umich.edu >
Subject: CLAIRLIB release
-------------------------Message 1 ----------------------------------
Date: Thu, 19 Oct 2006 13:35:28
From: Mark Joseph < mtjoseph at umich.edu >
Subject: CLAIRLIB release
Clairlib, The Clair Library is now available
http://tangra.si.umich.edu/clair/clairlib
INTRODUCTION
The University of Michigan's CLAIR (Computational Linguistics And
Information Retrieval) group (http://tangra.si.umich.edu/clair) is happy to
present the second release of clairlib, the Clair library.
The Clair library is written in Perl and is intended to simplify a number
of generic tasks in Natural Language Processing (NLP) and Information
Retrieval (IR). Its architecture also allows for external software to be
plugged in with very little effort.
Clairlib features a tiered architecture with a core shared by all
applications and subject-specific libraries (currently in political science
and bioinformatics).
FUNCTIONALITY
Native: Tokenization, Summarization, LexRank, Biased LexRank, Document
Clustering, Document Indexing, PageRank, Biased Pagerank, Web Graph
Analysis, Bioinformatics Text Analysis, Political Science Text Analysis,
Network Building, Power Law Distribution Analysis, Network Analysis and
Computation (Watts-Strogatz Clustering Coefficient, Cosines, Random Walks),
Tf, Idf
Imported: Stemming, Sentence Segmentation, Web Page Download, Web Crawling,
XML Parsing, XML Tree Building, XML Writing
FUNDING
This work has been supported in part by grants R01 LM008106 'Representing
and Acquiring Knowledge of Genome Regulation' and U54 DA021519 'National
center for integrative bioinformatics', both from the National Institutes
of Health as well as grants IDM 0329043 'Probabilistic and link-based
Methods for Exploiting Very Large Textual Repositories' and DHB 0527513
'The Dynamics of Politcal Representation and Political Rhetoric,' both from
the National Science Foundation.
ABOUT
The Clair Library is developed by the Clair group at the University of
Michigan. It encompasses the functionality of MEAD and perltree, two of
CLAIR's earlier releases.
Project design: Dragomir R. Radev
Main implementers: Anthony Fader, Mark Hodges, and Dragomir R. Radev
Additional code by: Timothy Allison, Michael Dagitses, Aaron Elkiss, Gunes
Erkan, Scott Gifford, Mark Joseph, Samuela Pollack, and Adam Winkel
Linguistic Field(s): Computational Linguistics
-----------------------------------------------------------
LINGUIST List: Vol-17-3080
More information about the LINGUIST
mailing list