[Corpora-List] New CLAIRLIB Release

mtjoseph at umich.edu mtjoseph at umich.edu
Thu Feb 8 21:57:08 UTC 2007


                     Clairlib, The Clair Library

                    version 0.953 is now available

              http://tangra.si.umich.edu/clair/clairlib


INTRODUCTION

The University of Michigan's CLAIR (Computational Linguistics And
Information Retrieval) group is happy to present the second release of
clairlib, the Clair library.

The Clair library is intended to simplify a number of generic tasks in
Natural Language Processing (NLP) and Information Retrieval (IR), with
additional applications to Bioinformatics and Political Science. Its
architecture also allows for external software to be plugged in with
very little effort.

Clairlib features a tiered architecture with a core shared by all
applications and subject-specific libraries (currently in political
science and bioinformatics).

FUNCTIONALITY

Native: Tokenization, Summarization, LexRank, Biased LexRank, Document
Clustering, Document Indexing, PageRank, Web Graph Analysis,
Bioinformatics Text Analysis, Political Science Text Analysis

Imported: Stemming, Sentence segmentation, Web page download, Web
crawling

DOWNLOAD

Write to radev at umich.edu to get a beta copy.

FUNDING

This work has been supported in part by grants R01 LM008106
"Representing and Acquiring Knowledge of Genome Regulation" and U54
DA021519 "National center for integrative bioinformatics", both from the
National Institutes of Health as well as grants IDM
0329043 "Probabilistic and link-based Methods for Exploiting Very
Large Textual Repositories" and DHB 0527513 "The Dynamics of Politcal
Representation and Political Rhetoric," both from the National Science
Foundation.

ABOUT

The Clair Library is developed by the Clair group at the University of
Michigan.

Project design: Dragomir Radev
Main implementers: Anthony Fader, Mark Hodges, and Dragomir Radev
Additional code by: Timothy Allison, Michael Dagitses, Aaron Elkiss,
Gunes Erkan, Scott Gifford, Mark Joseph, Samuela Pollack, and Adam
Winkel



More information about the Corpora mailing list