[Corpora-List] A tool for corpus management?

Normand Peladeau peladeau at simstat.com
Wed Aug 25 15:09:39 UTC 2010


If you don't mind paying for a commercial software, I suggest you 
have a look at our suite of text analysis software, QDA Miner and WordStat.

QDA Miner is a document management and coding tool that allows one to 
tag and annotate documents, perform searches and statistics on tags 
(frequency, co-occurrences, comparisons, sequences). QDA Miner can 
import MS Word, WordPerfect, Rich Text, HTML, Text file, and PDF 
files (most of them) as well as structured data files such as Excel, 
MS Access, SPSS, and many other database and spreadsheet formats, 
allowing you to combine text with metadata (numerical, categorical, 
dates, etc.). QDA Miner offers several text search tools, including 
boolean searches, query by examples, section retrieval and keyword 
searches. Tags may be automatically applied to retrieved text segments.

WordStat is a text analysis add-on to perform text analysis on QDA 
Miner project files. WordStat can perform word frequency, extract 
phrases, apply text mining techniques to identify themes and 
patterns, and also support automatic categorization and 
classification of words, word patterns, phrases and proximity 
rules.  We will release in a few days version 6.1 of WordStat, and a 
maintenance update of QDA Miner.  Both software will offer major 
speed improvements over the currently available versions.  For 
example, WordStat 6 will analyse up to 15 millions words per minute 
(a 50% speed increase).  WordStat 6 integrates several language 
dictionaries, two English thesaurus and WordNet in order to support 
the dictionary builidng process (what others may call "taxonomy 
building"). We will introduce in version 6.1 thesauri for French, 
Spanish, German and Portuguese languages.

You can get more information about the software from our web site at:

         http://www.provalisresearch.com

We also have flash demos of those two software:

         http://www.provalisresearch.com/wordstat/WordStatFlashDemo.html
         http://www.provalisresearch.com/QDAMiner/Flash/DemoQDA.htm

Normand Peladeau
Provalis Research


At 8/25/2010 07:06 AM, Mahdi Mohseni wrote:
>Dear Colleagues,
>
>I need a tool for managing a corpus with the following capabilities:
>    * Adding text files to the corpus
>    * Editing files
>    * Annotating words
>    * Searching
>    * Reporting statistics of words and tags
>Would you please introduce me a suitable tool?
>
>Best,
>Mahdi Mohseni
>
>_______________________________________________
>Corpora mailing list
>Corpora at uib.no
>http://mailman.uib.no/listinfo/corpora
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20100825/c8e931e3/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list