[Corpora-List] A tool for corpus management?
Normand Peladeau
peladeau at simstat.com
Wed Aug 25 15:09:39 UTC 2010
If you don't mind paying for a commercial software, I suggest you
have a look at our suite of text analysis software, QDA Miner and WordStat.
QDA Miner is a document management and coding tool that allows one to
tag and annotate documents, perform searches and statistics on tags
(frequency, co-occurrences, comparisons, sequences). QDA Miner can
import MS Word, WordPerfect, Rich Text, HTML, Text file, and PDF
files (most of them) as well as structured data files such as Excel,
MS Access, SPSS, and many other database and spreadsheet formats,
allowing you to combine text with metadata (numerical, categorical,
dates, etc.). QDA Miner offers several text search tools, including
boolean searches, query by examples, section retrieval and keyword
searches. Tags may be automatically applied to retrieved text segments.
WordStat is a text analysis add-on to perform text analysis on QDA
Miner project files. WordStat can perform word frequency, extract
phrases, apply text mining techniques to identify themes and
patterns, and also support automatic categorization and
classification of words, word patterns, phrases and proximity
rules. We will release in a few days version 6.1 of WordStat, and a
maintenance update of QDA Miner. Both software will offer major
speed improvements over the currently available versions. For
example, WordStat 6 will analyse up to 15 millions words per minute
(a 50% speed increase). WordStat 6 integrates several language
dictionaries, two English thesaurus and WordNet in order to support
the dictionary builidng process (what others may call "taxonomy
building"). We will introduce in version 6.1 thesauri for French,
Spanish, German and Portuguese languages.
You can get more information about the software from our web site at:
http://www.provalisresearch.com
We also have flash demos of those two software:
http://www.provalisresearch.com/wordstat/WordStatFlashDemo.html
http://www.provalisresearch.com/QDAMiner/Flash/DemoQDA.htm
Normand Peladeau
Provalis Research
At 8/25/2010 07:06 AM, Mahdi Mohseni wrote:
>Dear Colleagues,
>
>I need a tool for managing a corpus with the following capabilities:
> * Adding text files to the corpus
> * Editing files
> * Annotating words
> * Searching
> * Reporting statistics of words and tags
>Would you please introduce me a suitable tool?
>
>Best,
>Mahdi Mohseni
>
>_______________________________________________
>Corpora mailing list
>Corpora at uib.no
>http://mailman.uib.no/listinfo/corpora
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20100825/c8e931e3/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list