[Corpora-List] For summarization community: JRouge tool released

Marina Litvak litvak.marina at gmail.com
Wed Jul 30 08:14:31 UTC 2014


The following announcement may be of interest for summarization community.

Released: JRouge - Java ROUGE Implementation:
https://bitbucket.org/nocgod/jrouge/wiki/Home.
JRouge implements the N-gram based ROUGE metrics in the Java language and
resolves multiple limitations of the original ROUGE toolkit. Namely:

   - JRouge uses Unicode regular expressions to match characters and
   numerals of any Unicode text, making JRouge work on a variety of languages.
   - JRouge calculates scores more precisely than the original ROUGE since
   it does not truncate and round the results to the fifth digit after the
   decimal point. Instead, JRouge uses the whole extent of the Double
   primitive type of Java to provide precise scores.
   - The JRouge implementation of the ROUGE metric is written in the Java
   language. It runs evaluations of summaries on the fly without writing them
   to the HDD and creating the configuration file. Such implementation reduces
   the number of I/O accesses and the tool’s run time significantly.
   - Since JRouge does not use a configuration file, and gets input
   summaries by their location (paths), it requires special naming conventions
   for the input files. These conventions are described in the JRouge
   documentation.
   - For clean texts (no annotations for sentence splitting) JRouge uses
   the Stanford NLP toolkit to split the sentences. Currently this step is
   redundant for n-grams computation in JRouge (JRouge, like an original ROUGE
   toolkit, glues sentences together and produces cross-sentence n-grams).
   However, sentence splitting is required for creating SEE format files, in
   order to provide system/model and configuration (settings.xml) files for
   the original ROUGE. This functionality is required if one needs to compare
   the scores of JRouge and ROUGE toolkits on the same input (see more about
   the utility services in the documentation).
   - JRouge API has an interface adapter for creating delegation of method
   calls [6] from the JRouge library to the data structures used by the
   software. For further information, please consult the JRouge documentation.
   - JRouge provides consistent results in different OSes (contrary to the
   original ROUGE where different results were produced for Windows
and Linux-based
   OSes).


For the details please visit JRouge homepage:
https://bitbucket.org/nocgod/jrouge/wiki/Home
For any questions/comments please contact Dr. Marina Litvak (email:
litvak.marina at gmail.com)

-- 
Best regards,
Marina Litvak.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20140730/d8747106/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list