[Corpora-List] Succeed hackathon at UA

Marco BÜCHLER mbuechler at e-humanities.net
Tue Mar 25 14:28:14 UTC 2014




On the 10th and 11th of April 2014, the Succeed project will hold a
hackathon at the University of Alicante, whose aim is to look at
improving the state-of-the-art open-source tools for the digitisation of
textual content such as books and newspapers.

Over the two days, developers will work together in small groups to
discuss, roadmap and plan the future development of existing tools. Some
of the topics up for discussion are:

  * How to train the Tesseract
    <http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3> OCR
    engine.
  * Creation of XSLT stylesheets for format conversion, e.g. hOCR, PAGE,
    FRXML.
  * Debian package generation.

The hackathon provides a unique opportunity to meet developers involved
in digitisation projects all over Europe. Participation to the event is
FREE OF CHARGE, but please make sure you reserve
<https://www.eventbrite.com/e/2nd-succeed-dev-workshop-hackathon-tickets-10907317079> your
place. Participants are also encouraged to take a look at Last year's
hackathon's outcomes
<http://www.digitisation.eu/blog/1st-succeed-hackathon-kb/> and background
information
<http://succeed-project.eu/wiki/index.php/Developers_workshops_%28hackathons%29>.

-- 
Marco BÜCHLER
Georg-August-Universität Göttingen
Göttingen Centre for Digital Humanities (GCDH)
Papendiek 16
37073 Göttingen (Heynehaus) 

eMail    : mbuechler at e-humanities.net
Web      : http://www.gcdh.de/
Profil   : http://www.gcdh.de/en/people/team/marco-buechler/
Facebook : http://www.facebook.com/marco.buechler
LinkedIn : http://www.linkedin.com/profile/view?id=15098543&trk=tab_pro
Twitter  : https://twitter.com/mabuechler

l-h 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20140325/d3860fc5/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: image/png
Size: 12771 bytes
Desc: not available
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20140325/d3860fc5/attachment-0001.png>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list