[Corpora-List] CALBC challenge II [reminder]

Kerstin Hornbostel kerstin.hornbostel at uni-jena.de
Tue Nov 30 16:55:18 UTC 2010


(sorry for cross posting)

Dear corpora list members,

We would like to remind you that the CALBC challenge II (www.calbc.eu) is
still open for participation. You can register to the challenge by sending a
request to public at calbc.eu.

Participants will be contributing to generation of a large-scale corpus. 
In the previous phases of the project the documents have been annotated with
4 different semantic types (Proteins/genes, diseases/disorders, species,
chemicals).

The CALBC challenge II addresses two tasks:
Task (A) Named entitiy recognition
Task (B) Concept Identification
Participants can contribution any/both of them.

In the challenge scope, we will be seeking for two types of contributions
from participants:
(1) Application of their own named entity recognition solutions to the two
available test datasets (small set - 175k Medline abstracts (mandatory),
large set - about 700k Medline abstracts (optional) )
(2) Usage of the released training dataset (SSC-II- 75k Medline
abstracts) for training a classifier and then applying the model to the two
available test datasets.

Your annotated entities have to be marked up in XML according to the format
described in the challenge guideline (http://guideline.calbc.eu). 
Your submissions will be analysed against a Silver Standard Corpus (SSC).

In the current phase of the project, we harmonise and finalize concept
identifiers that have been provided from earlier submissions. 
Contributions to the entity normalization task will be evaluated against
normalization solution of SSC.

The CALBC project team is looking forward for your contribution.

Best wishes
Dietrich & project partners

--

Dietrich Rebholz-Schuhmann, MD, PhD - Research Group Leader EBI, Wellcome
Trust Genome Campus, Hinxton CB10 1SD (UK)



_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list