RSS 2.0
Critical Assessment of Information Extraction in Biology - data sets are available from Resources/Corpora and require registration.

Corpora

BioCreative III corpus (Resources) [2010-04-02]

  • IAT Task
  • The PubMed Central collection in XML format for the IAT task of Biocreative III is now available.

  • GN Task
  • The training data for the GN task of BioCreative III are now available.
    The evaluation scripts for the GN task of BioCreative III are now available (06/22/2010).
    The Test data fo the GN task of BioCreative III are now available (06/28/2010).
    The evaluation data (gold and silver standards) for the GN task are now available (09/29/2010).

  • PPI Task
  • The evaluation data (test set gold standard) for the Article Classification [Sub-]Task (ACT) of the PPI task are available.
    The evaluation data (test set gold standard) for the Method Interaction [Sub-]Task (IMT) of the PPI task are available.
    The test data for the Article Classification [Sub-]Task (ACT) of the PPI task are available.
    The test data for the Method Interaction [Sub-]Task (IMT) of the PPI task are available.
    The training data for the Method Interaction [Sub-]Task (IMT) of the PPI task are available.
    The development data for the Method Interaction [Sub-]Task (IMT) of the PPI task are available.
    The training data for the Article Classification [Sub-]Task (ACT) of the PPI task are available.
    The development data for the Article Classification [Sub-]Task (ACT) of the PPI task are available.
    IMS data from BioCreative II has been added as an additional resource for the Method Interaction [Sub-]Task (IMT) of the PPI task. The package includes 1051 annotated articles, with the HTML format in addition to PDF, PubMed XML and plain text.
    The evaluation library and command-line tool (from BioCreative II.5) for the PPI tasks can be found here.

Downloads