Where academic tradition
meets the exciting future

Extracting Contextualized Complex Biological Events with Rich Graph-Based Feature Sets

Jari Björne, Juho Heimonen, Filip Ginter, Antti Airola, Tapio Pahikkala, Tapio Salakoski, Extracting Contextualized Complex Biological Events with Rich Graph-Based Feature Sets. Computational Intelligence 27, 541–557, 2011.

Abstract:

We describe a system for extracting complex events among genes and proteins from biomedical literature, developed in context of the BioNLP’09 Shared Task on Event Extraction. For each event, the system extracts its text trigger, class, and arguments. In contrast to the approaches prevailing prior to the shared task, events can be arguments of other events, resulting in a nested structure that better captures the underlying biological statements. We divide the task into independent steps which we approach as machine learning problems. We define a wide array of features and in particular make extensive use of dependency parse graphs. A rule-based postprocessing step is used to refine the output in accordance with the restrictions of the extraction task. In the shared task evaluation, the system achieved an F-score of 51.95% on the primary task, the best performance among the participants. Currently, with modifications and improvements described in this article, the system achieves 52.86% F-score on Task 1, the primary task, improving on its original performance. In addition, we extend the system also to Tasks 2 and 3, gaining F-scores of 51.28% and 50.18%, respectively. The system thus addresses the BioNLP’09 Shared Task in its entirety and achieves the best performance on all three subtasks.

BibTeX entry:

@ARTICLE{jBjHeGiAiPaSaxxa,
  title = {Extracting Contextualized Complex Biological Events with Rich Graph-Based Feature Sets},
  author = {Björne, Jari and Heimonen, Juho and Ginter, Filip and Airola, Antti and Pahikkala, Tapio and Salakoski, Tapio},
  journal = {Computational Intelligence},
  volume = {27},
  pages = {541–557},
  year = {2011},
  keywords = {event extraction, BioNLP’09 Shared Task, machine learning, natural language processing, biomedical domain},
}

Belongs to TUCS Research Unit(s): Turku BioNLP Group

Publication Forum rating of this publication: level 1

Edit publication