Where academic tradition
meets the exciting future

Syntax Annotation Guidelines for the Turku Dependency Treebank

Katri Haverinen, Syntax Annotation Guidelines for the Turku Dependency Treebank. TUCS Technical Reports 1034, Turku Centre for Computer Science, 2012.

Abstract:

This document describes the syntax annotation scheme of the Turku Dependency Treebank. The treebank is annotated using a modified version of the well-known Stanford Dependency (SD) scheme, which represents the syntax of a sentence as a tree of labeled, directed dependencies. The SD scheme has been originally designed for English, and thus it has been modified in the annotation process, in order to accommodate the specific features of the Finnish language.

We first give a brief description of the original SD scheme and then proceed to describe the dependency types used in the Finnish specific version. Next, we discuss the most important changes between the original and the Finnish specific schemes, and finally, we give instructions for annotating specific phenomena within the Finnish language.

Files:

Full publication in PDF-format

BibTeX entry:

@TECHREPORT{tHaverinen12a,
  title = {Syntax Annotation Guidelines for the Turku Dependency Treebank},
  author = {Haverinen, Katri},
  number = {1034},
  series = {TUCS Technical Reports},
  publisher = {Turku Centre for Computer Science},
  year = {2012},
  keywords = {syntax, parsing, treebanking, Finnish},
  ISBN = {978-952-12-2708-0},
}

Belongs to TUCS Research Unit(s): Turku BioNLP Group

Edit publication