Where academic tradition
meets the exciting future

Syntax Annotation Guidelines for the Turku Dependency Treebank - 2nd edition, revised for the treebank release of July 2013

Katri Haverinen, Syntax Annotation Guidelines for the Turku Dependency Treebank - 2nd edition, revised for the treebank release of July 2013. TUCS Technical Reports 1034, TUCS, 2013.

Abstract:

This document describes the syntax annotation scheme of the Turku Dependency Treebank. The treebank is annotated using a modified version of the well-known Stanford Dependency (SD) scheme, which represents the syntax of a sentence as a tree of labeled, directed dependencies. The SD scheme has originally been designed for English, and thus it has been modified in the annotation process, in order to accommodate the specific features of the Finnish language.

We first give a brief description of the original SD scheme and then proceed to describe the dependency types used in the Finnish specific version. Next, we discuss the most important changes between the original and the Finnish specific schemes, and finally, we give instructions for annotating specific phenomena within the Finnish language.

This document has been revised to reflect the annotation in the July 2013 release of the treebank, as described in the paper of Haverinen et al. [4]. The revisions include, most importantly, describing the second annotation layer of the treebank and related changes, as well as few additional smaller clarifications.

Files:

Full publication in PDF-format

BibTeX entry:

@TECHREPORT{tHaverinen_Katri13a,
  title = {Syntax Annotation Guidelines for the Turku Dependency Treebank - 2nd edition, revised for the treebank release of July 2013},
  author = {Haverinen, Katri},
  number = {1034},
  series = {TUCS Technical Reports},
  publisher = {TUCS},
  year = {2013},
  keywords = {syntax, parsing, treebanking, Finnish},
  ISBN = {ISBN 978-952-12-2936-7},
}

Belongs to TUCS Research Unit(s): Turku BioNLP Group

Edit publication