Polish Dependency Bank

Authors

  • Alina Wróblewska Institute of Computer Science, Polish Academy of Sciences

DOI:

https://doi.org/10.33011/lilt.v7i.1261

Keywords:

treebank, Polish, dependency trees, constituent trees

Abstract

The paper outlines the first Polish dependency bank derived from constituent trees. The conversion is a fully automatic and unambiguous process. The converter takes manually disambiguated constituent trees encoded in the XML format as input and produces dependency structures encoded in the column-based CoNLL format. The conversion is a relatively straightforward process, since constituents have their syntactic centers marked in most cases. However, a certain amount of reorganizing is necessary, in order to make the dependency structures meet annotation principles. The main part of the paper will be devoted to characteristics of Polish dependency types. The Polish dependency bank can be used for training or evaluation of Polish parsers.

Downloads

Published

2012-01-01

How to Cite

Wróblewska, A. (2012). Polish Dependency Bank. Linguistic Issues in Language Technology, 7. https://doi.org/10.33011/lilt.v7i.1261