Polish Dependency Bank

Alina Wróblewska

doi:10.33011/lilt.v7i.1261

Polish Dependency Bank

Authors

Alina Wróblewska Institute of Computer Science, Polish Academy of Sciences

DOI:

https://doi.org/10.33011/lilt.v7i.1261

Keywords:

treebank, Polish, dependency trees, constituent trees

Abstract

The paper outlines the first Polish dependency bank derived from constituent trees. The conversion is a fully automatic and unambiguous process. The converter takes manually disambiguated constituent trees encoded in the XML format as input and produces dependency structures encoded in the column-based CoNLL format. The conversion is a relatively straightforward process, since constituents have their syntactic centers marked in most cases. However, a certain amount of reorganizing is necessary, in order to make the dependency structures meet annotation principles. The main part of the paper will be devoted to characteristics of Polish dependency types. The Polish dependency bank can be used for training or evaluation of Polish parsers.

Downloads

Published

2012-01-01

How to Cite

Wróblewska, A. (2012). Polish Dependency Bank. Linguistic Issues in Language Technology, 7. https://doi.org/10.33011/lilt.v7i.1261

Download Citation

Issue

Vol. 7 (2012): Treebanks and Linguistic Theory

Section

Articles

License

This work is licensed under CC BY 4.0, which permits you to use, share, adapt, distribute, and reproduce it in any medium or format, provided you credit the original author(s) and source.

Polish Dependency Bank

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Information