Bootstrapping a Persian Dependency Treebank

Authors

  • Mojgan Seraji Department of Linguistics and Philology, Uppsala University
  • Beáta Megyesi Department of Linguistics and Philology, Uppsala University
  • Joakim Nivre Department of Linguistics and Philology, Uppsala University

DOI:

https://doi.org/10.33011/lilt.v7i.1297

Keywords:

treebank, Persian, dependency trees, MaltParser

Abstract

This paper presents an ongoing project whose goal is to create a freely available dependency treebank for Persian. The data is taken from the Bijankhan corpus, which is already annotated for parts of speech, and a syntactic dependency annotation based on the Stanford Typed Dependencies is added through a bootstrapping procedure involving the open-source dependency parser MaltParser. We report preliminary parsing experiments with promising results after training the parser on a manually annotated seed data set of 215 sentences.

Downloads

Published

2012-01-01

How to Cite

Seraji, M., Megyesi, B., & Nivre, J. (2012). Bootstrapping a Persian Dependency Treebank. Linguistic Issues in Language Technology, 7. https://doi.org/10.33011/lilt.v7i.1297