Detecting Semantic Ambiguity

Alternative Readings in Treebanks

Authors

  • Kristiina Muhonen University of Helsinki
  • Tanja Purtonen University of Helsinki

DOI:

https://doi.org/10.33011/lilt.v7i.1291

Keywords:

treebank, ambiguity detection, constraint grammar, Finnish

Abstract

In this article, we investigate ambiguity in syntactic annotation. The ambiguity in question is inherent in a way that even human annotators interpret the meaning differently. In our experiment, we detect potential structurally ambiguous sentences with Constraint Grammar rules. In the linguistic phenomena we investigate, structural ambiguity is primarily caused by word order. The potentially ambiguous particle or adverbial is located between the main verb and the (participial) NP. After detecting the structures, we analyze how many of the potentially ambiguous cases are actually ambiguous using the double-blind method. We rank the sentences captured by the rules on a 1 to 5 scale to indicate which reading the annotator regards as the primary one. The results indicate that 67% of the sentences are ambiguous. Introducing ambiguity in the treebank/parsebank increases the informativeness of the representation since both correct analyses are presented.

Downloads

Published

2012-01-01

How to Cite

Muhonen, K., & Purtonen, T. (2012). Detecting Semantic Ambiguity: Alternative Readings in Treebanks. Linguistic Issues in Language Technology, 7. https://doi.org/10.33011/lilt.v7i.1291