Semantic Annotation for the Digital Humanities

– Using Markov Logic Networks for Annotation Consistency Control

Authors

  • Anette Frank Department of Computational Linguistics, Heidelberg University
  • Thomas Bögel Department of Computational Linguistics, Heidelberg University
  • Oliver Hellwig South Asia Institute, Heidelberg University
  • Nils Reiter Department of Computational Linguistics, Heidelberg University

DOI:

https://doi.org/10.33011/lilt.v7i.1275

Keywords:

treebank, annotation, digital humanities, semantic annotation, Markov logic

Abstract

This contribution investigates novel techniques for error detection in automatic semantic annotations, as an attempt to reconcile error-prone NLP processing with high quality standards required for empirical research in Digital Humanities. We demonstrate the state-of-the-art performance of semantic NLP systems on a corpus of ritual texts and report performance gains we obtain using domain adaptation techniques. Our main contribution is to explore new techniques for annotation consistency control, as an attempt to reconcile error-prone NLP processing with high quality requirements. The novelty of our approach lies in its attempt to leverage multi-level semantic annotations by defining interaction constraints between local word-level semantic annotations and global discourse-level annotations. These constraints are defined using Markov Logic Networks, a logical formalism for statistical relational inference that allows for violable constraints. We report first results.

Downloads

Published

2012-01-01

How to Cite

Frank, A., Bögel, T., Hellwig, O., & Reiter, N. (2012). Semantic Annotation for the Digital Humanities: – Using Markov Logic Networks for Annotation Consistency Control. Linguistic Issues in Language Technology, 7. https://doi.org/10.33011/lilt.v7i.1275