Parsed Corpus as a Source for Testing Generalizations in Japanese Syntax

  • Hideki Kishimoto Kobe University
  • Prashant Pardeshi National Institute for Japanese Language and Linguistics (NINJAL)

Abstract

In this paper, we discuss constituent ordering generalizations in Japanese. Japanese has SOV as its basic order, but a significant range of argument order variations brought about by ‘scrambling’ is permitted. Although scrambling does not induce much in the way of semantic effects, it is conceivable that marked orders are derived from the unmarked order under some pragmatic or other motivations. The difference in the effect of basic and derived order is not reflected in native speaker’s grammaticality judgments, but we suggest that the intuition about the ordering of arguments may be attested in corpus data. By using the Keyaki treebank (a proper subset of which is NINJAL Parsed Corpus of Modern Japanese (NPCMJ)), it is shown that the naturally-occurring corpus data confirm that marked orderings of arguments are less frequent than their unmarked ordering counterparts. We suggest some possible motivations lying behind the argument order variations.

Published
2019-07-01
How to Cite
Kishimoto, H., & Pardeshi, P. (2019). Parsed Corpus as a Source for Testing Generalizations in Japanese Syntax. Linguistic Issues in Language Technology, 18. https://doi.org/10.33011/lilt.v18i.1431