Improving Low-Resource Morphological Learning with Intermediate Forms from Finite State Transducers

Sarah Moeller; Ghazaleh Kazeminejad; Andrew Cowell; Mans Hulden

doi:10.33011/computel.v1i.427

Authors

Sarah Moeller University of Colorado
Ghazaleh Kazeminejad University of Colorado
Andrew Cowell University of Colorado
Mans Hulden University of Colorado

DOI:

https://doi.org/10.33011/computel.v1i.427

Abstract

Neural encoder-decoder models are usually applied to morphology learning as an end-to-end process without considering the underlying phonological representations that linguists posit as abstract forms before morphophonological rules are applied. Finite State Transducers for morphology, on the other hand, are developed to contain these underlying forms as an intermediate representation. This paper shows that training a bidirectional two-step encoder-decoder model of Arapaho verbs to learn two separate mappings between tags and abstract morphemes and morphemes and surface allomorphs improves results when training data is limited to 10,000 to 30,000 examples of inflected word forms.

Improving Low-Resource Morphological Learning with Intermediate Forms from Finite State Transducers

Authors

DOI:

Abstract

Downloads

Published

Issue

Section