A Joint Model of Orthography and Morphological Segmentation


Type
Conference Object
Change log
Authors
Cotterell, Ryan 
Vieira, Tim 
Schütze, Hinrich 
Abstract

We present a model of morphological seg- mentation that jointly learns to segment and restore orthographic changes, e.g., funniest → fun-y-est. We term this form of analysis canon- ical segmentation and contrast it with the tra- ditional surface segmentation, which segments a surface form into a sequence of substrings, e.g., funniest → funn-i-est. We derive an im- portance sampling algorithm for approximate inference in the model and report experimental results on English, German and Indonesian.

Description
Keywords
Journal Title
Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Conference Name
Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Journal ISSN
Volume Title
Publisher
Association for Computational Linguistics
Rights
All rights reserved