Paper: Syllable weight encodes mostly the same information for English word segmentation as dictionary stress

ACL ID D14-1091
Title Syllable weight encodes mostly the same information for English word segmentation as dictionary stress
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2014
Authors

Stress is a useful cue for English word segmentation. A wide range of computa- tional models have found that stress cues enable a 2-10% improvement in segmen- tation accuracy, depending on the kind of model, by using input that has been anno- tated with stress using a pronouncing dic- tionary. However, stress is neither invari- ably produced nor unambiguously iden- tifiable in real speech. Heavy syllables, i.e. those with long vowels or syllable codas, attract stress in English. We de- vise Adaptor Grammar word segmentation models that exploit either stress, or sylla- ble weight, or both, and evaluate the util- ity of syllable weight as a cue to word boundaries. Our results suggest that sylla- ble weight encodes largely the same infor- mation for word segmentation in English that annotated...