Paper: Improving speech synthesis quality by reducing pitch peaks in the source recordings

ACL ID N13-1054
Title Improving speech synthesis quality by reducing pitch peaks in the source recordings
Venue Annual Conference of the North American Chapter of the Association for Computational Linguistics
Session Main Conference
Year 2013
Authors

We present a method for improving the perceived nat- uralness of corpus-based speech synthesizers. It con- sists in removing pronounced pitch peaks in the origi- nal recordings, which typically lead to noticeable dis- continuities in the synthesized speech. We perceptu- ally evaluated this method using two concatenative and two HMM-based synthesis systems, and found that us- ing it on the source recordings managed to improve the naturalness of the synthesizers and had no effect on their intelligibility.