See it in Search
This page is a preview of the following resource. Continue onto eagle-i search using the button on the right to see the full record.
Objective Methods for Predicting and Optimizing Synthetic Speech Quality (Synth. Speech Qual.)
eagle-i ID
http://ohsu.eagle-i.net/i/0000012c-3ce7-01df-cc1a-f59980000000
Resource Type
Properties
-
-
Resource Description
-
Software from this project addresses on how humans perceive acoustic discontinuities in speech. Current text-to-speech synthesis ("TTS") technology operates by retrieving intervals of stored digitized speech("units") from a database and splicing ("concatenating") them to form the output utterance. Unavoidably, there are acoustic discontinuities at the time points where the successive speech intervals meet. An unsolved problem is how to predict from the quantitative, acoustic properties of two to-be-concatenated units whether humans will hear a discontinuity. This is of immediate relevance for TTS systems that select units at run time from a large speech corpus. During selection, the systems search through the space of all possible sequences of units that can be used for the utterance and selects the sequence that has the lowest overall objective cost measure, such as the Euclidean distance between the final frame and initial frame of two units. However, research has already shown that this method and related methods do not predict well whether humans will hear a discontinuity. The current research, by being explicitly focused on perceptually optimized objective cost measures, will directly contribute to the perceptual accuracy of cost measures and hence to synthesis quality.
-
-
Used by
-
Center for Spoken Language Understanding
-
-
Website(s)
-
http://nsf.gov/awardsearch/showAward.do?AwardNumber=0313383
-
-
Website(s)
-
http://www.ohsu.edu/xd/education/schools/school-of-medicine/departments/basic-science-departments/biomedical-engineering/center-for-spoken-language-understanding/objective-methods-for-predicti.cfm?WT_rank=1
-
-
Developed by
-
Klabbers-Judd, Esther, Ph.D.
-
-
Developed by
-
van Santen, Jan P.H., Ph.D.
-
-
Software license
-
Open source software license