This page is a preview of the following resource. Continue onto eagle-i search using the button on the right to see the full record.

Objective Methods for Predicting and Optimizing Synthetic Speech Quality (Synth. Speech Qual.)

eagle-i ID

http://ohsu.eagle-i.net/i/0000012c-3ce7-01df-cc1a-f59980000000

Resource Type

Software

Properties

Resource Description

Software from this project addresses on how humans perceive acoustic discontinuities in speech. Current text-to-speech synthesis ("TTS") technology operates by retrieving intervals of stored digitized speech("units") from a database and splicing ("concatenating") them to form the output utterance. Unavoidably, there are acoustic discontinuities at the time points where the successive speech intervals meet. An unsolved problem is how to predict from the quantitative, acoustic properties of two to-be-concatenated units whether humans will hear a discontinuity. This is of immediate relevance for TTS systems that select units at run time from a large speech corpus. During selection, the systems search through the space of all possible sequences of units that can be used for the utterance and selects the sequence that has the lowest overall objective cost measure, such as the Euclidean distance between the final frame and initial frame of two units. However, research has already shown that this method and related methods do not predict well whether humans will hear a discontinuity. The current research, by being explicitly focused on perceptually optimized objective cost measures, will directly contribute to the perceptual accuracy of cost measures and hence to synthesis quality.
Used by

Center for Spoken Language Understanding
Website(s)

http://nsf.gov/awardsearch/showAward.do?AwardNumber=0313383
Website(s)

http://www.ohsu.edu/xd/education/schools/school-of-medicine/departments/basic-science-departments/biomedical-engineering/center-for-spoken-language-understanding/objective-methods-for-predicti.cfm?WT_rank=1
Developed by

Klabbers-Judd, Esther, Ph.D.
Developed by

van Santen, Jan P.H., Ph.D.
Software license

Open source software license

Inferred Types from the eagle-i Ontology (What is an ontology?)

Provenance Metadata About This Resource Record

workflow state

Published
contributor

nvasilevsky (Nicole Vasilevsky)
created

2010-11-11T15:48:01.577-06:00
creator

mhan (Mikyung Han)
modified

2012-11-05T21:12:39.888-06:00