eagle-i Oregon Health & Science UniversityOregon Health & Science University
See it in Search
This page is a preview of the following resource. Continue onto eagle-i search using the button on the right to see the full record.

Objective Methods for Predicting and Optimizing Synthetic Speech Quality (Synth. Speech Qual.)

eagle-i ID

http://ohsu.eagle-i.net/i/0000012c-3ce7-01df-cc1a-f59980000000

Resource Type

  1. Software

Properties

  1. Resource Description
    Software from this project addresses on how humans perceive acoustic discontinuities in speech. Current text-to-speech synthesis ("TTS") technology operates by retrieving intervals of stored digitized speech("units") from a database and splicing ("concatenating") them to form the output utterance. Unavoidably, there are acoustic discontinuities at the time points where the successive speech intervals meet. An unsolved problem is how to predict from the quantitative, acoustic properties of two to-be-concatenated units whether humans will hear a discontinuity. This is of immediate relevance for TTS systems that select units at run time from a large speech corpus. During selection, the systems search through the space of all possible sequences of units that can be used for the utterance and selects the sequence that has the lowest overall objective cost measure, such as the Euclidean distance between the final frame and initial frame of two units. However, research has already shown that this method and related methods do not predict well whether humans will hear a discontinuity. The current research, by being explicitly focused on perceptually optimized objective cost measures, will directly contribute to the perceptual accuracy of cost measures and hence to synthesis quality.
  2. Used by
    Center for Spoken Language Understanding
  3. Website(s)
    http://nsf.gov/awardsearch/showAward.do?AwardNumber=0313383
  4. Website(s)
    http://www.ohsu.edu/xd/education/schools/school-of-medicine/departments/basic-science-departments/biomedical-engineering/center-for-spoken-language-understanding/objective-methods-for-predicti.cfm?WT_rank=1
  5. Developed by
    Klabbers-Judd, Esther, Ph.D.
  6. Developed by
    van Santen, Jan P.H., Ph.D.
  7. Software license
    Open source software license
 
RDFRDF
 
Provenance Metadata About This Resource Record
Copyright © 2016 by the President and Fellows of Harvard College
The eagle-i Consortium is supported by NIH Grant #5U24RR029825-02 / Copyright 2016