eagle-i Oregon Health & Science UniversityOregon Health & Science University
See it in Search
This page is a preview of the following resource. Continue onto eagle-i search using the button on the right to see the full record.

Voice Conversion Algorithm based on Spectral Envelope Mapping and Residual Prediction

eagle-i ID


Resource Type

  1. Algorithmic software component


  1. Resource Description
    "The purpose of a voice conversion (VC) system is to change the perceived speaker identity of a speech signal. We propose a new algorithm based on converting the LPC spectrum and predicting the residual as a function of the target envelope parameters. We conduct listening tests based on speaker discrimination of same/difference pairs to measure the accuracy by which the converted voices match the desired target voices. To establish the level of human performance as a baseline, we firstmeasure the ability of listeners to discriminate between original speech utterances under three conditions: normal, fundamental frequency and duration normalized, and LPC coded. Additionally, the spectral parameter conversion function is tested in isolation by listening to source, target, and converted speakers as LPC coded speech. The results show that the speaker identity of speech whose LPC spectrum has been converted can be recognized as the target speaker with the same level of performance as discriminating between LPC coded speech. However, the level of discrimination of converted utterances produced by the full VC system is significantlybelow that of speaker discrimination of natural speech."
  2. Used by
    Center for Spoken Language Understanding
  3. Related Publication or Documentation
    Design and Evaluation of a Voice Conversion Algorithm based on Spectral Envelope Mapping and Residual Prediction
  4. Developed by
    Kain, Alexander, Ph.D.
  5. Developed by
    Macon, Michael W.
Provenance Metadata About This Resource Record
Copyright © 2016 by the President and Fellows of Harvard College
The eagle-i Consortium is supported by NIH Grant #5U24RR029825-02 / Copyright 2016