eagle-i Oregon Health & Science UniversityOregon Health & Science University
See it in Search
This page is a preview of the following resource. Continue onto eagle-i search using the button on the right to see the full record.

Names Corpus

eagle-i ID


Resource Type

  1. Software


  1. Resource Description
    The Names Corpus is a collection of 24,245 first and last name utterances from 20184 speakers. The utterances were taken from many other telephone speech data collections that have been completed at the CSLU, during which callers were asked to say their first and last names, or asked to leave their name and address to receive an award coupon (addresses are not include in corpus). Each file in the Names corpus has an orthographic transcription following the CSLU Labeling Guide. Also, to take advantage of the phonemic variability, 24245 of the utterance have been phonetically transcribed. The selection of files to phonetically transcribe was constrained by a process that selected files that were suspected to contain phonetic contexts that had not yet been transcribed.
  2. Used by
    Center for Spoken Language Understanding
  3. Version
  4. Website(s)
Provenance Metadata About This Resource Record
Copyright © 2016 by the President and Fellows of Harvard College
The eagle-i Consortium is supported by NIH Grant #5U24RR029825-02 / Copyright 2016