This page is a preview of the following resource. Continue onto eagle-i search using the button on the right to see the full record.

Names Corpus

http://ohsu.eagle-i.net/i/0000012c-56e1-03bb-b172-130f80000000

Resource Description

The Names Corpus is a collection of 24,245 first and last name utterances from 20184 speakers. The utterances were taken from many other telephone speech data collections that have been completed at the CSLU, during which callers were asked to say their first and last names, or asked to leave their name and address to receive an award coupon (addresses are not include in corpus). Each file in the Names corpus has an orthographic transcription following the CSLU Labeling Guide. Also, to take advantage of the phonemic variability, 24245 of the utterance have been phonetically transcribed. The selection of files to phonetically transcribe was constrained by a process that selected files that were suspected to contain phonetic contexts that had not yet been transcribed.
Used by

Center for Spoken Language Understanding
Version

1.3
Website(s)

http://www.cslu.ogi.edu/corpora/names/

Inferred Types from the eagle-i Ontology (What is an ontology?)

Provenance Metadata About This Resource Record