UCL Speaker Database

Introduction

The UCL Speaker Database was primarily developed for a project on the perception of speaker variability in children and adults funded by the Wellcome Trust. The database contains recordings of a wide range of speech materials for 45 speakers of South-Eastern British English and is now being made available to other researchers investigating speaker variability and speaking styles.

Speakers

The database contains recording of forty-five speakers of British English with a fairly neutral accent or mild South-Eastern English accent. These include 18 women (mean age: 33;11 yrs), 15 men (mean age: 30;7 yrs), 6 girls (mean age: 13;2 yrs) and 6 boys (mean age: 13;2 yrs). To get an impression of the range of voices in the database, here is the the word ‘park’ spoken by all 45 speakers (MP3 90k).

Speech materials

Word level materials

  • Manchester Junior Word lists
  • UCL Markham word lists

Sentence-level materials

  • Semantically-unpredictable sentences

Read texts

  • 'Arthur the Rat' passage
  • 'Rainbow' passage

Semi-spontaneous speech

  • Description of a cartoon
  • Retelling of same cartoon from memory

Recording conditions

Speech recordings were made in the anechoic chamber of the Department of Phonetics and Linguistics, UCL using a Brüel & Kjær sound level meter. Glottal activity was measured using an electrolaryngograph. Recordings were made to DAT at a sampling rate of 44.1 kHz.

File format

Materials are included as WAV files at the original sampling rate of 44.1 KHz. The majority of the materials described above have been assembled onto a set of two DVDs which can be made available to interested researchers at low cost (to cover production costs and postage). A complete set of materials may not be available for all speakers due to time constraints during recordings or to technical problems. A list showing detailed availability for each type of material is supplied.

Note

We cannot offer further support for the materials provided. The recordings other than the UCL Markham test are provided unsegmented (one file per material type) so will require further preparation. Due to space limitations, materials are mostly provided without the laryngographic channel but the original recordings can be provided on request in DAT format.

The use of this material in research projects should be acknowledged in publications relating to the project. Authors can cite the following paper which contains a full description of materials:

D. Markham & V. Hazan, "The UCL Speaker Database", Speech, Hearing and Language: UCL Work in Progress, vol. 14, p.1-17, 2002.

For more information, please contact Valerie Hazan

Acknowledgment

This database was produced as part of a study funded by the Wellcome Trust (055651/ Z/98/ JRS/ JP/ JAT).

Database on DVD only £100 (about US$180)  

x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x

University College London - Gower Street - London - WC1E 6BT - Telephone: +44 (0)20 7679 2000 - Copyright © 1999-2016 UCL