The following directories contain (maximum-onset) syllables extracted from the LibriTTS corpus using phonetic forced-alignments from the LibriTTSLabel resource.
The audio files are extracted from the clean
subsets of the corpus and the train
files are from the clean-100
subset.
dev
, test
, and train
.An example of the information contained in each filename:
1272_128104_000001_000000_0002_jolly_JH-AA1.wav
This contains the following information separated by underscores:
1272_128104_000001_000000
--> This is the utterance id in the LibriTTS corpus, of which 1272
is the speaker id.0002
--> This is the syllable index (i.e. the third syllable extracted from this utterance).jolly
--> The surface form of the word in the utterance.JH-AA1
--> The phonetic transcription for this syllable (i.e. it is the first syllable in "jolly")For convenience in downloading, organising, selecting and partitioning the data, the following files contain lists of the filenames in each of the sub-directories dev.txt
, test.txt
, and train.txt
.
You are not required to use the complete dataset, however, all selections should be described and motivated appropriately.
If you prefer to download the entire corpus to local storage you may use the gzipped tar files: dev.tar.gz
, test.tar.gz
, and train.tar.gz
.