[an error occurred while processing this directive]
Research
Research Groups
Research Projects
Research Staff
Publications
Reports
Working Papers in Linguistics
Speech, Hearing and Language
Books
Annual PhD Day
Seminars, Workshops, Lectures
Current Events
Past Events
Speech Science Forum
Reading Groups

Research in Speech Recognition

If you are interested in doing graduate research in speech recognition in the department, please contact Mark Huckvale at .

Activities

The focus of our research in speech recognition has been how knowledge of human linguistic processing can aid the development of machine recognition of speech. This has two main threads: the application of traditional linguistic analysis within the decoder, and the analysis of machine and human recognition performance.

Recent work on linguistic analysis has looked at incorporating morphological analysis into the speech recognition decoder. Morphological analysis allows for smaller pronunciation lexicons, while at the same time increasing the dissimilarity of pronunciations. In combination with a word-level language model, we have shown that this approach can lead to improved word accuracy. See the article by Huckvale & Fang (2001).

Work on human-machine comparisons has investigated the psycholinguistic aspects of human morphological processing. Interesting parallels can be drawn between our morph recogniser and the results of priming experiments on human listeners. The most recent work here has been an investigation of how information processing accounts of perception can be informed by knowledge of pattern recognition systems engineering (see seminar).

Current work by Gordon Hunter and Mark Huckvale is looking at the statistical properties of words in dialogue turns. The hope is both to develop superior language models specifically for dialogue and also to find relationships between statistical models and conversational analysis.

Staff

Research work in speech recognition is led by Mark Huckvale with the assistance of postgraduate research students. Mark is an engineer by training who has worked in speech synthesis and recognition for over 15 years. He is also the author of the Speech Filing System tools which incorporate some speech recognition functionality.

Research projects

A recent research project on recognition:

Publications

Recent published papers in the area (in reverse order):

Speech Data

The following data sets are available to research workers in speech recognition at nominal charge:

UCL1000 - 1000 read sentences These are 1000 read sentences taken from the British National Corpus and read by 5 male and 5 female speakers of southern British English (100 sentences each). Recordings and transcriptions in standard format. Hear a sample.

If you are interested in these please contact Mark Huckvale ().

Software Tools

The following software tools are available to research workers in speech recognition free of charge: Speech Filing System
Our general purpose speech analysis toolkit. Contains tools for isolated word recognition, hidden-Markov modelling and semi-automatic annotation. Refer to the Speech Filing System Home Page. British National Corpus pre-processing tool
A tool for pre-processing the BNC texts into a standard format, removing punctuation and performing text normalisation.

Facilities

The Department has very good facilities for conducting research in speech recognition, including: