TXANAL

NAME

txanal -- pitch epoch detector from Speech or Lx

SYNOPSIS

txanal (-i item) (-l) (-f|-t|-v) (-p polarity) file

DESCRIPTION

txanal is a program to find the locations of larynx excitation points in the voiced regions of a speech signal. The output is saved as a TX item. Options allow the input of an FX item or a TR item to act as a voicing indicator.

The process is essentially: (i) High-pass the signal, (ii) pre-emphasise, (iii) find residual after LPC analysis, (iv) find all local maxima, (v) perform a DP search for peak sequences.

This algorithm was originally used in the "epochs" program of ESPS.

LICENCE

Some code in this program was originally written by David Talkin and Derek Lin as part of the Entropic Signal Processing System and is used under licence from Microsoft.

OPTIONS

-I Identify program name and version number.

-i item Select input item number.

-l Use LX signal as source. Some processing is modified for LX signals.

-f Use FX item to define voiced regions. Voiced regions are taken to be regions with a FX value greater than 0. The fxanal(SFS1) or fxrapt(SFS1) programs are a suitable source.

-t Use TR item to define voiced regions Voiced regions are taken to be regions where the track is greater than 0.5. The vdegree(SFS1) program is a suitable source.

-v Treat whole signal as voiced. Default is to apply a simple voicing decision based on energy and zero-crossing rate.

-p polarity Specify the signal polarity as positive (1) or negative (-1). By default the polarity is automatically detected. Specifying the polarity when known increases the speed of operation.

INPUT ITEMS

SPEECH Any speech signal

LX Any Lx signal

OUTPUT ITEMS

TX Larynx excitation points.

HISTORY

VERSION/AUTHOR

ESPS David Talkin and Derek Lin

SFS Mark Huckvale

SEE ALSO

HQtx(SFS1), vtx(SFS1), pp(SFS1)
Fri Jul 09 14:54:48 2004