ProsodyPro ---- A Praat script for large-scale systematic prosody analysis (Version 4.0; Previously TimeNormalizF0.praat) [Download]

by Yi Xu

An interactive Praat script that allows you to:


Motivation and brief history

ProsodyPro is developed as a convenient tool for our own research. It allows us to systematicaly process large amount of speech data with high precision. It has maximally reduced the amount of human labor by automating tasks that do not require human judgment, such as locating and opening sound files, taking measurements, and saving raw results in formats ready for further graphical and statistical analysis. On the other hand, it also allows human intervention of processes that are prone to error in automatic algorithms such as pitch detection and segmentation.

The f0 trimming time-normalization algorithms, which are part of the core of the script, were developed in my PhD research (Xu, 1993), which were then implemented in a C program working in conjunction with xwaves, which, like Praat, generates automatic vocal cycle markings and saves most of the manual labor in marking the cycle manually as done in my dissertation. The arrival of Praat, thanks to the brilliant invention of Paul Boersma and David Weenink, makes it possible to put these algorithms together in a single script that can run on all major computer platforms. It also solved the problem of having to write a different C program for each new experiment.

The first version of the script was made public in 2005. Since then it has been used in many research projects. Some are listed here.

Instructions

  1. Put ProsodyPro.praat in the folder containing the sound files to be analyzed, and launch Praat;
  2. Select Open Praat Script... from the top menu;
  3. Locate ProsodyPro.praat in the dialogue window and select it;
  4. When the script window opens in Praat, select Run from the Run menu (or type the key shortcut command-r or control-r);
  5. In the startup window, check or uncheck the boxes according to your need, and set appropriate values in the text fields or simply use the default values. Select the task by checking the appropriate radio button.
  6. Click OK and three windows will appear. The first window (PointProcess) displays the waveform together with vocal cycle marks (vertical lines) generated by Praat. This is where you can manually add the missing marks and delete the redundant ones. You need to do this only for the named intervals, as explained next.
  7. The second window (TextGrid) displays the waveform and spectrogram of the current sound together with optional pitch track and formant tracks in the spectrogram panel, and vocal pulse marks in the waveform panel. (These tracks and marks cannot be manually changed. So you can hide them to reduce processing time by using the corresponding menu.)
  8. At the bottom of this window are two TextGrid tiers, where you can insert interval boundaries (Tier 1) and add comments (Tier 2). For any interval that you want to have results saved, a label in Tier 1 is required. The label can be as simple as a, b, c or 1, 2, 3.
  9. The third window (Pause) allows you to control the progression of the analysis. You bring up the next found to be analyzed by changing the number (or leaving it as is) in the current_file box and pressing "Continue". The number indicates the order in the String object "list" in the Object window (a hardcopy is also saved in the current folder). The next sound will be 1 + current_file (So, type 0 to open sound 1).
  10. To end the progression of the current analysis session, press "Finish" in the Pause window, and the last sound analyzed will be shown in the Praat Info window. You can use that number as a starting point in you next analysis session.
  11. After processing individual files, you can run the script again to get ensemble files by checking the third radio button from the top.
  12. You can also change various parameter after processing individual files by runing the script again with the radio button "Process all sounds without pause" checked. Just watch the script run through all the files on its own.
  13. You can also generate mean normf0 contours averaged across repetitions of identical sentences. To do this, set the value of Nrepetitions in the opening window according to the number of repetitions in your data set; run the script again with the "Process all sounds without pause" button checked, and run the script again with the "Get ensemble files" button checked. Make sure that the number of labeled intervals are identical across the repetitions.

Output

Each time you press "Continue" in the Pause window, various analysis results are saved for the current sound as text files:

If you want to change certain analysis parameters after processing all the sound files, you can rerun the script, set the "Input File No" to 1 in the startup window and check the button "Process all sounds without pause" before pressing "OK". The script will then run by itself and cycle through all the sound files in the folder one by one.

After the analysis of all the individual sound files are done, you can gather the analysis results into a number of ensemble files by running the script again and checking the button "Get ensemble results" in the startup window. The following ensemble files will be saved (some are optional):

  1. means.txt
  2. normf0.txt
  3. mean_normf0.txt (optional)
  4. normIntensity.txt
  5. normactutime.txt
  6. samplef0.txt
  7. f0velocity.txt (optional)
  8. maxf0.txt
  9. minf0.txt
  10. excursionsize.txt
  11. meanf0.txt
  12. duration.txt
  13. maxvelocity.txt (optional)
  14. finalf0.txt
  15. finalvelocity.txt (optional)
  16. meanintensity.txt

Note that you can generate the ensemble files only if you have analyzed at least one sound following the steps described earlier.


Examples

The following examples show how functional contrasts can be easily brought out by time-normalized f0 contours, whether plotted on normalized time or mean time.


_ _

_ _ _ _ _ _ _ _ _ _ Data from Xu (1999) _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ Data from Xu & Xu (2005)


Download

Need more help?

Detailed instructions can be also found at the beginning of the script.

For more information, take a look at FAQ, and if you are still stuck, please write me (yi.xu at ucl.ac.uk).

Bug reports, suggestions on improvement and new features are also welcome.

How to cite

Xu, Y. (2005-2012). ProsodyPro.praat. Available from: http://www.phon.ucl.ac.uk/home/yi/ProsodyPro/.

Published research making use of ProsodyPro (or its predecessor TimeNormalizeF0)

  1. Kenstowicz, M. (2008). On the Origin of Tonal Classes in Kinande Noun Stems. Studies in African Linguistics 37: 115-151.
  2. Hsieh, F.-f. and Kenstowicz, M. J. (2008). Phonetic knowledge in tonal adaptation: Mandarin and English loanwords in Lhasa Tibetan. Journal of East Asian Linguistics 17: 279-297.
  3. Ito, C. and Kenstowicz, M. (2009). Mandarin Loanwords in Yanbian Korean II: Tones. Language Research 45: 85-109.
  4. Wu, W. L. (2009). Sentence-final particles in Hong Kong Cantonese: Are they tonal or intonational? In Proceedings of Interspeech 2009.
  5. Zhao, Y. and Jurafsky, D. (2009). The effect of lexical frequency and Lombard reflex on tone hyperarticulation. Journal of Phonetics 37(2): 231-247.
  6. Arnhold, A., Vainio, M., Suni, A. and Jarvikivi, J. (2010). Intonation of Finnish Verbs. In Proceedings of Interspeech 2010.
  7. Chen, S.-w. and Tsay, J. (2010). Phonetic realization of suffix vs. non-suffix morphemes in Taiwanese. In Proceedings of Speech Prosody 2010, Chicago.
  8. Greif, M. (2010). Contrastive Focus in Mandarin Chinese. In Proceedings of Speech Prosody 2010, Chicago.
  9. Lee, Y.-c. and Nambu, S. (2010). Focus-sensitive operator or focus inducer: always and only. In Proceedings of Interspeech 2010.
  10. Liu, F. (2010). Single vs. double focus in English statements and yes/no questions. In Proceedings of Speech Prosody 2010, Chicago.
  11. 王玲、尹巧云、王蓓、刘岩 (2010). 德昂语布雷方言中焦点的韵律编码方式 [Prosodic focus in Bulei dialect of Deang]. Proceedings of The 9th Phonetics Conference of China (PCC2010), Tianjin.
  12. 尹巧云、王玲、杨文华、王蓓、刘岩 (2010). 德昂语中焦点和疑问语气在语调上的共同编码 [Parallel encoding of focus and interrogative modality in Deang]. Proceedings of The 9th Phonetics Conference of China (PCC2010).
  13. Zhang, J. and Liu, J. (2011). Tone Sandhi and Tonal Coarticulation in Tianjin Chinese. Phonetica 68: 161-191.
  14. Soderstrom, M., Ko, E.-S. and Nevzorova, U. (2011). It's a question? Infants attend differently to yes/no questions and declaratives. Infant Behavior and Development 34(1): 107-110.
  15. Hwang, H. K. (2011). Distinct types of focus and wh-question intonation. In Proceedings of The 17th International Congress of Phonetic Sciences, Hong Kong: 922-925.
  16. Zerbian, S. (2011). Intensity in narrow focus across varieties of South African English. In Proceedings of The 17th International Congress of Phonetic Sciences, Hong Kong: 2268-2271.
  17. Wong, P. (2012). Acoustic characteristics of three-year-olds' correct and incorrect monosyllabic Mandarin lexical tone productions. Journal of Phonetics 40: 141-151.
  18. Arunima Choudhury & Elsi Kaiser (2012). Prosodic focus in Bangla: A psycholinguistic investigation of production and perception. Linguistic Society of America Annual Meeting, Portland, OR.
  19. 髙橋 康徳 (2012). 上海語変調ピッチ下降部の音声実現と音韻解釈. コーパスに基づく言語学教育研究報告 No. 8, 51-72.


Yi's other tools