Prosynth DB files

Alex Chengyu Fang (alex@phonetics.ucl.ac.uk)
Thu, 26 Mar 1998 11:34:17 +0000

Dear both,

It seems Mark never received the syntactic parses of the database files,
which I posted to prosynth on 17 Feb.

1. Just to make sure that they are readily available, I've copied them to
p://temp/alex/prosynth/data.

2. The four MARSEC files, which I originally analysed for a study of
grammatical complexities of spoken English, have also been copied into the
same directory.

3. The SGML-style pronounciation lexicon for DB 1-4 is stored in
p://temp/alex/prosynth/lexicon.

4. I list at the end of the message an SGML-style format for the syntactic
parse tree, which I proposed in my progress report, "The Annotation of
Prosynth", dated 10 December 1997. Could Mark please comment and let me
know what changes/additions you'd like to see for XML conversion?

5. I've written a program that parses for IPs, AGs, and feet, whose output
is illustrated in the same progress report as in 4. However, since Mark has
expressed an interest in writing the program himself and indeed since the
new program will be so much better with Mark's new segmental parser, I've
put this work on hold.

6. I shall copy all the database files and the lexicon onto the new project
machine once I have its disk mounted on my desktop.

7. My wife and I originally planned to go to France with our kid. But
fearing that there's still too much to finish before our next project
meeting, I'll stay in London instead without them. With the college closed
on Wed 8 April until Wed 15 April, I'll be working at home and can be
reached on 0181 455 4193.

8. Happy Easter.

Alex

==============================
<Node fun="PU" cat="COORD" att="decl">
<Node fun="CJ" cat="CL" att="act decl indic intr past unm main">
<Node fun="SU" cat="NP">
<Node fun="NPHD" cat="PRON" att="pers sing">
<WORD ID="1">I</WORD>
</Node></Node>
<Node fun="VB" cat="VP" att="act indic intr past">
<Node fun="MVB" cat="V" att="intr past">
<WORD ID="2">got</WORD>
</Node></Node>
<Node fun="A" cat="PP">
<Node fun="P" cat="PREP" att="phras">
<WORD ID="3">into</WORD>
</Node>
<Node fun="PC" cat="NP">
<Node fun="NPHD" cat="PRON" att="dem sing">
<WORD ID="4">this</WORD>
</Node></Node></Node>
<Node fun="A" cat="CL" att="act cxtr indic past sub unm">
<Node fun="SUB" cat="SUBP">
<Node fun="SBHD" cat="CONJUNC" att="subord">
<WORD ID="5">because</WORD>
</Node></Node>
<Node fun="SU" cat="NP">
<Node fun="NPPR" cat="AJP" att="attru">
<Node fun="AJHD" cat="ADJ">
<WORD ID="6">cerebro-spinal</WORD>
</Node></Node>
<Node fun="NPHD" cat="N" att="com sing">
<WORD ID="7">meningitis</WORD>
</Node></Node>
<Node fun="VB" cat="VP" att="act cxtr indic past">
<Node fun="MVB" cat="V" att="cxtr past">
<WORD ID="8">put</WORD>
</Node></Node>
<Node fun="OD" cat="NP">
<Node fun="NPHD" cat="PRON" att="pers sing">
<WORD ID="9">me</WORD>
</Node></Node>
<Node fun="CO" cat="PP">
<Node fun="P" cat="PREP" att="ge">
<WORD ID="10">out</WORD>
<WORD ID="11">of</WORD>
</Node>
<Node fun="PC" cat="NP">
<Node fun="NPHD" cat="N" att="com sing">
<WORD ID="12">action</WORD>
</Node></Node></Node>
<Node fun="A" cat="PP">
<Node fun="P" cat="PREP" att="ge">
<WORD ID="13">for</WORD>
</Node>
<Node fun="PC" cat="NP">
<Node fun="DT" cat="DTP">
<Node fun="DTPS" cat="NUM" att="card sing">
<WORD ID="14">six</WORD>
</Node></Node>
<Node fun="NPHD" cat="N" att="com plu">
<WORD ID="15">months</WORD>
</Node></Node></Node>
<Node fun="A" cat="PP">
<Node fun="P" cat="PREP" att="ge">
<WORD ID="16">in</WORD>
</Node>
<Node fun="PC" cat="NP">
<Node fun="DT" cat="DTP">
<Node fun="DTPS" cat="NUM" att="card sing">
<WORD ID="17">nineteen</WORD>
</Node></Node>
<Node fun="NPHD" cat="NUM" att="card sing">
<WORD ID="18">fifty-two</WORD>
</Node></Node></Node></Node></Node>
<Node fun="COOR" cat="CONJUNC" att="coord">
<WORD ID="19">and</WORD>
</Node>
<Node fun="CJ" cat="CL" att="act decl indic intr past unm main">
<Node fun="A" cat="CL" att="act indic motr pres sub unm">
<Node fun="SUB" cat="SUBP">
<Node fun="SBHD" cat="CONJUNC" att="subord">
<WORD ID="20">as</WORD>
</Node></Node>
<Node fun="SU" cat="NP">
<Node fun="DT" cat="DTP">
<Node fun="DTPE" cat="PRON" att="quant plu">
<WORD ID="21">few</WORD>
</Node>
<Node fun="DTCE" cat="NP" att="genc">
<Node fun="NPHD" cat="N" att="com plu">
<WORD ID="22">barristers</WORD>
</Node>
<Node fun="GENM" cat="GENM">
<WORD ID="23">'</WORD>
</Node></Node></Node>
<Node fun="NPHD" cat="N" att="com plu">
<WORD ID="24">practices</WORD>
</Node></Node>
<Node fun="VB" cat="VP" att="act indic motr pres">
<Node fun="OP" cat="AUX" att="modal pres">
<WORD ID="25">can</WORD>
</Node>
<Node fun="MVB" cat="V" att="montr infin">
<WORD ID="26">stand</WORD>
</Node></Node>
<Node fun="OD" cat="NP">
<Node fun="DT" cat="DTP">
<Node fun="DTCE" cat="PRON" att="dem sing">
<WORD ID="27">that</WORD>
</Node></Node>
<Node fun="NPHD" cat="N" att="com sing">
<WORD ID="28">sort</WORD>
</Node>
<Node fun="NPPO" cat="PP">
<Node fun="P" cat="PREP" att="ge">
<WORD ID="29">of</WORD>
</Node>
<Node fun="PC" cat="NP">
<Node fun="NPHD" cat="N" att="com sing">
<WORD ID="30">interruption</WORD>
</Node></Node></Node></Node></Node>
<Node fun="SU" cat="NP">
<Node fun="NPHD" cat="PRON" att="pers sing">
<WORD ID="31">I</WORD>
</Node></Node>
<Node fun="VB" cat="VP" att="act indic intr past">
<Node fun="MVB" cat="V" att="intr past">
<WORD ID="32">applied</WORD>
</Node></Node>
<Node fun="A" cat="PP">
<Node fun="P" cat="PREP" att="phras">
<WORD ID="33">for</WORD>
</Node>
<Node fun="PC" cat="NP">
<Node fun="DT" cat="DTP">
<Node fun="DTCE" cat="ART" att="indef">
<WORD ID="34">a</WORD>
</Node></Node>
<Node fun="NPHD" cat="N" att="com sing">
<WORD ID="35">job</WORD>
</Node></Node></Node></Node></Node>