Subject: alignment file conversion Date: Sun, 28 Jul 2002 13:14:11 PDT From: Andreas Stolcke the programs converting the trainer alignment file format (produced by go.dump-alignments) from ascii to binary and back are alignments-ascii2bin alignments-bin2ascii They are installed in $EVAL2000/bin/$MACHINE_TYPE. (Chuck Note: see below for more info on $EVAL2000 and $MACHINE_TYPE.) The output of alignments-bin2ascii looks something like this: -pau- 0 0 - 2 6 1.000400 -pau- 0 0 - 2 7 1.000400 -pau- 0 0 - 2 8 0.951517 -pau- 0 0 - 2 9 0.030357 treating 0 0 t 0 8 0.048921 treating 0 0 t 0 9 0.970059 treating 0 0 t 0 10 0.048077 treating 0 0 t 0 11 0.001049 treating 0 0 t 1 10 0.952374 treating 0 0 t 1 11 0.999500 The meaning of the columns is: #1 word label #2 wordnet node index #3 wordnet arc index (out of given node) #4 phone label #5 state number (within phone) #6 frame number #7 posterior probability (of being in that state at that frame) To find the total posterior probability for a phone at a given frame you should sort the output by frame number and phone labels, and add up all the probabilities with the same phone label.The environment variable "$EVAL2000" should be set to:
/u/drspeech/opt/eval2000/systemand the $MACHINE_TYPE environment variable can be set using $EVAL2000/bin/machine-type, which on the sparcs, produces "sparc-elf" and on the linux boxes, produces "i686".
Previous: 7.4 What is forced alignment? - Next: 7.6 What is embedded training?
Back to ICSI Speech FAQ index
Generated by build-faq-index on Tue Mar 24 16:18:17 PDT 2009