Speech:Exps 0310 036

Description
Author: Stephen Thibault (UserID: sdt1001)

Date: 4-23-2018

Purpose:

300hr non-LDA Train with Seen Decode and Scoring using sphinx_train.cfg parameter changes as described in "Details" below with new genTrans.pl developed by the Data Group.

Details:

In the sphinx_train.cfg file, changed the following:
 * change line 101 "CFG_STATESPERHMM = 3" from 3 to 5
 * change line 102 "CFG_SKIPSTATE = 'no'" from no to yes
 * change line 107 "CFG_FINAL_NUM_DENSITIES = 8" from 8 to 32
 * change line 120 "CFG_N_TIED_STATES = 1000" from 1000 to 8000

Performed on drone server Traubadix.

Results: COMPLETED.

Copied the LM from 0310/019. Ran makeTest.pl -t switchboard/300hr 0310/036 0310/036 which with a -d flag would be for Unseen Decode, but I decided to use this command instead of the awk '{print $1}'...  that I used in 0310/019. This yielded the following in command line:
 * LM dir is ready.
 * Decode is ready to be executed.
 * AM pointed to 0310/036
 * LM generated from switchboard/300hr/test/trans/train.trans
 * Note: Generate feats with
 * genFeats.pl -d
 * then execute
 * run_decode.pl 0310/036 0310/036 

nohup run_decode.pl 0310/036 0310/036  & commenced at 11:00 am 26 April.

parseDecode.pl decode.log hyp.trans performed the following day.

sclite -r 036_train.trans -h hyp.trans -i swb >> scoring.log

SYSTEM SUMMARY PERCENTAGES by SPEAKER

,-.     |                            hyp.trans                            | |=================================================================|     | Sum/Avg | 4034  57411 | 74.9   18.9    6.2    8.3   33.4   86.8 | |=================================================================|     |  Mean   |  1.3   18.5 | 77.2   18.2    4.6   16.4   39.2   86.7 | | S.D.   |  0.5   16.1 | 18.2   15.7    7.1   31.6   35.7   31.9 | | Median |  1.0   13.0 | 78.6   16.3    0.0    4.7   33.3  100.0 | `-'

Successful Completion