Speech:Exps 0305 009

Description
Author: Tri Nguyen (UserID: tmn1001)

Date: 3-6-2018

Edit genTrans.pl and parseLMTrans.pl to remove both [] and - in both the transcripts and LM.

Purpose:

Details:

ssh asterix

cd /mnt/main/Exp/0305/009

makeTrain_2018.pl switchboard 5hr/train

genFeats.pl -t

nohup scripts_pl/RunAll.pl &

mkdir LM

cd LM

cp -i /mnt/main/corpus/switchboard/5hr/train/trans/train.trans trans_unedited

/mnt/main/scripts/user/parseLMTrans_remove_all.pl trans_unedited trans_parsed

cp -i /mnt/main/scripts/user/lm_create.pl.

./lm_create.pl trans_parsed

cd ..

cd etc

awk '{print $1}' /mnt/main/corpus/switchboard/5hr/test/trans/train.trans >> /mnt/main/Exp/0305/009/etc/009_decode.fileids

nohup run_decode.pl 0305/009 0305/009 1000 &

parseDecode.pl decode.log hyp.trans

sclite -r 009_train.trans -h hyp.trans -i swb >> scoring.log

tail -8 scoring.log

Results:

| Sum/Avg | 4172 60048 | 73.0   19.5    7.5    9.1   36.1   89.6 | |=================================================================|     |  Mean   |  1.3   19.1 | 76.5   18.3    5.2   19.0   42.6   90.1 | | S.D.   |  0.5   16.4 | 17.9   15.2    7.2   32.5   34.6   27.5 | | Median |  1.0   15.0 | 76.9   16.7    0.0    6.5   33.3  100.0 | `-'