Speech:Exps 0170

From Openitware
Jump to: navigation, search

Description

Author: Colby Chenard

Date: 17Feb2014

Purpose: The goal of this experiment was to test out different densities values on the first_5hr and 10hr trains to optimize WER. This experiment was #5 of six that were done simultaneously(0162,0164,0166,0168,0170,0182) each has the same senone value
but we increased the densities each train, so 8,16,64. I skipped 32 because that was already done by Colby J in previous experiments.

Details: Colby J and I hypothesized that we may get better results as we increase the densities. The downside to this is that the higher the density the longer it takes to decode. I hope that with these experiments that we can get under 30% error rate.

Corpus/Switchboard:

  • first _5hr train

Sphinx_train.cfg:

  • Semone value: 5000
  • Density: 16

Dictionary:

  • first_5hr_train_full "Master Dictionary"
    • /mnt/main/corpus/dist/custom/first_5hr_train_full.dic

GenTrans:

  • genTrans5.pl


Results The train ran in: 1 Hour 37 Min
The Decode ran in: 32611 Sec (9.05 Hours)

                    SYSTEM SUMMARY PERCENTAGES by SPEAKER
     ,-----------------------------------------------------------------.
     |                            hyp.trans                            |
     |-----------------------------------------------------------------|
     | SPKR    | # Snt # Wrd | Corr    Sub    Del    Ins    Err  S.Err |
     |=================================================================|
     | Sum/Avg | 4659  68616 | 88.1    7.8    4.1   13.5   25.4   89.3 |
     |=================================================================|
     |  Mean   | 58.2  857.7 | 87.9    8.1    3.9   14.8   26.9   90.3 |
     |  S.D.   | 22.1  330.0 |  4.2    3.2    1.9    7.5    9.7    7.9 |
     | Median  | 55.5  813.0 | 88.6    7.4    3.5   13.9   25.6   92.7 |
     `-----------------------------------------------------------------'