Speech:Exps 0182

From Openitware
Jump to: navigation, search

Description

Author: Colby Chenard

Date: 17Feb2014

Purpose: The goal of this experiment was to test out different densities values on the first_5hr and 10hr trains to optimize WER. This experiment was #6 of six that were done simultaneously(0162,0164,0166,0168,0170,0182) each has the same senone value
but we increased the densities each train, so 8,16,64. I skipped 32 because that was already done by Colby J in previous experiments.

Details: Colby J and I hypothesized that we may get better results as we increase the densities. The downside to this is that the higher the density the longer it takes to decode. I hope that with these experiments that we can get under 30% error rate.

Corpus/Switchboard:

  • first _5hr train

Sphinx_train.cfg:

  • Semone value: 5000
  • Density: 64

Dictionary:

  • first_5hr_train_full "Master Dictionary"
    • /mnt/main/corpus/dist/custom/first_5hr_train_full.dic

GenTrans:

  • genTrans5.pl


Results The train ran in: 3 Hours 22 Min The Decode rain in: 32749 Sec (9.09 Hours)

                    SYSTEM SUMMARY PERCENTAGES by SPEAKER
     ,-----------------------------------------------------------------.
     |                            hyp.trans                            |
     |-----------------------------------------------------------------|
     | SPKR    | # Snt # Wrd | Corr    Sub    Del    Ins    Err  S.Err |
     |=================================================================|
     | Sum/Avg | 4659  68616 | 85.7    6.1    8.2    7.4   21.6   78.1 |
     |=================================================================|
     |  Mean   | 58.2  857.7 | 85.7    6.1    8.2    8.0   22.3   79.8 |
     |  S.D.   | 22.1  330.0 |  5.4    1.4    5.0    4.1    6.5    9.5 |
     | Median  | 55.5  813.0 | 85.8    5.9    7.0    7.1   21.5   81.6 |
     `-----------------------------------------------------------------'