Speech:Exps 0162

From Openitware
Jump to: navigation, search

Description

Author: Colby Chenard

Date: 17Feb2014

Purpose: The goal of this experiment was to test out different densities values on the first_5hr and 10hr trains to optimize WER. This experiment was #1 of six that were done simultaneously(0162,0164,0166,0168,0170,0182) each has the same senone value
but we increased the densities each train, so 8,16,64. I skipped 32 because that was already done by Colby J in previous experiments.

Details: Colby J and I hypothesized that we may get better results as we increase the densities. The downside to this is that the higher the density the longer it takes to decode. I hope that with these experiments that we can get under 30% error rate.

Corpus/Switchboard:

  • 10hr train

Sphinx_train.cfg:

  • Semone value: 5000
  • Density: 8

Dictionary:

  • 10hr dictionary "Master Dictionary"
    • /mnt/main/corpus/dist/custom/10hr.dic

GenTrans:

  • genTrans5.pl


Results The train ran successfully in: 2 Hours 57 Minutes Decode completed successfully in: 61848 Seconds (17.18 Hours)

                    SYSTEM SUMMARY PERCENTAGES by SPEAKER
     ,-----------------------------------------------------------------.
     |                            hyp.trans                            |
     |-----------------------------------------------------------------|
     | SPKR    | # Snt # Wrd | Corr    Sub    Del    Ins    Err  S.Err |
     |=================================================================|
     | Sum/Avg | 8860  134108| 79.6   15.6    4.8   17.4   37.8   96.7 |
     |=================================================================|
     |  Mean   | 43.9  663.9 | 79.7   15.7    4.6   18.5   38.8   96.8 |
     |  S.D.   | 19.4  286.0 |  6.8    5.5    2.3    8.6   12.5    4.2 |
     | Median  | 40.0  595.5 | 80.5   14.9    4.3   17.6   37.8   98.1 |
     `-----------------------------------------------------------------'

Successful Completion