Modified Senone Value (3000) Train on Last_5hr
Author: Eric Beikman
Purpose: To determine the effect of forcing the Sphinx trainer use the number of tied states, or Senones, from the default of 1000, to 3000.
Senones are the result of the trainer collecting all the HMM states of all Triphones and training it into a single, collective global state. Tied-States is a synonym of Senones. In other words, Senones are multiple phones which are related to each other.
According to CMU, Specifically [This table], The amount of Senones roughly corresponds to the amount of audio data. We are using a 5 hour corpus, as such, the default value of 1000 is inadequate. For this experiment, we shall utilize 3000 senone, which is slightly more than what is recommended by CMU for this amount of audio data. Our goal is to determine if this has an effect on the quality of the acoustic model.
The remaining aspects of the experiment was taken directly from experiment 0089. We are using its dictionaries, phone list, and using genTrans5.pl to generate the experiment's transcript.
This experiment is similar to Experiment 0102, except with a larger senone value.
Results The train was able to run successfully. This experiment is continued in Experiment 0109.