Speech:Exps 0283 006

Description
Ben Leith James Schumacher Ryan O'Neal Jon Shallow
 * Authors:

Date Created: 3/2/16

We found out that the corpus data (256hr) has approximately 11,000 utterances that are corrupt, starting after the first 32,000 files. We have also been instructed that our 125 Hr train configurations will have little effect on more data from a 256 hr train. We need to start running 256 hr trains and cannot use the same configurations. This will change our thought process for our experiments because we will now have to come up with new configurations to reduce the WER. This includes coming up with a new baseline for our new corpus. We need to find a way to copy the good corpus data and keep the corrupted files out of the experiment. We received permission from Prof. Jonas to perform this task.
 * Purpose:

TBD
 * Details:

Failed due to changes in Caesar directory structure. View 007 to see updated changes.
 * Results:

To come up with the best way to run a 256 hr train without feeling like the 125 hr train configurations didn't teach us the best train configurations.
 * Concerns: