Speech:Spring 2012 Sphinx Speech Tools Tasks


 * Home
 * Semesters
 * Spring 2012
 * Proposal
 * Report

To Do
This is a list of things that we would like to have done no later than April 4th. A thanks to Mike for composing this list for us.

Todo's
 * Break list list into four different groups for each group member
 * find or create script (call it genDict.pl) that generates dictionary for decoding and training
 * this may already exist...check logs...if not it needs to be written
 * genTrans.pl actually creates your transcripts and also the wave files.
 * take a look at it and you'll see it uses "sox"
 * understand how this works
 * on your assigned machines (i think you have two, right?), pick one and get decode/train to run
 * you can use caesar as a guide but you want to be able to recreate what Brian did
 * I believe all machines have trainer installed but caesar might be the only one with decoder v3
 * the tar ball for decoder v3 is on caesar in INSTALL
 * if you document all these steps, then the entire class should be ready to go do a mini-train
 * i.e. document how to:
 * create a dictionary (and capture this via a perl script)
 * create transcripts and wave files: genTrans.pl does this
 * understand how to use it
 * also understand how it does what it does!!!
 * installing sphinx v3 decoder on a machine
 * document what was involved
 * running a training
 * use the summer wiki but document in your own words what was involved
 * you can use that wiki as a base but create a new spring 12 one and add to it
 * i.e. it's pretty terse at the moment, maybe more explanation would help
 * running a decode
 * focus on test on train
 * know though what decoder's inputs are

There is enough here to subdivide into 4 parts, create a plan accordingly and write this plan up in your proposal with a time line. Obviously we'd like to have at least a month to 6 weeks before the end of the semester to use what you've done to then do a mini-train and possible full-train, so that should help you gauge your timeline (i.e. you don't have the entire semester to do this part).

To be successful you have to become very comfortable with unix, become somewhat proficient in Perl (though not an expert programmer) and get familiar with the basics of speech recognition. So plan your effort accordingly.