Speech:Spring 2015 Data Group


 * Home
 * Semesters
 * Spring 2015
 * Proposal
 * Report
 * Information - General Project Information
 * Experiments - List of speech experiments

Groups

 * Systems Group
 * Experiment Group
 * Tools Group
 * [Data Group]
 * Modeling Group


 * Proposal Group

Group Member Logs

 * Krista Cleary
 * Stephen Griffin
 * Dakota Heyman
 * Russell Sweet

Assigned machine is: TBD

Objectives:


 * Learn how to run experiments and trains
 * Become familiarized with directory structure and purpose of files
 * Organize .wav files and eliminate redundancies
 * Document where all .wav files and scripts can be found
 * Remove .wav files from folders and replace with softlinks to new organization system
 * Set up ssh accounts (log in, change passwords)
 * Figure out where the transcripts are, and if the data is still good

Objectives for the Week ending on 2/3:
 * Get access to caesar, set up personal SSH (putty), and do initial account setup
 * Familiarize ourselves with the wiki, and past semester's work.
 * Get started on the proposal we are in charge of

Progress for the week ending 3/11:

3/4 The data group has been tasked with cleaning up all of the directories within the Switchboard corpus. Within the subdirectories (ex: first_5hr, 256hr, 125hr_3170), there are at least two directories; train and clean. Clean is unnecessary and will be deleted.

Train will then be edited to make sure that the audio directory within the Train directories will have both utt and conv. As of right now, some of the audio directories have only utt, or only conv.