Course Outline

The course outline will change regularly as the semester unfolds. Check regularly for updated readings and assignments.

Dates Topic Reading Assignments
Sep 7 Overview of Speech and the Industry History of Automatic Speech Recognition, Juang & Rabiner
State of the Art, Makhoul & Schwartz
Blog Assignment 1: Speech Application Review
Sep 12 Speech Recognizer Components Interview with Vlingo CTO Mike Phillips, Speech Tuning and Statistical Grammars
Sep 14 Building speech applications Speech Mashup Framework,
Speech Mashup Guide,
Kotelly, Writing Effective Prompts
Speech application review due (postponed til links are up)
Sep 19 LVCSR Applications
Sep 21 Phonetics and dictionaries Jurafsky & Martin Ch. 7
Sep 26 Front End Feature Extraction Jurafsky & Martin, SLP Ch. 9.1, 9.3
CMU Spectograms, Cepstrum, etc
Blog Assignment 2: Speech Toolset Review
Quiz 1, Due Tuesday 9/26 5 pm
Sep 28 Viterbi Algorithm and HMM Review Jurafsky & Martin, Ch. 6, Ch.9.2 Dictionary confusability Due Oct 12
Oct 3 Baum-Welch Eisner paper Eisner spreadsheet
Oct 5 Gaussians Jurafsky & Martin, Ch. 9.3-4
Oct 10 Language Modeling Jurafsky & Martin, Ch. 4 (review)
Two Decades of LM, Rosenfeld
Speech App Design description due
Oct 12 Ngrams & training Goodman, A bit of progress ... Phonetic confusability due
Oct 17 No class Modeling Conversational Speech,
Sentence Level Mixtures
Filtering Web Text
Linguistically Motivated Language Model
Integrating Symbolic and Statistical Approaches
Quiz 2 questions
Oct 19 Decoding & Evaluation SLP Ch. 9.5-9 Quiz 2 due. Submit via Latte
Oct 24 Sphinx: hands on walk through SUN Technical Report & Sphinx Tutorial
Oct 26 Language Modeling Experiment  
Oct 31 Overivew: HTK Speech Recognizer HTK Tutorial
Nov 2 DARPA Speech Evaluations   Speech Analysis
Nov 7 LVCSR vs. Phonetics In class group discussions of speech papers Sphinx Decoding and Evaluation
Nov 9 Speech Recogition Analysis Presentations  
Nov 14 Discussion of speech application prototypesf Working prototype speech application due
Nov 16 Speech Synthesis Jurafsky & Martin Ch. 8  
Nov 21 SPeech Synthesis    
Nov 28 No class Sphinx LM Baslines due
Nov 30 Dialog Jurafsky & Martin Ch. 24  
Dec 5 Dialog PARADISE Dialog Evaluation Dialog Evaluation Blog Assignment
Dec 7 Voice User Interface Design, Guest speaker, Dr. Bernhard Suhm Human Factors in IVR - Chapter 1,
CHI2009 Suhm-Peterson
Sphinx LM improvements (due 12/19)
Dec 12 Dialog Evaluation Dialog Evaluation Slide(s) due at 1 PM
Dec 14
5 pm
Final presentations and demonstrations Final appilcation presentations
Dec 19 Sphinx LM improvements due