Course Description
Speech recognition is a growing part of many applications in a wide variety of industries, from call centers and mobile internet applications to medical dictation. However, the technology is far from perfect when compared to human performance. This course covers speech recognition and synthesis from both applied and theoretical perspectives. Students will build a speech application using commercial tools, then work through the underlying components and algorithms to understand how the state of the art can be moved forward. Topics include phonetics, Hidden Markov Models, finite state grammars, statistical language models, conversational systems, speech synthesis and industry standards for implementing applications such as VXML.
- Time:
- Mondays & Wednesday, 2–3:20.
- Location:
- Pearlman Hall 203
- Textbook:
- Speech and Language Processing (Second Edition), by Daniel Jurafsky & James H. Martin
Professor
- Marie Meteer
- Email: mmeteer@cs.brandeis.edu
- Office: Volen 256
- Office Hours:
- Monday & Wednesday: 1:00 - 2:00
- and by appointment
Teaching Assistant
- TBD
Schedule
- Topics and assignments for each class are posted on the schedule page. Please check this reguarly, as it may change throughout the year.
- Details on the assignments are posted on the assignments page. Again, please check this reguarly, I'll update it as the assignments get closer.
Grading
- There will be the following types of gradable elements in class. Due dates will be posted on the schedule page and announced in class.
- Policy on working together: Unless it is specifically stated in the assignment, all assignments must be done independently. However, when working with 3rd party toolsets, you may collaborate on getting the tools installed and running. In order to make this collaboration fair for everyone, you must post questions and answers on the class Latte blog, even if it's just a summary of a hallway conversation. If it was helpful, share it.
| Type | Percent of grade | Description |
|---|---|---|
| Programming Assignments | 50% | These will include actually building a speech recogition application, using speech tools to build new models to improve performance, and analyzing data and writing short reports on how something might change given different conditions. There will be 4 - 5 programming assignments over the year. |
| Quizes | 30% | Quizes are relatively short (30-40 minutes) with 4-6 questions on the material covered in class. If you miss a quiz you need to make it up. They will be roughly every 3 weeks. |
| Class Participation | 20% | Attendance and paying attention and answering questions, particpation on class Latte discussions. Throughout the semester, I will post questions about the readaing or class material. You should make at least one substantive comment per post. |
