CMU Multilingual NLP 2020 - Automatic Speech Recognition

Overview

This course on Automatic Speech Recognition covers topics such as pronunciation modeling, acoustic modeling, and language modeling. Students will learn about voice dialing systems, dynamic time warping, matching templates, training an acoustic model, and estimating the cost of a sequence of words in a language. The teaching method involves discussing various ASR components and their reliability, as well as measuring ASR success. This course is intended for individuals interested in multilingual natural language processing and automatic speech recognition.

Syllabus

Automatic Speech Recognition
Voice Dialing System
Matching in Frequency Domain
Dynamic Time Warping
DTW algorithm
Matching Templates
DTW issues
More reliable matching
More reliable distances
Extending template model
Training an acoustic model
Language Model Estimate cost of sequence of words in the language • Need appropriate training data
Pronunciation Model
Measuring ASR Success
How good is good?
ASR Discussion Point

Taught by

Graham Neubig

Reviews

Start your review of CMU Multilingual NLP 2020 - Automatic Speech Recognition

Udemy, Coursera, 2U/edX Face Lawsuits Over Meta Pixel Use

Most common

Popular subjects

Popular courses

CMU Multilingual NLP 2020 - Automatic Speech Recognition

Overview

Syllabus

Taught by

Reviews

Udemy, Coursera, 2U/edX Face Lawsuits Over Meta Pixel Use

Taught by

CMU Multilingual NLP 2020 - Low Resource ASR

CMU Multilingual NLP 2022 - Speech

MIT 6.S191 - Automatic Speech Recognition

CMU Multilingual NLP 2020 - Text to Speech

Stanford Seminar - Natural Language Processing for Conversational Interfaces

Never Stop Learning.