OpenAI Whisper - Robust Speech Recognition via Large-Scale Weak Supervision

OpenAI Whisper - Robust Speech Recognition via Large-Scale Weak Supervision

Aleksa Gordić - The AI Epiphany via YouTube Direct link

Intro

1 of 17

1 of 17

Intro

Class Central Classrooms beta

YouTube playlists curated by Class Central.

Classroom Contents

OpenAI Whisper - Robust Speech Recognition via Large-Scale Weak Supervision

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 Paper overview
  3. 3 Collecting a large scale weakly supervised dataset
  4. 4 Evaluation metric issues WER
  5. 5 Effective robustness
  6. 6 Scaling laws in progress
  7. 7 Decoding is hacky
  8. 8 Code walk-through
  9. 9 Model architecture diagram vs code
  10. 10 Transcription task
  11. 11 Loading the audio, mel spectrograms
  12. 12 Language detection
  13. 13 Transcription task continued
  14. 14 Suppressing token logits
  15. 15 Voice activity detection
  16. 16 Decoding and heuristics
  17. 17 Outro

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.