OpenAI transcribed over a million hours of YouTube videos to train its LLMs, Google engaged in same practice
TechSpot
APRIL 8, 2024
In order to access more reputable English language-based text on the internet in 2021, OpenAI researchers created a speech recognition tool called Whisper, writes The New York Times. It was designed to transcribe audio from YouTube videos, giving the company a trove of data to train its LLMs. Read Entire Article
Let's personalize your content