TechCrunch Minute: Over 100K YouTube videos have been scraped to train AI for Apple, Nvidia

Mobile

What do MrBeast, John Oliver and the Wall Street Journal have in common? The transcripts of their YouTube videos have been scraped to train the AI used by companies like Anthropic, Nvidia, Apple and Salesforce.

An investigation from Wired and Proof News found that this dataset, which is called YouTube Subtitles, contains transcripts from over 173,000 YouTube videos on more than 48,000 different channels.

This AI scraping is a problem all across the tech industry. Artist and founder of the app Cara, Jingna Zhang, has tried to protect artists by building a social platform that won’t sell them out. And the University of Chicago is working on Nightshade, which can “poison” an image to limit what an AI can glean from it. 

But is there really any way for creators to protect themselves from being next? More on the TechCrunch Minute.

Products You May Like

Articles You May Like

Lords of the Fallen Sequel Is in Full Production, Will Be Announced in 2025
Krafton Partners With Pocketpair to Develop Mobile Version of Palworld Amid Nintendo Lawsuit
A co-lead on Sora, OpenAI’s video generator, has left for Google
Qualcomm Developing Snapdragon X Elite Successor Under ‘Project Glymur’ Codename: Report
The complete agenda for the Disrupt Stage at TechCrunch Disrupt 2024

Leave a Reply

Your email address will not be published. Required fields are marked *