Runway’s AI video generator trained on thousands of scraped YouTube videos

3 months ago 40

Runway trained its AI text-to-video generator connected thousands of YouTube videos and pirated films, according to a study from 404 Media. A spreadsheet of grooming data obtained by the 404 Media lists links to YouTube channels belonging to large amusement companies, specified arsenic Netflix, Disney, Nintendo, and Rockstar Games, on with creators similar MKBHD, LinusTechTips, and Sam Kolder.

There are besides links to channels owned by quality outlets similar The Verge, The New Yorker, Reuters, and Wired. “The channels successful that spreadsheet were a company-wide effort to find bully prime videos to physique the exemplary with,” a erstwhile Runway worker tells 404 Media. “This was past utilized arsenic input to a monolithic web crawler which downloaded each the videos from each those channels, utilizing proxies to debar getting blocked by Google.” 

Runway is an AI startup that has received millions successful funding from Google genitor institution Alphabet and Nvidia. It has created awesome tools that let users to marque realistic-looking AI videos, arsenic good arsenic ones that seizure a peculiar animation type. Runway’s latest tool, Gen-3 Alpha, launched successful June and tin “create videos successful immoderate benignant you tin imagine.” Like different AI models, Gen-3 Alpha needs to ingest a breadth of contented erstwhile training.

In summation to YouTube channels, 404 Media besides recovered that Runway’s dataset contains links to piracy sites similar KissCartoons, which lets you ticker anime and different animated contented for free. It’s inactive not wide whether Runway utilized each of the videos successful this spreadsheet to bid its Gen-3 Alpha exemplary — and we whitethorn ne'er find out. In an interrogation with TechCrunch successful June, Runway co-founder Anastasis Germanidis said the institution uses “curated, interior datasets” to bid its models, but helium didn’t supply further detail.

When reached for comment, Google pointed The Verge to a connection from YouTube CEO Neal Mohan, who told Bloomberg successful April that grooming AI connected the platform’s videos is simply a “clear violation” of its policies. The Verge reached retired to Runway with a petition for remark but didn’t instantly perceive back.

Runway isn’t the lone AI institution that has had its AI grooming information linked to YouTube. Earlier this year, OpenAI CTO Mira Murati said she “wasn’t sure” whether the company’s text-to-video generator Sora trained connected YouTube. Meanwhile, a caller study from Proof News and Wired recovered that Anthopic, Apple, Nvidia, and Salesforce trained their AI models connected much than 170,000 YouTube videos.

Read Entire Article