Publishers Demand Accountability from Common Crawl over Unauthorized Use (www.newsmediaalliance.org)

0 points 59 days ago ago | visit original

🤖 AI Summary

The News/Media Alliance (NMA), representing major news publishers, has formally demanded that Common Crawl cease unauthorized scraping of their content and implement stricter protocols to prevent its use by AI companies. Common Crawl, which was originally intended to serve as a repository of web data for academic and research purposes, has become a key source of information for training commercial AI models without publisher consent. This situation has raised significant concerns about copyright infringement, harming publishers' ability to monetize their content effectively. In a letter addressing these issues, the NMA requests that Common Crawl revamp its opt-out registry to disallow unauthorized use of its scraped content, particularly for AI applications. The letter also calls for clear communication to users regarding content ownership and the need for explicit permission for commercial use. This demand highlights ongoing tensions between content creators and AI developers, emphasizing the necessity for clear intellectual property rights in the rapidly evolving AI landscape. The growing push from publishers emphasizes the importance of accountability and respect for content rights as AI continues to leverage significant amounts of data for model training.

Loading comments...

loading comments...