Publishers sue Microsoft and OpenAI over unauthorized content scraping

by

in

A significant legal challenge has emerged against two of the biggest names in artificial intelligence. A coalition representing nearly 400 newspapers has officially sued Microsoft and OpenAI. The publishers argue that their content was scraped without consent or compensation to train models powering ChatGPT and Copilot.

The core allegations

The lawsuit centers on the data practices used to build large language models. The publishing coalition claims that Microsoft and OpenAI harvested vast amounts of copyrighted material from news websites. According to the filing, this content was used to train their AI systems without permission from the original creators.

Furthermore, the publishers allege they received no compensation for this usage. They contend that the integration of their work into tools like Copilot—which is deeply embedded in Windows 11 and Microsoft 365—constitutes a violation of their intellectual property rights. The suit seeks to establish that consent is required before such data can be utilized.

Impact on Copilot and ChatGPT

This case directly targets the foundation of popular AI assistants. Copilot, available across Windows devices and Microsoft applications, relies on these underlying models to generate responses. If the court rules in favor of the publishers, it could force significant changes in how Microsoft sources training data for its integrated AI features.

Similarly, OpenAI’s ChatGPT faces scrutiny under this lawsuit. The outcome may influence broader industry standards regarding web scraping and copyright compliance. For users, this could eventually lead to changes in how AI assistants handle news-related queries or summarize recent articles.

What this means for you

For everyday Windows users, the immediate impact is likely minimal. Copilot and ChatGPT will continue to function as they do today while legal proceedings unfold. However, this lawsuit highlights the growing tension between tech companies and content creators.

If publishers win, we may see stricter licensing agreements or new fees associated with AI services that rely on news content. This could potentially affect the availability of certain features or the cost of subscriptions in the future. It is a key moment for understanding how AI interacts with established media sources.

Source: Neowin

Over to you: Do you think AI assistants should pay publishers for using their content, or is web scraping necessary for innovation?

You may also like

Microsoft officially confirms Windows 11 26H2, urges IT pros to prepare
Microsoft officially confirms Windows 11 26H2, urges IT pros to prepare

Microsoft Weekly: Windows 11 26H2 goes official, new Surface hardware revealed
Microsoft Weekly: Windows 11 26H2 goes official, new Surface hardware revealed

From Dial-Up Days to Windows 11: Why Microsoft Paint Remains a Hidden Gem
From Dial-Up Days to Windows 11: Why Microsoft Paint Remains a Hidden Gem

The forgotten Microsoft Office feature hiding in plain sight
The forgotten Microsoft Office feature hiding in plain sight