NYT slams Microsoft for building copyright-infringing supercomputer for OpenAI - Ars Technica
Skip to content
AI
Biz & IT
Cars
Culture
Gaming
Health
Policy
Science
Security
Space
Tech
Forum
Subscribe
Story text
Size
Small<br>Standard<br>Large
Width
Standard<br>Wide
Links
Standard<br>Orange
* Subscribers only
Learn more
Pin to story
Theme
Search
Sign In
Sign in dialog...
Text<br>settings
Story text
Size
Small<br>Standard<br>Large
Width
Standard<br>Wide
Links
Standard<br>Orange
* Subscribers only
Learn more
Minimize to nav
In a heavily redacted court filing Thursday, The New York Times proposed to amend its copyright complaint against OpenAI and Microsoft to clarify a claim and allege that Microsoft actively encouraged OpenAI to steal NYT works by building a bespoke supercomputing system ranked among the most powerful in the world.
NYT’s motion comes after the Supreme Court sided with Cox Communications in a case where Sony tried and failed to claim that Cox was contributing to music piracy as an Internet service provider, which set a new standard for contributory infringement. Moving forward, plaintiffs will have to prove that parties intentionally acted to induce illegal conduct. Recognizing that the legal precedent has changed, the NYT now wants to amend its complaint to align its contributory infringement claim against Microsoft with that new standard.
“Today, we asked the court for permission to file an amended complaint that further strengthens our case, clarifying our claim of contributory infringement against Microsoft based on new law and new evidence uncovered during discovery,” Graham James, an NYT spokesperson, said in a statement provided to Ars.
In addition to clarifying one claim, NYT also agreed to voluntarily dismiss two claims of contributory copyright infringement and trademark dilution against all defendants.
A Microsoft spokesperson told Ars that the company views the amended complaint as “a last-ditch effort by the plaintiff to save its claim from unfavorable precedent set in other recent rulings.”
But in its motion, the NYT argued that neither Microsoft nor OpenAI would be prejudiced by allowing the amended complaint. It’s proper to allow plaintiffs to revise arguments when legal standards change, the NYT argued, and the case schedule would not be set back because “The Times does not seek any additional discovery in support of its amended claims.”
“As we have long alleged, Microsoft actively encouraged OpenAI to steal our copyrighted works,” James said. “Beyond amending that claim and streamlining the case to its most potent arguments, our core claims remain the same from the day we filed this lawsuit—that Microsoft and OpenAI stole millions of The Times’s copyrighted works to compete with our products and illegally enrich themselves.”
NYT targets Microsoft supercomputer
In 2023, the NYT became the first major publisher to sue OpenAI. The prominent newspaper alleged that ChatGPT was illegally trained on its articles, infringed on its copyrights by outputting articles verbatim, and caused market harms by positioning ChatGPT as a substitute for a NYT subscription, as well as reputational harms by falsely attributing claims to NYT reporting. Additionally, ChatGPT outputs summarizing Wirecutter reviews robbed writers of commissions from lost clicks on affiliate links, the NYT alleged.
In the initial complaint, the NYT discussed Microsoft’s supercomputing systems as if they were providing generic cloud computing services. The updated complaint seeks to specify that the supercomputer was tailor-made to help OpenAI infringe and allege that it was built for the explicit purpose of training AI on copyrighted works without permission. And as the NYT alleged, its articles were more heavily weighted by this system, as both firms hoped to train models on the highest-quality journalism possible, so that level of writing could be confidently mimicked in outputs.
By building this “unusually complex” machine, Microsoft not only helped select the works that were infringed but also provided a means to seize copyrighted works without permission, the NYT alleged.
“Microsoft specifically designed it for the purpose of using essentially the whole Internet—curated to disproportionately feature Times Works—to train the most capable LLM in history,” the NYT alleged.
And now it’s allegedly unfairly profiting.
“Microsoft’s deployment of Times-trained LLMs throughout its product line helped boost its market capitalization by a trillion dollars in the past year alone,” the NYT alleged.
Model outputs show market harms, NYT alleged
For the NYT, outputs shared during discovery—including a huge chunk of users’ ChatGPT sessions—remain some of the strongest evidence that OpenAI and Microsoft built tools that allegedly replaced the NYT by producing near-verbatim excerpts of its copyrighted works.
In some cases, users told ChatGPT they were trying to skirt paywalls and were able to see...