Contact Info

  • B2 GIL'S MALL,CLUB AVENUE, OGUI, ENUGU STATE
  • info@scoplink.com
  • Mon-friday, 09am -06pm
  • +2348136152940
Get A Quote
Skip to content
  • Home
  • Tech News
  • Contact Us
Contact Me

Blog Details

admin - Comments (0) - 3 min Read


Like pretty much every other tech company in existence, Adobe has leaned heavily into AI over the past several years. The software firm has launched a number of different AI services since 2023, including Firefly — its AI-powered media-generation suite. Now, however, the company’s full-throated embrace of the technology may have led to trouble, as a new lawsuit claims it used pirated books to train one of its AI models.

A proposed class-action lawsuit filed on behalf of Elizabeth Lyon, an author from Oregon, claims that Adobe used pirated versions of numerous books — including her own — to train the company’s SlimLM program.

Adobe describes SlimLM as a small language model series that can be “optimized for document assistance tasks on mobile devices.” It states that SlimLM was pre-trained on SlimPajama-627B, a “deduplicated, multi-corpora, open-source dataset” released by Cerebras in June of 2023. Lyon, who has written a number of guidebooks for non-fiction writing, says that some of her works were included in a pretraining dataset that Adobe had used.

Lyon’s lawsuit, which was originally reported on by Reuters, says that her writing was included in a processed subset of a manipulated dataset that was the basis of Adobe’s program: “The SlimPajama dataset was created by copying and manipulating the RedPajama dataset (including copying Books3),” the lawsuit says. “Thus, because it is a derivative copy of the RedPajama dataset, SlimPajama contains the Books3 dataset, including the copyrighted works of Plaintiff and the Class members.”

“Books3” — a huge collection of 191,000 books that have been used to train GenAI systems — has been an ongoing source of legal trouble for the tech community. RedPajama has also been cited in a number of litigation cases. In September, a lawsuit against Apple claimed the company had used copyrighted material to train its Apple Intelligence model. The litigation mentioned the dataset and accused the tech company of copying protected works “without consent and without credit or compensation.” In October, a similar lawsuit against Salesforce also claimed the company had used RedPajama for training purposes. 

Unfortunately for the tech industry, such lawsuits have, by now, become somewhat commonplace. AI algorithms are trained on massive datasets and, in some cases, those datasets have allegedly included pirated materials. In September, Anthropic agreed to pay $1.5 billion to a number of authors who had sued it and accused it of using pirated versions of their work to train its chatbot, Claude. The case was considered a potential turning point in the ongoing legal battles over copyrighted material in AI training data, of which there are many.

Share:
Prev Post

VCs discuss why most.

Next Post

Ex-Splunk execs' startup Resolve.

Leave A Comment

Recent Posts

  • Shutdown silver lining? Your IPO review comes after investors buy in
    October 10, 2025
    Shutdown silver lining? Your IPO review comes after...
  • Flipkart’s Super.money quietly partners with troubled Juspay as it expands its reach
    October 10, 2025
    Flipkart’s Super.money quietly partners with troubled Juspay as...
  • While OpenAI races to build AI data centers, Nadella reminds us that Microsoft already has them
    October 10, 2025
    While OpenAI races to build AI data centers,...
  • Andreessen Horowitz denies report of India office, calls it ‘fake news’
    October 10, 2025
    Andreessen Horowitz denies report of India office, calls...
  • Reflection AI raises $2B to be America’s open frontier AI lab, challenging DeepSeek
    October 10, 2025
    Reflection AI raises $2B to be America’s open...

Tags

Elevate Your Business

Want to start a project?

START
YOUR PROJECT

info@scoplink.net
My Location
B2 GIL'S MALL,CLUB AVENUE, OGUI, ENUGU, ENUGU STATE
Phone Number
+2348136152940
Mail Me
support@scoplink.net

© 2025 Scoplink. All rights reserved.