It’s getting harder to keep up with copyright lawsuits against generative AI, with a new proposed class action filed in the courts last week. This time, the authors are suing NVIDIA over its NeMo AI platform, a language model that allows businesses to build and train their own chatbots, Ars Technica reported. They claim the company trained him on a controversial data set that illegally uses their books without consent.

Authors Abdi Nazemian, Brian Keene, and Stewart O’Nan filed a lawsuit demanding that Nvidia pay damages and destroy all copies of the Books3 dataset used to power NeMo Large Language Models (LLM). They claim the dataset copied a shadow library called Bibliotek consisting of 196,640 pirated books.

“In short, NVIDIA has admitted to training its NeMo Megatron models on a copy of The Pile dataset,” the claim states. “Therefore, NVIDIA necessarily trained their NeMo Megatron models on a copy of Books3, because Books3 is part of The Pile. Certain books written by Plaintiffs are part of Books3—including the Infringed Works—and thus NVIDIA necessarily trained its NeMo Megatron models on one or more copies of the Infringed Works, thereby directly infringing Plaintiffs’ copyrights.

In response, NVIDIA said The Wall Street Journal that “we respect the rights of all content creators and believe we created NeMo in full compliance with copyright law.”

Last year, OpenAI and Microsoft were hit with a copyright lawsuit from non-fiction authors claiming the companies made money off their works but refused to pay them. A similar lawsuit was filed earlier this year. This is in addition to a lawsuit by news organizations such as The interception and A harsh storyand of course, the lawsuit that started it all New York Times.

https://www.engadget.com/now-its-nvidia-being-sued-over-ai-copyright-infringement-083407300.html?src=rss