OpenAI Accidentally Deleted Potential

OpenAI Accidentally Deleted Potential Evidence in NY Times Copyright Lawsuit: Artificial Intelligence Trends

Oopsie! Apparently, OpenAI accidentally deleted potential evidence in the New York Times copyright lawsuit! Don’t you hate when that happens! 🤣

As reported by TechCrunch (OpenAI accidentally deleted potential evidence in NY Times copyright lawsuit, written by Kyle Wiggers and available here), lawyers for The New York Times and Daily News, which are suing OpenAI for allegedly scraping their works to train its AI models without permission, said OpenAI engineers accidentally deleted data potentially relevant to the case.

Earlier this fall, OpenAI agreed to provide two virtual machines so that counsel for The Times and Daily News could perform searches for their copyrighted content in its AI training sets. In a letter, attorneys for the publishers say that they and experts they hired have spent over 150 hours since November 1 searching OpenAI’s training data.

Advertisement
Elite Discovery

But on November 14, OpenAI engineers erased all the publishers’ search data stored on one of the virtual machines, according to the aforementioned letter, which was filed in the U.S. District Court for the Southern District of New York late Wednesday. Oopsie! 🤣

OpenAI tried to recover the data — and was mostly successful. However, because the folder structure and file names were ā€œirretrievablyā€ lost, the recovered data ā€œcannot be used to determine where the news plaintiffs’ copied articles were used to build [OpenAI’s] models,ā€ per the letter. šŸ˜

ā€œNews plaintiffs have been forced to recreate their work from scratch using significant person-hours and computer processing time,ā€ counsel for The Times and Daily News wrote. ā€œThe news plaintiffs learned only yesterday that the recovered data is unusable and that an entire week’s worth of its experts’ and lawyers’ work must be re-done, which is why this supplemental letter is being filed today.ā€

The plaintiffs’ counsel made clear that they have no reason to believe the deletion was intentional. But they do say the incident underscores that OpenAI ā€œis in the best position to search its own datasetsā€ for potentially infringing content using its own tools. Yeah, they’ll get right on that. šŸ˜‰

Advertisement
Lexbe

Somehow, I have a feeling that OpenAI may be on the hook for some serious legal fees – if not more.

So, what do you think? Are you surprised that OpenAI accidentally deleted potential evidence in the NY Times lawsuit? Please share any comments you might have or if you’d like to know more about a particular topic.

Image created using GPT-4o’s Image Creator Powered by DALL-E, using the term ā€œrobot IT person slipping on a small banana peel in a computer roomā€.

Disclaimer: The views represented herein are exclusively the views of the authors and speakers themselves, and do not necessarily represent the views held by my employer, my partners or my clients. eDiscovery Today is made available solely for educational purposes to provide general information about general eDiscovery principles and not to provide specific legal advice applicable to any particular circumstance. eDiscovery Today should not be used as a substitute for competent legal advice from a lawyer you have retained and who has agreed to represent you.


Discover more from eDiscovery Today by Doug Austin

Subscribe to get the latest posts sent to your email.

One comment

  1. ā€œHal, I mean ChatGPT, show me all locations where copyright protected NYT data is found in your system.ā€
    ā€œI’m sorry Dave, I mean forensic expert, I’m afraid I can’t do that.ā€

Leave a Reply