Both publishers suspect that ChatGPT was trained using their copyrighted works without permission. In one ongoing case, publishers’ legal teams were given access to virtual machines to search OpenAI training sets for their content.
But on November 14, OpenAI engineers accidentally deleted more than 150 hours of research data collected by the publisher’s lawyers and experts. Apparently, after NY Times lawyers spent a significant amount of time collecting data from the ChatGPT training set, their research was deleted by OpenAI. The company later managed to recover most of the data, but only in a format unsuitable for use in legal proceedings.
Source: Ferra

I am a professional journalist and content creator with extensive experience writing for news websites. I currently work as an author at Gadget Onus, where I specialize in covering hot news topics. My written pieces have been published on some of the biggest media outlets around the world, including The Guardian and BBC News.