In a statement on the project’s GitHub page, Spear explained that the internet, which provides most of the data for Wordfreq, is now flooded with AI-generated content. While traditional spam can be filtered, he noted, text generated by large language models “masks itself as real language,” making it impossible to identify real trends in people’s text usage.
Spear emphasized that the inability to obtain reliable information regarding the language’s use after 2021 led to the closure of the project.
Source: Ferra

I am a professional journalist and content creator with extensive experience writing for news websites. I currently work as an author at Gadget Onus, where I specialize in covering hot news topics. My written pieces have been published on some of the biggest media outlets around the world, including The Guardian and BBC News.