Google has filed a lawsuit against Texas-based SerpApi. They accuse the company of using hundreds of millions of fraudulent search requests to scrape and steal copyrighted content from Google’s search results on a massive scale. Meanwhile, Spotify grapples with unlawful scraping of its data, which was released on file-sharing sites. The music platform blames a “pirate activist group.”
Google vs. SerpApi in Data Scraping Copyright Battle
Google’s lawsuit, filed in California federal court, alleges SerpApi circumvents Google’s security protections. After breaching these safeguards, the company can access licensed material that appears in services such as Google Shopping, Google Maps, and Knowledge Panels. The Google results API company then sells this stolen content to third parties.
SerpApi Defends Its Data Scraping
The company defended itself, stating that it only provides publicly visible information. They claim the lawsuit aims to suppress competition from innovators building AI and other applications. The company vowed to fight the case vigorously.
This follows a similar October lawsuit by Reddit against SerpApi and other scrapers over alleged content theft for training Perplexity AI’s search engine. Reddit expressed support for Google’s legal action against what it called “bad actors” exploiting internet openness.
Google seeks monetary damages and an injunction to stop the scraping activities.
Spotify vs. Anti-Copyright Pirates
Spotify has been targeted by what it calls “anti-copyright extremists.” These pirates scraped 256 million rows of track metadata and accessed 86 million audio files from its platform. Only the metadata has been publicly released so far.
The pirate activist group distributed the data through Anna’s Archive, a shadow library search engine. The group claims to have created the world’s first open ‘preservation archive’ for music. Its 300TB dataset could enable anyone to build a Spotify clone, though legal action from rightsholders would likely follow quickly.
Spotify confirmed the breach resulted from third parties scraping public metadata through its web API and using illicit methods to bypass DRM protections. The company has disabled the malicious accounts and implemented new safeguards. They are actively monitoring for suspicious activity.
Scraped Data Could Be Used for AI Music Models
The incident raises concerns beyond piracy. The dataset could potentially be used to train unlicensed AI music models, undermining the industry’s licensing efforts with AI companies. If this occurs, it would be similar to how YouTube datasets have been misused.

