Scraping Next.js web sites in 2025
trickster.dev·16h·
Discuss: Hacker News
Flag this post

When looking into some targets for web scraping, you may come across pages that contain a lot of data represented in JSON-like (but not quite JSON) format passed to self.__next_f.push() Javascript function calls. What’s going on here and how do we parse this stuff? To understand what this is about, we must go through a little journey across the technological landscape of the modern web. Note that I am not a frontend developer and don’t aspire to be. For the most part, I extract data from web apps and web sites, not develop them. The following is what I managed to understand from reading around.

So, it’s widely known that React is a very prominent frontend framework to develop web apps in JavaScript. Although one can develop web apps with the very basics - H…

Similar Posts

Loading similar posts...