Millions of Locations for Thousands of Brands
tech.marksblogg.comΒ·4hΒ·
Discuss: Hacker News
πŸ”Paradedb
Preview
Report Post

All The Places has a set of spiders and scrapers that extract location information from thousands of brands’ websites.

Their GitHub repo is made up of 145K lines of Python with commits going back ten years. There have been a total of 162 contributors to date.

Their crawlers run weekly and the collected data is published shortly afterwards.

In this post, I’ll examine their latest release.

My Workstation

I’m using a 5.7 GHz AMD Ryzen 9 9950X CPU. It has 16 cores and 32 threads and 1.2 MB of L1, 16 MB of L2 and 64 MB of L3 cache. It has a liquid cooler attached and is housed in a spacious, full-sized Cooler Master HAF 700 computer case.

The system has 96 GB of DDR5 RAM clocked at 4,800 MT/s and a …

Similar Posts

Loading similar posts...