Data Pipelines (ETL)

Feeds to Scour
SubscribedAll
Scoured 54 posts in 13.2 ms

Airbyte

 🔌MCP Protocol  Content type: Code
github.com·

Transit Data Ingestion Platform Dev 101: Lyondle & Colosse

 🧪Testing  Content type: Blog

Real-time CDC from Aurora PostgreSQL to Amazon S3 Tables using Debezium and Firehose

 🗄️Databases  Content type: Blog
aws.amazon.com·

Modern Data Stack Migration — Day 1: Scaling to 8+ Companies with DRY Architecture and Chasing a $2M Discrepancy

 DevOps  Content type: Blog
dev.to··DEV

Real-time data replication to your data warehouse, self-serve

 📡Event-Driven Architecture

ETLs in the Era of AI and Sandboxes

 🧪Testing

COSS Weekly: Supabase achieves $10B valuation, DeepSeek eyes $7B funding round, Martin Scorsese joins Black Forest Labs, and more

 📱Edge AI  Content type: Blog
dev.to··DEV

Building a Production-Inspired CSV to PostgreSQL ETL Pipeline with Python

 🗄️Databases
pub.towardsai.net
·

Beyond Dual Writes: Microservice Integration Strategies

 📡Event-Driven Architecture  Content type: Blog
medium.com·

benseverndev-oss/goldenmatch: Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers, dbt + Airflow recipes.

 🤖Automation  Content type: Code
github.com··Hacker News

Agents can now provision ClickHouse and Postgres on ClickHouse Cloud

 🎬Film Discovery  Content type: Blog
clickhouse.com·

How to Optimize Enterprise Knowledge Graphs for Scalable Digital Product Platforms

 🗂Knowledge Management
freecodecamp.org·

ETL Pipeline: Fetching Real-Time News Data with Python and Postgres

 🗄️Databases  Content type: Blog
dev.to··DEV

EHR Integration & Data Interoperability with Iguana

 🔒Security  Content type: Discussion
interfaceware.com··Hacker News

Connect Metrics to Traces with Exemplars in Azure Monitor

 💡Observability on a Budget

New comment by aldoakhanov in "Ask HN: Who wants to be hired? (June 2026)"

 🤖Automation

From Data Quality Checks to Analytics-Ready Parquet with Python

 🗄️Databases  Content type: Blog
dev.to··DEV

The Considerate Data Modeler

 🗄️Databases

Introducing GitLab Orbit

 🌳Git  Content type: Blog
about.gitlab.com··Hacker News

How to Scrape E-Commerce Sites for AI Agents Using Playwright and LLMs

 🎭Web Automation
shop.example.com··DEV

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help