🔧 Data Engineering - widget101 · Scour

AI Agents and the Fight for Customer Data

🔄ETL Pipelines

a16z.simplecast.com·

Run an Apache Airflow DAG with Docker Compose and PostgreSQL

pyimagesearch.com·

Designing an ETL Application: Why I Started with a Modular Monolith Before Microservices

🔄ETL Pipelines Blog

·

Exclusive: MotherDuck adds agentic data ingestion to its cloud analytics service

🏗️Data Platforms

siliconangle.com·

Introducing Flights: Agent-Native Ingest in MotherDuck

🏗️Data Platforms Blog

motherduck.com·

Senior Data Engineer – Climate Friendly

au.seek.com··Hacker News, Hacker News

Deploying Vector High-Performance Observability Data Pipeline on Ubuntu 24.04

🚀DevOps Reference Tutorial

docs.vultr.com··DEV

Data Agent Kit - I Explored GCS, Visualized Data, and Built a Pipeline Without Leaving My Editor

🏗️Data Platforms Blog

·

Introducing Streamling: Performant and Extensible Data Streaming Framework

⚡DataFusion News

streamingdata.tech·

benseverndev-oss/goldenmatch: Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers, dbt + Airflow recipes.

💾Databases Code

github.com··Hacker News

Choosing the right workflow orchestration service for your use case: Amazon MWAA and AWS Step Functions

☁️Cloud Computing Blog

aws.amazon.com·

Integration Patterns: How To Choose for Your Architecture

🔄ETL Pipelines Blog

Real-time data replication to your data warehouse, self-serve

🏗️Data Platforms

artie.com··Hacker News, Hacker News

Embedding pipelines are the new ETL

🔄ETL Pipelines Blog

infoworld.com·

Towards Post-Quantum Secure Pharmacovigilance with ML-KEM and ML-DSA

🔄Data Pipelines Academic

SDLC vs. AIDLC: Why Data Engineering is Pushing the Boundaries of Software Development

🔄ETL Pipelines Blog

Azerbaijani Central Bank set to adopt data Lakehouse system in 2026

🏗️Data Platforms

15 years of Software Center – A Look in the Mirror and over the Front Windshield

🚀DevOps Blog

metrics.blogg.gu.se·

The Hidden Tax Killing Your ML Team’s Velocity – And the Architecture Decision That Fixes It

📊Data Lineage Blog

DuckDB Storage Engine for MariaDB. When the Sea Lion Learns to Quack.

📊Columnar Engines

mariadb.org··Hacker News

Log in to enable infinite scrolling