Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Data Engineering
⚙️ Data Engineering
ETL, Data Pipelines, Data Warehousing, Data Processing
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
298
posts in
4.5
ms
Azerbaijani Central Bank set to adopt
data
Lakehouse
system in 2026
🤔
Philosophy
trend.az
·
1d
1 day ago
Actions for Azerbaijani Central Bank set to adopt data Lakehouse system in 2026
15 years of Software Center – A Look in the Mirror and over the Front Windshield
🔄
DevOps
Content type:
Blog
metrics.blogg.gu.se
·
17h
17 hours ago
Actions for 15 years of Software Center – A Look in the Mirror and over the Front Windshield
10 MCP servers to connect LLMs with
databases
🗄️
Databases
infoworld.com
·
2d
2 days ago
Actions for 10 MCP servers to connect LLMs with databases
New comment by thaaff in "Ask HN: Who wants to be hired? (June 2026)"
🔄
DevOps
Content type:
Discussion
news.ycombinator.com
·
6d
6 days ago
·
Hacker News
Actions for New comment by thaaff in "Ask HN: Who wants to be hired? (June 2026)"
Real-time
data
replication to your
data
warehouse
, self-serve
🏗️
System Design
artie.com
·
1d
1 day ago
·
Hacker News
,
Hacker News
Actions for Real-time data replication to your data warehouse, self-serve
The Hidden Tax Killing Your ML Team’s Velocity – And the Architecture Decision That Fixes It
🗄️
Databases
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for The Hidden Tax Killing Your ML Team’s Velocity – And the Architecture Decision That Fixes It
DuckDB Storage
Engine
for MariaDB. When the Sea Lion Learns to Quack.
🗄️
Databases
mariadb.org
·
1d
1 day ago
·
Hacker News
Actions for DuckDB Storage Engine for MariaDB. When the Sea Lion Learns to Quack.
benseverndev-oss/goldenmatch: Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers,
dbt
+
Airflow
recipes.
🔄
DevOps
Content type:
Code
github.com
·
6d
6 days ago
·
Hacker News
Actions for benseverndev-oss/goldenmatch: Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers, dbt + Airflow recipes.
Microsoft just shared the frontier
data
engineering
secrets
🤖
Agents
mail.bycloud.ai
·
1d
1 day ago
Actions for Microsoft just shared the frontier data engineering secrets
Central Bank strengthens
data
governance for
AI
solutions
🛡️
Anthropic
Content type:
News
en.apa.az
·
1d
1 day ago
Actions for Central Bank strengthens data governance for AI solutions
AI
Agents and the Fight for Customer
Data
🤖
Agents
a16z.simplecast.com
·
5d
5 days ago
Actions for AI Agents and the Fight for Customer Data
Amazon SageMaker Unified Studio Notebooks now support EMR Serverless
☁️
Cloud Computing
aws.amazon.com
·
1d
1 day ago
Actions for Amazon SageMaker Unified Studio Notebooks now support EMR Serverless
When Feature Importance Lies: Target Encoding at the Noise Floor
💬
Prompt Engineering
flyback.ai
·
2d
2 days ago
·
DEV
Actions for When Feature Importance Lies: Target Encoding at the Noise Floor
Gene dependency-informed inference of response to targeted cancer therapies
💬
LLMs
Content type:
Academic
nature.com
·
3d
3 days ago
Actions for Gene dependency-informed inference of response to targeted cancer therapies
Storage Insights
datasets
: Enabling org-wide operational discovery with activity insights
🗄️
Databases
Content type:
Blog
cloud.google.com
·
2d
2 days ago
Actions for Storage Insights datasets: Enabling org-wide operational discovery with activity insights
New comment by aldoakhanov in "Ask HN: Who wants to be hired? (June 2026)"
🔄
DevOps
castlefootyai.com
·
5d
5 days ago
·
Hacker News
Actions for New comment by aldoakhanov in "Ask HN: Who wants to be hired? (June 2026)"
AI
Security Best Practices for Regulated Industries
🤖
Agents
orca.security
·
1d
1 day ago
Actions for AI Security Best Practices for Regulated Industries
The Considerate
Data
Modeler
🗄️
Databases
oranlooney.com
·
6d
6 days ago
·
Hacker News
Actions for The Considerate Data Modeler
Piper
: A Programmable Distributed Training System
📊
Query Optimization
Content type:
Academic
arxiv.org
·
21h
21 hours ago
Actions for Piper: A Programmable Distributed Training System
Get reliable answers to business questions with Bits
Data
Analysis
🏗️
System Design
Content type:
Blog
datadoghq.com
·
2d
2 days ago
Actions for Get reliable answers to business questions with Bits Data Analysis
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help