Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Data Engineering
🛠️ Data Engineering
data pipeline, ETL, columnar storage, data formats
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
282
posts in
6.8
ms
Apache
Iceberg
™ 1.11 Released: A Smarter REST Catalog, Production-Ready Encryption and the Road to v4
📦
Parquet
Content type:
Blog
snowflake.com
·
6d
6 days ago
Actions for Apache Iceberg™ 1.11 Released: A Smarter REST Catalog, Production-Ready Encryption and the Road to v4
Apache
Iceberg
v4: The Current State, the Proposals, and Why They Matter
🌐
Open Source
books.alexmerced.com
·
1d
1 day ago
·
DEV
Actions for Apache Iceberg v4: The Current State, the Proposals, and Why They Matter
Claude Code for Research: Preventing Hallucinations
⚡
Query Engines
Content type:
News
Content type:
Blog
homeeconomics.substack.com
·
2d
2 days ago
·
Substack
Actions for Claude Code for Research: Preventing Hallucinations
Databricks
wants to kill the “email me a file” problem for
AI
agent skills
📦
Parquet
thenewstack.io
·
15h
15 hours ago
Actions for Databricks wants to kill the “email me a file” problem for AI agent skills
New comment by mkolarek in "Ask HN: Who wants to be hired? (June 2026)"
📦
Parquet
Content type:
PDF
markokolarek.com
·
4d
4 days ago
·
Hacker News
Actions for New comment by mkolarek in "Ask HN: Who wants to be hired? (June 2026)"
LakeQA
: An Exploratory QA Benchmark over a Million-Scale
Data
Lake
⚡
Query Engines
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for LakeQA: An Exploratory QA Benchmark over a Million-Scale Data Lake
Choosing the right workflow orchestration service for your use case: Amazon MWAA and AWS Step Functions
📊
OLAP
Content type:
Blog
aws.amazon.com
·
13h
13 hours ago
Actions for Choosing the right workflow orchestration service for your use case: Amazon MWAA and AWS Step Functions
Azerbaijani Central Bank set to adopt
data
Lakehouse
system in 2026
📊
OLAP
trend.az
·
1d
1 day ago
Actions for Azerbaijani Central Bank set to adopt data Lakehouse system in 2026
benseverndev-oss/goldenmatch: Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers,
dbt
+
Airflow
recipes.
⚡
Query Engines
Content type:
Code
github.com
·
6d
6 days ago
·
Hacker News
Actions for benseverndev-oss/goldenmatch: Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers, dbt + Airflow recipes.
Real-time
data
replication to your
data
warehouse
, self-serve
📊
OLAP
artie.com
·
1d
1 day ago
·
Hacker News
,
Hacker News
Actions for Real-time data replication to your data warehouse, self-serve
The Hidden Tax Killing Your ML Team’s Velocity – And the Architecture Decision That Fixes It
🏹
Apache Arrow
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for The Hidden Tax Killing Your ML Team’s Velocity – And the Architecture Decision That Fixes It
DuckDB
Storage
Engine
for MariaDB. When the Sea Lion Learns to Quack.
⚡
Query Engines
mariadb.org
·
1d
1 day ago
·
Hacker News
Actions for DuckDB Storage Engine for MariaDB. When the Sea Lion Learns to Quack.
Redis
Data
Integration in Redis Cloud is now GA in AWS
⚡
Query Engines
Content type:
Blog
redis.io
·
6d
6 days ago
Actions for Redis Data Integration in Redis Cloud is now GA in AWS
Microsoft just shared the frontier
data
engineering
secrets
⚡
Query Engines
mail.bycloud.ai
·
1d
1 day ago
Actions for Microsoft just shared the frontier data engineering secrets
15 years of Software Center – A Look in the Mirror and over the Front Windshield
🌐
Open Source
Content type:
Blog
metrics.blogg.gu.se
·
21h
21 hours ago
Actions for 15 years of Software Center – A Look in the Mirror and over the Front Windshield
The Considerate
Data
Modeler
📊
OLAP
oranlooney.com
·
6d
6 days ago
·
Hacker News
Actions for The Considerate Data Modeler
Central Bank strengthens
data
governance for
AI
solutions
📊
OLAP
Content type:
News
en.apa.az
·
1d
1 day ago
Actions for Central Bank strengthens data governance for AI solutions
From Legacy Custom Logging to Native Structured Logging in
Dataflow
⚡
Query Engines
Content type:
Blog
medium.com
·
3h
3 hours ago
Actions for From Legacy Custom Logging to Native Structured Logging in Dataflow
When Feature Importance Lies: Target Encoding at the Noise Floor
⚡
Query Engines
flyback.ai
·
2d
2 days ago
·
DEV
Actions for When Feature Importance Lies: Target Encoding at the Noise Floor
Gene dependency-informed inference of response to targeted cancer therapies
⚡
Query Engines
Content type:
Academic
nature.com
·
3d
3 days ago
Actions for Gene dependency-informed inference of response to targeted cancer therapies
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help