Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Data Engineering
🔧 Data Engineering
data pipelines, ETL, data lakes, Apache Spark
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
273
posts in
7.5
ms
Archiving Years of
Dataverse
Audit History
🗄️
Databases
techcommunity.microsoft.com
·
1d
1 day ago
Actions for Archiving Years of Dataverse Audit History
benseverndev-oss/goldenmatch: Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers,
dbt
+
Airflow
recipes.
🗄️
Databases
Content type:
Code
github.com
·
5d
5 days ago
·
Hacker News
Actions for benseverndev-oss/goldenmatch: Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers, dbt + Airflow recipes.
Announcing
Spark
Connect on Amazon EMR Serverless: Interactive
PySpark
development, anywhere
☁️
Cloud Infrastructure
Content type:
Blog
aws.amazon.com
·
21h
21 hours ago
Actions for Announcing Spark Connect on Amazon EMR Serverless: Interactive PySpark development, anywhere
IPO-bound
Databricks
reportedly eyes $175B valuation after hitting $5.4B revenue run rate — TFN
☁️
Cloud Infrastructure
techfundingnews.com
·
1d
1 day ago
Actions for IPO-bound Databricks reportedly eyes $175B valuation after hitting $5.4B revenue run rate — TFN
Celebal Technologies raises debt funding from BlackSoil
🏗️
Platform Engineering
Content type:
News
theheadandtale.com
·
8h
8 hours ago
Actions for Celebal Technologies raises debt funding from BlackSoil
Redis
Data
Integration in Redis Cloud is now GA in AWS
☁️
Cloud Infrastructure
Content type:
Blog
redis.io
·
5d
5 days ago
Actions for Redis Data Integration in Redis Cloud is now GA in AWS
aws/agent-toolkit-for-aws: Official, AWS-supported MCP servers, skills, and plugins to help
AI
agents build on AWS
☁️
Cloud Infrastructure
Content type:
Code
github.com
·
1d
1 day ago
·
Hacker News
Actions for aws/agent-toolkit-for-aws: Official, AWS-supported MCP servers, skills, and plugins to help AI agents build on AWS
Day 10 of 100 Days of ClickHouse®: What Makes ClickHouse SQL Different?
🗄️
Databases
quantrail-data.com
·
8h
8 hours ago
·
DEV
Actions for Day 10 of 100 Days of ClickHouse®: What Makes ClickHouse SQL Different?
Daily Reading List – June 8, 2026 (#800)
☁️
Cloud Infrastructure
seroter.com
·
1d
1 day ago
Actions for Daily Reading List – June 8, 2026 (#800)
Iceberg Summit 2026: The Adoption Question Is Settled. Now What?
☁️
Cloud Infrastructure
Content type:
Blog
snowflake.com
·
6d
6 days ago
Actions for Iceberg Summit 2026: The Adoption Question Is Settled. Now What?
Piper
: A Programmable Distributed Training System
🌐
Distributed Systems
Content type:
Academic
arxiv.org
·
9h
9 hours ago
Actions for Piper: A Programmable Distributed Training System
AI
Agents and the Fight for Customer
Data
🤖
AI Engineering
a16z.simplecast.com
·
5d
5 days ago
Actions for AI Agents and the Fight for Customer Data
Leaders in Location
📦
Design Patterns
Content type:
Blog
mapbox.com
·
2d
2 days ago
Actions for Leaders in Location
Broadcom VMware security advisory (AV26-548)
📐
System Design
malware.news
·
6d
6 days ago
Actions for Broadcom VMware security advisory (AV26-548)
New comment by Revanthkodati in "Ask HN: Who wants to be hired? (June 2026)"
🤖
AI
drive.google.com
·
1d
1 day ago
·
Hacker News
Actions for New comment by Revanthkodati in "Ask HN: Who wants to be hired? (June 2026)"
New comment by aldoakhanov in "Ask HN: Who wants to be hired? (June 2026)"
🤖
AI
castlefootyai.com
·
5d
5 days ago
·
Hacker News
Actions for New comment by aldoakhanov in "Ask HN: Who wants to be hired? (June 2026)"
Cloudian closes gap between enterprise
AI
ambitions and messy production deployments
🤖
AI Engineering
Content type:
News
blocksandfiles.com
·
1d
1 day ago
Actions for Cloudian closes gap between enterprise AI ambitions and messy production deployments
The Considerate
Data
Modeler
🗄️
Databases
oranlooney.com
·
6d
6 days ago
·
Hacker News
Actions for The Considerate Data Modeler
Build stateful streaming applications with
Apache
Spark
4.0 on Amazon EMR Serverless
☁️
Cloud Infrastructure
Content type:
Blog
aws.amazon.com
·
21h
21 hours ago
Actions for Build stateful streaming applications with Apache Spark 4.0 on Amazon EMR Serverless
Why Snowflake Matters Now More Than Ever
☁️
Cloud Infrastructure
Content type:
News
Content type:
Blog
clouddb.substack.com
·
1d
1 day ago
·
Substack
Actions for Why Snowflake Matters Now More Than Ever
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help