Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Data Engineering
🏗️ Data Engineering
ETL Pipelines, Apache Spark, Kafka, Data Warehouses
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
429
posts in
5.7
ms
Senior
Data
Engineer
– Climate Friendly
💾
Database
au.seek.com
·
5d
5 days ago
·
Hacker News
,
Hacker News
Actions for Senior Data Engineer – Climate Friendly
What Went Wrong with
Data
Lakes
? A 15-Year Reality Check from the Field
🌐
Distributed Systems
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for What Went Wrong with Data Lakes? A 15-Year Reality Check from the Field
Redis
Data
Integration in Redis Cloud is now GA in AWS
💾
Database
Content type:
Blog
redis.io
·
6d
6 days ago
Actions for Redis Data Integration in Redis Cloud is now GA in AWS
Real-time
data
replication to your
data
warehouse
, self-serve
🔗
Distributed Systems & Big Data Techniques
artie.com
·
1d
1 day ago
·
Hacker News
,
Hacker News
Actions for Real-time data replication to your data warehouse, self-serve
“Whoever builds the most joyous product wins”: The agent
war
begins
💾
Database
thenewstack.io
·
4d
4 days ago
Actions for “Whoever builds the most joyous product wins”: The agent war begins
Claude Code for Research: Preventing Hallucinations
🗄
Databases
Content type:
News
Content type:
Blog
homeeconomics.substack.com
·
2d
2 days ago
·
Substack
Actions for Claude Code for Research: Preventing Hallucinations
IPO-bound
Databricks
reportedly eyes $175B valuation after hitting $5.4B revenue run rate — TFN
💾
Database
techfundingnews.com
·
1d
1 day ago
Actions for IPO-bound Databricks reportedly eyes $175B valuation after hitting $5.4B revenue run rate — TFN
Real Estate Lifecycle Analysis with
BigQuery
SQL Graph: Graph Modeling Beyond LLM Record Linkage
💾
Database
Content type:
Blog
medium.com
·
6d
6 days ago
Actions for Real Estate Lifecycle Analysis with BigQuery SQL Graph: Graph Modeling Beyond LLM Record Linkage
Gene dependency-informed inference of response to targeted cancer therapies
🤖
AI
Content type:
Academic
nature.com
·
2d
2 days ago
Actions for Gene dependency-informed inference of response to targeted cancer therapies
Embedding
pipelines
are the new
ETL
🧭
Vector Databases
Content type:
Blog
infoworld.com
·
5d
5 days ago
Actions for Embedding pipelines are the new ETL
On theCUBE:
Snowflake
,
Databricks
and more battle for control of
AI
stack
💾
Database
siliconangle.com
·
2d
2 days ago
Actions for On theCUBE: Snowflake, Databricks and more battle for control of AI stack
10 MCP servers to connect LLMs with
databases
💾
Database
infoworld.com
·
2d
2 days ago
Actions for 10 MCP servers to connect LLMs with databases
Snowflake
CEO says there’s a
big
myth at the heart of every org chart
💾
Database
Content type:
News
fortune.com
·
2d
2 days ago
Actions for Snowflake CEO says there’s a big myth at the heart of every org chart
Anthropic’s Daniela Amodei on trust as the
AI
accelerant
💾
Database
aimagazine.com
·
6d
6 days ago
Actions for Anthropic’s Daniela Amodei on trust as the AI accelerant
Integration Patterns: How To Choose for Your Architecture
💾
Database
Content type:
Blog
blog.n8n.io
·
2d
2 days ago
Actions for Integration Patterns: How To Choose for Your Architecture
Unveiling Enhanced Location Intelligence Features: Distance & Duration, and Isochrones
💾
Database
Content type:
Blog
mapbox.com
·
2d
2 days ago
Actions for Unveiling Enhanced Location Intelligence Features: Distance & Duration, and Isochrones
New comment by thaaff in "Ask HN: Who wants to be hired? (June 2026)"
💾
Database
Content type:
Discussion
news.ycombinator.com
·
6d
6 days ago
·
Hacker News
Actions for New comment by thaaff in "Ask HN: Who wants to be hired? (June 2026)"
Azerbaijani Central Bank set to adopt
data
Lakehouse
system in 2026
🕸️
Graph Databases
trend.az
·
1d
1 day ago
Actions for Azerbaijani Central Bank set to adopt data Lakehouse system in 2026
Build stateful streaming applications with
Apache
Spark
4.0 on Amazon EMR Serverless
🔗
Distributed Systems & Big Data Techniques
Content type:
Blog
aws.amazon.com
·
1d
1 day ago
Actions for Build stateful streaming applications with Apache Spark 4.0 on Amazon EMR Serverless
benseverndev-oss/goldenmatch: Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers,
dbt
+
Airflow
recipes.
🗄
Databases
Content type:
Code
github.com
·
6d
6 days ago
·
Hacker News
Actions for benseverndev-oss/goldenmatch: Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers, dbt + Airflow recipes.
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help