Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Data Engineering
🔧 Data Engineering
data pipeline, ETL, data lakehouse, apache spark
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
223
posts in
6.7
ms
Connections, Roles, and
Warehouses
: Getting CoCo Desktop Production-Ready from Day One
☁️
Cloud Computing
Content type:
Blog
towardsai.net
·
2d
2 days ago
Actions for Connections, Roles, and Warehouses: Getting CoCo Desktop Production-Ready from Day One
CSU Student of Distinction: Anthony Arthur
👨
Software engineering
Content type:
Academic
csuohio.edu
·
16h
16 hours ago
Actions for CSU Student of Distinction: Anthony Arthur
Introducing Flights: Agent-Native
Ingest
in MotherDuck
🐍
Python
Content type:
Blog
motherduck.com
·
1d
1 day ago
Actions for Introducing Flights: Agent-Native Ingest in MotherDuck
Embedding
pipelines
are the new
ETL
🧠
LLMs
Content type:
Blog
infoworld.com
·
5d
5 days ago
Actions for Embedding pipelines are the new ETL
Spring security advisory (AV26-574)
🏗️
Software Design
cyber.gc.ca
·
13h
13 hours ago
Actions for Spring security advisory (AV26-574)
Efficient
Snowflake
Ingestion
: Query-Ready
Data
at Scale
🧠
LLMs
Content type:
Blog
snowflake.com
·
1d
1 day ago
Actions for Efficient Snowflake Ingestion: Query-Ready Data at Scale
Redis
Data
Integration in Redis Cloud is now GA in AWS
☁️
Cloud Computing
Content type:
Blog
redis.io
·
6d
6 days ago
Actions for Redis Data Integration in Redis Cloud is now GA in AWS
Announcing general availability of
Apache
Spark
4.0 on Amazon EMR
☁️
Cloud Computing
Content type:
Blog
aws.amazon.com
·
1d
1 day ago
Actions for Announcing general availability of Apache Spark 4.0 on Amazon EMR
Presentation: Beyond Prompting: Context
Engineering
and Memory Management for
AI
Systems at Scale
🤖
AI
Content type:
News
infoq.com
·
15h
15 hours ago
Actions for Presentation: Beyond Prompting: Context Engineering and Memory Management for AI Systems at Scale
benseverndev-oss/goldenmatch: Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers,
dbt
+
Airflow
recipes.
🐍
Python
Content type:
Code
github.com
·
6d
6 days ago
·
Hacker News
Actions for benseverndev-oss/goldenmatch: Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers, dbt + Airflow recipes.
15 years of Software Center – A Look in the Mirror and over the Front Windshield
💻
Software Development
Content type:
Blog
metrics.blogg.gu.se
·
19h
19 hours ago
Actions for 15 years of Software Center – A Look in the Mirror and over the Front Windshield
Microsoft just shared the frontier
data
engineering
secrets
🤖
AI
mail.bycloud.ai
·
1d
1 day ago
Actions for Microsoft just shared the frontier data engineering secrets
Databricks
wants to kill the “email me a file” problem for
AI
agent skills
☁️
Cloud Computing
thenewstack.io
·
13h
13 hours ago
Actions for Databricks wants to kill the “email me a file” problem for AI agent skills
The Hidden Tax Killing Your ML Team’s Velocity – And the Architecture Decision That Fixes It
⚙️
MLOps
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for The Hidden Tax Killing Your ML Team’s Velocity – And the Architecture Decision That Fixes It
New comment by unkownnomad110 in "Ask HN: Who wants to be hired? (June 2026)"
🟨
JavaScript
Content type:
Discussion
news.ycombinator.com
·
2d
2 days ago
·
Hacker News
Actions for New comment by unkownnomad110 in "Ask HN: Who wants to be hired? (June 2026)"
When Feature Importance Lies: Target Encoding at the Noise Floor
🧠
LLMs
flyback.ai
·
2d
2 days ago
·
DEV
Actions for When Feature Importance Lies: Target Encoding at the Noise Floor
Gene dependency-informed inference of response to targeted cancer therapies
🗂️
Data Modeling
Content type:
Academic
nature.com
·
3d
3 days ago
Actions for Gene dependency-informed inference of response to targeted cancer therapies
Celebal Technologies raises debt funding from BlackSoil
🏦
Fintech
Content type:
News
theheadandtale.com
·
22h
22 hours ago
Actions for Celebal Technologies raises debt funding from BlackSoil
Optimize
Spark
and
Databricks
jobs with
Datadog
📡
Observability
Content type:
Blog
datadoghq.com
·
2d
2 days ago
Actions for Optimize Spark and Databricks jobs with Datadog
AI
Agents and the Fight for Customer
Data
🗂️
Data Modeling
a16z.simplecast.com
·
5d
5 days ago
Actions for AI Agents and the Fight for Customer Data
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help