Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Data Engineering
🔧 Data Engineering
data pipelines, ETL, data lakes, Apache Spark
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
273
posts in
4.4
ms
Best
Data
Engineering
Courses in 2026
☁️
Cloud Infrastructure
Content type:
Blog
dataquest.io
·
6d
6 days ago
Actions for Best Data Engineering Courses in 2026
Calculating speed estimates with
Apache
Spark
🌐
Distributed Systems
Content type:
Blog
mapbox.com
·
2d
2 days ago
Actions for Calculating speed estimates with Apache Spark
Announcing general availability of
Apache
Spark
4.0 on Amazon EMR
🔌
APIs
Content type:
Blog
aws.amazon.com
·
21h
21 hours ago
Actions for Announcing general availability of Apache Spark 4.0 on Amazon EMR
Designing an
ETL
Application: Why I Started with a Modular Monolith Before Microservices
🔗
Microservices
Content type:
Blog
medium.com
·
4h
4 hours ago
Actions for Designing an ETL Application: Why I Started with a Modular Monolith Before Microservices
What Went Wrong with
Data
Lakes
? A 15-Year Reality Check from the Field
🚀
DevOps
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for What Went Wrong with Data Lakes? A 15-Year Reality Check from the Field
SDLC vs. AIDLC: Why
Data
Engineering
is Pushing the Boundaries of Software Development
🤖
AI Engineering
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for SDLC vs. AIDLC: Why Data Engineering is Pushing the Boundaries of Software Development
Exclusive: MotherDuck adds agentic
data
ingestion to its cloud analytics service
☁️
Cloud Infrastructure
siliconangle.com
·
39m
39 minutes ago
Actions for Exclusive: MotherDuck adds agentic data ingestion to its cloud analytics service
Run an
Apache
Airflow
DAG with Docker Compose and PostgreSQL
🗄️
Databases
pyimagesearch.com
·
2d
2 days ago
Actions for Run an Apache Airflow DAG with Docker Compose and PostgreSQL
Introducing Streamling: Performant and Extensible
Data
Streaming Framework
📐
System Design
Content type:
News
streamingdata.tech
·
20h
20 hours ago
Actions for Introducing Streamling: Performant and Extensible Data Streaming Framework
It's official: Fivetran +
dbt
Labs merge to build the
data
foundation for trustworthy
AI
agents (Sponsor)
🤖
AI Engineering
fivetran.com
·
6d
6 days ago
Actions for It's official: Fivetran + dbt Labs merge to build the data foundation for trustworthy AI agents (Sponsor)
Linux Fundamentals for
Data
Engineering
📦
Design Patterns
dev-to-uploads.s3.amazonaws.com
·
1d
1 day ago
·
DEV
Actions for Linux Fundamentals for Data Engineering
Enhancements to Managed Service for
Apache
Spark
clusters
☁️
Cloud Infrastructure
Content type:
Blog
cloud.google.com
·
6d
6 days ago
Actions for Enhancements to Managed Service for Apache Spark clusters
DuckDB Storage
Engine
for MariaDB. When the Sea Lion Learns to Quack.
🗄️
Databases
mariadb.org
·
21h
21 hours ago
·
Hacker News
Actions for DuckDB Storage Engine for MariaDB. When the Sea Lion Learns to Quack.
Claude Code for Research: Preventing Hallucinations
🧠
LLMs
Content type:
News
Content type:
Blog
homeeconomics.substack.com
·
2d
2 days ago
·
Substack
Actions for Claude Code for Research: Preventing Hallucinations
Microsoft just shared the frontier
data
engineering
secrets
🤖
AI Engineering
mail.bycloud.ai
·
17h
17 hours ago
Actions for Microsoft just shared the frontier data engineering secrets
Do
data
quality frameworks have to be so complex?
📦
Design Patterns
sparkdq-community.github.io
·
5d
5 days ago
·
r/Python
Actions for Do data quality frameworks have to be so complex?
Streaming and
Batch
Data
Architectures with Microsoft Fabric to Azure Databricks
🔌
APIs
techcommunity.microsoft.com
·
23h
23 hours ago
Actions for Streaming and Batch Data Architectures with Microsoft Fabric to Azure Databricks
Gene dependency-informed inference of response to targeted cancer therapies
🤖
AI Engineering
Content type:
Academic
nature.com
·
2d
2 days ago
Actions for Gene dependency-informed inference of response to targeted cancer therapies
Senior
Data
Engineer
– Climate Friendly
☁️
Cloud Infrastructure
au.seek.com
·
5d
5 days ago
·
Hacker News
,
Hacker News
Actions for Senior Data Engineer – Climate Friendly
Upgrade
PySpark
from
Spark
3.5 to
Spark
4.0 with AWS
Spark
Upgrade Agent
☁️
Cloud Infrastructure
Content type:
Blog
aws.amazon.com
·
21h
21 hours ago
Actions for Upgrade PySpark from Spark 3.5 to Spark 4.0 with AWS Spark Upgrade Agent
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help