Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Data Engineering
🔧 Data Engineering
data pipelines, ETL, Apache Spark, data lakes
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
277
posts in
18.6
ms
Best
Data
Engineering
Courses in 2026
☁️
Cloud Computing
Content type:
Blog
dataquest.io
·
6d
6 days ago
Actions for Best Data Engineering Courses in 2026
Introducing Streamling: Performant and Extensible
Data
Streaming Framework
🔎
Query Engines
Content type:
News
streamingdata.tech
·
16h
16 hours ago
Actions for Introducing Streamling: Performant and Extensible Data Streaming Framework
What Went Wrong with
Data
Lakes
? A 15-Year Reality Check from the Field
⚙️
Distributed Systems
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for What Went Wrong with Data Lakes? A 15-Year Reality Check from the Field
Transit
Data
Ingestion
Platform
Dev 101: Lyondle & Colosse
☁️
Cloud Computing
Content type:
Blog
johanmontorfano.com
·
22h
22 hours ago
·
Lobsters
Actions for Transit Data Ingestion Platform Dev 101: Lyondle & Colosse
Run an
Apache
Airflow
DAG with Docker Compose and PostgreSQL
🗄️
Databases
pyimagesearch.com
·
1d
1 day ago
Actions for Run an Apache Airflow DAG with Docker Compose and PostgreSQL
It's official: Fivetran +
dbt
Labs merge to build the
data
foundation for trustworthy
AI
agents (Sponsor)
🤖
AI
fivetran.com
·
6d
6 days ago
Actions for It's official: Fivetran + dbt Labs merge to build the data foundation for trustworthy AI agents (Sponsor)
Announcing general availability of
Apache
Spark
4.0 on Amazon EMR
☁️
Cloud Computing
Content type:
Blog
aws.amazon.com
·
17h
17 hours ago
Actions for Announcing general availability of Apache Spark 4.0 on Amazon EMR
Linux Fundamentals for
Data
Engineering
⚙️
Systems Programming
dev-to-uploads.s3.amazonaws.com
·
1d
1 day ago
·
DEV
Actions for Linux Fundamentals for Data Engineering
Enhancements to Managed Service for
Apache
Spark
clusters
☁️
Cloud Computing
Content type:
Blog
cloud.google.com
·
6d
6 days ago
Actions for Enhancements to Managed Service for Apache Spark clusters
Spreadsheet native
data
platform
🔎
Query Engines
getarkx.com
·
19h
19 hours ago
·
r/SideProject
Actions for Spreadsheet native data platform
Calculating speed estimates with
Apache
Spark
☁️
Cloud Computing
Content type:
Blog
mapbox.com
·
1d
1 day ago
Actions for Calculating speed estimates with Apache Spark
Senior
Data
Engineer
– Climate Friendly
☁️
Cloud Computing
au.seek.com
·
5d
5 days ago
·
Hacker News
,
Hacker News
Actions for Senior Data Engineer – Climate Friendly
Azerbaijani Central Bank set to adopt
data
Lakehouse
system in 2026
🗄️
Databases
trend.az
·
23h
23 hours ago
Actions for Azerbaijani Central Bank set to adopt data Lakehouse system in 2026
Claude Code for Research: Preventing Hallucinations
🔎
Query Engines
Content type:
News
Content type:
Blog
homeeconomics.substack.com
·
2d
2 days ago
·
Substack
Actions for Claude Code for Research: Preventing Hallucinations
Integration Patterns: How To Choose for Your Architecture
⚙️
Systems Programming
Content type:
Blog
blog.n8n.io
·
1d
1 day ago
Actions for Integration Patterns: How To Choose for Your Architecture
New comment by mkolarek in "Ask HN: Who wants to be hired? (June 2026)"
☁️
Cloud Computing
Content type:
PDF
markokolarek.com
·
3d
3 days ago
·
Hacker News
Actions for New comment by mkolarek in "Ask HN: Who wants to be hired? (June 2026)"
Microsoft just shared the frontier
data
engineering
secrets
🤖
AI
mail.bycloud.ai
·
13h
13 hours ago
Actions for Microsoft just shared the frontier data engineering secrets
DuckDB Storage
Engine
for MariaDB. When the Sea Lion Learns to Quack.
🔎
Query Engines
mariadb.org
·
17h
17 hours ago
·
Hacker News
Actions for DuckDB Storage Engine for MariaDB. When the Sea Lion Learns to Quack.
Gene dependency-informed inference of response to targeted cancer therapies
🐹
Go
Content type:
Academic
nature.com
·
2d
2 days ago
Actions for Gene dependency-informed inference of response to targeted cancer therapies
AI
Agents and the Fight for Customer
Data
🤖
AI
a16z.simplecast.com
·
4d
4 days ago
Actions for AI Agents and the Fight for Customer Data
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help