Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Big Data
📦 Big Data
data pipeline, Spark, Hadoop, large-scale processing
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
97
posts in
7.6
ms
Announcing general availability of
Apache
Spark
4.0 on Amazon EMR
🖥️
Bytecode VMs
Content type:
Blog
aws.amazon.com
·
3d
3 days ago
Actions for Announcing general availability of Apache Spark 4.0 on Amazon EMR
Franz
🗄️
Columnar Storage
flathub.org
·
2d
2 days ago
Actions for Franz
Apache
Spark
: The Complete Deep Dive
🗄️
DB Internals
Content type:
Blog
medium.com
·
19h
19 hours ago
Actions for Apache Spark: The Complete Deep Dive
Deep dive: How Lightning Engine delivers 4.9x faster
Apache
Spark
performance
🖥️
Bytecode VMs
Content type:
Blog
cloud.google.com
·
2d
2 days ago
Actions for Deep dive: How Lightning Engine delivers 4.9x faster Apache Spark performance
How
Kafka
Works in Spring Boot: A Simple Explanation for Backend Developers
🗄️
DB Internals
Content type:
Blog
medium.com
·
4h
4 hours ago
Actions for How Kafka Works in Spring Boot: A Simple Explanation for Backend Developers
Calculating speed estimates with
Apache
Spark
⚡
Performance
Content type:
Blog
mapbox.com
·
4d
4 days ago
Actions for Calculating speed estimates with Apache Spark
What is
distributed
computing
in
big
data?
🌐
Distributed Systems
Content type:
Blog
medium.com
·
1d
1 day ago
Actions for What is distributed computing in big data?
Maestro: Workload-Aware
Cross-Cluster
Scheduling for LLM-Based Multi-Agent Systems
⚙️
Systems Engineering
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for Maestro: Workload-Aware Cross-Cluster Scheduling for LLM-Based Multi-Agent Systems
New comment by mkolarek in "Ask HN: Who wants to be hired? (June 2026)"
🔍
Query Optimization
Content type:
PDF
markokolarek.com
·
6d
6 days ago
·
Hacker News
Actions for New comment by mkolarek in "Ask HN: Who wants to be hired? (June 2026)"
DWH
Spark
MCP: Your Agent Can Read
Spark
History Now
📊
Dataflow
Content type:
Blog
medium.com
·
1d
1 day ago
Actions for DWH Spark MCP: Your Agent Can Read Spark History Now
Lakehouse
Demystified — Part 5: Just enough about Managed Service for
Apache
Airflow
⚙️
Systems Engineering
Content type:
Blog
medium.com
·
19h
19 hours ago
Actions for Lakehouse Demystified — Part 5: Just enough about Managed Service for Apache Airflow
DuckDB Ecosystem Newsletter : June 2026
🗄️
Columnar Storage
Content type:
Blog
motherduck.com
·
21h
21 hours ago
Actions for DuckDB Ecosystem Newsletter : June 2026
Introducing Streamling: Performant and Extensible
Data
Streaming Framework
🖥️
Bytecode VMs
Content type:
News
streamingdata.tech
·
3d
3 days ago
Actions for Introducing Streamling: Performant and Extensible Data Streaming Framework
Minimizing Memory in Parallel Task Graph Scheduling: Focusing on Average Consumption
💻
CPU Architecture
Content type:
Academic
sciencedirect.com
·
2d
2 days ago
Actions for Minimizing Memory in Parallel Task Graph Scheduling: Focusing on Average Consumption
Optimize
Spark
and
Databricks
jobs with
Datadog
🔭
Observability
Content type:
Blog
datadoghq.com
·
3d
3 days ago
Actions for Optimize Spark and Databricks jobs with Datadog
Databricks
Hands Delta Sharing to the Linux Foundation and Levels It Up
🌐
Distributed Systems
Content type:
News
techstrong.ai
·
1d
1 day ago
·
Hacker News
Actions for Databricks Hands Delta Sharing to the Linux Foundation and Levels It Up
make descriptions shorter · vinta/awesome-python@9f156de
🧩
Static Analysis
Content type:
Code
github.com
·
6d
6 days ago
Actions for make descriptions shorter · vinta/awesome-python@9f156de
Access Amazon S3
data
files directly using AWS
Lake
Formation permissions
🗄️
Columnar Storage
Content type:
Blog
aws.amazon.com
·
5h
5 hours ago
Actions for Access Amazon S3 data files directly using AWS Lake Formation permissions
Daily Reading List – June 10, 2026 (#802)
💥
Chaos Engineering
seroter.com
·
1d
1 day ago
Actions for Daily Reading List – June 10, 2026 (#802)
Presentation: Beyond Prompting: Context Engineering and Memory Management for AI Systems at
Scale
📊
Dataflow
Content type:
News
infoq.com
·
2d
2 days ago
Actions for Presentation: Beyond Prompting: Context Engineering and Memory Management for AI Systems at Scale
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help