Big Data

data pipeline, Spark, Hadoop, large-scale processing

Feeds to Scour
SubscribedAll
Scoured 97 posts in 7.6 ms

Announcing general availability of Apache Spark 4.0 on Amazon EMR

 🖥️Bytecode VMs  Content type: Blog
aws.amazon.com·

Franz

 🗄️Columnar Storage
flathub.org·

Apache Spark: The Complete Deep Dive

 🗄️DB Internals  Content type: Blog
medium.com
·

Deep dive: How Lightning Engine delivers 4.9x faster Apache Spark performance

 🖥️Bytecode VMs  Content type: Blog
cloud.google.com·

How Kafka Works in Spring Boot: A Simple Explanation for Backend Developers

 🗄️DB Internals  Content type: Blog
medium.com
·

Calculating speed estimates with Apache Spark

 Performance  Content type: Blog
mapbox.com·

What is distributed computing in big data?

 🌐Distributed Systems  Content type: Blog
medium.com
·

Maestro: Workload-Aware Cross-Cluster Scheduling for LLM-Based Multi-Agent Systems

 ⚙️Systems Engineering  Content type: Academic
arxiv.org·

New comment by mkolarek in "Ask HN: Who wants to be hired? (June 2026)"

 🔍Query Optimization  Content type: PDF

DWH Spark MCP: Your Agent Can Read Spark History Now

 📊Dataflow  Content type: Blog
medium.com
·

Lakehouse Demystified — Part 5: Just enough about Managed Service for Apache Airflow

 ⚙️Systems Engineering  Content type: Blog
medium.com
·

DuckDB Ecosystem Newsletter : June 2026

 🗄️Columnar Storage  Content type: Blog
motherduck.com·

Introducing Streamling: Performant and Extensible Data Streaming Framework

 🖥️Bytecode VMs  Content type: News
streamingdata.tech·

Minimizing Memory in Parallel Task Graph Scheduling: Focusing on Average Consumption

 💻CPU Architecture  Content type: Academic
sciencedirect.com·

Optimize Spark and Databricks jobs with Datadog

 🔭Observability  Content type: Blog
datadoghq.com·

Databricks Hands Delta Sharing to the Linux Foundation and Levels It Up

 🌐Distributed Systems  Content type: News

make descriptions shorter · vinta/awesome-python@9f156de

 🧩Static Analysis  Content type: Code
github.com·

Access Amazon S3 data files directly using AWS Lake Formation permissions

 🗄️Columnar Storage  Content type: Blog
aws.amazon.com·

Daily Reading List – June 10, 2026 (#802)

 💥Chaos Engineering
seroter.com·

Presentation: Beyond Prompting: Context Engineering and Memory Management for AI Systems at Scale

 📊Dataflow  Content type: News
infoq.com
·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help