Apache Spark

Feeds to Scour
SubscribedAll
Scoured 95 posts in 4.4 ms

Do data quality frameworks have to be so complex?

 🐍Python

Upgrade PySpark from Spark 3.5 to Spark 4.0 with AWS Spark Upgrade Agent

 🔄Data Pipelines  Content type: Blog
aws.amazon.com·
Less-relevant results

A Neurosymbolic Prolog Skill for LLM-Driven Service Placement

 🛠️Data Engineering  Content type: Academic
arxiv.org·

Calculating speed estimates with Apache Spark

 🔄Data Pipelines  Content type: Blog
mapbox.com·

Enhancements to Managed Service for Apache Spark clusters

 🛠️Data Engineering  Content type: Blog
cloud.google.com·

sync with upstream · vinta/awesome-python@eb86241

 🐍Python  Content type: Code
github.com·

Amazon SageMaker Unified Studio Notebooks now support EMR Serverless

 🔄Data Pipelines
aws.amazon.com
·

WriterAgent Week 8-9: Adding NumPy and Pandas to LibreOffice

 🐍Python
keithcu.com·

Dynamic Software Updates using CRDTs

 🛠️Data Engineering  Content type: Academic
arxiv.org·

PyCoder’s Weekly: Issue #738: sleep(), Polars Workflows, Iterators, and More (2026-06-09)

 🐍Python
pycoders.com·

DataAgents: How we turned 9 months of analysis into 10 days

 🛠️Data Engineering  Content type: Blog
medium.com
·

Announcing general availability of Apache Spark 4.0 on Amazon EMR

 🔄Data Pipelines  Content type: Blog
aws.amazon.com·

Revisiting "Cooler is Better": ITD-Aware Per-CPU Thermal Optimization for Sustainable Data Center Operation

 🔄Data Pipelines  Content type: Academic
arxiv.org·

SDLC vs. AIDLC: Why Data Engineering is Pushing the Boundaries of Software Development

 🛠️Data Engineering  Content type: Blog
medium.com·

Streaming and Batch Data Architectures with Microsoft Fabric to Azure Databricks

 🔄Data Pipelines

Linearizability and State-Machine Replication: Is It a Match?

 🔄Data Pipelines  Content type: Academic
arxiv.org··Hacker News

Announcing Spark Connect on Amazon EMR Serverless: Interactive PySpark development, anywhere

 🔄Data Pipelines  Content type: Blog
aws.amazon.com·

Awesome List Updated on Jun 04, 2026

 🐍Python
trackawesomelist.com·

Piper: A Programmable Distributed Training System

 🛠️Data Engineering  Content type: Academic
arxiv.org·

Build stateful streaming applications with Apache Spark 4.0 on Amazon EMR Serverless

 🔄Data Pipelines  Content type: Blog
aws.amazon.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help