Apache Spark

Feeds to Scour
SubscribedAll
Scoured 96 posts in 8.9 ms

Do data quality frameworks have to be so complex?

 🐍Python

Upgrade PySpark from Spark 3.5 to Spark 4.0 with AWS Spark Upgrade Agent

 🔄Data Pipelines  Content type: Blog
aws.amazon.com·
Less-relevant results

A Neurosymbolic Prolog Skill for LLM-Driven Service Placement

 🛠️Data Engineering  Content type: Academic
arxiv.org·

Calculating speed estimates with Apache Spark

 🔄Data Pipelines  Content type: Blog
mapbox.com·

Enhancements to Managed Service for Apache Spark clusters

 🛠️Data Engineering  Content type: Blog
cloud.google.com·

sync with upstream · vinta/awesome-python@eb86241

 🐍Python  Content type: Code
github.com·

Amazon SageMaker Unified Studio Notebooks now support EMR Serverless

 🔄Data Pipelines
aws.amazon.com
·

PyCoder’s Weekly: Issue #738: sleep(), Polars Workflows, Iterators, and More (2026-06-09)

 🐍Python
pycoders.com·

WriterAgent Week 8-9: Adding NumPy and Pandas to LibreOffice

 🐍Python
keithcu.com·

FlashCP: Load-Balanced Communication-Efficient Context Parallelism for LLM Training

 🔄Data Pipelines  Content type: Academic
arxiv.org·

Announcing general availability of Apache Spark 4.0 on Amazon EMR

 🔄Data Pipelines  Content type: Blog
aws.amazon.com·

Best Data Engineering Courses in 2026

 🛠️Data Engineering  Content type: Blog
dataquest.io·

Dynamic Software Updates using CRDTs

 🛠️Data Engineering  Content type: Academic
arxiv.org·

Streaming and Batch Data Architectures with Microsoft Fabric to Azure Databricks

 🔄Data Pipelines

DataAgents: How we turned 9 months of analysis into 10 days

 🛠️Data Engineering  Content type: Blog
medium.com
·

Announcing Spark Connect on Amazon EMR Serverless: Interactive PySpark development, anywhere

 🔄Data Pipelines  Content type: Blog
aws.amazon.com·

SDLC vs. AIDLC: Why Data Engineering is Pushing the Boundaries of Software Development

 🛠️Data Engineering  Content type: Blog
medium.com·

Revisiting "Cooler is Better": ITD-Aware Per-CPU Thermal Optimization for Sustainable Data Center Operation

 🔄Data Pipelines  Content type: Academic
arxiv.org·

fcarvajalbrown/MaskOps: High-speed PII masking as a Polars plugin — powered by Rust

 🛠️Data Engineering  Content type: Code
github.com··DEV, Hacker News

Linearizability and State-Machine Replication: Is It a Match?

 🔄Data Pipelines  Content type: Academic
arxiv.org··Hacker News

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help