Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Apache Spark
⚡ Apache Spark
Specific
PySpark, Spark SQL, distributed computing, big data
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
95
posts in
4.4
ms
Do
data
quality frameworks have to be so complex?
🐍
Python
sparkdq-community.github.io
·
6d
6 days ago
·
r/Python
Actions for Do data quality frameworks have to be so complex?
Upgrade
PySpark
from
Spark
3.5 to
Spark
4.0 with AWS
Spark
Upgrade Agent
🔄
Data Pipelines
Content type:
Blog
aws.amazon.com
·
1d
1 day ago
Actions for Upgrade PySpark from Spark 3.5 to Spark 4.0 with AWS Spark Upgrade Agent
Less-relevant results
A Neurosymbolic Prolog Skill for LLM-Driven Service Placement
🛠️
Data Engineering
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for A Neurosymbolic Prolog Skill for LLM-Driven Service Placement
Calculating speed estimates with
Apache
Spark
🔄
Data Pipelines
Content type:
Blog
mapbox.com
·
2d
2 days ago
Actions for Calculating speed estimates with Apache Spark
Enhancements to Managed Service for
Apache
Spark
clusters
🛠️
Data Engineering
Content type:
Blog
cloud.google.com
·
6d
6 days ago
Actions for Enhancements to Managed Service for Apache Spark clusters
sync with upstream · vinta/awesome-python@eb86241
🐍
Python
Content type:
Code
github.com
·
4d
4 days ago
Actions for sync with upstream · vinta/awesome-python@eb86241
Amazon SageMaker Unified Studio Notebooks now support EMR Serverless
🔄
Data Pipelines
aws.amazon.com
·
1d
1 day ago
Actions for Amazon SageMaker Unified Studio Notebooks now support EMR Serverless
WriterAgent Week 8-9: Adding NumPy and Pandas to LibreOffice
🐍
Python
keithcu.com
·
6d
6 days ago
Actions for WriterAgent Week 8-9: Adding NumPy and Pandas to LibreOffice
Dynamic Software Updates using CRDTs
🛠️
Data Engineering
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for Dynamic Software Updates using CRDTs
PyCoder’s Weekly: Issue #738: sleep(), Polars Workflows, Iterators, and More (2026-06-09)
🐍
Python
pycoders.com
·
1d
1 day ago
Actions for PyCoder’s Weekly: Issue #738: sleep(), Polars Workflows, Iterators, and More (2026-06-09)
DataAgents
: How we turned 9 months of analysis into 10 days
🛠️
Data Engineering
Content type:
Blog
medium.com
·
22h
22 hours ago
Actions for DataAgents: How we turned 9 months of analysis into 10 days
Announcing general availability of
Apache
Spark
4.0 on Amazon EMR
🔄
Data Pipelines
Content type:
Blog
aws.amazon.com
·
1d
1 day ago
Actions for Announcing general availability of Apache Spark 4.0 on Amazon EMR
Revisiting "Cooler is Better": ITD-Aware Per-CPU Thermal Optimization for Sustainable
Data
Center Operation
🔄
Data Pipelines
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for Revisiting "Cooler is Better": ITD-Aware Per-CPU Thermal Optimization for Sustainable Data Center Operation
SDLC vs. AIDLC: Why
Data
Engineering is Pushing the Boundaries of Software Development
🛠️
Data Engineering
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for SDLC vs. AIDLC: Why Data Engineering is Pushing the Boundaries of Software Development
Streaming
and Batch
Data
Architectures with Microsoft Fabric to Azure Databricks
🔄
Data Pipelines
techcommunity.microsoft.com
·
1d
1 day ago
Actions for Streaming and Batch Data Architectures with Microsoft Fabric to Azure Databricks
Linearizability and State-Machine Replication: Is It a Match?
🔄
Data Pipelines
Content type:
Academic
arxiv.org
·
5h
5 hours ago
·
Hacker News
Actions for Linearizability and State-Machine Replication: Is It a Match?
Announcing
Spark
Connect on Amazon EMR Serverless: Interactive
PySpark
development, anywhere
🔄
Data Pipelines
Content type:
Blog
aws.amazon.com
·
1d
1 day ago
Actions for Announcing Spark Connect on Amazon EMR Serverless: Interactive PySpark development, anywhere
Awesome List Updated on Jun 04, 2026
🐍
Python
trackawesomelist.com
·
6d
6 days ago
Actions for Awesome List Updated on Jun 04, 2026
Piper: A Programmable
Distributed
Training System
🛠️
Data Engineering
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for Piper: A Programmable Distributed Training System
Build stateful
streaming
applications with
Apache
Spark
4.0 on Amazon EMR Serverless
🔄
Data Pipelines
Content type:
Blog
aws.amazon.com
·
1d
1 day ago
Actions for Build stateful streaming applications with Apache Spark 4.0 on Amazon EMR Serverless
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help