Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Apache Spark
⚡ Apache Spark
Specific
Distributed Computing, DataFrames, Big Data Processing, Analytics
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
28
posts in
7.5
ms
Amazon SageMaker Unified Studio Notebooks now support EMR Serverless
🔄
ETL Pipelines
aws.amazon.com
·
1d
1 day ago
Actions for Amazon SageMaker Unified Studio Notebooks now support EMR Serverless
Do
data
quality frameworks have to be so complex?
🐍
Python
sparkdq-community.github.io
·
6d
6 days ago
·
r/Python
Actions for Do data quality frameworks have to be so complex?
Deep dive: How Lightning Engine delivers 4.9x faster
Apache
Spark
performance
🔄
ETL Pipelines
Content type:
Blog
cloud.google.com
·
5h
5 hours ago
Actions for Deep dive: How Lightning Engine delivers 4.9x faster Apache Spark performance
Calculating speed estimates with
Apache
Spark
🔄
ETL Pipelines
Content type:
Blog
mapbox.com
·
2d
2 days ago
Actions for Calculating speed estimates with Apache Spark
Less-relevant results
Operationalizing Property-Based Testing for
Data-Intensive
Scalable
Computing
Systems
⚡
OLTP Systems
Content type:
Academic
arxiv.org
·
19h
19 hours ago
Actions for Operationalizing Property-Based Testing for Data-Intensive Scalable Computing Systems
make descriptions shorter · vinta/awesome-python@9f156de
📐
Column Encoding
Content type:
Code
github.com
·
4d
4 days ago
Actions for make descriptions shorter · vinta/awesome-python@9f156de
Databricks
wants to kill the “email me a file” problem for AI agent skills
🏠
Lakehouse
thenewstack.io
·
9h
9 hours ago
Actions for Databricks wants to kill the “email me a file” problem for AI agent skills
New comment by mkolarek in "Ask HN: Who wants to be hired? (June 2026)"
🐍
Python
Content type:
PDF
markokolarek.com
·
4d
4 days ago
·
Hacker News
Actions for New comment by mkolarek in "Ask HN: Who wants to be hired? (June 2026)"
Linux Fundamentals for
Data
Engineering
🔄
ETL Pipelines
dev-to-uploads.s3.amazonaws.com
·
2d
2 days ago
·
DEV
Actions for Linux Fundamentals for Data Engineering
Optimize
Spark
and
Databricks
jobs with
Datadog
🔄
ETL Pipelines
Content type:
Blog
datadoghq.com
·
1d
1 day ago
Actions for Optimize Spark and Databricks jobs with Datadog
Announcing general availability of
Apache
Spark
4.0 on Amazon EMR
🏞️
Data Lakehouse
Content type:
Blog
aws.amazon.com
·
1d
1 day ago
Actions for Announcing general availability of Apache Spark 4.0 on Amazon EMR
Enhancements to Managed Service for
Apache
Spark
clusters
🔄
ETL Pipelines
Content type:
Blog
cloud.google.com
·
6d
6 days ago
Actions for Enhancements to Managed Service for Apache Spark clusters
aayush4vedi/drift-spark
:
Spark-native
embedding lifecycle- produce, CDC refresh, model-migrate, audit.
🔌
Data Integration
Content type:
Code
github.com
·
11h
11 hours ago
·
Hacker News
Actions for aayush4vedi/drift-spark: Spark-native embedding lifecycle- produce, CDC refresh, model-migrate, audit.
Databricks
is Hiring! — Non-Phone — Remote HR Operations Associate — Up to $60/hr.
🏠
Lakehouse
ratracerebellion.com
·
5d
5 days ago
Actions for Databricks is Hiring! — Non-Phone — Remote HR Operations Associate — Up to $60/hr.
Upgrade
PySpark
from
Spark
3.5 to
Spark
4.0 with AWS
Spark
Upgrade Agent
🐍
Python
Content type:
Blog
aws.amazon.com
·
1d
1 day ago
Actions for Upgrade PySpark from Spark 3.5 to Spark 4.0 with AWS Spark Upgrade Agent
Apache
Iceberg™ 1.11 Released: A Smarter REST Catalog, Production-Ready Encryption and the Road to v4
🏠
Lakehouse
Content type:
Blog
snowflake.com
·
6d
6 days ago
Actions for Apache Iceberg™ 1.11 Released: A Smarter REST Catalog, Production-Ready Encryption and the Road to v4
Jupyter Enterprise Gateway - From Notebook to Kubernetes
Cluster
Admin
🔧
Database Internals
Content type:
Blog
elttam.com
·
16h
16 hours ago
·
r/netsec
Actions for Jupyter Enterprise Gateway - From Notebook to Kubernetes Cluster Admin
IPO-bound
Databricks
reportedly eyes $175B valuation after hitting $5.4B revenue run rate — TFN
🔧
Data Engineering
techfundingnews.com
·
1d
1 day ago
Actions for IPO-bound Databricks reportedly eyes $175B valuation after hitting $5.4B revenue run rate — TFN
Announcing
Spark
Connect on Amazon EMR Serverless: Interactive
PySpark
development, anywhere
🔌
Data Integration
Content type:
Blog
aws.amazon.com
·
1d
1 day ago
Actions for Announcing Spark Connect on Amazon EMR Serverless: Interactive PySpark development, anywhere
sync with upstream · vinta/awesome-python@eb86241
🐍
Python
Content type:
Code
github.com
·
4d
4 days ago
Actions for sync with upstream · vinta/awesome-python@eb86241
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help