Apache Spark

Feeds to Scour
SubscribedAll
Scoured 28 posts in 7.5 ms

Amazon SageMaker Unified Studio Notebooks now support EMR Serverless

 🔄ETL Pipelines
aws.amazon.com
·

Do data quality frameworks have to be so complex?

 🐍Python

Deep dive: How Lightning Engine delivers 4.9x faster Apache Spark performance

 🔄ETL Pipelines  Content type: Blog
cloud.google.com·

Calculating speed estimates with Apache Spark

 🔄ETL Pipelines  Content type: Blog
mapbox.com·
Less-relevant results

Operationalizing Property-Based Testing for Data-Intensive Scalable Computing Systems

 OLTP Systems  Content type: Academic
arxiv.org·

make descriptions shorter · vinta/awesome-python@9f156de

 📐Column Encoding  Content type: Code
github.com·

Databricks wants to kill the “email me a file” problem for AI agent skills

 🏠Lakehouse
thenewstack.io·

New comment by mkolarek in "Ask HN: Who wants to be hired? (June 2026)"

 🐍Python  Content type: PDF

Linux Fundamentals for Data Engineering

 🔄ETL Pipelines

Optimize Spark and Databricks jobs with Datadog

 🔄ETL Pipelines  Content type: Blog
datadoghq.com·

Announcing general availability of Apache Spark 4.0 on Amazon EMR

 🏞️Data Lakehouse  Content type: Blog
aws.amazon.com·

Enhancements to Managed Service for Apache Spark clusters

 🔄ETL Pipelines  Content type: Blog
cloud.google.com·

aayush4vedi/drift-spark: Spark-native embedding lifecycle- produce, CDC refresh, model-migrate, audit.

 🔌Data Integration  Content type: Code
github.com··Hacker News

Databricks is Hiring! — Non-Phone — Remote HR Operations Associate — Up to $60/hr.

 🏠Lakehouse
ratracerebellion.com·

Upgrade PySpark from Spark 3.5 to Spark 4.0 with AWS Spark Upgrade Agent

 🐍Python  Content type: Blog
aws.amazon.com·

Apache Iceberg™ 1.11 Released: A Smarter REST Catalog, Production-Ready Encryption and the Road to v4

 🏠Lakehouse  Content type: Blog
snowflake.com·

Jupyter Enterprise Gateway - From Notebook to Kubernetes Cluster Admin

 🔧Database Internals  Content type: Blog
elttam.com··r/netsec

IPO-bound Databricks reportedly eyes $175B valuation after hitting $5.4B revenue run rate — TFN

 🔧Data Engineering
techfundingnews.com·

Announcing Spark Connect on Amazon EMR Serverless: Interactive PySpark development, anywhere

 🔌Data Integration  Content type: Blog
aws.amazon.com·

sync with upstream · vinta/awesome-python@eb86241

 🐍Python  Content type: Code
github.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help