Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
⚡ Spark
Specific
Apache Spark, Distributed Computing, Big Data, PySpark
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
20
posts in
15.7
ms
Build petabyte-scale synthetic test
data
with Amazon EMR on EC2
🌊
Delta Lake
aws.amazon.com
·
1d
Confluent Current London 2026 - AI requires a
data
re-think
🌊
Delta Lake
diginomica.com
·
11h
Expanded interoperability with Unity
Catalog
Open APIs
🌊
Delta Lake
databricks.com
·
6d
LogRouter: Adaptive Two-Level LLM Routing for Log Question Answering in
Big
Data
Systems
🔧
Feature Engineering
arxiv.org
·
2d
CVE-2026-47237 – Overly Permissive Istio Permissions Allow Kubeflow Authorization Token Stealing
📨
Apache Kafka
insinuator.net
·
18h
The Missing Organizing Principle of Microsoft Fabric: Medallion Architecture Explained :gem:
🔧
Feature Engineering
dattasable.com
·
3d
·
DEV
Bloomberg's Open Source project contributions and utilization
🌟
Open Source
bloomberg.com
·
2d
Why agentic AI systems fail in production without a semantic layer
🔧
Feature Engineering
prometheux.ai
·
6d
·
Hacker News
Mental Models for
Data
Platform
Engineers
(Inspired by Poor Charlie's Almanack)
🔧
Feature Engineering
dagster.io
·
1d
·
Hacker News
Agoda Builds Multimodal Content System to Bridge Images and Reviews in Travel Discovery
🎨
Generative AI
infoq.com
·
1d
Azure Functions
para
developers que nunca usaron serverless
❄️
Snowflake
learn.microsoft.com
·
5d
·
DEV
Databricks
for Good and Virtue Foundation: Partnering to Connect Medical Volunteers to Critical Health Services in 72 Countries
❄️
Snowflake
databricks.com
·
19h
ArroyoSystems/arroyo:
Distributed
stream
processing
engine
in Rust
📨
Apache Kafka
github.com
·
4d
·
DEV
Less-relevant results
Local LLMs are ready for real work
✍️
Prompt Engineering
thelurkreport.beehiiv.com
·
2d
·
r/LocalLLaMA
A systematic approach to benchmarking
SQL
processing
engines
on AWS
🚀
SQL Optimization
aws.amazon.com
·
1d
How often should I update the system? And when?
🌊
Delta Lake
wiki.archlinux.org
·
4d
·
r/archlinux
How to Build Real-Time Fraud Detection using
Spark
Real-Time Mode and Lakebase
🌊
Delta Lake
databricks.com
·
1d
Amazon EMR Serverless is now available in additional AWS Regions
📨
Apache Kafka
aws.amazon.com
·
5d
How Deutsche Börse built a generative AI tool to tackle the large-scale migration of Zeppelin notebooks to
Databricks
🧱
Databricks
databricks.com
·
2d
A Bayesian Additive Regression Tree Model for Learning Conditional Average Treatment Effects in Regression Discontinuity Designs
🔧
Feature Engineering
arxiv.org
·
3d
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help