Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
big data
📊 big data
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
227
posts in
7.2
ms
Enhancements to Managed Service for
Apache
Spark
clusters
☁️
Cloud Deployment
Content type:
Blog
cloud.google.com
·
6d
6 days ago
Actions for Enhancements to Managed Service for Apache Spark clusters
Calculating speed estimates with
Apache
Spark
🗃️
databases
Content type:
Blog
mapbox.com
·
2d
2 days ago
Actions for Calculating speed estimates with Apache Spark
LakeQA
: An Exploratory QA Benchmark over a Million-Scale
Data
Lake
🗃️
databases
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for LakeQA: An Exploratory QA Benchmark over a Million-Scale Data Lake
Franz
🐹
Go
flathub.org
·
8h
8 hours ago
Actions for Franz
Introducing Streamling: Performant and Extensible
Data
Streaming Framework
🦀
rust
Content type:
News
streamingdata.tech
·
1d
1 day ago
Actions for Introducing Streamling: Performant and Extensible Data Streaming Framework
New comment by mkolarek in "Ask HN: Who wants to be hired? (June 2026)"
🐍
python
Content type:
PDF
markokolarek.com
·
4d
4 days ago
·
Hacker News
Actions for New comment by mkolarek in "Ask HN: Who wants to be hired? (June 2026)"
Designing an
ETL
Application: Why I Started with a Modular Monolith Before Microservices
☁️
Cloud Deployment
Content type:
Blog
medium.com
·
12h
12 hours ago
Actions for Designing an ETL Application: Why I Started with a Modular Monolith Before Microservices
Announcing general availability of
Apache
Spark
4.0 on Amazon EMR
☁️
Cloud Deployment
Content type:
Blog
aws.amazon.com
·
1d
1 day ago
Actions for Announcing general availability of Apache Spark 4.0 on Amazon EMR
Exclusive: MotherDuck adds agentic
data
ingestion to its cloud analytics service
⚡
Zig
siliconangle.com
·
8h
8 hours ago
Actions for Exclusive: MotherDuck adds agentic data ingestion to its cloud analytics service
Senior
Data
Engineer – Climate Friendly
☁️
Cloud Deployment
au.seek.com
·
5d
5 days ago
·
Hacker News
,
Hacker News
Actions for Senior Data Engineer – Climate Friendly
Claude Code for Research: Preventing Hallucinations
⚙️
Systems Programming
Content type:
News
Content type:
Blog
homeeconomics.substack.com
·
2d
2 days ago
·
Substack
Actions for Claude Code for Research: Preventing Hallucinations
Introducing Flights: Agent-Native Ingest in MotherDuck
🐍
python
Content type:
Blog
motherduck.com
·
21h
21 hours ago
Actions for Introducing Flights: Agent-Native Ingest in MotherDuck
Automating Real-time
Data
Pipelines
: Deploying Pub/Sub to BigQuery with Dataflow Custom Template…
☁️
Cloud Deployment
Content type:
Blog
medium.com
·
6d
6 days ago
Actions for Automating Real-time Data Pipelines: Deploying Pub/Sub to BigQuery with Dataflow Custom Template…
Minimizing Memory in Parallel Task Graph Scheduling: Focusing on Average Consumption
⚙️
Systems Programming
Content type:
Academic
sciencedirect.com
·
8h
8 hours ago
Actions for Minimizing Memory in Parallel Task Graph Scheduling: Focusing on Average Consumption
DuckDB Storage Engine for MariaDB. When the Sea Lion Learns to Quack.
🗃️
databases
mariadb.org
·
1d
1 day ago
·
Hacker News
Actions for DuckDB Storage Engine for MariaDB. When the Sea Lion Learns to Quack.
Snowflake
Datastream
:
Kafka-native
streaming in Snowflake
🐹
Go
snowflake.com
·
6d
6 days ago
·
Hacker News
Actions for Snowflake Datastream: Kafka-native streaming in Snowflake
AI Security Best Practices for Regulated Industries
⚙️
Systems Programming
orca.security
·
1d
1 day ago
Actions for AI Security Best Practices for Regulated Industries
Deep dive: How Lightning Engine delivers 4.9x faster
Apache
Spark
performance
☁️
Cloud Deployment
Content type:
Blog
cloud.google.com
·
3h
3 hours ago
Actions for Deep dive: How Lightning Engine delivers 4.9x faster Apache Spark performance
SDLC vs. AIDLC: Why
Data
Engineering is Pushing the Boundaries of Software Development
⚙️
engineering
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for SDLC vs. AIDLC: Why Data Engineering is Pushing the Boundaries of Software Development
Presentation: Beyond Prompting: Context Engineering and Memory Management for AI Systems at Scale
⚙️
Systems Programming
Content type:
News
infoq.com
·
9h
9 hours ago
Actions for Presentation: Beyond Prompting: Context Engineering and Memory Management for AI Systems at Scale
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help