Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
big data
📊 big data
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
224
posts in
4.5
ms
New comment by mkolarek in "Ask HN: Who wants to be hired? (June 2026)"
🐍
python
Content type:
PDF
markokolarek.com
·
4d
4 days ago
·
Hacker News
Actions for New comment by mkolarek in "Ask HN: Who wants to be hired? (June 2026)"
What Went Wrong with
Data
Lakes
? A 15-Year Reality Check from the Field
🗃️
databases
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for What Went Wrong with Data Lakes? A 15-Year Reality Check from the Field
Deep dive: How Lightning Engine delivers 4.9x faster
Apache
Spark
performance
☁️
Cloud Deployment
Content type:
Blog
cloud.google.com
·
6h
6 hours ago
Actions for Deep dive: How Lightning Engine delivers 4.9x faster Apache Spark performance
Franz
🐹
Go
flathub.org
·
10h
10 hours ago
Actions for Franz
Calculating speed estimates with
Apache
Spark
🗃️
databases
Content type:
Blog
mapbox.com
·
2d
2 days ago
Actions for Calculating speed estimates with Apache Spark
Deploying Vector High-Performance Observability
Data
Pipeline
on Ubuntu 24.04
🌐
Networking
Content type:
Reference
Content type:
Tutorial
docs.vultr.com
·
2h
2 hours ago
·
DEV
Actions for Deploying Vector High-Performance Observability Data Pipeline on Ubuntu 24.04
Senior
Data
Engineer – Climate Friendly
☁️
Cloud Deployment
au.seek.com
·
5d
5 days ago
·
Hacker News
,
Hacker News
Actions for Senior Data Engineer – Climate Friendly
Designing an
ETL
Application: Why I Started with a Modular Monolith Before Microservices
☁️
Cloud Deployment
Content type:
Blog
medium.com
·
15h
15 hours ago
Actions for Designing an ETL Application: Why I Started with a Modular Monolith Before Microservices
Introducing Streamling: Performant and Extensible
Data
Streaming Framework
🦀
rust
Content type:
News
streamingdata.tech
·
1d
1 day ago
Actions for Introducing Streamling: Performant and Extensible Data Streaming Framework
Exclusive: MotherDuck adds agentic
data
ingestion to its cloud analytics service
⚡
Zig
siliconangle.com
·
11h
11 hours ago
Actions for Exclusive: MotherDuck adds agentic data ingestion to its cloud analytics service
Announcing general availability of
Apache
Spark
4.0 on Amazon EMR
☁️
Cloud Deployment
Content type:
Blog
aws.amazon.com
·
1d
1 day ago
Actions for Announcing general availability of Apache Spark 4.0 on Amazon EMR
Automating Real-time
Data
Pipelines
: Deploying Pub/Sub to BigQuery with Dataflow Custom Template…
☁️
Cloud Deployment
Content type:
Blog
medium.com
·
6d
6 days ago
Actions for Automating Real-time Data Pipelines: Deploying Pub/Sub to BigQuery with Dataflow Custom Template…
Minimizing Memory in Parallel Task Graph Scheduling: Focusing on Average Consumption
⚙️
Systems Programming
Content type:
Academic
sciencedirect.com
·
11h
11 hours ago
Actions for Minimizing Memory in Parallel Task Graph Scheduling: Focusing on Average Consumption
Introducing Flights: Agent-Native Ingest in MotherDuck
🐍
python
Content type:
Blog
motherduck.com
·
1d
1 day ago
Actions for Introducing Flights: Agent-Native Ingest in MotherDuck
Snowflake
Datastream
:
Kafka-native
streaming in Snowflake
🐹
Go
snowflake.com
·
6d
6 days ago
·
Hacker News
Actions for Snowflake Datastream: Kafka-native streaming in Snowflake
Claude Code for Research: Preventing Hallucinations
⚙️
Systems Programming
Content type:
News
Content type:
Blog
homeeconomics.substack.com
·
2d
2 days ago
·
Substack
Actions for Claude Code for Research: Preventing Hallucinations
SDLC vs. AIDLC: Why
Data
Engineering is Pushing the Boundaries of Software Development
⚙️
engineering
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for SDLC vs. AIDLC: Why Data Engineering is Pushing the Boundaries of Software Development
Presentation: Beyond Prompting: Context Engineering and Memory Management for AI Systems at Scale
⚙️
Systems Programming
Content type:
News
infoq.com
·
12h
12 hours ago
Actions for Presentation: Beyond Prompting: Context Engineering and Memory Management for AI Systems at Scale
DuckDB Storage Engine for MariaDB. When the Sea Lion Learns to Quack.
🗃️
databases
mariadb.org
·
1d
1 day ago
·
Hacker News
Actions for DuckDB Storage Engine for MariaDB. When the Sea Lion Learns to Quack.
Spring security advisory (AV26-574)
🗃️
databases
cyber.gc.ca
·
10h
10 hours ago
Actions for Spring security advisory (AV26-574)
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help