Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Parquet
📊 Parquet
Specific
Columnar Storage, Apache Arrow, Data Serialization, DuckDB
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
53
posts in
7.3
ms
Weekly Bookmarks
🗄️
Databases
inkdroid.org
·
4d
4 days ago
Actions for Weekly Bookmarks
Bloom
Filter
Trick Reduces 170
Object-Storage
Reads to One (2.6s → 89ms)
🔑
Key-Value Stores
Content type:
Blog
openobserve.ai
·
2d
2 days ago
·
Hacker News
Actions for Bloom Filter Trick Reduces 170 Object-Storage Reads to One (2.6s → 89ms)
powerset-co/research-data
: Access public Powerset Research
data
🦆
DuckDB
Content type:
Code
github.com
·
5d
5 days ago
·
Hacker News
Actions for powerset-co/research-data: Access public Powerset Research data
What's new with Postgres at Microsoft, 2026 edition
🗄️
Databases
techcommunity.microsoft.com
·
11h
11 hours ago
Actions for What's new with Postgres at Microsoft, 2026 edition
Claude Code for Research: Preventing Hallucinations
📈
Data Visualization
Content type:
News
Content type:
Blog
homeeconomics.substack.com
·
2d
2 days ago
·
Substack
Actions for Claude Code for Research: Preventing Hallucinations
Announcing general availability of
Apache
Spark 4.0 on Amazon EMR
🧹
Data Cleaning
Content type:
Blog
aws.amazon.com
·
1d
1 day ago
Actions for Announcing general availability of Apache Spark 4.0 on Amazon EMR
SDLC vs. AIDLC: Why
Data
Engineering is Pushing the Boundaries of Software Development
🧹
Data Cleaning
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for SDLC vs. AIDLC: Why Data Engineering is Pushing the Boundaries of Software Development
Real-time
data
replication to your
data
warehouse, self-serve
🗄️
Databases
artie.com
·
1d
1 day ago
·
Hacker News
,
Hacker News
Actions for Real-time data replication to your data warehouse, self-serve
Awesome List Updated on Jun 10, 2026
🗄️
Databases
trackawesomelist.com
·
1d
1 day ago
Actions for Awesome List Updated on Jun 10, 2026
Data
Flow Control:
Data
Safety Policies for AI Agents
🗄
Database
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Data Flow Control: Data Safety Policies for AI Agents
Why Blue-Green Deployments Fail at Scale in Kubernetes — and What Works Instead
🐳
Docker
cloudnativenow.com
·
22h
22 hours ago
Actions for Why Blue-Green Deployments Fail at Scale in Kubernetes — and What Works Instead
benseverndev-oss/goldenmatch: Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL &
DuckDB
; MCP/REST servers, dbt + Airflow recipes.
🧹
Data Cleaning
Content type:
Code
github.com
·
6d
6 days ago
·
Hacker News
Actions for benseverndev-oss/goldenmatch: Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers, dbt + Airflow recipes.
Apache
Iceberg v4: The Current State, the Proposals, and Why They Matter
🗄️
Databases
books.alexmerced.com
·
1d
1 day ago
·
DEV
Actions for Apache Iceberg v4: The Current State, the Proposals, and Why They Matter
The First Bite — The Birth of v5
🦆
DuckDB
lifelog.my
·
10h
10 hours ago
Actions for The First Bite — The Birth of v5
When MCP Deployment Security Makes You Say AI, AI, AI (Ouch, Ouch, Ouch)!
💻
Terminal Tools
Content type:
Blog
guidepointsecurity.com
·
1d
1 day ago
Actions for When MCP Deployment Security Makes You Say AI, AI, AI (Ouch, Ouch, Ouch)!
Vibe Coding Is Dangerous, Agentic Engineering Isn't ft. Wes McKinney
🧹
Data Cleaning
Content type:
Blog
motherduck.com
·
6d
6 days ago
Actions for Vibe Coding Is Dangerous, Agentic Engineering Isn't ft. Wes McKinney
Build stateful streaming applications with
Apache
Spark 4.0 on Amazon EMR Serverless
💻
Terminal Tools
Content type:
Blog
aws.amazon.com
·
1d
1 day ago
Actions for Build stateful streaming applications with Apache Spark 4.0 on Amazon EMR Serverless
Show HN: YourMemory, agentic memory is a pruning problem, not a hoarding problem
🗄️
Databases
Content type:
Discussion
yourmemoryai.vercel.app
·
3d
3 days ago
·
Hacker News
Actions for Show HN: YourMemory, agentic memory is a pruning problem, not a hoarding problem
Snowflake thinks it knows what’s really slowing developers down
🧹
Data Cleaning
thenewstack.io
·
6d
6 days ago
Actions for Snowflake thinks it knows what’s really slowing developers down
Streaming and Batch
Data
Architectures with Microsoft Fabric to Azure Databricks
🧹
Data Cleaning
techcommunity.microsoft.com
·
1d
1 day ago
Actions for Streaming and Batch Data Architectures with Microsoft Fabric to Azure Databricks
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help