Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Data Engineering
🏗️ Data Engineering
data pipeline, ETL, data platform, data infrastructure
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
369
posts in
6.8
ms
Embedding
pipelines
are the new
ETL
📚
RAG
Content type:
Blog
infoworld.com
·
5d
5 days ago
Actions for Embedding pipelines are the new ETL
Gene dependency-informed inference of response to targeted cancer therapies
🔢
Vector Databases
Content type:
Academic
nature.com
·
2d
2 days ago
Actions for Gene dependency-informed inference of response to targeted cancer therapies
Claude Code for Research: Preventing Hallucinations
🏞️
Data Lakes
Content type:
News
Content type:
Blog
homeeconomics.substack.com
·
2d
2 days ago
·
Substack
Actions for Claude Code for Research: Preventing Hallucinations
Redis
Data
Integration in Redis Cloud is now GA in AWS
⚙️
MLOps
Content type:
Blog
redis.io
·
6d
6 days ago
Actions for Redis Data Integration in Redis Cloud is now GA in AWS
Snowflake
CEO says there’s a big myth at the heart of every org chart
📈
Developer Productivity
Content type:
News
fortune.com
·
2d
2 days ago
Actions for Snowflake CEO says there’s a big myth at the heart of every org chart
TikTok's 'Not Interested' tool beats swiping, but effect may quickly wear off
🏪
Feature Stores
techxplore.com
·
6h
6 hours ago
Actions for TikTok's 'Not Interested' tool beats swiping, but effect may quickly wear off
Automating Real-time
Data
Pipelines
: Deploying Pub/Sub to BigQuery with Dataflow Custom Template…
📊
OLAP
Content type:
Blog
medium.com
·
6d
6 days ago
Actions for Automating Real-time Data Pipelines: Deploying Pub/Sub to BigQuery with Dataflow Custom Template…
Integration Patterns: How To Choose for Your Architecture
📊
OLAP
Content type:
Blog
blog.n8n.io
·
2d
2 days ago
Actions for Integration Patterns: How To Choose for Your Architecture
On theCUBE:
Snowflake
,
Databricks
and more battle for control of
AI
stack
📊
OLAP
siliconangle.com
·
2d
2 days ago
Actions for On theCUBE: Snowflake, Databricks and more battle for control of AI stack
Azerbaijani Central Bank set to adopt
data
Lakehouse
system in 2026
🏞️
Data Lakes
trend.az
·
1d
1 day ago
Actions for Azerbaijani Central Bank set to adopt data Lakehouse system in 2026
Anthropic’s Daniela Amodei on trust as the
AI
accelerant
🕵️
AI Agents
aimagazine.com
·
6d
6 days ago
Actions for Anthropic’s Daniela Amodei on trust as the AI accelerant
15 years of Software Center – A Look in the Mirror and over the Front Windshield
⚙️
MLOps
Content type:
Blog
metrics.blogg.gu.se
·
13h
13 hours ago
Actions for 15 years of Software Center – A Look in the Mirror and over the Front Windshield
Unveiling Enhanced Location Intelligence Features: Distance & Duration, and Isochrones
✍️
Prompt Engineering
Content type:
Blog
mapbox.com
·
2d
2 days ago
Actions for Unveiling Enhanced Location Intelligence Features: Distance & Duration, and Isochrones
Announcing general availability of
Apache
Spark
4.0 on Amazon EMR
🧊
Apache Iceberg
Content type:
Blog
aws.amazon.com
·
1d
1 day ago
Actions for Announcing general availability of Apache Spark 4.0 on Amazon EMR
The Hidden Tax Killing Your ML Team’s Velocity – And the Architecture Decision That Fixes It
🏪
Feature Stores
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for The Hidden Tax Killing Your ML Team’s Velocity – And the Architecture Decision That Fixes It
benseverndev-oss/goldenmatch: Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers,
dbt
+
Airflow
recipes.
🏞️
Data Lakes
Content type:
Code
github.com
·
6d
6 days ago
·
Hacker News
Actions for benseverndev-oss/goldenmatch: Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers, dbt + Airflow recipes.
How the Other Half Counts
🗄️
Database Internals
Content type:
Blog
thebuild.com
·
2d
2 days ago
·
Hacker News
Actions for How the Other Half Counts
Piper
: A Programmable Distributed Training System
⚡
Query Optimization
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for Piper: A Programmable Distributed Training System
DuckDB Storage
Engine
for MariaDB. When the Sea Lion Learns to Quack.
📊
OLAP
mariadb.org
·
1d
1 day ago
·
Hacker News
Actions for DuckDB Storage Engine for MariaDB. When the Sea Lion Learns to Quack.
“Whoever builds the most joyous product wins”: The agent
war
begins
🕵️
AI Agents
thenewstack.io
·
4d
4 days ago
Actions for “Whoever builds the most joyous product wins”: The agent war begins
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help