Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Data Engineering
🔧 Data Engineering
data pipelines, ETL, ELT, data infrastructure
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
330
posts in
11.2
ms
Streaming and Batch
Data
Architectures
with Microsoft Fabric to Azure Databricks
🧱
Databricks
techcommunity.microsoft.com
·
1d
1 day ago
Actions for Streaming and Batch Data Architectures with Microsoft Fabric to Azure Databricks
Minerva raises $20M and an OpenAI deal to fix marketers' broken first-party
data
📊
Analytics Engineering
ppc.land
·
5h
5 hours ago
Actions for Minerva raises $20M and an OpenAI deal to fix marketers' broken first-party data
Real Estate Lifecycle Analysis with
BigQuery
SQL Graph: Graph Modeling Beyond LLM Record Linkage
📊
Analytics Engineering
Content type:
Blog
medium.com
·
6d
6 days ago
Actions for Real Estate Lifecycle Analysis with BigQuery SQL Graph: Graph Modeling Beyond LLM Record Linkage
Storage Insights
datasets
: Enabling org-wide operational discovery with activity insights
📊
Analytics Engineering
Content type:
Blog
cloud.google.com
·
2d
2 days ago
Actions for Storage Insights datasets: Enabling org-wide operational discovery with activity insights
AI
start-up Plaud to invest $10 million in Singapore as it expands Asia-Pacific operations
👨💻
Senior Dev
straitstimes.com
·
11h
11 hours ago
Actions for AI start-up Plaud to invest $10 million in Singapore as it expands Asia-Pacific operations
Scaling beyond one: How Airbnb evolved its
data
architecture
for a multi-product world
🏠
Lakehouse
Content type:
Blog
medium.com
·
1d
1 day ago
Actions for Scaling beyond one: How Airbnb evolved its data architecture for a multi-product world
The Hidden Tax Killing Your ML Team’s Velocity – And the
Architecture
Decision That Fixes It
📊
Analytics Engineering
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for The Hidden Tax Killing Your ML Team’s Velocity – And the Architecture Decision That Fixes It
benseverndev-oss/goldenmatch: Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers,
dbt
+
Airflow
recipes.
🐍
Python
Content type:
Code
github.com
·
6d
6 days ago
·
Hacker News
Actions for benseverndev-oss/goldenmatch: Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers, dbt + Airflow recipes.
USAFacts’ new campaign is showing voters that
data
rules everything around them
🗂️
Data Governance
fastcompany.com
·
1d
1 day ago
Actions for USAFacts’ new campaign is showing voters that data rules everything around them
“Whoever builds the most joyous product wins”: The agent
war
begins
📊
Analytics Engineering
thenewstack.io
·
4d
4 days ago
Actions for “Whoever builds the most joyous product wins”: The agent war begins
Get reliable answers to business questions with Bits
Data
Analysis
📊
Analytics Engineering
Content type:
Blog
datadoghq.com
·
2d
2 days ago
Actions for Get reliable answers to business questions with Bits Data Analysis
Meta Chooses Mukesh Ambani's Reliance for Its First
AI-Powered
Data
Centre in India
🔵
Google Cloud
Content type:
News
in.mashable.com
·
18h
18 hours ago
Actions for Meta Chooses Mukesh Ambani's Reliance for Its First AI-Powered Data Centre in India
AI
Security Best Practices for Regulated Industries
🗂️
Data Governance
orca.security
·
1d
1 day ago
Actions for AI Security Best Practices for Regulated Industries
Airbyte
🟠
AWS
Content type:
Code
github.com
·
5d
5 days ago
Actions for Airbyte
Optimize
Spark
and
Databricks
jobs with
Datadog
⚡
Apache Spark
Content type:
Blog
datadoghq.com
·
2d
2 days ago
Actions for Optimize Spark and Databricks jobs with Datadog
Gene dependency-informed inference of response to targeted cancer therapies
📊
Analytics Engineering
Content type:
Academic
nature.com
·
3d
3 days ago
Actions for Gene dependency-informed inference of response to targeted cancer therapies
DuckDB Storage
Engine
for MariaDB. When the Sea Lion Learns to Quack.
📦
Parquet
mariadb.org
·
1d
1 day ago
·
Hacker News
Actions for DuckDB Storage Engine for MariaDB. When the Sea Lion Learns to Quack.
AI
Personalization Drives Canva Design Intelligence (3 minute read)
📊
Analytics Engineering
siliconangle.com
·
6d
6 days ago
Actions for AI Personalization Drives Canva Design Intelligence (3 minute read)
🇳🇱 Go/Golang job: Senior Backend
Engineer
(Go) | Studio
AI
at Creative Fabrica (Amsterdam, Netherlands)
👨💻
Senior Dev
golangprojects.com
·
9h
9 hours ago
Actions for 🇳🇱 Go/Golang job: Senior Backend Engineer (Go) | Studio AI at Creative Fabrica (Amsterdam, Netherlands)
Amazon SageMaker Unified Studio Notebooks now support EMR Serverless
⚡
Apache Spark
aws.amazon.com
·
1d
1 day ago
Actions for Amazon SageMaker Unified Studio Notebooks now support EMR Serverless
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help