Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Data Engineering
🔧 Data Engineering
data pipelines, ETL, batch processing, data warehousing
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
315
posts in
20.4
ms
CSU Student of Distinction: Anthony Arthur
🔢
Discrete Math
Content type:
Academic
csuohio.edu
·
8h
8 hours ago
Actions for CSU Student of Distinction: Anthony Arthur
Claude Code for Research: Preventing Hallucinations
🌸
Bloom Filters
Content type:
News
Content type:
Blog
homeeconomics.substack.com
·
2d
2 days ago
·
Substack
Actions for Claude Code for Research: Preventing Hallucinations
New comment by mkolarek in "Ask HN: Who wants to be hired? (June 2026)"
☁️
Cloud Infrastructure
Content type:
PDF
markokolarek.com
·
4d
4 days ago
·
Hacker News
Actions for New comment by mkolarek in "Ask HN: Who wants to be hired? (June 2026)"
Gene dependency-informed inference of response to targeted cancer therapies
🔐
MVCC
Content type:
Academic
nature.com
·
2d
2 days ago
Actions for Gene dependency-informed inference of response to targeted cancer therapies
Embedding
pipelines
are the new
ETL
🔄
Replication
Content type:
Blog
infoworld.com
·
5d
5 days ago
Actions for Embedding pipelines are the new ETL
Azerbaijani Central Bank set to adopt
data
Lakehouse
system in 2026
🗄️
Cassandra
trend.az
·
1d
1 day ago
Actions for Azerbaijani Central Bank set to adopt data Lakehouse system in 2026
Enhancements to Managed Service for
Apache
Spark
clusters
🔧
DevOps
Content type:
Blog
cloud.google.com
·
6d
6 days ago
Actions for Enhancements to Managed Service for Apache Spark clusters
15 years of Software Center – A Look in the Mirror and over the Front Windshield
🔧
DevOps
Content type:
Blog
metrics.blogg.gu.se
·
12h
12 hours ago
Actions for 15 years of Software Center – A Look in the Mirror and over the Front Windshield
DuckDB Storage
Engine
for MariaDB. When the Sea Lion Learns to Quack.
🔄
Concurrency
mariadb.org
·
1d
1 day ago
·
Hacker News
Actions for DuckDB Storage Engine for MariaDB. When the Sea Lion Learns to Quack.
The Hidden Tax Killing Your ML Team’s Velocity – And the Architecture Decision That Fixes It
🏗️
System Design
Content type:
Blog
medium.com
·
4d
4 days ago
Actions for The Hidden Tax Killing Your ML Team’s Velocity – And the Architecture Decision That Fixes It
Microsoft just shared the frontier
data
engineering
secrets
🗄️
Cassandra
mail.bycloud.ai
·
1d
1 day ago
Actions for Microsoft just shared the frontier data engineering secrets
What Went Wrong with
Data
Lakes
? A 15-Year Reality Check from the Field
🌐
Distributed Systems
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for What Went Wrong with Data Lakes? A 15-Year Reality Check from the Field
Choosing the right workflow orchestration service for your use case: Amazon MWAA and AWS Step Functions
☁️
Cloud Infrastructure
Content type:
Blog
aws.amazon.com
·
4h
4 hours ago
Actions for Choosing the right workflow orchestration service for your use case: Amazon MWAA and AWS Step Functions
10 MCP servers to connect LLMs with
databases
🗄️
Databases
infoworld.com
·
2d
2 days ago
Actions for 10 MCP servers to connect LLMs with databases
benseverndev-oss/goldenmatch: Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers,
dbt
+
Airflow
recipes.
⚙️
Backend Development
Content type:
Code
github.com
·
6d
6 days ago
·
Hacker News
Actions for benseverndev-oss/goldenmatch: Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers, dbt + Airflow recipes.
Central Bank strengthens
data
governance for
AI
solutions
🧠
Query Planners
Content type:
News
en.apa.az
·
1d
1 day ago
Actions for Central Bank strengthens data governance for AI solutions
Location: Lubbock, TX, USA Remote: Yes (Remote-friendly, US-based) Technologies:...
⚙️
Backend Development
Content type:
Discussion
news.ycombinator.com
·
4h
4 hours ago
·
Hacker News
Actions for Location: Lubbock, TX, USA Remote: Yes (Remote-friendly, US-based) Technologies:...
FOCUS specification eyes
AI
token economics as
AI
billing complexity hits a new frontier
🔭
Observability
siliconangle.com
·
1d
1 day ago
Actions for FOCUS specification eyes AI token economics as AI billing complexity hits a new frontier
Real Estate Lifecycle Analysis with
BigQuery
SQL Graph: Graph Modeling Beyond LLM Record Linkage
🗄️
Databases
Content type:
Blog
medium.com
·
6d
6 days ago
Actions for Real Estate Lifecycle Analysis with BigQuery SQL Graph: Graph Modeling Beyond LLM Record Linkage
Day 10 of 100 Days of ClickHouse®: What Makes ClickHouse SQL Different?
🗄️
Databases
quantrail-data.com
·
14h
14 hours ago
·
DEV
Actions for Day 10 of 100 Days of ClickHouse®: What Makes ClickHouse SQL Different?
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help