Data Engineering

Feeds to Scour
SubscribedAll
Scoured 261 posts in 7.8 ms

OpenGov and Snowflake build a knowledge graph to unify government data and AI

Β πŸ”„Data Pipelines Β Content type: Video
siliconangle.comΒ·

Microsoft just shared the frontier data engineering secrets

Β πŸ”„Data Pipelines
mail.bycloud.aiΒ·

Central Bank strengthens data governance for AI solutions

Β πŸ”„Data Pipelines Β Content type: News
en.apa.azΒ·

Redis Data Integration in Redis Cloud is now GA in AWS

Β πŸ”„Data Pipelines Β Content type: Blog
redis.ioΒ·

When Feature Importance Lies: Target Encoding at the Noise Floor

Β πŸ”„Data Pipelines
flyback.aiΒ·Β·DEV

Celebal Technologies raises debt funding from BlackSoil

Β πŸ”„Data Pipelines Β Content type: News
theheadandtale.comΒ·

benseverndev-oss/goldenmatch: Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers, dbt + Airflow recipes.

 🐍Python  Content type: Code
github.comΒ·Β·Hacker News

Azerbaijani Central Bank set to adopt data Lakehouse system in 2026

Β πŸ”„Data Pipelines
trend.azΒ·

The Hidden Tax Killing Your ML Team’s Velocity – And the Architecture Decision That Fixes It

Β πŸ”„Data Pipelines Β Content type: Blog
medium.comΒ·

IPO-bound Databricks reportedly eyes $175B valuation after hitting $5.4B revenue run rate β€” TFN

Β πŸ”„Data Pipelines
techfundingnews.comΒ·

Amazon SageMaker Unified Studio Notebooks now support EMR Serverless

 ⚑Apache Spark
aws.amazon.com
Β·

Operationalizing Property-Based Testing for Data-Intensive Scalable Computing Systems

Β πŸ”„Data Pipelines Β Content type: Academic
arxiv.orgΒ·

AI Agents and the Fight for Customer Data

Β πŸ”„Data Pipelines
a16z.simplecast.comΒ·

AI Security Best Practices for Regulated Industries

Β πŸ”„Data Pipelines
orca.securityΒ·

DuckDB Storage Engine for MariaDB. When the Sea Lion Learns to Quack.

Β πŸ”„Data Pipelines
mariadb.orgΒ·Β·Hacker News

The Considerate Data Modeler

Β πŸ”„Data Pipelines
oranlooney.comΒ·Β·Hacker News

New comment by thaaff in "Ask HN: Who wants to be hired? (June 2026)"

Β πŸ”„Data Pipelines Β Content type: Discussion

Storage Insights datasets: Enabling org-wide operational discovery with activity insights

Β πŸ”„Data Pipelines Β Content type: Blog
cloud.google.comΒ·

New comment by aldoakhanov in "Ask HN: Who wants to be hired? (June 2026)"

 🐍Python
castlefootyai.comΒ·Β·Hacker News

Scaling Zero Copy from 1 Trillion to 120 Trillion Rows with File Federation

Β πŸ”„Data Pipelines
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help