Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
π data engineering
pyspark, Polars, data bricks, spark, fabric, Azure synapse
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
15952
posts in
102.2
ms
Structured
outputs
on Amazon Bedrock:
Schema-compliant
AI responses
aws.amazon.com
Β·
19h
β‘
DataFusion
5 Ways Spark 4.1 Moves Data Engineering From Manual
Pipelines
to
Intent-Driven
Design
hackernoon.com
Β·
5d
π
ETL Pipelines
My Data Lake Runs on
MongoDB
and PostgreSQL and Iβm Not
Sorry
dev.to
Β·
2h
Β·
Discuss:
DEV
ποΈ
Lakehouse Architecture
How I
squeezed
a
BERT
sentiment analyzer into 1GB RAM on a $5 VPS
mohammedeabdelaziz.github.io
Β·
32m
Β·
Discuss:
Hacker News
β‘
DataFusion
Why RAG Failed Us for
SRE
and How We Built Dynamic Memory
Retrieval
Instead
drdroid.io
Β·
1d
Β·
Discuss:
Hacker News
βοΈ
Database Internals
Data
Integration
databricks.com
Β·
3d
π§
Data Engineering
Mastering Data
Cleansing
in Python: A DevOps Approach to
Dirty
Data Without Documentation
dev.to
Β·
2d
Β·
Discuss:
DEV
π
CSV Processing
Data Agent Ready Database:
Designing
the Next-Gen Enterprise Data
Warehouse
databend.com
Β·
3d
Β·
Discuss:
Hacker News
ποΈ
Lakehouse Architecture
Improving atlas-scale single-cell
annotation
models with hierarchical
cross-entropy
loss
nature.com
Β·
1d
π§
Vector Databases
A Modern Python Stack for Data Projects (uv +
ruff
+ ty +
Marimo
+ Polars)
mameli.dev
Β·
2d
Β·
Discuss:
r/programming
π
Tokei
mstrYoda/goraphdb
: A graph database implemented in Golang
github.com
Β·
21h
Β·
Discuss:
r/programming
πΈοΈ
Graph Databases
I
struggled
with system design until I learned these 114
concepts
newsletter.systemdesign.one
Β·
1h
ποΈ
Lakehouse Architecture
Project Management Built for Engineering Teams
velocity.quest
Β·
16m
Β·
Discuss:
Hacker News
π³
Git
Is Your Machine Learning
Pipeline
as Efficient as it Could Be?
kdnuggets.com
Β·
1d
π
Columnar Engines
Kubernetes Operator for automated
Jupyter
Notebook validation in
MLOps
pipelines
reddit.com
Β·
16h
Β·
Discuss:
r/kubernetes
π
Jupyter
The Future of Systems
novlabs.ai
Β·
5h
Β·
Discuss:
Hacker News
βοΈ
AWS Infrastructure
Data Agents:
Levels
, State of the Art, and Open
Problems
arxiv.org
Β·
2d
πΈοΈ
Knowledge Graphs
The
Portfolio
Challenge by Google AI
gbemisolaportfolio-627390562920.us-west1.run.app
Β·
2h
Β·
Discuss:
DEV
π€
AI
Pydantic
Performance: 4 Tips on How to Validate Large
Amounts
of Data Efficiently
towardsdatascience.com
Β·
1d
β
Data Validation
The Rise of
Spec
Driven Development
dbreunig.com
Β·
23h
Β·
Discuss:
Hacker News
π
Tokei
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help