Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🏗 data engineering
pyspark, Polars, data bricks, spark, fabric, Azure synapse
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
15840
posts in
9.2
ms
PySpark
: The Big Brain of Data
Processing
🌟
spark
dev.to
·
2d
·
DEV
·
…
What Category Theory
Teaches
Us About
DataFrames
🧊
Iceberg Tables
mchav.github.io
·
4d
·
Lobsters
,
Hacker News
,
Hacker News
,
r/programming
·
…
How
Honeylove
boosts product quality and service efficiency with
BigQuery
🔍
BigQuery
cloud.google.com
·
13h
·
…
lynxbase/lynxdb
: A lightweight schema-on-read analytics in a single binary
⚡
DataFusion
github.com
·
2d
·
Hacker News
·
…
From
Pipelines
to AI Platforms: How Agentic AI Is
Redefining
the Role of Data Engineers
🔍
AI Detection
hackernoon.com
·
3d
·
…
Data
Inlining
in
DuckLake
: Unlocking Streaming for Data Lakes
🏠
Data Lakehouse
ducklake.select
·
1d
·
Hacker News
·
…
Semlib
: Semantic Data
Processing
🧮
Apache Calcite
anishathalye.com
·
2d
·
Hacker News
·
…
GA4
Data Quality Monitoring with
BigQuery
SQL
⏱️
Real-time Analytics
paolobietolini.com
·
5d
·
…
Getting Data from Multiple Sources in Power BI: A
Practical
Guide to Modern Data Integration for
Analysts
📋
CSV Processing
yourcompany.sharepoint.com
·
2d
·
DEV
·
…
Show HN: Built
Loony
for
builders
who want to spin up data infrastructure fast
🧮
Apache Calcite
loony.dev
·
6d
·
Hacker News
·
…
Dux
: Distributed DuckDB-native
DataFrames
for Elixir
🌟
spark
dux.now
·
5d
·
Hacker News
·
…
Drizby
: An Open Source BI Platform Built on a Semantic
Layer
(and why I built it)
🧮
Apache Calcite
dev.to
·
1d
·
DEV
·
…
From SQL Analytics to
Predictive
Decision Systems:
Operationalizing
ML Models in Business Operation
⏱️
Real-time Analytics
hackernoon.com
·
2d
·
…
grove/pg-trickle
: A PostgreSQL extension for streaming tables with incremental view maintenance, powered by differential
dataflow
in Rust.
🧊
Iceberg Tables
github.com
·
3d
·
Hacker News
·
…
How I built a data quality API that
runs
at the edge in
milliseconds
⚡
DataFusion
dev.to
·
3d
·
DEV
·
…
How to
Optimize
Big Data Platform Costs Across the Data
Lifecycle
🗄️
Storage Tiering
hackernoon.com
·
3d
·
…
Show HN:
Diffly
– A Python package to compare polars
dataframes
🐻
Polars
github.com
·
3d
·
Hacker News
·
…
100
Spark
Interview Questions for Data
Engineer
🌟
spark
dev.to
·
6d
·
DEV
·
…
Building AI Agents That Close the
Loop
on Pipeline
Failures
🤖
Copilot
hackernoon.com
·
4d
·
…
DataFuse.Net
- Data
Integration
Framework.
⚡
DataFusion
dev.to
·
3d
·
DEV
·
…
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help