Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Parquet
📦 Parquet
Specific
parquet, columnar storage, apache parquet, file format
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
39
posts in
6.1
ms
What is
Apache
Arrow
Flight? (8 minute read)
🐻
Polars
confessionsofadataguy.com
·
6d
6 days ago
Actions for What is Apache Arrow Flight? (8 minute read)
Deep dive: How Lightning Engine delivers 4.9x faster
Apache
Spark
performance
⚡
Apache Spark
Content type:
Blog
cloud.google.com
·
3h
3 hours ago
Actions for Deep dive: How Lightning Engine delivers 4.9x faster Apache Spark performance
Calculating speed estimates with
Apache
Spark
⚡
Apache Spark
Content type:
Blog
mapbox.com
·
2d
2 days ago
Actions for Calculating speed estimates with Apache Spark
apache/arrow-nanoarrow
: Helpers for
Arrow
C
Data
&
Arrow
C Stream interfaces
🐻
Polars
Content type:
Code
github.com
·
3h
3 hours ago
·
Hacker News
Actions for apache/arrow-nanoarrow: Helpers for Arrow C Data & Arrow C Stream interfaces
🔗 Documentation | Ladybug
🗂️
Data Governance
hames.id.au
·
2d
2 days ago
Actions for 🔗 Documentation | Ladybug
The tiniest logging stack: Fluent Bit,
Parquet
and DuckDB
⚡
Apache Spark
Content type:
Blog
davidguerrero.fr
·
6d
6 days ago
·
Hacker News
,
r/selfhosted
Actions for The tiniest logging stack: Fluent Bit, Parquet and DuckDB
Less-relevant results
Databricks
wants to kill the “email me a
file
” problem for AI agent skills
🏠
Lakehouse
thenewstack.io
·
7h
7 hours ago
Actions for Databricks wants to kill the “email me a file” problem for AI agent skills
Announcing general availability of
Apache
Spark
4.0 on Amazon EMR
⚡
Apache Spark
Content type:
Blog
aws.amazon.com
·
1d
1 day ago
Actions for Announcing general availability of Apache Spark 4.0 on Amazon EMR
Row vs
Columnar
Storage
for Analytics: Why PostgreSQL Scans Are Slower Than They Should Be
⚡
Apache Spark
Content type:
Blog
tigerdata.com
·
5d
5 days ago
Actions for Row vs Columnar Storage for Analytics: Why PostgreSQL Scans Are Slower Than They Should Be
Yesterday Was a Good Day
🏠
Lakehouse
Content type:
Blog
kottke.org
·
1d
1 day ago
Actions for Yesterday Was a Good Day
Why Postgres TOAST does almost nothing for time-series, and what TimescaleDB does instead (disclosure: my company blog)
⚡
Apache Spark
roszigit.com
·
1d
1 day ago
·
r/PostgreSQL
Actions for Why Postgres TOAST does almost nothing for time-series, and what TimescaleDB does instead (disclosure: my company blog)
New comment by mkolarek in "Ask HN: Who wants to be hired? (June 2026)"
🔧
Data Engineering
Content type:
PDF
markokolarek.com
·
4d
4 days ago
·
Hacker News
Actions for New comment by mkolarek in "Ask HN: Who wants to be hired? (June 2026)"
Operationalizing Property-Based Testing for
Data-Intensive
Scalable Computing Systems
⚡
Apache Spark
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for Operationalizing Property-Based Testing for Data-Intensive Scalable Computing Systems
Archiving Years of
Dataverse
Audit History
📊
Analytics Engineering
techcommunity.microsoft.com
·
2d
2 days ago
Actions for Archiving Years of Dataverse Audit History
Icechunk Adopted by the National Weather Service: Earthmover Joins Booz Allen on NWS CIRRUS
🏠
Lakehouse
Content type:
Blog
earthmover.io
·
6d
6 days ago
Actions for Icechunk Adopted by the National Weather Service: Earthmover Joins Booz Allen on NWS CIRRUS
Transforming solar and wind maintenance reports with genie and AI agents
🧱
Databricks
Content type:
Blog
databricks.com
·
2d
2 days ago
Actions for Transforming solar and wind maintenance reports with genie and AI agents
Databricks
is Hiring! — Non-Phone — Remote HR Operations Associate — Up to $60/hr.
🧱
Databricks
ratracerebellion.com
·
5d
5 days ago
Actions for Databricks is Hiring! — Non-Phone — Remote HR Operations Associate — Up to $60/hr.
Introducing Streamling: Performant and Extensible
Data
Streaming Framework
🔧
Data Engineering
Content type:
News
streamingdata.tech
·
1d
1 day ago
Actions for Introducing Streamling: Performant and Extensible Data Streaming Framework
DuckDB vs. SQLite
🗂️
Data Governance
motherduck.com
·
4d
4 days ago
·
Hacker News
Actions for DuckDB vs. SQLite
Linux Fundamentals for
Data
Engineering
🧱
Databricks
dev-to-uploads.s3.amazonaws.com
·
2d
2 days ago
·
DEV
Actions for Linux Fundamentals for Data Engineering
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help