From Mapping Files to Data Plumbing
🔧Data Engineering
Flag this post
An introduction to program synthesis (Part II) - Automatically generating features for machine learning
⚙️Query Compilers
Flag this post
The Data Engineering Agent is now in preview
cloud.google.com·3d
🔧Data Engineering
Flag this post
10 Polars One-Liners for Speeding Up Data Workflows
kdnuggets.com·17h
🐻Polars
Flag this post
DS-STAR: A state-of-the-art versatile data science agent
research.google·13h
🏺Data Archaeology
Flag this post
flowengineR: A Modular and Extensible Framework for Fair and Reproducible Workflow Design in R
arxiv.org·3d
🔧Data Engineering
Flag this post
Optimizing Datalog for the GPU
⚡DataFusion
Flag this post
Redpanda 25.3 delivers near-instant disaster recovery, and more
redpanda.com·1d
🏠Data Lakehouse
Flag this post
Announcing Magika 1.0: now faster, smarter, and rebuilt in Rust
blogger.com·11h
📋Tokei
Flag this post
A Step-by-Step Guide to Implementing Microsoft Fabric with a Trusted Partner
🏛️Lakehouse Architecture
Flag this post
Scaling data governance with Amazon DataZone: Covestro success story
aws.amazon.com·3d
🏛️Lakehouse Architecture
Flag this post
Your AI Models Aren’t Slow, but Your Data Pipeline Might Be
thenewstack.io·6d
🌊Stream Processing
Flag this post
Read more
🧭Vector Databases
Flag this post
Why Multimodal AI Broke the Data Pipeline — And How Daft Is Beating Ray and Spark to Fix It
hackernoon.com·4d
📊Columnar Engines
Flag this post
Supercharging the ML and AI Development Experience at Netflix
netflixtechblog.com·2d
📊Columnar Engines
Flag this post
American Wind Farms
📊Column Stores
Flag this post
Loading...Loading more...