Breaking Monoliths Taught Me How to Fix Data

Around 2015, I was leading an extraordinary architecture team in an effort to decompose Pluralsight’s monolithic application into distributed services. The system exhibited classic tight coupling symptoms: changes cascaded unpredictably into other components, deployment required coordinated release windows across multiple teams, and specific engineers became single points of failure for critical subsystems.

During this work to reduce coupling across the organization, I had a revelation. The things we were addressing in the application architecture seemed eerily similar to problems plaguing our data infrastructure. In some ways almost identical.

Coupling Creates Organizational Paralysis

Tight coupling in software systems manifests in predictable patterns. Components depend heavi…

Coupling Creates Organizational Paralysis

Tight coupling in software systems manifests in predictable patterns. Components depend heavily on each other’s internal implementation details. A database schema modification requires simultaneous updates across multiple services. Developers experience deep “change anxiety,” a hard-learned reluctance to modify code because past changes produced unexpected downstream failures.

This creates measurable organizational effects. Communication overhead increases rapidly with system complexity (a kind of human Amdahl’s law). Teams spend disproportionate time coordinating rather than delivering features. Technical debt accumulates because refactoring risk exceeds tolerance thresholds. Tribal knowledge concentrates in specific individuals who become bottlenecks. These things have been discussed and studied in works like The Mythical Man-Month.

Eventually, organizations structure themselves around the coupling rather than fix it. They create rigid management processes, coordination roles, and approval workflows that treat symptoms while the underlying architectural disease progresses.

Data Infrastructure Has the Same Problems, Amplified

Data systems manifest the same coupling patterns, with additional complexity dimensions that make the problem even more intractable.

Schema coupling occurs when producers and consumers depend on shared data structures. Modifying a schema risks breaking unknown downstream consumers, like dashboards in marketing, machine learning models in production, and billing automation in finance. The blast radius is typically unknown until failures occur.

Temporal coupling compounds the problem. Unlike stateless application code that can be redeployed with corrections, data has state and history. Three months of corrupted pipeline output means three months of compromised analytics, flawed model training, and potentially incorrect business decisions. Remediation through backfilling is often expensive, technically infeasible, or sometimes outright impossible. At Pluralsight, I worked with a team where one Python pipeline script was twenty thousand lines. Patches, fixes, normalizations, remediations, all baked into a script that had grown hair over years and absolutely no one dared touch it.

Semantic coupling is particularly insidious. Data contracts are almost always implicit. Teams make assumptions about field definitions, bake in their understanding that “revenue” excludes returns, that “active user” means login within 30 days, all usually without documentation. When producers change these semantics, consumers break silently. By the time discrepancies surface in executive dashboards, tracing the root cause requires digging into opaque systems and archaeological analysis through multiple transformation layers.

Ownership ambiguity creates coordination nightmares. In microservices, teams clearly own services. In data infrastructure, the “customer table” might be produced by engineering, transformed by five different pipelines, consumed by 20 teams, and definitively owned by no one.

Data Infrastructure Pain Explained by Coupling

This coupling explains several puzzling phenomena in data infrastructure: why data-driven transformation initiatives consistently underdeliver.

When I first started to feel this splinter in my mind, I was at Pluralsight and I would have said they were doing everything right. Data was strategic. They had invested in a top-notch data team, hired data scientists, and deployed modern tooling, yet no one was happy. Analysts spent 80% of their time on data wrangling rather than insight generation. It was very difficult to build products that used data. There were constant disagreements between functional units, like marketing and sales, about base data.

This underlying infrastructure coupling constrains velocity regardless of intent, team size or tooling sophistication. This is a clue why data quality issues prove intractable. Bad data propagates through tightly coupled pipelines before detection. The engineer who introduced erroneous data three systems upstream remains unaware they’ve broken downstream consumers.

We start to see why data team size scales poorly with value delivered. Companies double data headcount but see sublinear value increases. Most incremental effort goes toward managing dependencies and coordinating changes. Coupling friction scales superlinearly with organizational complexity.

We also see why legacy systems persist indefinitely. That 2012-vintage database feeds pipelines feeding other pipelines. Complete dependency graphs don’t exist. Decommissioning risk exceeds tolerance. Systems accumulate indefinitely, accruing technical debt and operational costs.

Data Mesh: Right Problem, Incomplete Solution

Zhamak Dehghani brilliantly identified core issues with coupling and monolithic centralization. She argued for a new model, data mesh, that attempted to apply microservices principles to data infrastructure. The idea of domain-owned data products with federated governance and clear interfaces. The concept was right directionally, but practical implementation struggled.

Data mesh has come to be understood primarily as an organizational model rather than a technical architecture. It prescribes ownership structures but doesn’t address fundamental technical challenges: schema evolution, versioning, backward compatibility, or the state management problem inherent to data.

More fundamentally, a core organizing principle of data mesh, producer-owned data products, recreates the same problematic coupling it aims to solve. When data producers create data products aligned with consumer use cases, they must understand and support those use cases. Data product teams must understand diverse end use cases, support multiple formats and query patterns, and continuously iterate as business needs evolve. This creates bottleneck resources that scale poorly with organizational needs while simultaneously constraining emergent and experimental use cases. The coupling shifts from shared infrastructure to shared teams, but the coordination overhead and velocity constraints persist.

Network effects prevent partial adoption. Three teams implementing data mesh while 17 continue legacy patterns creates operational complexity without solving coupling. Migration requires organization-wide coordination, which the coupling itself prevents.

Data mesh also doesn’t address the legacy state problem. Years of accumulated coupled data with implicit contracts can’t simply be rearchitected. Organizations face the choice of expensive risky migration or maintaining dual systems indefinitely.

A Practical Solution to Architectural Coupling

At Matterbeam, we’ve built a system that addresses these coupling problems through specific architectural principles. This isn’t theoretical: We’re observing these transformations at customer deployments.

The architecture consists of several key components:

Immutable event storage. All data from source systems is captured and stored as immutable events in a log structure. The log stores only novelty, new or changed data, keeping storage requirements manageable even with complete history retention.

Common record format with inferred schemas. Events are normalized into a common record format. Schema inference runs continuously, detecting structural changes and evolution patterns without requiring producers to explicitly define schemas.

Consumer-owned derived views. Consumers apply composable transformations to event streams, creating materialized views tailored to their specific requirements. These derived datasets are also stored immutably in the common format.

Replay and time travel capabilities. The immutable event log enables replaying data through transformations at any point in time. Consumers can rebuild views with corrected logic or analyze historical states.

Translation to target systems. A translation layer materializes data into appropriate target systems for use or query: relational databases, data warehouses, object stores, each with query characteristics matching consumer requirements.

Collection from heterogeneous sources. Collectors interface with existing systems through change data capture, database polling, or API querying, normalizing diverse sources into the event stream without requiring source system modifications.

How This Decouples Data Infrastructure

This architecture addresses specific coupling dimensions:

Eliminates temporal coupling. Consumers don’t need to be available when producers emit data. They materialize views on their own schedule, catching up to the current state when ready.

Reduces schema coupling. Producers emit events in their natural structure. Consumers apply transformations to reshape data for their needs. Schema evolution becomes a consumer concern rather than requiring producer-consumer coordination.

Clarifies ownership. Consumers own their derived views. Data quality issues in a consumer’s view are that consumer’s responsibility to debug and correct.

Makes failures explicit. Continuous schema and anomaly inference detect changes. Rather than silent failures propagating downstream, processes pause and require explicit acknowledgment of changes before resuming.

Enables mistake correction. Incorrect transformation logic can be fixed and data replayed through corrected transformations. Mistakes don’t permanently corrupt state.

Addressing Legitimate Concerns

Several objections to this architecture warrant examination.

Storage costs. Storing every event immutably appears expensive. This is the most common objection I hear to this model. There are three responses to this: First, most organizations don’t operate at true big data scale. The distribution is heavily skewed where most companies work with gigabytes or terabytes, not exabytes. Second, event logs store only novelty, which are orders of magnitude smaller than multiple full state snapshots. Third, we’re in a new world that people haven’t fully internalized. Modern cloud storage pricing has changed the economics fundamentally. Amazon Glacier stores a petabyte for approximately $1,000 a month. Smart compression and intelligent lifecycle management makes complete history retention economically viable.

Eventual consistency. Most analytics use cases already operate on delayed batch data: daily, hourly at best. Continuous update within seconds meets the practical definition of “real-time” for most data uses like business intelligence and analytics. Operational use cases requiring strong consistency can materialize views with appropriate freshness guarantees. The architecture supports both patterns.

Stream processing complexity. Using common record formats and composable transformations, these can be kept light-weight and easy to understand. Consumers primarily interact with materialized views in familiar target systems using standard query languages. The streaming complexity is abstracted by the platform. Analysts work with SQL against databases or warehouses, unaware that underlying data arrived via event streams.

Performance. The core event logs are not intended for query, they are optimized for rapid materialization. Materialization happens continuously into appropriate read-optimized systems. Consumers query materialized views in databases or warehouses. Creating a new view materialization might require hours, but subsequent updates maintain freshness within seconds. Query performance matches the target system characteristics.

Legacy system integration. Collectors adapt to existing systems without requiring source modifications. Change data capture, database polling, and API scraping normalize diverse sources into the event stream. Producers don’t need to understand or adopt event-driven patterns. The event stream becomes an internal representation, not a required producer interface.

Operational complexity. This is delivered as a managed service. Organizations don’t operate Kafka clusters, manage stream processors, run container orchestration, or tune storage systems. Our platform abstracts operational complexity while exposing simple interfaces for defining collectors, transformations, and emitters (materialization targets).

The Migration Path

Another critical insight is the adoption pattern. Organizations don’t need to understand event streams or embrace new paradigms initially.

The system presents as a familiar ETL tool. Teams connect their database to their warehouse. Internally, the platform stores the event stream, but this is transparent to users. They receive immediate value, a working data pipeline using familiar concepts.

As teams work with the system, they discover new capabilities organically. Someone needs customer data at different granularity. Instead of requesting modifications to the central pipeline, they define their own transformation and materialize a new view. Another team wants to experiment with a metric definition. They create a derived view without coordination overhead.

Adoption follows a pattern: initial use as ETL replacement, gradual discovery of transformation capabilities, eventual recognition that views can be created independently without central bottlenecks. Teams don’t adopt a new mental model through training, they discover it through usage.

Organizational Transformation in Practice

At customer deployments, we observe consistent patterns.

Data teams transition from gatekeepers to enablers. They maintain blessed canonical views for official metrics but no longer bottleneck exploratory work. Teams self-serve derived views for analysis and experimentation.

Experimentation frequency increases substantially. The cost of testing hypotheses drops effectively to zero. Teams spin up views, validate approaches, and either formalize successful patterns or abandon unsuccessful attempts. The tight coupling that previously prevented experimentation dissolves.

Migration timelines compress. Reshaping data models no longer requires coordinated cutover events. Teams create new views with updated models, validate correctness, and migrate consumers gradually. Old views continue operating during transition. Migrations become routine rather than high-risk events.

Legacy system decommissioning becomes tractable. Rather than migrating legacy systems before capturing their data, teams point collectors at them. The legacy system continues operating unchanged while its data flows into the event stream and gets reshaped downstream. This inverts the usual migration dependency where you can reshape data before replacing source systems.

Debug cycles accelerate dramatically. Time travel enables engineers to identify precisely when metrics began diverging from expectations. They replay views at different timestamps, isolate the transformation that introduced errors, correct it, and replay with fixed logic. The debugging process transforms from guesswork into systematic analysis.

Cross-functional autonomy increases. Product teams can own their analytics transformations rather than depending on centralized data engineering for every analysis. Central teams focus on platform reliability and canonical definitions rather than servicing every bespoke request.

Coupling That Remains

Some coupling dimensions can’t be eliminated through this architecture.

Semantic coupling persists. Event semantics, like what a “purchase completed” event actually represents, remain a human coordination problem. Does it include returns? Gift card redemptions? Producers can change event semantics without breaking schemas, causing consumers to fail silently.

Our approach treats this pragmatically. Blessed canonical views establish official definitions for business-critical metrics. Teams can create non-blessed experimental views, but accountability is clear when numbers diverge, the burden is on the non-blessed view owner to explain why their definition differs.

Governance requires organizational process. With Matterbeam, this should be more tractable since Matterbeam is effectively an emergent catalog as well as a lineage tool. A lightweight governance process is still necessary for promoting non-blessed views to canonical status.

Matterbeam is a cloud-native system with strong resilience, but as teams use it more and more the platform becomes critical infrastructure. Platform availability issues affect all consumers. This is analogous to dependency on Snowflake or AWS: You trade distributed coordination complexity for centralized platform dependency. The reliability bar is high. We work hard to build in simplicity and resilience.

This Is Not Theoretical

These patterns aren’t merely extrapolated from principles. We’ve built this system at Matterbeam and deployed it with customers. We observe these organizational transformations in practice.

Teams that initially adopted the platform as an ETL replacement discover transformation capabilities after weeks or months. They begin creating derived views independently. Coordination overhead drops measurably with fewer meetings, shorter approval cycles, faster time to insight. One customer shared that their product managers now ask “can we use Matterbeam for this?” for every new project. They’ve experienced a step change in what’s possible and how quickly it becomes possible.

Data quality issues that previously required days of debugging across teams now resolve in hours through deterministic replay and systematic analysis. Teams experiment more because failure costs drop dramatically.

The most striking observation is how teams internalize decoupling without explicit training. They don’t study event-driven architecture or read whitepapers about loose coupling. They simply discover “I can reshape this data myself without asking permission,” and behavior changes organically. This lets people get their hands ‘dirty’ with data gaining data literacy in the process. In many ways this is aligning the way data flows with the way people actually work.

The Path Forward

Application architecture underwent this transformation over the past decades. Monolithic systems decomposed into microservices. Teams gained independence. Deployment velocity increased. Innovation accelerated.

There is an opportunity for data infrastructure to follow this same trajectory, just years behind. The tight coupling that has constrained data teams, with coordination overhead, the fear of changes, the tribal knowledge, the inexplicable slowness now has a name and a solution.

The technology exists now. The architecture is proven. Organizations can choose to decouple their data infrastructure the same way they’ve decoupled application architecture.

When they do, the effects cascade. Faster experimentation. Clearer ownership. Manageable technical debt. Reduced coordination overhead. Teams spend 80% of time generating insights rather than wrangling pipelines.

The constraint isn’t technology. It’s recognizing that tight coupling in data infrastructure isn’t inevitable. In many cases it’s a choice we’ve been making without realizing alternatives existed.

We can choose differently. And at Matterbeam, we’re watching organizations make that choice and seeing them transform as a result.

Coupling Creates Organizational Paralysis

Coupling Creates Organizational Paralysis

Data Infrastructure Has the Same Problems, Amplified

Data Infrastructure Pain Explained by Coupling

Data Mesh: Right Problem, Incomplete Solution

A Practical Solution to Architectural Coupling

How This Decouples Data Infrastructure

Addressing Legitimate Concerns

The Migration Path

Organizational Transformation in Practice

Coupling That Remains

This Is Not Theoretical

The Path Forward

Similar Posts