Core Idea

Data Integrators are architectural forces that create pressure to keep data schemas together rather than distributing them across services.

Definition

Data Integrators are architectural forces that create pressure to keep data schemas together rather than distributing them across services. These forces represent genuine business constraints—ACID transactions, tight relationships, complex joins, consistency guarantees—making data distribution costly or risky.

Key Characteristics

  • ACID transaction requirements: Operations requiring strong consistency favor shared storage—financial transactions, inventory, and orders demand all-or-nothing guarantees that distributed transactions cannot reliably provide
  • Tight data relationships: Entities with referential integrity resist separation—foreign key constraints only work within a single database (Customer → Orders → OrderItems)
  • Complex query patterns: Operations requiring joins favor co-location—reports and analytics introduce latency and complexity when distributed
  • High consistency needs: Real-time systems cannot tolerate eventual consistency—inventory, reservations, and financials require immediate consistency
  • Shared domain semantics: Cohesive bounded contexts belong together—DDD keeps related concepts within aggregates

Examples

  • Financial ledger: Double-entry bookkeeping requires perfect consistency and transactional integrity—splitting risks data corruption
  • Real-time inventory: Multiple concurrent purchases need consistent stock counts—centralized storage prevents overselling

Why It Matters

Data integrators oppose Data-Disintegrators in trade-off analysis for service granularity. Architects evaluate the strength of integration forces against distribution benefits (scalability, deployment autonomy). Strong integrators suggest larger services; weak integrators support finer decomposition.

Distributed data patterns (saga, eventual consistency, CQRS) introduce significant complexity. Integrators identify scenarios where that complexity exceeds the benefit—financial systems and reservation platforms often have forces so strong that distribution becomes untenable. Bounded contexts align with natural integration forces, revealing optimal service boundaries.

Sources

AI Assistance

This content was drafted with assistance from AI tools for research, organization, and initial content generation. All final content has been reviewed, fact-checked, and edited by the author to ensure accuracy and alignment with the author’s intentions and perspective.