Architecture Decision Records

This page documents the key architectural decisions made in pg_tide's design, their rationale, and the alternatives that were considered.

ADR-1: Transactional Outbox over WAL-Based CDC

Decision: pg_tide uses the transactional outbox pattern rather than WAL (logical replication) for change capture.

Context: Most CDC tools (Debezium, pgoutput) read the PostgreSQL WAL to capture changes. This is transparent to the application but has limitations: it captures all changes (including internal ones), can't easily enrich events with application context, and requires careful management of replication slots.

Rationale:

Applications control exactly what events are published and when
Events are guaranteed to be published if and only if the business transaction commits
No replication slot management or WAL retention concerns
Events can include derived data not present in any single table
Schema is explicit and application-controlled

Trade-offs:

Applications must explicitly call tide.outbox_publish() (not transparent)
Slightly more application code vs. transparent CDC
Cannot capture changes from direct SQL or other tools (unless triggers are used)

ADR-2: PostgreSQL Advisory Locks for HA Coordination

Decision: Use pg_try_advisory_lock() for pipeline ownership coordination rather than external consensus (etcd, ZooKeeper) or leader election.

Context: Multiple relay instances need to coordinate which instance processes which pipeline without double-processing.

Rationale:

Zero additional infrastructure — PostgreSQL is already required
Advisory locks are automatically released on connection close (crash safety)
Non-blocking pg_try_advisory_lock prevents deadlocks
Well-understood PostgreSQL primitive with decades of production use

Trade-offs:

Requires all relay instances to connect to the same PostgreSQL instance
Discovery interval determines failover speed (not instant)
Lock granularity is per-pipeline (cannot split a single pipeline across instances)

ADR-3: Pipeline Configuration in PostgreSQL Catalog

Decision: Store pipeline configurations in PostgreSQL tables (tide.relay_outbox_config, tide.relay_inbox_config) rather than in TOML/YAML files or environment variables.

Context: The relay needs to know which pipelines to run and how to configure each one.

Rationale:

Configuration changes via SQL (hot-reload without restart)
LISTEN/NOTIFY enables instant propagation
Configuration is transactional (rollback on error)
All relay instances see the same configuration (single source of truth)
Easy to manage programmatically (Terraform, application code)

Trade-offs:

Requires database access to view/change configuration
Secrets must use ${env:...} substitution (not stored in catalog)
Slightly less familiar than file-based config for ops teams

ADR-4: Single Binary Relay

Decision: The relay is a single statically-linked binary rather than a framework, library, or JVM application.

Context: The relay needs to be deployed easily across diverse environments.

Rationale:

Single binary deployment (no runtime dependencies)
Small container images (~20 MB)
Fast startup time (< 1 second)
Low resource consumption (Rust, no GC pauses)
Cross-compilation for Linux/macOS/ARM

Trade-offs:

Feature-gated compilation (some sinks/sources require build flags)
Rust ecosystem less familiar than Java/Python for some teams
Plugin system not possible (all sinks compiled in)

ADR-5: JMESPath for Transforms

Decision: Use JMESPath (not JSONPath, jq, or a custom DSL) for message transforms and filters.

Context: Messages need lightweight filtering and reshaping without external tools.

Rationale:

Well-specified language with formal grammar
Deterministic evaluation (no side effects)
Good balance of power and simplicity
Fast compiled evaluation
Familiar from AWS CLI and other tools

Trade-offs:

Less powerful than jq (no recursion, limited array manipulation)
No support for array indexing in field paths
Users must learn JMESPath syntax

ADR-6: At-Least-Once Delivery

Decision: pg_tide provides at-least-once delivery semantics by default, not exactly-once.

Context: Distributed systems cannot provide exactly-once delivery without end-to-end coordination.

Rationale:

At-least-once is achievable without two-phase commit
Simpler implementation, more reliable operation
The inbox provides application-level deduplication for exactly-once processing
Most sinks (Kafka, NATS) inherently provide at-least-once anyway
Idempotent consumers are a well-understood pattern

Trade-offs:

Consumers may receive duplicate messages (rare, only on failure/recovery)
Applications that need exactly-once must implement deduplication
The inbox pattern adds complexity for cross-system exactly-once

pg_tide Documentation

Architecture Decision Records

ADR-1: Transactional Outbox over WAL-Based CDC

ADR-2: PostgreSQL Advisory Locks for HA Coordination

ADR-3: Pipeline Configuration in PostgreSQL Catalog

ADR-4: Single Binary Relay

ADR-5: JMESPath for Transforms

ADR-6: At-Least-Once Delivery

Further Reading

Keyboard shortcuts

pg_tide Documentation