Migrating to pg_tide

This guide covers migrating from common messaging and CDC solutions to pg_tide. Whether you're moving from Debezium, a custom outbox implementation, application-level message publishing, or another event streaming platform, this guide provides a path.

From Debezium

What Changes

Aspect	Debezium	pg_tide
CDC mechanism	WAL logical replication	Transactional outbox
Infrastructure	Kafka Connect cluster	Single relay binary
Configuration	Connector JSON via REST API	SQL catalog + relay process
Message format	Debezium JSON/Avro	Same (use `wire_format = "debezium"`)
Schema changes	Automatic (WAL captures all)	Application explicitly publishes
Filtering	SMTs (Single Message Transforms)	JMESPath transforms

Migration Steps

Install pg_tide extension:
```
CREATE EXTENSION pg_tide;
```

Create outbox for each captured table:

SELECT tide.outbox_create('orders_cdc');

Add triggers or modify application to publish events:

-- Option A: Trigger (captures all changes automatically)
CREATE TRIGGER orders_cdc AFTER INSERT OR UPDATE OR DELETE ON orders
FOR EACH ROW EXECUTE FUNCTION your_cdc_trigger();

-- Option B: Application publishes explicitly (recommended)
SELECT tide.outbox_publish('orders_cdc', 'orders', your_payload);

Configure pipeline with Debezium wire format:

SELECT tide.relay_set_outbox('orders-cdc', 'orders_cdc', '{
    "sink_type": "kafka",
    "brokers": "kafka:9092",
    "topic": "dbserver1.public.orders",
    "wire_format": "debezium",
    "wire_config": { "server_name": "dbserver1" }
}'::jsonb);

Run in parallel: Deploy pg_tide alongside Debezium, publishing to a separate topic. Compare output.
Switch consumers: Once validated, point consumers to the pg_tide topic.
Decommission Debezium: Remove Kafka Connect connectors.

Key Differences to Communicate to Your Team

Consumers see identical Debezium-format messages (no consumer changes needed)
Events are now guaranteed to be published if and only if the transaction commits
You choose what to publish (vs. Debezium capturing everything)
No more "snapshot" mode — data is published at application time

From Custom Outbox Implementations

Many teams have hand-built outbox tables with cron jobs or application-level polling. Migrating to pg_tide replaces the custom infrastructure with a maintained, optimized solution.

Typical Custom Outbox

-- Your existing custom outbox table
CREATE TABLE outbox (
    id BIGSERIAL PRIMARY KEY,
    event_type TEXT,
    payload JSONB,
    published BOOLEAN DEFAULT FALSE,
    created_at TIMESTAMPTZ DEFAULT now()
);

Migration Steps

Create pg_tide outbox:
```
SELECT tide.outbox_create('events');
```

Backfill unpublished messages (if needed):

INSERT INTO tide.outbox_events (stream_table, payload)
SELECT event_type, payload FROM outbox WHERE published = FALSE;

Update application code:

-- Before: INSERT INTO outbox (event_type, payload) VALUES (...)
-- After:
SELECT tide.outbox_publish('events', 'orders', '{"order_id": "..."}'::jsonb);

Configure relay pipeline:

SELECT tide.relay_set_outbox('events-pipeline', 'events', '{
    "sink_type": "your-sink",
    ...
}'::jsonb);

Remove old polling infrastructure: Delete cron jobs, background workers, custom retry logic.

From Direct Message Publishing

If your application publishes directly to Kafka/NATS/RabbitMQ (without an outbox), you have a dual-write problem — the database write and message publish can become inconsistent.

The Dual-Write Problem

BEGIN;
  INSERT INTO orders (...);  -- ✓ succeeds
COMMIT;
publish_to_kafka(...);       -- ✗ fails (network error)
-- Order exists but event was never published!

Migration to Transactional Outbox

BEGIN;
  INSERT INTO orders (...);
  SELECT tide.outbox_publish('order_events', 'orders', ...);
COMMIT;
-- Both succeed or both fail. pg_tide relay handles delivery.

Steps

Install pg_tide and create outbox
Replace direct publish calls with tide.outbox_publish() inside the transaction
Configure relay to deliver to the same broker
Remove direct publishing code and client libraries

From RabbitMQ / ActiveMQ

If you're using a traditional message broker and want to move to pg_tide:

Keep the broker as the transport — pg_tide publishes to RabbitMQ/AMQP
Replace the producer — use transactional outbox instead of direct publish
Consumers stay the same — they still read from the same queues

SELECT tide.relay_set_outbox('orders-to-rabbit', 'order_events', '{
    "sink_type": "rabbitmq",
    "url": "amqp://guest:guest@rabbitmq:5672",
    "exchange": "orders",
    "routing_key": "order.created"
}'::jsonb);

Rollback Plan

If you need to roll back the migration:

pg_tide outbox tables remain in your database (no data loss)
Re-enable your previous publishing mechanism

Disable the pg_tide relay pipeline:

UPDATE tide.relay_outbox_config SET enabled = false WHERE name = 'your-pipeline';

The extension can be dropped if no longer needed:
```
DROP EXTENSION pg_tide CASCADE;
```

pg_tide Documentation

Migrating to pg_tide

From Debezium

What Changes

Migration Steps

Key Differences to Communicate to Your Team

From Custom Outbox Implementations

Typical Custom Outbox

Migration Steps

From Direct Message Publishing

The Dual-Write Problem

Migration to Transactional Outbox

Steps

From RabbitMQ / ActiveMQ

Rollback Plan

Further Reading

Keyboard shortcuts

pg_tide Documentation