Runbook: Relay Upgrade

Applies to: pg-tide relay binary (pg-tide run)
Scope: Rolling upgrade procedure for high-availability deployments with multiple relay instances.

Overview

The pg-tide relay is stateless between reconcile cycles. Pipeline ownership is coordinated via PostgreSQL advisory locks, so multiple relay instances can run simultaneously without split-brain. This makes rolling upgrades possible without any downtime to message delivery.

Pre-Upgrade Checklist

Read the CHANGELOG for the target version — note any new required configuration keys or deprecated flags.
Back up the database (or confirm PITR is current).

Confirm current relay health:

pg-tide status --postgres-url "$PG_TIDE_POSTGRES_URL"
pg-tide doctor --postgres-url "$PG_TIDE_POSTGRES_URL"

Upgrade the PostgreSQL extension first (if the target relay version requires a newer extension schema):
```
ALTER EXTENSION pg_tide UPDATE;
```
The old relay is forward-compatible with the new schema; the new relay is backward-compatible with the old schema. Upgrading the extension first is always safe.

Rolling Upgrade Procedure (Kubernetes)

1. Update the Deployment Image Tag

kubectl set image deployment/pg-tide \
  pg-tide=ghcr.io/trickle-labs/pg-tide:0.19.0

Or update image.tag in values.yaml and run helm upgrade:

helm upgrade pg-tide oci://ghcr.io/trickle-labs/helm/pg-tide \
  --set image.tag=0.19.0 \
  --reuse-values

2. Watch the Rollout

kubectl rollout status deployment/pg-tide

Kubernetes replaces pods one at a time (controlled by maxUnavailable and maxSurge). As each old pod is terminated:

PostgreSQL advisory locks held by the old pod are released automatically when the connection closes (within ~1 s of pod termination).
The new pod's coordinator reconciles and reacquires the released pipelines.
Messages may be re-delivered (at-least-once) for any batch that was in-flight at the time of the old pod's termination.

3. Verify Post-Upgrade

# All pods should be running the new version:
kubectl get pods -l app.kubernetes.io/name=pg-tide -o json \
  | jq '.items[].spec.containers[].image'

# Pipeline ownership should be fully restored:
pg-tide status --postgres-url "$PG_TIDE_POSTGRES_URL"

Rolling Upgrade Procedure (Docker / Systemd)

For single-instance or manual deployments:

Start the new relay alongside the old one (different container name or port is fine; both will contend for advisory locks and share pipeline ownership gracefully).
```
docker run -d --name pg-tide-new \
  -e PG_TIDE_POSTGRES_URL="$PG_TIDE_POSTGRES_URL" \
  ghcr.io/trickle-labs/pg-tide:0.19.0
```

Verify the new relay is healthy and has acquired pipelines:

docker logs pg-tide-new | grep "acquired lock"
pg-tide status --postgres-url "$PG_TIDE_POSTGRES_URL"

Stop the old relay (graceful drain):
```
docker stop --time 60 pg-tide-old
```
The --time 60 gives the old relay up to 60 seconds to finish in-flight batches before hard-stopping.

Clean up:

docker rm pg-tide-old
docker rename pg-tide-new pg-tide

Old (pre-0.17.0)	New	Notes
`PG_TIDE_RELAY_POSTGRES_URL`	`PG_TIDE_POSTGRES_URL`	Old name no longer recognised
Pre-v0.17 legacy env var	`PG_TIDE_POSTGRES_URL`	See CHANGELOG v0.17.0 for details

Rollback

If the new relay is unhealthy after the upgrade:

Start the old relay binary (advisory locks auto-transfer back).
Stop the new relay.
Investigate logs from the new relay for the root cause.

Because pipeline state lives in PostgreSQL, no data is lost during rollback.

HA Considerations

Minimum two relay instances are recommended for production to ensure zero-downtime upgrades.
maxUnavailable: 0 in the Kubernetes PodDisruptionBudget ensures at least one relay is always running during node drains.
The Helm chart defaults (helm/pg-tide/values.yaml) set replicaCount: 2 and include a PodDisruptionBudget.

pg_tide Documentation

Runbook: Relay Upgrade

Overview

Pre-Upgrade Checklist

Rolling Upgrade Procedure (Kubernetes)

1. Update the Deployment Image Tag

2. Watch the Rollout

3. Verify Post-Upgrade

Rolling Upgrade Procedure (Docker / Systemd)

Configuration Changes Between Versions

New Required Configuration Keys

Deprecated Flags

Environment Variable Changes

Rollback

HA Considerations

See Also

Keyboard shortcuts

pg_tide Documentation