Postgres-to-Iceberg Replication That Analysts Query Through the Postgres Wire
Streambed, built by Cloudflare's former Postgres tech lead, streams WAL changes into Iceberg on S3 and serves analytical queries back through the Postgres protocol via embedded DuckDB, drawing 129 HN points and unusually specific practitioner engagement: TOAST column handling, DDL sync, comparisons against olake, supabase/etl, and Sequin, with one commenter noting the alternatives all have subtle issues. Reliable Postgres CDC to open table formats with zero new query surface is infrastructure teams keep half-building themselves.
Problem Statement
A platform team fields BI queries that throttle the production database, so they add read replicas that still buckle under analytical scans, then evaluate CDC pipelines: Debezium needs Kafka operations, hosted CDC costs scale punitively, and the new Go-based tools each carry caveats like TOAST columns requiring REPLICA IDENTITY FULL, raised verbatim by an HN commenter from production experience. Meanwhile dashboard teams just want a Postgres endpoint that does not endanger production, the exact interface Streambed's author built after hitting this at Cloudflare.
The Idea
A replication service for data and platform teams that turns a production Postgres into an analytics-ready Iceberg lakehouse queryable through the Postgres wire their BI tools already speak.
Why Now
Iceberg won the open-table-format consolidation in 2025-2026, and every Postgres shop now wants WAL-to-Iceberg without running Debezium and Kafka. The HN thread documents a field of immature alternatives with named subtle issues, while the analytics-on-Postgres pain, read replicas strangled by long queries, predates them all and worsens with data volume.
Target User
Platform and data engineers at Postgres-centric companies with growing analytical load
Target Market
Data replication and lakehouse infrastructure
The full brief is free to read
Create a free account to unlock the complete build-ready brief for “Postgres-to-Iceberg Replication That Analysts Query Through the Postgres Wire”, including:
- MVP scope & feature boundaries
- Step-by-step validation plan
- Score rationale across 11 dimensions
- Monetization model & pricing angle
- Competitors with links
- Acquisition channels & go-to-market
- Risks & counter-evidence
More Data Integration opportunities
Connector Health Monitor and Pipeline Reliability Dashboard for Airbyte
Buyer reviews for Airbyte Cloud consistently highlight reliability concern friction, specifically: Connector quality varies wildly. Salesforce connector is solid; HubSpot connecto; Sync failure debugging is painful. Error messages are often generic Java stack t. This pain is concentrated among Data engineers managing Airbyte connector reliability across production pipelines and creates demand for a focused tool that resolves the gap without requiring a platform switch. The Data Integration category has matured enough that users have committed to Airbyte Cloud as infrastructure, making adjacent tooling more viable than platform replacement.
View opportunityData IntegrationSelf-Serve Postgres CDC Replication With TOAST and Schema-Drift Handling
Artie opened self-serve access to its real-time database-to-warehouse replication and the first HN question was whether it handles TOAST columns and schema drift better than Debezium. Mid-market data teams still get paged for broken CDC pipelines weekly. A self-serve CDC product that makes the Debezium failure modes (TOAST, drift, replica identity) invisible is a wedge with proven willingness to pay.
View opportunity