Skip to content

Source: Kafka / Redpanda topics (streaming-as-table) #5

@frhack

Description

@frhack

Kafka consumer, topic = bounded or unbounded SQL table.

Why it matters: Streaming-as-table is the headline value prop of Materialize / RisingWave / Arroyo. "SELECT count(*) FROM orders_topic WHERE …" with a tail-mode option turns dbfy into a real ops/log-analytics tool, not just a read-once federation.

Scope (v1): dbfy_kafka('broker:9092', topic := 'orders', format := 'json', start := 'earliest', end := 'latest'). Schema inference via the parser layer (jsonl/avro/protobuf). tail := true for unbounded streaming via the streaming SQL operator (when DataFusion's stream support stabilises).

Effort: 5-7 days (the bounded vs unbounded distinction + offset management is the real complexity, not the consumer wire).

Metadata

Metadata

Assignees

No one assigned

    Labels

    source-requestRequest for a new source kind to be addedtier-1Wow factor — opens a new mark

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions