Kafka Kit

⚠️ Work in Progress: This project is under active development. APIs may change.

A Kafka toolkit for Kotlin with two complementary approaches:

Module	Description
kafka-flow	Flow-based consumer/producer with automatic retry, DLT, and metrics
ktor-kafka	Ktor plugin for Spring Kafka with suspend listeners and DI bridging

Both modules are built on Spring Kafka but require no Spring Boot.

Requirements

Requirement	Version	Notes
Kotlin	2.0+
JDK	21+	Virtual Threads enabled by default
Spring Kafka	3.3+

kafka-flow Library

A Kotlin Flow-based Kafka consumer/producer library. Write lean consumers that focus purely on business logic—all retry, DLT, and metrics handling is automatic.

Features

Lean Consumer Pattern: Just implement suspend fun consume(record), no boilerplate
Flow-based API: Consume Kafka messages as Kotlin Flows with backpressure
Coroutine-native: Fully suspend-based, non-blocking async publishing
Two-stage Retry: In-memory retries → Retry Topic → Dead Letter Topic
Per-consumer Config: All policies configurable via @KafkaTopic annotation
Exception Classification: Retryable vs NonRetryable, skip retries for validation errors
TTL Support: Expire messages by retry duration or message age
Metrics Interface: Pluggable observability (NoOp, Logging, Micrometer)

Installation

dependencies {
    implementation("io.github.osoykan:kafka-flow:0.1.0") // to be published
}

Quick Start

The Lean Consumer Pattern

Write consumers that focus only on business logic:

@KafkaTopic(
    name = "orders.created",
    retry = "orders.created.retry",
    dlt = "orders.created.dlt",
    maxInMemoryRetries = 3,
    maxRetryTopicAttempts = 2
)
class OrderCreatedConsumer(
    private val orderService: OrderService
) : ConsumerAutoAck<String, OrderEvent> {
    
    override suspend fun consume(record: ConsumerRecord<String, OrderEvent>) {
        orderService.processOrder(record.value())
    }
}

What happens automatically:

✅ In-memory retries with exponential backoff
✅ Send to retry topic when in-memory retries exhausted
✅ Send to DLT when retry topic attempts exhausted
✅ Exception classification (validation errors → DLT immediately)
✅ TTL checks (expire old messages)
✅ Metrics recording
✅ Automatic acknowledgment on success

Multiple Topics

Listen to multiple topics with a single consumer:

@KafkaTopic(
    topics = ["orders.created", "orders.updated", "orders.cancelled"],
    retry = "orders.retry",
    dlt = "orders.dlt"
)
class OrderEventsConsumer : ConsumerAutoAck<String, OrderEvent> {
    override suspend fun consume(record: ConsumerRecord<String, OrderEvent>) {
        when (record.topic()) {
            "orders.created" -> handleCreated(record.value())
            "orders.updated" -> handleUpdated(record.value())
            "orders.cancelled" -> handleCancelled(record.value())
        }
    }
}

Configuration

@KafkaTopic Annotation

@KafkaTopic(
    // Topic Configuration
    name = "payments",                    // Main topic (or use topics = [...])
    retry = "payments.retry",             // Retry topic (auto: {name}.retry)
    dlt = "payments.dlt",                 // DLT topic (auto: {name}.dlt)
    groupId = "payment-service",          // Consumer group
    concurrency = 4,                      // Processing concurrency
    multiplePartitions = 2,               // Container-level parallelism
    
    // In-Memory Retry Configuration
    maxInMemoryRetries = 5,
    backoffMs = 100,
    backoffMultiplier = 2.0,
    maxBackoffMs = 30_000,
    
    // Retry Topic Configuration
    maxRetryTopicAttempts = 3,
    retryTopicBackoffMs = 1_000,
    retryTopicBackoffMultiplier = 2.0,
    maxRetryTopicBackoffMs = 60_000,
    
    // TTL Configuration
    maxRetryDurationMs = 300_000,         // 5 min max from first failure
    maxMessageAgeMs = 600_000,            // 10 min max from timestamp
    
    // Exception Classification
    classifier = ClassifierType.DEFAULT,
    nonRetryableExceptions = [ValidationException::class]
)
class PaymentConsumer : ConsumerAutoAck<String, PaymentEvent> {
    override suspend fun consume(record: ConsumerRecord<String, PaymentEvent>) {
        // Your business logic
    }
}

Concurrency Model

kafka-flow provides two-level parallelism for maximum throughput:

Parameter	What It Controls	Default
`multiplePartitions`	Number of Kafka polling threads (Spring Kafka's `concurrency`)	1
`concurrency`	Number of records processed in parallel (across ALL pollers)	4

⚠️ Important: All polling threads feed into a single Kotlin Flow. The concurrency parameter controls how many records from that unified flow are processed simultaneously.

How Records Flow Through the System

flowchart LR
    subgraph Topic["Topic: orders.created (6 partitions)"]
        P0["P0"] & P1["P1"] & P2["P2"] & P3["P3"] & P4["P4"] & P5["P5"]
    end

    subgraph Polling["multiplePartitions = 3"]
        PT1["Poller 1"]
        PT2["Poller 2"]
        PT3["Poller 3"]
    end

    subgraph Flow["Single Kotlin Flow"]
        Channel["Shared Channel"]
    end

    subgraph Processing["concurrency = 4"]
        W1["Worker 1"] & W2["Worker 2"] & W3["Worker 3"] & W4["Worker 4"]
    end

    P0 & P1 --> PT1
    P2 & P3 --> PT2
    P4 & P5 --> PT3
    
    PT1 & PT2 & PT3 --> Channel
    Channel --> W1 & W2 & W3 & W4

    style Topic fill:#1e88e5,color:#fff
    style Polling fill:#fb8c00,color:#fff
    style Flow fill:#7b1fa2,color:#fff
    style Processing fill:#43a047,color:#fff

Processing parallelism = concurrency (not multiplePartitions × concurrency)

All records merge into one flow, then flatMapMerge(concurrency) fans out to workers.

Single Poller (`multiplePartitions = 1`)

flowchart LR
    subgraph Topic["Topic (3 partitions)"]
        P0["P0"] & P1["P1"] & P2["P2"]
    end

    subgraph Polling["multiplePartitions = 1"]
        PT1["Single Poller"]
    end

    subgraph Processing["concurrency = 4"]
        W1["W1"] & W2["W2"] & W3["W3"] & W4["W4"]
    end

    P0 & P1 & P2 --> PT1 --> W1 & W2 & W3 & W4

    style Topic fill:#1e88e5,color:#fff
    style Polling fill:#fb8c00,color:#fff
    style Processing fill:#43a047,color:#fff

✅ Simple setup, good for low-volume topics
⚠️ Single thread polls all partitions (may become bottleneck at high volume)
Processing: 4 records in parallel

Multiple Pollers (`multiplePartitions > 1`)

flowchart LR
    subgraph Topic["Topic (6 partitions)"]
        P0["P0"] & P1["P1"] & P2["P2"] & P3["P3"] & P4["P4"] & P5["P5"]
    end

    subgraph Polling["multiplePartitions = 3"]
        PT1["Poller 1"]
        PT2["Poller 2"]
        PT3["Poller 3"]
    end

    subgraph Processing["concurrency = 4"]
        W1["W1"] & W2["W2"] & W3["W3"] & W4["W4"]
    end

    P0 & P1 --> PT1
    P2 & P3 --> PT2
    P4 & P5 --> PT3
    PT1 & PT2 & PT3 --> W1 & W2 & W3 & W4

    style Topic fill:#1e88e5,color:#fff
    style Polling fill:#fb8c00,color:#fff
    style Processing fill:#43a047,color:#fff

✅ Faster polling from multiple partitions in parallel
✅ Better partition distribution in consumer group
✅ Reduced risk of max.poll.interval.ms timeout
Processing: Still 4 records in parallel (same concurrency)

multiplePartitions (Spring Kafka's container concurrency):

Creates multiple Kafka polling threads within the same consumer group
Each thread is assigned a subset of partitions by Kafka's group coordinator
Increases polling throughput - records are fetched faster from Kafka
Set this ≤ partition count (extra threads will be idle)
Uses virtual threads (Java 21+) by default

concurrency (kafka-flow's processing parallelism):

This is your actual processing parallelism
All records from ALL pollers merge into a single Kotlin Flow
flatMapMerge(concurrency) processes N records simultaneously
Increase for I/O-bound workloads (HTTP calls, DB queries)
Ordered commits are still maintained per partition

Example configurations:

// Low-volume topic, simple processing
@KafkaTopic(
    name = "notifications",
    multiplePartitions = 1,  // Single consumer
    concurrency = 2          // 2 records at a time
)

// High-volume topic with I/O-heavy processing
@KafkaTopic(
    name = "orders.created",
    multiplePartitions = 6,  // Match partition count
    concurrency = 8          // 8 records per consumer = 48 total
)

// CPU-bound processing (limit parallelism)
@KafkaTopic(
    name = "reports.generate",
    multiplePartitions = 4,
    concurrency = 1          // Sequential per consumer
)

Key differences from Spring Kafka:

Spring Kafka	kafka-flow	Description
`concurrency`	`multiplePartitions`	Container-level parallelism (Kafka consumer threads)
N/A	`concurrency`	Flow-level parallelism (coroutine-based record processing)

Spring Kafka processes records sequentially per partition by default. kafka-flow adds a second layer of parallelism via Kotlin Flows, allowing concurrent processing of records while still maintaining commit ordering.

Partition Assignment & Rebalancing

When using multiplePartitions > 1, understanding how Kafka assigns partitions to consumers is important for optimal performance.

How ConcurrentMessageListenerContainer Works

Under the hood, kafka-flow uses Spring Kafka's ConcurrentMessageListenerContainer, which delegates to multiple KafkaMessageListenerContainer instances:

flowchart LR
    subgraph Topic["Topic: orders.created"]
        P0["Partition 0"]
        P1["Partition 1"]
        P2["Partition 2"]
        P3["Partition 3"]
        P4["Partition 4"]
        P5["Partition 5"]
    end

    subgraph CMLC["ConcurrentMessageListenerContainer (multiplePartitions = 3)"]
        subgraph KC1["KafkaMessageListenerContainer-0"]
            C1["Consumer Thread 1"]
            ID1["group.instance.id-1"]
        end
        subgraph KC2["KafkaMessageListenerContainer-1"]
            C2["Consumer Thread 2"]
            ID2["group.instance.id-2"]
        end
        subgraph KC3["KafkaMessageListenerContainer-2"]
            C3["Consumer Thread 3"]
            ID3["group.instance.id-3"]
        end
    end

    P0 --> C1
    P1 --> C1
    P2 --> C2
    P3 --> C2
    P4 --> C3
    P5 --> C3

    style Topic fill:#1e88e5,color:#fff
    style CMLC fill:#fb8c00,color:#fff
    style KC1 fill:#43a047,color:#fff
    style KC2 fill:#43a047,color:#fff
    style KC3 fill:#43a047,color:#fff

Each internal container runs on its own thread with an independent Kafka consumer
Partitions are distributed across containers by Kafka's group coordinator
Setting multiplePartitions > partition count means some containers will be idle

Static Membership (Reducing Rebalances)

When multiplePartitions > 1, kafka-flow automatically enables Static Membership by suffixing group.instance.id with -n (where n starts at 1). This significantly reduces rebalance events during:

Application restarts
Rolling deployments
Temporary network issues

# application.yaml - Configure static membership
kafka:
  consumer:
    properties:
      group.instance.id: "order-service"      # Base instance ID
      session.timeout.ms: 300000              # 5 min (increase for static membership)

How it works:

Each container gets a unique instance ID: order-service-1, order-service-2, etc.
When a consumer restarts with the same instance ID, Kafka recognizes it and skips rebalancing
The increased session.timeout.ms gives more time for restarts without triggering rebalance

Without Static Membership	With Static Membership
Every restart triggers rebalance	Same instance ID = no rebalance
Partitions redistributed	Partitions stay assigned
Processing pauses during rebalance	Minimal disruption

Partition Assignment Strategies

kafka-flow uses CooperativeStickyAssignor by default, which provides:

Sticky assignments: Partitions stay with the same consumer across rebalances when possible
Cooperative rebalancing: Only revoked partitions stop processing during rebalance (not all partitions)

// Default assignment strategy (configured in ConsumerConfig)
val partitionAssignmentStrategy = "org.apache.kafka.clients.consumer.CooperativeStickyAssignor"

Available strategies:

Strategy	Behavior	Use Case
`CooperativeStickyAssignor`	Incremental rebalance, sticky assignments	✅ Recommended default
`RangeAssignor`	Assigns contiguous partition ranges	Co-partitioned topics
`RoundRobinAssignor`	Even distribution across consumers	Simple even load
`StickyAssignor`	Sticky but with stop-the-world rebalance	Legacy compatibility

Kafka 4.0 Consumer Rebalance Protocol (KIP-848)

Apache Kafka 4.0 introduces a new server-driven consumer rebalance protocol that significantly improves rebalancing performance:

# application.yaml - Enable new protocol
kafka:
  consumer:
    properties:
      group.protocol: consumer  # New protocol (Kafka 4.0+)
      # group.protocol: classic # Legacy protocol (default)

Benefits of the new protocol:

Server-driven assignments: Broker manages partition assignments, reducing client coordination
Incremental rebalancing: Only affected partitions are reassigned
Reduced downtime: Consumer groups experience minimal disruption during scaling

Important considerations:

Aspect	Classic Protocol	New Protocol (KIP-848)
Assignment	Client-side assignors	Server-side (ignores custom assignors)
Rebalancing	Stop-the-world or cooperative	Always incremental
Callbacks	Single `onPartitionsAssigned`	Multiple calls with smaller partition sets
Kafka Version	All versions	4.0+ only

// Example: Configure for Kafka 4.0 new protocol
fun consumerProperties(config: KafkaConfig): Map<String, Any> = buildMap {
    put(ConsumerConfig.BOOTSTRAP_SERVERS_CONFIG, config.bootstrapServers)
    put(ConsumerConfig.GROUP_ID_CONFIG, config.groupId)
    // Enable new rebalance protocol (Kafka 4.0+)
    put("group.protocol", "consumer")
    // Note: partition.assignment.strategy is ignored with new protocol
}

⚠️ Note: When using group.protocol=consumer, custom partition assignment strategies are ignored. The server handles all assignments. If you need custom assignors, use group.protocol=classic.

Exception Classification

// DEFAULT: Validation errors go to DLT immediately
@KafkaTopic(name = "orders", classifier = ClassifierType.DEFAULT)

// ALWAYS_RETRY: Retry all exceptions
@KafkaTopic(name = "notifications", classifier = ClassifierType.ALWAYS_RETRY)

// NEVER_RETRY: Never retry
@KafkaTopic(name = "critical", classifier = ClassifierType.NEVER_RETRY)

Default non-retryable exceptions: IllegalArgumentException, IllegalStateException, NullPointerException, ClassCastException, NumberFormatException, UnsupportedOperationException, IndexOutOfBoundsException, NoSuchElementException

Manual Acknowledgment

@KafkaTopic(name = "payments")
class PaymentConsumer : ConsumerManualAck<String, PaymentEvent> {
    
    override suspend fun consume(
        record: ConsumerRecord<String, PaymentEvent>, 
        ack: Acknowledgment
    ) {
        paymentService.process(record.value())
        ack.acknowledge()  // Explicit commit
    }
}

Retry Flow

Message → TTL Check → Classification → In-Memory Retries → Retry Topic → DLT
              ↓              ↓                ↓                  ↓
           Expired     NonRetryable      Exhausted          Exhausted
              ↓              ↓                ↓                  ↓
             DLT            DLT         Retry Topic            DLT

Backpressure

The library implements backpressure using Spring Kafka's pause() and resume():

ListenerConfig(
    backpressure = BackpressureConfig(
        enabled = true,
        pauseThreshold = 0.8,   // Pause at 80% buffer fill
        resumeThreshold = 0.3   // Resume at 30% buffer fill
    )
)

flowchart TB
    Topic["Topic"]
    Poller["Poller"]
    Buffer["Buffer ▓▓▓▓▓▓▓▓░░"]
    W1["Worker 1"]
    W2["Worker 2"]
    W3["Worker 3"]

    Topic --> Poller
    Poller -->|"records"| Buffer
    Buffer -->|"flow"| W1 & W2 & W3
    
    Buffer -.->|"80% full → pause()"| Poller
    W1 & W2 & W3 -.->|"30% full → resume()"| Poller

    style Topic fill:#1e88e5,color:#fff
    style Poller fill:#fb8c00,color:#fff
    style Buffer fill:#7b1fa2,color:#fff
    style W1 fill:#43a047,color:#fff
    style W2 fill:#43a047,color:#fff
    style W3 fill:#43a047,color:#fff

How it works:

Records flow from Kafka → Buffer → Workers
When buffer reaches 80% capacity → pause() stops polling
Workers continue processing, draining the buffer
When buffer drops to 30% → resume() restarts polling

Ordered Commits & Gap Detection

With concurrent processing, records may complete out of order. The OrderedCommitter ensures offsets are committed in order and pauses consumption when gaps are detected.

sequenceDiagram
    participant K as Kafka
    participant P as Poller
    participant W as Workers
    participant OC as OrderedCommitter

    Note over K,OC: Records arrive in order: 0, 1, 2, 3, 4
    
    K->>P: Poll records 0,1,2,3,4
    P->>W: Dispatch to workers
    
    Note over W: Concurrent processing<br/>(out of order completion)
    
    W->>OC: Complete offset 2
    Note over OC: Gap detected!<br/>Expected: 0, Got: 2
    OC-->>P: onGapDetected → pause()
    
    W->>OC: Complete offset 4
    Note over OC: Still waiting for 0,1,3
    
    W->>OC: Complete offset 0
    Note over OC: Commit 0<br/>Gap remains (1 missing)
    
    W->>OC: Complete offset 1
    Note over OC: Commit 1,2<br/>Gap remains (3 missing)
    
    W->>OC: Complete offset 3
    Note over OC: Commit 3,4<br/>No gaps!
    OC-->>P: onGapClosed → resume()
    
    K->>P: Resume polling

flowchart LR
    subgraph Completion["Completion Order"]
        C1["2 ✓"] --> C2["4 ✓"] --> C3["0 ✓"] --> C4["1 ✓"] --> C5["3 ✓"]
    end

    subgraph Tracking["OrderedCommitter State"]
        direction TB
        S1["completed: {2}<br/>lastCommitted: -1<br/>⚠️ GAP"]
        S2["completed: {2,4}<br/>lastCommitted: -1<br/>⚠️ GAP"]
        S3["completed: {2,4}<br/>lastCommitted: 0<br/>⚠️ GAP"]
        S4["completed: {4}<br/>lastCommitted: 2<br/>⚠️ GAP"]
        S5["completed: {}<br/>lastCommitted: 4<br/>✅ NO GAP"]
    end

    subgraph Commits["Actual Commits"]
        K1["—"]
        K2["—"]
        K3["Commit 0"]
        K4["Commit 1,2"]
        K5["Commit 3,4"]
    end

    C1 --> S1 --> K1
    C2 --> S2 --> K2
    C3 --> S3 --> K3
    C4 --> S4 --> K4
    C5 --> S5 --> K5

    style S1 fill:#ff9800,color:#000
    style S2 fill:#ff9800,color:#000
    style S3 fill:#ff9800,color:#000
    style S4 fill:#ff9800,color:#000
    style S5 fill:#4caf50,color:#fff

Why gap detection pauses consumption:

Prevents unbounded memory growth from pending completions
If a record gets "stuck" (slow processing), new records won't pile up indefinitely
Once the stuck record completes, consumption resumes automatically

Metrics

interface KafkaFlowMetrics {
    fun recordConsumed(topic: String, consumer: String, partition: Int)
    fun recordProcessingSuccess(topic: String, consumer: String, duration: Duration)
    fun recordProcessingFailure(topic: String, consumer: String, exception: Throwable)
    fun recordInMemoryRetry(topic: String, consumer: String, attempt: Int)
    fun recordSentToRetryTopic(topic: String, retryTopic: String, attempt: Int)
    fun recordSentToDlt(topic: String, dltTopic: String, totalAttempts: Int)
    fun recordExpired(topic: String, consumer: String, reason: String)
}

Built-in: NoOpMetrics, LoggingMetrics, CompositeMetrics

Configuration Overrides

While @KafkaTopic is the idiomatic way to configure consumers, you can override any setting via DefaultTopicResolver. This allows for environment-specific settings (like increasing concurrency in production) without recompiling.

Merging Strategy: Manual configuration is merged with annotation values at the field level. If a field is set in TopicConfig, it overrides the annotation; otherwise, the annotation value (or its default) is used.

val factory = KafkaFlowFactory.create(
    KafkaFlowFactoryConfig(
        consumerProperties = properties,
        producerProperties = properties,
        topicResolver = DefaultTopicResolver(
            topicConfigs = mapOf(
                "OrderCreatedConsumer" to TopicConfig(
                    concurrency = 8,        // Overrides annotation
                    maxInMemoryRetries = 10 // Overrides annotation
                )
            )
        )
    )
)

Supported override fields in TopicConfig:

topics, groupId, concurrency, multiplePartitions
retryTopic, dltTopic
maxInMemoryRetries, backoffMs, backoffMultiplier, maxBackoffMs
maxRetryTopicAttempts, retryTopicBackoffMs, retryTopicBackoffMultiplier, maxRetryTopicBackoffMs
maxRetryDurationMs, maxMessageAgeMs

Configuration from File

For production environments, you'll want to externalize consumer configuration to files (YAML, HOCON, etc.) rather than hardcoding in annotations. This enables environment-specific tuning without recompilation.

1. Define your configuration data classes:

data class AppConfig(
    val kafka: KafkaConfig
)

data class KafkaConfig(
    val bootstrapServers: String,
    val groupId: String,
    val consumers: Map<String, ConsumerTopicConfig> = emptyMap()
)

data class ConsumerTopicConfig(
    val topics: List<String>? = null,
    val concurrency: Int? = null,
    val multiplePartitions: Int? = null,
    val retryTopic: String? = null,
    val dltTopic: String? = null,
    val maxInMemoryRetries: Int? = null,
    val backoffMs: Long? = null,
    val backoffMultiplier: Double? = null,
    val maxBackoffMs: Long? = null,
    val maxRetryTopicAttempts: Int? = null,
    val retryTopicBackoffMs: Long? = null,
    val retryTopicBackoffMultiplier: Double? = null,
    val maxRetryTopicBackoffMs: Long? = null,
    val maxRetryDurationMs: Long? = null,
    val maxMessageAgeMs: Long? = null
)

2. Create your configuration file (application.yaml):

kafka:
  bootstrap-servers: "kafka-prod:9092"
  group-id: "order-service"
  
  consumers:
    # Override settings for OrderCreatedConsumer
    OrderCreatedConsumer:
      concurrency: 8                    # Scale up in production
      max-in-memory-retries: 5
      max-retry-topic-attempts: 5
      backoff-ms: 500
      max-backoff-ms: 60000
    
    # Override settings for PaymentConsumer
    PaymentConsumer:
      concurrency: 16
      multiple-partitions: 4           # More partition consumers
      max-retry-duration-ms: 600000    # 10 min max retry
      max-message-age-ms: 3600000      # 1 hour max age
    
    # Define a consumer entirely from config (no @KafkaTopic needed)
    AuditLogConsumer:
      topics:
        - "audit.events"
        - "audit.changes"
      concurrency: 2
      retry-topic: "audit.retry"
      dlt-topic: "audit.dlt"

3. Load configuration using Hoplite:

@OptIn(ExperimentalHoplite::class)
fun loadConfig(args: Array<String> = emptyArray()): AppConfig = ConfigLoaderBuilder
    .default()
    .addCommandLineSource(args)  // CLI args override file config
    .addResourceSource("/application.yaml")
    .withExplicitSealedTypes()
    .build()
    .loadConfigOrThrow<AppConfig>()

4. Map configuration to TopicConfig and create the factory:

fun createKafkaFactory(config: AppConfig): KafkaFlowFactory<String, DomainEvent> {
    // Convert file config to TopicConfig map
    val topicConfigs = config.kafka.consumers.mapValues { (_, consumerConfig) ->
        TopicConfig(
            topics = consumerConfig.topics ?: emptyList(),
            concurrency = consumerConfig.concurrency,
            multiplePartitions = consumerConfig.multiplePartitions,
            retryTopic = consumerConfig.retryTopic,
            dltTopic = consumerConfig.dltTopic,
            maxInMemoryRetries = consumerConfig.maxInMemoryRetries,
            backoffMs = consumerConfig.backoffMs,
            backoffMultiplier = consumerConfig.backoffMultiplier,
            maxBackoffMs = consumerConfig.maxBackoffMs,
            maxRetryTopicAttempts = consumerConfig.maxRetryTopicAttempts,
            retryTopicBackoffMs = consumerConfig.retryTopicBackoffMs,
            retryTopicBackoffMultiplier = consumerConfig.retryTopicBackoffMultiplier,
            maxRetryTopicBackoffMs = consumerConfig.maxRetryTopicBackoffMs,
            maxRetryDurationMs = consumerConfig.maxRetryDurationMs,
            maxMessageAgeMs = consumerConfig.maxMessageAgeMs
        )
    }

    return KafkaFlowFactory.create(
        KafkaFlowFactoryConfig(
            consumerProperties = mapOf(
                ConsumerConfig.BOOTSTRAP_SERVERS_CONFIG to config.kafka.bootstrapServers,
                ConsumerConfig.GROUP_ID_CONFIG to config.kafka.groupId,
                // ... other properties
            ),
            producerProperties = mapOf(/* ... */),
            topicResolver = DefaultTopicResolver(
                topicConfigs = topicConfigs
            )
        )
    )
}

Merging Strategy:

File-based configuration merges with @KafkaTopic annotations at the field level:

Source	Priority	Use Case
`@KafkaTopic` annotation	Base	Default behavior, compile-time checked
File config (`topicConfigs`)	Override	Environment-specific tuning

// Annotation defines defaults
@KafkaTopic(
    name = "orders.created",
    concurrency = 4,
    maxInMemoryRetries = 3
)
class OrderCreatedConsumer : ConsumerAutoAck<String, OrderEvent> { ... }

// application-prod.yaml overrides for production
kafka:
  consumers:
    OrderCreatedConsumer:
      concurrency: 16        # Override: 4 → 16
      # maxInMemoryRetries not specified, keeps annotation value: 3

This pattern gives you:

✅ Compile-time safety for topic bindings
✅ Environment-specific tuning without redeployment
✅ Sensible defaults in code, overrides in config
✅ Support for multiple config file formats (YAML, HOCON, JSON, TOML via Hoplite)

ktor-kafka Plugin

A Ktor plugin for Spring Kafka integration. Use Spring Kafka's powerful @KafkaListener with Ktor as your main framework.

Features

Ktor Plugin: Idiomatic install(SpringKafka) { } configuration
Suspend Listeners: Native Kotlin coroutine support via @KafkaListener
Virtual Threads: Enabled by default for all listeners (Java 21+)
DI Bridging: Inject Koin/other DI beans into Spring Kafka listeners
Multi-Cluster: Named factories for different Kafka clusters
No Spring Boot: Just Spring Kafka core

Installation

dependencies {
    implementation("io.github.osoykan:ktor-kafka:0.1.0")
}

Quick Start

fun main() {
    embeddedServer(Netty, port = 8080) {
        // Install Koin first (optional, for DI bridging)
        install(Koin) {
            modules(module {
                single { OrderRepository() }
                single { NotificationService() }
            })
        }
        
        // Install Spring Kafka
        install(SpringKafka) {
            bootstrapServers = "localhost:9092"
            groupId = "my-service"
            
            // Bridge Koin dependencies to Spring
            dependencyResolver = KoinDependencyResolver(getKoin())
            
            // Scan for @KafkaListener consumers
            scanPackages("com.example.consumers")
            
            // Configure consumers
            consumer {
                concurrency = 4
                pollTimeout = 1.seconds
            }
            
            // Configure producers
            producer {
                acks = "all"
                compression = "lz4"
            }
        }
        
        configureRouting()
    }.start(wait = true)
}

Suspend vs Non-Suspend Listeners

Spring Kafka supports both regular and suspend listener functions. Each has different commit behavior and performance characteristics.

Suspend Listeners

@Component
class OrderConsumer(
    private val orderRepository: OrderRepository,  // From Koin!
    private val eventMetrics: EventMetricsService  // From Spring!
) {
    @KafkaListener(topics = ["orders.created"])
    suspend fun consume(record: ConsumerRecord<String, OrderEvent>) {
        val order = record.value()
        orderRepository.save(order)
        eventMetrics.recordProcessed("OrderCreated")
        logger.info { "Processed order: ${order.id}" }
    }
}

Why special handling is needed:

With normal AckMode.BATCH or RECORD, Spring Kafka commits the offset immediately after your listener method returns. But suspend functions (and Mono<Void>, CompletableFuture<Void>) return instantly while actual work happens asynchronously. Without special handling, the offset gets committed before processing finishes—meaning you lose the message if your app crashes mid-processing.

What Spring Kafka does automatically:

When it detects an async return type, Spring:

Switches to AckMode.MANUAL internally
Waits for the async operation to complete (awaits the suspend function)
Only then acknowledges the offset

Out-of-order commit handling:

Say you receive records A, B, C from the same partition:

Record A (offset 1) - still processing
Record B (offset 2) - complete ✓
Record C (offset 3) - complete ✓
→ nothing committed yet (waiting for A)

Record A (offset 1) - complete ✓
→ commits offset 3 (covers A, B, C in one commit)

Spring tracks which offsets have completed and only commits the highest contiguous offset. This preserves at-least-once semantics despite out-of-order completion, without forcing sequential processing.

Requires kotlinx-coroutines-reactor dependency
See Spring Kafka Async Returns for more details

Non-Suspend Listeners

@Component
class OrderConsumer(
    private val orderRepository: OrderRepository
) {
    @KafkaListener(topics = ["orders.created"])
    fun consume(record: ConsumerRecord<String, OrderEvent>) {
        orderRepository.save(record.value())
    }
}

How it works:

Uses AckMode.BATCH (Spring's default)
Commits all offsets after the entire poll batch is processed
Single network round-trip per batch (efficient)

Calling suspend functions from non-suspend listeners:

If your services use suspend functions, use runBlocking to bridge:

@Component
class OrderConsumer(
    private val orderRepository: OrderRepository  // has suspend functions
) {
    @KafkaListener(topics = ["orders.created"])
    fun consume(record: ConsumerRecord<String, OrderEvent>) {
        runBlocking(Dispatchers.IO) {
            // Bridge to coroutine world
            orderRepository.save(record.value())  // suspend fun
            notificationService.sendAsync(record.value())  // suspend fun
        }
    }
}

Comparison

Aspect	`suspend` Listener	Non-Suspend Listener
AckMode	`MANUAL` (auto-switched)	`BATCH` (default)
Commits	After highest contiguous offset completes	After poll batch completes
Processing	Concurrent (many records in parallel)	Sequential (one at a time per thread)
Scalability	High (non-blocking I/O)	Limited by thread count

When to Use Each

Use suspend when:

Your processing involves I/O (HTTP calls, database queries, external services)
You need high concurrency - many records processed in parallel
You want to scale throughput without adding threads
Natural coroutine integration is desired

Use non-suspend when:

Processing is CPU-bound (no I/O wait time to exploit)
You want simpler sequential processing per thread
Predictable, ordered processing is preferred

DI Bridging

The plugin bridges external DI containers (like Koin) to Spring:

// 1. Implement DependencyResolver
class KoinDependencyResolver(private val koin: Koin) : DependencyResolver {
    override fun <T : Any> resolve(type: KClass<T>): T? = 
        koin.getOrNull(type)
    
    override fun <T : Any> resolve(type: KClass<T>, name: String): T? = 
        koin.getOrNull(type, named(name))
    
    override fun <T : Any> resolveAll(type: KClass<T>): List<T> = 
        koin.getAll(type)
    
    override fun canResolve(type: KClass<*>): Boolean = 
        koin.getOrNull(type) != null
}

// 2. Configure in plugin
install(SpringKafka) {
    dependencyResolver = KoinDependencyResolver(getKoin())
    consumerPackages("com.example.consumers")
}

// 3. Now Spring @KafkaListener can inject Koin beans!
@Component
class MyConsumer(
    private val koinService: MyKoinService  // Resolved from Koin
) {
    @KafkaListener(topics = ["my-topic"])
    suspend fun handle(record: ConsumerRecord<String, String>) {
        koinService.process(record.value())
    }
}

Producing Messages

fun Application.configureRouting() {
    val kafkaTemplate = kafkaTemplate<String, OrderEvent>()
    
    routing {
        post("/orders") {
            val order = call.receive<CreateOrderRequest>()
            val event = OrderCreatedEvent(order.id, order.items)
            
            kafkaTemplate.send("orders.created", order.id, event)
            
            call.respond(HttpStatusCode.Created, order)
        }
        }
    }

Multi-Cluster Support

Configure multiple Kafka clusters:

install(SpringKafka) {
    // Default cluster
    bootstrapServers = "cluster-a:9092"
    groupId = "my-service"
    
    // Additional consumer factory for cluster B
    consumerFactory("clusterB") {
        bootstrapServers = "cluster-b:9092"
        groupId = "my-service-cluster-b"
        concurrency = 2
    }
    
    // Additional producer for analytics
    producerFactory("analytics") {
        bootstrapServers = "analytics:9092"
        acks = "1"
        compression = "snappy"
    }
}

Use named factories:

// Consumer with named factory
@Component
class ClusterBConsumer {
    @KafkaListener(
        topics = ["events"],
        containerFactory = "clusterBKafkaListenerContainerFactory"
    )
    suspend fun handle(record: ConsumerRecord<String, Event>) { }
}

// Producer with named template
val analyticsTemplate = application.kafkaTemplate<String, AnalyticsEvent>("analytics")

Error Handling

Configure custom error handlers for consumer errors:

install(SpringKafka) {
    bootstrapServers = "localhost:9092"
    
    consumer {
        // Custom error handler with retry and dead-letter publishing
        errorHandler = DefaultErrorHandler(
            DeadLetterPublishingRecoverer(kafkaTemplate),
            FixedBackOff(1000L, 3L)  // 1s delay, 3 retries
        )
    }
}

Available error handlers:

DefaultErrorHandler - Retries with backoff, optional dead-letter recovery
CommonLoggingErrorHandler - Logs errors only, no retry
CommonDelegatingErrorHandler - Delegates based on exception type

Named consumer factories can also have their own error handlers:

consumerFactory("critical") {
    bootstrapServers = "localhost:9092"
    errorHandler = DefaultErrorHandler(
        FixedBackOff(5000L, 10L)  // More aggressive retry
    )
}

Advanced Factory Customization

For advanced use cases, customize factories after creation:

install(SpringKafka) {
    bootstrapServers = "localhost:9092"
    
    // Customize ConsumerFactory
    customizeConsumerFactory {
        // 'this' is ConsumerFactory<Any, Any>
        addListener(object : ConsumerFactory.Listener<Any, Any> {
            override fun consumerAdded(id: String, consumer: Consumer<Any, Any>) {
                logger.info { "Consumer added: $id" }
            }
        })
    }
    
    // Customize ProducerFactory
    customizeProducerFactory {
        // 'this' is ProducerFactory<Any, Any>
        addListener(object : ProducerFactory.Listener<Any, Any> {
            override fun producerAdded(id: String, producer: Producer<Any, Any>) {
                logger.info { "Producer added: $id" }
            }
        })
    }
    
    // Customize ConcurrentKafkaListenerContainerFactory
    customizeContainerFactory {
        // 'this' is ConcurrentKafkaListenerContainerFactory<Any, Any>
        
        // Record interceptor
        setRecordInterceptor { record, consumer ->
            logger.debug { "Intercepted: ${record.topic()}" }
            record
        }
        
        // Batch listener mode
        setBatchListener(true)
        
        // Custom after-rollback processor
        setAfterRollbackProcessor(DefaultAfterRollbackProcessor())
    }
}

Named factories also support customizers:

consumerFactory("custom") {
    bootstrapServers = "localhost:9092"
    
    customizeConsumerFactory {
        // Customize this specific consumer factory
    }
    
    customizeContainerFactory {
        // Customize this specific container factory
        setRecordInterceptor { record, _ ->
            // Add tracing headers, etc.
            record
        }
    }
}

producerFactory("custom") {
    bootstrapServers = "localhost:9092"
    
    customizeProducerFactory {
        // Customize this specific producer factory
    }
}

Configuration Reference

install(SpringKafka) {
    // Connection
    bootstrapServers = "localhost:9092"
    groupId = "my-consumer-group"
    
    // Serialization (default: String)
    keySerializer = StringSerializer::class
    valueSerializer = JsonSerializer::class
    keyDeserializer = StringDeserializer::class
    valueDeserializer = JsonDeserializer::class
    
    // DI bridging
    dependencyResolver = KoinDependencyResolver(getKoin())
    
    // Component scanning
    consumerPackages("com.example.consumers", "com.example.handlers")
    
    // Consumer settings
    consumer {
        concurrency = 4                    // Concurrent consumers
        pollTimeout = 1.seconds            // Poll timeout
        autoOffsetReset = "earliest"       // earliest, latest, none
        enableAutoCommit = false           // Manual commits (recommended)
        maxPollRecords = 500               // Max records per poll
        errorHandler = DefaultErrorHandler() // Custom error handler
    }
    
    // Producer settings
    producer {
        acks = "all"                       // 0, 1, all
        retries = 3                        // Retry count
        compression = "lz4"                // none, gzip, snappy, lz4, zstd
        idempotence = true                 // Idempotent producer
    }
    
    // Additional properties
    consumerProperty("max.poll.interval.ms", 300000)
    producerProperty("linger.ms", 5)
}

Extension Functions

// Get KafkaTemplate
val template = application.kafkaTemplate<String, MyEvent>()
val namedTemplate = application.kafkaTemplate<String, MyEvent>("analytics")

// Get Spring beans
val service = application.springKafkaBean<MySpringService>()

// Check status
val running = application.isSpringKafkaRunning()
val hasFactory = application.hasListenerContainerFactory("clusterB")
val hasTemplate = application.hasKafkaTemplate("analytics")

Examples

Example	Description
`examples/ktor-kafka-flow/`	Full kafka-flow library usage with Ktor and Koin
`examples/ktor-spring-kafka/`	ktor-kafka plugin with suspend listeners and DI bridging

Running Examples

# Start Kafka (using Docker)
docker run -d --name kafka -p 9092:9092 \
  -e KAFKA_CFG_NODE_ID=0 \
  -e KAFKA_CFG_PROCESS_ROLES=controller,broker \
  -e KAFKA_CFG_LISTENERS=PLAINTEXT://:9092,CONTROLLER://:9093 \
  -e KAFKA_CFG_ADVERTISED_LISTENERS=PLAINTEXT://localhost:9092 \
  -e KAFKA_CFG_CONTROLLER_QUORUM_VOTERS=0@localhost:9093 \
  -e KAFKA_CFG_CONTROLLER_LISTENER_NAMES=CONTROLLER \
  bitnami/kafka:latest

# Run example
./gradlew :examples:ktor-spring-kafka:run

Testing

Both libraries support dual testing modes:

Mode	Implementation	Use Case
`embedded`	EmbeddedKafkaBroker	Local TDD (~5s startup)
`testcontainers`	ConfluentKafkaContainer	CI, production-like

Automatic selection:

CI=true → Testcontainers
Otherwise → EmbeddedKafka

Manual override:

KAFKA_TEST_MODE=testcontainers ./gradlew test

Why kafka-kit?

With reactor-kafka being discontinued, this project explores the idea of having an alternative to creating infinite flows. kafka-kit provides a modern, Kotlin-first approach to Kafka consumption using Kotlin Flows, offering a clean and idiomatic way to handle Kafka messages with built-in retry mechanisms, dead-letter topics, and comprehensive metrics.

Alternatives: While alternatives like kotlin-kafka exist, they provide low-level Flow bindings without high-level consumer abstractions. With kotlin-kafka, you still need to manually wire consumers, retries, and error handling. kafka-kit takes a similar approach but also provides a complete, opinionated solution with automatic retry strategies, dead-letter topic handling, and comprehensive metrics out of the box.

In the future releases of kafka-kit, we will create a way to plug in a custom poller (KafkaPoller interface) implementation that allows you to use alternative Kafka client libraries and implement custom polling strategies. By that we can use kotlin-kafka as a poller implementation.

License

Apache License 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 140 Commits
.github/workflows		.github/workflows
buildSrc		buildSrc
examples		examples
gradle		gradle
kafka-flow		kafka-flow
ktor-kafka		ktor-kafka
.editorconfig		.editorconfig
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
README.md		README.md
build.gradle.kts		build.gradle.kts
gradlew		gradlew
gradlew.bat		gradlew.bat
renovate.json		renovate.json
settings.gradle.kts		settings.gradle.kts

Folders and files

Latest commit

History

Repository files navigation

Kafka Kit

Requirements

kafka-flow Library

Features

Installation

Quick Start

The Lean Consumer Pattern

Multiple Topics

Configuration

@KafkaTopic Annotation

Concurrency Model

How Records Flow Through the System

Single Poller (multiplePartitions = 1)

Multiple Pollers (multiplePartitions > 1)

Partition Assignment & Rebalancing

How ConcurrentMessageListenerContainer Works

Static Membership (Reducing Rebalances)

Partition Assignment Strategies

Kafka 4.0 Consumer Rebalance Protocol (KIP-848)

Exception Classification

Manual Acknowledgment

Retry Flow

Backpressure

Ordered Commits & Gap Detection

Metrics

Configuration Overrides

Configuration from File

ktor-kafka Plugin

Features

Installation

Quick Start

Suspend vs Non-Suspend Listeners

Suspend Listeners

Non-Suspend Listeners

Comparison

When to Use Each

DI Bridging

Producing Messages

Multi-Cluster Support

Error Handling

Advanced Factory Customization

Configuration Reference

Extension Functions

Examples

Running Examples

Testing

Why kafka-kit?

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Single Poller (`multiplePartitions = 1`)

Multiple Pollers (`multiplePartitions > 1`)

Packages