Spring Cloud Stream

Spring Cloud Stream is a framework for building highly scalable, event-driven microservices that are connected by a shared messaging system. In simple terms, it’s a powerful tool that takes away the complexity of communicating with message brokers like RabbitMQ or Apache Kafka, allowing you to focus purely on your application’s business logic.

The framework provides an abstraction layer over the messaging infrastructure. This means your application code doesn’t need to know the specific details of the message broker it’s talking to. Whether you’re using Kafka, RabbitMQ, or another supported system, the code you write to send and receive messages remains virtually the same. This is achieved through the use of binders. A binder is an implementation that connects the framework’s internal messaging channels to a specific external message broker.

Key Concepts

Binder: The component that provides integration with a specific messaging system (e.g., a Kafka binder or a RabbitMQ binder). You simply add the correct binder dependency to your project, and Spring Cloud Stream takes care of the rest via Spring Boot’s auto-configuration.
Binding: The bridge between your application’s code and the external messaging system. A binding connects a message channel (an input or output channel) to a destination on the message broker.
Functions: Spring Cloud Stream has shifted towards a functional programming model. Instead of relying on specific annotations, you can define your messaging logic as Supplier, Consumer, or Function beans.
- A Supplier provides data to an output channel.
- A Consumer receives data from an input channel.
- A Function receives data from an input channel, processes it, and returns a result to an output channel.

This functional approach, combined with the power of Spring Boot, makes it incredibly easy to create robust, production-ready, and portable messaging applications. It’s an excellent choice for a software architect building modern, decoupled microservice architectures in the Java ecosystem.

A Complete Streaming Pipeline: Supplier -> Function -> Consumer

Here is a complete, end-to-end example that connects a Supplier, Function, and Consumer together to create a single message-processing pipeline. We will use three separate beans, but they will be configured to work as a single flow.

1. The Supplier (Producer)

The Supplier bean is the source of our data stream. It will generate a simple message with an incrementing number.

@Configuration
public class MySupplierConfig {
    private AtomicInteger counter = new AtomicInteger();

    @Bean
    public Supplier<String> supplyMessage() {
        return () -> "Message " + counter.incrementAndGet();
    }
}

Configuration: The supplyMessage bean’s output binding (supplyMessage-out-0) will be configured to send messages to a destination named messages-to-process.

2. The Function (Processor)

The Function bean acts as a processor. It consumes the message from the supplier, converts the text to uppercase, and then sends it to a new destination.

@Configuration
public class MyFunctionConfig {

    @Bean
    public Function<String, String> processMessage() {
        return message -> {
            System.out.println("Received by Function: " + message);
            String processedMessage = message.toUpperCase();
            System.out.println("Sending from Function: " + processedMessage);
            return processedMessage;
        };
    }
}

Configuration: The processMessage function’s input binding (processMessage-in-0) will be configured to listen to messages-to-process, and its output binding (processMessage-out-0) will be configured to send the processed message to messages-processed.

3. The Consumer (Sink)

The Consumer bean is the final destination. It receives the processed message and prints it to the console, finalizing the stream.

@Configuration
public class MyConsumerConfig {

    @Bean
    public Consumer<String> logProcessedMessage() {
        return message -> {
            System.out.println("Final Message received by Consumer: " + message);
        };
    }
}

Configuration: The logProcessedMessage consumer’s input binding (logProcessedMessage-in-0) will be configured to listen to the messages-processed destination.

All-in-One `application.yml` Configuration

To connect these three components into a single, cohesive pipeline, you would use a single configuration file like this.

spring:
  cloud:
    stream:
      bindings:
        # Supplier output binding sends to this destination
        supplyMessage-out-0:
          destination: messages-to-process

        # Function input binding receives from the supplier's destination
        processMessage-in-0:
          destination: messages-to-process

        # Function output binding sends to this destination
        processMessage-out-0:
          destination: messages-processed

        # Consumer input binding receives from the function's destination
        logProcessedMessage-in-0:
          destination: messages-processed

By setting the destinations correctly, you have now created a complete, loosely coupled streaming pipeline. The Supplier microservice is completely unaware of the Function or Consumer, and vice versa. They simply send and receive messages on the configured destinations.

Branching a Stream: One Supplier to Multiple Functions and Consumers

This powerful pattern demonstrates a branching pipeline where a single source of messages is processed independently by different downstream services. This is a core tenet of event-driven architectures and is easily configured with Spring Cloud Stream.

1. The Supplier (Producer)

The supplier remains the same, producing a stream of messages to a single destination.

@Configuration
public class MySupplierConfig {
    private AtomicInteger counter = new AtomicInteger();

    @Bean
    public Supplier<String> supplyMessage() {
        return () -> "Message " + counter.incrementAndGet();
    }
}

Configuration: The supplier’s output binding will be configured to send messages to a shared destination, messages-to-branch.

2. The Multiple Functions (Processors)

Now, we define two distinct functions that will process the same incoming messages in different ways.

Uppercase Function: This function converts the message to uppercase.@Configuration public class UppercaseFunctionConfig { @Bean public Function<String, String> uppercase() { return message -> { System.out.println("Uppercase Function received: " + message); return message.toUpperCase(); }; } }
Timestamp Function: This function appends a timestamp to the message.@Configuration public class TimestampFunctionConfig { @Bean public Function<String, String> timestamp() { return message -> { System.out.println("Timestamp Function received: " + message); return message + " at " + Instant.now().toString(); }; } }

Configuration: Both functions will have their input bindings configured to listen to the messages-to-branch destination. Their output bindings will be set to separate destinations.

3. The Multiple Consumers (Sinks)

Finally, we create a consumer for each of the functions to receive their specific output.

Uppercase Consumer:@Configuration public class UppercaseConsumerConfig { @Bean public Consumer<String> logUppercase() { return message -> { System.out.println("Received by Uppercase Consumer: " + message); }; } }
Timestamp Consumer:@Configuration public class TimestampConsumerConfig { @Bean public Consumer<String> logTimestamp() { return message -> { System.out.println("Received by Timestamp Consumer: " + message); }; } }

Configuration: Each consumer’s input binding will be configured to listen to the specific destination corresponding to its function.

All-in-One `application.yml` Configuration for Branching

This single application.yml ties the entire branching pipeline together.

spring:
  cloud:
    function:
      definition: supplyMessage;uppercase;timestamp;logUppercase;logTimestamp
    stream:
      bindings:
        # The single supplier sends to this destination
        supplyMessage-out-0:
          destination: messages-to-branch

        # Both functions listen to the same destination as the supplier's output
        uppercase-in-0:
          destination: messages-to-branch
        timestamp-in-0:
          destination: messages-to-branch

        # Each function sends to a unique destination
        uppercase-out-0:
          destination: uppercase-messages
        timestamp-out-0:
          destination: timestamped-messages

        # Each consumer listens to its unique destination
        logUppercase-in-0:
          destination: uppercase-messages
        logTimestamp-in-0:
          destination: timestamped-messages

The spring.cloud.function.definition property is used here to explicitly define the beans for the functions, which is considered a best practice when you have multiple beans of the same type. This configuration ensures that every message from the supplyMessage supplier is sent to the messages-to-branch destination, where both the uppercase and timestamp functions receive and process it independently. The final output is then routed to the correct consumer.

Dynamically Creating Bindings Programmatically

In some scenarios, you may not know the destination names at application startup. For example, you might need to send a message to a topic or queue whose name is determined by user input or a database query at runtime. For these cases, Spring Cloud Stream provides a powerful component called StreamBridge that allows you to dynamically create bindings and send messages.

The StreamBridge is an abstraction over the underlying binder, giving you a way to interact with the messaging system without a pre-defined functional bean.

Code Example: Using `StreamBridge` with a REST Endpoint

You can inject StreamBridge into any Spring component (like a REST controller or a service class) and use its send method. The first argument to the send method is the destination name, and the second is the message payload. If a binding for that destination doesn’t exist, StreamBridge will provision it on the fly, a process known as dynamic binding provisioning.

@RestController
public class DynamicSenderController {

    private final StreamBridge streamBridge;

    public DynamicSenderController(StreamBridge streamBridge) {
        this.streamBridge = streamBridge;
    }

    @PostMapping("/send/{destination}")
    public String sendToDynamicDestination(@PathVariable String destination, @RequestBody String message) {
        // The destination is determined at runtime from the URL path
        streamBridge.send(destination, message);
        return "Message '" + message + "' sent dynamically to destination: " + destination;
    }
}

Configuration for Dynamic Bindings

When using StreamBridge for dynamic bindings, you typically don’t need any spring.cloud.stream.bindings configuration for the producer side in your application.yml. Spring Cloud Stream handles the provisioning automatically. However, if you have other functional beans (like Supplier, Function, or Consumer), you might still need to specify them with the spring.cloud.function.definition property to ensure they are all discovered and configured correctly.

This dynamic approach offers maximum flexibility and is ideal for microservices that produce messages in response to external events, such as REST API calls.

Explicitly Specifying Functions with `spring.cloud.function.definition`

By default, Spring Cloud Stream automatically detects and binds a single Supplier, Function, or Consumer bean. However, when you have multiple beans of these types in your application, this automatic discovery is disabled to avoid ambiguity. In such cases, you must explicitly tell Spring Cloud Stream which beans to use for messaging by using the spring.cloud.function.definition property in your configuration file.

This property takes a comma-separated or semicolon-separated list of the bean names (i.e., the @Bean method names) that you want to bind.

Example 1: Specifying a Single Function

Even with only one function, it’s considered a best practice to be explicit. If you have a single Function bean named processMessage, you would specify it like this:

spring:
  cloud:
    function:
      definition: processMessage

Example 2: Specifying a Pipeline

To create a simple pipeline from a supplier to a function to a consumer, you can list them sequentially using the | symbol. The framework will connect the output of the first to the input of the next.

spring:
  cloud:
    function:
      definition: supplyMessage | processMessage | logProcessedMessage

This is a more concise way of defining a linear flow and replaces the need for the individual bindings configurations for the destinations.

Example 3: Specifying Unrelated Functions

If your application has multiple messaging beans that are not part of a single pipeline, you can simply list their names. The framework will then create bindings for each of them individually based on their default naming conventions.

spring:
  cloud:
    function:
      definition: supplyMessage;uppercase;timestamp;logUppercase;logTimestamp

This example is the one used in the “Branching a Stream” section above, demonstrating how to enable all the individual components in a complex branching scenario.

Consumer Groups for Scalability

In a distributed microservices environment, scalability is a primary concern. Consumer groups are a key mechanism provided by Spring Cloud Stream and underlying message brokers to handle high message volumes by distributing the workload across multiple application instances.

A consumer group ensures that a message from a topic is delivered to only one instance within the group. This prevents duplicate processing when you scale up your application by running multiple instances. When a new instance is added to the group, the message broker automatically rebalances the workload among all the members.

You can configure consumer groups by simply adding the group property to your input binding configuration.

application.yml Example for a Consumer Group:

Let’s imagine you want to scale your logProcessedMessage consumer. You would define a consumer group like this:

spring:
  cloud:
    stream:
      bindings:
        # The consumer's input binding
        logProcessedMessage-in-0:
          destination: messages-processed
          group: my-processing-group

By adding group: my-processing-group, all instances of your application with this configuration will join the same consumer group. The message broker will ensure that messages sent to messages-processed are distributed among these instances. If you start a new instance of your application with this same configuration, it will automatically join the my-processing-group and start receiving messages. This is the simple yet powerful way Spring Cloud Stream enables horizontal scaling for your applications.

Content-Type and Serialization

In real-world applications, messages are rarely simple strings. They are often complex data structures like JSON or Avro. Spring Cloud Stream simplifies the process of serialization and deserialization (the conversion between a Java object and a byte stream) by using message converters.

The framework automatically uses the contentType binding property to determine how to convert messages. By default, it uses application/json and handles the conversion automatically, allowing your functional beans to operate directly on Java objects.

Code Example with a Custom Object

First, define a simple Plain Old Java Object (POJO) for your message payload.

public class UserRegistration {
    private String username;
    private String email;

    // Getters, setters, and constructors
    // ...
}

Then, you can update your Supplier to produce this object and your Consumer to receive it. The framework will handle the JSON serialization and deserialization for you.

Supplier:@Bean public Supplier<UserRegistration> supplyUserRegistration() { return () -> new UserRegistration("john.doe", "john.doe@example.com"); }
Consumer:@Bean public Consumer<UserRegistration> logUserRegistration() { return registration -> { System.out.println("New user registered: " + registration.getUsername()); }; }

Configuration

To ensure the framework knows to use JSON for the messages, you would explicitly set the contentType property in your application.yml. This is crucial for interoperability with other services.

spring:
  cloud:
    stream:
      bindings:
        supplyUserRegistration-out-0:
          destination: user-registrations
          contentType: application/json
        logUserRegistration-in-0:
          destination: user-registrations
          contentType: application/json

By setting contentType: application/json, Spring Cloud Stream automatically uses the appropriate Jackson converter to handle the conversion of your UserRegistration object to and from a JSON string, eliminating the need for manual serialization code.

Reactive Programming with Flux and Mono

Spring Cloud Stream has first-class support for reactive programming with Project Reactor’s Flux and Mono. This enables you to build non-blocking, asynchronous message processing pipelines that are highly efficient and scalable. Instead of processing messages one by one in an imperative way, you can operate on continuous streams of data.

Key Reactive Types

Flux: A Flux represents a stream of zero, one, or more items, potentially infinite. This is the ideal type for a Supplier that produces a continuous stream of messages or a Function that processes multiple inputs and produces multiple outputs.
Mono: A Mono represents a stream of at most one item. It’s often used for a single, final result.

Reactive Examples

The transition to reactive is straightforward; you simply change the return type of your Supplier, Function, or Consumer bean to a reactive type.

Reactive Supplier: Instead of a Supplier<String>, you can return a Supplier<Flux<String>> to emit a continuous, non-blocking stream of messages. The framework’s polling mechanism will automatically subscribe to this Flux.

@Configuration
public class ReactiveSupplierConfig {

    @Bean
    public Supplier<Flux<String>> reactiveSupply() {
        return () -> Flux.interval(Duration.ofSeconds(1))
                          .map(i -> "Reactive Message " + i);
    }
}

This bean will emit a new message every second without blocking a thread.

Reactive Function: A reactive Function can transform a Flux of incoming messages into a Flux of outgoing messages, allowing you to perform operations on the stream itself.

@Configuration
public class ReactiveFunctionConfig {

    @Bean
    public Function<Flux<String>, Flux<String>> reactiveUppercase() {
        return flux -> flux.map(String::toUpperCase);
    }
}

This function will consume a stream of messages and produce a new stream with each message converted to uppercase.

Reactive Consumer: A reactive Consumer will receive a Flux of messages and perform a terminal operation, such as logging each item.

@Configuration
public class ReactiveConsumerConfig {

    @Bean
    public Consumer<Flux<String>> reactiveLog() {
        return flux -> flux.subscribe(message -> {
            System.out.println("Received by Reactive Consumer: " + message);
        });
    }
}

This consumer will subscribe to the incoming stream and log each message as it arrives. By embracing these reactive patterns, your Spring Cloud Stream applications can achieve much higher throughput and better resource utilization, especially for I/O-heavy workloads.

Error Handling and Dead-Letter Queues (DLQ)

In any production system, handling errors gracefully is essential. Spring Cloud Stream provides robust mechanisms for message retries and for sending messages that repeatedly fail to process to a Dead-Letter Queue (DLQ). This prevents a single corrupt message from blocking your entire processing pipeline.

How It Works

When a Consumer or Function bean throws an exception while processing a message, Spring Cloud Stream’s binder implementation can be configured to automatically retry the operation a set number of times. This handles transient issues, such as a temporary network outage or a database connection failure.

If a message continues to fail after all retry attempts are exhausted, the binder can be configured to send the message to a dedicated DLQ. The DLQ is a separate destination where these “poison pill” messages can be stored, inspected, and manually reprocessed later, preventing them from clogging the main queue.

Configuration for Dead-Letter Queues

To enable DLQ functionality, you can configure the consumer’s binding with the maxAttempts and republishToDlq properties.

application.yml Example:

spring:
  cloud:
    stream:
      bindings:
        # The consumer's input binding with DLQ configuration
        logProcessedMessage-in-0:
          destination: messages-processed
          group: my-processing-group
          consumer:
            maxAttempts: 3
            republishToDlq: true
            # republishToDlq-prefix: (optional) a prefix for the DLQ name

maxAttempts: Specifies the number of times the message will be re-delivered and re-attempted before it is sent to the DLQ. A value of -1 means infinite retries.
republishToDlq: When set to true, a message that has exhausted its retry attempts will be automatically sent to the DLQ. The DLQ’s name is typically a convention, such as messages-processed.dlq for Kafka or a unique queue in RabbitMQ.

By implementing this configuration, you build a resilient message processing pipeline that can gracefully handle unexpected errors without interrupting the flow of other messages.

Summary of Key YAML Configuration Fields

This table provides a quick reference for the most important application.yml properties used to configure Spring Cloud Stream applications.

Field	Description	Example
`spring.cloud.function.definition`	Explicitly defines the beans to be used for messaging. Required when you have more than one `Supplier`, `Function`, or `Consumer` bean in your application.	`supplyMessage \| processMessage`
`spring.cloud.stream.bindings.<name>.destination`	The name of the message broker destination (e.g., Kafka topic or RabbitMQ exchange) to which the binding connects.	`messages-to-process`
`spring.cloud.stream.bindings.<name>.group`	The consumer group name. Used to distribute messages among multiple instances of the same consumer to prevent duplicate processing. Only applicable for inputs.	`my-processing-group`
`spring.cloud.stream.bindings.<name>.contentType`	The message content type (e.g., `application/json`). Spring Cloud Stream uses this to automatically serialize and deserialize message payloads.	`application/json`
`spring.cloud.stream.bindings.<name>.consumer.maxAttempts`	The number of retry attempts for processing a message before it is considered a failure. A value of `-1` means infinite retries.	`3`
`spring.cloud.stream.bindings.<name>.consumer.republishToDlq`	When set to `true`, messages that exhaust their retry attempts are automatically sent to a Dead-Letter Queue (DLQ) for later inspection.	`true`

This table provides a comprehensive overview of the key configuration properties that give you fine-grained control over your Spring Cloud Stream applications, from simple bindings to advanced features like scalability and error handling.

Supported Binders and Their Limitations

The core of Spring Cloud Stream’s power lies in its pluggable binder abstraction. A binder is a component that provides a standardized way to connect your application to a specific message broker. By simply changing a dependency, you can switch the underlying messaging technology without altering your core business logic.

The Spring team and community maintain a number of popular binders.

Apache Kafka Binder:
- Description: This is the most widely used binder, leveraging the high-throughput, distributed nature of Apache Kafka. It is ideal for event streaming, real-time analytics, and high-volume data pipelines.
- Common Use Cases: Log aggregation, metrics, IoT event streams, and other data-intensive applications.
- Potential Limitations:
  - Version Compatibility: The binder is built against a specific version of the Kafka client library. While it’s often backward compatible with older Kafka brokers, some of the latest features might not be available on older broker versions.
  - Native Partitioning: The Kafka binder maps Spring Cloud Stream partitioning directly to Kafka partitions, which is a powerful feature but requires careful management of your partition counts and keying strategies.
RabbitMQ Binder:
- Description: This binder integrates with RabbitMQ, a versatile message broker known for its robust routing capabilities and flexible messaging patterns. It is an excellent choice for general-purpose messaging, task queues, and complex message routing.
- Common Use Cases: Command and control messaging, work queues, asynchronous task processing, and publish-subscribe systems.
- Potential Limitations:
  - No Native Partitioning: RabbitMQ does not have the native partitioning concept that Kafka does. Spring Cloud Stream provides a solution for this, but it is not as seamless as with the Kafka binder.
  - Retry and DLQ Logic: The default behavior of retries and dead-letter queues can be customized, but it’s important to understand the specific properties that control message rejection and requeueing to avoid infinite loops.
Other Binders:
- The Spring Cloud Stream ecosystem also includes community-maintained or partner-supported binders for various other platforms, such as Azure Event Hubs, Azure Service Bus, Google Cloud Pub/Sub, and AWS Kinesis. These binders extend the same programming model to a wide range of cloud-native messaging services.

The key takeaway is that the core of your application remains the same, regardless of the binder you choose. This gives you the flexibility to select the right tool for the job while maintaining a consistent and simple programming model.

Discover more from GhostProgrammer - Jeff Miller

Subscribe to get the latest posts sent to your email.

ByJeffery Miller

Key Concepts

A Complete Streaming Pipeline: Supplier -> Function -> Consumer

1. The Supplier (Producer)

2. The Function (Processor)

3. The Consumer (Sink)

All-in-One application.yml Configuration

Branching a Stream: One Supplier to Multiple Functions and Consumers

1. The Supplier (Producer)

2. The Multiple Functions (Processors)

3. The Multiple Consumers (Sinks)

All-in-One application.yml Configuration for Branching

Dynamically Creating Bindings Programmatically

Code Example: Using StreamBridge with a REST Endpoint

Configuration for Dynamic Bindings

Explicitly Specifying Functions with spring.cloud.function.definition

Consumer Groups for Scalability

Content-Type and Serialization

Code Example with a Custom Object

Configuration

Reactive Programming with Flux and Mono

Key Reactive Types

Reactive Examples

Error Handling and Dead-Letter Queues (DLQ)

How It Works

Configuration for Dead-Letter Queues

Summary of Key YAML Configuration Fields

Supported Binders and Their Limitations

Share this:

Like this:

Related

Discover more from GhostProgrammer - Jeff Miller

By Jeffery Miller

Related Post

You missed

All-in-One `application.yml` Configuration

All-in-One `application.yml` Configuration for Branching

Code Example: Using `StreamBridge` with a REST Endpoint

Explicitly Specifying Functions with `spring.cloud.function.definition`