10 use cases made easy with Redpanda Connect

The possibilities are endless when you have 280+ connectors at your fingertips — but here are 10 to get you started

Dunith Danushka

June 18, 2024

Last modified on

TL;DR Takeaways:

How can I use Redpanda Connect for message format transformation?

Redpanda Connect can decode messages from one format, change their structure, and encode them back into a different format. This is useful for data format normalization and system integrations. The processors support formats including Avro, JSON, Protobuf, Parquet, MessagePack, and XML.

How can I use Redpanda Connect for reading from and writing to databases?

The sql_select operator in Redpanda Connect executes a select query against a database and creates a message for each row received. For writes, the sql_insert operator inserts rows into an SQL database for each message. You can also use the sql_raw operator to run an arbitrary SQL query against a database. Both operators support databases including MySQL, Postgres, MSSQL, SQLite, Oracle, ClickHouse, Trino, and Azure CosmosDB.

How can I use Redpanda Connect for working with caches?

The cache operator in Redpanda Connect performs operations against a cache for each message, allowing you to store or retrieve data within message payloads. This enables two important use cases: message deduplication and enrichment. Redpanda Connect supports cache implementations including but not limited to Redis, AWS S3, AWS DynamoDB, Google Cloud Storage, Memcached, MongoDB, CouchDB, memory, SQL databases, and files.

What are some use cases for Redpanda Connect?

With over 300 connectors, Redpanda Connect can be used for a variety of use cases, including message format transformation, executing a code for every message, message mapping and filtering, working with caches, reading from and writing to databases, enqueueing and dequeueing messages from queues, message downsampling, and compressing/decompressing and archiving/extracting messages.

What is a connector in Redpanda Connect?

A connector in Redpanda Connect integrates your Redpanda data with different data systems. There are two types of connectors: input connectors and output connectors. An input connector imports data from a source system into Redpanda Connect, while an output connector exports data from Redpanda Connect and transforms it into a format the target system can use.

Learn more at Redpanda University

Redpanda Connect is a lightweight stream processor that allows you to build streaming data pipelines in a declarative manner.

Formerly known as Benthos, Redpanda Connect is written in Go and deployed as a static binary with zero external dependencies. Its functionality overlaps with enterprise integration frameworks, log aggregators, and ETL workflow engines. So, Redpanda Connect can complement these traditional data engineering tools or act as a simpler alternative.

With an ecosystem of high-performance connectors at your fingertips, you’re spoiled for choice when it comes to potential use cases. To help you kick things off, this post highlights 10 handy use cases you can easily implement by pairing Redpanda Connect with relevant technologies.

But first, let’s get familiar with the basics of Redpanda Connect.

What is a connector in Redpanda Connect?

When working with Redpanda Connect, you’ll find connectors and processors. As a developer or data engineer, you’ll compose streaming data pipelines with Redpanda Connect using YAML. These pipeline definitions are declarative, consisting of one or more connectors and processors.

Connectors

__wf_reserved_inherit — Redpanda connectors

A connector integrates your Redpanda data with different data systems. There are two types of connectors: input connectors and output connectors.

An input connector imports data from a source system into a Redpanda Connect. For example, you can download objects within an Amazon S3 bucket with aws s3 connector.
An output connector exports data from a Redpanda Connect and transforms it into a format the target system can use. For example, inserting a record into a database table with sql_insert connector.

Processors

Data pipeline diagram showing sequential processing from source to sink — Processing sequence with multiple processors

A processor is a function applied to messages passing through a pipeline. The function signature allows a processor to mutate or drop messages depending on the content of the message.

Processors are set via config, and depending on where in the config they’re placed, they’ll run either immediately after a specific input (set in the input section), on all messages (set in the pipeline section), or before a specific output (set in the output section). Most processors apply to all messages and can be placed in the pipeline section.

Here’s an example of how a processor can be configured in the pipeline section:

pipeline: threads: 1 processors: - label: my_cool_mapping mapping: | root.message = this root.meta.link_count = this.links.length()

Browse the Redpanda Connect documentation for the full list of connectors and processors.

10 use cases for Redpanda Connect

With 220+ connectors, you can implement an unlimited number of use cases by combining Redpanda Connect with various technologies, including databases, message queues, data warehouses, etc. These use cases cut through different domains such as data engineering, stream processing, enterprise integration, and more.

Here are 10 handy use cases to get you started.

1. Message format transformation

Different processors in Redpanda Connect can decode messages from one format, change their structure, and encode them back into a different format. This feature is highly useful in several scenarios, including:

Data format normalization where data from different sources in varying formats needs to be converted into a common format for consistent handling and analysis.
System integrations where data from an older system (in an obsolete or less efficient format) needs to be transformed into a format compatible with a new system.

Processors support formats including Avro, JSON, Protobuf, Parquet, MessagePack, and XML.

2. Execute a code for every message

Several processors in Redpanda Connect allow you to execute a code snippet or trigger a function for each message. The contents of the message are the payload of the request, and the result of the invocation will become the new contents of the message.

wasm executes a function exported by a WASM module for each message
javascript executes a provided JavaScript code block or file for each message
command executes a Linux command for each message
aws_lambda invokes an AWS lambda for each message

3. Message mapping and filtering

Execute various expressions on the content, transforming it into a different structure. The result of the expression will replace the original message content. If the expression does not emit any value, the message is filtered — producing no result.

awk: Executes an AWK program on messages, querying and mutating message contents and metadata.
grok: Parses messages into a structured format by attempting to apply a list of Grok expressions.
jmespath: Executes a JMESPath query on JSON documents and replaces the message with the resulting document.
jq: Transforms and filters JSON messages using jq queries.

Additionally, the mapping and mutation processors can do the same with Bloblang, a powerful language that enables a wide range of mapping, transformation, and filtering tasks.

4. Working with caches

The cache operator performs operations against a cache for each message, allowing you to store or retrieve data within message payloads.

This enables two important use cases: message deduplication and enrichment. Deduplication can be achieved by adding a message to the cache with a key, extracted from the message itself. If the add operation fails, it indicates that the key already exists, and we can eliminate the duplicate using the mapping processor.

Similarly, you can enrich message payloads with content previously stored in a cache.

Redpanda Connect supports cache implementations including but not limited to Redis, AWS S3, AWS DynamoDB, Google Cloud Storage, Memcached, MongoDB, CouchDB, memory, SQL databases, and files.

5. Read from and write to databases

The sql_select operator executes a select query against a database and creates a message for each row received. For example, you could use this operator to read records from a table created within the last hour and forward them to a different processor in the pipeline.

For writes, the sql_insert operator inserts rows into an SQL database for each message and leaves the message unchanged.

Additionally, you can use the sql_raw operator to run an arbitrary SQL query against a database, including SELECT, INSERT, UPDATE, and DELETE queries. For SELECT queries it returns the result as an array of objects, one for each row returned.

Both operators support databases including MySQL, Postgres, MSSQL, SQLite, Oracle, ClickHouse, Trino, and Azure CosmosDB. Additionally, you can work with other databases via relevant processors, such as Couchbase, MongoDB, and Apache Cassandra.

6. Enqueue and dequeue messages from queues

Redpanda Connect has the necessary inputs and outputs to both consume messages from message queues and produce messages to them. This is handy if you want to bridge messages between different message brokers.

Supported implementations include AMQP, MQTT, AWS S3, AWS SNS, AWS Kinesis, Azure Queue Storage, Google Pub/Sub, NATS, Pulsar, Redis Pub/Sub, Kafka, and ZeroMQ.

7. Message downsampling

The rate_limit operator controls the throughput of a pipeline based on a specified limit. This is useful when the downstream system has a lower processing capacity than the data producer.

8. Compress/decompress and archive/extract messages

The compress operator compresses messages using the chosen algorithm. The supported compression algorithms include flate, gzip, lz4, pgzip, snappy, and zlib.

The decompress operator does the opposite—decompresses messages according to the selected algorithm. Supported decompression algorithms are [bzip2 flate gzip lz4 pgzip snappy zlib]

The archive operator archives all the messages of a batch into a single message according to the selected archive format. The output can be a tar or a zip file, a JSON array, a concatenated text, or a binary message.

The unarchive operator does the opposite.

9. Retry

The retry operator attempts to execute a series of child processors until succeeding.

For instance, imagine a child processor attempting to write a message into a database, but it receives an error because the database is offline. After a specified backoff period, the retry operator reattempts the same message at a configured interval until the operation succeeds.

10. Call an HTTP endpoint

The https operator performs an HTTP request using a message batch as the request body and replaces the original message parts with the body of the response. This is useful when invoking a Webhook or fetching data from an API.

Get started with Redpanda Connect

Redpanda Connect allows you to compose streaming data pipelines that move data between systems. You’ll author these pipelines with YAML, making them declarative and version control-friendly.

With its 280+ prebuilt connectors, Redpanda Connect complements Redpanda’s data platform offering by providing smooth integrations to various systems, enabling reliable and efficient data flow.

Want to kick off with Redpanda Connect? Check out our quickstart guide.

No items found.

Join the Redpanda Community on Slack

Chat with our team, ask industry experts, and meet fellow data streaming enthusiasts.

FEATURED RESOURCE

Table of contents

Prakhar Garg

Jul 7, 2026

Stream Oracle changes to ClickHouse in real time with Redpanda Connect

A hands-on walkthrough of our new Oracle CDC connector

Text Link

Product

Tutorial

Paul Wilkinson

Jun 23, 2026

Bridge Queries in Redpanda SQL

Have your real-time cake and eat your analytics too

Text Link

Tutorial

Prakhar Garg

Feb 24, 2026

Building with Redpanda Connect: Bloblang and Claude plugin

A workshop on building and debugging real-world streaming pipelines

Text Link

PANDA MAIL

Stay in the loop

Subscribe to our VIP (very important panda) mailing list to pounce on the latest blogs, surprise announcements, and community events!
Opt out anytime.

10 use cases made easy with Redpanda Connect

What is a connector in Redpanda Connect?

Connectors

Processors

10 use cases for Redpanda Connect

1. Message format transformation

2. Execute a code for every message

3. Message mapping and filtering

4. Working with caches

5. Read from and write to databases

6. Enqueue and dequeue messages from queues

7. Message downsampling

8. Compress/decompress and archive/extract messages

9. Retry

10. Call an HTTP endpoint

Get started with Redpanda Connect

Join the Redpanda Community on Slack

Related articles

Stream Oracle changes to ClickHouse in real time with Redpanda Connect

Bridge Queries in Redpanda SQL

Building with Redpanda Connect: Bloblang and Claude plugin

Stay in the loop