Batch to streaming workshop
Explore the integration of Apache Spark, ScyllaDB, PostgreSQL, Redpanda, Debezium, and Benthos to master building advanced real-time data pipelines.
This course focuses on transitioning from traditional batch processing to real-time data systems. You'll learn how to implement technologies like Apache Spark™, ScyllaDB, and PostgreSQL to manage live data feeds. You'll also learn about streaming platforms and Change Data Capture (CDC) using tools like Redpanda and Debezium for continuous data flow.
As a bonus, you'll dive into advanced data querying techniques with Flink’s SQL client so you can easily integrate it with relational and NoSQL databases.