Design and implement analytics infrastructure
We are building Redpanda, a real-time streaming engine for modern applications. Redpanda is used by Fortune 1000 enterprises pushing hundreds of terabytes a day, and by the solo dev prototyping a React application on her laptop. We go beyond the Kafka protocol into the future of streaming, with inline WASM transforms and geo-replicated hierarchical storage. Think of it as a data API platform that scales with you from the smallest projects to petabytes of data distributed across the globe.
We are on a mission to enable every developer to supercharge their real-time applications.
Design, implement, and maintain an analytics pipeline to ingest, process, and query large volumes of data coming in from Redpanda clusters around the world.
Interact with other teams inside Redpanda such as support, marketing, and product to deliver data-driven insights into the usage and behavior of our clusters.
You’ll be part of a diverse team with members in both US (New York City, San Francisco, San Diego, Austin, Denver) and international locations, including Colombia, Denmark, the United Kingdom, Russia, Poland, Czech Republic, Germany, Greece, Japan, and growing!
A passion for working on data analytics
Strong understanding of either AWS, GCP or Azure.
Hands on experience administering and troubleshooting complex cloud environments.
Ability to work with ETL and BI tools
Good understanding of SQL
Willingness to work with a 100% distributed engineering team, collaborating on GitHub, in the open and a self starter.
Excellent written communication skills.
Please highlight any of the following
- Previous demonstrated experience building a data analytics pipeline