Talent.com
This job offer is not available in your country.
Infrastructure DevOps Engineer

Infrastructure DevOps Engineer

coverletter.techBucharest, Romania
3 hours ago
Job type
  • Quick Apply
Job description

Location : This is a hybrid role based in our Bucharest office , requiring 3 days a week in the office.

We are seeking a skilled and motivated DevOps Engineer with deep familiarity in the streaming ecosystem to join our elite infrastructure team. This is an operations-first role focused on running, scaling, automating, and monitoring mission-critical streaming infrastructure like Kafka and RabbitMQ .

Streaming is at the heart of our product. We operate hundreds of streaming applications that transform, aggregate, analyze, and enrich the most valuable data we collect from our clients. We process billions of events and petabytes of raw data daily, and we're rapidly growing.

Your mission is to provide a rock-solid, scalable, and secure infrastructure foundation that empowers our engineers to build and operate streaming services with confidence. If you're excited by the challenge of operating mission-critical systems at scale and optimizing the developer experience through automation and tooling, we’d love to hear from you.

Tasks

What You Will Do :

  • Automate Deployment and Operation : Oversee deployment of Kafka and RabbitMQ clusters (including Confluent Cloud & CFK). Build automation pipelines to ensure repeatability and resiliency across environments.
  • Monitor and Support Production Systems : Own production stability of global Kafka clusters. Handle on-call rotations , incident management, troubleshooting, and scaling challenges.
  • Improve Infrastructure Observability : Build and maintain observability systems including dashboards, alerting pipelines, and metrics collection (e.g., Prometheus, Grafana).
  • Optimize System Performance : Collaborate with peers on benchmarking and optimization initiatives. Work on tuning Kafka brokers, cluster configurations, and runtime parameters.
  • Secure Infrastructure Access : Configure and maintain secure access patterns across streaming infrastructure, ensuring proper authentication and Role-Based Access Controls are enforced for both developers and services.
  • Develop and Maintain Infrastructure : Contribute to building infrastructure tools and scripts ( IaC , Helm charts, etc.) that make provisioning and managing clusters reliable and efficient.
  • Provide Developer Support and Training (Infra-focused) : Help developers configure topics, quotas, and consumers appropriately. Train service owners to interpret monitoring data and avoid pitfalls.

Requirements

What We Expect :

  • 8+ years of experience in DevOps , SRE , or Infrastructure Engineering roles.
  • Deep hands-on Kafka experience , including deploying, maintaining, scaling, and monitoring clusters.
  • Experience with RabbitMQ .
  • Extensive experience with Docker , Kubernetes , Helm , and GitOps-style deployments .
  • Infrastructure as Code experience (Terraform, Pulumi, etc.).
  • Strong skills in scripting and automation (Python, Bash, etc.).
  • Familiarity with Confluent Cloud , Confluent for Kubernetes , and similar tools.
  • Solid understanding of authentication and authorization mechanisms in distributed systems.
  • Proven production support mindset with a history of troubleshooting and incident resolution.
  • Excellent collaboration and communication skills, especially with development teams depending on platform support.
  • Bonus : Experience with Istio Service Mesh .
  • Bonus : Experience with GovCloud .
  • Bonus Qualities :

  • Mentorship and leadership experience in infrastructure or SRE teams.
  • Contributions to automation or monitoring open-source tooling.
  • Active participant in SRE or DevOps communities.
  • Conference speaker or internal tech trainer.
  • Technical writing about infrastructure automation or reliability.
  • Next generation matchmaking - fast, accurate and 100% digital.

    Create a job alert for this search

    Engineer • Bucharest, Romania