See all roles

Site Reliability Engineer

Work from home Full-time role Hiring

About Ververica

Ververica, founded by the original creators of Apache Flink™, empowers businesses to unlock the full potential of real-time data processing and analytics. Our platform provides cutting-edge stream processing and event-driven applications, enabling companies worldwide to build scalable and reliable data-driven solutions.

Role Overview

As a Site Reliability Engineer (SRE) at Ververica, you will design, provision, and maintain the infrastructure for Ververica’s Unified Streaming Data Platform across multiple cloud providers, including AWS, GCP, and Azure. You will collaborate with software engineering teams to develop solutions that enhance feature delivery, optimize performance, and address security vulnerabilities. Your role will involve architectural improvements, implementation ownership, and driving reliability best practices.

Key Responsibilities

  • Build and maintain the infrastructure for Ververica’s Unified Streaming Data Platform across AWS, GCP, and Azure.
  • Design and manage Infrastructure as Code (IaC) using Terraform, ensuring modularity, reusability, and best practices.
  • Implement and enhance observability tooling, including Grafana, Prometheus, logging systems, traces, metrics, dashboards, and alerts.
  • Ensure system reliability through SRE best practices, including defining SLIs, SLOs, and error budgets.
  • Improve infrastructure architecture and engineering efficiency through continuous evaluation and optimization.
  • Enhance CI/CD pipelines to automate development workflows.
  • Monitor, identify, and resolve security vulnerabilities (CVE updates and security enhancements).
  • Contribute to the successful development and launch of new products, features, and services.
  • Periodically participate in on-call rotations to manage incidents in a 24/7 live infrastructure.
  • Maintain and update documentation, including architectural designs and changes.

Requirements

  • Bachelor’s degree in Computer Science, Information Technology, or a related field.
  • Minimum 2 years of hands-on experience with Kubernetes clusters, Helm charts, controllers, and operators.
  • Proficiency in designing and maintaining Terraform code with best practices.
  • Strong knowledge of observability tools and practices, including metrics, logging, and alerting systems.
  • Experience implementing SRE principles such as SLIs, SLOs, and error budgets.
  • Solid understanding of Linux systems and networking in cloud environments.
  • Hands-on experience managing multiple Kubernetes clusters.
  • Familiarity with distributed systems or streaming data platforms.
  • Knowledge of cloud-native security best practices.
About the company

Ververica is a pioneering leader in stream processing technology and the original creator of Apache Flink. Our advanced streaming platform empowers businesses to leverage real-time data, enabling timely and informed decision-making to gain competitive advantages.

We are a globally operating company, originally founded in Berlin, Germany, with a strong commitment to open source and a deep understanding of commercial needs. Operating under an open-core business model and backed by one of the largest tech companies, we are seeking talented individuals to join our team and shape the future of data stream processing.

Apply To This Job

You might like

CRM Administrator

Work from home Full-time role

SEO Content Editor

Work from home Full-time role

Director, Quality Management & Compliance

Work from home Full-time role

Senior Enablement Partner, Customer Success

Work from home Full-time role

HR & Personal Development Specialist

Work from home Full-time role

Senior Accountant

Work from home Full-time role

Director of Finance

Work from home Full-time role

Accounts Receivable Specialist

Work from home Full-time role

Senior Infrastructure Engineer

Work from home Full-time role

Staff Database Reliability Engineer

Work from home Full-time role

Looking for High School Special Education Paraprofessional in Connecticut

Work from home Full-time role

Experienced ACT/SAT English/Reading Prep Tutor Wanted for Part-Time Position in Chattanooga, TN

Work from home Full-time role

Virtual Customer Service/Sales Representative – Family Benefits Specialist

Work from home Full-time role

Experienced Customer Support Representative – Apple Home Advisor at arenaflex

Work from home Full-time role

Experienced Remote Data Entry Specialist – Join the arenaflex Team for a Magical Career Opportunity in Data Management and Entry

Work from home Full-time role

Senior Software Engineer (Golang/Ruby on Rails)

Work from home Full-time role

Lead Insurance Claims Adjuster - Remote

Work from home Full-time role

Representative, Business Development

Work from home Full-time role

Pricing Strategy, Manager/Senior Manager 5 Locations

Work from home Full-time role

Experienced Remote Live Chat Support Specialist – Customer Service and Technical Support for arenaflex, $25-$35/HR

Work from home Full-time role