See all roles

Senior Site Reliability Engineer

Work from home Full-time role Hiring
Company Description

We have an excellent opportunity for a bright, smart, and highly motivated Senior/Principal Site Reliability Engineer to join our mature project team. 

You have a unique chance to become part of our team and work with best practices and methodologies.  This role empowers you to take the lead and excel to your fullest potential.  

CUSTOMER

Our customer is a renowned technology company based in New York, specializing in providing cutting-edge solutions in the realm of video advertising, such as ad platforms for publishers, advertisers, and media buyers. The products field millions of queries per second and consume 125 GB of data per minute.  

With over a decade of experience in the AdTech domain, Sigma Software has played a key role in shaping the digital advertising landscape. Our team of experts has created software solutions that span the entire AdTech ecosystem, working with publishers, advertisers, and intermediary platforms to achieve success.

PROJECT

We're seeking a skilled Senior/Principal Site Reliability Engineer responsible for the Cloud Infrastructure and Observability solutions for the Client`s platform and ensuring all systems run smoothly. If you're passionate about complex tasks, optimizing systems, driving innovation, providing the highest quality, and collaborating with top talent, this is the perfect opportunity.  

The project is an easy-to-use, massive-scale, and highly available demand-side platform. Backed by Amazon Web Services and Kubernetes, the team has embraced Infrastructure as code to manage thousands of applications, servers, and containers running in multiple regions worldwide.  

Bring your expertise to our dynamic and forward-thinking environment! 

Job Description
  • Design and build infrastructure and tooling to provide high scalability, reliability, and sub-second performance levels using security industry best practices 
  • Write code and scripts to support Infrastructure as code (IaC), configuration management, and automated incident resolution 
  • Support and extend the observability stack to capture and alert on any system issues 
  • Participate in on-call rotations and be an escalation contact for service incidents 
  • Write systems documentation, troubleshoot playbooks and other instruction manuals 
  • Other duties and responsibilities as assigned 
Qualifications
  • Bachelor’s or higher degree in computer science, computer engineering, relevant technical field, or equivalent practical experience 
  • Expertise with architecture solutions and system design 
  • Experience in analyzing and troubleshooting large-scale distributed systems 
  • At least 6 years of administration experience with Linux, AWS, and Kubernetes 
  • At least 6 years of experience in configuration management using Cloud Formation, Terraform, and Ansible or similar 
  • At least 3 years of experience with Python
  • Strong problem-solving skills 
  • Strong verbal/written communication skills 
  • At least an Upper-Intermediate level of English 
Additional Information Apply To This Job

You might like

Middle/Strong Junior DevOps Engineer

Work from home Full-time role

Combustible Dust & Hazards Consultant

Work from home Full-time role

Policy Advocacy Manager

Work from home Full-time role

Operations Associate

Work from home Full-time role

Technical Writer

Work from home Full-time role

Senior Quality Assurance Engineer and Analyst

Work from home Full-time role

Software Tester

Work from home Full-time role

HR Ops Lead

Work from home Full-time role

Junior Accountant

Work from home Full-time role

Senior FP&A Analyst

Work from home Full-time role

Join Today: Urgently Require Instructor

Work from home Full-time role

Sales Development Representative

Work from home Full-time role

FLEX Senior Implementation Specialist

Work from home Full-time role

Experienced Freelance Data Entry Specialist – Remote Opportunity with arenaflex

Work from home Full-time role

Teacher - English as a Second Language, Gr. K-2 (SY25-26)

Work from home Full-time role

Senior Accountant (CPA) Havasu Lake, CA

Work from home Full-time role

Experienced Full-Time Remote Data Entry Specialist – Business Insights and Information Management at Blithequark ($23/Hour)

Work from home Full-time role

Experienced Travel Customer Service Representative/Travel Assistant – Remote Group Travel Adventure Expert

Work from home Full-time role

Experienced Part-Time Online Chat Specialist – Remote Customer Service Representative for arenaflex

Work from home Full-time role

Technical Support Engineer

Work from home Full-time role