Site Reliability Engineer

Semrush

Your competitors' favorite marketing platform used by 10,000,000 marketers

About the Company

Semrush is a leading SaaS platform for digital marketing, helping businesses worldwide enhance their online visibility through tools for SEO, PPC, content, social media, and competitive research. With over 10 million users globally and a workforce of 1,700+ employees, Semrush has earned numerous accolades, including G2’s Top 100 Software Products and Deloitte’s Technology Fast 500. The company is publicly traded on the NYSE under the SEMR ticker.

About the Role

Semrush is looking for a Site Reliability Engineer (SRE) to join the SRE team. In this role, you’ll collaborate with development teams to ensure the reliability and performance of critical systems. You’ll be responsible for designing and implementing scalable system architectures, debugging applications, and ensuring their continuous availability. The role involves some on-call duties and offers flexible working hours to maintain a work-life balance.

Responsibilities

  • Collaborate with development teams to design, implement, and scale reliable and efficient system architectures.
  • Define and refine SLOs in partnership with stakeholders to ensure high service reliability and performance.
  • Code in Python/Go and work with Kubernetes, Helm, and cloud providers for scalable solutions.
  • Simulate application failures and ensure recovery strategies are in place.
  • Debug applications using metrics and enhance applications with necessary traces/metrics.
  • Lead changes in engineering practices to enhance the development process.
  • Participate in on-call duties, including possible night shifts, to ensure constant system support.
  • Continuously improve and optimize infrastructure for reliability and performance.

Required Skills

  • Proficiency with Kubernetes, Helm, and cloud providers (e.g., AWS, GCP).
  • Solid coding experience in Python/Go.
  • Strong understanding of application failure points and troubleshooting techniques.
  • Experience with debugging using metrics and adding traces to applications.
  • Ability to collaborate effectively with cross-functional teams and stakeholders.
  • Willingness to work flexible hours and participate in on-call duties.

Preferred Qualifications

  • Familiarity with GCP and cloud-based technologies.
  • Experience with DevOps practices and CI/CD.
  • Understanding of software architecture and scalability.
  • Passion for adapting to continuous changes and improving processes.
  • Strong communication skills to effectively share ideas and solutions with the team.

Benefits

  • Flexible working hours and unlimited PTO.
  • Flexi Benefit for hobbies and personal development.
  • Employee support program, including financial aid in the event of family loss.
  • Meals, snacks, and drinks provided at the office.
  • Team-building events and corporate gatherings.
  • Training, courses, and conference access for professional development.

Visit the official website below to access the full details of this vacancy:

Copyright © 2025 SRE-Jobs.com. All Rights Reserved.