Site Reliability Engineer

Sectigo

CA agnostic Certificate Lifecycle Management for the modern enterprise. Secure your human & machine identities at scale.

About the Company

Sectigo is a global leader in digital identity management, offering cutting-edge solutions for businesses to secure and manage their online presence. The company thrives on a strong team culture, innovation, and a commitment to delivering impactful results for clients worldwide.

About the Role

Sectigo is looking for a Site Reliability Engineer to join its growing global team. The ideal candidate will play a key role in ensuring the reliability of critical services by designing and implementing solutions that reduce operational overhead and enhance service stability.

Responsibilities

  • Ensure the reliability and performance of Sectigo’s critical products and services, exceeding Site Reliability Engineering (SRE) objectives.
  • Build and maintain production infrastructure using Infrastructure as Code (IaC) and Configuration Management tools.
  • Implement monitoring solutions to track the health and performance of services.
  • Automate deployments, administration, and monitoring following CI/CD practices.
  • Collaborate with engineering and information security teams to improve and document processes, enhancing the operability and security of services.
  • Participate in the team’s on-call rotation and provide incident response and support.
  • Take on additional responsibilities as needed in line with company initiatives.

Required Skills

  • Bachelor’s degree in Computer Science, Information Systems, or a related field, or equivalent work experience.
  • 3+ years of experience in software and/or operational roles, particularly with internet-facing production environments.
  • Strong experience in Linux/Unix systems administration.
  • Familiarity with source control tools, particularly Git.
  • Proficient in Configuration Management and Infrastructure as Code tools (e.g., Ansible, Puppet, Terraform).
  • Strong understanding of containerization technologies (Docker, Kubernetes).
  • Experience with monitoring tools such as Prometheus, Grafana, or Nagios.
  • Ability to manage large-scale, 24/7 production environments.
  • Familiarity with distributed data processing, databases, and large-scale file systems is a plus.

Preferred Qualifications

  • Strong scripting skills in Bash and Python.
  • Experience with incident management, troubleshooting, and root cause analysis.
  • Experience in running and maintaining build systems (e.g., Jenkins, DroneCI).
  • Background in systems architecture, design, and operations.
  • Experience with HTTP Service APIs and virtualization tools (e.g., VMware, Proxmox).
  • Knowledge of network administration is a plus.
  • Exposure to security and testing frameworks is a bonus.
  • Experience in regulated industries such as Finance, Healthcare, or Government is beneficial.

Find the complete job listing and details on the official website mentioned below:

Copyright © 2025 SRE-Jobs.com. All Rights Reserved.