Sectigo
CA agnostic Certificate Lifecycle Management for the modern enterprise. Secure your human & machine identities at scale.
About the Company
Sectigo is a global leader in digital identity management, offering cutting-edge solutions for businesses to secure and manage their online presence. The company thrives on a strong team culture, innovation, and a commitment to delivering impactful results for clients worldwide.
About the Role
Sectigo is looking for a Site Reliability Engineer to join its growing global team. The ideal candidate will play a key role in ensuring the reliability of critical services by designing and implementing solutions that reduce operational overhead and enhance service stability.
Responsibilities
- Ensure the reliability and performance of Sectigo’s critical products and services, exceeding Site Reliability Engineering (SRE) objectives.
- Build and maintain production infrastructure using Infrastructure as Code (IaC) and Configuration Management tools.
- Implement monitoring solutions to track the health and performance of services.
- Automate deployments, administration, and monitoring following CI/CD practices.
- Collaborate with engineering and information security teams to improve and document processes, enhancing the operability and security of services.
- Participate in the team’s on-call rotation and provide incident response and support.
- Take on additional responsibilities as needed in line with company initiatives.
Required Skills
- Bachelor’s degree in Computer Science, Information Systems, or a related field, or equivalent work experience.
- 3+ years of experience in software and/or operational roles, particularly with internet-facing production environments.
- Strong experience in Linux/Unix systems administration.
- Familiarity with source control tools, particularly Git.
- Proficient in Configuration Management and Infrastructure as Code tools (e.g., Ansible, Puppet, Terraform).
- Strong understanding of containerization technologies (Docker, Kubernetes).
- Experience with monitoring tools such as Prometheus, Grafana, or Nagios.
- Ability to manage large-scale, 24/7 production environments.
- Familiarity with distributed data processing, databases, and large-scale file systems is a plus.
Preferred Qualifications
- Strong scripting skills in Bash and Python.
- Experience with incident management, troubleshooting, and root cause analysis.
- Experience in running and maintaining build systems (e.g., Jenkins, DroneCI).
- Background in systems architecture, design, and operations.
- Experience with HTTP Service APIs and virtualization tools (e.g., VMware, Proxmox).
- Knowledge of network administration is a plus.
- Exposure to security and testing frameworks is a bonus.
- Experience in regulated industries such as Finance, Healthcare, or Government is beneficial.