AI that connects online marketing to offline revenue.
About the Company
Invoca is a leader in AI-powered conversation intelligence, transforming how businesses understand and engage with customers. With an AI-driven approach, Invoca unlocks valuable insights from every customer interaction, enabling companies to optimize customer experience, increase revenue, and improve operational efficiency. Invoca operates in the SaaS industry, offering solutions for real-time data processing, AI-powered automation, and security, with a passion for pushing the boundaries of technology.
About the Role
The Site Reliability Engineer (SRE) will play a pivotal role in supporting cloud infrastructure and ensuring the seamless operation of Invoca’s platforms. This position will require a strong focus on scaling cloud infrastructure, building resilient systems, and implementing high-quality practices to support applications. The role is remote, and the ideal candidate will be passionate about improving system reliability and collaborating across teams to enhance the infrastructure and performance of applications.
Responsibilities
- Ensure the reliability and scalability of cloud infrastructure, especially within AWS or GCP.
- Support and maintain core applications, ensuring 99.99% uptime and high performance.
- Work with software teams to design and implement solutions that enhance infrastructure security, scalability, and performance.
- Lead the migration and deployment of critical services using Kubernetes and Docker.
- Automate and optimize cloud-based operations using Infrastructure as Code (IaC) tools such as Terraform.
- Improve observability and monitoring of infrastructure with tools like Prometheus, Grafana, and the ELK Stack.
- Implement DevSecOps practices, ensuring compliance with industry standards (e.g., SOC2, PCI).
- Conduct postmortems for incidents, applying learnings to improve infrastructure stability.
- Mentor junior engineers and contribute to technical decision-making and documentation.
Required Skills
- 4+ years of experience in SRE, DevOps, or related engineering roles.
- Expertise in cloud-native computing, specifically AWS or GCP.
- Strong skills in Infrastructure as Code (IaC) using Terraform or similar tools.
- Proficiency in containerization technologies like Docker and Kubernetes.
- Experience in using monitoring tools (Prometheus, Grafana) and logging systems (ELK Stack).
- Proficient in scripting languages such as Python, Bash, or Go.
- Experience with CI/CD tools like GitHub Actions, Jenkins, or similar.
- Hands-on experience with cloud security practices, including IAM, VPC, and security group management.
- Strong understanding of distributed systems and event-driven architectures.
- Familiarity with service mesh technologies (e.g., Istio) is a plus.
Preferred Qualifications
- Experience with SIP, FreeSWITCH, or Kamailio.
- Prior work with FedRAMP or SOC2-compliant environments.
- Familiarity with Agile methodologies and working in cross-functional teams.
- Experience with security compliance and risk management frameworks.
- Previous exposure to high-stakes enterprise or defense tech markets.
- Ability to obtain U.S. Government security clearance.