About the Company
Point Wild helps individuals monitor, manage, and safeguard their digital identities and personal data. Supported by investors like WndrCo, Warburg Pincus, and General Catalyst, we are committed to building the most comprehensive suite of top-tier cybersecurity solutions. Our goal is to be the ultimate resource for all cybersecurity needs that people might face now and in the future.
Come join us on this journey!
Role Overview:
We’re looking for a motivated and skilled Site Reliability Engineer (SRE) to join our energetic engineering team. In this role, you will ensure our systems and applications are reliable, available, and high-performing. You’ll collaborate with both development and operations teams to apply best practices, automate workflows, and scale infrastructure to support our growing business.
Key Responsibilities:
-
System Monitoring & Incident Management: Build and maintain monitoring tools to track system health, respond to issues quickly, and resolve incidents efficiently.
-
Automation & Infrastructure as Code: Create automated solutions for infrastructure and application deployments using tools like Terraform and Ansible.
-
Performance Tuning: Monitor and analyze system performance and capacity to implement improvements that boost reliability and efficiency.
-
Team Collaboration: Partner with developers to enhance system design and deployment, advocating for improved reliability throughout the software lifecycle.
-
Documentation & Reporting: Keep detailed records of system architecture, processes, and incident responses, and provide regular performance and reliability updates.
-
Disaster Recovery & Backup: Develop and maintain recovery plans and ensure robust backup systems are in place.
-
Security Collaboration: Work with security teams to enforce best practices that protect data and systems.
Qualifications:
-
Experience in Site Reliability Engineering, DevOps, or similar roles.
-
Familiarity with cloud platforms like AWS, Azure, or Google Cloud, and container tools such as Kubernetes and Docker.
-
Skilled in scripting languages (Python, Bash, Ansible) and tools for CI/CD (Jenkins, GitLab CI/CD) and infrastructure management (Terraform, Ansible).
-
At least 3 years of experience with production monitoring tools such as Prometheus, ELK, Grafana, and OpsGenie/PagerDuty.
-
3+ years managing Linux systems (preferably Ubuntu).
-
Strong knowledge of networking, security, system architecture, and data center operations in fast-paced, 24/7 environments.
-
Good understanding of networking protocols (TCP/IP, BGP, OSPF) and technologies (LAN, WAN, VPN), along with network monitoring software proficiency.
What You’ll Gain at Point Wild:
-
The chance to tackle real-world problems by delivering solutions that meet customers’ immediate cybersecurity needs, while anticipating their future challenges.
-
A dynamic environment where your individual contributions truly matter and impact the company daily.
-
Opportunities for rapid career growth by working with cutting-edge technologies and expanding into new products and markets.
-
Collaboration with a talented, supportive team in a workplace that values people and inclusivity.
Commitment to Diversity and Inclusion:
Point Wild is dedicated to a workplace free from discrimination or harassment based on any protected characteristic, and strives to create an inclusive community where everyone feels welcome and supported.