Full Time
San Francisco
Posted 7 days ago
0.000000 - 0.000000

Retool, Inc.

ABOUT RETOOL

Nearly every company in the world runs on custom software for critical operations like tracking performance metrics, handling customer support workflows, building admin dashboards, and countless other processes you might not have even thought of. But most companies don’t have adequate resources to properly invest in these tools, leading to a lot of old and clunky internal software or, even worse, users still stuck in manual and spreadsheet flows. At Retool, we’re building the first enterprise AppGen platform: software that transforms natural language into production-ready code, integrates directly with business data, and meets the highest standards of security and governance. AI is redefining what it means to build software—and who gets to build it. The definition of “developer” now includes analysts, operators, and domain experts creating solutions directly. As the pool of builders widens, so does the complexity of what they need to build. The opportunity is enormous, but so is the challenge of enabling this larger community to build production‑grade software safely. That means AI that understands real business data, enforces enterprise policies automatically, and empowers teams to create once and reuse everywhere with shared, trusted components. Over 100 million hours of work has been automated by developers and domain experts using our platform, freeing them to focus on creative problem‑solving and strategic initiatives that drive real business value. The people closest to knowing what needs to be built can now safely create custom solutions within enterprise guardrails. And that’s a mission worth striving for. Let’s build the future together!

WHY WE’RE LOOKING FOR YOU:

Good software has to run where customers need it. For many of Retool’s largest customers, that means running Retool in their own infrastructure, behind their own controls, with the reliability and operational clarity they would expect from any critical system. Retool’s Core Infrastructure team owns the systems that make this possible: Retool Cloud, managed single tenant environments, BYOC (bring-your-own-cloud) environments, Kubernetes and Helm deployments, Docker Compose, and the migration paths between them. It is a broad surface area, and it is one of the biggest levers we have for making Retool work for enterprise customers. The work is not clean‑room infrastructure. Customers run different clouds, different versions, different deployment models, and different levels of operational maturity. A bad upgrade experience can leave a customer many versions behind. A manual Terraform run can become the bottleneck during a launch or incident. We are hiring SREs who want to turn that mess into leverage. You will help us reduce customer toil, automate upgrades and infrastructure changes, build reliability tooling across Retool Cloud and customer‑owned environments, and make Retool easier to deploy and operate at enterprise scale. The strongest candidates are comfortable debugging Kubernetes, Terraform, AWS, Postgres, networking, and deployment problems, then stepping back and building the automation or product surface that prevents the same problem from happening again.

What you’ll do:

Own reliability across Retool Cloud, managed single tenant, BYOC, and self‑hosted deployment paths, including provisioning, upgrades, migrations, configuration changes, and production escalations.
Build the automation that turns today’s manual infrastructure work into repeatable systems: Terraform runs, customer environment updates, upgrade workflows, secret rotations, and migration steps.
Improve observability for Retool Cloud, self‑hosted customers, and internal operators. We care less about exposing every metric and more about turning health signals into clear status, likely causes, and recommended actions.
Design safer deployment, upgrade, and rollback paths so Cloud and managed customers can stay current.
Help move customers from legacy or less‑supported deployment models toward supported paths such as Retool’s official deployment paths (Blueprints, Kubernetes, and Helm), with migration flows that are repeatable enough for customers, Support, and TAMs to trust.
Partner with product engineers on infrastructure requirements for new Retool products, especially when they introduce new dependencies.
Lead through ambiguity, make careful risk calls, and communicate clearly while things are moving quickly.
Write the docs, runbooks, design notes, and migration guides that make complex systems understandable to other engineers and to customers.

What we’re looking for:

Infrastructure fundamentals

Deep experience operating production infrastructure in AWS.
Experience improving reliability for customer‑facing SaaS systems.
Strong Kubernetes fundamentals.
Real Terraform or infrastructure‑as‑code experience.
Good operational judgment around databases, especially Postgres.

Reliability and automation

Experience building or operating observability systems.
Programming ability in a language such as Go, Python, TypeScript, Java, or Ruby.
A bias toward automation. If you find yourself doing the same operational task twice, you should start thinking about the interface, workflow, or tool that eliminates the third time.

Customer and team judgment

Clear written communication.
Comfort working directly with customer‑facing teams and, when useful, customers themselves.

What makes SREs successful here

You will do well here if you like infrastructure that sits close to real customer pain. Some days that means debugging a specific customer environment. Other days it means improving Retool Cloud reliability or designing the migration path so the next 25 customers do not need that same debugging session. We value SREs who are ambitious, curious, energetic, and careful with the details. Retool moves quickly, priorities can change, and the systems are not always as clean as we want them to be. The work needs SREs who can get their hands dirty, tell the truth about tradeoffs, and leave the system better than they found it.

Compensation & Benefits

For candidates based in the United States, the pay range(s) for this role is listed below and represents base salary range for non‑commissionable roles or on‑target earnings (OTE) for commissionable roles. This salary range may be inclusive of several career levels at Retool and will be narrowed during the interview process based on a number of factors such as (but not limited to), scope and responsibilities, the candidate’s experience and qualifications, and location. Additional compensation in the form(s) of equity and/or commission are dependent on the position offered. Retool provides a comprehensive benefit plan, including medical, dental, vision, and 401(k). Pay and benefits are subject to change at any time, consistent with the terms of any applicable compensation or benefit plans. The base pay range for this role is $163,710 – $306,000 per year.

Retool offers generous benefits to all employees and hybrid work location. For more information, please visit the benefits and perks section of our careers page! Retool is currently set up to employ all roles in the US and specific roles in the UK. To find roles that can be employed in the UK, please refer to our careers page and review the indicated locations.

#J-18808-Ljbffr

Apply Now