Engineering Manager, Infrastructure
Skills
About the Role
You will define and execute the strategy for internal infrastructure platforms, build and operate scalable cloud and Kubernetes-based systems, and ensure high availability and compliance. You will improve provisioning workflows to speed time to production, maintain uptime SLOs, and implement secure access control and CI/CD infrastructure.
Requirements
- Experience building and operating internal infrastructure platforms on public cloud environments (AWS, GCP, or Azure), including provisioning, scaling, and reliability engineering
- Led and managed software or infrastructure engineering teams for 3+ years, including hiring, developing, and retaining talent
- Demonstrated ownership of Kubernetes at scale, including building or leading a K8s-based internal platform
- Operated effectively in startup environments and navigated ambiguity with lean teams
- Experience supporting SOC2 Type 2 and ISO 27001 compliance in infrastructure provisioning and access control
- Familiarity with CI/CD ecosystems, runners, secrets management, and cloud networking
- Understanding of security best practices in software delivery and secure artifact management
- Exposure to blockchain technologies, decentralized applications, or Web3 infrastructure
- Experience with globally distributed remote-first engineering teams
- Familiarity with infrastructure automation using Go-based tooling or GitHub Actions
Responsibilities
- Deliver a scalable internal infrastructure platform on public cloud environments
- Establish and evolve Kubernetes-based platform capabilities to support production-grade workloads
- Maintain 99.9% uptime SLOs for infrastructure systems critical to product delivery
- Improve provisioning workflows to reduce operational friction and enable faster time to production
- Ensure infrastructure meets SOC2 Type 2 and ISO 27001 compliance via robust access control and provisioning standards
- Build and maintain secure foundations that support CI/CD pipelines and minimize operational risk
- Enable product teams to reliably provision, deploy, and operate systems at scale
