DevOps Team Lead
Skills
About the Role
You will own and operate the cloud platform on AWS, architecting, implementing, and maintaining infrastructure and CI/CD pipelines. You will automate deployments, harden security, manage backups and disaster recovery, tune RDS databases, and maintain monitoring and alerting. You will lead a small DevOps team, provide technical direction, perform on-call incident response, run post-incident reviews, and continuously optimize cost, performance, and reliability.
Requirements
- 8+ years in DevOps roles
- Deep hands-on AWS experience including EC2 ECS RDS VPC IAM CloudWatch Lambda S3 ALB NLB
- Production experience with Terraform or CloudFormation
- Strong experience with Docker and container management tools such as ECS Portainer Coolify
- Expertise building and optimizing CI/CD pipelines (Bitbucket Pipelines GitHub Actions)
- Proficiency in scripting and automation with Bash Python and/or Node.js
- Solid understanding of TCP/IP DNS VPNs firewalls CDN and secure access solutions
- Experience with RDS PostgreSQL performance tuning backup and recovery
- Professional-level English written and verbal communication
- Proficiency with AI-assisted coding tools such as Claude Code Cursor or GitHub Copilot
- Preferred: AWS certifications (Solutions Architect DevOps Engineer)
- Preferred: Experience in regulated industries such as Fintech Blockchain or Financial Services
- Preferred: Bachelor’s degree in Computer Science Engineering or related field or equivalent practical experience
Responsibilities
- Design implement and maintain high-performance secure highly available AWS infrastructure
- Build and manage infrastructure using Infrastructure as Code with Terraform or CloudFormation
- Act as SME for networking VPCs security groups load balancers and inter-service communication
- Manage CDN DNS and secure network access with Cloudflare WARP and Tailscale
- Architect and implement business continuity and disaster recovery solutions
- Optimize AWS costs through right-sizing reserved instances and lifecycle management
- Own and improve CI/CD pipelines ensuring reliable build test and deployment workflows
- Develop automation scripts and tools using Bash Python or Node.js
- Implement and maintain container orchestration with Docker ECS Portainer and Coolify
- Integrate automated security scanning into CI/CD and enforce IAM and secrets management best practices
- Manage RDS database performance including tuning indexing and backup recovery
- Perform OS-level administration security patching and system hardening
- Design and maintain monitoring observability using CloudWatch Prometheus and Grafana
- Define and enforce SLAs SLOs and run incident response and post-incident reviews
- Lead and coordinate the DevOps team provide technical direction and code reviews
- Maintain infrastructure documentation runbooks and architectural decision records
