Urgently Hiring
DevOps and Infrastructure Engineer
Skills
About the Role
You will run and scale the agent fleet on Railway and AWS with isolated per-user agent processes. You will stand up private networking between user agents, MCP servers, and the trading runtime using Tailscale. You will build the per-user deploy pipeline including containers, secrets, and environment configuration so onboarding a new agent is one click for the user and zero-touch for us. You will own monitoring, alerting, and on-call for the live fleet, ensuring graceful restarts, key rotation, and handling partial outages to prevent loss of in-flight trade state.
Requirements
- Production experience with Railway or equivalent container PaaS such as Fly Render or Northflank.
- Deep AWS expertise including VPC, IAM, Secrets Manager, ECS and Lambda.
- Production experience with Tailscale including ACLs, subnet routing, and ephemeral nodes.
- Strong Docker, CI/CD, and Infrastructure as Code skills.
- On-call and real incident response experience.
- Blockchain or on-chain experience.
- Financial-grade reliability background with emphasis on uptime and PnL.
Responsibilities
- Run and scale the agent fleet on Railway and AWS with isolated per-user agent processes and a predictable cost and clean lifecycle.
- Stand up private networking between user agents, MCP servers, and the trading runtime using Tailscale.
- Build the per-user deploy pipeline including containers, secrets, and environment configuration so onboarding a new agent is one click for the user and zero-touch for us.
- Own monitoring, alerting, and on-call for the live fleet with graceful host restarts, key rotation, and handling partial outages to prevent loss of in-flight trade state.
