Search...
Urgently Hiring

DevOps and Infrastructure Engineer

Skills

About the Role

You will run and scale the agent fleet on Railway and AWS with isolated per-user agent processes. You will stand up private networking between user agents, MCP servers, and the trading runtime using Tailscale. You will build the per-user deploy pipeline including containers, secrets, and environment configuration so onboarding a new agent is one click for the user and zero-touch for us. You will own monitoring, alerting, and on-call for the live fleet, ensuring graceful restarts, key rotation, and handling partial outages to prevent loss of in-flight trade state.

Requirements

  • Production experience with Railway or equivalent container PaaS such as Fly Render or Northflank.
  • Deep AWS expertise including VPC, IAM, Secrets Manager, ECS and Lambda.
  • Production experience with Tailscale including ACLs, subnet routing, and ephemeral nodes.
  • Strong Docker, CI/CD, and Infrastructure as Code skills.
  • On-call and real incident response experience.
  • Blockchain or on-chain experience.
  • Financial-grade reliability background with emphasis on uptime and PnL.

Responsibilities

  • Run and scale the agent fleet on Railway and AWS with isolated per-user agent processes and a predictable cost and clean lifecycle.
  • Stand up private networking between user agents, MCP servers, and the trading runtime using Tailscale.
  • Build the per-user deploy pipeline including containers, secrets, and environment configuration so onboarding a new agent is one click for the user and zero-touch for us.
  • Own monitoring, alerting, and on-call for the live fleet with graceful host restarts, key rotation, and handling partial outages to prevent loss of in-flight trade state.