Urgently Hiring
Senior Payments Platform Engineer
Skills
Api GatewayStripeEventbridgeFcaKinesisMskElasticacheChaos EngineeringCdkSolanaIacSoc 2OpentelemetryBashEcsRdsSloHsmEksSqsCloudwatchGithub ActionsTerraformPostgresqlObservabilityAwsCi/CdGrafanaPrometheusFinopsCost EngineeringPythonPlaidRustDockerKubernetesPci DssDisaster RecoverySecrets ManagementChainalysisPulumiKafkaKmsSns
About the Role
You will design, build, and operate the cloud-native infrastructure that powers fiat-to-crypto payment flows. You will own the Infrastructure as Code lifecycle, implement secure network and key management, and build CI/CD pipelines and observability that surface real payment failures. You will define disaster recovery and backup strategies, run reliability experiments and load tests, and drive cost engineering. You will mentor backend engineers on safe platform patterns and help ensure systems meet audit and regulatory requirements.
Requirements
- Hands-on production expertise with AWS core services
- Payments or financial services infrastructure experience (neobank PSP or bank)
- Large-scale Infrastructure as Code ownership experience
- Production Kubernetes (EKS) or ECS cluster operations experience
- Experience running RDS/Aurora PostgreSQL for transactional workloads
- Production experience with Kafka (MSK) or Kinesis event streaming
- Experience working with PCI DSS SOC 2 or FCA/PRA regulatory obligations
- On-call experience for 24/7 financial systems with runbooks and incident response
Responsibilities
- Architect and operate multi-region AWS payments infrastructure
- Own Infrastructure as Code lifecycle with Terraform Pulumi or CDK
- Design and enforce network security architecture for regulated payments
- Build and maintain CI/CD pipelines supporting safe high-frequency deployments
- Define and operate secrets and key management strategies
- Own observability infrastructure including logging tracing and metrics
- Design platform controls for PCI DSS SOC 2 and regulatory audit readiness
- Manage payments data platform infrastructure and event streaming
- Build reliability tooling including SLO dashboards runbooks and chaos tests
- Own disaster recovery and backup strategy and runbook testing
- Drive platform cost engineering and rightsizing
- Support and mentor backend engineers on platform best practices
