Infrastructure Engineer (DevOps)
Skills
About the Role
You will ensure the reliability, scalability, and performance of infrastructure supporting decentralized storage and related services. You will manage cloud and bare-metal environments, operate containerized stacks on Linux, implement automation for builds and deployments, and build observability and incident response practices. You will design and document operational procedures, participate in an on-call rotation, work directly with node development teams to resolve production issues, and collaborate with the wider open-source community to improve infrastructure based on data and research.
Requirements
- Practical knowledge of at least one programming language
- Demonstrable experience with modern Infrastructure as Code tools and CI/CD best practices
- 3+ years managing resources in AWS, GCP, or Azure
- 3+ years implementing observability and incident response best practices
- 3+ years administering and maintaining Linux hosts
- Excellent communication skills with the ability to document technical details clearly
- Ability to work autonomously and collaboratively in a remote setting
- As a plus: Experience with Filecoin or similar decentralized storage networks
- As a plus: Practical knowledge of Golang, TypeScript, Solidity, or Rust
- As a plus: Experience running blockchain nodes or validators
- As a plus: Experience using Terraform, Helm, Ansible
- As a plus: Experience working with bare metal
- As a plus: Understanding at least two of Web Security, Web3 Security, Cloud Security, Systems Security, or Applied Cryptography
- As a plus: Timezone compatibility with EST or CET
Responsibilities
- Build and maintain tooling to support the Filecoin ecosystem and related infrastructure
- Oversee globally distributed environments and optimize performance and cost efficiency
- Implement automation around builds, deployment, and observability
- Support internal node development teams to resolve issues
- Design, implement, and document infrastructure operation procedures
- Participate in on-call rotation and respond to incidents outside business hours
- Collaborate with the community to evolve infrastructure architecture based on data and ecosystem research
Benefits
- Remote work
- Flexible work hours
