Quantinuum is seeking to hire a System Reliability Engineer Engineer for our Cambridge-based cloud platform, Quantinuum Nexus. Our team aims to support the effort of quantum researchers at every stage of an experiment, making working on quantum computers as easy as sending an email.
The successful candidate will be an expert at working with managed Kubernetes instances such as Amazon EKS and the distributed systems that can be built on top of them. Managing the architecture, performance, security and cost of this instance will be at the core of what you do. Experience with tools like Helm, Karpenter and k9s will be essential to meeting the goals of this role.
The ideal candidate will have experience with collecting logs, traces and metrics via Opentelemetry and making those available through AWS products like x-ray and cloudwatch. These readings should then be used to ensure that Nexus meets our high standards for performance and reliability or, if it’s falling short, they can be used to direct the team on how best to improve things.
In the event of issues and outages you’ll be active in reporting, monitoring and diagnosing the cause of issues. You should have the programming experience required to read and understand code in production with the intention of matching it up with readings collected by monitoring tools. You will be working closely with the development team to make sure everyone has the information they need to identify and resolve the issue as soon as possible.
The ideal candidate will have:
- Experience with Kubernetes and Docker.
- Experience collecting logs, traces and metrics for distributed systems.
- Experience using tools such as AWS CloudWatch to locate bugs and performance issues.
- Experience improving declarative Infrastructure as Code tools such as Terraform
- Experience working on cloud based systems where uptime and reliability are crucial.
It would be desirable to have:
- Experience withPostgreSQL.
- Experience working in a continuous deployment environment.
- Experience with triaging and debugging issues in code.
- Professional experience working with Python.
- Familiarity with the OpenTelemetry standard and SDKs.
What is in it for you?
Working alongside a highly talented team, with leading names in the quantum computing industry. We offer a highly competitive package, equity, 28 days of paid holiday (in addition to public holidays), a workplace pension, a positive approach to flexible working and enhanced parental and adoption benefits.
About Us:
Science Led, Enterprise Driven – Accelerating Quantum Computing
Quantinuum is the world’s largest integrated quantum company, pioneering powerful quantum computers and advanced software solutions. Quantinuum’s technology drives breakthroughs in materials discovery, cybersecurity, and next-gen quantum AI. With approximately 500 employees, including 370+ scientists and engineers, Quantinuum leads the quantum computing revolution across continents.
Quantinuum recently secured $300m in funding, visit our news pages to learn more about this and other Quantinuum scientific breakthroughs and achievements:https://www.quantinuum.com/news
Please note that employment with us is subject to successfully passing our pre-employment screening checks. We are an inclusive equal opportunity employer. You will be considered without regard to age, race, creed, color, national origin, ancestry, marital status, affectional or sexual orientation, gender identity or expression, disability, nationality, sex, or veteran status.
PI271266366