Senior Cloud Network Engineer
Graphcore is seeking a Senior Cloud Network Engineer to join our Cloud Platform Team in Bristol, UK. In this role, you will collaborate with Software Platform, Datacentre Operations, and Product Development teams to deploy services on our cutting-edge AI systems. As part of the Software Platform organization, you will be involved in cloud integration, validation, performance benchmarking, optimization, and development of high-performance AI solutions, including in-house AI systems and off-the-shelf high-performance servers, switches, and storage solutions. This hands-on technical role requires a solid background in cloud infrastructure, deployment using Infrastructure-as-Code, observability, high-performance networking, and storage systems.
Key responsibilities include developing and operating high-performance Ethernet infrastructure on our private clouds, supporting internal users by translating end-user and product requirements into deployed services. You will build automation to collect and analyze metrics from the network infrastructure to identify and report issues, working with users to provide information on product-related issues to Engineering and QA departments. Additionally, you will collaborate with Datacentre Operations Engineers to maintain, tune, and operate the fleet of AI systems at peak performance in our private clouds. Engaging with external vendors, you will integrate third-party products into our Cloud Reference Design, focusing on network performance, automation, and resilience.
The ideal candidate will have a bachelor's degree or equivalent practical experience in a relevant subject, with significant hands-on experience with high-end (100Gb/s+) Ethernet switch solutions from one or more vendors. Experience managing on-premises or private-cloud environments is essential, along with a proven track record in software engineering or IT as an individual contributor. Proficiency in Linux scripting (bash, python, awk, sed) and system administration (Ubuntu, RHEL and variants) is required. Familiarity with version control systems (preferably Git), Continuous Integration or testing pipelines (GitLab, GitHub), and Infrastructure-as-Code automation tools (Terraform/OpenTofu, Ansible) is also necessary. Strong communication and presentation skills, along with the ability to work independently on critical infrastructure with minimal oversight, are important.
In addition to a competitive salary, Graphcore offers flexible working arrangements, a generous annual leave policy, private medical insurance, a health cash plan, a dental plan, pension (matched up to 5%), life assurance, and income protection. We have a generous parental leave policy and an employee assistance program that includes health, mental wellbeing, and bereavement support. Our central Bristol office provides a range of healthy food and snacks, along with our own barista bar.
At Graphcore, we are committed to building an inclusive work environment that makes our company a great home for everyone. We understand that there are visible and invisible differences in all of us and offer a flexible approach to interviews, encouraging candidates to discuss any reasonable adjustments they may require.