Staff Engineer - Server Hardware Compute Blade and Rack Validation Lead

🇺🇸 Austin, Texas
$2K - $3K Annual
Posted 2 months ago
Expires July 21, 2026
Full TimeHybridEngineeringProduct

Graphcore is seeking a senior validation lead engineer to spearhead at-scale rack validation efforts for next-generation AI hyperscale systems. This role focuses on post-silicon system validation across the full lifecycle, ensuring functional, electrical, and thermal performance meets product objectives. The successful candidate will own end-to-end blade and rack validation, including planning, development, execution, and debugging, while collaborating across firmware, systems, and hardware teams.

Key responsibilities include leading post-silicon validation of AI compute blades and racks, encompassing test planning, development, and automation. The role involves driving provisioning and integration of system components such as SoC firmware, BMC, RMC, and OS for rack-level readiness. The engineer will own execution against program achievements, report validation progress and risks, triage test failures, collect debug data, and collaborate on root cause analysis. Additionally, the position requires tracking validation coverage and continuously improving test processes and infrastructure, collaborating with ODM/JDM partners on validation and quality, and mentoring engineers to drive engineering excellence.

The ideal candidate will possess a Bachelor's or Master's degree or equivalent experience in Computer Engineering, Electrical Engineering, Computer Science, or a related field. A proven track record in system, rack, or embedded validation with leadership experience is essential. Strong experience in large-scale hardware validation environments, expertise in CPU/GPU, memory, IO, and firmware validation, and proficiency with Linux/server OS and automation using Python/Bash are required. Knowledge of IPMI, Redfish, PLDM, experience with CI/CD pipelines, and hardware interfaces are also necessary. Desirable qualifications include experience in hyperscale environments, familiarity with OpenBMC and processes for verifying firmware functionality, knowledge of firmware security and HIL testing, and experience with test management tools.

Graphcore offers a competitive benefits package, including healthcare options such as day-one medical coverage through Cigna/Kaiser with PPO and HDHP options, employer HSA contributions, dental/vision, life insurance at 3x salary, disability, and mental-health support via Spring Health. Retirement support includes a 401(k) with a 100% company match up to 6% with a year-end true-up. Leave and time-off policies feature flexible or unlimited PTO with 11 paid U.S. holidays and paid family leave.

Joining Graphcore provides an opportunity to be at the forefront of the machine intelligence revolution, enabling innovators from all industries to build AI-native products that expand human potential. The company fosters a culture of continuous learning and constant innovation, bringing together the brightest minds to solve the toughest problems in a place where everyone has the opportunity to make an impact on the company, its products, and the future of artificial intelligence.

More Jobs at Graphcore