Manager, Site Reliability Engineering
Veracode is seeking a Manager of Site Reliability Engineering to lead a global team responsible for the reliability, availability, and operational excellence of its production systems. This role focuses on defining and enforcing reliability standards, managing production risks, and ensuring services meet agreed-upon service levels under real-world conditions.
Key responsibilities include leading a nine-member global Site Reliability Engineering team, setting objectives and key results, managing team performance, and acting as the primary point of accountability for reliability concerns spanning multiple teams. The manager will oversee the team on-call schedule, escalate alerts and production incidents, and collaborate with software engineering teams to ensure effective monitoring and alerting systems are in place. Additionally, the role involves automating infrastructure deployment and management using tools like Terraform and Kubernetes, and driving improvements in observability and alert hygiene across the organization.
The ideal candidate will have a Bachelor's Degree in Computer Science, Information Science, Engineering, or a related field, with at least two years of experience as a manager or team lead with direct reports, and over five years in a Site Reliability Engineering, DevOps, Cloud Engineering, or similar role. Required skills include experience with AWS and automation tools such as Terraform, CloudFormation, or Ansible; hands-on experience with Kubernetes clusters; proficiency with observability tools like Datadog, Sumologic, Prometheus, or Grafana; familiarity with CI/CD pipelines and repository management tools; strong programming skills in languages like Python or Go; and a solid understanding of infrastructure as code and GitOps methodologies.
Veracode offers a "Take What You Need" time off policy, extensive development and training opportunities, and a generous 401(k) match to support employees' future savings. The company fosters a community of professionals who take pride in their work and are committed to continuous improvement.
As a global leader in Application Risk Management for the AI era, Veracode provides a platform trusted by organizations worldwide to build and maintain secure software from code creation to cloud deployment. Joining Veracode means becoming part of an innovative, high-growth, multi-award-winning company in one of the hottest segments of the security market.