Model Behavior Architect

🇺🇸 San Francisco, California
$2K - $3K Annual
Posted 4 months ago
Expires June 9, 2026

ABOUT THE ROLE

We're looking for a Model Behavior Architect to help build Perplexity's AI products and evaluations. You'll sit within our AI team and collaborate closely with research and product teams, designing prompt and context engineering strategies to deliver high quality user experiences across multiple domains and models.

This role is equal parts craft and science. You'll develop a deep understanding of our answer engine by pressure-testing model capabilities and working across our AI infrastructure (including system and tool prompts, skills, and evaluations) to create a stellar product experience for our users.

You'll serve as a go-to expert on prompting, model quality, and behavioral consistency across new product features and model releases.

KEY RESPONSIBILITIES

- Context Engineering: Design, test, and optimize context strategies and system prompts that shape answer engine behavior across products, features, and use cases.

- Evaluation Systems: Build automated and semi-automated evaluation pipelines that measure model quality, catch regressions, and scale across product surfaces.

- Model Launch Support: Partner with research and engineering to validate model behavior before and during rollouts, ensuring smooth transitions with no degradation.

- Research & Analysis: Identify inconsistencies and failure modes in model outputs through well-designed research projects — for both internal and production-facing systems.

- Cross-functional Collaboration: Work closely with design, product, and research teams to translate product goals into concrete model behavior requirements.

- Knowledge Sharing: Help engineers across teams build intuition for prompt design, context engineering, and evaluation best practices.

- Staying Current: Track the latest alignment, evaluation, and prompting techniques from industry and academia, and bring the best ideas back to the team.

WHAT WE'RE LOOKING FOR

REQUIRED

- Experience designing evaluations, benchma...

More Jobs at Perplexity