at Apple
Location
Cupertino, United States of America
Compensation
$181k–$318k USD
Type
full time
Posted
1 weeks ago
Market range · company + function + seniority
p25 · target · p75 · n=626
Posted $318k · in the market band
Tailor your résumé to this role in 30 seconds.
Free account · ATS keyword check · per-job bullet rewrite by Claude.
In this role you'll contribute to the infrastructure, tooling, and pipelines that let us evaluate Siri reliably and at scale. You'll have meaningful autonomy in how you get there, and the work will move across several areas of expansion as priorities evolve. The specific platforms, frameworks, and components will change over time, so we're looking for someone who can transition smoothly across them and bring strong evaluation and systems engineering fundamentals to whatever the team needs next.
Extending evaluation capabilities to new devices, platforms, and runtime environments, with designs that favor portability over any single target
Supporting the evaluation of new Siri features and interaction modalities, working from ambiguous early requirements toward concrete, automated coverage
Diagnosing failures across the stack, from environment provisioning through pipeline execution to scoring, enabling auto-diagnostics and driving durable fixes
Contributing to architecture decisions for the team's evaluation systems
Partnering across engineering, infrastructure, and program teams to align on interfaces, priorities, and shared standards
Strong programming skills in one or more compiled languages (Swift, C++ or Objective-C).
Python scripting skills for tooling and automation
Solid understanding of computer science fundamentals
Ability to quickly learn new technologies and adapt to evolving requirements
Excellent communication skills and ability to collaborate across teams
M.S. or B.S. in Computer Science, Machine Learning, or related field (or equivalent experience)
Experience staging, provisioning, or controlling test or evaluation environments to produce repeatable, deterministic conditions
Experience evaluating ML, LLM or agent-based systems, including familiarity with metrics, scoring methodology, or trajectory and outcome analysis
Experience designing or operating test infrastructure at scale, such as device provisioning, environment restore, warm pools, or continuous integration systems
Proficiency with Python and Swift in a production setting
A track record of approaching problems flexibly and cutting through ambiguity, adapting your approach to reach the right outcome and setting a clear path when requirements are not yet defined
A talent for focusing and simplifying, stripping away what is not essential and distilling complex decisions down to the factors that matter
A history of collaborating across teams and communicating effectively with both technical and program audiences
At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish.
Do you want to help measure and improve the quality of Siri across the devices, features, and experiences people rely on every day? Apple's Agentic Evaluation Engineering organization builds the infrastructure that determines how Siri's quality is measured, trusted, and improved. You'll join a team focused on expanding what that platform can reach: the devices and environments we evaluate on, the features and interaction modalities we exercise, and the realistic, repeatable conditions we stage to ground each evaluation. The surface area is large and growing. You'll have real autonomy in how you tackle it, and you'll build infrastructure the team can rely on as priorities shift.
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant
At Apple, we believe accessibility is a fundamental human right. You’ll find that idea reflected in everything here — in our culture, our benefits and our digital tools. By welcoming as many perspectives as possible, we help you build a career where you feel like you belong.
Learn about accessibility in Apple’s workplace
Learn about reasonable accommodations for job applicants
Apple accepts applications to this posting on an ongoing basis.
More open roles at Apple
Hiring velocity, headcount trend, and every open posting on one page.
Open postings ranked by description similarity — useful if this role isn't quite right.