at Apple
Location
Seattle, United States of America
Compensation
$172k–$302k USD
Type
full time
Posted
1 months ago
Market range · company + function + seniority
p25 · target · p75 · n=653
Posted $302k · in the market band
Tailor your résumé to this role in 30 seconds.
Free account · ATS keyword check · per-job bullet rewrite by Claude.
Work along side Foundation Model Research team to optimize inference for cutting edge model architectures.
Work closely with product teams to build Production grade solutions to launch models serving millions of customers in real time.
Build tools to understand bottlenecks in Inference for different hardwares and use cases.
Mentor and guide engineers in the organization.
Collaborate with the Foundation Model Research team to optimize inference for cutting edge model architectures
Work closely with product teams to build Production grade solutions to launch models serving millions of customers in real time
Build profiling tools, simulators to understand the bottlenecks
Mentor and guide engineers in the organization
5+ years of experience leading and driving complex, ambiguous projects.
Experience with LLM inference stack
Familiarity with GPU programming concepts using CUDA.
Familiarity with one of the popular ML Frameworks like Pytorch, Tensorflow.
Have experience with high throughput services particularly at supercomputing scale.
Proficient with running applications on Cloud (AWS / Azure or equivalent) using Kubernetes, Docker etc.
Familiar with one of the popular ML Frameworks like Pytorch, Tensorflow.
BS in Computer Science, Artificial Intelligence, Machine Learning, Information Retrieval, Data Science or related field
Proficient in building and maintaining systems written in modern languages (eg: Golang, Python)
Familiar with fundamental Deep Learning architectures such as Transformers, Encoder/Decoder models.
Familiarity with Nvidia TensorRT-LLM, vLLM, DeepSpeed, Nvidia Triton Server etc.
Experience writing custom CUDA kernels using CUDA or OpenAI Triton.
MS in Computer Science, Artificial Intelligence, Machine Learning, Information Retrieval, Data Science or related field.
We are Foundation Model Inference Team, within AI, Search & Knowledge Platform Technologies organization. Our team is responsible to build Inference stack to power Apple Intelligence. It builds frameworks, services and tools that power the largest Apple foundation models on servers. Our Infrastructure powers a wide gamut of services at Apple including Apple Search, Apple Music, AppleTV, AppStore, iMessages, Photos & Camera, Spotlight, Safari, Siri and upcoming ever exciting Apple products serving millions of queries every day with incredible low latencies, drawing every ounce of compute from our hardware. As part of this group, you will get a chance to bring Intelligence to billions of users across the world. You will have an opportunity to make difference in life of people by empowering them with AI. You will have a chance to work on optimizing billions of parameter langauge and vision and speech models using state of the art technologies and make it run at scale of Apple.
At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $171,600 and $302,200, and your base pay will depend on your skills, qualifications, experience, and location.Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant
At Apple, we believe accessibility is a fundamental human right. You’ll find that idea reflected in everything here — in our culture, our benefits and our digital tools. By welcoming as many perspectives as possible, we help you build a career where you feel like you belong.
Learn about accessibility in Apple’s workplace
Learn about reasonable accommodations for job applicants
Apple accepts applications to this posting on an ongoing basis.
At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $181,100 and $318,400, and your base pay will depend on your skills, qualifications, experience, and location.Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant
At Apple, we believe accessibility is a fundamental human right. You’ll find that idea reflected in everything here — in our culture, our benefits and our digital tools. By welcoming as many perspectives as possible, we help you build a career where you feel like you belong.
Learn about accessibility in Apple’s workplace
Learn about reasonable accommodations for job applicants
Apple accepts applications to this posting on an ongoing basis.
More open roles at Apple
Hiring velocity, headcount trend, and every open posting on one page.
Open postings ranked by description similarity — useful if this role isn't quite right.