AIML - ML Engineer, Machine Learning Platform & InfraDo you feel you think differently, are eager to break the status quo, bold and ambitious, and aren't afraid to take risks? If yes, what better place to be than Apple? At Apple, we think differently and push the boundaries of computing and intelligence. We build products that bring smiles to people's faces. The Foundation Model Infrastructure team, within the Machine Learning Platform Technologies organization, is the backbone of Apple Intelligence. It builds frameworks, services, and tools that power the largest Apple foundation models on servers. Our Infrastructure powers a wide range of services at Apple including Apple Search, Apple Music, AppleTV, AppStore, iMessages, Photos & Camera, Spotlight, Safari, Siri, and upcoming exciting Apple products, serving millions of queries every day with incredible low latencies, drawing every ounce of compute from our hardware. As part of this group, you will have the chance to bring intelligence to billions of users across the world and make a difference in people's lives. You will work on optimizing billions of parameter language, vision, and speech models using state-of-the-art technologies and make them run at the scale of Apple.
Description Work alongside the Foundation Model Research team to optimize inference for cutting-edge model architectures. Work closely with product teams to build production-grade solutions to launch models serving millions of customers in real-time. Build tools to understand bottlenecks in inference for different hardware and use cases. Mentor and guide engineers in the organization.
Minimum Qualifications Demonstrated experience in leading and driving complex, ambiguous projects.Experience with high throughput services, particularly at supercomputing scale.Proficient in running applications on Cloud (AWS, Azure, or equivalent) using Kubernetes and Docker.Familiar with GPU programming concepts using CUDA and with popular machine learning frameworks like PyTorch or TensorFlow.Preferred Qualifications Proficient in building and maintaining systems written in modern languages (e.g., Go, Python).Familiar with fundamental deep learning architectures such as Transformer models and encoder/decoder models.Familiar with NVIDIA TensorRT-LLM, vLLM, DeepSpeed, and NVIDIA Triton Inference Server.Experience in writing custom CUDA kernels using CUDA or OpenAI Triton.At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $166,600 and $296,300, and your base pay will depend on your skills, qualifications, experience, and location.
Apple employees also have the opportunity to become shareholders through participation in Apple's discretionary employee stock programs. Employees are eligible for discretionary restricted stock unit awards and can purchase Apple stock at a discount if voluntarily participating in Apple's Employee Stock Purchase Plan. You'll also receive benefits including comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and reimbursement for certain educational expenses related to advancing your career at Apple. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits.
Note: Apple benefits, compensation, and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.
Apple is an equal opportunity employer committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.
#J-18808-Ljbffr