Senior Ml Infrastructure Engineer

Details of the offer

Who is Recruiting from Scratch: Recruiting from Scratch is a talent firm that focuses on placing the best candidate for our clients. Our team is 100% remote and we work with teams across North America, South America, and Europe to help them hire. Senior ML Infrastructure Engineer | AI Infrastructure Scale-Up | SF Based Base: $180K - $300K + Equity (0.1-3%) | Visa Sponsorship Available

Are you excited about building the future of AI infrastructure? We're scaling our inference systems to handle millions of LLM requests daily, and we need exceptional talent to drive this growth.

The Role: We're seeking a Senior ML Infrastructure Engineer to architect and implement large-scale, fault-tolerant systems. You'll be joining a team that's pushing the boundaries of AI infrastructure, handling hundreds of millions of API calls daily.

What You'll Do:

Design and implement distributed systems for our inference network Develop resource allocation models across heterogeneous hardware Optimize network performance metrics (latency, throughput, availability) Build robust monitoring and observability systems Drive architectural decisions and best practices Collaborate directly with founders and engineering teams What You Bring:

5+ years building high-performance, scalable distributed systems Strong programming skills in TypeScript, Python, and either Go, Rust, or C++ Experience with Kubernetes/Nomad orchestration Hands-on experience with AI tooling (ChatGPT, Claude, Cursor) GPU programming and optimization skills (CUDA experience is a plus) Startup experience (pre-seed to series A) Bonus Points:

Experience with LLM inference engines (vLLM, TensorRT-LLM) Track record of scaling distributed systems Location & Details:

San Francisco, CA (In-person) Full-time W-2 position




#J-18808-Ljbffr


Nominal Salary: To be agreed

Source: Jobleads

Requirements

Manager, Software Engineering

Company Overview Docusign brings agreements to life. Over 1.5 million customers and more than a billion people in over 180 countries use Docusign solutions t...


Docusign, Inc. - California

Published 12 days ago

Corporate Functions Opportunities Mountain View, Ca (Remote)

At Groq, we believe AI will change humanity forever, and that making it affordable and universally accessible is the key to human agency in an AI economy. We...


Groq Inc. - California

Published 12 days ago

Technical Account Manager

Team Description Pendo's Technical Account Managers play a crucial role by providing proactive, strategic, and technical guidance to ensure customers are abl...


Pendo - California

Published 12 days ago

Senior Software Engineer, Fullstack (Cdp Api)

The Coinbase Developer Platform APIs are the easiest way for developers to get started with building crypto applications. Conceived by Coinbase CEO Brian Arm...


Coinbase Developer Platform - California

Published 12 days ago

Built at: 2024-12-22T10:57:48.308Z