Senior Ai Infrastructure Engineer

Details of the offer

As a Senior AI Infrastructure Engineer, you will be responsible for building the next generation, highly available, global, multi-cloud PaaS platform with open-source technologies to enable and accelerate Together AI's rapid growth.
This system spans many diverse environments (Kubernetes, VMs, bare metal compute, and edge deployments) and provides a cohesive and reliable abstraction for running AI workloads in them.
You will get to be a technology thought leader, evangelize new, cutting-edge technologies, and solve complex problems.
To be successful, you'll need to be deeply technical and possess excellent communication, collaboration, and diplomacy skills.
You have experience practicing infrastructure-as-code, including using tools like Terraform and Ansible.
You have strong software development fundamentals and skills.
In addition, you have strong systems knowledge and troubleshooting abilities.
Requirements 5+ years of professional software development experience and proficiency in at least one backend programming language (Golang desired)Demonstrated experience with high performance or distributed cloud microservices architectures and ideally experience building them in operation at a global scale using multiple cloud providers such as AWS, Azure, or GCPExcellent understanding of low level operating systems concepts including multi-threading, memory management, networking and storage, performance, and scalePragmatic, methodical, well-organized, detail-oriented, and self-startingExperience with Kubernetes and containerization, VPNs, AI workloads, and blockchain based protocols a plusGPU programming, NCCL, CUDA knowledge a plusExperience with Pytorch or Tensorflow a plus5+ years experience writing high-performance, well-tested, production quality codeResponsibilities Perform architecture and research work for decentralized AI workloadsWork on the core, open-source Together AI platformCreate services, tools, and developer documentationCreate testing frameworks for robustness and fault-toleranceAbout Together AI Together AI is a research-driven artificial intelligence company.
We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models.
We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama.
We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure.
Compensation We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work.
The US base salary range for this full-time position is: $160,000 - $230,000 + equity + benefits.
Our salary ranges are determined by location, level and role.
Individual compensation will be determined by experience, skills, and job-related knowledge.
Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

#J-18808-Ljbffr


Nominal Salary: To be agreed

Source: Jobleads

Requirements

Backend Engineer, Developer Sdks, Golang Expert

Stripe is a financial infrastructure platform for businesses. Millions of companies - from the world's largest enterprises to the most ambitious startups - u...


Rollbar, Inc. - California

Published 6 days ago

Senior Software Engineer - Containers And Platform

By making evidence the heart of security, we help customers stay ahead of ever-changing cyber-attacks. Corelight is a distributed first cybersecurity startup...


Job Board - California

Published 6 days ago

Software Engineer, Infrastructure

Why HarveyHarvey is a secure AI platform for professionals in law, tax, and finance that augments productivity and automates complex workflows. Harvey uses a...


Harvey.Ai - California

Published 6 days ago

Senior Software Engineer- Reliability

Luma's mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality is critical for intelligence. To go beyo...


Luma Ai - California

Published 6 days ago

Built at: 2025-01-22T07:02:19.722Z