Ai Engineer
Company:

Diverse Lynx


Details of the offer

AI+HPC infra requirement

looking for someone with Architectural and design experience also along with experience in handling 1000+ nodes.

Technical/Functional Skills -
Proficiency in RoCEv2, K8s, KVM, Ubuntu, Python, Shell, Go, Rust, GPU drivers, and Cluster interconnect with 200G/400G networking.
Managing GPU clusters optimizing GPU-based services/tools/software

Roles & Responsibilities -

Develop, implement, and maintain GPU-based clusters of 10 to 1000 nodes, ensuring optimal performance and availability.
Administer Client/AI platforms - Distributed Client services, LLMs, Vector-DB and AI inferencing, by managing deployments, resource allocation, monitoring, and security.
Collaborate with cross-functional teams to address AI infrastructure requirements, support AI-related projects, and provide technical expertise.
Monitor and evaluate the performance of AI systems and clusters, ensuring that they adhere to industry best practices and meet company standards.
Compile reports, document procedures, and publish recommendations for improving AI infrastructure and solutions.
Use AI/Client to continuously improve internal processes and tools that are used in end-to-end delivery of your services in this team

Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.
#J-18808-Ljbffr


Source: Grabsjobs_Co

Job Function:

Requirements

Ai Engineer
Company:

Diverse Lynx


Structural Engineer

JOB DESCRIPTION I am partnered with a firm that is seeking a  Structural Engineer. This organization works hard to ensure their projects are planned, designe...


From Gpac - California

Published a month ago

Principal Engineer - Creator Safety Team

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all creat...


From Roblox - California

Published a month ago

Engineering Manager - Ubuntu Systems Management

Job Description Job Description This is an engineering management role to lead the reboot of our Landscape systems management solution for Ubuntu. Ubuntu i...


From Canonical - Jobs - California

Published a month ago

Senior Gameplay Engineer - Treyarch (Los Angeles)

Job Title: Senior Gameplay Engineer - Treyarch (Los Angeles) Requisition ID: R022953 Job Description: We are looking for a talented engineer with a passio...


From Activision Blizzard, Inc. - California

Published a month ago

Built at: 2024-06-01T22:48:32.894Z