Research Engineer, Trust & Safety

Details of the offer

About the roleWe are looking for Research engineers to help design and build safety and oversight algorithms for our AI models and products.
As a Trust and Safety Research Engineer, you will work to design and train ML models based on research progress, which detect harmful user/model behaviors and help ensure society's well-being.
You will apply your research skills to uphold our principles of safety, transparency, and oversight while enforcing our terms of service and acceptable use policies.
What you will be working on:
Design, iterate and build ML models to detect unwanted or anomalous behaviors from both users and LLM modelsWork with T&S ML engineers to review and iterate experiment ideations.
Co-author the experiment success criteria and production deployment roadmapsPartner with T&S Policy and Enforcement cross-functional teams to understand emerging and sustained abuse patterns from user prompts and behaviors.
Incorporate the insights into T&S research datasetsSurface abuse patterns to sibling research teams in the company.
Collaborate together to harden Anthropic's LLMs at the pre/post training stagesStay current with state-of-the-art research in AI and machine learning, and propose ways to apply these advancements to T&S systemsYou may be a good fit if you:Have 4+ years of experience in a research engineering or an applied research scientist position, preferably with a focus on trust and safetyHave significant Python programming experience and machine learning experienceHave proficiency in building trustworthy and safe AI technologyHave strong communication skills and ability to explain complex technical concepts to non-technical stakeholdersCare about the societal impacts and long-term implications of your work and are results orientedStrong candidates may also:Have experience fine-tuning large language models with supervised learning or reinforcement learningHave experience with machine learning frameworks like Scikit-Learn, Tensorflow, or PytorchHave experience authoring research papers in machine learning, NLP, or AI alignment or similar industry experienceHave developed evaluations for language models
#J-18808-Ljbffr

Nominal Salary: To be agreed

Source: Jobleads

Job Function:

Science

Requirements

Similar offers

See more similar offers

Principal Data Scientist Rwe & Aa

Job Description - Principal Data Scientist RWE & AA (2406227092W) Description Johnson & Johnson Innovative Medicine, R&D Data Science and Digital Health team...

Johnson & Johnson - California

Published 9 days ago

I.P. - Patent Agent - Life Sciences Chemistry

For over 50 years, an AM 100 Law Firm has been at the forefront of helping some of the world's most recognized companies achieve and maintain their status as...

Ucare Staffing, Llc - California

Published 8 days ago

Sr.Data Scientist

Client is looking for someone who is a Machine Learning Engineer. Have experience like this: Industry machine learning experience doesn't have to be healthca...

Blockchain Technologies. Llc - California

Published 8 days ago

Principal Data Scientist - Product Owner Enterprise Data Lake And Digital Analytics

Join Amgen's Mission of Serving PatientsAt Amgen, if you feel like you're part of something bigger, it's because you are. Our shared mission—to serve patient...

Amgen Sa - California

Published 8 days ago

Built at: 2025-01-07T22:03:59.827Z