What are the responsibilities and job description for the Machine Learning Engineer, Trust & Safety position at Anthropic?

About the role

We are looking for ML engineers to help build safety and oversight mechanisms for our AI systems. As a Trust and Safety Machine Learning Engineer, you will work to train models which detect harmful behaviors and help ensure user well-being. You will apply your technical skills to uphold our principles of safety, transparency, and oversight while enforcing our terms of service and acceptable use policies.

Responsibilities:

Build machine learning models to detect unwanted or anomalous behaviors from users and API partners, and integrate them into our production system
Improve our automated detection and enforcement systems as needed
Analyze user reports of inappropriate accounts and build machine learning models to detect similar instances proactively
Surface abuse patterns to our research teams to harden models at the training stage

You may be a good fit if you:

Have 4 years of experience in a research/ML engineering or an applied research scientist position, preferably with a focus on trust and safety.
Have proficiency in SQL, Python, and data analysis/data mining tools.
Have proficiency in building trust and safety AI/ML systems, such as behavioral classifiers or anomaly detection.
Have strong communication skills and ability to explain complex technical concepts to non-technical stakeholders.
Care about the societal impacts and long-term implications of your work.

Strong candidates may also have experience with:

Machine learning frameworks like Scikit-Learn, TensorFlow, or PyTorch
High-performance, large-scale ML systems
Language modeling with transformers
Reinforcement learning
Large-scale ETL

Apply for this job

Receive alerts for other Machine Learning Engineer, Trust & Safety job openings

Sign up to receive alerts about other jobs with skills like those required for the Machine Learning Engineer, Trust & Safety.

Click the checkbox next to the jobs that you are interested in.

AI -Artificial Intelligence Skill

RPA Developer II

Income Estimation: $82,915 - $107,600
RPA Analyst II

Income Estimation: $86,277 - $111,000

Analysis of Algorithms Skill

Algorithm and Simulation Engineer II

Income Estimation: $92,775 - $114,342
Signal & Image Processing Engineer I

Income Estimation: $87,186 - $106,215

Job openings at Anthropic

Product Designer, Mobile

Anthropic

Seattle, WA Full Time

About the role As a designer at Anthropic, you’ll work alongside product managers, engineers, and AI researchers to shap...

Research Engineer, Reward Models

Anthropic

Seattle, WA Full Time

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be saf...

Research Engineer, Production Model Post Training

Anthropic

Seattle, WA Full Time

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be saf...

Research Engineer, Data Infra

Anthropic

San Francisco, CA Full Time

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be saf...

Not the job you're looking for? Here are some other Machine Learning Engineer, Trust & Safety jobs in the San Francisco, CA area that may be a better fit.

Machine Learning Engineer, Trust & Safety

What are the responsibilities and job description for the Machine Learning Engineer, Trust & Safety position at Anthropic?

About the role

Responsibilities:

You may be a good fit if you:

Strong candidates may also have experience with:

What is the career path for a Machine Learning Engineer, Trust & Safety?

Sign up to receive alerts about other jobs with skills like those required for the Machine Learning Engineer, Trust & Safety.

Job openings at Anthropic

Not the job you're looking for? Here are some other Machine Learning Engineer, Trust & Safety jobs in the San Francisco, CA area that may be a better fit.

We don't have any other Machine Learning Engineer, Trust & Safety jobs in the San Francisco, CA area right now.

AI Assistant is available now!