What are the responsibilities and job description for the Model Behavior Architect, Alignment Finetuning position at The Rundown AI, Inc.?

About the Role :

As a Model Behavior Architect at Anthropic, you'll be at the forefront of shaping AI system behavior to ensure it aligns with human values. Working within the Alignment Finetuning team, you'll combine your expertise in model evaluation, prompt engineering, and ethical judgment and knowledge to help create AI systems that respond with good judgment across diverse scenarios.

Responsibilities :

Interact with models to carefully identify where model behavior and judgment can be improved
Gather internal and external feedback on model behavior to document areas for improvement
Design and implement subtle prompting strategies and data generation pipelines that improve model responses
Identify and fix edge case behaviors through rigorous testing of your data generation pipelines
Develop evaluations of language model behaviors across judgment-based domains like honesty, character, and ethics
Work collaboratively with researchers on related teams like Trust and Safety, Alignment Science, and Applied Finetuning

You May Be a Good Fit If You :

Have extensive experience with prompt engineering and chaining for language models

Demonstrate strong skills in evaluating AI system outputs on subtle or fuzzy tasks

Have a background in philosophy, psychology, data science, or related fields

Care about AI safety and the ethical implications of both current and future AI behaviors

Are comfortable using basic Python and running basic scripts

Have a keen eye for identifying subtle issues in AI outputs

Understand how LLMs are trained and are familiar with concepts in reinforcement learning

Have experience finetuning large language models

Are happy to engage in test-driven development and to carefully analyze data and data pipelines

Strong Candidates May Also Have :

Formal training in ethics or moral philosophy or moral psychology

Experience in data science with emphasis on data verification

Conceptual understanding of language model training and finetuning techniques

Previous experience developing evaluation frameworks for large language models

Background in AI safety research or similar fields

Experience with RLHF, constitutional AI, or other alignment techniques

Published work related to AI ethics or safety

Knowledge of model behavior benchmarking

Join us in our mission to ensure advanced AI systems behave reliably and ethically while staying aligned with human values.

J-18808-Ljbffr

Apply for this job

Receive alerts for other Model Behavior Architect, Alignment Finetuning job openings

What is the career path for a Model Behavior Architect, Alignment Finetuning?

Sign up to receive alerts about other jobs on the Model Behavior Architect, Alignment Finetuning career path by checking the boxes next to the positions that interest you.

Model Maker

Income Estimation:

$63,912 - $88,987

Model Maker, Sr.

Income Estimation:

$78,601 - $108,479

Mental Health Technician

Income Estimation:

$36,885 - $46,221

Behavioral Health Supervisor

Income Estimation:

$79,078 - $104,694

Intake Coordinator

Income Estimation:

$55,611 - $73,900

Behavior Analyst

Income Estimation:

$65,218 - $79,682

Behavior Analyst

Income Estimation:

$65,218 - $79,682

Behavioral Health Supervisor

Income Estimation:

$79,078 - $104,694

Job openings at The Rundown AI, Inc.

Corporate Counsel

The Rundown AI, Inc.

California, MO Full Time

About GleanWe’re on a mission to make knowledge work faster and more humane. We believe that AI will fundamentally trans...

(Senior) Medical Science Liaison

The Rundown AI, Inc.

MD Full Time

Passionate about precision medicine and advancing the healthcare industry? Recent advancements in underlying technology ...

Solutions Architect, Llama

The Rundown AI, Inc.

Menlo, CA Full Time

We are seeking an experienced Solutions Architect to join our LlamaX Enterprise & Government team within our Partner Eng...

Corporate Legal Specialist

The Rundown AI, Inc.

San Francisco, CA Full Time

About the role We are seeking an experienced Corporate Legal Specialist to join our dynamic legal team. As we continue t...

Not the job you're looking for? Here are some other Model Behavior Architect, Alignment Finetuning jobs in the San Francisco, CA area that may be a better fit.

Model Behavior Architect, Alignment Finetuning

What are the responsibilities and job description for the Model Behavior Architect, Alignment Finetuning position at The Rundown AI, Inc.?

What is the career path for a Model Behavior Architect, Alignment Finetuning?

Job openings at The Rundown AI, Inc.

Not the job you're looking for? Here are some other Model Behavior Architect, Alignment Finetuning jobs in the San Francisco, CA area that may be a better fit.

We don't have any other Model Behavior Architect, Alignment Finetuning jobs in the San Francisco, CA area right now.

AI Assistant is available now!