What are the responsibilities and job description for the Research Engineer, Model Behavior position at OpenAI?
About the Team
The Model Behavior team shapes how our models interact with people. , aiming for intuitive experiences that exceed user expectations and feel like magic.
The team partners closely with research and product teams across the company to improve the real-world usefulness of our models at scale. Our work directly impacts millions of users globally and contributes to OpenAI's mission of broadly distributing safe AI.
About the Role
As research engineer, you will research and develop improvements to our models. Our team works in research areas combining reinforcement learning and products.
We're looking for individuals with strong ML engineering skills. An ideal candidate is passionate about applying both creative and robust engineering approaches and internally proven state-of-the-art research methods to bring out the magic in OpenAI’s models.
Some experience working with large language models is a bonus.
This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.
In this role, you will :
Do the highest-leverage work to improve models for our users at scale — including the bullet points below.
Use the latest research methods to understand and act on how users use our models and where our models are falling short.
Design, implement, test, and debug code across our product and research stack.
Build robust evaluations for defining and tracking improvements in model behavior.
Own and support experiments that tweak model behavior.
You might thrive in this role if you :
Are willing to own important problems end-to-end, while also having good delegation skills and are willing to pick up whatever knowledge you're missing to get the job done.
Have a working knowledge of relevant models, and building evaluations for model capability improvements.
Are experienced in collaborating with cross-functional teams to ensure that reliability and scalability are considered in the design and development of new systems and tools
Have a humble attitude, an eagerness to help your colleagues, and a desire to do whatever it takes to make the team succeed
Are excited about going deep in the weeds of disparate data points to identify patterns that will inform how we make life better for users
As a bonus, have understanding of AI / ML workloads or working knowledge and experience in building evaluations for large language and multimodal models.