What are the responsibilities and job description for the ML Engineer position at Oumi?
About Oumi
Why we exist: Oumi is on a mission to make frontier AI truly open for all. We are founded on the belief that AI will have a transformative impact on humanity, and that developing it collectively, in the open, is the best path forward to ensure that it is done efficiently and safely.
What we do: Oumi provides an all-in-one platform to build state-of-the-art AI models, end to end, from data preparation to production deployment, empowering innovators to build cutting-edge models at any scale. Oumi also develops open foundation models in collaboration with academic collaborators and the open community.
Our Approach: Oumi is fundamentally an open-source first company, with open-collaboration across the community as a core principle. Our work is:
The ML Engineer will be a crucial part of the team, working to build and maintain the infrastructure that powers Oumi's open AI platform. This role combines platform engineering with machine learning expertise, focusing on creating a reliable and scalable environment for open AI development. As an open-source project and platform, code excellence is key to ensure stability for our thousands of users with access and active contribution to state of the art research on our platform.
What you'll do:
Why we exist: Oumi is on a mission to make frontier AI truly open for all. We are founded on the belief that AI will have a transformative impact on humanity, and that developing it collectively, in the open, is the best path forward to ensure that it is done efficiently and safely.
What we do: Oumi provides an all-in-one platform to build state-of-the-art AI models, end to end, from data preparation to production deployment, empowering innovators to build cutting-edge models at any scale. Oumi also develops open foundation models in collaboration with academic collaborators and the open community.
Our Approach: Oumi is fundamentally an open-source first company, with open-collaboration across the community as a core principle. Our work is:
- Open Source First: All our platform and core technology is open source
- Research-driven: We conduct and publish original research in AI, collaborating with our community of academic research labs and collaborators
- Community-powered: We believe in the power of open-collaboration and welcome contributions from researchers and developers worldwide
The ML Engineer will be a crucial part of the team, working to build and maintain the infrastructure that powers Oumi's open AI platform. This role combines platform engineering with machine learning expertise, focusing on creating a reliable and scalable environment for open AI development. As an open-source project and platform, code excellence is key to ensure stability for our thousands of users with access and active contribution to state of the art research on our platform.
What you'll do:
- Training Infrastructure: Design, develop, and maintain the core platform infrastructure for Oumi, ensuring it is robust, scalable, and efficient for AI model development, training and deployment.
- ML Pipeline Implementation: Implement and optimize machine learning pipelines, including data preparation, model training, evaluation, and deployment.
- Scalability: Design and implement solutions for scaling the platform to handle large datasets and models, ensuring it can meet the needs of the community.
- Performance Optimization: Identify and resolve performance bottlenecks in the platform and ML pipelines, ensuring smooth execution and rapid iteration.
- Automation: Automate infrastructure provisioning, deployment, and monitoring processes to ensure high reliability and efficiency.
- Collaboration: Work closely with the research and engineering teams to support their development workflows and ensure the platform meets their needs.
- Open Source Contribution: Contribute to and help guide the development of Oumi's open-source platform and models.
- Experience: Proven experience in platform engineering, DevOps, or related fields, with a strong understanding of infrastructure-as-code and cloud technologies.
- ML Knowledge: Solid understanding of machine learning concepts and experience with ML workflows, including data preparation, model training, and evaluation.
- Programming Skills: Proficiency in programming languages such as Python, with experience in software development practices.
- Cloud Technologies: Experience with cloud platforms such as AWS, Google Cloud, or Azure.
- Scalability: Experience in designing scalable systems and implementing distributed computing architectures.
- Open Source: Familiarity with open-source projects and a passion for contributing to the open-source community.
- Values: Share Oumi's values: Beneficial for all, Customer-obsessed, Radical Ownership, Exceptional Teammates, Science-grounded.
- Competitive salary: $140,000 - $220,000
- Equity in a high-growth startup
- Comprehensive health, dental and vision insurance
- 21 days PTO
- Regular team offsites and events
Salary : $140,000 - $220,000