What are the responsibilities and job description for the Staff Software Engineer, Site Reliability position at Character?
About the Role
As a founding member of our Site Reliability Engineering function at Character, you will have the opportunity to support our infrastructure with thousands of nodes, terabytes of data, and millions of daily active users on our site.
Responsibilities
- Maintain production services and keep them operational.
- Develop tools, instrumentation, and automation to monitor and optimize the performance and reliability of our service.
- Collaborate with development teams to design and implement scalable, reliable systems and CI/CD processes for deployment.
- Establish and support SLAs and SLOs for our site.
Requirements
- 5 years of experience in a development-focused DevOps/SRE role within a technology organization with significant scale.
- Deep experience with and proven success in developing software tools and automation using Python and Golang.
- Expertise with SQL, Linux, CI/CD, Kubernetes, and Terraform to support a site/application within a large multi-node infrastructure and a growing user base.
About Character.AI
Character.AI empowers people to connect, learn, and tell stories through interactive entertainment. Our platform lets users engage with tens of millions of characters, enjoy unlimited conversations, and embark on infinite adventures.
Our Values
We value diversity and welcome applicants from all backgrounds. As an equal opportunity employer, we uphold a non-discrimination policy based on race, religion, national origin, gender, sexual orientation, age, veteran status, or disability.