What are the responsibilities and job description for the Site Reliability Engineer position at writer.com?
About Writer
Writer is the full-stack generative AI platform delivering transformative ROI for the world’s leading enterprises. Named one of the top 50 companies in AI by Forbes and one of the best places to work by Inc. Magazine, Writer empowers hundreds of customers like Accenture, Intuit, L’Oreal, Mars, Salesforce, and Vanguard to transform the way they work.
Writer’s fully integrated solution makes it easy to deploy secure and reliable AI applications and agents that solve mission-critical business challenges. Our suite of development tools is powered by Palmyra – Writer’s state-of-the-art family of LLMs — alongside our industry-leading graph-based RAG and customizable AI guardrails.
Founded in 2020 with office hubs in San Francisco, New York City, Austin, Chicago, and London, our team of over 250 employees thinks big and moves fast, and we’re looking for smart, hardworking builders and scalers to join us on our journey to create a better future of work.
About this role
We are looking for a foundational member of the Cloud Infrastructure team at Writer. This role will involve contributing to the development and implementation of our Site Reliability Engineering (SRE) program. The ideal candidate will ensure the reliability, scalability, performance, and security of Writer’s critical systems, taking a proactive approach to guarantee that our high-ROI products reach our customers seamlessly.
Your responsibilities :
- Lead the design, implementation, and maintenance of Writer, Inc.’s cloud infrastructure to ensure high availability and performance.
- Design and implement scalable cloud automation to support seamless deployment for our largest enterprise customers.
- Automate infrastructure provisioning and management using Terraform & Python.
- Collaborate with development teams to optimize cloud resources and enhance system reliability.
- Develop and maintain monitoring and alerting systems to proactively identify and resolve issues affecting the reliability of our writing solutions.
- Conduct post-mortem analyses of system failures to identify root causes and implement preventive measures.
- Optimize and scale our cloud infrastructure to support growing user demand and ensure cost efficiency.
- Ensure the security and compliance of our systems, adhering to industry standards and regulations.
- Provide mentorship and technical guidance to junior engineers, fostering a culture of reliability and continuous improvement.
- Stay current with emerging technologies and industry trends to continuously improve our site reliability practices.
Is this you?
Preferred Skills & Experience :
Benefits & perks
Writer is an equal-opportunity employer and is committed to diversity. We don't make hiring or employment decisions based on race, color, religion, creed, gender, national origin, age, disability, veteran status, marital status, pregnancy, sex, gender expression or identity, sexual orientation, citizenship, or any other basis protected by applicable local, state or federal law. Under the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
By submitting your application on the application page, you acknowledge and agree to Writer's Global Candidate Privacy Notice .
J-18808-Ljbffr