What are the responsibilities and job description for the Senior Software Engineer - Asynchronous Processing position at Klaviyo Inc.?
Klaviyo is growing fast and we have openings for all skill levels across all of our teams. Learn more about our engineering culture at https : / / klaviyo.tech
Maximise your chances of a successful application to this job by ensuring your CV and skills are a good match.
This role will be part of our Site Reliability Engineering (SRE) Team. SRE is what you get when you treat system operations as a software engineering problem. The mission of the Site Reliability Engineering team is to provide services, tooling, and guidance to Klaviyo's product engineers to make them more productive and ensure their services are sufficiently reliable, scalable, and secure.
For the Asynchronous Processing Team : Our most active current project is building an async platform for Klaviyo's various async workloads on Apache Pulsar. Klaviyo is mainly a Python shop, so a lot of it involves building client libraries for our internal users in Python, along with operating the actual Pulsar cluster. It's at scale, so all of that is a challenge. If any of this sounds interesting to you, please reach out to me and I can tell you more.
The SRE team builds foundational backend services as well as tooling and automation to allow product teams to release and scale their software reliably and predictably. SREs are team players who work collaboratively among themselves and with engineers from product teams to build the platform Klaviyo relies on to power its products.
You will own multiple foundational Klaviyo services and make a big impact on the productivity of our product engineering teams.
How you will make a difference :
- Ship foundational services to enable Klaviyo engineering to move faster with confidence
- Design and develop systems and processes that enable highly available & scalable systems
- Design, build and deliver software to dramatically improve the availability, scalability, latency, and efficiency of Klaviyo’s services
- Achieve breakthroughs in systems throughput by identifying and eliminating bottlenecks
- Leverage technology such as Python, Go, Bash, Django, AWS, Kubernetes, Terraform, MySQL, Apache Pulsar, Redis, and Clickhouse to advance Klaviyo’s platform
- Champion best practices by actively collaborating with other teams in a culture that values technical design review
- Contribute to the company as a subject matter expert in multiple areas, constantly pushing yourself to be a better engineer and to level up all of your peers within your team and within Klaviyo.
- Mentor and pair with other Klaviyo engineers to build better software by focusing on performance, self-healing systems, configuration as code; defensive programming, application security, etc.
- Participate in periodic on-call duties with a focus on solving issues when they are discovered, preventing recurrences, and minimizing alert fatigue
- Work hand-in-hand with product-facing engineers to ship impactful code
- Perform quantitative analysis to understand and scale Klaviyo systems and manage the cross-functional effort to resolve scalability issues
- Produce and advocate for preventative, upstream solutions with internal stakeholders and external vendors and dependencies
- Confidently make informed, data-driven decisions in a fast-paced environment with competing priorities
- Evangelize Site Reliability best practices across the engineering organization and community
Who You Are :
J-18808-Ljbffr