What are the responsibilities and job description for the Data Engineer I - Personalization position at Spotify?
The Personalization team makes deciding what to play next easier and more enjoyable for every listener. From Daily Mix to Discover Weekly, we’re behind some of Spotify’s most-loved features. We built them by understanding the world of music and podcasts better than anyone else. Join us and you’ll keep millions of users listening by making great recommendations to each and every one of them.
Within this organization, we are looking for a Data Engineer to join the Home Music team, responsible for the music recommendations users see on Spotify’s Home surface. We craft this entry point to Spotify such that it greets our listeners with a variety of music-related content packaged in ways they find engaging and useful. Our work is highly visible within the company and core to many of Spotify’s core strategic objectives.
Our team is a cross-functional group of Machine Learning and Backend engineers, looking to add a Data Engineer into the mix. You will partner with teammates to design and deploy scalable, personalized recommendation systems, including building data solutions that support the production and evaluation of large-scale ML models.
What You'll Do
Today, we are the world’s most popular audio streaming subscription service.
Within this organization, we are looking for a Data Engineer to join the Home Music team, responsible for the music recommendations users see on Spotify’s Home surface. We craft this entry point to Spotify such that it greets our listeners with a variety of music-related content packaged in ways they find engaging and useful. Our work is highly visible within the company and core to many of Spotify’s core strategic objectives.
Our team is a cross-functional group of Machine Learning and Backend engineers, looking to add a Data Engineer into the mix. You will partner with teammates to design and deploy scalable, personalized recommendation systems, including building data solutions that support the production and evaluation of large-scale ML models.
What You'll Do
- Build large-scale batch and streaming data pipelines, and assist the rest of the team in supporting backend services used to serve real-time model predictions.
- Integrate our data systems with their backend counterparts.
- Develop expertise in leveraging the tools, frameworks and languages that make up our stack: Scio, Google Cloud Platform, Scala, BigTable, gRPC, Java, TensorFlow, and more.
- Drive optimization, testing and tooling to improve data quality and efficiency of our systems, especially data pipelines.
- Collaborate with ML engineers and business partners.
- Learn from highly experienced data engineers about data engineering best practices.
- Work on a multi-functional agile team to continuously experiment, iterate and deliver on product objectives.
- You are comfortable with the JVM and object-oriented programming–experience with Scala, Java, and Python is ideal.
- You are familiar with the concepts of data modeling, data access, and data storage techniques.
- You are familiar with distributed data processing frameworks (ex: Beam, Spark).
- You want to work on a team employing agile software development processes, data-driven development, and responsible experimentation.
- You value opportunities to work collaboratively.
- This role is based in NYC or Boston.
- We offer you the flexibility to work where you work best! There will be some in person meetings, but still allows for flexibility to work from home.
Today, we are the world’s most popular audio streaming subscription service.
Salary : $100,705 - $143,865