What are the responsibilities and job description for the Senior Staff Software Engineer, AI Data Infra Lead position at Stack AV?
About the Role:
Data is one of the main drivers in the success of any ML company and the autonomous vehicle industry is no exception to this. The AI Data Infra team, part of the ML Platform org of Stack AV, is responsible for storage, access, and curation of the data across the company and across all the ML use cases.
We are seeking an experienced, visionary, and hands-on technical lead for our data infra team. You will be responsible for designing the architecture and leading a team to build a robust data platform that powers all the next-gen AI Autonomous Vehicle applications in the company. The ideal candidate will have a deep understanding of modern data technologies, excellent leadership skills, and the ability to drive technical excellence.
Responsibilities:
- Architectural Design: Design and oversee the architecture of the data platform, ensuring scalability, reliability, and performance.
- Tech Leadership: Lead and mentor a team of data engineers in building and maintaining the data platform.
- Data Management: Ensure efficient storage, retrieval, data quality, consistency, and governance across the platform. Develop and implement strategies for data lifecycle management, including archiving and purging.
- Collaboration: Collaborate with cross-functional teams to understand data requirements and design appropriate solutions.
- Technology Stack: Stay updated with the latest technologies and trends in data engineering, making recommendations for new tools and best practices.
- Performance Optimization: Identify and resolve performance bottlenecks in data processing and storage.
- Promote Engineering Excellence: Set a culture of engineering excellence within the team and work closely with the management and customer teams to balance between speed of delivery and quality of engineering artifacts.
Qualifications:
- Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.
- 8 years of experience in data engineering or a related field, with at least 3 years in a technical leadership role.
- Proficient in data architecture and design.
- Extensive experience with data lake and data warehouse technologies (e.g., Hadoop, Spark, AWS S3, Redshift, Iceberg, Hive, Trino, etc).
- Experience with real-time data processing and streaming technologies (e.g., Kafka, Flink).
- Strong programming skills in languages such as Python, Java, or Scala.
- Experience with ETL tools and processes.
- Knowledge of machine learning and AI technologies and their data requirements.
- Experience with agile development methodologies.
- Contributions to open-source projects or technical publications.
- Proven ability to lead and mentor a team, manage projects, and drive technical initiatives.
- Strong analytical and problem-solving skills.
- Excellent verbal and written communication skills, with the ability to convey complex technical concepts to non-technical stakeholders. #LI-TT1