What are the responsibilities and job description for the Lead Research Engineer, Speech Foundation Models (AI Labs) position at Krutrim?
Location : Palo Alto (CA, US)
Type of Job : Full-time
About Krutrim :
Krutrim is building AI computing for the future. Our envisioned AI computing stack encompasses the AI computing infrastructure, AI Cloud, multilingual and multimodal foundational models, and AI-powered end applications. We are India’s first AI unicorn and built the first foundation model from the country.
Our AI stack is empowering consumers, startups, enterprises and scientists across India and the world to build their end AI applications or AI models. While we are building foundational models across text, voice, and vision relevant to our focus markets, we are also developing AI training and inference platforms that enable AI research and development across industry domains. The platforms being built by Krutrim have the potential to impact millions of lives in India, across income and education strata, and across languages.
The team at Krutrim represents a convergence of talent across AI research, Applied AI, Cloud Engineering, and semiconductor design. Our teams operate from three locations : Bangalore, Singapore & San Francisco.
Job Description :
We are seeking a highly skilled and experienced Senior Research Lead for Speech, Audio, and Conversational AI to join our innovative team. In this role, you will spearhead the research and development of cutting-edge technologies in speech processing, text-to-speech (TTS), audio analysis , and real-time conversational AI. You will push the boundaries of what's possible in automatic speech recognition (ASR), speaker identification, diarization, speech synthesis, voice cloning, dubbing and audio generation . Working closely with a team of talented engineers and researchers, you'll design, implement, and optimize state-of-the-art systems that contribute to creating more natural, human-like, and high-quality speech and audio solutions for a variety of applications.
Key Responsibilities :
- Bring the state of the art in Audio / Speech and Large Language Models to develop advanced Audio Language Models and Speech Language Models.
- Research, architect, and deploy new generative AI methods such as autoregressive models, causal models, and diffusion models
- Design and implement low-latency end-to-end models with multilingual speech / audio as both input and output.
- Conduct experiments to evaluate and improve the performance of these models, focusing on accuracy, naturalness, efficiency, and real-time capabilities across multiple languages.
- Stay at the forefront of advancements in speech processing, audio analysis, and large language models, integrating new techniques into our foundation models.
- Collaborate with cross-functional teams to integrate these foundation models into Krutrim's AI stack and products.
- Publish research findings in top-tier conferences and journals such as INTERSPEECH, ICASSP, ICLR, ICML, NeurIPS, and IEEE / ACM Transactions on Audio, Speech, and Language Processing.
- Mentor and guide junior researchers and engineers, fostering a collaborative and innovative team environment.
- Drive the adoption of best practices in model development, including rigorous testing, documentation, and ethical considerations in multilingual AI.
Qualifications :
Join Krutrim to shape the future of AI and make a significant impact on 100s of millions of lives across India and the world. If you're passionate about pushing the boundaries of AI and want to work with a team at the forefront of innovation, we want to hear from you!