What are the responsibilities and job description for the Senior Staff Research Engineer, On-Device Language Intelligence position at Samsung Research America?
Lab Summary: Samsung AI Research Center (AIC) located in Mountain View, California, is currently recruiting outstanding scientists for the Language Intelligence lab. Our goal is to perform research and development with direct impact on future Samsung products reaching hundreds of millions of users worldwide. We are focused on pushing the state-of-the-art and practice in on-device natural language intelligence.
Position Summary: We are looking for highly skilled and motivated Researchers/Engineers to join our team and contribute to the development of our future natural language intelligence.
Position Responsibilities:
- Conduct cutting-edge research and development of large foundation models (LLM, VLM, and Reasoning) for future, including model design, efficient model training, instruction tuning, prompt engineering, planning, action and related topics
- Collaborate with a multidisciplinary team of researchers, engineers, and domain experts to understand requirements, develop prototypes, and deliver robust solutions
- Conduct thorough evaluations and analysis of model performance, identify areas for improvement, and propose innovative solutions to enhance the overall quality and capabilities of large language models
- Generate creative solutions (patents), publish research results in top conferences (papers)
Required Skills:
- PhD in C.S., EE or related fields or equivalent combination of education, training, and experience
- 10 years of research experience in the fields of AI/NLP/ML
- Experience conducting research and shipping user facing products
- Experience in large language model (LLM), including Transformer model architecture, attention mechanisms, decoder only LLMs, SSM architecture. Foundational LLM training experience is a plus, including data curation, distributed training, and hyperparameter tuning
- Experience in making LLM-based solution deployable on-device with small latency and memory (e.g., knowledge distillation) and on-device acceleration. NPU optimization is a plus
- Experience in LLM alignment, instruction tuning, LoRA, Adapter, etc.
- Experience in Expertise in multi-step reasoning, planning, reinforcement learning (including RLHF), etc.
- Proficiency in deep learning frameworks such as TensorFlow, PyTorch, or similar, and strong hands-on experience with large language models (e.g., GPT, LLaMA, etc.)
- Strong analytical and problem-solving skills, with a keen attention to detail and a passion for pushing the boundaries of AI capabilities
- Excellent written and verbal communication skills, with the ability to present complex concepts and research findings in a clear and concise manner
- Demonstrated ability to work independently as well as collaboratively in a fast-paced research and development environment
- A strong product/commercialization deliverable experience is required