Huaweicanada
Intern Engineer – RL Post-Training for LLMs
Company
Role
Intern Engineer – RL Post-Training for LLMs
Job type
Internship
Found on Mokaru
🔥Recently
Salary
Job description
Huawei Canada has an immediate 6-12 months internship opening for an Intern Researcher.
About the team
The Computing Data Application Acceleration Lab aims to create a leading global data analytics platform organized into three specialized teams using innovative programming technologies. This team focuses on full-stack innovations, including software-hardware co-design and optimizing data efficiency at both the storage and runtime layers. This team also develops next-generation GPU architecture for gaming, cloud rendering, VR/AR, and Metaverse applications. One of the goals of this lab are to enhance algorithm performance and training efficiency across industries, fostering long-term competitiveness.
About the job
•
Develop and optimize RL post-training pipelines for LLMs (e.g., GRPO, reward modeling).
•
Conduct experiments to improve model performance, reasoning, and alignment.
•
Build scalable training, evaluation, and data generation systems.
•
Collaborate with researchers and engineers on cutting-edge LLM projects
•
Stay current with advancements in RL, LLMs, and post-training research.
The total target annual compensation (based on 2,080 hours per year) ranges from $58,000 to $104,000 depending on education, experience, and demonstrated expertise.


