AI Research Engineer (Pre-training - LLM & Multi-Modal)
Tether Operations Limited
Job Description
About the job
As a member of the AI model team, you will drive innovation in architecture development for cuttingāedge models of various scales, including small, large, and multiāmodal systems. Your work will enhance intelligence, improve efficiency, and introduce new capabilities to advance the field.
You will have a deep expertise in Large Language Model (LLM) and MultiāModal architectures, a strong grasp of preātraining optimization, and a handsāon, researchādriven approach. Your mission is to explore and implement novel techniques and algorithms that lead to groundbreaking advancements: multiāmodal data curation and alignment, strengthening baselines, and identifying and resolving existing preātraining bottlenecks to push the limits of crossāmodal AI performance.
Responsibilities
- LargeāScale PreāTraining: Conduct foundational preātraining for LLMs and MultiāModal models on large, distributed servers equipped with thousands of ...