Location
bengaluru, karnataka, India
Posted
June 07, 2026
Job Description
Scribie is an AI-powered, Human Verified audio and video transcription service, trusted globally since 2008. We specialize in delivering accurate and reliable transcription solutions by blending advanced AI technology with human expertise. Headquartered in the US, we operate with a hybrid model in our Bangalore office, combining the flexibility of remote work with the collaboration of in-person engagement. This approach offers our team both autonomy and growth opportunities in a dynamic and supportive environment.
Weβre building production-grade audio foundation models for high-stakes legal and enterprise transcription β real customer data, messy audio, real consequences.
This is not a paper-only research role.
Youβll own the full ML lifecycle:
Fine-tuning large audio / multimodal models using SFT, Lo RA, and RL-based preference optimization (DPO / PPO / ORPO)
Beating strong baselines like Whisper-large, GPT-4o, Gemini, Claude on domain-specific data
Designing WER, di...
Weβre building production-grade audio foundation models for high-stakes legal and enterprise transcription β real customer data, messy audio, real consequences.
This is not a paper-only research role.
Youβll own the full ML lifecycle:
Fine-tuning large audio / multimodal models using SFT, Lo RA, and RL-based preference optimization (DPO / PPO / ORPO)
Beating strong baselines like Whisper-large, GPT-4o, Gemini, Claude on domain-specific data
Designing WER, di...