Location
islamabad, islamabad capital territory, Pakistan
Posted
June 30, 2026
Job Description
The Role
You’ll own the LLM-powered parts of the platform end-to-end, including:
- Chat and voice intake systems
- Agent orchestration
- Risk and similarity pipelines
- RAG and structured outputs
- Voice integrations
- Evaluation infrastructure
- AI observability, latency, and cost optimization
This is a production engineering role — not just prompt engineering. Reliability, safety, latency, and system design matter.
What You’ll Build
- Multi-step intake workflows across chat and voice
- Low-latency voice agents with function calling and barge-in support
- Background risk scoring and similarity retrieval using pgvector
- Evaluation systems with regression testing and CI gates
- Prompt caching, embedding caching, and model routing for cost and performance optimization
- Structured agent outputs with human takeover flows when required <...