What you’ll do & how you’ll make your mark .
Design & scale async REST/WebSocket APIs with Python 3.11+ + FastAPI, using dependency-injection, type hints, and clean vertical-slice architecture.Implement multi-agent workflows with Semantic Kernel (handoff, sequential, concurrent) to route traffic among specialised LLM agents.Integrate LLM providers (OpenAI GPT-4.1/mini, Google Gemini 2.5 Flash) behind a provider-agnostic layer for A/B and cost-aware routing.Deliver Retrieval-Augmented Generation with vector stores such as Azure AI Search, pgvector, or Chroma.Expose tool-using agents via OpenAI Assistants (Code-Interpreter) for data-analysis / file-manipulation tasks.Evolve schemas with SQLModel / SQLAlchemy 2 & Alembic; tune Postgres for high concurrency async access.Maintain robust CI/CD (Bitbucket Jenkins) that lint, type-check, test, package (Docker), and deploy.