None

Promtec Innovation

Full-time Other-General
Apply Now
Location
mΓ©rida, yucatΓ‘n, Mexico
Posted
June 27, 2026

Job Description

We're building something that doesn't exist yet in

Latin America: a domain-specific large language model

trained on a large Spanish-language corpus, deployed

on proprietary on-premise hardware, solving a real

problem for clients who are already waiting.


We can't tell you exactly what it is yet.

What we can tell you:


β†’ The corpus is real and large

β†’ The clients are real and paying

β†’ The hardware is ready

β†’ The team is small and the decisions matter

β†’ The person who joins now shapes the architecture


This is a fully on-site role in MΓ©rida, YucatΓ‘n.

We want someone in the room β€” not because we don't

trust remote work, but because the knowledge needs

to live in the team, not in one person's laptop.


──────────────────────────────

WHAT YOU'LL BUILD

──────────────────────────────

β†’ Large-scale Spanish text dataset pipeline:

Β cleaning, deduplication, tokenization

β†’ Continual Pre-Training (CPT) on open-source

Β base models (Llama/Qwen family) on dedicated

Β GPU workstation β€” in our office

β†’ Supervised fine-tuning: SFT with LoRA/QLoRA,

Β HuggingFace + TRL

β†’ RLHF/DPO pipeline with domain expert annotators

β†’ Model quantization for on-premise deployment:

Β GGUF, MLX, llama.cpp

β†’ RAG system on PostgreSQL + pgvector

β†’ Evaluation suite + hallucination monitoring


──────────────────────────────

βœ… YOU NEED THIS

──────────────────────────────

β†’ Python β€” advanced and demonstrable

β†’ Real ML framework experience: PyTorch,

Β TensorFlow, or Scikit-learn with actual

Β projects, not just certifications

β†’ Mathematical foundation: linear algebra,

Β stats, calculus β€” things you actually use

β†’ Linux CLI β€” the training workstation runs

Β Linux, full stop

β†’ English β€” reading ML papers and HuggingFace

Β docs is part of the daily job

β†’ Spanish native or C2 β€” the corpus is in

Β Spanish

β†’ Based in MΓ©rida, YucatΓ‘n β€” fully on-site, no exceptions


──────────────────────────────

⭐ BONUS POINTS

──────────────────────────────

β†’ HuggingFace Transformers / TRL / PEFT

β†’ LLM fine-tuning: SFT, LoRA, QLoRA, DPO

β†’ RAG pipelines and vector databases (pgvector)

β†’ Ollama, llama.cpp, MLX (Apple Silicon)

β†’ Data engineering: ETL, scraping, text pipelines

β†’ Docker + CI/CD basics


──────────────────────────────

WHAT YOU GET

──────────────────────────────

Competitive salary β€” first call, directly,

Β Β no invented ranges

On-site in MΓ©rida β€” real team, real

Β Β collaboration, real knowledge transfer

️ Dedicated GPU hardware β€” not your laptop