Job Description
We're building something that doesn't exist yet in
Latin America: a domain-specific large language model
trained on a large Spanish-language corpus, deployed
on proprietary on-premise hardware, solving a real
problem for clients who are already waiting.
We can't tell you exactly what it is yet.
What we can tell you:
β The corpus is real and large
β The clients are real and paying
β The hardware is ready
β The team is small and the decisions matter
β The person who joins now shapes the architecture
This is a fully on-site role in MΓ©rida, YucatΓ‘n.
We want someone in the room β not because we don't
trust remote work, but because the knowledge needs
to live in the team, not in one person's laptop.
ββββββββββββββββββββββββββββββ
WHAT YOU'LL BUILD
ββββββββββββββββββββββββββββββ
β Large-scale Spanish text dataset pipeline:
Β cleaning, deduplication, tokenization
β Continual Pre-Training (CPT) on open-source
Β base models (Llama/Qwen family) on dedicated
Β GPU workstation β in our office
β Supervised fine-tuning: SFT with LoRA/QLoRA,
Β HuggingFace + TRL
β RLHF/DPO pipeline with domain expert annotators
β Model quantization for on-premise deployment:
Β GGUF, MLX, llama.cpp
β RAG system on PostgreSQL + pgvector
β Evaluation suite + hallucination monitoring
ββββββββββββββββββββββββββββββ
β YOU NEED THIS
ββββββββββββββββββββββββββββββ
β Python β advanced and demonstrable
β Real ML framework experience: PyTorch,
Β TensorFlow, or Scikit-learn with actual
Β projects, not just certifications
β Mathematical foundation: linear algebra,
Β stats, calculus β things you actually use
β Linux CLI β the training workstation runs
Β Linux, full stop
β English β reading ML papers and HuggingFace
Β docs is part of the daily job
β Spanish native or C2 β the corpus is in
Β Spanish
β Based in MΓ©rida, YucatΓ‘n β fully on-site, no exceptions
ββββββββββββββββββββββββββββββ
β BONUS POINTS
ββββββββββββββββββββββββββββββ
β HuggingFace Transformers / TRL / PEFT
β LLM fine-tuning: SFT, LoRA, QLoRA, DPO
β RAG pipelines and vector databases (pgvector)
β Ollama, llama.cpp, MLX (Apple Silicon)
β Data engineering: ETL, scraping, text pipelines
β Docker + CI/CD basics
ββββββββββββββββββββββββββββββ
WHAT YOU GET
ββββββββββββββββββββββββββββββ
Competitive salary β first call, directly,
Β Β no invented ranges
On-site in MΓ©rida β real team, real
Β Β collaboration, real knowledge transfer
οΈ Dedicated GPU hardware β not your laptop