Copy of Senior Python Developer (AI Evaluation & Benchmarking)

Lifted, an Upwork Company

Full-time Computer Occupations
Apply Now
Location
Buenos Aires, Buenos Aires, Argentina
Posted
June 30, 2026

Job Description

Job Description

This opportunity is ideal for senior software engineers with strong Python expertise who enjoy writing high-quality code, reviewing technical solutions, and working on AI-related projects.

What You'll Do:

  • Design and develop coding benchmarks used to evaluate frontier AI models.
  • Analyze AI-generated code for correctness, reliability, efficiency, and edge cases.Build and maintain scalable data pipelines that support AI evaluation workflows.
  • Create structured programming scenarios to test reasoning, debugging, and code quality.
  • Work with large codebases and multi-language software environments.
  • Collaborate with teams focused on improving how AI models understand, generate, and evaluate software.
  • Write clean, maintainable, and well-tested Python code following software engineering best practices.

Qualifications

Requirements:

  • 4+ years of professional software engineering ...