Remote Software Engineer – Go
Full Job Description
About the Projects
We are building large-scale evaluation and training datasets for LLMs focused on realistic software engineering problems. Our approach involves creating verifiable Software Engineering (SWE) tasks based on public GitHub repository histories using a synthetic, human-in-the-loop methodology. We aim to expand dataset coverage across diverse programming languages and difficulty levels.
About the Role
We seek experienced Tech Lead-level software engineers with deep familiarity in high-quality open-source repositories. This role focuses on hands-on engineering: automating development environments, triaging issues, evaluating test coverage/quality, and assessing LLM performance in bug-fixing scenarios.
Key Responsibilities:
- Analyze and triage GitHub issues across trending libraries.
- Configure code repositories (Dockerization, environment setup).
- Evaluate unit tests for quality and coverage.
- Modify/run local codebases to assess LLM capabilities.
- Collaborate with researchers to identify challenging repos/issues for AI models.
Opportunity: Lead a team of junior engineers on cutting-edge projects blending practical SWE with AI research.
Required Skills
- Minimum 3+ years of overall experience.
- Strong proficiency in at least one language (e.g., Ruby).
- Mastery of Git, Docker, and pipeline setup basics.
- Ability to navigate complex codebases and test projects locally.
Nice-to-Haves
- Experience with LLM research/evaluation or developer tool automation agents.
Work Model:
- Fully remote environment.
- Commitment: 20–40 hours/week (minimum 20 hrs) with 4-hour PST overlap required.
- Type: Contractor assignment (no medical/paid leave).
Evaluation Process
A streamlined ~75-minute assessment consisting of two rounds: a technical deep dive and a cultural discussion.
Company
Turing
Turing is a global leader in AI talent, connecting top engineering and STEM experts with leading organizations to accelerate model training, evaluation, and real-world application deployment.About Our...