Software Engineer (Systems Design) - Remote
Full Job Description
About the Job
Title: Software Engineer (Remote)
Engagement: Hourly contract (independent contractor)
Rate: USD 45-80/hour
Location: Remote – candidates based in the United States, United Kingdom, Canada, Europe, Singapore, Dubai, or Australia who are legally able to work as independent contractors in their jurisdiction.
Role Overview: One of our clients, a leading AI research organization, is developing advanced conversational AI systems that assist users with real-world software engineering and coding tasks. This role focuses on evaluating and improving how large language models (LLMs) reason about code, generate solutions, and explain technical concepts across a wide range of programming and system design scenarios.
Key Responsibilities:
- Evaluate AI-generated responses to software engineering and coding queries for correctness, clarity, and completeness
- Execute and test code to validate functionality, performance, and edge-case handling
- Perform fact-checking using authoritative technical references and public sources
- Annotate model outputs by identifying strengths, weaknesses, bugs, and conceptual gaps
- Assess code quality, readability, algorithmic soundness, and explanation quality
- Ensure outputs align with established conversational and technical guidelines
- Apply standardized evaluation rubrics and benchmarks consistently
Required Qualifications:
- Bachelor’s, Master’s, or PhD in Computer Science or a closely related field
- Significant professional experience in software engineering or system design
- Expert-level proficiency in at least one major programming language (e.g., Python, Java, C++, JavaScript, Go, Rust)
- Ability to independently solve medium-to-hard algorithmic problems
- Experience contributing to open-source projects with accepted pull requests
- Strong familiarity with using LLMs for coding and understanding their limitations
- Exceptional attention to detail and ability to detect subtle technical errors
Preferred Qualifications:
- Prior experience with RLHF, model evaluation, or technical data annotation
- Background in competitive programming or algorithmic problem solving
- Experience reviewing or maintaining production-level code
- Familiarity with multiple programming paradigms and technology stacks
- Ability to explain complex technical topics to non-technical audiences
What Success Looks Like:
- You consistently identify logical errors, inefficiencies, and misleading explanations in AI-generated code
- Your feedback measurably improves the accuracy, reliability, and clarity of model outputs
- You deliver high-quality, reproducible evaluation artifacts that strengthen AI system performance
Contract & Payment Terms:
- Independent contractor engagement
- Fully remote with flexible scheduling
- Weekly payments via Stripe or Wise
- Project scope and duration may vary based on performance and client needs
- No access to confidential or proprietary employer data is required
- H1-B and STEM OPT sponsorship is not available
Application Process:
- Submit your resume for review
- Selected candidates will complete a short technical and evaluation assessment
Company
Keystone Recruitment
Keystone Recruitment helps build great companies by identifying pivotal talent. We specialize in long-term placements and cultural alignment, ensuring candidates become essential to your company's suc...