Work Snapshot
Type: W2
Location: Remote
Commitment: 40 hours per week
Commission: $70 $100 per hour
What You will Be Doing
- Design complex technical tasks across machine learning, data science, data engineering, and software workflows
- Evaluate model outputs and provide detailed feedback on correctness, efficiency, and reasoning quality
- Develop evaluation frameworks and rubrics for assessing agentic system behavior
- Create accurate, well-documented solutions that serve as high-quality ground truth data
- Collaborate with cross-functional subject matter experts to ensure consistency and technical accuracy
What We are Looking For
- Strong experience in machine learning, data science, software engineering, or related STEM disciplines
- Strong experience in programming, data analysis, statistical methods, or computational workflows
- Ability to commit to full-time weekday availability throughout the engagement
- Experience with data annotation, evaluation, or human feedback workflows is a plus
- Familiarity with LLMs, agentic systems, or evaluation frameworks
- Strong written communication and technical documentation skills
How To Apply
- Upload resume
- Interview
- Submit form