Position: Generalist Evaluator ExpertType: Hourly contractCompensation: $35-$40 per hourLocation: RemoteCommitment: At least 20 hours per week Role Responsibilities<ul><li>Author prompt–golden answer pairs to train and evaluate advanced language models</li><li>Create detailed prompts with multiple constraints and instructions</li><li>Establish expectations for correct responses in general consumer contexts and develop comprehensive rubrics</li><li>Run prompts through models and assess outputs against defined expectations</li><li>Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigor and maintain consistency before integration into official benchmarks</li></ul> Requirements<ul><li>BS or BA from a reputable institution, completed or in progress</li><li>Strong writing and critical thinking skills</li><li>Ability to work independently and meet deadlines</li><li>Familiarity with ChatGPT or similar tools for personal decision-making or general interests</li><li>Experience in teaching or research preferred</li></ul> Application Process (Takes 20 Mins)<ul><li>Upload resume</li><li>Interview (15 min)</li><li>Submit form</li></ul>

Remotehey

Work anywhere, Live anywhere

Researcher | $40 Remote