Position: LLM – AI Quality Analyst (Personalization) – English
Type: Short-Term Contract
Location: Remote
Commitment: 30–40 hours/week
Engagement Length: 2 months
Start Date: Immediate
Role Responsibilities
- Design multi-turn conversational prompts based on personal context
- Evaluate personalized AI responses for relevance, grounding, and helpfulness
- Assess correct and incorrect use of personal data in model outputs
- Perform side-by-side (SxS) evaluation and ranking of AI responses
- Identify grounding errors, poor inferences, and forced personalization
- Write clear, structured rationales referencing specific conversation turns
- Extract and verify model debug information and data source usage
- Maintain strict data hygiene by deleting evaluation conversations
Requirements
- English fluency (reading and writing)
- Strong experience in data annotation, AI quality evaluation, content moderation, or related roles
- Strong analytical thinking and attention to detail
- Ability to evaluate nuanced and ambiguous AI responses
- Comfortable using a primary personal Google account with enabled data sources
- BS/BA degree or equivalent experience in a relevant analytical field
- Strong written communication and structured feedback skills
- Self-motivated and able to work independently in a remote setting
- Reliable desktop/laptop with stable internet connection
Application Process
- Fill out the application form
- Complete the ICF
- Assessment