Evals Engineer, Applied AI
Scale AI
🔍 Observed
66
Hiring Activity Score
Likely Active (65-79)
- Base score
- Posted 21 days ago
- has location, quality description (7894 chars)
- 3 skills
- High confidence (90%)
- Direct ATS (greenhouse)
San Francisco, CA; New York, NY
First seen 3 weeks, 1 day ago
Last seen 10 hours, 55 minutes ago
Greenhouse
Apply on Greenhouse
Search Google for This Role
ATS links often expire — Google search finds the latest posting
Job Description
AI Summary
• Develop and deploy LLM-as-a-Judge frameworks and AI-assisted evaluation systems to assess enterprise agent outputs and improve model safety and reliability.
• Work with Scale's operations team and customers to create gold-standard datasets, expert rubrics, and structured evaluation frameworks that drive continuous improvement of GenAI systems.
• Conduct research on novel evaluation methodologies and automated analysis techniques for enterprise AI agents, staying current with frontier evaluation literature.
• Requires 2+ years of ML/applied research experience, strong LLM knowledge, Python proficiency, and solid understanding of model evaluation methodologies.
• Preferred qualifications include advanced degree in ML/CS, published research at top conferences (NeurIPS, ICML, ICLR), and prior experience with LLM evaluation systems.
Skills
aws
python
go
Quick Actions
Job Information
-
Company:
Scale AI -
Location:
San Francisco, CA; New York, NY -
Job Type:
Internship -
Experience Level:
Mid -
Source:
Greenhouse -
Status:
Active
Activity Score
66
/100
Likely Active (66)
Higher scores indicate more likely active hiring based on listing freshness, company activity, and other signals. Learn more →
More from Scale AI
-
Senior Machine Learning Engineer, Public Sector
San Francisco, CA; New York, NY; Washington, DC -
Staff Software Engineer, Full-Stack - Enterprise Gen AI
New York, NY; San Francisco, CA -
Software Engineer - New Grad
San Francisco, CA -
Senior AI Infrastructure Engineer, Model Serving Platform
San Francisco, CA; New York, NY -
Mission Software Engineering Manager, Public Sector
San Francisco, CA; St. Louis, MO; New York, NY; Washington, DC