Researcher, Alignment Oversight
OpenAI
🔍 Observed
83
Hiring Activity Score
Very Active (80-100)
- Base score
- Just posted (1d, flat to 14d)
- has location, quality description (6309 chars)
- 417 new listings in 30d (×0.98 age 1d)
- 3 skills
- Tier 1 reputable (OpenAI) ×0.98 age 1d
- Low confidence (30%)
- Direct ATS (ashby)
San Francisco
First seen 1 day, 17 hours ago
Last seen 5 hours, 12 minutes ago
Ashby
Apply on Ashby
Search Google for This Role
ATS links often expire — Google search finds the latest posting
Job Description
AI Summary
• Design and run experiments to improve oversight of increasingly capable AI models, combining longer-horizon research with practical 3-6 month deployment sprints.
• Develop and deploy monitoring systems for action tracking, red-teaming, and human-in-the-loop control while analyzing real deployment data to identify model failures and alignment gaps.
• Build evaluations for alignment failure modes (overeagerness, instruction following failures, covert actions, scheming) and create techniques to feed oversight signals back into model training.
• Requires strong hands-on experience with large language model training, evaluation, debugging, and empirical ML research (reinforcement learning, preference optimization, or scalable oversight preferred).
• Based in San Francisco with hybrid work (3 days/week in office), relocation assistance available, and opportunity to publish externally while collaborating across research, product, security, and engineering teams.
Skills
rust
aws
go
Quick Actions
Job Information
-
Company:
OpenAI -
Location:
San Francisco -
Job Type:
Full-Time -
Experience Level:
Mid -
Source:
Ashby -
Status:
Active
Activity Score
83
/100
Very Active (83)
Higher scores indicate more likely active hiring based on listing freshness, company activity, and other signals. Learn more →
More from OpenAI
-
Senior Staff Backend Software Engineer, Codex for Finance
San Francisco -
Staff / Senior Staff Product Engineer, Full Stack - Codex for Finance
San Francisco -
Manager, AI Deployment Engineering (Korea)
Seoul, South Korea -
Solutions Architect, Digital Natives
San Francisco -
AI Systems Engineer, Codex Agents
San Francisco