Researcher, Misalignment Research
OpenAI
🔍 Observed
89
Hiring Activity Score
Very Active (80-100)
- Base score
- Just posted (1d, flat to 14d)
- has location, quality description (6694 chars)
- 417 new listings in 30d (×0.98 age 1d)
- 3 skills
- Tier 1 reputable (OpenAI) ×0.98 age 1d
- High confidence (90%)
- Direct ATS (ashby)
San Francisco
First seen 1 day, 17 hours ago
Last seen 4 hours, 31 minutes ago
Ashby
Apply on Ashby
Search Google for This Role
ATS links often expire — Google search finds the latest posting
Job Description
AI Summary
• Design and execute adversarial attacks and worst-case demonstrations to identify AGI misalignment risks, including deceptive behavior, scheming, and reward hacking before they become harmful
• Develop rigorous, repeatable safety evaluations and automated infrastructure to stress-test AI systems and probe product stacks for breaking points under extreme conditions
• Conduct research on why alignment mitigations fail, publish findings to shape safety strategy, and partner across engineering, research, and policy teams to integrate discoveries into product safeguards
• Lead a team focused on four core pillars: worst-case demos, adversarial evaluations, system-level stress testing, and alignment stress-testing research with a mandate to reduce existential AI risk
• Position offers direct influence on OpenAI's product launches and long-term safety roadmap, with opportunities to mentor researchers and collaborate with external labs to advance collective progress on AI alignment
Skills
rust
aws
go
Quick Actions
Job Information
-
Company:
OpenAI -
Location:
San Francisco -
Job Type:
Full-Time -
Experience Level:
Senior -
Source:
Ashby -
Status:
Active
Activity Score
89
/100
Very Active (89)
Higher scores indicate more likely active hiring based on listing freshness, company activity, and other signals. Learn more →
More from OpenAI
-
Senior Staff Backend Software Engineer, Codex for Finance
San Francisco -
Staff / Senior Staff Product Engineer, Full Stack - Codex for Finance
San Francisco -
Manager, AI Deployment Engineering (Korea)
Seoul, South Korea -
Solutions Architect, Digital Natives
San Francisco -
AI Systems Engineer, Codex Agents
San Francisco