New RJRP now shows Market-Observed Roles alongside verified postings — scored by our Hiring Activity algorithm. How it works →
🔍
Market-Observed Role 🔍 Observed Very Active (80-100)
This role was detected through OpenAI's hiring system and hasn't been verified directly by the employer. Our algorithm scored it as Very Active (80-100) based on freshness, specificity, and company patterns. What does this mean? →

Technical Program Manager, Frontier Evals

OpenAI
🔍 Observed
92
Hiring Activity Score
Very Active (80-100)
  • Base score
  • Just posted (1d, flat to 14d)
  • has location, quality description (6700 chars), named stack (5 tools)
  • 417 new listings in 30d (×0.98 age 1d)
  • 5 skills
  • Tier 1 reputable (OpenAI) ×0.98 age 1d
  • High confidence (90%)
  • Direct ATS (ashby)
How the Hiring Activity Score works →
San Francisco First seen 1 day, 17 hours ago Last seen 5 hours, 13 minutes ago Ashby
Apply on Ashby Search Google for This Role

ATS links often expire — Google search finds the latest posting

Job Description

AI Summary
• Drive frontier model evaluation projects end-to-end, from research questions through design, execution, and analysis of capabilities and limitations. • Manage human data campaigns and evaluation workflows while coordinating across researchers, engineers, data teams, vendors, and domain experts to deliver benchmarks. • Perform hands-on technical work including prompt iteration, data analysis, scripting, and pipeline debugging; requires Python/SQL proficiency and LLM knowledge. • Hybrid IC and PM role requiring comfort with ambiguous research problems, rapid learning on unfamiliar topics, and operating without perfect infrastructure. • San Francisco-based position (3 days/week in office) with relocation assistance; ideal candidates have technical program management, research ops, or evaluation experience.

Skills

rust go sql python aws
Job Information
  • Company:
    OpenAI
  • Location:
    San Francisco
  • Job Type:
    Full-Time
  • Work Location:
    Remote
  • Experience Level:
    Mid
  • Source:
    Ashby
  • Status:
    Active
Activity Score
92 /100
Very Active (92)

Higher scores indicate more likely active hiring based on listing freshness, company activity, and other signals. Learn more →

+
🔍

We now show two types of job listings

Same commitment to real jobs. More opportunities for you. Here's how it works.

✓ Verified Employer-Verified Posts

These jobs were posted directly to RJRP by the employer. The company has been verified through our multi-step process. This is our gold standard — the employer is real, the job is real, and you can apply with confidence.

✓ 100% employer verified
🔍 Observed Market-Observed Roles

These roles were detected through employer hiring systems like Workday. They haven't been verified by the employer directly, so we score each one using our Hiring Activity Score — an algorithm that analyzes freshness, specificity, company hiring patterns, and more to estimate whether the role is actively being filled.

📊 Only high-scoring listings are shown

Our promise hasn't changed. We will never show you a listing we can't stand behind. Market-observed roles must pass our scoring threshold before they appear on RJRP. Anything that looks like a ghost job, a talent pipeline, or a dead listing gets filtered out — you'll never see it.