TL, Research Inference
OpenAI
🔍 Observed
81
Hiring Activity Score
Very Active (80-100)
- Base score
- Just posted (1d, flat to 14d)
- has location, quality description (5578 chars)
- 417 new listings in 30d (×0.98 age 1d)
- Tier 1 reputable (OpenAI) ×0.98 age 1d
- Low confidence (30%)
- Direct ATS (ashby)
San Francisco
First seen 1 day, 17 hours ago
Last seen 5 hours, 17 minutes ago
Ashby
Apply on Ashby
Search Google for This Role
ATS links often expire — Google search finds the latest posting
Job Description
AI Summary
• Lead the design and development of high-performance inference runtimes for large-scale AI models, focusing on efficiency, reliability, and scalability across multiple GPUs and distributed systems.
• Optimize core execution paths including model execution, memory management, batching, scheduling, and inference-critical operators based on real-world workloads and performance profiling.
• Partner with research teams to ensure new model architectures translate efficiently into inference systems, informing how models are designed and evaluated across the organization.
• Required experience building production inference systems with GPU performance engineering expertise, distributed systems knowledge, and ability to reason end-to-end about inference pipelines.
• This is a research-enabling systems role (not product-facing) focused on making AI research grounded in real scalability constraints, requiring hands-on technical ownership and problem-solving at scale.
Skills
rust
aws
Quick Actions
Job Information
-
Company:
OpenAI -
Location:
San Francisco -
Job Type:
Full-Time -
Experience Level:
Mid -
Source:
Ashby -
Status:
Active
Activity Score
81
/100
Very Active (81)
Higher scores indicate more likely active hiring based on listing freshness, company activity, and other signals. Learn more →
More from OpenAI
-
Senior Staff Backend Software Engineer, Codex for Finance
San Francisco -
Staff / Senior Staff Product Engineer, Full Stack - Codex for Finance
San Francisco -
Manager, AI Deployment Engineering (Korea)
Seoul, South Korea -
Solutions Architect, Digital Natives
San Francisco -
AI Systems Engineer, Codex Agents
San Francisco