Software Engineer, Research Infrastructure

Company: OpenAI
Location: San Francisco
Posted on: April 17, 2025

Job Description:

Software Engineer, Research Infrastructure - OpenAI - OpenAICareersSoftware Engineer, Research InfrastructureScaling - San FranciscoThis role will support the fleet infrastructure team at OpenAI. The fleet team focuses on running the world's largest, most reliable, and frictionless GPU fleet to support OpenAI's general purpose model training and deployment. Work on this team ranges from:

Maximizing GPUs doing useful work by building user-friendly scheduling and quota systems
Running a reliable and low maintenance platform by building push-button automation for kubernetes cluster provisioning and upgrades
Supporting research workflows with service frameworks and deployment systems
Ensuring fast model startup times through high performance snapshot delivery across blob storage down to hardware caching
Much more!About the RoleAs an engineer within Fleet infrastructure, you will design, write, deploy, and operate infrastructure systems for model deployment and training on one of the world's largest GPU fleet. The scale is immense, the timelines are tight, and the organization is moving fast; this is an opportunity to shape a critical system in support of OpenAI's mission to advance AI capabilities responsibly.This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.In this role, you will:
- Design, implement and operate components of our compute fleet including job scheduling, cluster management, snapshot delivery, and CI/CD systems.
- Interface with researchers and product teams to understand workload requirements.
- Collaborate with hardware, infrastructure, and business teams to provide a high utilization and high reliability service.You might thrive in this role if you:
  - Have experience with hyperscale compute systems.
  - Possess strong programming skills.
  - Have experience working in public clouds (especially Azure).
  - Have experience working in Kubernetes.
  - Have an execution-focused mentality paired with a rigorous focus on user requirements.
  - As a bonus, have an understanding of AI/ML workloads.About OpenAIOpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status.For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.Compensation$360K - $440K + Offers Equity
    #J-18808-Ljbffr

Keywords: OpenAI, Concord , Software Engineer, Research Infrastructure, IT / Software / Systems , San Francisco, California

Click here to apply!

Didn't find what you're looking for? Search again!

Let San Francisco recruiters find you. Post your resume for free!

Get San Francisco IT / Software / Systems jobs via email.

View more Concord IT / Software / Systems jobs

Other IT / Software / Systems Jobs

Lead Testing Analyst
Description: The Ads Marketing team is in search of a full time contract Lead Testing Analyst to address the below to provide support for a high visibility project for products within the B2B Ads Marketing Suite. (more...)
Company:
Location: San Francisco
Posted on: 04/19/2025

.NET Developer
Description: lt span gt Major Pharmaceutical Distributor/Healthcare services client headquartered in San Francisco is seeking a .NET Programmer to join their team onsite in San Francisco. This contract starts at (more...)
Company:
Location: San Francisco
Posted on: 04/19/2025

Data Visualization Analyst (Tableau) job, San Francisco
Description: lt span gt Modis is currently recruiting Data Visualization Analysts with Tableau for a very exciting opportunity in San Francisco, CA. amp nbsp This is a contract to hire opportunity with a stellar (more...)
Company:
Location: San Francisco
Posted on: 04/19/2025

Salary in Concord, California Area | More details for Concord, California Jobs |Salary

Senior SQL Developer Job in San Francisco
Description: lt span gt Our healthcare client who is the leading distributor of healthcare information technology and medical supplies has a job opening for a lt B gt Senior SQL Developer lt /B gt in lt B gt San (more...)
Company:
Location: San Francisco
Posted on: 04/19/2025

Exciting Meta Data/Business Analyst Job Opportunity in San Francisco or Charlotte, NC!
Description: lt span gt Metadata and Data Profiling Analyst Job Opportunity in either San Francisco, CA, or Charlotte, NC lt br gt Location: San Francisco, CA OR Charlotte, NC lt br gt amp nbsp lt br gt One of (more...)
Company:
Location: San Francisco
Posted on: 04/19/2025

Infrastructure Design Engineer
Description: lt span gt Healthcare services client needs a Design Engineer in their El Dorado Hills office. This is a 12 month contract to join their Infrastructure Services team. lt br gt amp nbsp lt br gt Infrastructure (more...)
Company:
Location: San Francisco
Posted on: 04/19/2025

Test Coordinator -
Description: 5-7 years experience required. Certified Project Manager PMP . Works with minimum supervision assignments broad in nature performs work that is complex and varied defines and discerns key aspects (more...)
Company:
Location: San Francisco
Posted on: 04/19/2025

Software Test Engineer -
Description: Qualifications: br Bachelor s degree or equivalent work experience br 4 years experience with software testing br 2 years experience with manual software testing br 2 years of automated (more...)
Company:
Location: San Francisco
Posted on: 04/19/2025

Business Analyst
Description: lt span gt BA experience within the last 1-2 years lt br gt Experience with query languages / SQL lt br gt Experience working with business unit staff not just technical / developer staff lt br gt (more...)
Company:
Location: San Francisco
Posted on: 04/19/2025

SCCM Consultant -
Description: My client, a world renown Systems Integration firm, is seeking a SCCM SME for a 3-month project in San Francisco. br br Basically this
Company:
Location: San Francisco
Posted on: 04/19/2025

Loading more jobs...

Software Engineer, Research Infrastructure

Didn't find what you're looking for? Search again!

Other IT / Software / Systems Jobs

Log In or Create An Account