CareerAddict

Senior RL (Reinforcement Learning) Engineer - San Francisco, CA

Staffworx Limited

Posted on Jun 2, 2026 by Staffworx Limited
San Francisco, CA, 94101
Engineering
Immediate Start
$400k - $500k Annual
Full-Time

Senior Reinforcement Learning Engineer San Francisco, CA (FiDi) | On-site 5 days | Full-time $300,000 $500,000 + Equity

A small, elite AI research team working on reinforcement learning in open-ended settings. The team includes researchers from leading PhD programmes and tier-one AI organisations. Early-stage, well-resourced, and moving quickly this is a genuine ground-floor opportunity with significant scope and impact.

The Role

We are hiring a Senior RL Engineer to sit at the intersection of research and production engineering. You will own the translation of RL research ideas into reliable, measurable training systems and drive technically complex projects end to end.

Key Responsibilities

  • Build and improve RL training pipelines for language model-based agents
  • Implement reward functions, verifiers, environment interfaces, rollout pipelines, and evaluation harnesses
  • Design experiments to test whether RL methods are improving model behaviour, sample efficiency, robustness, or generalisation
  • Build monitoring tooling: regression tests, eval suites, and reward-hacking checks
  • Debug unstable training runs and diagnose learning dynamics failures across algorithms, rewards, data, infrastructure, and evals
  • Manage GPU clusters, distributed training, and compute efficiency
  • Build 0-to-1 systems for new RL workflows and harden them into reusable infrastructure
  • Own ambiguous technical problems from problem framing through to delivery

Requirements

  • Strong applied ML engineering background: shipped systems, open-source work, competitions, or early-stage startup experience
  • Hands-on experience scaling RL pipelines and debugging training issues
  • Familiarity with RL environments and large language models; diffusion model experience a plus
  • Python proficiency and strong working knowledge of PyTorch or JAX
  • Solid grounding in RL, supervised learning, optimisation, and modern deep learning
  • Independent, intellectually curious, and able to drive ambiguous problems to working solutions
  • Comfortable collaborating with researchers while holding high engineering standards
  • PhD not required strong applied experience equally valued

Package

  • Visa sponsorship available (H1B transfer, TN, OPT, O-1); existing US work authorisation preferred

Reference: 3117115758

https://jobs.careeraddict.com/post/113339580
Staffworx Limited

Senior RL (Reinforcement Learning) Engineer - San Francisco, CA

Staffworx Limited

Posted on Jun 2, 2026 by Staffworx Limited

Print
San Francisco, CA, 94101
Engineering
Immediate Start
$400k - $500k Annual
Full-Time

Senior Reinforcement Learning Engineer San Francisco, CA (FiDi) | On-site 5 days | Full-time $300,000 $500,000 + Equity

A small, elite AI research team working on reinforcement learning in open-ended settings. The team includes researchers from leading PhD programmes and tier-one AI organisations. Early-stage, well-resourced, and moving quickly this is a genuine ground-floor opportunity with significant scope and impact.

The Role

We are hiring a Senior RL Engineer to sit at the intersection of research and production engineering. You will own the translation of RL research ideas into reliable, measurable training systems and drive technically complex projects end to end.

Key Responsibilities

  • Build and improve RL training pipelines for language model-based agents
  • Implement reward functions, verifiers, environment interfaces, rollout pipelines, and evaluation harnesses
  • Design experiments to test whether RL methods are improving model behaviour, sample efficiency, robustness, or generalisation
  • Build monitoring tooling: regression tests, eval suites, and reward-hacking checks
  • Debug unstable training runs and diagnose learning dynamics failures across algorithms, rewards, data, infrastructure, and evals
  • Manage GPU clusters, distributed training, and compute efficiency
  • Build 0-to-1 systems for new RL workflows and harden them into reusable infrastructure
  • Own ambiguous technical problems from problem framing through to delivery

Requirements

  • Strong applied ML engineering background: shipped systems, open-source work, competitions, or early-stage startup experience
  • Hands-on experience scaling RL pipelines and debugging training issues
  • Familiarity with RL environments and large language models; diffusion model experience a plus
  • Python proficiency and strong working knowledge of PyTorch or JAX
  • Solid grounding in RL, supervised learning, optimisation, and modern deep learning
  • Independent, intellectually curious, and able to drive ambiguous problems to working solutions
  • Comfortable collaborating with researchers while holding high engineering standards
  • PhD not required strong applied experience equally valued

Package

  • Visa sponsorship available (H1B transfer, TN, OPT, O-1); existing US work authorisation preferred
Print

Reference: 3117115758

Share this job:
CareerAddict

Alert me to jobs like this:

Amplify your job search:

CV/résumé help

Increase interview chances with our downloads and specialist services.

CV Help

Expert career advice

Increase interview chances with our downloads and specialist services.

Visit Blog

Job compatibility

Increase interview chances with our downloads and specialist services.

Start Test