Reinforcement Learning Researcher (Humanoid)

REK•San Francisco, CA

57d•Onsite

About The Position

Were seeking a Humanoid Robot Reinforcement Learning Researcher to develop and train advanced control policies for our REK humanoid robots. Youll work closely with REKs robotics, simulation, and teleoperation teams to design reinforcement learning pipelines that improve movement quality, stability, responsiveness, and adaptability all optimized for real-time fighting performance. This role sits at the intersection of research and applied engineering: youll design algorithms, implement simulation environments, train agents, and test them on full-scale humanoid robots.

Requirements

2+ years of hands-on experience in reinforcement learning for humanoid robots at a university lab, research institute, or corporation.
Deep understanding of deep RL algorithms (PPO, SAC, TD3, DDPG, etc.).
Experience using Isaac Gym, MuJoCo, PyBullet, or Gazebo for simulation training.
Strong software engineering skills in Python, PyTorch, and C++ .
Understanding of robot kinematics, dynamics, and control systems.
Ability to run large-scale experiments efficiently and interpret quantitative results.
Strong communication skills and comfort working in an experimental, cross-disciplinary team.

Nice To Haves

Experience with Unitree G1 EDU
Background in teleoperation, imitation learning, or motion retargeting.
Familiarity with low-latency communication protocols and embedded robot control.
Understanding of VR systems, motion capture, or human-in-the-loop RL.
Mandarin proficiency a major plus for collaboration with Chinese robotics partners and manufacturers.

Responsibilities

RL Algorithm Development Design and implement state-of-the-art reinforcement learning algorithms for humanoid locomotion, balance, and reactive movement.
Simulation Environments Build and customize physics-based simulation environments (Isaac Gym, MuJoCo, PyBullet, etc.) for efficient training and domain randomization.
Sim-to-Real Transfer Develop robust transfer strategies that ensure trained policies perform reliably on physical humanoid robots.
Policy Evaluation Define performance metrics, run experiments, and benchmark results for both simulated and real-world tests.
Integration & Collaboration Work closely with teleoperation and control system teams to blend RL policies with operator input in hybrid control architectures.
Research & Publication Stay current with cutting-edge humanoid and robotics RL research and contribute to internal whitepapers or external publications as appropriate.
Data & Infrastructure Maintain scalable training pipelines and data logging systems using GPU clusters or cloud resources.

Benefits

Competitive salary based on experience.
Equity participation in a rapidly growing robotics entertainment startup.
Full benefits and access to REKs humanoid robotics lab and training infrastructure.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume