Locomotion Policy Training

Train, preview, and export a K1 velocity policy from this guide. Run the commands inside the Cyclo Lab container from /workspace/cyclo_lab.

Train

python scripts/reinforcement_learning/rsl_rl/train.py \
  --task Cyclo-Velocity-Flat-K1-Rev1-v0 \
  --num_envs 4096 \
  --headless

Training output is written to:

logs/rsl_rl/k1_rev1_velocity/*/

Each run directory contains:

logs/rsl_rl/k1_rev1_velocity/*/

model_*.pt
params/
- agent.yaml
- env.yaml
- sim2real.yaml

Play and Export

Run the matching play task after training. By default, play.py loads the latest run and the latest model_*.pt checkpoint for the selected task. It runs the policy in simulation and exports policy.onnx next to the checkpoint.

python scripts/reinforcement_learning/rsl_rl/play.py \
  --task Cyclo-Velocity-Flat-K1-Rev1-Play-v0 \
  --num_envs 16

Add --headless when exporting without the Isaac Sim window.

To play a specific checkpoint instead of the latest one, add --checkpoint:

python scripts/reinforcement_learning/rsl_rl/play.py \
  --task Cyclo-Velocity-Flat-K1-Rev1-Play-v0 \
  --checkpoint logs/rsl_rl/k1_rev1_velocity/2026-01-01_12-00-00/model_1000.pt \
  --num_envs 16

After play.py finishes, check the same training run directory. The exported/ directory is added by play.py:

logs/rsl_rl/k1_rev1_velocity/*/

model_*.pt
exported/
- policy.onnx
- policy.pt
params/
- agent.yaml
- env.yaml
- sim2real.yaml

Prepare for Sim2Real

Before moving to Sim2Real, make sure these required files are ready:

exported/policy.onnx
params/sim2real.yaml

Then follow How to Deploy Your Own Cyclo Lab-Trained Policy to place the asset under the Sim2Real asset root and connect it to the Velocity mode.

note

exported/policy.onnx and params/sim2real.yaml are a matched pair from one logs/rsl_rl/.../<run>/ directory. Do not pair policy.onnx from one run with sim2real.yaml from another run.

AI Manipulator

OpenManipulator

Locomotion Policy Training

Train

Play and Export

Prepare for Sim2Real

Model Reference

Protocol

Humanoid

Robot Hand

Manipulator

TurtleBot3

AI Manipulator

OpenManipulator

Overview

DYNAMIXEL Software

Arduino IDE

R+ 1.0

R+ Manager 2.0

R+Task 3.0

Controller

Interface

Bioloid

Engineer

FAQ by Topic

Guides

Train​

Play and Export​

Prepare for Sim2Real​

Train

Play and Export

Prepare for Sim2Real