# Humanoidbench RL We provide a basic RL training example for Humanoidbench tasks. RL framework: `stable-baselines3` RL algorithm: `PPO` Simulator: `MuJoCo` and `IsaacGym` and `IsaacLab` ## Installation ```bash pip install stable-baselines3 pip install wandb pip install tensorboard ``` Wandb login, enter your wandb account token. ```bash wandb login ``` ## Training > NOTE: > 1. Modify `task: humanoidbench:Stand` in the config files to the task you want to train. > 2. Modify `use_wandb: true` and `wandb_entity: ` in the config files to use wandb to log the training process. - MuJoCo: ```bash python roboverse_learn/humanoidbench_rl/train_sb3.py mujoco ``` - IsaacGym: ```bash python roboverse_learn/humanoidbench_rl/train_sb3.py isaacgym ``` - IsaacLab: ```bash python roboverse_learn/humanoidbench_rl/train_sb3.py isaaclab ``` ## Task list - [ ] Balance - Not Implemented because of collision detection issues. - [x] Crawl - [x] Cube - [x] Door - [ ] Highbar - Not implemented due to the need to connect H1 and Highbar. - [ ] Hurdle - Not Implemented because of collision detection issues. - [ ] Maze - Not Implemented because of collision detection issues. - [x] Package - [ ] Pole - Not Implemented because of collision detection issues. - [x] Powerlift - [x] Push - [x] Run - [x] Sit - [x] Slide - [ ] Spoon - Not Implemented because of sensor detection issues. - [x] Stair - [x] Stand