RoboVerse Learn# Imitation Learning Diffusion Policy ACT OpenVLA SmolVLA RDT Octo Design Philosophy Configuration Management (Hydra + YAML) Reinforcement Learning PPO Fast TD3 SAC TD3 SkillBlender RL World Model CNN_MLP VQGAN_MLP Diffusion Transformer