RoboVerse Learn# Imitation Learning Diffusion-Policy OpenVLA RDT Octo Reinforcement Learning PPO SAC Dreamer World Model CNN_MLP VQGAN_MLP Diffusion Transformer