MDMini Drone Shop AI
SOLE-R1: Learning Drone Tasks from Video & Language, No Reward Hacking
A new AI model, SOLE-R1, acts as the sole reward for robot reinforcement learning, letting drones learn complex tasks from video and language without explicit rewards or demonstrations, and resists reward hacking.
Mar 31·1 min read·Research