RoboMeter: Learning Dense Rewards from Successes and Failures
RoboMeter trains dense reward models from both successful and failed robot trajectories, solving a key gap in prior methods that only learn from expert demos.
RoboMeter: Learning Dense Rewards from Successes and Failures
RoboMeter trains dense reward models from both successful and failed robot trajectories, solving a key gap in prior methods that only learn from expert demos.