aHiroakiIshikawa/openarm_mini_dual_camera_blue_sponge_0529_01_right_only_lpf_hybrid_v4
Viewer • Updated • 85.1k • 187
How to use aHiroakiIshikawa/sarm-openarm-blue-sponge with LeRobot:
A Success-Aware Reward Model (SARM) predicts a dense reward signal from observations, typically used downstream for reinforcement learning or human-in-the-loop fine-tuning when task success is not directly observable.
This reward model has been trained and pushed to the Hub using LeRobot. See the full documentation at LeRobot Docs.
lerobot-train \
--dataset.repo_id=${HF_USER}/<dataset> \
--reward_model.type=sarm \
--output_dir=outputs/train/<desired_reward_model_repo_id> \
--job_name=lerobot_reward_training \
--reward_model.device=cuda \
--reward_model.repo_id=${HF_USER}/<desired_reward_model_repo_id> \
--wandb.enable=true
Writes checkpoints to outputs/train/<desired_reward_model_repo_id>/checkpoints/.
from lerobot.rewards import make_reward_model
reward_model = make_reward_model(pretrained_path="<hf_user>/<reward_model_repo_id>")
reward = reward_model.compute_reward(batch)