aswinkumar99 commited on
Commit
fd198e1
·
verified ·
1 Parent(s): c41b303

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +91 -0
README.md ADDED
@@ -0,0 +1,91 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: mit
5
+ tags:
6
+ - SmolVLA
7
+ - LeRobot
8
+ - robotics
9
+ - imitation-learning
10
+ - behavior-cloning
11
+ - so101
12
+ pipeline_tag: reinforcement-learning
13
+ library_name: lerobot
14
+ base_model:
15
+ - lerobot/smolvla_base
16
+ ---
17
+
18
+ # LeRobot SO101 SmolVLA task1-unknown_bs64_s20000
19
+
20
+ ## Summary
21
+
22
+ This repository contains the final checkpoint for a SmolVLA policy fine-tune trained on `aswinkumar99/task1-unknown` for SO101 sponge pick-and-place experiments.
23
+
24
+ Dataset meaning: Task 1: Single Sponge - No Distractors (random-locations / unknown).
25
+
26
+ This SmolVLA policy is a fine-tune of `lerobot/smolvla_base`, as recorded by both the launch command (`--policy.path=lerobot/smolvla_base`) and the saved training config (`pretrained_path: lerobot/smolvla_base`).
27
+
28
+ ## Training Setup
29
+
30
+ - Dataset repo: `aswinkumar99/task1-unknown`
31
+ - Local dataset root during training: `/home/riftuser/datasets_combined/aswinkumar99/task1-unknown`
32
+ - Output directory during training: `/home/riftuser/outputs_matrix/smolvla/task1-unknown_bs64_s20000`
33
+ - Batch size: `64`
34
+ - Training steps: `20000`
35
+ - Checkpoint save frequency: `5000`
36
+ - Data loader workers: `8`
37
+ - WandB project: `so101-layout-generalization`
38
+ - GPU: `NVIDIA RTX PRO 6000 Blackwell Max-Q Workstation Edition`
39
+ - Python: `CPython 3.12.13`
40
+ - CUDA: `12.9`
41
+ - Training start: `2026-04-24T00:24:53.023942+00:00`
42
+ - Training end: `2026-04-24T02:49:03`
43
+ - Approximate training duration: `2h 24m 9s`
44
+
45
+ - Base model: `lerobot/smolvla_base`
46
+ - Observation camera rename map: `{"observation.images.overhead": "observation.images.camera1", "observation.images.wrist": "observation.images.camera2"}`
47
+ - Action chunk size: `50`
48
+ - Action steps predicted: `50`
49
+
50
+ ## Exact Training Command
51
+
52
+ ```bash
53
+ lerobot-train \
54
+ --dataset.repo_id=aswinkumar99/task1-unknown \
55
+ --dataset.root=/home/riftuser/datasets_combined/aswinkumar99/task1-unknown \
56
+ --dataset.video_backend=torchcodec \
57
+ --output_dir=/home/riftuser/outputs_matrix/smolvla/task1-unknown_bs64_s20000 \
58
+ --job_name=smolvla_task1-unknown_bs64 \
59
+ --batch_size=64 \
60
+ --steps=20000 \
61
+ --log_freq=200 \
62
+ --save_freq=5000 \
63
+ --save_checkpoint=true \
64
+ --num_workers=8 \
65
+ --wandb.enable=true \
66
+ --wandb.project=so101-layout-generalization \
67
+ --wandb.mode=online \
68
+ --wandb.disable_artifact=true \
69
+ --policy.path=lerobot/smolvla_base \
70
+ --policy.device=cuda \
71
+ --policy.push_to_hub=false \
72
+ --rename_map={"observation.images.overhead": "observation.images.camera1", "observation.images.wrist": "observation.images.camera2"}
73
+ ```
74
+
75
+ ## Repository Contents
76
+
77
+ - `pretrained_model/`: final downloadable model artifacts for inference/loading
78
+ - `training_state/`: optimizer, RNG, scheduler/state, and step information for resuming or auditability
79
+
80
+ ## Notes
81
+
82
+ - This repo stores the final checkpoint that was uploaded from the cloud training workspace.
83
+ - The checkpoint was trained with LeRobot tooling via `lerobot-train`.
84
+ - For SO101 experiments in this workspace, the dataset source was created by Aswinkumar.
85
+
86
+ ## Creator
87
+
88
+ Aswinkumar
89
+
90
+ - Website: [aswinkumar.me](https://aswinkumar.me)
91
+ - Hugging Face repo: <https://huggingface.co/aswinkumar99/LeRobot-SO101-SmolVLA-task1-unknown_bs64_s20000>