hello-world-yoo commited on
Commit
5cdbd97
·
verified ·
1 Parent(s): 336a9cb

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  license: apache-2.0
3
- base_model: nvidia/GR00T-N1.5-3B
4
  tags:
5
  - robotics
6
  - gr00t
@@ -15,14 +15,14 @@ pipeline_tag: robotics
15
 
16
  # GR00T-N1.6-3B-Pick-Banana-Real
17
 
18
- A fine-tuned version of [nvidia/GR00T-N1.5-3B](https://huggingface.co/nvidia/GR00T-N1.5-3B) for a banana pick-and-place task on a real SO-ARM101 robot, trained on the [hi-space/SO-ARM101-PICK-BANANA](https://huggingface.co/datasets/hi-space/SO-ARM101-PICK-BANANA) dataset.
19
 
20
  ## Model Description
21
 
22
  GR00T-N1.6 (Gr00tN1d6) is a vision-language-action (VLA) model for robot manipulation. This checkpoint is fine-tuned for a pick-and-place task where the robot picks up a banana and places it on a plate using a real SO-ARM101 robot arm.
23
 
24
  - **Architecture:** Gr00tN1d6 with Eagle-Block2A-2B-v2 vision-language backbone + diffusion policy action head
25
- - **Base model:** nvidia/GR00T-N1.5-3B
26
  - **Task:** Pick banana and place on plate (real robot)
27
  - **Robot:** SO-ARM101
28
  - **Action horizon:** 50 steps
@@ -65,4 +65,4 @@ This model is fine-tuned for a real-world robotic banana pick-and-place task usi
65
 
66
  ## License
67
 
68
- This model inherits the license from the base model [nvidia/GR00T-N1.5-3B](https://huggingface.co/nvidia/GR00T-N1.5-3B). Please refer to NVIDIA's terms for usage restrictions.
 
1
  ---
2
  license: apache-2.0
3
+ base_model: nvidia/GR00T-N1.6-3B
4
  tags:
5
  - robotics
6
  - gr00t
 
15
 
16
  # GR00T-N1.6-3B-Pick-Banana-Real
17
 
18
+ A fine-tuned version of [nvidia/GR00T-N1.6-3B](https://huggingface.co/nvidia/GR00T-N1.6-3B) for a banana pick-and-place task on a real SO-ARM101 robot, trained on the [hi-space/SO-ARM101-PICK-BANANA](https://huggingface.co/datasets/hi-space/SO-ARM101-PICK-BANANA) dataset.
19
 
20
  ## Model Description
21
 
22
  GR00T-N1.6 (Gr00tN1d6) is a vision-language-action (VLA) model for robot manipulation. This checkpoint is fine-tuned for a pick-and-place task where the robot picks up a banana and places it on a plate using a real SO-ARM101 robot arm.
23
 
24
  - **Architecture:** Gr00tN1d6 with Eagle-Block2A-2B-v2 vision-language backbone + diffusion policy action head
25
+ - **Base model:** nvidia/GR00T-N1.6-3B
26
  - **Task:** Pick banana and place on plate (real robot)
27
  - **Robot:** SO-ARM101
28
  - **Action horizon:** 50 steps
 
65
 
66
  ## License
67
 
68
+ This model inherits the license from the base model [nvidia/GR00T-N1.6-3B](https://huggingface.co/nvidia/GR00T-N1.6-3B). Please refer to NVIDIA's terms for usage restrictions.