robomotic commited on
Commit
6fc6e7f
·
verified ·
1 Parent(s): e3c44c8

Upload yam_lift_cube_vision/2026-05-03_08-02-44/README.md with huggingface_hub

Browse files
yam_lift_cube_vision/2026-05-03_08-02-44/README.md ADDED
@@ -0,0 +1,100 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - reinforcement-learning
4
+ - robotics
5
+ - mujoco
6
+ - onnx
7
+ - rsl-rl
8
+ library_name: pytorch
9
+ ---
10
+
11
+ # yam_lift_cube_vision Policy
12
+
13
+ **Run:** `2026-05-03_08-02-44`
14
+ **Uploaded:** 2026-05-03 13:39:35 UTC
15
+
16
+ ## Training Configuration
17
+
18
+ | Parameter | Value |
19
+ |-----------|-------|
20
+ | Algorithm | PPO |
21
+ | Max Iterations | unknown |
22
+ | Final Iteration | 2999 |
23
+ | Learning Rate | unknown |
24
+ | Gamma | unknown |
25
+ | Learning Epochs | unknown |
26
+ | Mini Batches | unknown |
27
+ | Entropy Coefficient | unknown |
28
+
29
+ ## Network Architecture
30
+
31
+ | Component | Hidden Dimensions | Activation |
32
+ |-----------|------------------|------------|
33
+ | Actor | unknown | unknown |
34
+ | Critic | unknown | unknown |
35
+
36
+ ## Environment
37
+
38
+ | Parameter | Value |
39
+ |-----------|-------|
40
+ | Num Environments | unknown |
41
+ | Decimation | unknown |
42
+ | Episode Length (s) | unknown |
43
+
44
+ ## Command Ranges
45
+
46
+ - No command ranges found
47
+
48
+ ## Reward Functions
49
+
50
+ - No reward terms found
51
+
52
+ ## Files
53
+
54
+ | File | Description |
55
+ |------|-------------|
56
+ | `model_final.pt` | Final PyTorch checkpoint (iteration 2999) |
57
+ | `policy.onnx` | Exported ONNX policy |
58
+ | `agent.yaml` | Agent configuration |
59
+ | `env.yaml` | Environment configuration |
60
+ | `mjlab.diff` | Git diff snapshot for reproducibility |
61
+
62
+ ## Usage
63
+
64
+ ### Load PyTorch Checkpoint
65
+
66
+ ```python
67
+ import torch
68
+ from mjlab.rl.runner import MjlabOnPolicyRunner
69
+
70
+ checkpoint = torch.load("model_final.pt", map_location="cpu")
71
+ actor_state = checkpoint["actor_state_dict"]
72
+ # Use with your environment setup
73
+ ```
74
+
75
+ ### Load ONNX Policy
76
+
77
+ ```python
78
+ import onnxruntime as ort
79
+ import numpy as np
80
+
81
+ session = ort.InferenceSession("policy.onnx")
82
+ observation = np.zeros((1, obs_dim), dtype=np.float32)
83
+ action = session.run(None, {session.get_inputs()[0].name: observation})[0]
84
+ ```
85
+
86
+ ### Load with MjLab
87
+
88
+ ```bash
89
+ # Option 1: Clone the HF repository
90
+ git lfs install
91
+ git clone https://huggingface.co/robomotic/mjlab-policies
92
+ cd mjlab-policies
93
+ # Navigate to the appropriate directory for this run
94
+
95
+ # Option 2: Download just this run using HF CLI
96
+ huggingface-cli download robomotic/mjlab-policies yam_lift_cube_vision/2026-05-03_08-02-44/model_final.pt
97
+
98
+ # Play the policy
99
+ uv run play --task yam_lift_cube_vision --checkpoint path/to/model_final.pt
100
+ ```