cy0307 commited on
Commit
ee93657
·
verified ·
1 Parent(s): cb2eae0

Add files using upload-large-folder tool

Browse files
TASK_METHOD_20_GAP_AUDIT.md CHANGED
@@ -1,6 +1,6 @@
1
  # Task Method 20-Result Gap Audit
2
 
3
- Generated: `2026-06-19T11:30:03+00:00`
4
 
5
  This audit is the explicit gap ledger for the 9-method x 20-task result matrix.
6
  It keeps missing cells visible while preserving the rule that a numeric score
@@ -9,8 +9,8 @@ requires a real task target and source artifact.
9
  ## Score Summary
10
 
11
  - Method-task records: `180`
12
- - Numeric scored records: `153`
13
- - Scoreless records: `27`
14
  - Proxy-scored records: `4`
15
  - Source matrix: [`docs/data/task_method_20_result_matrix.json`](docs/data/task_method_20_result_matrix.json)
16
 
@@ -24,7 +24,7 @@ requires a real task target and source artifact.
24
  | 128ep Aligned NN | metadata128_neural_mlp | 18/20 | 2 | 0 | not_supported_by_metadata_only_package: 2, scored: 18 |
25
  | 128ep Raw Simple | raw128_simple | 20/20 | 0 | 2 | proxy_scored: 2, scored: 18 |
26
  | 128ep Raw NN | raw128_neural_mlp | 20/20 | 0 | 2 | proxy_scored: 2, scored: 18 |
27
- | Qwen3-Omni v6 LoRA | qwen3_omni_v6_lora | 16/20 | 4 | 0 | not_evaluated_in_verified_package: 4, scored: 16 |
28
  | Cosmos3-Super Reasoner | cosmos3_super_reasoner | 10/20 | 10 | 0 | not_evaluated_in_verified_package: 10, scored: 10 |
29
  | Cosmos3-Nano Future Window | cosmos3_nano_future_window | 11/20 | 9 | 0 | not_evaluated_in_verified_package: 9, scored: 11 |
30
 
@@ -32,7 +32,7 @@ requires a real task target and source artifact.
32
 
33
  | Status | Count | Next step |
34
  | --- | --- | --- |
35
- | not_evaluated_in_verified_package | 23 | Generate verified model outputs for this task contract and score them against the held-out labels. |
36
  | not_supported_by_metadata_only_package | 2 | Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score. |
37
  | unsupported_without_required_target | 2 | Export the missing target field for this 128-episode method, then rerun the same train/validation/test split. |
38
 
@@ -41,13 +41,11 @@ requires a real task target and source artifact.
41
  | Task | Task label | Method | Status | Required evidence |
42
  | --- | --- | --- | --- | --- |
43
  | 02 | Procedure Step Recognition | Cosmos3-Nano Future Window | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
44
- | 05 | Hand Trajectory Forecasting | Qwen3-Omni v6 LoRA | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
45
  | 05 | Hand Trajectory Forecasting | Cosmos3-Super Reasoner | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
46
  | 05 | Hand Trajectory Forecasting | Cosmos3-Nano Future Window | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
47
  | 07 | Object Relevance Prediction | Cosmos3-Nano Future Window | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
48
  | 08 | Language Grounding | Cosmos3-Nano Future Window | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
49
  | 09 | Cross-Modal Retrieval | Cosmos3-Super Reasoner | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
50
- | 10 | Cross-Modal Reconstruction | Qwen3-Omni v6 LoRA | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
51
  | 10 | Cross-Modal Reconstruction | Cosmos3-Super Reasoner | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
52
  | 11 | Temporal Order Verification | Cosmos3-Super Reasoner | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
53
  | 11 | Temporal Order Verification | Cosmos3-Nano Future Window | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
@@ -60,7 +58,6 @@ requires a real task target and source artifact.
60
  | 15 | Interaction Text Prediction | Cosmos3-Super Reasoner | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
61
  | 15 | Interaction Text Prediction | Cosmos3-Nano Future Window | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
62
  | 17 | Future Object-Set Forecasting | Cosmos3-Super Reasoner | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
63
- | 18 | IMU-to-Hand Pose Reconstruction | Qwen3-Omni v6 LoRA | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
64
  | 18 | IMU-to-Hand Pose Reconstruction | Cosmos3-Super Reasoner | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
65
  | 18 | IMU-to-Hand Pose Reconstruction | Cosmos3-Nano Future Window | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
66
  | 19 | Camera-View Synchronization Retrieval | 128ep Aligned Simple | unsupported | Export the missing target field for this 128-episode method, then rerun the same train/validation/test split. |
 
1
  # Task Method 20-Result Gap Audit
2
 
3
+ Generated: `2026-06-19T11:42:08+00:00`
4
 
5
  This audit is the explicit gap ledger for the 9-method x 20-task result matrix.
6
  It keeps missing cells visible while preserving the rule that a numeric score
 
9
  ## Score Summary
10
 
11
  - Method-task records: `180`
12
+ - Numeric scored records: `156`
13
+ - Scoreless records: `24`
14
  - Proxy-scored records: `4`
15
  - Source matrix: [`docs/data/task_method_20_result_matrix.json`](docs/data/task_method_20_result_matrix.json)
16
 
 
24
  | 128ep Aligned NN | metadata128_neural_mlp | 18/20 | 2 | 0 | not_supported_by_metadata_only_package: 2, scored: 18 |
25
  | 128ep Raw Simple | raw128_simple | 20/20 | 0 | 2 | proxy_scored: 2, scored: 18 |
26
  | 128ep Raw NN | raw128_neural_mlp | 20/20 | 0 | 2 | proxy_scored: 2, scored: 18 |
27
+ | Qwen3-Omni v6 LoRA | qwen3_omni_v6_lora | 19/20 | 1 | 0 | not_evaluated_in_verified_package: 1, scored: 19 |
28
  | Cosmos3-Super Reasoner | cosmos3_super_reasoner | 10/20 | 10 | 0 | not_evaluated_in_verified_package: 10, scored: 10 |
29
  | Cosmos3-Nano Future Window | cosmos3_nano_future_window | 11/20 | 9 | 0 | not_evaluated_in_verified_package: 9, scored: 11 |
30
 
 
32
 
33
  | Status | Count | Next step |
34
  | --- | --- | --- |
35
+ | not_evaluated_in_verified_package | 20 | Generate verified model outputs for this task contract and score them against the held-out labels. |
36
  | not_supported_by_metadata_only_package | 2 | Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score. |
37
  | unsupported_without_required_target | 2 | Export the missing target field for this 128-episode method, then rerun the same train/validation/test split. |
38
 
 
41
  | Task | Task label | Method | Status | Required evidence |
42
  | --- | --- | --- | --- | --- |
43
  | 02 | Procedure Step Recognition | Cosmos3-Nano Future Window | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
 
44
  | 05 | Hand Trajectory Forecasting | Cosmos3-Super Reasoner | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
45
  | 05 | Hand Trajectory Forecasting | Cosmos3-Nano Future Window | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
46
  | 07 | Object Relevance Prediction | Cosmos3-Nano Future Window | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
47
  | 08 | Language Grounding | Cosmos3-Nano Future Window | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
48
  | 09 | Cross-Modal Retrieval | Cosmos3-Super Reasoner | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
 
49
  | 10 | Cross-Modal Reconstruction | Cosmos3-Super Reasoner | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
50
  | 11 | Temporal Order Verification | Cosmos3-Super Reasoner | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
51
  | 11 | Temporal Order Verification | Cosmos3-Nano Future Window | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
 
58
  | 15 | Interaction Text Prediction | Cosmos3-Super Reasoner | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
59
  | 15 | Interaction Text Prediction | Cosmos3-Nano Future Window | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
60
  | 17 | Future Object-Set Forecasting | Cosmos3-Super Reasoner | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
 
61
  | 18 | IMU-to-Hand Pose Reconstruction | Cosmos3-Super Reasoner | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
62
  | 18 | IMU-to-Hand Pose Reconstruction | Cosmos3-Nano Future Window | not evaluated | Generate verified model outputs for this task contract and score them against the held-out labels. |
63
  | 19 | Camera-View Synchronization Retrieval | 128ep Aligned Simple | unsupported | Export the missing target field for this 128-episode method, then rerun the same train/validation/test split. |
data/mirror_parity.json CHANGED
The diff for this file is too large to render. See raw diff
 
data/publication_audit.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-19T11:30:51+00:00",
4
  "checks": [
5
  {
6
  "name": "required_publication_assets_present",
@@ -227,19 +227,19 @@
227
  "hf_space_bundle": {
228
  "root": "hf_publish/space",
229
  "exists": true,
230
- "file_count": 443,
231
- "text_file_count": 337,
232
  "largest_file": {
233
- "path": "results/omni_finetune/xperience10m_qwen3_omni_v6_cross_modal_retrieval_probe_a100_20260618T000000Z/cross_modal_retrieval/predictions.jsonl",
234
- "bytes": 9251901
235
  },
236
  "violations": []
237
  },
238
  "hf_artifact_bundle": {
239
  "root": "hf_publish/artifacts",
240
  "exists": true,
241
- "file_count": 4060,
242
- "text_file_count": 1168,
243
  "largest_file": {
244
  "path": "results/omni_finetune/xperience10m_128ep_dense_multiscale_hierarchical_v1_20260608/dense_multiscale_windows.jsonl",
245
  "bytes": 135591061
@@ -249,8 +249,8 @@
249
  "hf_model_bundle": {
250
  "root": "hf_publish/model",
251
  "exists": true,
252
- "file_count": 4804,
253
- "text_file_count": 1336,
254
  "largest_file": {
255
  "path": "results/omni_finetune/xperience10m_128ep_dense_multiscale_hierarchical_v1_20260608/dense_multiscale_windows.jsonl",
256
  "bytes": 135591061
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-19T12:02:40+00:00",
4
  "checks": [
5
  {
6
  "name": "required_publication_assets_present",
 
227
  "hf_space_bundle": {
228
  "root": "hf_publish/space",
229
  "exists": true,
230
+ "file_count": 465,
231
+ "text_file_count": 347,
232
  "largest_file": {
233
+ "path": "results/omni_finetune/xperience10m_qwen3_omni_v6_sensor_target_probes_a100_20260619T000000Z/modality_reconstruction/predictions.jsonl",
234
+ "bytes": 10221085
235
  },
236
  "violations": []
237
  },
238
  "hf_artifact_bundle": {
239
  "root": "hf_publish/artifacts",
240
  "exists": true,
241
+ "file_count": 2775,
242
+ "text_file_count": 1178,
243
  "largest_file": {
244
  "path": "results/omni_finetune/xperience10m_128ep_dense_multiscale_hierarchical_v1_20260608/dense_multiscale_windows.jsonl",
245
  "bytes": 135591061
 
249
  "hf_model_bundle": {
250
  "root": "hf_publish/model",
251
  "exists": true,
252
+ "file_count": 3248,
253
+ "text_file_count": 1346,
254
  "largest_file": {
255
  "path": "results/omni_finetune/xperience10m_128ep_dense_multiscale_hierarchical_v1_20260608/dense_multiscale_windows.jsonl",
256
  "bytes": 135591061
data/source_alignment_audit.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Source Alignment Note",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-19T11:30:21+00:00",
5
  "alignment_json": "docs/data/xperience10m_dataset_card_alignment.json",
6
  "alignment_summary": {
7
  "full_dataset_repo": "ropedia-ai/xperience-10m",
 
1
  {
2
  "title": "Ropedia Xperience-10M Source Alignment Note",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-19T12:02:10+00:00",
5
  "alignment_json": "docs/data/xperience10m_dataset_card_alignment.json",
6
  "alignment_summary": {
7
  "full_dataset_repo": "ropedia-ai/xperience-10m",
data/task_method_20_gap_audit.json CHANGED
@@ -1,10 +1,10 @@
1
  {
2
- "generated_at_utc": "2026-06-19T11:30:03+00:00",
3
  "immediate_actions": [
4
  {
5
  "artifact": "docs/data/task_method_20_gap_audit.json",
6
  "id": "gap_audit",
7
- "purpose": "Keep the 27 scoreless cells visible and reproducible."
8
  },
9
  {
10
  "artifact": "scripts/omni/score_model_output_probes.py",
@@ -100,11 +100,11 @@
100
  "proxy_scored_task_count": 0,
101
  "result_record_count": 20,
102
  "scope": "128 selected episodes, held-out test",
103
- "scored_task_count": 16,
104
- "scoreless_task_count": 4,
105
  "status_counts": {
106
- "not_evaluated_in_verified_package": 4,
107
- "scored": 16
108
  }
109
  },
110
  "raw128_neural_mlp": {
@@ -139,10 +139,10 @@
139
  "cosmos3_super_reasoner": 10,
140
  "metadata128_neural_mlp": 2,
141
  "metadata128_simple": 2,
142
- "qwen3_omni_v6_lora": 4
143
  },
144
  "missing_by_status": {
145
- "not_evaluated_in_verified_package": 23,
146
  "not_supported_by_metadata_only_package": 2,
147
  "unsupported_without_required_target": 2
148
  },
@@ -152,8 +152,7 @@
152
  ],
153
  "05 Hand Trajectory Forecasting": [
154
  "cosmos3_nano_future_window",
155
- "cosmos3_super_reasoner",
156
- "qwen3_omni_v6_lora"
157
  ],
158
  "07 Object Relevance Prediction": [
159
  "cosmos3_nano_future_window"
@@ -165,8 +164,7 @@
165
  "cosmos3_super_reasoner"
166
  ],
167
  "10 Cross-Modal Reconstruction": [
168
- "cosmos3_super_reasoner",
169
- "qwen3_omni_v6_lora"
170
  ],
171
  "11 Temporal Order Verification": [
172
  "cosmos3_nano_future_window",
@@ -191,8 +189,7 @@
191
  ],
192
  "18 IMU-to-Hand Pose Reconstruction": [
193
  "cosmos3_nano_future_window",
194
- "cosmos3_super_reasoner",
195
- "qwen3_omni_v6_lora"
196
  ],
197
  "19 Camera-View Synchronization Retrieval": [
198
  "cosmos3_nano_future_window",
@@ -215,19 +212,6 @@
215
  "task_label": "Procedure Step Recognition",
216
  "task_number": 2
217
  },
218
- {
219
- "method": "Qwen3-Omni v6 LoRA",
220
- "metric_key": "mpjpe",
221
- "reason": "the verified public model package did not ask this branch to emit that task target; a new task-specific evaluation package is required for a numeric score",
222
- "recommended_next_step": "Generate verified model outputs for this task contract and score them against the held-out labels.",
223
- "scope": "multi_episode_128_partial_model_overlay",
224
- "series_id": "qwen3_omni_v6_lora",
225
- "status": "not_evaluated_in_verified_package",
226
- "status_label": "not evaluated",
227
- "task_id": "hand_trajectory_forecast",
228
- "task_label": "Hand Trajectory Forecasting",
229
- "task_number": 5
230
- },
231
  {
232
  "method": "Cosmos3-Super Reasoner",
233
  "metric_key": "mpjpe",
@@ -293,19 +277,6 @@
293
  "task_label": "Cross-Modal Retrieval",
294
  "task_number": 9
295
  },
296
- {
297
- "method": "Qwen3-Omni v6 LoRA",
298
- "metric_key": "r2",
299
- "reason": "the verified public model package did not ask this branch to emit that task target; a new task-specific evaluation package is required for a numeric score",
300
- "recommended_next_step": "Generate verified model outputs for this task contract and score them against the held-out labels.",
301
- "scope": "multi_episode_128_partial_model_overlay",
302
- "series_id": "qwen3_omni_v6_lora",
303
- "status": "not_evaluated_in_verified_package",
304
- "status_label": "not evaluated",
305
- "task_id": "modality_reconstruction",
306
- "task_label": "Cross-Modal Reconstruction",
307
- "task_number": 10
308
- },
309
  {
310
  "method": "Cosmos3-Super Reasoner",
311
  "metric_key": "r2",
@@ -462,19 +433,6 @@
462
  "task_label": "Future Object-Set Forecasting",
463
  "task_number": 17
464
  },
465
- {
466
- "method": "Qwen3-Omni v6 LoRA",
467
- "metric_key": "mae",
468
- "reason": "the verified public model package did not ask this branch to emit that task target; a new task-specific evaluation package is required for a numeric score",
469
- "recommended_next_step": "Generate verified model outputs for this task contract and score them against the held-out labels.",
470
- "scope": "multi_episode_128_partial_model_overlay",
471
- "series_id": "qwen3_omni_v6_lora",
472
- "status": "not_evaluated_in_verified_package",
473
- "status_label": "not evaluated",
474
- "task_id": "imu_to_hand_pose",
475
- "task_label": "IMU-to-Hand Pose Reconstruction",
476
- "task_number": 18
477
- },
478
  {
479
  "method": "Cosmos3-Super Reasoner",
480
  "metric_key": "mae",
@@ -600,8 +558,8 @@
600
  "method_count": 9,
601
  "method_task_record_count": 180,
602
  "proxy_scored_method_task_count": 4,
603
- "scored_method_task_count": 153,
604
- "scoreless_method_task_count": 27,
605
  "task_count": 20
606
  },
607
  "source_matrix": "docs/data/task_method_20_result_matrix.json",
 
1
  {
2
+ "generated_at_utc": "2026-06-19T11:42:08+00:00",
3
  "immediate_actions": [
4
  {
5
  "artifact": "docs/data/task_method_20_gap_audit.json",
6
  "id": "gap_audit",
7
+ "purpose": "Keep the 24 scoreless cells visible and reproducible."
8
  },
9
  {
10
  "artifact": "scripts/omni/score_model_output_probes.py",
 
100
  "proxy_scored_task_count": 0,
101
  "result_record_count": 20,
102
  "scope": "128 selected episodes, held-out test",
103
+ "scored_task_count": 19,
104
+ "scoreless_task_count": 1,
105
  "status_counts": {
106
+ "not_evaluated_in_verified_package": 1,
107
+ "scored": 19
108
  }
109
  },
110
  "raw128_neural_mlp": {
 
139
  "cosmos3_super_reasoner": 10,
140
  "metadata128_neural_mlp": 2,
141
  "metadata128_simple": 2,
142
+ "qwen3_omni_v6_lora": 1
143
  },
144
  "missing_by_status": {
145
+ "not_evaluated_in_verified_package": 20,
146
  "not_supported_by_metadata_only_package": 2,
147
  "unsupported_without_required_target": 2
148
  },
 
152
  ],
153
  "05 Hand Trajectory Forecasting": [
154
  "cosmos3_nano_future_window",
155
+ "cosmos3_super_reasoner"
 
156
  ],
157
  "07 Object Relevance Prediction": [
158
  "cosmos3_nano_future_window"
 
164
  "cosmos3_super_reasoner"
165
  ],
166
  "10 Cross-Modal Reconstruction": [
167
+ "cosmos3_super_reasoner"
 
168
  ],
169
  "11 Temporal Order Verification": [
170
  "cosmos3_nano_future_window",
 
189
  ],
190
  "18 IMU-to-Hand Pose Reconstruction": [
191
  "cosmos3_nano_future_window",
192
+ "cosmos3_super_reasoner"
 
193
  ],
194
  "19 Camera-View Synchronization Retrieval": [
195
  "cosmos3_nano_future_window",
 
212
  "task_label": "Procedure Step Recognition",
213
  "task_number": 2
214
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
215
  {
216
  "method": "Cosmos3-Super Reasoner",
217
  "metric_key": "mpjpe",
 
277
  "task_label": "Cross-Modal Retrieval",
278
  "task_number": 9
279
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
280
  {
281
  "method": "Cosmos3-Super Reasoner",
282
  "metric_key": "r2",
 
433
  "task_label": "Future Object-Set Forecasting",
434
  "task_number": 17
435
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
436
  {
437
  "method": "Cosmos3-Super Reasoner",
438
  "metric_key": "mae",
 
558
  "method_count": 9,
559
  "method_task_record_count": 180,
560
  "proxy_scored_method_task_count": 4,
561
+ "scored_method_task_count": 156,
562
+ "scoreless_method_task_count": 24,
563
  "task_count": 20
564
  },
565
  "source_matrix": "docs/data/task_method_20_result_matrix.json",
data/website_integrity.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-19T11:30:46+00:00",
4
  "docs_root": "docs",
5
  "site_base": "/ropedia-xperience-10m-task-suite/",
6
  "summary": {
@@ -351,7 +351,7 @@
351
  },
352
  {
353
  "path": "data/mirror_parity.json",
354
- "bytes": 1115613,
355
  "top_level_type": "dict"
356
  },
357
  {
@@ -486,7 +486,7 @@
486
  },
487
  {
488
  "path": "data/task_method_20_gap_audit.json",
489
- "bytes": 28259,
490
  "top_level_type": "dict"
491
  },
492
  {
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-19T12:02:12+00:00",
4
  "docs_root": "docs",
5
  "site_base": "/ropedia-xperience-10m-task-suite/",
6
  "summary": {
 
351
  },
352
  {
353
  "path": "data/mirror_parity.json",
354
+ "bytes": 1161913,
355
  "top_level_type": "dict"
356
  },
357
  {
 
486
  },
487
  {
488
  "path": "data/task_method_20_gap_audit.json",
489
+ "bytes": 26093,
490
  "top_level_type": "dict"
491
  },
492
  {
docs/data/mirror_parity.json CHANGED
The diff for this file is too large to render. See raw diff
 
docs/data/publication_audit.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-19T11:30:51+00:00",
4
  "checks": [
5
  {
6
  "name": "required_publication_assets_present",
@@ -227,19 +227,19 @@
227
  "hf_space_bundle": {
228
  "root": "hf_publish/space",
229
  "exists": true,
230
- "file_count": 443,
231
- "text_file_count": 337,
232
  "largest_file": {
233
- "path": "results/omni_finetune/xperience10m_qwen3_omni_v6_cross_modal_retrieval_probe_a100_20260618T000000Z/cross_modal_retrieval/predictions.jsonl",
234
- "bytes": 9251901
235
  },
236
  "violations": []
237
  },
238
  "hf_artifact_bundle": {
239
  "root": "hf_publish/artifacts",
240
  "exists": true,
241
- "file_count": 4060,
242
- "text_file_count": 1168,
243
  "largest_file": {
244
  "path": "results/omni_finetune/xperience10m_128ep_dense_multiscale_hierarchical_v1_20260608/dense_multiscale_windows.jsonl",
245
  "bytes": 135591061
@@ -249,8 +249,8 @@
249
  "hf_model_bundle": {
250
  "root": "hf_publish/model",
251
  "exists": true,
252
- "file_count": 4804,
253
- "text_file_count": 1336,
254
  "largest_file": {
255
  "path": "results/omni_finetune/xperience10m_128ep_dense_multiscale_hierarchical_v1_20260608/dense_multiscale_windows.jsonl",
256
  "bytes": 135591061
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-19T12:02:40+00:00",
4
  "checks": [
5
  {
6
  "name": "required_publication_assets_present",
 
227
  "hf_space_bundle": {
228
  "root": "hf_publish/space",
229
  "exists": true,
230
+ "file_count": 465,
231
+ "text_file_count": 347,
232
  "largest_file": {
233
+ "path": "results/omni_finetune/xperience10m_qwen3_omni_v6_sensor_target_probes_a100_20260619T000000Z/modality_reconstruction/predictions.jsonl",
234
+ "bytes": 10221085
235
  },
236
  "violations": []
237
  },
238
  "hf_artifact_bundle": {
239
  "root": "hf_publish/artifacts",
240
  "exists": true,
241
+ "file_count": 2775,
242
+ "text_file_count": 1178,
243
  "largest_file": {
244
  "path": "results/omni_finetune/xperience10m_128ep_dense_multiscale_hierarchical_v1_20260608/dense_multiscale_windows.jsonl",
245
  "bytes": 135591061
 
249
  "hf_model_bundle": {
250
  "root": "hf_publish/model",
251
  "exists": true,
252
+ "file_count": 3248,
253
+ "text_file_count": 1346,
254
  "largest_file": {
255
  "path": "results/omni_finetune/xperience10m_128ep_dense_multiscale_hierarchical_v1_20260608/dense_multiscale_windows.jsonl",
256
  "bytes": 135591061
docs/data/source_alignment_audit.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Source Alignment Note",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-19T11:30:21+00:00",
5
  "alignment_json": "docs/data/xperience10m_dataset_card_alignment.json",
6
  "alignment_summary": {
7
  "full_dataset_repo": "ropedia-ai/xperience-10m",
 
1
  {
2
  "title": "Ropedia Xperience-10M Source Alignment Note",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-19T12:02:10+00:00",
5
  "alignment_json": "docs/data/xperience10m_dataset_card_alignment.json",
6
  "alignment_summary": {
7
  "full_dataset_repo": "ropedia-ai/xperience-10m",
docs/data/task_method_20_gap_audit.json CHANGED
@@ -1,10 +1,10 @@
1
  {
2
- "generated_at_utc": "2026-06-19T11:30:03+00:00",
3
  "immediate_actions": [
4
  {
5
  "artifact": "docs/data/task_method_20_gap_audit.json",
6
  "id": "gap_audit",
7
- "purpose": "Keep the 27 scoreless cells visible and reproducible."
8
  },
9
  {
10
  "artifact": "scripts/omni/score_model_output_probes.py",
@@ -100,11 +100,11 @@
100
  "proxy_scored_task_count": 0,
101
  "result_record_count": 20,
102
  "scope": "128 selected episodes, held-out test",
103
- "scored_task_count": 16,
104
- "scoreless_task_count": 4,
105
  "status_counts": {
106
- "not_evaluated_in_verified_package": 4,
107
- "scored": 16
108
  }
109
  },
110
  "raw128_neural_mlp": {
@@ -139,10 +139,10 @@
139
  "cosmos3_super_reasoner": 10,
140
  "metadata128_neural_mlp": 2,
141
  "metadata128_simple": 2,
142
- "qwen3_omni_v6_lora": 4
143
  },
144
  "missing_by_status": {
145
- "not_evaluated_in_verified_package": 23,
146
  "not_supported_by_metadata_only_package": 2,
147
  "unsupported_without_required_target": 2
148
  },
@@ -152,8 +152,7 @@
152
  ],
153
  "05 Hand Trajectory Forecasting": [
154
  "cosmos3_nano_future_window",
155
- "cosmos3_super_reasoner",
156
- "qwen3_omni_v6_lora"
157
  ],
158
  "07 Object Relevance Prediction": [
159
  "cosmos3_nano_future_window"
@@ -165,8 +164,7 @@
165
  "cosmos3_super_reasoner"
166
  ],
167
  "10 Cross-Modal Reconstruction": [
168
- "cosmos3_super_reasoner",
169
- "qwen3_omni_v6_lora"
170
  ],
171
  "11 Temporal Order Verification": [
172
  "cosmos3_nano_future_window",
@@ -191,8 +189,7 @@
191
  ],
192
  "18 IMU-to-Hand Pose Reconstruction": [
193
  "cosmos3_nano_future_window",
194
- "cosmos3_super_reasoner",
195
- "qwen3_omni_v6_lora"
196
  ],
197
  "19 Camera-View Synchronization Retrieval": [
198
  "cosmos3_nano_future_window",
@@ -215,19 +212,6 @@
215
  "task_label": "Procedure Step Recognition",
216
  "task_number": 2
217
  },
218
- {
219
- "method": "Qwen3-Omni v6 LoRA",
220
- "metric_key": "mpjpe",
221
- "reason": "the verified public model package did not ask this branch to emit that task target; a new task-specific evaluation package is required for a numeric score",
222
- "recommended_next_step": "Generate verified model outputs for this task contract and score them against the held-out labels.",
223
- "scope": "multi_episode_128_partial_model_overlay",
224
- "series_id": "qwen3_omni_v6_lora",
225
- "status": "not_evaluated_in_verified_package",
226
- "status_label": "not evaluated",
227
- "task_id": "hand_trajectory_forecast",
228
- "task_label": "Hand Trajectory Forecasting",
229
- "task_number": 5
230
- },
231
  {
232
  "method": "Cosmos3-Super Reasoner",
233
  "metric_key": "mpjpe",
@@ -293,19 +277,6 @@
293
  "task_label": "Cross-Modal Retrieval",
294
  "task_number": 9
295
  },
296
- {
297
- "method": "Qwen3-Omni v6 LoRA",
298
- "metric_key": "r2",
299
- "reason": "the verified public model package did not ask this branch to emit that task target; a new task-specific evaluation package is required for a numeric score",
300
- "recommended_next_step": "Generate verified model outputs for this task contract and score them against the held-out labels.",
301
- "scope": "multi_episode_128_partial_model_overlay",
302
- "series_id": "qwen3_omni_v6_lora",
303
- "status": "not_evaluated_in_verified_package",
304
- "status_label": "not evaluated",
305
- "task_id": "modality_reconstruction",
306
- "task_label": "Cross-Modal Reconstruction",
307
- "task_number": 10
308
- },
309
  {
310
  "method": "Cosmos3-Super Reasoner",
311
  "metric_key": "r2",
@@ -462,19 +433,6 @@
462
  "task_label": "Future Object-Set Forecasting",
463
  "task_number": 17
464
  },
465
- {
466
- "method": "Qwen3-Omni v6 LoRA",
467
- "metric_key": "mae",
468
- "reason": "the verified public model package did not ask this branch to emit that task target; a new task-specific evaluation package is required for a numeric score",
469
- "recommended_next_step": "Generate verified model outputs for this task contract and score them against the held-out labels.",
470
- "scope": "multi_episode_128_partial_model_overlay",
471
- "series_id": "qwen3_omni_v6_lora",
472
- "status": "not_evaluated_in_verified_package",
473
- "status_label": "not evaluated",
474
- "task_id": "imu_to_hand_pose",
475
- "task_label": "IMU-to-Hand Pose Reconstruction",
476
- "task_number": 18
477
- },
478
  {
479
  "method": "Cosmos3-Super Reasoner",
480
  "metric_key": "mae",
@@ -600,8 +558,8 @@
600
  "method_count": 9,
601
  "method_task_record_count": 180,
602
  "proxy_scored_method_task_count": 4,
603
- "scored_method_task_count": 153,
604
- "scoreless_method_task_count": 27,
605
  "task_count": 20
606
  },
607
  "source_matrix": "docs/data/task_method_20_result_matrix.json",
 
1
  {
2
+ "generated_at_utc": "2026-06-19T11:42:08+00:00",
3
  "immediate_actions": [
4
  {
5
  "artifact": "docs/data/task_method_20_gap_audit.json",
6
  "id": "gap_audit",
7
+ "purpose": "Keep the 24 scoreless cells visible and reproducible."
8
  },
9
  {
10
  "artifact": "scripts/omni/score_model_output_probes.py",
 
100
  "proxy_scored_task_count": 0,
101
  "result_record_count": 20,
102
  "scope": "128 selected episodes, held-out test",
103
+ "scored_task_count": 19,
104
+ "scoreless_task_count": 1,
105
  "status_counts": {
106
+ "not_evaluated_in_verified_package": 1,
107
+ "scored": 19
108
  }
109
  },
110
  "raw128_neural_mlp": {
 
139
  "cosmos3_super_reasoner": 10,
140
  "metadata128_neural_mlp": 2,
141
  "metadata128_simple": 2,
142
+ "qwen3_omni_v6_lora": 1
143
  },
144
  "missing_by_status": {
145
+ "not_evaluated_in_verified_package": 20,
146
  "not_supported_by_metadata_only_package": 2,
147
  "unsupported_without_required_target": 2
148
  },
 
152
  ],
153
  "05 Hand Trajectory Forecasting": [
154
  "cosmos3_nano_future_window",
155
+ "cosmos3_super_reasoner"
 
156
  ],
157
  "07 Object Relevance Prediction": [
158
  "cosmos3_nano_future_window"
 
164
  "cosmos3_super_reasoner"
165
  ],
166
  "10 Cross-Modal Reconstruction": [
167
+ "cosmos3_super_reasoner"
 
168
  ],
169
  "11 Temporal Order Verification": [
170
  "cosmos3_nano_future_window",
 
189
  ],
190
  "18 IMU-to-Hand Pose Reconstruction": [
191
  "cosmos3_nano_future_window",
192
+ "cosmos3_super_reasoner"
 
193
  ],
194
  "19 Camera-View Synchronization Retrieval": [
195
  "cosmos3_nano_future_window",
 
212
  "task_label": "Procedure Step Recognition",
213
  "task_number": 2
214
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
215
  {
216
  "method": "Cosmos3-Super Reasoner",
217
  "metric_key": "mpjpe",
 
277
  "task_label": "Cross-Modal Retrieval",
278
  "task_number": 9
279
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
280
  {
281
  "method": "Cosmos3-Super Reasoner",
282
  "metric_key": "r2",
 
433
  "task_label": "Future Object-Set Forecasting",
434
  "task_number": 17
435
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
436
  {
437
  "method": "Cosmos3-Super Reasoner",
438
  "metric_key": "mae",
 
558
  "method_count": 9,
559
  "method_task_record_count": 180,
560
  "proxy_scored_method_task_count": 4,
561
+ "scored_method_task_count": 156,
562
+ "scoreless_method_task_count": 24,
563
  "task_count": 20
564
  },
565
  "source_matrix": "docs/data/task_method_20_result_matrix.json",
docs/data/website_integrity.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-19T11:30:46+00:00",
4
  "docs_root": "docs",
5
  "site_base": "/ropedia-xperience-10m-task-suite/",
6
  "summary": {
@@ -351,7 +351,7 @@
351
  },
352
  {
353
  "path": "data/mirror_parity.json",
354
- "bytes": 1115613,
355
  "top_level_type": "dict"
356
  },
357
  {
@@ -486,7 +486,7 @@
486
  },
487
  {
488
  "path": "data/task_method_20_gap_audit.json",
489
- "bytes": 28259,
490
  "top_level_type": "dict"
491
  },
492
  {
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-19T12:02:12+00:00",
4
  "docs_root": "docs",
5
  "site_base": "/ropedia-xperience-10m-task-suite/",
6
  "summary": {
 
351
  },
352
  {
353
  "path": "data/mirror_parity.json",
354
+ "bytes": 1161913,
355
  "top_level_type": "dict"
356
  },
357
  {
 
486
  },
487
  {
488
  "path": "data/task_method_20_gap_audit.json",
489
+ "bytes": 26093,
490
  "top_level_type": "dict"
491
  },
492
  {
metrics/mirror_parity.json CHANGED
The diff for this file is too large to render. See raw diff
 
metrics/publication_audit.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-19T11:30:51+00:00",
4
  "checks": [
5
  {
6
  "name": "required_publication_assets_present",
@@ -227,19 +227,19 @@
227
  "hf_space_bundle": {
228
  "root": "hf_publish/space",
229
  "exists": true,
230
- "file_count": 443,
231
- "text_file_count": 337,
232
  "largest_file": {
233
- "path": "results/omni_finetune/xperience10m_qwen3_omni_v6_cross_modal_retrieval_probe_a100_20260618T000000Z/cross_modal_retrieval/predictions.jsonl",
234
- "bytes": 9251901
235
  },
236
  "violations": []
237
  },
238
  "hf_artifact_bundle": {
239
  "root": "hf_publish/artifacts",
240
  "exists": true,
241
- "file_count": 4060,
242
- "text_file_count": 1168,
243
  "largest_file": {
244
  "path": "results/omni_finetune/xperience10m_128ep_dense_multiscale_hierarchical_v1_20260608/dense_multiscale_windows.jsonl",
245
  "bytes": 135591061
@@ -249,8 +249,8 @@
249
  "hf_model_bundle": {
250
  "root": "hf_publish/model",
251
  "exists": true,
252
- "file_count": 4804,
253
- "text_file_count": 1336,
254
  "largest_file": {
255
  "path": "results/omni_finetune/xperience10m_128ep_dense_multiscale_hierarchical_v1_20260608/dense_multiscale_windows.jsonl",
256
  "bytes": 135591061
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-19T12:02:40+00:00",
4
  "checks": [
5
  {
6
  "name": "required_publication_assets_present",
 
227
  "hf_space_bundle": {
228
  "root": "hf_publish/space",
229
  "exists": true,
230
+ "file_count": 465,
231
+ "text_file_count": 347,
232
  "largest_file": {
233
+ "path": "results/omni_finetune/xperience10m_qwen3_omni_v6_sensor_target_probes_a100_20260619T000000Z/modality_reconstruction/predictions.jsonl",
234
+ "bytes": 10221085
235
  },
236
  "violations": []
237
  },
238
  "hf_artifact_bundle": {
239
  "root": "hf_publish/artifacts",
240
  "exists": true,
241
+ "file_count": 2775,
242
+ "text_file_count": 1178,
243
  "largest_file": {
244
  "path": "results/omni_finetune/xperience10m_128ep_dense_multiscale_hierarchical_v1_20260608/dense_multiscale_windows.jsonl",
245
  "bytes": 135591061
 
249
  "hf_model_bundle": {
250
  "root": "hf_publish/model",
251
  "exists": true,
252
+ "file_count": 3248,
253
+ "text_file_count": 1346,
254
  "largest_file": {
255
  "path": "results/omni_finetune/xperience10m_128ep_dense_multiscale_hierarchical_v1_20260608/dense_multiscale_windows.jsonl",
256
  "bytes": 135591061
metrics/source_alignment_audit.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Source Alignment Note",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-19T11:30:21+00:00",
5
  "alignment_json": "docs/data/xperience10m_dataset_card_alignment.json",
6
  "alignment_summary": {
7
  "full_dataset_repo": "ropedia-ai/xperience-10m",
 
1
  {
2
  "title": "Ropedia Xperience-10M Source Alignment Note",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-19T12:02:10+00:00",
5
  "alignment_json": "docs/data/xperience10m_dataset_card_alignment.json",
6
  "alignment_summary": {
7
  "full_dataset_repo": "ropedia-ai/xperience-10m",
metrics/task_method_20_gap_audit.json CHANGED
@@ -1,10 +1,10 @@
1
  {
2
- "generated_at_utc": "2026-06-19T11:30:03+00:00",
3
  "immediate_actions": [
4
  {
5
  "artifact": "docs/data/task_method_20_gap_audit.json",
6
  "id": "gap_audit",
7
- "purpose": "Keep the 27 scoreless cells visible and reproducible."
8
  },
9
  {
10
  "artifact": "scripts/omni/score_model_output_probes.py",
@@ -100,11 +100,11 @@
100
  "proxy_scored_task_count": 0,
101
  "result_record_count": 20,
102
  "scope": "128 selected episodes, held-out test",
103
- "scored_task_count": 16,
104
- "scoreless_task_count": 4,
105
  "status_counts": {
106
- "not_evaluated_in_verified_package": 4,
107
- "scored": 16
108
  }
109
  },
110
  "raw128_neural_mlp": {
@@ -139,10 +139,10 @@
139
  "cosmos3_super_reasoner": 10,
140
  "metadata128_neural_mlp": 2,
141
  "metadata128_simple": 2,
142
- "qwen3_omni_v6_lora": 4
143
  },
144
  "missing_by_status": {
145
- "not_evaluated_in_verified_package": 23,
146
  "not_supported_by_metadata_only_package": 2,
147
  "unsupported_without_required_target": 2
148
  },
@@ -152,8 +152,7 @@
152
  ],
153
  "05 Hand Trajectory Forecasting": [
154
  "cosmos3_nano_future_window",
155
- "cosmos3_super_reasoner",
156
- "qwen3_omni_v6_lora"
157
  ],
158
  "07 Object Relevance Prediction": [
159
  "cosmos3_nano_future_window"
@@ -165,8 +164,7 @@
165
  "cosmos3_super_reasoner"
166
  ],
167
  "10 Cross-Modal Reconstruction": [
168
- "cosmos3_super_reasoner",
169
- "qwen3_omni_v6_lora"
170
  ],
171
  "11 Temporal Order Verification": [
172
  "cosmos3_nano_future_window",
@@ -191,8 +189,7 @@
191
  ],
192
  "18 IMU-to-Hand Pose Reconstruction": [
193
  "cosmos3_nano_future_window",
194
- "cosmos3_super_reasoner",
195
- "qwen3_omni_v6_lora"
196
  ],
197
  "19 Camera-View Synchronization Retrieval": [
198
  "cosmos3_nano_future_window",
@@ -215,19 +212,6 @@
215
  "task_label": "Procedure Step Recognition",
216
  "task_number": 2
217
  },
218
- {
219
- "method": "Qwen3-Omni v6 LoRA",
220
- "metric_key": "mpjpe",
221
- "reason": "the verified public model package did not ask this branch to emit that task target; a new task-specific evaluation package is required for a numeric score",
222
- "recommended_next_step": "Generate verified model outputs for this task contract and score them against the held-out labels.",
223
- "scope": "multi_episode_128_partial_model_overlay",
224
- "series_id": "qwen3_omni_v6_lora",
225
- "status": "not_evaluated_in_verified_package",
226
- "status_label": "not evaluated",
227
- "task_id": "hand_trajectory_forecast",
228
- "task_label": "Hand Trajectory Forecasting",
229
- "task_number": 5
230
- },
231
  {
232
  "method": "Cosmos3-Super Reasoner",
233
  "metric_key": "mpjpe",
@@ -293,19 +277,6 @@
293
  "task_label": "Cross-Modal Retrieval",
294
  "task_number": 9
295
  },
296
- {
297
- "method": "Qwen3-Omni v6 LoRA",
298
- "metric_key": "r2",
299
- "reason": "the verified public model package did not ask this branch to emit that task target; a new task-specific evaluation package is required for a numeric score",
300
- "recommended_next_step": "Generate verified model outputs for this task contract and score them against the held-out labels.",
301
- "scope": "multi_episode_128_partial_model_overlay",
302
- "series_id": "qwen3_omni_v6_lora",
303
- "status": "not_evaluated_in_verified_package",
304
- "status_label": "not evaluated",
305
- "task_id": "modality_reconstruction",
306
- "task_label": "Cross-Modal Reconstruction",
307
- "task_number": 10
308
- },
309
  {
310
  "method": "Cosmos3-Super Reasoner",
311
  "metric_key": "r2",
@@ -462,19 +433,6 @@
462
  "task_label": "Future Object-Set Forecasting",
463
  "task_number": 17
464
  },
465
- {
466
- "method": "Qwen3-Omni v6 LoRA",
467
- "metric_key": "mae",
468
- "reason": "the verified public model package did not ask this branch to emit that task target; a new task-specific evaluation package is required for a numeric score",
469
- "recommended_next_step": "Generate verified model outputs for this task contract and score them against the held-out labels.",
470
- "scope": "multi_episode_128_partial_model_overlay",
471
- "series_id": "qwen3_omni_v6_lora",
472
- "status": "not_evaluated_in_verified_package",
473
- "status_label": "not evaluated",
474
- "task_id": "imu_to_hand_pose",
475
- "task_label": "IMU-to-Hand Pose Reconstruction",
476
- "task_number": 18
477
- },
478
  {
479
  "method": "Cosmos3-Super Reasoner",
480
  "metric_key": "mae",
@@ -600,8 +558,8 @@
600
  "method_count": 9,
601
  "method_task_record_count": 180,
602
  "proxy_scored_method_task_count": 4,
603
- "scored_method_task_count": 153,
604
- "scoreless_method_task_count": 27,
605
  "task_count": 20
606
  },
607
  "source_matrix": "docs/data/task_method_20_result_matrix.json",
 
1
  {
2
+ "generated_at_utc": "2026-06-19T11:42:08+00:00",
3
  "immediate_actions": [
4
  {
5
  "artifact": "docs/data/task_method_20_gap_audit.json",
6
  "id": "gap_audit",
7
+ "purpose": "Keep the 24 scoreless cells visible and reproducible."
8
  },
9
  {
10
  "artifact": "scripts/omni/score_model_output_probes.py",
 
100
  "proxy_scored_task_count": 0,
101
  "result_record_count": 20,
102
  "scope": "128 selected episodes, held-out test",
103
+ "scored_task_count": 19,
104
+ "scoreless_task_count": 1,
105
  "status_counts": {
106
+ "not_evaluated_in_verified_package": 1,
107
+ "scored": 19
108
  }
109
  },
110
  "raw128_neural_mlp": {
 
139
  "cosmos3_super_reasoner": 10,
140
  "metadata128_neural_mlp": 2,
141
  "metadata128_simple": 2,
142
+ "qwen3_omni_v6_lora": 1
143
  },
144
  "missing_by_status": {
145
+ "not_evaluated_in_verified_package": 20,
146
  "not_supported_by_metadata_only_package": 2,
147
  "unsupported_without_required_target": 2
148
  },
 
152
  ],
153
  "05 Hand Trajectory Forecasting": [
154
  "cosmos3_nano_future_window",
155
+ "cosmos3_super_reasoner"
 
156
  ],
157
  "07 Object Relevance Prediction": [
158
  "cosmos3_nano_future_window"
 
164
  "cosmos3_super_reasoner"
165
  ],
166
  "10 Cross-Modal Reconstruction": [
167
+ "cosmos3_super_reasoner"
 
168
  ],
169
  "11 Temporal Order Verification": [
170
  "cosmos3_nano_future_window",
 
189
  ],
190
  "18 IMU-to-Hand Pose Reconstruction": [
191
  "cosmos3_nano_future_window",
192
+ "cosmos3_super_reasoner"
 
193
  ],
194
  "19 Camera-View Synchronization Retrieval": [
195
  "cosmos3_nano_future_window",
 
212
  "task_label": "Procedure Step Recognition",
213
  "task_number": 2
214
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
215
  {
216
  "method": "Cosmos3-Super Reasoner",
217
  "metric_key": "mpjpe",
 
277
  "task_label": "Cross-Modal Retrieval",
278
  "task_number": 9
279
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
280
  {
281
  "method": "Cosmos3-Super Reasoner",
282
  "metric_key": "r2",
 
433
  "task_label": "Future Object-Set Forecasting",
434
  "task_number": 17
435
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
436
  {
437
  "method": "Cosmos3-Super Reasoner",
438
  "metric_key": "mae",
 
558
  "method_count": 9,
559
  "method_task_record_count": 180,
560
  "proxy_scored_method_task_count": 4,
561
+ "scored_method_task_count": 156,
562
+ "scoreless_method_task_count": 24,
563
  "task_count": 20
564
  },
565
  "source_matrix": "docs/data/task_method_20_result_matrix.json",
metrics/website_integrity.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-19T11:30:46+00:00",
4
  "docs_root": "docs",
5
  "site_base": "/ropedia-xperience-10m-task-suite/",
6
  "summary": {
@@ -351,7 +351,7 @@
351
  },
352
  {
353
  "path": "data/mirror_parity.json",
354
- "bytes": 1115613,
355
  "top_level_type": "dict"
356
  },
357
  {
@@ -486,7 +486,7 @@
486
  },
487
  {
488
  "path": "data/task_method_20_gap_audit.json",
489
- "bytes": 28259,
490
  "top_level_type": "dict"
491
  },
492
  {
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-19T12:02:12+00:00",
4
  "docs_root": "docs",
5
  "site_base": "/ropedia-xperience-10m-task-suite/",
6
  "summary": {
 
351
  },
352
  {
353
  "path": "data/mirror_parity.json",
354
+ "bytes": 1161913,
355
  "top_level_type": "dict"
356
  },
357
  {
 
486
  },
487
  {
488
  "path": "data/task_method_20_gap_audit.json",
489
+ "bytes": 26093,
490
  "top_level_type": "dict"
491
  },
492
  {