cy0307 commited on
Commit
c90629d
·
verified ·
1 Parent(s): 5205c69

Add files using upload-large-folder tool

Browse files
PROJECT_STATUS.md CHANGED
@@ -37,7 +37,7 @@ The current no-new-episode enhancement layer records how to push the selected
37
  | Public package policy | Verified | `DATA_NOTICE.md`, `REPRODUCIBILITY.md` | Raw Xperience-10M data, private gated files, large archives, credentials, and full Qwen weights are not redistributed. |
38
  | Reproducibility | Verified for the public sample | `REPRODUCIBILITY.md`, `docs/data/reproducibility_matrix.json`, `notes/reproducibility_audit.md` | The public sample workflow has explicit commands, expected outputs, and exact-match reproduction evidence. |
39
  | 128-episode aligned baselines | Verified companion result | `results/omni_finetune/multi_episode_128_task_baselines/BASELINE_ALIGNMENT_REPORT.md`, `results/omni_finetune/multi_episode_128_task_baselines/summary_report.json`, `scripts/omni/run_128_task_baselines.py` | The earlier simple and neural baseline framing is aligned to the same selected 96/16/16 episode split used by the Qwen3-Omni pilot. JSON-supported tasks have metadata/text simple and neural MLP metrics; raw-feature-only tasks are explicitly marked unsupported until 128-run sensor feature blocks are available. |
40
- | Qwen3-Omni fine-tuning | Final verified diagnostic held-out result; JSON target met | `docs/data/omni_finetune_verified_result.json`, `results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_structured_json_v4_4epoch_full8gpu_lora_eval_test_full/`, `scripts/omni/package_verified_omni_result.py`, `scripts/omni/audit_verified_omni_package.py`, `scripts/omni/analyze_qwen3_omni_errors.py` | The selected 96/16/16 episode split produced a current public-safe v4 four-epoch held-out package with 3,808 exported windows, 512 validation windows, 448 test predictions, validation/audit summaries, and the public LoRA adapter. JSON validity is 100.00%, meeting the 98% target; transition accuracy is 97.32%, contact accuracy is 72.99%, object micro-F1 is 31.10%, next-action accuracy is 3.35%, and action/subtask metrics remain weak, so it is still a diagnostic baseline rather than a strong model-quality claim. |
41
  | Raw Xperience-10M redistribution | Not included | `DATA_NOTICE.md`, `docs/data/publication_audit.json` | Raw MP4, HDF5, RRD files, private gated data, and full Qwen weights are intentionally excluded. |
42
 
43
  ## Fast Research Route
@@ -79,10 +79,11 @@ The current no-new-episode enhancement layer records how to push the selected
79
  current results use one public sample episode.
80
  - Public-facing fine-tuning results should come from the verified result
81
  package, not from live process logs or setup-only artifacts.
82
- - The final Qwen3-Omni v4 held-out package verifies the pipeline and meets the
83
- strict-JSON target, but not strong action/subtask model quality: JSON
84
- validity is 100.00%, action macro-F1 is 0.0019, and subtask accuracy is
85
- 0.0000.
 
86
  - The current 128-episode task suite can be pushed further without more raw
87
  episodes by using dense/multiscale windows, hierarchical action/subtask
88
  targets, stronger label-normalized scoring, and compact raw-feature shards
 
37
  | Public package policy | Verified | `DATA_NOTICE.md`, `REPRODUCIBILITY.md` | Raw Xperience-10M data, private gated files, large archives, credentials, and full Qwen weights are not redistributed. |
38
  | Reproducibility | Verified for the public sample | `REPRODUCIBILITY.md`, `docs/data/reproducibility_matrix.json`, `notes/reproducibility_audit.md` | The public sample workflow has explicit commands, expected outputs, and exact-match reproduction evidence. |
39
  | 128-episode aligned baselines | Verified companion result | `results/omni_finetune/multi_episode_128_task_baselines/BASELINE_ALIGNMENT_REPORT.md`, `results/omni_finetune/multi_episode_128_task_baselines/summary_report.json`, `scripts/omni/run_128_task_baselines.py` | The earlier simple and neural baseline framing is aligned to the same selected 96/16/16 episode split used by the Qwen3-Omni pilot. JSON-supported tasks have metadata/text simple and neural MLP metrics; raw-feature-only tasks are explicitly marked unsupported until 128-run sensor feature blocks are available. |
40
+ | Qwen3-Omni fine-tuning | Latest v6 diagnostic branch verified; JSON target met | `docs/data/omni_finetune_verified_result.json`, `docs/data/qwen3_v5_v6_comparison.json`, `results/omni_finetune/QWEN3_V5_V6_COMPARISON_20260614.md`, `results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full/`, `scripts/omni/package_verified_omni_result.py`, `scripts/omni/audit_verified_omni_package.py`, `scripts/omni/analyze_qwen3_omni_errors.py` | The selected 96/16/16 episode split now has a current public-safe v6 rank64/lr5e-5 held-out package with 34,269 exported windows and 4,032 test predictions. JSON validity is 99.90%, meeting the 98% target; transition accuracy is 98.98%, contact accuracy is 81.77%, object micro-F1 is 30.65%, next-action accuracy is 4.31%, and action/subtask metrics remain weak. v6 improves action macro-F1 and contact accuracy versus v5, but v5 remains stronger on JSON validity, subtask, next-action, transition, and object metrics. |
41
  | Raw Xperience-10M redistribution | Not included | `DATA_NOTICE.md`, `docs/data/publication_audit.json` | Raw MP4, HDF5, RRD files, private gated data, and full Qwen weights are intentionally excluded. |
42
 
43
  ## Fast Research Route
 
79
  current results use one public sample episode.
80
  - Public-facing fine-tuning results should come from the verified result
81
  package, not from live process logs or setup-only artifacts.
82
+ - The latest Qwen3-Omni v6 held-out package verifies the current dense
83
+ multiscale branch and meets the strict-JSON target, but not strong
84
+ action/subtask model quality: JSON validity is 99.90%, action macro-F1 is
85
+ 0.0029, and subtask accuracy is 0.0037. v5 remains the pinned prior release
86
+ row because it is still stronger on several metrics.
87
  - The current 128-episode task suite can be pushed further without more raw
88
  episodes by using dense/multiscale windows, hierarchical action/subtask
89
  targets, stronger label-normalized scoring, and compact raw-feature shards
RESEARCH_ROADMAP.md CHANGED
@@ -11,7 +11,7 @@ should exist before the stage is treated as complete.
11
  | --- | --- | --- | --- | --- |
12
  | Public-Sample Task Lab | Implemented | One public Xperience-10M sample episode is available. | 1,161 aligned windows, 12 task contracts, minimal heads, neural MLP heads, modality atlas, task walkthroughs, and derived figures. | `PROJECT_STATUS.md`, `EVALUATION_PROTOCOL.md`, `RESEARCH_TAKEAWAYS.md`, `docs/data/summary_metrics.json`, `results/episode_task_suite/summary_report.json` |
13
  | Multi-Episode Data Preparation | Implemented for first selected pilot | Gated dataset availability and enough storage for selected episodes. | 128 selected episodes, episode manifest, missing-view manifest, held-out episode split, and source-discovery report. | `results/omni_finetune/DATA_ACCESS_STATUS.md`, `results/omni_finetune/MULTI_EPISODE_ACCESS_STATUS.md`, `results/omni_finetune/xperience10m_128_episode_selection.json` |
14
- | Qwen3-Omni LoRA Final Diagnostic Result | Verified baseline | Selected episodes prepared locally with no train/test episode leakage. | Dataset JSONL/media manifests, LoRA adapter checkpoint, progress logs, validation monitoring, held-out predictions, metrics, confusion matrices, run report, and public LoRA adapter repo. | `docs/data/omni_finetune_verified_result.json`, `results/omni_finetune/verified_public/`, `metrics.json`, `predictions.jsonl`, `RUN_REPORT.md`, `https://huggingface.co/cy0307/ropedia-qwen3-omni-lora-128ep` |
15
  | 128-Episode Same-Split Simple/NN Baselines | Verified companion result | Derived Qwen JSONL export for the selected 96/16/16 split. | Same 12 task ids, simple metadata/text baselines, neural MLP baselines where JSON labels support them, and explicit unsupported markers for tasks that still require raw 128 feature blocks. | `results/omni_finetune/multi_episode_128_task_baselines/BASELINE_ALIGNMENT_REPORT.md`, `summary_report.json`, `scripts/omni/run_128_task_baselines.py` |
16
  | 128-Episode Task Suite Enhancement Pack | Current no-new-episode plan | Same selected 96/16/16 split and current public 3,808-window export. | Dense-window and multiscale export estimates, hierarchical action/subtask target contract, raw-feature shard priorities for unsupported tasks, Qwen v5 and Cosmos continuation run cards, and publication-ready artifacts. | `TASK_SUITE_ENHANCEMENT_128.md`, `docs/data/task_suite_enhancement_128.json`, `results/omni_finetune/task_suite_enhancement_128_v1_20260608/enhancement_plan.json`, `scripts/omni/build_task_suite_enhancement_128.py` |
17
  | Action/Subtask Error-Analysis Pass | Active next step | The final diagnostic package meets strict JSON validity but has weak action/subtask held-out quality. | Same 96/16/16 split, action/subtask confusion analysis, unseen-label analysis, object/action family breakdowns, and comparison to the final verified Qwen baseline. | Updated error-analysis tables, held-out metrics by failure type, and verified public package. |
@@ -24,8 +24,8 @@ should exist before the stage is treated as complete.
24
 
25
  The useful next decision is model-quality improvement plus backbone fit without
26
  requiring more raw episodes first: keep the public-sample task suite as the
27
- development harness, use the verified Qwen3-Omni final diagnostic result as the
28
- first cross-episode baseline, then improve action/subtask quality before
29
  claiming model quality. The earlier simple and neural baseline framing is now
30
  aligned to the same 96/16/16 split through metadata/text baselines for
31
  JSON-supported task ids; raw-feature-only tasks remain marked as needing the
 
11
  | --- | --- | --- | --- | --- |
12
  | Public-Sample Task Lab | Implemented | One public Xperience-10M sample episode is available. | 1,161 aligned windows, 12 task contracts, minimal heads, neural MLP heads, modality atlas, task walkthroughs, and derived figures. | `PROJECT_STATUS.md`, `EVALUATION_PROTOCOL.md`, `RESEARCH_TAKEAWAYS.md`, `docs/data/summary_metrics.json`, `results/episode_task_suite/summary_report.json` |
13
  | Multi-Episode Data Preparation | Implemented for first selected pilot | Gated dataset availability and enough storage for selected episodes. | 128 selected episodes, episode manifest, missing-view manifest, held-out episode split, and source-discovery report. | `results/omni_finetune/DATA_ACCESS_STATUS.md`, `results/omni_finetune/MULTI_EPISODE_ACCESS_STATUS.md`, `results/omni_finetune/xperience10m_128_episode_selection.json` |
14
+ | Qwen3-Omni LoRA Latest Diagnostic Branch | Verified latest branch | Selected episodes prepared locally with no train/test episode leakage. | Dataset JSONL/media manifests, LoRA adapter checkpoint, progress logs, validation monitoring, held-out predictions, metrics, confusion matrices, v5/v6 comparison, run report, and public LoRA adapter repo. | `docs/data/omni_finetune_verified_result.json`, `docs/data/qwen3_v5_v6_comparison.json`, `results/omni_finetune/QWEN3_V5_V6_COMPARISON_20260614.md`, `results/omni_finetune/verified_public/`, `metrics.json`, `predictions.jsonl`, `RUN_REPORT.md`, `https://huggingface.co/cy0307/ropedia-qwen3-omni-lora-128ep` |
15
  | 128-Episode Same-Split Simple/NN Baselines | Verified companion result | Derived Qwen JSONL export for the selected 96/16/16 split. | Same 12 task ids, simple metadata/text baselines, neural MLP baselines where JSON labels support them, and explicit unsupported markers for tasks that still require raw 128 feature blocks. | `results/omni_finetune/multi_episode_128_task_baselines/BASELINE_ALIGNMENT_REPORT.md`, `summary_report.json`, `scripts/omni/run_128_task_baselines.py` |
16
  | 128-Episode Task Suite Enhancement Pack | Current no-new-episode plan | Same selected 96/16/16 split and current public 3,808-window export. | Dense-window and multiscale export estimates, hierarchical action/subtask target contract, raw-feature shard priorities for unsupported tasks, Qwen v5 and Cosmos continuation run cards, and publication-ready artifacts. | `TASK_SUITE_ENHANCEMENT_128.md`, `docs/data/task_suite_enhancement_128.json`, `results/omni_finetune/task_suite_enhancement_128_v1_20260608/enhancement_plan.json`, `scripts/omni/build_task_suite_enhancement_128.py` |
17
  | Action/Subtask Error-Analysis Pass | Active next step | The final diagnostic package meets strict JSON validity but has weak action/subtask held-out quality. | Same 96/16/16 split, action/subtask confusion analysis, unseen-label analysis, object/action family breakdowns, and comparison to the final verified Qwen baseline. | Updated error-analysis tables, held-out metrics by failure type, and verified public package. |
 
24
 
25
  The useful next decision is model-quality improvement plus backbone fit without
26
  requiring more raw episodes first: keep the public-sample task suite as the
27
+ development harness, use the verified Qwen3-Omni v6 diagnostic branch plus the
28
+ pinned v5 row as the current cross-episode references, then improve action/subtask quality before
29
  claiming model quality. The earlier simple and neural baseline framing is now
30
  aligned to the same 96/16/16 split through metadata/text baselines for
31
  JSON-supported task ids; raw-feature-only tasks remain marked as needing the
metrics/artifact_index.json CHANGED
@@ -1,13 +1,13 @@
1
  {
2
  "title": "Ropedia Xperience-10M Task Suite Artifact Index",
3
- "generated_at_utc": "2026-06-12T18:14:49+00:00",
4
  "status": "pass",
5
- "artifact_count": 159,
6
  "missing": [],
7
  "by_kind": {
8
  "project_path": 14,
9
  "scaleup_contract": 7,
10
- "scaleup_status": 36,
11
  "publication_workflow": 5,
12
  "project_scope": 1,
13
  "source_alignment": 5,
@@ -32,7 +32,7 @@
32
  "citation": 1,
33
  "license": 1,
34
  "verified_public_package": 10,
35
- "publication_audit": 6
36
  },
37
  "artifacts": [
38
  {
@@ -65,8 +65,8 @@
65
  "surface": "repo_hf",
66
  "shows": "Gives a compact current-state table for first-pass readers.",
67
  "exists": true,
68
- "bytes": 13535,
69
- "sha256": "595f90e26d75baba46a0827eb2ec38d0c6badb137437f642dff4e88e22b7ca80"
70
  },
71
  {
72
  "id": "project_status_json",
@@ -76,8 +76,8 @@
76
  "surface": "website_hf",
77
  "shows": "Machine-readable copy of the current project status for website and HF mirrors.",
78
  "exists": true,
79
- "bytes": 23198,
80
- "sha256": "07678354fde90a7e15134f5deff036eb200f7bf422ab2350ff64825b902843d5"
81
  },
82
  {
83
  "id": "research_roadmap",
@@ -87,8 +87,8 @@
87
  "surface": "repo_hf",
88
  "shows": "Defines the path from public-sample task development to multi-episode held-out evaluation and larger omni-model extensions.",
89
  "exists": true,
90
- "bytes": 14354,
91
- "sha256": "7d385916e6d07cfa5f95e80b501fe58237f671711691cc8c792479d233991ef5"
92
  },
93
  {
94
  "id": "research_roadmap_json",
@@ -98,8 +98,8 @@
98
  "surface": "website_hf",
99
  "shows": "Machine-readable research roadmap for the website and Hugging Face mirrors.",
100
  "exists": true,
101
- "bytes": 13313,
102
- "sha256": "719a8b5f45810d6b09788391d8549eb5226e480454c9f2a4ed7380ceacfd834b"
103
  },
104
  {
105
  "id": "foundation_model_plan",
@@ -264,7 +264,7 @@
264
  "shows": "Summarizes the 2026-06-09 full-parameter FSDP feasibility gates: 1/8/32/64-step guarded runs passed, the 128-step opportunistic pilot was preempted for Qwen v5 handoff, and no full checkpoints or weights are published.",
265
  "exists": true,
266
  "bytes": 3253,
267
- "sha256": "5c0e49adad861e00bede7f9245808e6e68346e7bc51c0b0142d2b99f944f2408"
268
  },
269
  {
270
  "id": "qwen3_full_parameter_gates_json",
@@ -275,7 +275,29 @@
275
  "shows": "Machine-readable summary of full-parameter feasibility evidence and publication policy for website and Hugging Face mirrors.",
276
  "exists": true,
277
  "bytes": 12183,
278
- "sha256": "50b8b93f0bae34c7fa269ae52d7c2073d6122fdafc9297f380c052d94253bf3f"
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
279
  },
280
  {
281
  "id": "qwen3_full_parameter_gates_builder",
@@ -362,8 +384,8 @@
362
  "surface": "website_hf",
363
  "shows": "Gives a short project path with scope status and public surfaces.",
364
  "exists": true,
365
- "bytes": 10496,
366
- "sha256": "fc21325035c3b8d4892bfad50dd1a7f5f562a662b7da7be73f7349cb515544ef"
367
  },
368
  {
369
  "id": "artifact_guide",
@@ -418,7 +440,7 @@
418
  "shows": "Machine-readable source-alignment pass/fail check for repo, website, and HF surfaces.",
419
  "exists": true,
420
  "bytes": 4432,
421
- "sha256": "d5ada9ec76cace484a779672d636222f1dfa281135508ff1b5d5eac98908bf60"
422
  },
423
  {
424
  "id": "source_alignment_validator",
@@ -671,7 +693,7 @@
671
  "shows": "Machine-readable release-check summary for validators, mirrors, and public project surfaces.",
672
  "exists": true,
673
  "bytes": 8097,
674
- "sha256": "9b4783b42d0a6f58f2bc36bb7990c86dc85af9d346cb59c90dca402e2c4cf0de"
675
  },
676
  {
677
  "id": "public_surface_qa",
@@ -785,8 +807,8 @@
785
  "surface": "repo",
786
  "shows": "Fetches the published GitHub/HF URLs and compares live hashes and public-card markers against the release assets.",
787
  "exists": true,
788
- "bytes": 42224,
789
- "sha256": "069b05229b72b9fa0c0917afd18655f58e5258499b43889dd96a87f29f577890"
790
  },
791
  {
792
  "id": "reproducibility_contract",
@@ -818,8 +840,8 @@
818
  "surface": "repo_hf",
819
  "shows": "Generates the selective artifact catalog from local files.",
820
  "exists": true,
821
- "bytes": 42045,
822
- "sha256": "0954119cda59bb9c77cba362a206ac237eae73c4a1dc44cfa56599f170fc94de"
823
  },
824
  {
825
  "id": "publication_audit",
@@ -842,7 +864,7 @@
842
  "volatile": true,
843
  "shows": "Separates setup paths from completed held-out-episode results.",
844
  "exists": true,
845
- "bytes": 21325,
846
  "hash_policy": "existence_and_size_only"
847
  },
848
  {
@@ -866,7 +888,7 @@
866
  "volatile": true,
867
  "shows": "Confirms local website links, anchors, JSON data files, and referenced images resolve.",
868
  "exists": true,
869
- "bytes": 15654,
870
  "hash_policy": "existence_and_size_only"
871
  },
872
  {
@@ -899,8 +921,8 @@
899
  "surface": "website_hf",
900
  "shows": "Mirrors task metrics for the static dashboard.",
901
  "exists": true,
902
- "bytes": 27604,
903
- "sha256": "ebaf9d598b4cd91118f149cafa01fe4d17629499565b7be0c7ce4a0ffcd70f6b"
904
  },
905
  {
906
  "id": "feature_manifest",
@@ -1119,8 +1141,8 @@
1119
  "surface": "repo_hf",
1120
  "shows": "Reader-facing comparison of the single-episode task suite, 128-episode aligned baselines, Qwen3-Omni packages, and Cosmos3 future-window branch.",
1121
  "exists": true,
1122
- "bytes": 15731,
1123
- "sha256": "1ee5621a825219c1125a9c3f3ac779ac8bfa2fc45668531dbee3a728be13bfb5"
1124
  },
1125
  {
1126
  "id": "omni_model_comparison_json",
@@ -1130,8 +1152,8 @@
1130
  "surface": "repo_hf",
1131
  "shows": "Machine-readable comparison of the current result versions, per-task aligned baselines, verified Qwen3 packages, and Cosmos3 package.",
1132
  "exists": true,
1133
- "bytes": 81593,
1134
- "sha256": "c570f0810dec46f9a14969089245051c2b3a255f074bd9d06f529615fa6fbd73"
1135
  },
1136
  {
1137
  "id": "cosmos3_nano_verified_summary",
@@ -1548,8 +1570,8 @@
1548
  "surface": "repo_hf",
1549
  "shows": "Public-safe verified package for xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full (qwen3_omni_lora, status=verified).",
1550
  "exists": true,
1551
- "file_count": 13,
1552
- "bytes": 12189099
1553
  },
1554
  {
1555
  "id": "verified_public_summary_xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full",
@@ -1595,6 +1617,17 @@
1595
  "bytes": 49205,
1596
  "sha256": "fc198c3e443877bca42cc33bec6e2a194d6cb20e97c28e931a90736c45538bba"
1597
  },
 
 
 
 
 
 
 
 
 
 
 
1598
  {
1599
  "id": "verified_public_package_xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora_eval_test_full",
1600
  "title": "Verified public package: Qwen3-Omni LoRA",
 
1
  {
2
  "title": "Ropedia Xperience-10M Task Suite Artifact Index",
3
+ "generated_at_utc": "2026-06-13T17:46:38+00:00",
4
  "status": "pass",
5
+ "artifact_count": 162,
6
  "missing": [],
7
  "by_kind": {
8
  "project_path": 14,
9
  "scaleup_contract": 7,
10
+ "scaleup_status": 38,
11
  "publication_workflow": 5,
12
  "project_scope": 1,
13
  "source_alignment": 5,
 
32
  "citation": 1,
33
  "license": 1,
34
  "verified_public_package": 10,
35
+ "publication_audit": 7
36
  },
37
  "artifacts": [
38
  {
 
65
  "surface": "repo_hf",
66
  "shows": "Gives a compact current-state table for first-pass readers.",
67
  "exists": true,
68
+ "bytes": 13755,
69
+ "sha256": "342897ae05ceab83d626765c0052c140e414ba25ebda4fce9fb07bb37a2decef"
70
  },
71
  {
72
  "id": "project_status_json",
 
76
  "surface": "website_hf",
77
  "shows": "Machine-readable copy of the current project status for website and HF mirrors.",
78
  "exists": true,
79
+ "bytes": 23535,
80
+ "sha256": "9ffae32ff0b3750f89179d2ce92205f95a5b53069d0aa344d6342c23b1efebbd"
81
  },
82
  {
83
  "id": "research_roadmap",
 
87
  "surface": "repo_hf",
88
  "shows": "Defines the path from public-sample task development to multi-episode held-out evaluation and larger omni-model extensions.",
89
  "exists": true,
90
+ "bytes": 14503,
91
+ "sha256": "bb06fb0ccf336cafb0305883b7f93c2e1af547c9ff04b1fa6fc87481d54bcf61"
92
  },
93
  {
94
  "id": "research_roadmap_json",
 
98
  "surface": "website_hf",
99
  "shows": "Machine-readable research roadmap for the website and Hugging Face mirrors.",
100
  "exists": true,
101
+ "bytes": 13450,
102
+ "sha256": "668220626950f8e55410b8f829dba6306d0b2feedafdf0198800ad9814992d84"
103
  },
104
  {
105
  "id": "foundation_model_plan",
 
264
  "shows": "Summarizes the 2026-06-09 full-parameter FSDP feasibility gates: 1/8/32/64-step guarded runs passed, the 128-step opportunistic pilot was preempted for Qwen v5 handoff, and no full checkpoints or weights are published.",
265
  "exists": true,
266
  "bytes": 3253,
267
+ "sha256": "b25f1d8cde814207b4c3234bf07140cf99a0ede29af3f53dbc146aab464e8a9b"
268
  },
269
  {
270
  "id": "qwen3_full_parameter_gates_json",
 
275
  "shows": "Machine-readable summary of full-parameter feasibility evidence and publication policy for website and Hugging Face mirrors.",
276
  "exists": true,
277
  "bytes": 12183,
278
+ "sha256": "d051608d3428645778f721e538af93566ab772871a825ac12825dd5f18e94a95"
279
+ },
280
+ {
281
+ "id": "qwen3_v5_v6_comparison",
282
+ "title": "Qwen3-Omni v5/v6 comparison",
283
+ "path": "results/omni_finetune/QWEN3_V5_V6_COMPARISON_20260614.md",
284
+ "kind": "scaleup_status",
285
+ "surface": "repo_hf",
286
+ "shows": "Reader-facing comparison of the verified Qwen3 v5 release row and the latest verified v6 row, including metric deltas and release-tag policy.",
287
+ "exists": true,
288
+ "bytes": 1793,
289
+ "sha256": "890430b05ace20375fc9534f923d170c0509037272ba4ef523e3ca2f3c9ac746"
290
+ },
291
+ {
292
+ "id": "qwen3_v5_v6_comparison_json",
293
+ "title": "Qwen3-Omni v5/v6 comparison JSON",
294
+ "path": "docs/data/qwen3_v5_v6_comparison.json",
295
+ "kind": "scaleup_status",
296
+ "surface": "website_hf",
297
+ "shows": "Machine-readable v5/v6 metric deltas and publication recommendation for website and Hugging Face mirrors.",
298
+ "exists": true,
299
+ "bytes": 2814,
300
+ "sha256": "f5d16e279a82cdc6266a1318584bf38cbc0b105296d437f9b8bf0398403aace5"
301
  },
302
  {
303
  "id": "qwen3_full_parameter_gates_builder",
 
384
  "surface": "website_hf",
385
  "shows": "Gives a short project path with scope status and public surfaces.",
386
  "exists": true,
387
+ "bytes": 10597,
388
+ "sha256": "a64b7c033c54879e0183e7ec794d3197fb483024c25947759287fcd4b7e0fec1"
389
  },
390
  {
391
  "id": "artifact_guide",
 
440
  "shows": "Machine-readable source-alignment pass/fail check for repo, website, and HF surfaces.",
441
  "exists": true,
442
  "bytes": 4432,
443
+ "sha256": "0d0d381f726c1e3787fb3fb15b6fb8879512c26fa0dc06fb943e1a239b0063dd"
444
  },
445
  {
446
  "id": "source_alignment_validator",
 
693
  "shows": "Machine-readable release-check summary for validators, mirrors, and public project surfaces.",
694
  "exists": true,
695
  "bytes": 8097,
696
+ "sha256": "1cdc8b4767b3ca88eada654a3117aa2de253fea7af62573b080088e8f1b311bd"
697
  },
698
  {
699
  "id": "public_surface_qa",
 
807
  "surface": "repo",
808
  "shows": "Fetches the published GitHub/HF URLs and compares live hashes and public-card markers against the release assets.",
809
  "exists": true,
810
+ "bytes": 42394,
811
+ "sha256": "43a70436108eea3fa6692096ebf318fb755ff040d925094ef9de018f212fde18"
812
  },
813
  {
814
  "id": "reproducibility_contract",
 
840
  "surface": "repo_hf",
841
  "shows": "Generates the selective artifact catalog from local files.",
842
  "exists": true,
843
+ "bytes": 42809,
844
+ "sha256": "c03d1b1367ad191fea0be3c634fddf8ee6fdc2118bf17396920c16cc288c4ef0"
845
  },
846
  {
847
  "id": "publication_audit",
 
864
  "volatile": true,
865
  "shows": "Separates setup paths from completed held-out-episode results.",
866
  "exists": true,
867
+ "bytes": 21795,
868
  "hash_policy": "existence_and_size_only"
869
  },
870
  {
 
888
  "volatile": true,
889
  "shows": "Confirms local website links, anchors, JSON data files, and referenced images resolve.",
890
  "exists": true,
891
+ "bytes": 15774,
892
  "hash_policy": "existence_and_size_only"
893
  },
894
  {
 
921
  "surface": "website_hf",
922
  "shows": "Mirrors task metrics for the static dashboard.",
923
  "exists": true,
924
+ "bytes": 27807,
925
+ "sha256": "3a6a5ee59562ae189844cb4ba26d6e261c2f73a8e54bb6e2fbc3e307c2d123fa"
926
  },
927
  {
928
  "id": "feature_manifest",
 
1141
  "surface": "repo_hf",
1142
  "shows": "Reader-facing comparison of the single-episode task suite, 128-episode aligned baselines, Qwen3-Omni packages, and Cosmos3 future-window branch.",
1143
  "exists": true,
1144
+ "bytes": 15999,
1145
+ "sha256": "d5a7118a878b202adbc50e3436bbe134e5de139f2a9e97176efe9ecc0f446088"
1146
  },
1147
  {
1148
  "id": "omni_model_comparison_json",
 
1152
  "surface": "repo_hf",
1153
  "shows": "Machine-readable comparison of the current result versions, per-task aligned baselines, verified Qwen3 packages, and Cosmos3 package.",
1154
  "exists": true,
1155
+ "bytes": 81866,
1156
+ "sha256": "191125098a66ecccfa27395c0f9776616f74b4bf8fb19f16b75cda7ed06cb4b2"
1157
  },
1158
  {
1159
  "id": "cosmos3_nano_verified_summary",
 
1570
  "surface": "repo_hf",
1571
  "shows": "Public-safe verified package for xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full (qwen3_omni_lora, status=verified).",
1572
  "exists": true,
1573
+ "file_count": 14,
1574
+ "bytes": 12189730
1575
  },
1576
  {
1577
  "id": "verified_public_summary_xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full",
 
1617
  "bytes": 49205,
1618
  "sha256": "fc198c3e443877bca42cc33bec6e2a194d6cb20e97c28e931a90736c45538bba"
1619
  },
1620
+ {
1621
+ "id": "verified_public_package_audit_xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full",
1622
+ "title": "Verified package audit: Qwen3-Omni LoRA",
1623
+ "path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full/package_audit.json",
1624
+ "kind": "publication_audit",
1625
+ "surface": "repo_hf",
1626
+ "shows": "Package audit for xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full.",
1627
+ "exists": true,
1628
+ "bytes": 631,
1629
+ "sha256": "7cf478ae33c52bae0ba742e81da8e482e06d0853eecd85f895f447a708f81718"
1630
+ },
1631
  {
1632
  "id": "verified_public_package_xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora_eval_test_full",
1633
  "title": "Verified public package: Qwen3-Omni LoRA",
metrics/mirror_parity.json CHANGED
@@ -1,9 +1,9 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-12T18:14:59+00:00",
4
  "hf_root": "hf_publish",
5
  "summary": {
6
- "group_count": 363,
7
  "failure_count": 0,
8
  "failures_by_surface": {}
9
  },
@@ -138,45 +138,45 @@
138
  "local": {
139
  "path": "repo:docs/data/artifact_index.json",
140
  "exists": true,
141
- "bytes": 87230,
142
- "sha256": "7426cbf2be990a353dd2f6a289b887aa37b12cb6ee4716de37a16709deb26715"
143
  },
144
  "mirrors": {
145
  "hf_space": {
146
  "path": "hf_space:data/artifact_index.json",
147
  "exists": true,
148
- "bytes": 87230,
149
- "sha256": "7426cbf2be990a353dd2f6a289b887aa37b12cb6ee4716de37a16709deb26715"
150
  },
151
  "hf_artifacts_data": {
152
  "path": "hf_artifacts:data/artifact_index.json",
153
  "exists": true,
154
- "bytes": 87230,
155
- "sha256": "7426cbf2be990a353dd2f6a289b887aa37b12cb6ee4716de37a16709deb26715"
156
  },
157
  "hf_artifacts": {
158
  "path": "hf_artifacts:docs/data/artifact_index.json",
159
  "exists": true,
160
- "bytes": 87230,
161
- "sha256": "7426cbf2be990a353dd2f6a289b887aa37b12cb6ee4716de37a16709deb26715"
162
  },
163
  "hf_model_data": {
164
  "path": "hf_model:data/artifact_index.json",
165
  "exists": true,
166
- "bytes": 87230,
167
- "sha256": "7426cbf2be990a353dd2f6a289b887aa37b12cb6ee4716de37a16709deb26715"
168
  },
169
  "hf_model_docs_data": {
170
  "path": "hf_model:docs/data/artifact_index.json",
171
  "exists": true,
172
- "bytes": 87230,
173
- "sha256": "7426cbf2be990a353dd2f6a289b887aa37b12cb6ee4716de37a16709deb26715"
174
  },
175
  "hf_model": {
176
  "path": "hf_model:metrics/artifact_index.json",
177
  "exists": true,
178
- "bytes": 87230,
179
- "sha256": "7426cbf2be990a353dd2f6a289b887aa37b12cb6ee4716de37a16709deb26715"
180
  }
181
  },
182
  "failures": []
@@ -530,45 +530,45 @@
530
  "local": {
531
  "path": "repo:docs/data/omni_finetune_verified_result.json",
532
  "exists": true,
533
- "bytes": 4431,
534
- "sha256": "cc51c2d7be5e924e7c8fd4847e548d29097e403b24aafbb387a1ca59a505b099"
535
  },
536
  "mirrors": {
537
  "hf_space": {
538
  "path": "hf_space:data/omni_finetune_verified_result.json",
539
  "exists": true,
540
- "bytes": 4431,
541
- "sha256": "cc51c2d7be5e924e7c8fd4847e548d29097e403b24aafbb387a1ca59a505b099"
542
  },
543
  "hf_artifacts_data": {
544
  "path": "hf_artifacts:data/omni_finetune_verified_result.json",
545
  "exists": true,
546
- "bytes": 4431,
547
- "sha256": "cc51c2d7be5e924e7c8fd4847e548d29097e403b24aafbb387a1ca59a505b099"
548
  },
549
  "hf_artifacts": {
550
  "path": "hf_artifacts:docs/data/omni_finetune_verified_result.json",
551
  "exists": true,
552
- "bytes": 4431,
553
- "sha256": "cc51c2d7be5e924e7c8fd4847e548d29097e403b24aafbb387a1ca59a505b099"
554
  },
555
  "hf_model_data": {
556
  "path": "hf_model:data/omni_finetune_verified_result.json",
557
  "exists": true,
558
- "bytes": 4431,
559
- "sha256": "cc51c2d7be5e924e7c8fd4847e548d29097e403b24aafbb387a1ca59a505b099"
560
  },
561
  "hf_model_docs_data": {
562
  "path": "hf_model:docs/data/omni_finetune_verified_result.json",
563
  "exists": true,
564
- "bytes": 4431,
565
- "sha256": "cc51c2d7be5e924e7c8fd4847e548d29097e403b24aafbb387a1ca59a505b099"
566
  },
567
  "hf_model": {
568
  "path": "hf_model:metrics/omni_finetune_verified_result.json",
569
  "exists": true,
570
- "bytes": 4431,
571
- "sha256": "cc51c2d7be5e924e7c8fd4847e548d29097e403b24aafbb387a1ca59a505b099"
572
  }
573
  },
574
  "failures": []
@@ -579,45 +579,45 @@
579
  "local": {
580
  "path": "repo:docs/data/omni_model_comparison.json",
581
  "exists": true,
582
- "bytes": 81593,
583
- "sha256": "c570f0810dec46f9a14969089245051c2b3a255f074bd9d06f529615fa6fbd73"
584
  },
585
  "mirrors": {
586
  "hf_space": {
587
  "path": "hf_space:data/omni_model_comparison.json",
588
  "exists": true,
589
- "bytes": 81593,
590
- "sha256": "c570f0810dec46f9a14969089245051c2b3a255f074bd9d06f529615fa6fbd73"
591
  },
592
  "hf_artifacts_data": {
593
  "path": "hf_artifacts:data/omni_model_comparison.json",
594
  "exists": true,
595
- "bytes": 81593,
596
- "sha256": "c570f0810dec46f9a14969089245051c2b3a255f074bd9d06f529615fa6fbd73"
597
  },
598
  "hf_artifacts": {
599
  "path": "hf_artifacts:docs/data/omni_model_comparison.json",
600
  "exists": true,
601
- "bytes": 81593,
602
- "sha256": "c570f0810dec46f9a14969089245051c2b3a255f074bd9d06f529615fa6fbd73"
603
  },
604
  "hf_model_data": {
605
  "path": "hf_model:data/omni_model_comparison.json",
606
  "exists": true,
607
- "bytes": 81593,
608
- "sha256": "c570f0810dec46f9a14969089245051c2b3a255f074bd9d06f529615fa6fbd73"
609
  },
610
  "hf_model_docs_data": {
611
  "path": "hf_model:docs/data/omni_model_comparison.json",
612
  "exists": true,
613
- "bytes": 81593,
614
- "sha256": "c570f0810dec46f9a14969089245051c2b3a255f074bd9d06f529615fa6fbd73"
615
  },
616
  "hf_model": {
617
  "path": "hf_model:metrics/omni_model_comparison.json",
618
  "exists": true,
619
- "bytes": 81593,
620
- "sha256": "c570f0810dec46f9a14969089245051c2b3a255f074bd9d06f529615fa6fbd73"
621
  }
622
  },
623
  "failures": []
@@ -726,45 +726,45 @@
726
  "local": {
727
  "path": "repo:docs/data/project_packet.json",
728
  "exists": true,
729
- "bytes": 10496,
730
- "sha256": "fc21325035c3b8d4892bfad50dd1a7f5f562a662b7da7be73f7349cb515544ef"
731
  },
732
  "mirrors": {
733
  "hf_space": {
734
  "path": "hf_space:data/project_packet.json",
735
  "exists": true,
736
- "bytes": 10496,
737
- "sha256": "fc21325035c3b8d4892bfad50dd1a7f5f562a662b7da7be73f7349cb515544ef"
738
  },
739
  "hf_artifacts_data": {
740
  "path": "hf_artifacts:data/project_packet.json",
741
  "exists": true,
742
- "bytes": 10496,
743
- "sha256": "fc21325035c3b8d4892bfad50dd1a7f5f562a662b7da7be73f7349cb515544ef"
744
  },
745
  "hf_artifacts": {
746
  "path": "hf_artifacts:docs/data/project_packet.json",
747
  "exists": true,
748
- "bytes": 10496,
749
- "sha256": "fc21325035c3b8d4892bfad50dd1a7f5f562a662b7da7be73f7349cb515544ef"
750
  },
751
  "hf_model_data": {
752
  "path": "hf_model:data/project_packet.json",
753
  "exists": true,
754
- "bytes": 10496,
755
- "sha256": "fc21325035c3b8d4892bfad50dd1a7f5f562a662b7da7be73f7349cb515544ef"
756
  },
757
  "hf_model_docs_data": {
758
  "path": "hf_model:docs/data/project_packet.json",
759
  "exists": true,
760
- "bytes": 10496,
761
- "sha256": "fc21325035c3b8d4892bfad50dd1a7f5f562a662b7da7be73f7349cb515544ef"
762
  },
763
  "hf_model": {
764
  "path": "hf_model:metrics/project_packet.json",
765
  "exists": true,
766
- "bytes": 10496,
767
- "sha256": "fc21325035c3b8d4892bfad50dd1a7f5f562a662b7da7be73f7349cb515544ef"
768
  }
769
  },
770
  "failures": []
@@ -775,45 +775,45 @@
775
  "local": {
776
  "path": "repo:docs/data/project_status.json",
777
  "exists": true,
778
- "bytes": 23198,
779
- "sha256": "07678354fde90a7e15134f5deff036eb200f7bf422ab2350ff64825b902843d5"
780
  },
781
  "mirrors": {
782
  "hf_space": {
783
  "path": "hf_space:data/project_status.json",
784
  "exists": true,
785
- "bytes": 23198,
786
- "sha256": "07678354fde90a7e15134f5deff036eb200f7bf422ab2350ff64825b902843d5"
787
  },
788
  "hf_artifacts_data": {
789
  "path": "hf_artifacts:data/project_status.json",
790
  "exists": true,
791
- "bytes": 23198,
792
- "sha256": "07678354fde90a7e15134f5deff036eb200f7bf422ab2350ff64825b902843d5"
793
  },
794
  "hf_artifacts": {
795
  "path": "hf_artifacts:docs/data/project_status.json",
796
  "exists": true,
797
- "bytes": 23198,
798
- "sha256": "07678354fde90a7e15134f5deff036eb200f7bf422ab2350ff64825b902843d5"
799
  },
800
  "hf_model_data": {
801
  "path": "hf_model:data/project_status.json",
802
  "exists": true,
803
- "bytes": 23198,
804
- "sha256": "07678354fde90a7e15134f5deff036eb200f7bf422ab2350ff64825b902843d5"
805
  },
806
  "hf_model_docs_data": {
807
  "path": "hf_model:docs/data/project_status.json",
808
  "exists": true,
809
- "bytes": 23198,
810
- "sha256": "07678354fde90a7e15134f5deff036eb200f7bf422ab2350ff64825b902843d5"
811
  },
812
  "hf_model": {
813
  "path": "hf_model:metrics/project_status.json",
814
  "exists": true,
815
- "bytes": 23198,
816
- "sha256": "07678354fde90a7e15134f5deff036eb200f7bf422ab2350ff64825b902843d5"
817
  }
818
  },
819
  "failures": []
@@ -825,44 +825,44 @@
825
  "path": "repo:docs/data/publication_audit.json",
826
  "exists": true,
827
  "bytes": 7591,
828
- "sha256": "5e7e31a14197a95c2411225ccda07dbcd257747d8c809627ed119993bf409033"
829
  },
830
  "mirrors": {
831
  "hf_space": {
832
  "path": "hf_space:data/publication_audit.json",
833
  "exists": true,
834
  "bytes": 7591,
835
- "sha256": "5e7e31a14197a95c2411225ccda07dbcd257747d8c809627ed119993bf409033"
836
  },
837
  "hf_artifacts_data": {
838
  "path": "hf_artifacts:data/publication_audit.json",
839
  "exists": true,
840
  "bytes": 7591,
841
- "sha256": "5e7e31a14197a95c2411225ccda07dbcd257747d8c809627ed119993bf409033"
842
  },
843
  "hf_artifacts": {
844
  "path": "hf_artifacts:docs/data/publication_audit.json",
845
  "exists": true,
846
  "bytes": 7591,
847
- "sha256": "5e7e31a14197a95c2411225ccda07dbcd257747d8c809627ed119993bf409033"
848
  },
849
  "hf_model_data": {
850
  "path": "hf_model:data/publication_audit.json",
851
  "exists": true,
852
  "bytes": 7591,
853
- "sha256": "5e7e31a14197a95c2411225ccda07dbcd257747d8c809627ed119993bf409033"
854
  },
855
  "hf_model_docs_data": {
856
  "path": "hf_model:docs/data/publication_audit.json",
857
  "exists": true,
858
  "bytes": 7591,
859
- "sha256": "5e7e31a14197a95c2411225ccda07dbcd257747d8c809627ed119993bf409033"
860
  },
861
  "hf_model": {
862
  "path": "hf_model:metrics/publication_audit.json",
863
  "exists": true,
864
  "bytes": 7591,
865
- "sha256": "5e7e31a14197a95c2411225ccda07dbcd257747d8c809627ed119993bf409033"
866
  }
867
  },
868
  "failures": []
@@ -874,44 +874,44 @@
874
  "path": "repo:docs/data/public_surface_qa.json",
875
  "exists": true,
876
  "bytes": 5640,
877
- "sha256": "1054926dee80548a2a10db93766ffc1d2578bc326136af2ab4afbed54af8334e"
878
  },
879
  "mirrors": {
880
  "hf_space": {
881
  "path": "hf_space:data/public_surface_qa.json",
882
  "exists": true,
883
  "bytes": 5640,
884
- "sha256": "1054926dee80548a2a10db93766ffc1d2578bc326136af2ab4afbed54af8334e"
885
  },
886
  "hf_artifacts_data": {
887
  "path": "hf_artifacts:data/public_surface_qa.json",
888
  "exists": true,
889
  "bytes": 5640,
890
- "sha256": "1054926dee80548a2a10db93766ffc1d2578bc326136af2ab4afbed54af8334e"
891
  },
892
  "hf_artifacts": {
893
  "path": "hf_artifacts:docs/data/public_surface_qa.json",
894
  "exists": true,
895
  "bytes": 5640,
896
- "sha256": "1054926dee80548a2a10db93766ffc1d2578bc326136af2ab4afbed54af8334e"
897
  },
898
  "hf_model_data": {
899
  "path": "hf_model:data/public_surface_qa.json",
900
  "exists": true,
901
  "bytes": 5640,
902
- "sha256": "1054926dee80548a2a10db93766ffc1d2578bc326136af2ab4afbed54af8334e"
903
  },
904
  "hf_model_docs_data": {
905
  "path": "hf_model:docs/data/public_surface_qa.json",
906
  "exists": true,
907
  "bytes": 5640,
908
- "sha256": "1054926dee80548a2a10db93766ffc1d2578bc326136af2ab4afbed54af8334e"
909
  },
910
  "hf_model": {
911
  "path": "hf_model:metrics/public_surface_qa.json",
912
  "exists": true,
913
  "bytes": 5640,
914
- "sha256": "1054926dee80548a2a10db93766ffc1d2578bc326136af2ab4afbed54af8334e"
915
  }
916
  },
917
  "failures": []
@@ -923,44 +923,93 @@
923
  "path": "repo:docs/data/qwen3_full_parameter_gates.json",
924
  "exists": true,
925
  "bytes": 12183,
926
- "sha256": "50b8b93f0bae34c7fa269ae52d7c2073d6122fdafc9297f380c052d94253bf3f"
927
  },
928
  "mirrors": {
929
  "hf_space": {
930
  "path": "hf_space:data/qwen3_full_parameter_gates.json",
931
  "exists": true,
932
  "bytes": 12183,
933
- "sha256": "50b8b93f0bae34c7fa269ae52d7c2073d6122fdafc9297f380c052d94253bf3f"
934
  },
935
  "hf_artifacts_data": {
936
  "path": "hf_artifacts:data/qwen3_full_parameter_gates.json",
937
  "exists": true,
938
  "bytes": 12183,
939
- "sha256": "50b8b93f0bae34c7fa269ae52d7c2073d6122fdafc9297f380c052d94253bf3f"
940
  },
941
  "hf_artifacts": {
942
  "path": "hf_artifacts:docs/data/qwen3_full_parameter_gates.json",
943
  "exists": true,
944
  "bytes": 12183,
945
- "sha256": "50b8b93f0bae34c7fa269ae52d7c2073d6122fdafc9297f380c052d94253bf3f"
946
  },
947
  "hf_model_data": {
948
  "path": "hf_model:data/qwen3_full_parameter_gates.json",
949
  "exists": true,
950
  "bytes": 12183,
951
- "sha256": "50b8b93f0bae34c7fa269ae52d7c2073d6122fdafc9297f380c052d94253bf3f"
952
  },
953
  "hf_model_docs_data": {
954
  "path": "hf_model:docs/data/qwen3_full_parameter_gates.json",
955
  "exists": true,
956
  "bytes": 12183,
957
- "sha256": "50b8b93f0bae34c7fa269ae52d7c2073d6122fdafc9297f380c052d94253bf3f"
958
  },
959
  "hf_model": {
960
  "path": "hf_model:metrics/qwen3_full_parameter_gates.json",
961
  "exists": true,
962
  "bytes": 12183,
963
- "sha256": "50b8b93f0bae34c7fa269ae52d7c2073d6122fdafc9297f380c052d94253bf3f"
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
964
  }
965
  },
966
  "failures": []
@@ -972,44 +1021,44 @@
972
  "path": "repo:docs/data/quality_gates.json",
973
  "exists": true,
974
  "bytes": 8097,
975
- "sha256": "9780a5415c71ae68b708c65e8d194b2601f66eef20c8bd3def2fd01763dc8e55"
976
  },
977
  "mirrors": {
978
  "hf_space": {
979
  "path": "hf_space:data/quality_gates.json",
980
  "exists": true,
981
  "bytes": 8097,
982
- "sha256": "9780a5415c71ae68b708c65e8d194b2601f66eef20c8bd3def2fd01763dc8e55"
983
  },
984
  "hf_artifacts_data": {
985
  "path": "hf_artifacts:data/quality_gates.json",
986
  "exists": true,
987
  "bytes": 8097,
988
- "sha256": "9780a5415c71ae68b708c65e8d194b2601f66eef20c8bd3def2fd01763dc8e55"
989
  },
990
  "hf_artifacts": {
991
  "path": "hf_artifacts:docs/data/quality_gates.json",
992
  "exists": true,
993
  "bytes": 8097,
994
- "sha256": "9780a5415c71ae68b708c65e8d194b2601f66eef20c8bd3def2fd01763dc8e55"
995
  },
996
  "hf_model_data": {
997
  "path": "hf_model:data/quality_gates.json",
998
  "exists": true,
999
  "bytes": 8097,
1000
- "sha256": "9780a5415c71ae68b708c65e8d194b2601f66eef20c8bd3def2fd01763dc8e55"
1001
  },
1002
  "hf_model_docs_data": {
1003
  "path": "hf_model:docs/data/quality_gates.json",
1004
  "exists": true,
1005
  "bytes": 8097,
1006
- "sha256": "9780a5415c71ae68b708c65e8d194b2601f66eef20c8bd3def2fd01763dc8e55"
1007
  },
1008
  "hf_model": {
1009
  "path": "hf_model:metrics/quality_gates.json",
1010
  "exists": true,
1011
  "bytes": 8097,
1012
- "sha256": "9780a5415c71ae68b708c65e8d194b2601f66eef20c8bd3def2fd01763dc8e55"
1013
  }
1014
  },
1015
  "failures": []
@@ -1118,45 +1167,45 @@
1118
  "local": {
1119
  "path": "repo:docs/data/research_roadmap.json",
1120
  "exists": true,
1121
- "bytes": 13313,
1122
- "sha256": "719a8b5f45810d6b09788391d8549eb5226e480454c9f2a4ed7380ceacfd834b"
1123
  },
1124
  "mirrors": {
1125
  "hf_space": {
1126
  "path": "hf_space:data/research_roadmap.json",
1127
  "exists": true,
1128
- "bytes": 13313,
1129
- "sha256": "719a8b5f45810d6b09788391d8549eb5226e480454c9f2a4ed7380ceacfd834b"
1130
  },
1131
  "hf_artifacts_data": {
1132
  "path": "hf_artifacts:data/research_roadmap.json",
1133
  "exists": true,
1134
- "bytes": 13313,
1135
- "sha256": "719a8b5f45810d6b09788391d8549eb5226e480454c9f2a4ed7380ceacfd834b"
1136
  },
1137
  "hf_artifacts": {
1138
  "path": "hf_artifacts:docs/data/research_roadmap.json",
1139
  "exists": true,
1140
- "bytes": 13313,
1141
- "sha256": "719a8b5f45810d6b09788391d8549eb5226e480454c9f2a4ed7380ceacfd834b"
1142
  },
1143
  "hf_model_data": {
1144
  "path": "hf_model:data/research_roadmap.json",
1145
  "exists": true,
1146
- "bytes": 13313,
1147
- "sha256": "719a8b5f45810d6b09788391d8549eb5226e480454c9f2a4ed7380ceacfd834b"
1148
  },
1149
  "hf_model_docs_data": {
1150
  "path": "hf_model:docs/data/research_roadmap.json",
1151
  "exists": true,
1152
- "bytes": 13313,
1153
- "sha256": "719a8b5f45810d6b09788391d8549eb5226e480454c9f2a4ed7380ceacfd834b"
1154
  },
1155
  "hf_model": {
1156
  "path": "hf_model:metrics/research_roadmap.json",
1157
  "exists": true,
1158
- "bytes": 13313,
1159
- "sha256": "719a8b5f45810d6b09788391d8549eb5226e480454c9f2a4ed7380ceacfd834b"
1160
  }
1161
  },
1162
  "failures": []
@@ -1167,45 +1216,45 @@
1167
  "local": {
1168
  "path": "repo:docs/data/research_roadmap_interactive.json",
1169
  "exists": true,
1170
- "bytes": 145083,
1171
- "sha256": "3f3896f728d65efdee792a255a878243a7f1d777f3c28aac680a6755741e8e6f"
1172
  },
1173
  "mirrors": {
1174
  "hf_space": {
1175
  "path": "hf_space:data/research_roadmap_interactive.json",
1176
  "exists": true,
1177
- "bytes": 145083,
1178
- "sha256": "3f3896f728d65efdee792a255a878243a7f1d777f3c28aac680a6755741e8e6f"
1179
  },
1180
  "hf_artifacts_data": {
1181
  "path": "hf_artifacts:data/research_roadmap_interactive.json",
1182
  "exists": true,
1183
- "bytes": 145083,
1184
- "sha256": "3f3896f728d65efdee792a255a878243a7f1d777f3c28aac680a6755741e8e6f"
1185
  },
1186
  "hf_artifacts": {
1187
  "path": "hf_artifacts:docs/data/research_roadmap_interactive.json",
1188
  "exists": true,
1189
- "bytes": 145083,
1190
- "sha256": "3f3896f728d65efdee792a255a878243a7f1d777f3c28aac680a6755741e8e6f"
1191
  },
1192
  "hf_model_data": {
1193
  "path": "hf_model:data/research_roadmap_interactive.json",
1194
  "exists": true,
1195
- "bytes": 145083,
1196
- "sha256": "3f3896f728d65efdee792a255a878243a7f1d777f3c28aac680a6755741e8e6f"
1197
  },
1198
  "hf_model_docs_data": {
1199
  "path": "hf_model:docs/data/research_roadmap_interactive.json",
1200
  "exists": true,
1201
- "bytes": 145083,
1202
- "sha256": "3f3896f728d65efdee792a255a878243a7f1d777f3c28aac680a6755741e8e6f"
1203
  },
1204
  "hf_model": {
1205
  "path": "hf_model:metrics/research_roadmap_interactive.json",
1206
  "exists": true,
1207
- "bytes": 145083,
1208
- "sha256": "3f3896f728d65efdee792a255a878243a7f1d777f3c28aac680a6755741e8e6f"
1209
  }
1210
  },
1211
  "failures": []
@@ -1363,45 +1412,45 @@
1363
  "local": {
1364
  "path": "repo:docs/data/scope_claims_audit.json",
1365
  "exists": true,
1366
- "bytes": 21325,
1367
- "sha256": "09bbffdae552c243e06c874b9791c2f805214fbc0e7cd4fb42c31cb756a5b374"
1368
  },
1369
  "mirrors": {
1370
  "hf_space": {
1371
  "path": "hf_space:data/scope_claims_audit.json",
1372
  "exists": true,
1373
- "bytes": 21325,
1374
- "sha256": "09bbffdae552c243e06c874b9791c2f805214fbc0e7cd4fb42c31cb756a5b374"
1375
  },
1376
  "hf_artifacts_data": {
1377
  "path": "hf_artifacts:data/scope_claims_audit.json",
1378
  "exists": true,
1379
- "bytes": 21325,
1380
- "sha256": "09bbffdae552c243e06c874b9791c2f805214fbc0e7cd4fb42c31cb756a5b374"
1381
  },
1382
  "hf_artifacts": {
1383
  "path": "hf_artifacts:docs/data/scope_claims_audit.json",
1384
  "exists": true,
1385
- "bytes": 21325,
1386
- "sha256": "09bbffdae552c243e06c874b9791c2f805214fbc0e7cd4fb42c31cb756a5b374"
1387
  },
1388
  "hf_model_data": {
1389
  "path": "hf_model:data/scope_claims_audit.json",
1390
  "exists": true,
1391
- "bytes": 21325,
1392
- "sha256": "09bbffdae552c243e06c874b9791c2f805214fbc0e7cd4fb42c31cb756a5b374"
1393
  },
1394
  "hf_model_docs_data": {
1395
  "path": "hf_model:docs/data/scope_claims_audit.json",
1396
  "exists": true,
1397
- "bytes": 21325,
1398
- "sha256": "09bbffdae552c243e06c874b9791c2f805214fbc0e7cd4fb42c31cb756a5b374"
1399
  },
1400
  "hf_model": {
1401
  "path": "hf_model:metrics/scope_claims_audit.json",
1402
  "exists": true,
1403
- "bytes": 21325,
1404
- "sha256": "09bbffdae552c243e06c874b9791c2f805214fbc0e7cd4fb42c31cb756a5b374"
1405
  }
1406
  },
1407
  "failures": []
@@ -1462,44 +1511,44 @@
1462
  "path": "repo:docs/data/source_alignment_audit.json",
1463
  "exists": true,
1464
  "bytes": 4432,
1465
- "sha256": "b5c63578c8a6b24247cb5e9aca175e5004194c76c7cbbc2e510374fcb30277d8"
1466
  },
1467
  "mirrors": {
1468
  "hf_space": {
1469
  "path": "hf_space:data/source_alignment_audit.json",
1470
  "exists": true,
1471
  "bytes": 4432,
1472
- "sha256": "b5c63578c8a6b24247cb5e9aca175e5004194c76c7cbbc2e510374fcb30277d8"
1473
  },
1474
  "hf_artifacts_data": {
1475
  "path": "hf_artifacts:data/source_alignment_audit.json",
1476
  "exists": true,
1477
  "bytes": 4432,
1478
- "sha256": "b5c63578c8a6b24247cb5e9aca175e5004194c76c7cbbc2e510374fcb30277d8"
1479
  },
1480
  "hf_artifacts": {
1481
  "path": "hf_artifacts:docs/data/source_alignment_audit.json",
1482
  "exists": true,
1483
  "bytes": 4432,
1484
- "sha256": "b5c63578c8a6b24247cb5e9aca175e5004194c76c7cbbc2e510374fcb30277d8"
1485
  },
1486
  "hf_model_data": {
1487
  "path": "hf_model:data/source_alignment_audit.json",
1488
  "exists": true,
1489
  "bytes": 4432,
1490
- "sha256": "b5c63578c8a6b24247cb5e9aca175e5004194c76c7cbbc2e510374fcb30277d8"
1491
  },
1492
  "hf_model_docs_data": {
1493
  "path": "hf_model:docs/data/source_alignment_audit.json",
1494
  "exists": true,
1495
  "bytes": 4432,
1496
- "sha256": "b5c63578c8a6b24247cb5e9aca175e5004194c76c7cbbc2e510374fcb30277d8"
1497
  },
1498
  "hf_model": {
1499
  "path": "hf_model:metrics/source_alignment_audit.json",
1500
  "exists": true,
1501
  "bytes": 4432,
1502
- "sha256": "b5c63578c8a6b24247cb5e9aca175e5004194c76c7cbbc2e510374fcb30277d8"
1503
  }
1504
  },
1505
  "failures": []
@@ -1510,45 +1559,45 @@
1510
  "local": {
1511
  "path": "repo:docs/data/summary_metrics.json",
1512
  "exists": true,
1513
- "bytes": 27604,
1514
- "sha256": "ebaf9d598b4cd91118f149cafa01fe4d17629499565b7be0c7ce4a0ffcd70f6b"
1515
  },
1516
  "mirrors": {
1517
  "hf_space": {
1518
  "path": "hf_space:data/summary_metrics.json",
1519
  "exists": true,
1520
- "bytes": 27604,
1521
- "sha256": "ebaf9d598b4cd91118f149cafa01fe4d17629499565b7be0c7ce4a0ffcd70f6b"
1522
  },
1523
  "hf_artifacts_data": {
1524
  "path": "hf_artifacts:data/summary_metrics.json",
1525
  "exists": true,
1526
- "bytes": 27604,
1527
- "sha256": "ebaf9d598b4cd91118f149cafa01fe4d17629499565b7be0c7ce4a0ffcd70f6b"
1528
  },
1529
  "hf_artifacts": {
1530
  "path": "hf_artifacts:docs/data/summary_metrics.json",
1531
  "exists": true,
1532
- "bytes": 27604,
1533
- "sha256": "ebaf9d598b4cd91118f149cafa01fe4d17629499565b7be0c7ce4a0ffcd70f6b"
1534
  },
1535
  "hf_model_data": {
1536
  "path": "hf_model:data/summary_metrics.json",
1537
  "exists": true,
1538
- "bytes": 27604,
1539
- "sha256": "ebaf9d598b4cd91118f149cafa01fe4d17629499565b7be0c7ce4a0ffcd70f6b"
1540
  },
1541
  "hf_model_docs_data": {
1542
  "path": "hf_model:docs/data/summary_metrics.json",
1543
  "exists": true,
1544
- "bytes": 27604,
1545
- "sha256": "ebaf9d598b4cd91118f149cafa01fe4d17629499565b7be0c7ce4a0ffcd70f6b"
1546
  },
1547
  "hf_model": {
1548
  "path": "hf_model:metrics/summary_metrics.json",
1549
  "exists": true,
1550
- "bytes": 27604,
1551
- "sha256": "ebaf9d598b4cd91118f149cafa01fe4d17629499565b7be0c7ce4a0ffcd70f6b"
1552
  }
1553
  },
1554
  "failures": []
@@ -1609,44 +1658,44 @@
1609
  "path": "repo:docs/data/task_surface_integrity.json",
1610
  "exists": true,
1611
  "bytes": 45779,
1612
- "sha256": "f3078c3993b127ab63afec6973cadb7007549ff3ea5582aa7eb5bc00bcd1c17b"
1613
  },
1614
  "mirrors": {
1615
  "hf_space": {
1616
  "path": "hf_space:data/task_surface_integrity.json",
1617
  "exists": true,
1618
  "bytes": 45779,
1619
- "sha256": "f3078c3993b127ab63afec6973cadb7007549ff3ea5582aa7eb5bc00bcd1c17b"
1620
  },
1621
  "hf_artifacts_data": {
1622
  "path": "hf_artifacts:data/task_surface_integrity.json",
1623
  "exists": true,
1624
  "bytes": 45779,
1625
- "sha256": "f3078c3993b127ab63afec6973cadb7007549ff3ea5582aa7eb5bc00bcd1c17b"
1626
  },
1627
  "hf_artifacts": {
1628
  "path": "hf_artifacts:docs/data/task_surface_integrity.json",
1629
  "exists": true,
1630
  "bytes": 45779,
1631
- "sha256": "f3078c3993b127ab63afec6973cadb7007549ff3ea5582aa7eb5bc00bcd1c17b"
1632
  },
1633
  "hf_model_data": {
1634
  "path": "hf_model:data/task_surface_integrity.json",
1635
  "exists": true,
1636
  "bytes": 45779,
1637
- "sha256": "f3078c3993b127ab63afec6973cadb7007549ff3ea5582aa7eb5bc00bcd1c17b"
1638
  },
1639
  "hf_model_docs_data": {
1640
  "path": "hf_model:docs/data/task_surface_integrity.json",
1641
  "exists": true,
1642
  "bytes": 45779,
1643
- "sha256": "f3078c3993b127ab63afec6973cadb7007549ff3ea5582aa7eb5bc00bcd1c17b"
1644
  },
1645
  "hf_model": {
1646
  "path": "hf_model:metrics/task_surface_integrity.json",
1647
  "exists": true,
1648
  "bytes": 45779,
1649
- "sha256": "f3078c3993b127ab63afec6973cadb7007549ff3ea5582aa7eb5bc00bcd1c17b"
1650
  }
1651
  },
1652
  "failures": []
@@ -1706,45 +1755,45 @@
1706
  "local": {
1707
  "path": "repo:docs/data/website_integrity.json",
1708
  "exists": true,
1709
- "bytes": 15654,
1710
- "sha256": "b5235c9c9a9394977ec707ed5c42b50a26f4538bd499bff04861da6be45ba404"
1711
  },
1712
  "mirrors": {
1713
  "hf_space": {
1714
  "path": "hf_space:data/website_integrity.json",
1715
  "exists": true,
1716
- "bytes": 15654,
1717
- "sha256": "b5235c9c9a9394977ec707ed5c42b50a26f4538bd499bff04861da6be45ba404"
1718
  },
1719
  "hf_artifacts_data": {
1720
  "path": "hf_artifacts:data/website_integrity.json",
1721
  "exists": true,
1722
- "bytes": 15654,
1723
- "sha256": "b5235c9c9a9394977ec707ed5c42b50a26f4538bd499bff04861da6be45ba404"
1724
  },
1725
  "hf_artifacts": {
1726
  "path": "hf_artifacts:docs/data/website_integrity.json",
1727
  "exists": true,
1728
- "bytes": 15654,
1729
- "sha256": "b5235c9c9a9394977ec707ed5c42b50a26f4538bd499bff04861da6be45ba404"
1730
  },
1731
  "hf_model_data": {
1732
  "path": "hf_model:data/website_integrity.json",
1733
  "exists": true,
1734
- "bytes": 15654,
1735
- "sha256": "b5235c9c9a9394977ec707ed5c42b50a26f4538bd499bff04861da6be45ba404"
1736
  },
1737
  "hf_model_docs_data": {
1738
  "path": "hf_model:docs/data/website_integrity.json",
1739
  "exists": true,
1740
- "bytes": 15654,
1741
- "sha256": "b5235c9c9a9394977ec707ed5c42b50a26f4538bd499bff04861da6be45ba404"
1742
  },
1743
  "hf_model": {
1744
  "path": "hf_model:metrics/website_integrity.json",
1745
  "exists": true,
1746
- "bytes": 15654,
1747
- "sha256": "b5235c9c9a9394977ec707ed5c42b50a26f4538bd499bff04861da6be45ba404"
1748
  }
1749
  },
1750
  "failures": []
@@ -2520,21 +2569,21 @@
2520
  "local": {
2521
  "path": "repo:scripts/omni/build_omni_model_comparison.py",
2522
  "exists": true,
2523
- "bytes": 44466,
2524
- "sha256": "5e89e608edcbc69e8fbc5beb1370ccc443110fe5125d32400cb0aeafb56356aa"
2525
  },
2526
  "mirrors": {
2527
  "hf_artifacts": {
2528
  "path": "hf_artifacts:scripts/omni/build_omni_model_comparison.py",
2529
  "exists": true,
2530
- "bytes": 44466,
2531
- "sha256": "5e89e608edcbc69e8fbc5beb1370ccc443110fe5125d32400cb0aeafb56356aa"
2532
  },
2533
  "hf_model": {
2534
  "path": "hf_model:scripts/omni/build_omni_model_comparison.py",
2535
  "exists": true,
2536
- "bytes": 44466,
2537
- "sha256": "5e89e608edcbc69e8fbc5beb1370ccc443110fe5125d32400cb0aeafb56356aa"
2538
  }
2539
  },
2540
  "failures": []
@@ -2895,21 +2944,21 @@
2895
  "local": {
2896
  "path": "repo:scripts/build_artifact_index.py",
2897
  "exists": true,
2898
- "bytes": 42045,
2899
- "sha256": "0954119cda59bb9c77cba362a206ac237eae73c4a1dc44cfa56599f170fc94de"
2900
  },
2901
  "mirrors": {
2902
  "hf_artifacts": {
2903
  "path": "hf_artifacts:scripts/build_artifact_index.py",
2904
  "exists": true,
2905
- "bytes": 42045,
2906
- "sha256": "0954119cda59bb9c77cba362a206ac237eae73c4a1dc44cfa56599f170fc94de"
2907
  },
2908
  "hf_model": {
2909
  "path": "hf_model:scripts/build_artifact_index.py",
2910
  "exists": true,
2911
- "bytes": 42045,
2912
- "sha256": "0954119cda59bb9c77cba362a206ac237eae73c4a1dc44cfa56599f170fc94de"
2913
  }
2914
  },
2915
  "failures": []
@@ -3170,21 +3219,21 @@
3170
  "local": {
3171
  "path": "repo:scripts/verify_live_publication.py",
3172
  "exists": true,
3173
- "bytes": 42224,
3174
- "sha256": "069b05229b72b9fa0c0917afd18655f58e5258499b43889dd96a87f29f577890"
3175
  },
3176
  "mirrors": {
3177
  "hf_artifacts": {
3178
  "path": "hf_artifacts:scripts/verify_live_publication.py",
3179
  "exists": true,
3180
- "bytes": 42224,
3181
- "sha256": "069b05229b72b9fa0c0917afd18655f58e5258499b43889dd96a87f29f577890"
3182
  },
3183
  "hf_model": {
3184
  "path": "hf_model:scripts/verify_live_publication.py",
3185
  "exists": true,
3186
- "bytes": 42224,
3187
- "sha256": "069b05229b72b9fa0c0917afd18655f58e5258499b43889dd96a87f29f577890"
3188
  }
3189
  },
3190
  "failures": []
@@ -3195,21 +3244,21 @@
3195
  "local": {
3196
  "path": "repo:scripts/validate_mirror_parity.py",
3197
  "exists": true,
3198
- "bytes": 20941,
3199
- "sha256": "edbc633ae7482ce190a02bc956d0c0e3fda0d102e1b1448e9d15fe966442ab38"
3200
  },
3201
  "mirrors": {
3202
  "hf_artifacts": {
3203
  "path": "hf_artifacts:scripts/validate_mirror_parity.py",
3204
  "exists": true,
3205
- "bytes": 20941,
3206
- "sha256": "edbc633ae7482ce190a02bc956d0c0e3fda0d102e1b1448e9d15fe966442ab38"
3207
  },
3208
  "hf_model": {
3209
  "path": "hf_model:scripts/validate_mirror_parity.py",
3210
  "exists": true,
3211
- "bytes": 20941,
3212
- "sha256": "edbc633ae7482ce190a02bc956d0c0e3fda0d102e1b1448e9d15fe966442ab38"
3213
  }
3214
  },
3215
  "failures": []
@@ -3245,21 +3294,21 @@
3245
  "local": {
3246
  "path": "repo:scripts/validate_scope_claims.py",
3247
  "exists": true,
3248
- "bytes": 12028,
3249
- "sha256": "1aff00ee149f3e49e7c5043afea24311128be633ef45c27e4d5bab4f16f87bd4"
3250
  },
3251
  "mirrors": {
3252
  "hf_artifacts": {
3253
  "path": "hf_artifacts:scripts/validate_scope_claims.py",
3254
  "exists": true,
3255
- "bytes": 12028,
3256
- "sha256": "1aff00ee149f3e49e7c5043afea24311128be633ef45c27e4d5bab4f16f87bd4"
3257
  },
3258
  "hf_model": {
3259
  "path": "hf_model:scripts/validate_scope_claims.py",
3260
  "exists": true,
3261
- "bytes": 12028,
3262
- "sha256": "1aff00ee149f3e49e7c5043afea24311128be633ef45c27e4d5bab4f16f87bd4"
3263
  }
3264
  },
3265
  "failures": []
@@ -3445,21 +3494,21 @@
3445
  "local": {
3446
  "path": "repo:docs/index.html",
3447
  "exists": true,
3448
- "bytes": 186490,
3449
- "sha256": "f0d74ebaf983f1d49643069efe505cd45a39731349af85329b89fe8cbc5b1758"
3450
  },
3451
  "mirrors": {
3452
  "hf_space": {
3453
  "path": "hf_space:index.html",
3454
  "exists": true,
3455
- "bytes": 186490,
3456
- "sha256": "f0d74ebaf983f1d49643069efe505cd45a39731349af85329b89fe8cbc5b1758"
3457
  },
3458
  "hf_artifacts_docs": {
3459
  "path": "hf_artifacts:docs/index.html",
3460
  "exists": true,
3461
- "bytes": 186490,
3462
- "sha256": "f0d74ebaf983f1d49643069efe505cd45a39731349af85329b89fe8cbc5b1758"
3463
  }
3464
  },
3465
  "failures": []
@@ -3731,27 +3780,27 @@
3731
  "local": {
3732
  "path": "repo:results/omni_finetune/OMNI_MODEL_COMPARISON.md",
3733
  "exists": true,
3734
- "bytes": 15731,
3735
- "sha256": "1ee5621a825219c1125a9c3f3ac779ac8bfa2fc45668531dbee3a728be13bfb5"
3736
  },
3737
  "mirrors": {
3738
  "hf_space": {
3739
  "path": "hf_space:results/omni_finetune/OMNI_MODEL_COMPARISON.md",
3740
  "exists": true,
3741
- "bytes": 15731,
3742
- "sha256": "1ee5621a825219c1125a9c3f3ac779ac8bfa2fc45668531dbee3a728be13bfb5"
3743
  },
3744
  "hf_artifacts": {
3745
  "path": "hf_artifacts:results/omni_finetune/OMNI_MODEL_COMPARISON.md",
3746
  "exists": true,
3747
- "bytes": 15731,
3748
- "sha256": "1ee5621a825219c1125a9c3f3ac779ac8bfa2fc45668531dbee3a728be13bfb5"
3749
  },
3750
  "hf_model": {
3751
  "path": "hf_model:results/omni_finetune/OMNI_MODEL_COMPARISON.md",
3752
  "exists": true,
3753
- "bytes": 15731,
3754
- "sha256": "1ee5621a825219c1125a9c3f3ac779ac8bfa2fc45668531dbee3a728be13bfb5"
3755
  }
3756
  },
3757
  "failures": []
@@ -3763,26 +3812,57 @@
3763
  "path": "repo:results/omni_finetune/QWEN3_FULL_PARAMETER_GATES_20260609.md",
3764
  "exists": true,
3765
  "bytes": 3253,
3766
- "sha256": "5c0e49adad861e00bede7f9245808e6e68346e7bc51c0b0142d2b99f944f2408"
3767
  },
3768
  "mirrors": {
3769
  "hf_space": {
3770
  "path": "hf_space:results/omni_finetune/QWEN3_FULL_PARAMETER_GATES_20260609.md",
3771
  "exists": true,
3772
  "bytes": 3253,
3773
- "sha256": "5c0e49adad861e00bede7f9245808e6e68346e7bc51c0b0142d2b99f944f2408"
3774
  },
3775
  "hf_artifacts": {
3776
  "path": "hf_artifacts:results/omni_finetune/QWEN3_FULL_PARAMETER_GATES_20260609.md",
3777
  "exists": true,
3778
  "bytes": 3253,
3779
- "sha256": "5c0e49adad861e00bede7f9245808e6e68346e7bc51c0b0142d2b99f944f2408"
3780
  },
3781
  "hf_model": {
3782
  "path": "hf_model:results/omni_finetune/QWEN3_FULL_PARAMETER_GATES_20260609.md",
3783
  "exists": true,
3784
  "bytes": 3253,
3785
- "sha256": "5c0e49adad861e00bede7f9245808e6e68346e7bc51c0b0142d2b99f944f2408"
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3786
  }
3787
  },
3788
  "failures": []
@@ -7972,6 +8052,37 @@
7972
  },
7973
  "failures": []
7974
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7975
  {
7976
  "name": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full/training/progress.jsonl",
7977
  "status": "pass",
@@ -11574,27 +11685,27 @@
11574
  "local": {
11575
  "path": "repo:RESEARCH_ROADMAP.md",
11576
  "exists": true,
11577
- "bytes": 14354,
11578
- "sha256": "7d385916e6d07cfa5f95e80b501fe58237f671711691cc8c792479d233991ef5"
11579
  },
11580
  "mirrors": {
11581
  "hf_space": {
11582
  "path": "hf_space:RESEARCH_ROADMAP.md",
11583
  "exists": true,
11584
- "bytes": 14354,
11585
- "sha256": "7d385916e6d07cfa5f95e80b501fe58237f671711691cc8c792479d233991ef5"
11586
  },
11587
  "hf_artifacts": {
11588
  "path": "hf_artifacts:RESEARCH_ROADMAP.md",
11589
  "exists": true,
11590
- "bytes": 14354,
11591
- "sha256": "7d385916e6d07cfa5f95e80b501fe58237f671711691cc8c792479d233991ef5"
11592
  },
11593
  "hf_model": {
11594
  "path": "hf_model:RESEARCH_ROADMAP.md",
11595
  "exists": true,
11596
- "bytes": 14354,
11597
- "sha256": "7d385916e6d07cfa5f95e80b501fe58237f671711691cc8c792479d233991ef5"
11598
  }
11599
  },
11600
  "failures": []
@@ -11605,27 +11716,27 @@
11605
  "local": {
11606
  "path": "repo:PROJECT_STATUS.md",
11607
  "exists": true,
11608
- "bytes": 13535,
11609
- "sha256": "595f90e26d75baba46a0827eb2ec38d0c6badb137437f642dff4e88e22b7ca80"
11610
  },
11611
  "mirrors": {
11612
  "hf_space": {
11613
  "path": "hf_space:PROJECT_STATUS.md",
11614
  "exists": true,
11615
- "bytes": 13535,
11616
- "sha256": "595f90e26d75baba46a0827eb2ec38d0c6badb137437f642dff4e88e22b7ca80"
11617
  },
11618
  "hf_artifacts": {
11619
  "path": "hf_artifacts:PROJECT_STATUS.md",
11620
  "exists": true,
11621
- "bytes": 13535,
11622
- "sha256": "595f90e26d75baba46a0827eb2ec38d0c6badb137437f642dff4e88e22b7ca80"
11623
  },
11624
  "hf_model": {
11625
  "path": "hf_model:PROJECT_STATUS.md",
11626
  "exists": true,
11627
- "bytes": 13535,
11628
- "sha256": "595f90e26d75baba46a0827eb2ec38d0c6badb137437f642dff4e88e22b7ca80"
11629
  }
11630
  },
11631
  "failures": []
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-13T17:47:14+00:00",
4
  "hf_root": "hf_publish",
5
  "summary": {
6
+ "group_count": 366,
7
  "failure_count": 0,
8
  "failures_by_surface": {}
9
  },
 
138
  "local": {
139
  "path": "repo:docs/data/artifact_index.json",
140
  "exists": true,
141
+ "bytes": 88913,
142
+ "sha256": "ad08fe33d7563a5be1a92158fb6e86c28ca990d6009cbef62d8c0bb616cf5ff6"
143
  },
144
  "mirrors": {
145
  "hf_space": {
146
  "path": "hf_space:data/artifact_index.json",
147
  "exists": true,
148
+ "bytes": 88913,
149
+ "sha256": "ad08fe33d7563a5be1a92158fb6e86c28ca990d6009cbef62d8c0bb616cf5ff6"
150
  },
151
  "hf_artifacts_data": {
152
  "path": "hf_artifacts:data/artifact_index.json",
153
  "exists": true,
154
+ "bytes": 88913,
155
+ "sha256": "ad08fe33d7563a5be1a92158fb6e86c28ca990d6009cbef62d8c0bb616cf5ff6"
156
  },
157
  "hf_artifacts": {
158
  "path": "hf_artifacts:docs/data/artifact_index.json",
159
  "exists": true,
160
+ "bytes": 88913,
161
+ "sha256": "ad08fe33d7563a5be1a92158fb6e86c28ca990d6009cbef62d8c0bb616cf5ff6"
162
  },
163
  "hf_model_data": {
164
  "path": "hf_model:data/artifact_index.json",
165
  "exists": true,
166
+ "bytes": 88913,
167
+ "sha256": "ad08fe33d7563a5be1a92158fb6e86c28ca990d6009cbef62d8c0bb616cf5ff6"
168
  },
169
  "hf_model_docs_data": {
170
  "path": "hf_model:docs/data/artifact_index.json",
171
  "exists": true,
172
+ "bytes": 88913,
173
+ "sha256": "ad08fe33d7563a5be1a92158fb6e86c28ca990d6009cbef62d8c0bb616cf5ff6"
174
  },
175
  "hf_model": {
176
  "path": "hf_model:metrics/artifact_index.json",
177
  "exists": true,
178
+ "bytes": 88913,
179
+ "sha256": "ad08fe33d7563a5be1a92158fb6e86c28ca990d6009cbef62d8c0bb616cf5ff6"
180
  }
181
  },
182
  "failures": []
 
530
  "local": {
531
  "path": "repo:docs/data/omni_finetune_verified_result.json",
532
  "exists": true,
533
+ "bytes": 4325,
534
+ "sha256": "861d451551143d28759a9e475abc88868148c5184bb23821d9a1584cad0a195f"
535
  },
536
  "mirrors": {
537
  "hf_space": {
538
  "path": "hf_space:data/omni_finetune_verified_result.json",
539
  "exists": true,
540
+ "bytes": 4325,
541
+ "sha256": "861d451551143d28759a9e475abc88868148c5184bb23821d9a1584cad0a195f"
542
  },
543
  "hf_artifacts_data": {
544
  "path": "hf_artifacts:data/omni_finetune_verified_result.json",
545
  "exists": true,
546
+ "bytes": 4325,
547
+ "sha256": "861d451551143d28759a9e475abc88868148c5184bb23821d9a1584cad0a195f"
548
  },
549
  "hf_artifacts": {
550
  "path": "hf_artifacts:docs/data/omni_finetune_verified_result.json",
551
  "exists": true,
552
+ "bytes": 4325,
553
+ "sha256": "861d451551143d28759a9e475abc88868148c5184bb23821d9a1584cad0a195f"
554
  },
555
  "hf_model_data": {
556
  "path": "hf_model:data/omni_finetune_verified_result.json",
557
  "exists": true,
558
+ "bytes": 4325,
559
+ "sha256": "861d451551143d28759a9e475abc88868148c5184bb23821d9a1584cad0a195f"
560
  },
561
  "hf_model_docs_data": {
562
  "path": "hf_model:docs/data/omni_finetune_verified_result.json",
563
  "exists": true,
564
+ "bytes": 4325,
565
+ "sha256": "861d451551143d28759a9e475abc88868148c5184bb23821d9a1584cad0a195f"
566
  },
567
  "hf_model": {
568
  "path": "hf_model:metrics/omni_finetune_verified_result.json",
569
  "exists": true,
570
+ "bytes": 4325,
571
+ "sha256": "861d451551143d28759a9e475abc88868148c5184bb23821d9a1584cad0a195f"
572
  }
573
  },
574
  "failures": []
 
579
  "local": {
580
  "path": "repo:docs/data/omni_model_comparison.json",
581
  "exists": true,
582
+ "bytes": 81866,
583
+ "sha256": "191125098a66ecccfa27395c0f9776616f74b4bf8fb19f16b75cda7ed06cb4b2"
584
  },
585
  "mirrors": {
586
  "hf_space": {
587
  "path": "hf_space:data/omni_model_comparison.json",
588
  "exists": true,
589
+ "bytes": 81866,
590
+ "sha256": "191125098a66ecccfa27395c0f9776616f74b4bf8fb19f16b75cda7ed06cb4b2"
591
  },
592
  "hf_artifacts_data": {
593
  "path": "hf_artifacts:data/omni_model_comparison.json",
594
  "exists": true,
595
+ "bytes": 81866,
596
+ "sha256": "191125098a66ecccfa27395c0f9776616f74b4bf8fb19f16b75cda7ed06cb4b2"
597
  },
598
  "hf_artifacts": {
599
  "path": "hf_artifacts:docs/data/omni_model_comparison.json",
600
  "exists": true,
601
+ "bytes": 81866,
602
+ "sha256": "191125098a66ecccfa27395c0f9776616f74b4bf8fb19f16b75cda7ed06cb4b2"
603
  },
604
  "hf_model_data": {
605
  "path": "hf_model:data/omni_model_comparison.json",
606
  "exists": true,
607
+ "bytes": 81866,
608
+ "sha256": "191125098a66ecccfa27395c0f9776616f74b4bf8fb19f16b75cda7ed06cb4b2"
609
  },
610
  "hf_model_docs_data": {
611
  "path": "hf_model:docs/data/omni_model_comparison.json",
612
  "exists": true,
613
+ "bytes": 81866,
614
+ "sha256": "191125098a66ecccfa27395c0f9776616f74b4bf8fb19f16b75cda7ed06cb4b2"
615
  },
616
  "hf_model": {
617
  "path": "hf_model:metrics/omni_model_comparison.json",
618
  "exists": true,
619
+ "bytes": 81866,
620
+ "sha256": "191125098a66ecccfa27395c0f9776616f74b4bf8fb19f16b75cda7ed06cb4b2"
621
  }
622
  },
623
  "failures": []
 
726
  "local": {
727
  "path": "repo:docs/data/project_packet.json",
728
  "exists": true,
729
+ "bytes": 10597,
730
+ "sha256": "a64b7c033c54879e0183e7ec794d3197fb483024c25947759287fcd4b7e0fec1"
731
  },
732
  "mirrors": {
733
  "hf_space": {
734
  "path": "hf_space:data/project_packet.json",
735
  "exists": true,
736
+ "bytes": 10597,
737
+ "sha256": "a64b7c033c54879e0183e7ec794d3197fb483024c25947759287fcd4b7e0fec1"
738
  },
739
  "hf_artifacts_data": {
740
  "path": "hf_artifacts:data/project_packet.json",
741
  "exists": true,
742
+ "bytes": 10597,
743
+ "sha256": "a64b7c033c54879e0183e7ec794d3197fb483024c25947759287fcd4b7e0fec1"
744
  },
745
  "hf_artifacts": {
746
  "path": "hf_artifacts:docs/data/project_packet.json",
747
  "exists": true,
748
+ "bytes": 10597,
749
+ "sha256": "a64b7c033c54879e0183e7ec794d3197fb483024c25947759287fcd4b7e0fec1"
750
  },
751
  "hf_model_data": {
752
  "path": "hf_model:data/project_packet.json",
753
  "exists": true,
754
+ "bytes": 10597,
755
+ "sha256": "a64b7c033c54879e0183e7ec794d3197fb483024c25947759287fcd4b7e0fec1"
756
  },
757
  "hf_model_docs_data": {
758
  "path": "hf_model:docs/data/project_packet.json",
759
  "exists": true,
760
+ "bytes": 10597,
761
+ "sha256": "a64b7c033c54879e0183e7ec794d3197fb483024c25947759287fcd4b7e0fec1"
762
  },
763
  "hf_model": {
764
  "path": "hf_model:metrics/project_packet.json",
765
  "exists": true,
766
+ "bytes": 10597,
767
+ "sha256": "a64b7c033c54879e0183e7ec794d3197fb483024c25947759287fcd4b7e0fec1"
768
  }
769
  },
770
  "failures": []
 
775
  "local": {
776
  "path": "repo:docs/data/project_status.json",
777
  "exists": true,
778
+ "bytes": 23535,
779
+ "sha256": "9ffae32ff0b3750f89179d2ce92205f95a5b53069d0aa344d6342c23b1efebbd"
780
  },
781
  "mirrors": {
782
  "hf_space": {
783
  "path": "hf_space:data/project_status.json",
784
  "exists": true,
785
+ "bytes": 23535,
786
+ "sha256": "9ffae32ff0b3750f89179d2ce92205f95a5b53069d0aa344d6342c23b1efebbd"
787
  },
788
  "hf_artifacts_data": {
789
  "path": "hf_artifacts:data/project_status.json",
790
  "exists": true,
791
+ "bytes": 23535,
792
+ "sha256": "9ffae32ff0b3750f89179d2ce92205f95a5b53069d0aa344d6342c23b1efebbd"
793
  },
794
  "hf_artifacts": {
795
  "path": "hf_artifacts:docs/data/project_status.json",
796
  "exists": true,
797
+ "bytes": 23535,
798
+ "sha256": "9ffae32ff0b3750f89179d2ce92205f95a5b53069d0aa344d6342c23b1efebbd"
799
  },
800
  "hf_model_data": {
801
  "path": "hf_model:data/project_status.json",
802
  "exists": true,
803
+ "bytes": 23535,
804
+ "sha256": "9ffae32ff0b3750f89179d2ce92205f95a5b53069d0aa344d6342c23b1efebbd"
805
  },
806
  "hf_model_docs_data": {
807
  "path": "hf_model:docs/data/project_status.json",
808
  "exists": true,
809
+ "bytes": 23535,
810
+ "sha256": "9ffae32ff0b3750f89179d2ce92205f95a5b53069d0aa344d6342c23b1efebbd"
811
  },
812
  "hf_model": {
813
  "path": "hf_model:metrics/project_status.json",
814
  "exists": true,
815
+ "bytes": 23535,
816
+ "sha256": "9ffae32ff0b3750f89179d2ce92205f95a5b53069d0aa344d6342c23b1efebbd"
817
  }
818
  },
819
  "failures": []
 
825
  "path": "repo:docs/data/publication_audit.json",
826
  "exists": true,
827
  "bytes": 7591,
828
+ "sha256": "5670d3c1f7592d96a342d4b6cd2f2cedfbe29ac309db282f2082fc2414f4469a"
829
  },
830
  "mirrors": {
831
  "hf_space": {
832
  "path": "hf_space:data/publication_audit.json",
833
  "exists": true,
834
  "bytes": 7591,
835
+ "sha256": "5670d3c1f7592d96a342d4b6cd2f2cedfbe29ac309db282f2082fc2414f4469a"
836
  },
837
  "hf_artifacts_data": {
838
  "path": "hf_artifacts:data/publication_audit.json",
839
  "exists": true,
840
  "bytes": 7591,
841
+ "sha256": "5670d3c1f7592d96a342d4b6cd2f2cedfbe29ac309db282f2082fc2414f4469a"
842
  },
843
  "hf_artifacts": {
844
  "path": "hf_artifacts:docs/data/publication_audit.json",
845
  "exists": true,
846
  "bytes": 7591,
847
+ "sha256": "5670d3c1f7592d96a342d4b6cd2f2cedfbe29ac309db282f2082fc2414f4469a"
848
  },
849
  "hf_model_data": {
850
  "path": "hf_model:data/publication_audit.json",
851
  "exists": true,
852
  "bytes": 7591,
853
+ "sha256": "5670d3c1f7592d96a342d4b6cd2f2cedfbe29ac309db282f2082fc2414f4469a"
854
  },
855
  "hf_model_docs_data": {
856
  "path": "hf_model:docs/data/publication_audit.json",
857
  "exists": true,
858
  "bytes": 7591,
859
+ "sha256": "5670d3c1f7592d96a342d4b6cd2f2cedfbe29ac309db282f2082fc2414f4469a"
860
  },
861
  "hf_model": {
862
  "path": "hf_model:metrics/publication_audit.json",
863
  "exists": true,
864
  "bytes": 7591,
865
+ "sha256": "5670d3c1f7592d96a342d4b6cd2f2cedfbe29ac309db282f2082fc2414f4469a"
866
  }
867
  },
868
  "failures": []
 
874
  "path": "repo:docs/data/public_surface_qa.json",
875
  "exists": true,
876
  "bytes": 5640,
877
+ "sha256": "e4557f23f199b87063d8a97aca64253d58bed5be3538a51526ac86ce3258ee7c"
878
  },
879
  "mirrors": {
880
  "hf_space": {
881
  "path": "hf_space:data/public_surface_qa.json",
882
  "exists": true,
883
  "bytes": 5640,
884
+ "sha256": "e4557f23f199b87063d8a97aca64253d58bed5be3538a51526ac86ce3258ee7c"
885
  },
886
  "hf_artifacts_data": {
887
  "path": "hf_artifacts:data/public_surface_qa.json",
888
  "exists": true,
889
  "bytes": 5640,
890
+ "sha256": "e4557f23f199b87063d8a97aca64253d58bed5be3538a51526ac86ce3258ee7c"
891
  },
892
  "hf_artifacts": {
893
  "path": "hf_artifacts:docs/data/public_surface_qa.json",
894
  "exists": true,
895
  "bytes": 5640,
896
+ "sha256": "e4557f23f199b87063d8a97aca64253d58bed5be3538a51526ac86ce3258ee7c"
897
  },
898
  "hf_model_data": {
899
  "path": "hf_model:data/public_surface_qa.json",
900
  "exists": true,
901
  "bytes": 5640,
902
+ "sha256": "e4557f23f199b87063d8a97aca64253d58bed5be3538a51526ac86ce3258ee7c"
903
  },
904
  "hf_model_docs_data": {
905
  "path": "hf_model:docs/data/public_surface_qa.json",
906
  "exists": true,
907
  "bytes": 5640,
908
+ "sha256": "e4557f23f199b87063d8a97aca64253d58bed5be3538a51526ac86ce3258ee7c"
909
  },
910
  "hf_model": {
911
  "path": "hf_model:metrics/public_surface_qa.json",
912
  "exists": true,
913
  "bytes": 5640,
914
+ "sha256": "e4557f23f199b87063d8a97aca64253d58bed5be3538a51526ac86ce3258ee7c"
915
  }
916
  },
917
  "failures": []
 
923
  "path": "repo:docs/data/qwen3_full_parameter_gates.json",
924
  "exists": true,
925
  "bytes": 12183,
926
+ "sha256": "d051608d3428645778f721e538af93566ab772871a825ac12825dd5f18e94a95"
927
  },
928
  "mirrors": {
929
  "hf_space": {
930
  "path": "hf_space:data/qwen3_full_parameter_gates.json",
931
  "exists": true,
932
  "bytes": 12183,
933
+ "sha256": "d051608d3428645778f721e538af93566ab772871a825ac12825dd5f18e94a95"
934
  },
935
  "hf_artifacts_data": {
936
  "path": "hf_artifacts:data/qwen3_full_parameter_gates.json",
937
  "exists": true,
938
  "bytes": 12183,
939
+ "sha256": "d051608d3428645778f721e538af93566ab772871a825ac12825dd5f18e94a95"
940
  },
941
  "hf_artifacts": {
942
  "path": "hf_artifacts:docs/data/qwen3_full_parameter_gates.json",
943
  "exists": true,
944
  "bytes": 12183,
945
+ "sha256": "d051608d3428645778f721e538af93566ab772871a825ac12825dd5f18e94a95"
946
  },
947
  "hf_model_data": {
948
  "path": "hf_model:data/qwen3_full_parameter_gates.json",
949
  "exists": true,
950
  "bytes": 12183,
951
+ "sha256": "d051608d3428645778f721e538af93566ab772871a825ac12825dd5f18e94a95"
952
  },
953
  "hf_model_docs_data": {
954
  "path": "hf_model:docs/data/qwen3_full_parameter_gates.json",
955
  "exists": true,
956
  "bytes": 12183,
957
+ "sha256": "d051608d3428645778f721e538af93566ab772871a825ac12825dd5f18e94a95"
958
  },
959
  "hf_model": {
960
  "path": "hf_model:metrics/qwen3_full_parameter_gates.json",
961
  "exists": true,
962
  "bytes": 12183,
963
+ "sha256": "d051608d3428645778f721e538af93566ab772871a825ac12825dd5f18e94a95"
964
+ }
965
+ },
966
+ "failures": []
967
+ },
968
+ {
969
+ "name": "data/qwen3_v5_v6_comparison.json",
970
+ "status": "pass",
971
+ "local": {
972
+ "path": "repo:docs/data/qwen3_v5_v6_comparison.json",
973
+ "exists": true,
974
+ "bytes": 2814,
975
+ "sha256": "f5d16e279a82cdc6266a1318584bf38cbc0b105296d437f9b8bf0398403aace5"
976
+ },
977
+ "mirrors": {
978
+ "hf_space": {
979
+ "path": "hf_space:data/qwen3_v5_v6_comparison.json",
980
+ "exists": true,
981
+ "bytes": 2814,
982
+ "sha256": "f5d16e279a82cdc6266a1318584bf38cbc0b105296d437f9b8bf0398403aace5"
983
+ },
984
+ "hf_artifacts_data": {
985
+ "path": "hf_artifacts:data/qwen3_v5_v6_comparison.json",
986
+ "exists": true,
987
+ "bytes": 2814,
988
+ "sha256": "f5d16e279a82cdc6266a1318584bf38cbc0b105296d437f9b8bf0398403aace5"
989
+ },
990
+ "hf_artifacts": {
991
+ "path": "hf_artifacts:docs/data/qwen3_v5_v6_comparison.json",
992
+ "exists": true,
993
+ "bytes": 2814,
994
+ "sha256": "f5d16e279a82cdc6266a1318584bf38cbc0b105296d437f9b8bf0398403aace5"
995
+ },
996
+ "hf_model_data": {
997
+ "path": "hf_model:data/qwen3_v5_v6_comparison.json",
998
+ "exists": true,
999
+ "bytes": 2814,
1000
+ "sha256": "f5d16e279a82cdc6266a1318584bf38cbc0b105296d437f9b8bf0398403aace5"
1001
+ },
1002
+ "hf_model_docs_data": {
1003
+ "path": "hf_model:docs/data/qwen3_v5_v6_comparison.json",
1004
+ "exists": true,
1005
+ "bytes": 2814,
1006
+ "sha256": "f5d16e279a82cdc6266a1318584bf38cbc0b105296d437f9b8bf0398403aace5"
1007
+ },
1008
+ "hf_model": {
1009
+ "path": "hf_model:metrics/qwen3_v5_v6_comparison.json",
1010
+ "exists": true,
1011
+ "bytes": 2814,
1012
+ "sha256": "f5d16e279a82cdc6266a1318584bf38cbc0b105296d437f9b8bf0398403aace5"
1013
  }
1014
  },
1015
  "failures": []
 
1021
  "path": "repo:docs/data/quality_gates.json",
1022
  "exists": true,
1023
  "bytes": 8097,
1024
+ "sha256": "1cdc8b4767b3ca88eada654a3117aa2de253fea7af62573b080088e8f1b311bd"
1025
  },
1026
  "mirrors": {
1027
  "hf_space": {
1028
  "path": "hf_space:data/quality_gates.json",
1029
  "exists": true,
1030
  "bytes": 8097,
1031
+ "sha256": "1cdc8b4767b3ca88eada654a3117aa2de253fea7af62573b080088e8f1b311bd"
1032
  },
1033
  "hf_artifacts_data": {
1034
  "path": "hf_artifacts:data/quality_gates.json",
1035
  "exists": true,
1036
  "bytes": 8097,
1037
+ "sha256": "1cdc8b4767b3ca88eada654a3117aa2de253fea7af62573b080088e8f1b311bd"
1038
  },
1039
  "hf_artifacts": {
1040
  "path": "hf_artifacts:docs/data/quality_gates.json",
1041
  "exists": true,
1042
  "bytes": 8097,
1043
+ "sha256": "1cdc8b4767b3ca88eada654a3117aa2de253fea7af62573b080088e8f1b311bd"
1044
  },
1045
  "hf_model_data": {
1046
  "path": "hf_model:data/quality_gates.json",
1047
  "exists": true,
1048
  "bytes": 8097,
1049
+ "sha256": "1cdc8b4767b3ca88eada654a3117aa2de253fea7af62573b080088e8f1b311bd"
1050
  },
1051
  "hf_model_docs_data": {
1052
  "path": "hf_model:docs/data/quality_gates.json",
1053
  "exists": true,
1054
  "bytes": 8097,
1055
+ "sha256": "1cdc8b4767b3ca88eada654a3117aa2de253fea7af62573b080088e8f1b311bd"
1056
  },
1057
  "hf_model": {
1058
  "path": "hf_model:metrics/quality_gates.json",
1059
  "exists": true,
1060
  "bytes": 8097,
1061
+ "sha256": "1cdc8b4767b3ca88eada654a3117aa2de253fea7af62573b080088e8f1b311bd"
1062
  }
1063
  },
1064
  "failures": []
 
1167
  "local": {
1168
  "path": "repo:docs/data/research_roadmap.json",
1169
  "exists": true,
1170
+ "bytes": 13450,
1171
+ "sha256": "668220626950f8e55410b8f829dba6306d0b2feedafdf0198800ad9814992d84"
1172
  },
1173
  "mirrors": {
1174
  "hf_space": {
1175
  "path": "hf_space:data/research_roadmap.json",
1176
  "exists": true,
1177
+ "bytes": 13450,
1178
+ "sha256": "668220626950f8e55410b8f829dba6306d0b2feedafdf0198800ad9814992d84"
1179
  },
1180
  "hf_artifacts_data": {
1181
  "path": "hf_artifacts:data/research_roadmap.json",
1182
  "exists": true,
1183
+ "bytes": 13450,
1184
+ "sha256": "668220626950f8e55410b8f829dba6306d0b2feedafdf0198800ad9814992d84"
1185
  },
1186
  "hf_artifacts": {
1187
  "path": "hf_artifacts:docs/data/research_roadmap.json",
1188
  "exists": true,
1189
+ "bytes": 13450,
1190
+ "sha256": "668220626950f8e55410b8f829dba6306d0b2feedafdf0198800ad9814992d84"
1191
  },
1192
  "hf_model_data": {
1193
  "path": "hf_model:data/research_roadmap.json",
1194
  "exists": true,
1195
+ "bytes": 13450,
1196
+ "sha256": "668220626950f8e55410b8f829dba6306d0b2feedafdf0198800ad9814992d84"
1197
  },
1198
  "hf_model_docs_data": {
1199
  "path": "hf_model:docs/data/research_roadmap.json",
1200
  "exists": true,
1201
+ "bytes": 13450,
1202
+ "sha256": "668220626950f8e55410b8f829dba6306d0b2feedafdf0198800ad9814992d84"
1203
  },
1204
  "hf_model": {
1205
  "path": "hf_model:metrics/research_roadmap.json",
1206
  "exists": true,
1207
+ "bytes": 13450,
1208
+ "sha256": "668220626950f8e55410b8f829dba6306d0b2feedafdf0198800ad9814992d84"
1209
  }
1210
  },
1211
  "failures": []
 
1216
  "local": {
1217
  "path": "repo:docs/data/research_roadmap_interactive.json",
1218
  "exists": true,
1219
+ "bytes": 145234,
1220
+ "sha256": "c11a51b33e2af6b1afaf42c3cec1c38c1aed912ebbc815f466c5f085adea6f6f"
1221
  },
1222
  "mirrors": {
1223
  "hf_space": {
1224
  "path": "hf_space:data/research_roadmap_interactive.json",
1225
  "exists": true,
1226
+ "bytes": 145234,
1227
+ "sha256": "c11a51b33e2af6b1afaf42c3cec1c38c1aed912ebbc815f466c5f085adea6f6f"
1228
  },
1229
  "hf_artifacts_data": {
1230
  "path": "hf_artifacts:data/research_roadmap_interactive.json",
1231
  "exists": true,
1232
+ "bytes": 145234,
1233
+ "sha256": "c11a51b33e2af6b1afaf42c3cec1c38c1aed912ebbc815f466c5f085adea6f6f"
1234
  },
1235
  "hf_artifacts": {
1236
  "path": "hf_artifacts:docs/data/research_roadmap_interactive.json",
1237
  "exists": true,
1238
+ "bytes": 145234,
1239
+ "sha256": "c11a51b33e2af6b1afaf42c3cec1c38c1aed912ebbc815f466c5f085adea6f6f"
1240
  },
1241
  "hf_model_data": {
1242
  "path": "hf_model:data/research_roadmap_interactive.json",
1243
  "exists": true,
1244
+ "bytes": 145234,
1245
+ "sha256": "c11a51b33e2af6b1afaf42c3cec1c38c1aed912ebbc815f466c5f085adea6f6f"
1246
  },
1247
  "hf_model_docs_data": {
1248
  "path": "hf_model:docs/data/research_roadmap_interactive.json",
1249
  "exists": true,
1250
+ "bytes": 145234,
1251
+ "sha256": "c11a51b33e2af6b1afaf42c3cec1c38c1aed912ebbc815f466c5f085adea6f6f"
1252
  },
1253
  "hf_model": {
1254
  "path": "hf_model:metrics/research_roadmap_interactive.json",
1255
  "exists": true,
1256
+ "bytes": 145234,
1257
+ "sha256": "c11a51b33e2af6b1afaf42c3cec1c38c1aed912ebbc815f466c5f085adea6f6f"
1258
  }
1259
  },
1260
  "failures": []
 
1412
  "local": {
1413
  "path": "repo:docs/data/scope_claims_audit.json",
1414
  "exists": true,
1415
+ "bytes": 21795,
1416
+ "sha256": "289f6522b8d7dd8efd98cd1fa5cf0e766710439b4caac9cd634973abfdc73846"
1417
  },
1418
  "mirrors": {
1419
  "hf_space": {
1420
  "path": "hf_space:data/scope_claims_audit.json",
1421
  "exists": true,
1422
+ "bytes": 21795,
1423
+ "sha256": "289f6522b8d7dd8efd98cd1fa5cf0e766710439b4caac9cd634973abfdc73846"
1424
  },
1425
  "hf_artifacts_data": {
1426
  "path": "hf_artifacts:data/scope_claims_audit.json",
1427
  "exists": true,
1428
+ "bytes": 21795,
1429
+ "sha256": "289f6522b8d7dd8efd98cd1fa5cf0e766710439b4caac9cd634973abfdc73846"
1430
  },
1431
  "hf_artifacts": {
1432
  "path": "hf_artifacts:docs/data/scope_claims_audit.json",
1433
  "exists": true,
1434
+ "bytes": 21795,
1435
+ "sha256": "289f6522b8d7dd8efd98cd1fa5cf0e766710439b4caac9cd634973abfdc73846"
1436
  },
1437
  "hf_model_data": {
1438
  "path": "hf_model:data/scope_claims_audit.json",
1439
  "exists": true,
1440
+ "bytes": 21795,
1441
+ "sha256": "289f6522b8d7dd8efd98cd1fa5cf0e766710439b4caac9cd634973abfdc73846"
1442
  },
1443
  "hf_model_docs_data": {
1444
  "path": "hf_model:docs/data/scope_claims_audit.json",
1445
  "exists": true,
1446
+ "bytes": 21795,
1447
+ "sha256": "289f6522b8d7dd8efd98cd1fa5cf0e766710439b4caac9cd634973abfdc73846"
1448
  },
1449
  "hf_model": {
1450
  "path": "hf_model:metrics/scope_claims_audit.json",
1451
  "exists": true,
1452
+ "bytes": 21795,
1453
+ "sha256": "289f6522b8d7dd8efd98cd1fa5cf0e766710439b4caac9cd634973abfdc73846"
1454
  }
1455
  },
1456
  "failures": []
 
1511
  "path": "repo:docs/data/source_alignment_audit.json",
1512
  "exists": true,
1513
  "bytes": 4432,
1514
+ "sha256": "b277329512546c334ce68deda9d7d2c8f6e5df3e04a28b5a2fcc6fd7aa5036dd"
1515
  },
1516
  "mirrors": {
1517
  "hf_space": {
1518
  "path": "hf_space:data/source_alignment_audit.json",
1519
  "exists": true,
1520
  "bytes": 4432,
1521
+ "sha256": "b277329512546c334ce68deda9d7d2c8f6e5df3e04a28b5a2fcc6fd7aa5036dd"
1522
  },
1523
  "hf_artifacts_data": {
1524
  "path": "hf_artifacts:data/source_alignment_audit.json",
1525
  "exists": true,
1526
  "bytes": 4432,
1527
+ "sha256": "b277329512546c334ce68deda9d7d2c8f6e5df3e04a28b5a2fcc6fd7aa5036dd"
1528
  },
1529
  "hf_artifacts": {
1530
  "path": "hf_artifacts:docs/data/source_alignment_audit.json",
1531
  "exists": true,
1532
  "bytes": 4432,
1533
+ "sha256": "b277329512546c334ce68deda9d7d2c8f6e5df3e04a28b5a2fcc6fd7aa5036dd"
1534
  },
1535
  "hf_model_data": {
1536
  "path": "hf_model:data/source_alignment_audit.json",
1537
  "exists": true,
1538
  "bytes": 4432,
1539
+ "sha256": "b277329512546c334ce68deda9d7d2c8f6e5df3e04a28b5a2fcc6fd7aa5036dd"
1540
  },
1541
  "hf_model_docs_data": {
1542
  "path": "hf_model:docs/data/source_alignment_audit.json",
1543
  "exists": true,
1544
  "bytes": 4432,
1545
+ "sha256": "b277329512546c334ce68deda9d7d2c8f6e5df3e04a28b5a2fcc6fd7aa5036dd"
1546
  },
1547
  "hf_model": {
1548
  "path": "hf_model:metrics/source_alignment_audit.json",
1549
  "exists": true,
1550
  "bytes": 4432,
1551
+ "sha256": "b277329512546c334ce68deda9d7d2c8f6e5df3e04a28b5a2fcc6fd7aa5036dd"
1552
  }
1553
  },
1554
  "failures": []
 
1559
  "local": {
1560
  "path": "repo:docs/data/summary_metrics.json",
1561
  "exists": true,
1562
+ "bytes": 27807,
1563
+ "sha256": "3a6a5ee59562ae189844cb4ba26d6e261c2f73a8e54bb6e2fbc3e307c2d123fa"
1564
  },
1565
  "mirrors": {
1566
  "hf_space": {
1567
  "path": "hf_space:data/summary_metrics.json",
1568
  "exists": true,
1569
+ "bytes": 27807,
1570
+ "sha256": "3a6a5ee59562ae189844cb4ba26d6e261c2f73a8e54bb6e2fbc3e307c2d123fa"
1571
  },
1572
  "hf_artifacts_data": {
1573
  "path": "hf_artifacts:data/summary_metrics.json",
1574
  "exists": true,
1575
+ "bytes": 27807,
1576
+ "sha256": "3a6a5ee59562ae189844cb4ba26d6e261c2f73a8e54bb6e2fbc3e307c2d123fa"
1577
  },
1578
  "hf_artifacts": {
1579
  "path": "hf_artifacts:docs/data/summary_metrics.json",
1580
  "exists": true,
1581
+ "bytes": 27807,
1582
+ "sha256": "3a6a5ee59562ae189844cb4ba26d6e261c2f73a8e54bb6e2fbc3e307c2d123fa"
1583
  },
1584
  "hf_model_data": {
1585
  "path": "hf_model:data/summary_metrics.json",
1586
  "exists": true,
1587
+ "bytes": 27807,
1588
+ "sha256": "3a6a5ee59562ae189844cb4ba26d6e261c2f73a8e54bb6e2fbc3e307c2d123fa"
1589
  },
1590
  "hf_model_docs_data": {
1591
  "path": "hf_model:docs/data/summary_metrics.json",
1592
  "exists": true,
1593
+ "bytes": 27807,
1594
+ "sha256": "3a6a5ee59562ae189844cb4ba26d6e261c2f73a8e54bb6e2fbc3e307c2d123fa"
1595
  },
1596
  "hf_model": {
1597
  "path": "hf_model:metrics/summary_metrics.json",
1598
  "exists": true,
1599
+ "bytes": 27807,
1600
+ "sha256": "3a6a5ee59562ae189844cb4ba26d6e261c2f73a8e54bb6e2fbc3e307c2d123fa"
1601
  }
1602
  },
1603
  "failures": []
 
1658
  "path": "repo:docs/data/task_surface_integrity.json",
1659
  "exists": true,
1660
  "bytes": 45779,
1661
+ "sha256": "71efc018d2ccb2ce4d2efc67adca9cae4e25b23b1abd96ca1cbb0ff7fc578f9b"
1662
  },
1663
  "mirrors": {
1664
  "hf_space": {
1665
  "path": "hf_space:data/task_surface_integrity.json",
1666
  "exists": true,
1667
  "bytes": 45779,
1668
+ "sha256": "71efc018d2ccb2ce4d2efc67adca9cae4e25b23b1abd96ca1cbb0ff7fc578f9b"
1669
  },
1670
  "hf_artifacts_data": {
1671
  "path": "hf_artifacts:data/task_surface_integrity.json",
1672
  "exists": true,
1673
  "bytes": 45779,
1674
+ "sha256": "71efc018d2ccb2ce4d2efc67adca9cae4e25b23b1abd96ca1cbb0ff7fc578f9b"
1675
  },
1676
  "hf_artifacts": {
1677
  "path": "hf_artifacts:docs/data/task_surface_integrity.json",
1678
  "exists": true,
1679
  "bytes": 45779,
1680
+ "sha256": "71efc018d2ccb2ce4d2efc67adca9cae4e25b23b1abd96ca1cbb0ff7fc578f9b"
1681
  },
1682
  "hf_model_data": {
1683
  "path": "hf_model:data/task_surface_integrity.json",
1684
  "exists": true,
1685
  "bytes": 45779,
1686
+ "sha256": "71efc018d2ccb2ce4d2efc67adca9cae4e25b23b1abd96ca1cbb0ff7fc578f9b"
1687
  },
1688
  "hf_model_docs_data": {
1689
  "path": "hf_model:docs/data/task_surface_integrity.json",
1690
  "exists": true,
1691
  "bytes": 45779,
1692
+ "sha256": "71efc018d2ccb2ce4d2efc67adca9cae4e25b23b1abd96ca1cbb0ff7fc578f9b"
1693
  },
1694
  "hf_model": {
1695
  "path": "hf_model:metrics/task_surface_integrity.json",
1696
  "exists": true,
1697
  "bytes": 45779,
1698
+ "sha256": "71efc018d2ccb2ce4d2efc67adca9cae4e25b23b1abd96ca1cbb0ff7fc578f9b"
1699
  }
1700
  },
1701
  "failures": []
 
1755
  "local": {
1756
  "path": "repo:docs/data/website_integrity.json",
1757
  "exists": true,
1758
+ "bytes": 15774,
1759
+ "sha256": "baafcf64bfdb562f588d5f8d2c6ee1a3cb7f381de50e6f8d9400409e07cc7116"
1760
  },
1761
  "mirrors": {
1762
  "hf_space": {
1763
  "path": "hf_space:data/website_integrity.json",
1764
  "exists": true,
1765
+ "bytes": 15774,
1766
+ "sha256": "baafcf64bfdb562f588d5f8d2c6ee1a3cb7f381de50e6f8d9400409e07cc7116"
1767
  },
1768
  "hf_artifacts_data": {
1769
  "path": "hf_artifacts:data/website_integrity.json",
1770
  "exists": true,
1771
+ "bytes": 15774,
1772
+ "sha256": "baafcf64bfdb562f588d5f8d2c6ee1a3cb7f381de50e6f8d9400409e07cc7116"
1773
  },
1774
  "hf_artifacts": {
1775
  "path": "hf_artifacts:docs/data/website_integrity.json",
1776
  "exists": true,
1777
+ "bytes": 15774,
1778
+ "sha256": "baafcf64bfdb562f588d5f8d2c6ee1a3cb7f381de50e6f8d9400409e07cc7116"
1779
  },
1780
  "hf_model_data": {
1781
  "path": "hf_model:data/website_integrity.json",
1782
  "exists": true,
1783
+ "bytes": 15774,
1784
+ "sha256": "baafcf64bfdb562f588d5f8d2c6ee1a3cb7f381de50e6f8d9400409e07cc7116"
1785
  },
1786
  "hf_model_docs_data": {
1787
  "path": "hf_model:docs/data/website_integrity.json",
1788
  "exists": true,
1789
+ "bytes": 15774,
1790
+ "sha256": "baafcf64bfdb562f588d5f8d2c6ee1a3cb7f381de50e6f8d9400409e07cc7116"
1791
  },
1792
  "hf_model": {
1793
  "path": "hf_model:metrics/website_integrity.json",
1794
  "exists": true,
1795
+ "bytes": 15774,
1796
+ "sha256": "baafcf64bfdb562f588d5f8d2c6ee1a3cb7f381de50e6f8d9400409e07cc7116"
1797
  }
1798
  },
1799
  "failures": []
 
2569
  "local": {
2570
  "path": "repo:scripts/omni/build_omni_model_comparison.py",
2571
  "exists": true,
2572
+ "bytes": 45217,
2573
+ "sha256": "14a326dbd62737ece3947b0af42685394c6018807f62f705f236dca17a5bee61"
2574
  },
2575
  "mirrors": {
2576
  "hf_artifacts": {
2577
  "path": "hf_artifacts:scripts/omni/build_omni_model_comparison.py",
2578
  "exists": true,
2579
+ "bytes": 45217,
2580
+ "sha256": "14a326dbd62737ece3947b0af42685394c6018807f62f705f236dca17a5bee61"
2581
  },
2582
  "hf_model": {
2583
  "path": "hf_model:scripts/omni/build_omni_model_comparison.py",
2584
  "exists": true,
2585
+ "bytes": 45217,
2586
+ "sha256": "14a326dbd62737ece3947b0af42685394c6018807f62f705f236dca17a5bee61"
2587
  }
2588
  },
2589
  "failures": []
 
2944
  "local": {
2945
  "path": "repo:scripts/build_artifact_index.py",
2946
  "exists": true,
2947
+ "bytes": 42809,
2948
+ "sha256": "c03d1b1367ad191fea0be3c634fddf8ee6fdc2118bf17396920c16cc288c4ef0"
2949
  },
2950
  "mirrors": {
2951
  "hf_artifacts": {
2952
  "path": "hf_artifacts:scripts/build_artifact_index.py",
2953
  "exists": true,
2954
+ "bytes": 42809,
2955
+ "sha256": "c03d1b1367ad191fea0be3c634fddf8ee6fdc2118bf17396920c16cc288c4ef0"
2956
  },
2957
  "hf_model": {
2958
  "path": "hf_model:scripts/build_artifact_index.py",
2959
  "exists": true,
2960
+ "bytes": 42809,
2961
+ "sha256": "c03d1b1367ad191fea0be3c634fddf8ee6fdc2118bf17396920c16cc288c4ef0"
2962
  }
2963
  },
2964
  "failures": []
 
3219
  "local": {
3220
  "path": "repo:scripts/verify_live_publication.py",
3221
  "exists": true,
3222
+ "bytes": 42394,
3223
+ "sha256": "43a70436108eea3fa6692096ebf318fb755ff040d925094ef9de018f212fde18"
3224
  },
3225
  "mirrors": {
3226
  "hf_artifacts": {
3227
  "path": "hf_artifacts:scripts/verify_live_publication.py",
3228
  "exists": true,
3229
+ "bytes": 42394,
3230
+ "sha256": "43a70436108eea3fa6692096ebf318fb755ff040d925094ef9de018f212fde18"
3231
  },
3232
  "hf_model": {
3233
  "path": "hf_model:scripts/verify_live_publication.py",
3234
  "exists": true,
3235
+ "bytes": 42394,
3236
+ "sha256": "43a70436108eea3fa6692096ebf318fb755ff040d925094ef9de018f212fde18"
3237
  }
3238
  },
3239
  "failures": []
 
3244
  "local": {
3245
  "path": "repo:scripts/validate_mirror_parity.py",
3246
  "exists": true,
3247
+ "bytes": 21032,
3248
+ "sha256": "cee8801466f5b0593e4a43258d8d8f90c4b87737bfcb966d8a72a010eae12c5b"
3249
  },
3250
  "mirrors": {
3251
  "hf_artifacts": {
3252
  "path": "hf_artifacts:scripts/validate_mirror_parity.py",
3253
  "exists": true,
3254
+ "bytes": 21032,
3255
+ "sha256": "cee8801466f5b0593e4a43258d8d8f90c4b87737bfcb966d8a72a010eae12c5b"
3256
  },
3257
  "hf_model": {
3258
  "path": "hf_model:scripts/validate_mirror_parity.py",
3259
  "exists": true,
3260
+ "bytes": 21032,
3261
+ "sha256": "cee8801466f5b0593e4a43258d8d8f90c4b87737bfcb966d8a72a010eae12c5b"
3262
  }
3263
  },
3264
  "failures": []
 
3294
  "local": {
3295
  "path": "repo:scripts/validate_scope_claims.py",
3296
  "exists": true,
3297
+ "bytes": 13424,
3298
+ "sha256": "080867b4e05c97e63506625b2063ad88f841dba7327a39fc41b17dbfcaa17f92"
3299
  },
3300
  "mirrors": {
3301
  "hf_artifacts": {
3302
  "path": "hf_artifacts:scripts/validate_scope_claims.py",
3303
  "exists": true,
3304
+ "bytes": 13424,
3305
+ "sha256": "080867b4e05c97e63506625b2063ad88f841dba7327a39fc41b17dbfcaa17f92"
3306
  },
3307
  "hf_model": {
3308
  "path": "hf_model:scripts/validate_scope_claims.py",
3309
  "exists": true,
3310
+ "bytes": 13424,
3311
+ "sha256": "080867b4e05c97e63506625b2063ad88f841dba7327a39fc41b17dbfcaa17f92"
3312
  }
3313
  },
3314
  "failures": []
 
3494
  "local": {
3495
  "path": "repo:docs/index.html",
3496
  "exists": true,
3497
+ "bytes": 186593,
3498
+ "sha256": "13094cd7413fb4dc5c4cdd5d6f72b45e2b4ef35bef5e36d28fa073998b1abfcb"
3499
  },
3500
  "mirrors": {
3501
  "hf_space": {
3502
  "path": "hf_space:index.html",
3503
  "exists": true,
3504
+ "bytes": 186593,
3505
+ "sha256": "13094cd7413fb4dc5c4cdd5d6f72b45e2b4ef35bef5e36d28fa073998b1abfcb"
3506
  },
3507
  "hf_artifacts_docs": {
3508
  "path": "hf_artifacts:docs/index.html",
3509
  "exists": true,
3510
+ "bytes": 186593,
3511
+ "sha256": "13094cd7413fb4dc5c4cdd5d6f72b45e2b4ef35bef5e36d28fa073998b1abfcb"
3512
  }
3513
  },
3514
  "failures": []
 
3780
  "local": {
3781
  "path": "repo:results/omni_finetune/OMNI_MODEL_COMPARISON.md",
3782
  "exists": true,
3783
+ "bytes": 15999,
3784
+ "sha256": "d5a7118a878b202adbc50e3436bbe134e5de139f2a9e97176efe9ecc0f446088"
3785
  },
3786
  "mirrors": {
3787
  "hf_space": {
3788
  "path": "hf_space:results/omni_finetune/OMNI_MODEL_COMPARISON.md",
3789
  "exists": true,
3790
+ "bytes": 15999,
3791
+ "sha256": "d5a7118a878b202adbc50e3436bbe134e5de139f2a9e97176efe9ecc0f446088"
3792
  },
3793
  "hf_artifacts": {
3794
  "path": "hf_artifacts:results/omni_finetune/OMNI_MODEL_COMPARISON.md",
3795
  "exists": true,
3796
+ "bytes": 15999,
3797
+ "sha256": "d5a7118a878b202adbc50e3436bbe134e5de139f2a9e97176efe9ecc0f446088"
3798
  },
3799
  "hf_model": {
3800
  "path": "hf_model:results/omni_finetune/OMNI_MODEL_COMPARISON.md",
3801
  "exists": true,
3802
+ "bytes": 15999,
3803
+ "sha256": "d5a7118a878b202adbc50e3436bbe134e5de139f2a9e97176efe9ecc0f446088"
3804
  }
3805
  },
3806
  "failures": []
 
3812
  "path": "repo:results/omni_finetune/QWEN3_FULL_PARAMETER_GATES_20260609.md",
3813
  "exists": true,
3814
  "bytes": 3253,
3815
+ "sha256": "b25f1d8cde814207b4c3234bf07140cf99a0ede29af3f53dbc146aab464e8a9b"
3816
  },
3817
  "mirrors": {
3818
  "hf_space": {
3819
  "path": "hf_space:results/omni_finetune/QWEN3_FULL_PARAMETER_GATES_20260609.md",
3820
  "exists": true,
3821
  "bytes": 3253,
3822
+ "sha256": "b25f1d8cde814207b4c3234bf07140cf99a0ede29af3f53dbc146aab464e8a9b"
3823
  },
3824
  "hf_artifacts": {
3825
  "path": "hf_artifacts:results/omni_finetune/QWEN3_FULL_PARAMETER_GATES_20260609.md",
3826
  "exists": true,
3827
  "bytes": 3253,
3828
+ "sha256": "b25f1d8cde814207b4c3234bf07140cf99a0ede29af3f53dbc146aab464e8a9b"
3829
  },
3830
  "hf_model": {
3831
  "path": "hf_model:results/omni_finetune/QWEN3_FULL_PARAMETER_GATES_20260609.md",
3832
  "exists": true,
3833
  "bytes": 3253,
3834
+ "sha256": "b25f1d8cde814207b4c3234bf07140cf99a0ede29af3f53dbc146aab464e8a9b"
3835
+ }
3836
+ },
3837
+ "failures": []
3838
+ },
3839
+ {
3840
+ "name": "results/omni_finetune/QWEN3_V5_V6_COMPARISON_20260614.md",
3841
+ "status": "pass",
3842
+ "local": {
3843
+ "path": "repo:results/omni_finetune/QWEN3_V5_V6_COMPARISON_20260614.md",
3844
+ "exists": true,
3845
+ "bytes": 1793,
3846
+ "sha256": "890430b05ace20375fc9534f923d170c0509037272ba4ef523e3ca2f3c9ac746"
3847
+ },
3848
+ "mirrors": {
3849
+ "hf_space": {
3850
+ "path": "hf_space:results/omni_finetune/QWEN3_V5_V6_COMPARISON_20260614.md",
3851
+ "exists": true,
3852
+ "bytes": 1793,
3853
+ "sha256": "890430b05ace20375fc9534f923d170c0509037272ba4ef523e3ca2f3c9ac746"
3854
+ },
3855
+ "hf_artifacts": {
3856
+ "path": "hf_artifacts:results/omni_finetune/QWEN3_V5_V6_COMPARISON_20260614.md",
3857
+ "exists": true,
3858
+ "bytes": 1793,
3859
+ "sha256": "890430b05ace20375fc9534f923d170c0509037272ba4ef523e3ca2f3c9ac746"
3860
+ },
3861
+ "hf_model": {
3862
+ "path": "hf_model:results/omni_finetune/QWEN3_V5_V6_COMPARISON_20260614.md",
3863
+ "exists": true,
3864
+ "bytes": 1793,
3865
+ "sha256": "890430b05ace20375fc9534f923d170c0509037272ba4ef523e3ca2f3c9ac746"
3866
  }
3867
  },
3868
  "failures": []
 
8052
  },
8053
  "failures": []
8054
  },
8055
+ {
8056
+ "name": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full/package_audit.json",
8057
+ "status": "pass",
8058
+ "local": {
8059
+ "path": "repo:results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full/package_audit.json",
8060
+ "exists": true,
8061
+ "bytes": 631,
8062
+ "sha256": "7cf478ae33c52bae0ba742e81da8e482e06d0853eecd85f895f447a708f81718"
8063
+ },
8064
+ "mirrors": {
8065
+ "hf_space": {
8066
+ "path": "hf_space:results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full/package_audit.json",
8067
+ "exists": true,
8068
+ "bytes": 631,
8069
+ "sha256": "7cf478ae33c52bae0ba742e81da8e482e06d0853eecd85f895f447a708f81718"
8070
+ },
8071
+ "hf_artifacts": {
8072
+ "path": "hf_artifacts:results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full/package_audit.json",
8073
+ "exists": true,
8074
+ "bytes": 631,
8075
+ "sha256": "7cf478ae33c52bae0ba742e81da8e482e06d0853eecd85f895f447a708f81718"
8076
+ },
8077
+ "hf_model": {
8078
+ "path": "hf_model:results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full/package_audit.json",
8079
+ "exists": true,
8080
+ "bytes": 631,
8081
+ "sha256": "7cf478ae33c52bae0ba742e81da8e482e06d0853eecd85f895f447a708f81718"
8082
+ }
8083
+ },
8084
+ "failures": []
8085
+ },
8086
  {
8087
  "name": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full/training/progress.jsonl",
8088
  "status": "pass",
 
11685
  "local": {
11686
  "path": "repo:RESEARCH_ROADMAP.md",
11687
  "exists": true,
11688
+ "bytes": 14503,
11689
+ "sha256": "bb06fb0ccf336cafb0305883b7f93c2e1af547c9ff04b1fa6fc87481d54bcf61"
11690
  },
11691
  "mirrors": {
11692
  "hf_space": {
11693
  "path": "hf_space:RESEARCH_ROADMAP.md",
11694
  "exists": true,
11695
+ "bytes": 14503,
11696
+ "sha256": "bb06fb0ccf336cafb0305883b7f93c2e1af547c9ff04b1fa6fc87481d54bcf61"
11697
  },
11698
  "hf_artifacts": {
11699
  "path": "hf_artifacts:RESEARCH_ROADMAP.md",
11700
  "exists": true,
11701
+ "bytes": 14503,
11702
+ "sha256": "bb06fb0ccf336cafb0305883b7f93c2e1af547c9ff04b1fa6fc87481d54bcf61"
11703
  },
11704
  "hf_model": {
11705
  "path": "hf_model:RESEARCH_ROADMAP.md",
11706
  "exists": true,
11707
+ "bytes": 14503,
11708
+ "sha256": "bb06fb0ccf336cafb0305883b7f93c2e1af547c9ff04b1fa6fc87481d54bcf61"
11709
  }
11710
  },
11711
  "failures": []
 
11716
  "local": {
11717
  "path": "repo:PROJECT_STATUS.md",
11718
  "exists": true,
11719
+ "bytes": 13755,
11720
+ "sha256": "342897ae05ceab83d626765c0052c140e414ba25ebda4fce9fb07bb37a2decef"
11721
  },
11722
  "mirrors": {
11723
  "hf_space": {
11724
  "path": "hf_space:PROJECT_STATUS.md",
11725
  "exists": true,
11726
+ "bytes": 13755,
11727
+ "sha256": "342897ae05ceab83d626765c0052c140e414ba25ebda4fce9fb07bb37a2decef"
11728
  },
11729
  "hf_artifacts": {
11730
  "path": "hf_artifacts:PROJECT_STATUS.md",
11731
  "exists": true,
11732
+ "bytes": 13755,
11733
+ "sha256": "342897ae05ceab83d626765c0052c140e414ba25ebda4fce9fb07bb37a2decef"
11734
  },
11735
  "hf_model": {
11736
  "path": "hf_model:PROJECT_STATUS.md",
11737
  "exists": true,
11738
+ "bytes": 13755,
11739
+ "sha256": "342897ae05ceab83d626765c0052c140e414ba25ebda4fce9fb07bb37a2decef"
11740
  }
11741
  },
11742
  "failures": []
metrics/omni_finetune_verified_result.json CHANGED
@@ -1,98 +1,96 @@
1
  {
2
- "title": "Verified Qwen3-Omni LoRA 128-Episode Held-Out Result",
3
- "status": "verified_full_128_episode_diagnostic_result",
4
- "status_date": "2026-06-08",
5
- "backbone": "Qwen/Qwen3-Omni-30B-A3B-Instruct",
6
- "adapter": "Qwen3-Omni LoRA",
7
- "dataset": "Ropedia Xperience-10M selected 128-episode pilot",
8
- "split_policy": {
9
- "unit": "episode",
10
- "selected_episode_counts": {
11
- "test": 16,
12
- "train": 96,
13
- "val": 16
14
- },
15
- "exported_window_counts": {
16
- "train": 2848,
17
- "val": 512,
18
- "test": 448
19
- },
20
- "exported_episode_counts": {
21
- "train": 89,
22
- "val": 16,
23
- "test": 14
24
- },
25
- "skipped_selected_episodes": 9,
26
- "leakage_policy": "Train, validation, and test are separated by episode/session; test windows are used only for held-out evaluation."
27
  },
28
- "training": {
29
- "num_processes": 8,
30
- "epochs": 4,
31
- "lora_rank": 16,
32
- "lora_alpha": 32,
33
- "lora_dropout": 0.05,
34
- "num_train_samples": 2848,
35
- "num_val_samples": 512,
36
- "history": [
37
- {
38
- "epoch": 1,
39
- "train_loss": 0.40796751019628613,
40
- "val_loss": 0.03258896619081497,
41
- "global_step": 356
42
- },
43
- {
44
- "epoch": 2,
45
- "train_loss": 0.027628723937453012,
46
- "val_loss": 0.027754632756114006,
47
- "global_step": 712
48
- },
49
- {
50
- "epoch": 3,
51
- "train_loss": 0.02446955946807781,
52
- "val_loss": 0.026343274861574173,
53
- "global_step": 1068
54
- },
55
- {
56
- "epoch": 4,
57
- "train_loss": 0.022728607045444712,
58
- "val_loss": 0.025629229843616486,
59
- "global_step": 1424
60
- }
61
- ],
62
- "loss": "answer-token cross entropy over supervised JSON tokens",
63
- "note": "This current Qwen3-Omni LoRA result is the v4 four-epoch full held-out evaluation on the selected 96/16/16 episode setup."
64
  },
65
- "evaluation": {
66
- "split": "test",
67
- "num_samples": 448,
68
- "held_out_episode_count": 14,
69
- "json_validity_rate": 1.0,
70
- "action_macro_f1": 0.0018678269676001454,
71
- "subtask_accuracy": 0.0,
72
- "transition_accuracy": 0.9732142857142857,
73
- "next_action_accuracy": 0.033482142857142856,
74
- "contact_accuracy": 0.7299107142857143,
75
- "object_micro_f1": 0.31099781500364165,
76
- "quality_target": {
77
- "json_validity_rate": 0.98,
78
- "status": "met"
79
- },
80
- "previous_strict_label_v3_action_macro_f1": 0.0021983997167007384,
81
- "previous_structured_json_v2_json_validity_rate": 0.9977678571428571
82
  },
83
- "interpretation": "This is the current verified Qwen3-Omni LoRA diagnostic result for the selected 128-episode setup. The v4 four-epoch package reaches 100% JSON validity and slightly improves next-action, contact, and object metrics versus the prior strict-label v3 package, while action and subtask classification remain weak on held-out episodes. Treat it as a diagnostic baseline and error-analysis source, not as a strong Xperience-10M action recognizer.",
84
- "public_package": {
85
- "path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_structured_json_v4_4epoch_full8gpu_lora_eval_test_full",
86
- "audit_status": "pass",
87
- "contains_raw_xperience10m_data": false,
88
- "contains_qwen_base_weights": false,
89
- "contains_lora_weights": false,
90
- "adapter_weights_repo": "cy0307/ropedia-qwen3-omni-lora-128ep"
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
91
  },
92
- "required_next_steps": [
93
- "Use the v4 predictions for action/subtask error analysis, unseen-label debugging, and hierarchical action-family scoring.",
94
- "Use TASK_SUITE_ENHANCEMENT_128.md and docs/data/task_suite_enhancement_128.json for the no-new-episode suite push before requesting more storage.",
95
- "Keep the existing Qwen LoRA adapter repository as the weight-bearing artifact and publish future Qwen v5 runs as separate verified packages.",
96
- "Use the verified Cosmos3-Super Forward-Dynamics LoRA package as a separate world-model branch: it updates adapter weights over camera-pose proxy future-vision-velocity targets, not Qwen-style JSON action labels."
97
- ]
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
98
  }
 
1
  {
2
+ "title": "Verified Qwen3-Omni LoRA 128-Episode Held-Out Result",
3
+ "status": "verified_latest_qwen3_v6_diagnostic_result",
4
+ "status_date": "2026-06-14",
5
+ "backbone": "Qwen/Qwen3-Omni-30B-A3B-Instruct",
6
+ "adapter": "Qwen3-Omni LoRA",
7
+ "dataset": "Ropedia Xperience-10M selected 128-episode pilot",
8
+ "split_policy": {
9
+ "unit": "episode",
10
+ "selected_episode_counts": {
11
+ "train": 96,
12
+ "val": 16,
13
+ "test": 16
 
 
 
 
 
 
 
 
 
 
 
 
 
14
  },
15
+ "exported_window_counts": {
16
+ "train": 25629,
17
+ "val": 4608,
18
+ "test": 4032
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
19
  },
20
+ "exported_episode_counts": {
21
+ "train": 89,
22
+ "val": 16,
23
+ "test": 14
 
 
 
 
 
 
 
 
 
 
 
 
 
24
  },
25
+ "skipped_selected_episodes": 9,
26
+ "leakage_policy": "Train, validation, and test are separated by episode/session; test windows are used only for held-out evaluation."
27
+ },
28
+ "training": {
29
+ "num_processes": 8,
30
+ "epochs": 2,
31
+ "lora_rank": 64,
32
+ "lora_alpha": 128,
33
+ "lora_dropout": 0.05,
34
+ "learning_rate": 0.00005,
35
+ "num_train_samples": 25629,
36
+ "num_val_samples": 2048,
37
+ "history": [
38
+ {
39
+ "epoch": 1,
40
+ "train_loss": 0.05208605339353295,
41
+ "val_loss": 0.026512427255511284,
42
+ "global_step": 3204
43
+ },
44
+ {
45
+ "epoch": 2,
46
+ "train_loss": 0.013760763933660042,
47
+ "val_loss": 0.032345958054065704,
48
+ "global_step": 6408
49
+ }
50
+ ],
51
+ "loss": "answer-token cross entropy over supervised JSON tokens",
52
+ "note": "This current Qwen3-Omni LoRA result is the v6 rank64/lr5e-5 dense multiscale held-out evaluation on the selected 96/16/16 episode setup."
53
+ },
54
+ "evaluation": {
55
+ "split": "test",
56
+ "num_samples": 4032,
57
+ "held_out_episode_count": 14,
58
+ "json_validity_rate": 0.9990079365079365,
59
+ "action_macro_f1": 0.0028830723979596335,
60
+ "subtask_accuracy": 0.0037313432835820895,
61
+ "transition_accuracy": 0.9898313492063492,
62
+ "next_action_accuracy": 0.04305335446381405,
63
+ "contact_accuracy": 0.8177083333333334,
64
+ "object_micro_f1": 0.3064982378331287,
65
+ "quality_target": {
66
+ "json_validity_rate": 0.98,
67
+ "status": "met"
68
  },
69
+ "previous_v5_json_validity_rate": 1.0,
70
+ "previous_v5_action_macro_f1": 0.002289711036077459,
71
+ "previous_v5_subtask_accuracy": 0.011194029850746268,
72
+ "previous_v5_next_action_accuracy": 0.053618594823032224,
73
+ "previous_v5_contact_accuracy": 0.7864583333333334,
74
+ "previous_v5_object_micro_f1": 0.31614599936244814
75
+ },
76
+ "interpretation": "This is the latest verified Qwen3-Omni LoRA diagnostic result for the selected 128-episode setup. The v6 rank64/lr5e-5 package keeps JSON validity above the 98% target and improves action macro-F1 and contact accuracy versus the pinned v5 release row, but slightly regresses JSON validity, subtask accuracy, next-action accuracy, transition accuracy, and object micro-F1. Treat it as the latest diagnostic branch, not as a globally stronger replacement for v5.",
77
+ "public_package": {
78
+ "path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full",
79
+ "audit_status": "pass",
80
+ "contains_raw_xperience10m_data": false,
81
+ "contains_qwen_base_weights": false,
82
+ "contains_lora_weights": false,
83
+ "adapter_weights_repo": "cy0307/ropedia-qwen3-omni-lora-128ep"
84
+ },
85
+ "release_policy": {
86
+ "latest_verified_qwen_row": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full",
87
+ "pinned_release_tag": "ropedia-xperience-10m-v5",
88
+ "pinned_release_reason": "v5 remains the prior stable release tag; v6 is published on main/HF as the latest verified branch and can receive a separate v6 release tag."
89
+ },
90
+ "required_next_steps": [
91
+ "Use results/omni_finetune/QWEN3_V5_V6_COMPARISON_20260614.md before deciding whether v6 should become a formal release tag.",
92
+ "Use the v6 predictions for action/contact error analysis, and compare v5 for subtask, next-action, and object regressions.",
93
+ "Keep full-parameter Qwen runs as feasibility gates until there is a storage plan for checkpoints or mergeable full-weight deltas.",
94
+ "Use the verified Cosmos3-Super Forward-Dynamics LoRA package as a separate world-model branch: it updates adapter weights over camera-pose proxy future-vision-velocity targets, not Qwen-style JSON action labels."
95
+ ]
96
  }
metrics/omni_model_comparison.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "title": "Ropedia Xperience-10M Current Result Versions and Model Groups",
3
- "generated_at_utc": "2026-06-12T18:14:48+00:00",
4
  "status": "pass",
5
  "version_count": 3,
6
  "model_group_count": 5,
@@ -588,8 +588,8 @@
588
  "global_step": 3204
589
  }
590
  ],
591
- "is_current": true,
592
- "weights_repository": "https://huggingface.co/cy0307/ropedia-qwen3-omni-lora-128ep"
593
  },
594
  {
595
  "id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full",
@@ -640,8 +640,8 @@
640
  "global_step": 6408
641
  }
642
  ],
643
- "is_current": false,
644
- "weights_repository": "historical diagnostic package; keep separate from the final 128-episode adapter repo"
645
  },
646
  {
647
  "id": "xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora_eval_test_full",
@@ -1202,8 +1202,8 @@
1202
  "global_step": 3204
1203
  }
1204
  ],
1205
- "is_current": true,
1206
- "weights_repository": "https://huggingface.co/cy0307/ropedia-qwen3-omni-lora-128ep"
1207
  },
1208
  {
1209
  "id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full",
@@ -1254,8 +1254,8 @@
1254
  "global_step": 6408
1255
  }
1256
  ],
1257
- "is_current": false,
1258
- "weights_repository": "historical diagnostic package; keep separate from the final 128-episode adapter repo"
1259
  },
1260
  {
1261
  "id": "xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora_eval_test_full",
@@ -1753,6 +1753,7 @@
1753
  "Cosmos3-Super now has both a 128-episode base-weight Reasoner evaluation on the JSON task and a fine-tuned forward-dynamics LoRA branch over camera-pose proxy targets."
1754
  ],
1755
  "pending": [
1756
- "Use the verified Qwen3 v5 dense multiscale full-eval package as the current Qwen row; older Qwen package rows remain historical diagnostics for comparison."
 
1757
  ]
1758
  }
 
1
  {
2
  "title": "Ropedia Xperience-10M Current Result Versions and Model Groups",
3
+ "generated_at_utc": "2026-06-13T17:41:35+00:00",
4
  "status": "pass",
5
  "version_count": 3,
6
  "model_group_count": 5,
 
588
  "global_step": 3204
589
  }
590
  ],
591
+ "is_current": false,
592
+ "weights_repository": "historical diagnostic package; keep separate from the final 128-episode adapter repo"
593
  },
594
  {
595
  "id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full",
 
640
  "global_step": 6408
641
  }
642
  ],
643
+ "is_current": true,
644
+ "weights_repository": "https://huggingface.co/cy0307/ropedia-qwen3-omni-lora-128ep"
645
  },
646
  {
647
  "id": "xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora_eval_test_full",
 
1202
  "global_step": 3204
1203
  }
1204
  ],
1205
+ "is_current": false,
1206
+ "weights_repository": "historical diagnostic package; keep separate from the final 128-episode adapter repo"
1207
  },
1208
  {
1209
  "id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full",
 
1254
  "global_step": 6408
1255
  }
1256
  ],
1257
+ "is_current": true,
1258
+ "weights_repository": "https://huggingface.co/cy0307/ropedia-qwen3-omni-lora-128ep"
1259
  },
1260
  {
1261
  "id": "xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora_eval_test_full",
 
1753
  "Cosmos3-Super now has both a 128-episode base-weight Reasoner evaluation on the JSON task and a fine-tuned forward-dynamics LoRA branch over camera-pose proxy targets."
1754
  ],
1755
  "pending": [
1756
+ "Use the verified Qwen3 v6 rank64/lr5e-5 dense multiscale full-eval package as the latest current Qwen row; the v5 release tag remains pinned as the previous verified release.",
1757
+ "Read results/omni_finetune/QWEN3_V5_V6_COMPARISON_20260614.md before claiming v6 is globally better than v5, because v6 improves action macro-F1 and contact accuracy but regresses subtask, next-action, object micro-F1, and JSON validity slightly."
1758
  ]
1759
  }
metrics/project_packet.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "title": "Ropedia Xperience-10M Task Suite Project Packet",
3
- "version": "2026-06-08",
4
  "scope_status": {
5
  "validated_data": "one public Xperience-10M sample episode",
6
  "aligned_frames": 5821,
@@ -12,7 +12,7 @@
12
  "raw_xperience10m_data_in_repo": false,
13
  "audio_feature_status": "Audio is one of the synchronized source modalities in the current task representation.",
14
  "qwen3_omni_32_episode_claim": false,
15
- "qwen3_omni_status": "The selected 96/16/16 Qwen3-Omni v4 final diagnostic result is verified, meets the strict-JSON target, and still has weak action/subtask metrics that guide the next error-analysis pass.",
16
  "cosmos3_super_forward_dynamics_lora_status": "The first Cosmos3-Super fine-tuned adapter branch is verified as a forward-dynamics LoRA over camera-pose proxy targets; it reports loss metrics, not JSON action-label accuracy.",
17
  "task_suite_enhancement_128_status": "Current no-new-episode enhancement pack recommends multiscale_20s10_40s20_80s40, hierarchical action/subtask targets, label-normalized scoring, and raw-feature shards before adding more episodes."
18
  },
@@ -118,7 +118,7 @@
118
  "scripts/omni/discover_xperience10m_sources.py",
119
  "docs/data/omni_finetune_verified_result.json"
120
  ],
121
- "readout": "The selected-episode held-out Qwen3-Omni final diagnostic result is verified and JSON-format reliability meets the 98% target. The same public comparison also includes the verified 128-episode baselines, Cosmos3-Nano compatibility result, Cosmos3-Super Reasoner evaluation, and Cosmos3-Super Forward-Dynamics LoRA package. The next milestone is action/subtask error analysis and stronger model-quality runs on the same split."
122
  },
123
  {
124
  "step": 9,
@@ -155,7 +155,7 @@
155
  "hf_model_baselines": "https://huggingface.co/cy0307/ropedia-xperience-10m-task-baselines"
156
  },
157
  "current_reading_notes": [
158
- "The first cross-episode Qwen3-Omni v4 diagnostic pilot is verified, but strong model quality is not yet shown; action/subtask metrics remain weak.",
159
  "The current 128-episode suite has a no-new-episode enhancement plan: multiscale_20s10_40s20_80s40 windows, hierarchical labels, label-normalized scoring, and raw-feature shard export.",
160
  "Cosmos3-Super Forward-Dynamics LoRA is verified as a loss-based world-model adapter branch, not as JSON action-token prediction.",
161
  "Older Qwen3-Omni setup artifacts are separate from the verified selected-episode diagnostic package.",
 
1
  {
2
  "title": "Ropedia Xperience-10M Task Suite Project Packet",
3
+ "version": "2026-06-14",
4
  "scope_status": {
5
  "validated_data": "one public Xperience-10M sample episode",
6
  "aligned_frames": 5821,
 
12
  "raw_xperience10m_data_in_repo": false,
13
  "audio_feature_status": "Audio is one of the synchronized source modalities in the current task representation.",
14
  "qwen3_omni_32_episode_claim": false,
15
+ "qwen3_omni_status": "The selected 96/16/16 Qwen3-Omni v6 diagnostic branch is verified, meets the strict-JSON target, improves action macro-F1/contact accuracy versus v5, and still has weak action/subtask metrics that guide the next error-analysis pass.",
16
  "cosmos3_super_forward_dynamics_lora_status": "The first Cosmos3-Super fine-tuned adapter branch is verified as a forward-dynamics LoRA over camera-pose proxy targets; it reports loss metrics, not JSON action-label accuracy.",
17
  "task_suite_enhancement_128_status": "Current no-new-episode enhancement pack recommends multiscale_20s10_40s20_80s40, hierarchical action/subtask targets, label-normalized scoring, and raw-feature shards before adding more episodes."
18
  },
 
118
  "scripts/omni/discover_xperience10m_sources.py",
119
  "docs/data/omni_finetune_verified_result.json"
120
  ],
121
+ "readout": "The selected-episode held-out Qwen3-Omni v6 diagnostic branch is verified and JSON-format reliability meets the 98% target. The same public comparison also includes the verified 128-episode baselines, Cosmos3-Nano compatibility result, Cosmos3-Super Reasoner evaluation, and Cosmos3-Super Forward-Dynamics LoRA package. The next milestone is action/subtask error analysis and stronger model-quality runs on the same split."
122
  },
123
  {
124
  "step": 9,
 
155
  "hf_model_baselines": "https://huggingface.co/cy0307/ropedia-xperience-10m-task-baselines"
156
  },
157
  "current_reading_notes": [
158
+ "The latest cross-episode Qwen3-Omni v6 diagnostic branch is verified, but strong model quality is not yet shown; action/subtask metrics remain weak and v5 remains stronger on several non-contact metrics.",
159
  "The current 128-episode suite has a no-new-episode enhancement plan: multiscale_20s10_40s20_80s40 windows, hierarchical labels, label-normalized scoring, and raw-feature shard export.",
160
  "Cosmos3-Super Forward-Dynamics LoRA is verified as a loss-based world-model adapter branch, not as JSON action-token prediction.",
161
  "Older Qwen3-Omni setup artifacts are separate from the verified selected-episode diagnostic package.",
metrics/public_surface_qa.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Public Project Surface",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-12T18:14:49+00:00",
5
  "scope": "Repo README, GitHub Pages HTML, Hugging Face Space card, artifact dataset card, and model card.",
6
  "checks": [
7
  {
@@ -18,7 +18,7 @@
18
  "website_integrity": {
19
  "exists": true,
20
  "status": "pass",
21
- "generated_at_utc": "2026-06-12T17:58:57+00:00"
22
  },
23
  "rendered_site_check": {
24
  "exists": true,
@@ -28,27 +28,27 @@
28
  "task_surface_integrity": {
29
  "exists": true,
30
  "status": "pass",
31
- "generated_at_utc": "2026-06-12T17:58:56+00:00"
32
  },
33
  "source_alignment": {
34
  "exists": true,
35
  "status": "pass",
36
- "generated_at_utc": "2026-06-12T17:58:56+00:00"
37
  },
38
  "scale_up_status": {
39
  "exists": true,
40
  "status": "pass",
41
- "generated_at_utc": "2026-06-12T17:58:56+00:00"
42
  },
43
  "publication_package": {
44
  "exists": true,
45
  "status": "pass",
46
- "generated_at_utc": "2026-06-12T17:59:02+00:00"
47
  },
48
  "mirror_parity": {
49
  "exists": true,
50
  "status": "pass",
51
- "generated_at_utc": "2026-06-12T17:59:14+00:00"
52
  },
53
  "live_publication": {
54
  "exists": true,
 
1
  {
2
  "title": "Ropedia Xperience-10M Public Project Surface",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-13T17:46:37+00:00",
5
  "scope": "Repo README, GitHub Pages HTML, Hugging Face Space card, artifact dataset card, and model card.",
6
  "checks": [
7
  {
 
18
  "website_integrity": {
19
  "exists": true,
20
  "status": "pass",
21
+ "generated_at_utc": "2026-06-13T17:46:31+00:00"
22
  },
23
  "rendered_site_check": {
24
  "exists": true,
 
28
  "task_surface_integrity": {
29
  "exists": true,
30
  "status": "pass",
31
+ "generated_at_utc": "2026-06-13T17:42:17+00:00"
32
  },
33
  "source_alignment": {
34
  "exists": true,
35
  "status": "pass",
36
+ "generated_at_utc": "2026-06-13T17:42:17+00:00"
37
  },
38
  "scale_up_status": {
39
  "exists": true,
40
  "status": "pass",
41
+ "generated_at_utc": "2026-06-13T17:46:07+00:00"
42
  },
43
  "publication_package": {
44
  "exists": true,
45
  "status": "pass",
46
+ "generated_at_utc": "2026-06-13T17:44:16+00:00"
47
  },
48
  "mirror_parity": {
49
  "exists": true,
50
  "status": "pass",
51
+ "generated_at_utc": "2026-06-12T18:14:59+00:00"
52
  },
53
  "live_publication": {
54
  "exists": true,
metrics/quality_gates.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Release Checks",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-12T18:14:49+00:00",
5
  "rule": "A release is current when the automated reports pass and the live GitHub/Hugging Face mirrors are verified after publishing.",
6
  "automated_gates": [
7
  {
 
1
  {
2
  "title": "Ropedia Xperience-10M Release Checks",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-13T17:46:37+00:00",
5
  "rule": "A release is current when the automated reports pass and the live GitHub/Hugging Face mirrors are verified after publishing.",
6
  "automated_gates": [
7
  {
metrics/qwen3_full_parameter_gates.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "title": "Qwen3-Omni Full-Parameter Feasibility Gates",
3
- "generated_at_utc": "2026-06-12T18:14:48+00:00",
4
  "status": "pass",
5
  "decision": "full_parameter_feasible_for_guarded_short_runs_not_promoted",
6
  "interpretation": "The 2026-06-09 gates prove that Qwen3-Omni full-parameter FSDP can load, prepare, run backward/optimizer steps, and complete guarded pilots up to 128 optimizer steps on an 8-GPU remote worker. They do not prove a production full-parameter fine-tune, and they intentionally save no full checkpoints or public weights.",
 
1
  {
2
  "title": "Qwen3-Omni Full-Parameter Feasibility Gates",
3
+ "generated_at_utc": "2026-06-13T17:41:13+00:00",
4
  "status": "pass",
5
  "decision": "full_parameter_feasible_for_guarded_short_runs_not_promoted",
6
  "interpretation": "The 2026-06-09 gates prove that Qwen3-Omni full-parameter FSDP can load, prepare, run backward/optimizer steps, and complete guarded pilots up to 128 optimizer steps on an 8-GPU remote worker. They do not prove a production full-parameter fine-tune, and they intentionally save no full checkpoints or public weights.",
metrics/qwen3_v5_v6_comparison.json ADDED
@@ -0,0 +1,68 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "title": "Qwen3-Omni v5 versus v6 verified comparison",
3
+ "status": "pass",
4
+ "generated_at_utc": "2026-06-14T00:00:00+00:00",
5
+ "comparison_scope": "Verified Qwen3-Omni LoRA held-out test packages on the same dense multiscale selected 128-episode dataset.",
6
+ "release_policy": {
7
+ "latest_verified_qwen_row": "v6",
8
+ "pinned_release_tag": "ropedia-xperience-10m-v5",
9
+ "recommendation": "Publish v6 as the latest verified branch and create a separate v6 tag only if the project wants a formal experimental release; do not move the v5 tag."
10
+ },
11
+ "runs": {
12
+ "v5": {
13
+ "eval_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora_eval_test_full",
14
+ "train_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora",
15
+ "epochs": 1,
16
+ "eval_samples": 4032,
17
+ "held_out_episode_count": 14,
18
+ "metrics": {
19
+ "json_validity_rate": 1.0,
20
+ "action_macro_f1": 0.002289711036077459,
21
+ "subtask_accuracy": 0.011194029850746268,
22
+ "transition_accuracy": 0.9908234126984127,
23
+ "next_action_accuracy": 0.053618594823032224,
24
+ "contact_accuracy": 0.7864583333333334,
25
+ "object_micro_f1": 0.31614599936244814
26
+ }
27
+ },
28
+ "v6": {
29
+ "eval_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full",
30
+ "train_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora",
31
+ "epochs": 2,
32
+ "lora_rank": 64,
33
+ "learning_rate": 0.00005,
34
+ "eval_samples": 4032,
35
+ "held_out_episode_count": 14,
36
+ "metrics": {
37
+ "json_validity_rate": 0.9990079365079365,
38
+ "action_macro_f1": 0.0028830723979596335,
39
+ "subtask_accuracy": 0.0037313432835820895,
40
+ "transition_accuracy": 0.9898313492063492,
41
+ "next_action_accuracy": 0.04305335446381405,
42
+ "contact_accuracy": 0.8177083333333334,
43
+ "object_micro_f1": 0.3064982378331287
44
+ }
45
+ }
46
+ },
47
+ "deltas_v6_minus_v5": {
48
+ "json_validity_rate": -0.0009920634920634888,
49
+ "action_macro_f1": 0.0005933613618821745,
50
+ "subtask_accuracy": -0.007462686567164178,
51
+ "transition_accuracy": -0.0009920634920634888,
52
+ "next_action_accuracy": -0.010565240359218173,
53
+ "contact_accuracy": 0.03125,
54
+ "object_micro_f1": -0.009647761529319436
55
+ },
56
+ "wins_for_v6": [
57
+ "action_macro_f1",
58
+ "contact_accuracy"
59
+ ],
60
+ "wins_for_v5": [
61
+ "json_validity_rate",
62
+ "subtask_accuracy",
63
+ "transition_accuracy",
64
+ "next_action_accuracy",
65
+ "object_micro_f1"
66
+ ],
67
+ "interpretation": "v6 is the newest verified Qwen LoRA branch and is better for action macro-F1 and contact accuracy, but v5 remains the safer pinned release row for JSON perfection, subtask/next-action accuracy, transition accuracy, and object micro-F1."
68
+ }
metrics/research_roadmap.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Research Roadmap",
3
  "summary": "Staged path from the public-sample task lab to verified Qwen3-Omni, Cosmos3-Nano, and Cosmos3-Super diagnostics, same-split 128-episode baseline alignment, a no-new-episode 128-suite enhancement pack, action/subtask error analysis, world/policy branches, and a future Xperience-native embodied foundation model.",
4
- "current_decision_point": "Push the current selected 128 episodes harder before requesting more storage: keep the public-sample task suite as the development harness, use the final verified selected-episode Qwen3-Omni v4 diagnostic result and same-split 128-episode simple/NN metadata baselines as structured-task references, read Cosmos3-Nano and Cosmos3-Super Forward-Dynamics LoRA as separate world-model results, export multiscale_20s10_40s20_80s40 windows plus hierarchical action/subtask targets, and defer policy-model experiments until robot-compatible targets are implemented. The Xperience Embodied Foundation Model is a later full-corpus pretraining goal, not a current result.",
5
  "additional_development_directions": {
6
  "source_document": "ADDITIONAL_DEVELOPMENT_DIRECTIONS.md",
7
  "source_json": "docs/data/additional_development_directions.json",
@@ -52,8 +52,8 @@
52
  },
53
  {
54
  "id": "qwen3_omni_lora_diagnostic_pilot",
55
- "name": "Qwen3-Omni LoRA Final Diagnostic Result",
56
- "status": "verified_baseline",
57
  "entry_condition": "Selected episodes are prepared locally with no train/test episode leakage.",
58
  "deliverables": [
59
  "dataset JSONL/media manifests",
@@ -64,10 +64,13 @@
64
  "metrics",
65
  "confusion matrices",
66
  "run report",
 
67
  "public LoRA adapter repo"
68
  ],
69
  "completion_evidence": [
70
  "docs/data/omni_finetune_verified_result.json",
 
 
71
  "results/omni_finetune/verified_public/",
72
  "dataset_manifest.json",
73
  "training_metadata.json",
 
1
  {
2
  "title": "Ropedia Xperience-10M Research Roadmap",
3
  "summary": "Staged path from the public-sample task lab to verified Qwen3-Omni, Cosmos3-Nano, and Cosmos3-Super diagnostics, same-split 128-episode baseline alignment, a no-new-episode 128-suite enhancement pack, action/subtask error analysis, world/policy branches, and a future Xperience-native embodied foundation model.",
4
+ "current_decision_point": "Push the current selected 128 episodes harder before requesting more storage: keep the public-sample task suite as the development harness, use the latest verified selected-episode Qwen3-Omni v6 diagnostic branch plus the pinned v5 row as structured-task references, read Cosmos3-Nano and Cosmos3-Super Forward-Dynamics LoRA as separate world-model results, continue with hierarchical action/subtask targets and label-normalized scoring, and defer policy-model experiments until robot-compatible targets are implemented. The Xperience Embodied Foundation Model is a later full-corpus pretraining goal, not a current result.",
5
  "additional_development_directions": {
6
  "source_document": "ADDITIONAL_DEVELOPMENT_DIRECTIONS.md",
7
  "source_json": "docs/data/additional_development_directions.json",
 
52
  },
53
  {
54
  "id": "qwen3_omni_lora_diagnostic_pilot",
55
+ "name": "Qwen3-Omni LoRA Latest Diagnostic Branch",
56
+ "status": "verified_latest_branch",
57
  "entry_condition": "Selected episodes are prepared locally with no train/test episode leakage.",
58
  "deliverables": [
59
  "dataset JSONL/media manifests",
 
64
  "metrics",
65
  "confusion matrices",
66
  "run report",
67
+ "v5/v6 comparison",
68
  "public LoRA adapter repo"
69
  ],
70
  "completion_evidence": [
71
  "docs/data/omni_finetune_verified_result.json",
72
+ "docs/data/qwen3_v5_v6_comparison.json",
73
+ "results/omni_finetune/QWEN3_V5_V6_COMPARISON_20260614.md",
74
  "results/omni_finetune/verified_public/",
75
  "dataset_manifest.json",
76
  "training_metadata.json",
metrics/source_alignment_audit.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Source Alignment Note",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-12T18:14:51+00:00",
5
  "alignment_json": "docs/data/xperience10m_dataset_card_alignment.json",
6
  "alignment_summary": {
7
  "full_dataset_repo": "ropedia-ai/xperience-10m",
 
1
  {
2
  "title": "Ropedia Xperience-10M Source Alignment Note",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-13T17:46:47+00:00",
5
  "alignment_json": "docs/data/xperience10m_dataset_card_alignment.json",
6
  "alignment_summary": {
7
  "full_dataset_repo": "ropedia-ai/xperience-10m",
metrics/summary_metrics.json CHANGED
@@ -14,7 +14,7 @@
14
  "visualization.rrd"
15
  ],
16
  "access_status": "The gated Xperience-10M dataset is available for selected multi-episode pilot preparation.",
17
- "current_scope": "The selected-episode Qwen3-Omni diagnostic pilot is verified on the 96/16/16 split and meets the 98% target for JSON validity; action/subtask quality remains weak, so it is a structured-task baseline. Cosmos3-Nano future-window compatibility and Cosmos3-Super Forward-Dynamics LoRA are also verified as separate world-model diagnostics with different metrics."
18
  },
19
  "models": {
20
  "motion_action": {
 
14
  "visualization.rrd"
15
  ],
16
  "access_status": "The gated Xperience-10M dataset is available for selected multi-episode pilot preparation.",
17
+ "current_scope": "The selected-episode Qwen3-Omni v6 diagnostic branch is verified on the 96/16/16 split and meets the 98% target for JSON validity; action/subtask quality remains weak, so it is a structured-task baseline rather than a strong model-quality claim. v6 improves action macro-F1 and contact accuracy versus v5, while v5 remains stronger on JSON validity, subtask, next-action, transition, and object metrics. Cosmos3-Nano future-window compatibility and Cosmos3-Super Forward-Dynamics LoRA are also verified as separate world-model diagnostics with different metrics."
18
  },
19
  "models": {
20
  "motion_action": {
metrics/task_surface_integrity.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-12T18:14:51+00:00",
4
  "summary": {
5
  "task_count": 12,
6
  "expected_task_count": 12,
@@ -64,45 +64,45 @@
64
  "observed": "timeline_action"
65
  },
66
  {
67
- "name": "timeline_action: public_field_output_short_is_human_readable",
68
  "status": "pass",
69
- "value": "current action class",
70
  "raw_hits": []
71
  },
72
  {
73
- "name": "timeline_action: public_field_plain_goal_is_human_readable",
74
  "status": "pass",
75
- "value": "Look at one short multimodal window and name what action is happening now.",
76
  "raw_hits": []
77
  },
78
  {
79
- "name": "timeline_action: public_field_display_name_is_human_readable",
80
  "status": "pass",
81
- "value": "Action Recognition",
82
  "raw_hits": []
83
  },
84
  {
85
- "name": "timeline_action: public_field_input_short_is_human_readable",
86
  "status": "pass",
87
- "value": "20-frame multimodal window",
88
  "raw_hits": []
89
  },
90
  {
91
- "name": "timeline_action: public_field_card_blurb_is_human_readable",
92
  "status": "pass",
93
- "value": "Recognize the current manipulation action from synchronized visual, motion, inertial, pose, and annotation context.",
94
  "raw_hits": []
95
  },
96
  {
97
- "name": "timeline_action: public_field_process_short_is_human_readable",
98
  "status": "pass",
99
- "value": "window features -> action label builder -> classifier",
100
  "raw_hits": []
101
  },
102
  {
103
- "name": "timeline_action: public_field_research_name_is_human_readable",
104
  "status": "pass",
105
- "value": "Egocentric Action Recognition",
106
  "raw_hits": []
107
  },
108
  {
@@ -184,45 +184,45 @@
184
  "observed": "timeline_subtask"
185
  },
186
  {
187
- "name": "timeline_subtask: public_field_output_short_is_human_readable",
188
  "status": "pass",
189
- "value": "current procedure step",
190
  "raw_hits": []
191
  },
192
  {
193
- "name": "timeline_subtask: public_field_plain_goal_is_human_readable",
194
  "status": "pass",
195
- "value": "Predict the higher-level task stage for the current window.",
196
  "raw_hits": []
197
  },
198
  {
199
- "name": "timeline_subtask: public_field_display_name_is_human_readable",
200
  "status": "pass",
201
- "value": "Procedure Step Recognition",
202
  "raw_hits": []
203
  },
204
  {
205
- "name": "timeline_subtask: public_field_input_short_is_human_readable",
206
  "status": "pass",
207
- "value": "20-frame multimodal window",
208
  "raw_hits": []
209
  },
210
  {
211
- "name": "timeline_subtask: public_field_card_blurb_is_human_readable",
212
  "status": "pass",
213
- "value": "Recognize the broader activity stage so fine actions become a readable procedure timeline.",
214
  "raw_hits": []
215
  },
216
  {
217
- "name": "timeline_subtask: public_field_process_short_is_human_readable",
218
  "status": "pass",
219
- "value": "window features -> subtask label builder -> classifier",
220
  "raw_hits": []
221
  },
222
  {
223
- "name": "timeline_subtask: public_field_research_name_is_human_readable",
224
  "status": "pass",
225
- "value": "Temporal Subtask Recognition",
226
  "raw_hits": []
227
  },
228
  {
@@ -304,45 +304,45 @@
304
  "observed": "transition_detection"
305
  },
306
  {
307
- "name": "transition_detection: public_field_output_short_is_human_readable",
308
  "status": "pass",
309
- "value": "boundary or steady",
310
  "raw_hits": []
311
  },
312
  {
313
- "name": "transition_detection: public_field_plain_goal_is_human_readable",
314
  "status": "pass",
315
- "value": "Detect whether the current window is near a boundary between actions.",
316
  "raw_hits": []
317
  },
318
  {
319
- "name": "transition_detection: public_field_display_name_is_human_readable",
320
  "status": "pass",
321
- "value": "Action Boundary Detection",
322
  "raw_hits": []
323
  },
324
  {
325
- "name": "transition_detection: public_field_input_short_is_human_readable",
326
  "status": "pass",
327
- "value": "current window with boundary target",
328
  "raw_hits": []
329
  },
330
  {
331
- "name": "transition_detection: public_field_card_blurb_is_human_readable",
332
  "status": "pass",
333
- "value": "Detect the local moment where the episode changes from one action segment to the next.",
334
  "raw_hits": []
335
  },
336
  {
337
- "name": "transition_detection: public_field_process_short_is_human_readable",
338
  "status": "pass",
339
- "value": "action changes -> boundary labels -> binary classifier",
340
  "raw_hits": []
341
  },
342
  {
343
- "name": "transition_detection: public_field_research_name_is_human_readable",
344
  "status": "pass",
345
- "value": "Temporal Action Segmentation",
346
  "raw_hits": []
347
  },
348
  {
@@ -422,45 +422,45 @@
422
  "observed": "next_action"
423
  },
424
  {
425
- "name": "next_action: public_field_output_short_is_human_readable",
426
  "status": "pass",
427
- "value": "action at t+20 frames",
428
  "raw_hits": []
429
  },
430
  {
431
- "name": "next_action: public_field_plain_goal_is_human_readable",
432
  "status": "pass",
433
- "value": "Use the current window to guess the action that will happen shortly after it.",
434
  "raw_hits": []
435
  },
436
  {
437
- "name": "next_action: public_field_display_name_is_human_readable",
438
  "status": "pass",
439
- "value": "Next-Action Prediction",
440
  "raw_hits": []
441
  },
442
  {
443
- "name": "next_action: public_field_input_short_is_human_readable",
444
  "status": "pass",
445
- "value": "current window at time t",
446
  "raw_hits": []
447
  },
448
  {
449
- "name": "next_action: public_field_card_blurb_is_human_readable",
450
  "status": "pass",
451
- "value": "Forecast the near-future action from the current observations only.",
452
  "raw_hits": []
453
  },
454
  {
455
- "name": "next_action: public_field_process_short_is_human_readable",
456
  "status": "pass",
457
- "value": "current features -> future label shift -> classifier",
458
  "raw_hits": []
459
  },
460
  {
461
- "name": "next_action: public_field_research_name_is_human_readable",
462
  "status": "pass",
463
- "value": "Short-Horizon Intention Prediction",
464
  "raw_hits": []
465
  },
466
  {
@@ -540,45 +540,45 @@
540
  "observed": "hand_trajectory_forecast"
541
  },
542
  {
543
- "name": "hand_trajectory_forecast: public_field_output_short_is_human_readable",
544
  "status": "pass",
545
- "value": "future hand-joint trajectory",
546
  "raw_hits": []
547
  },
548
  {
549
- "name": "hand_trajectory_forecast: public_field_plain_goal_is_human_readable",
550
  "status": "pass",
551
- "value": "Predict where the hands will move over the next few frames.",
552
  "raw_hits": []
553
  },
554
  {
555
- "name": "hand_trajectory_forecast: public_field_display_name_is_human_readable",
556
  "status": "pass",
557
- "value": "Hand Trajectory Forecasting",
558
  "raw_hits": []
559
  },
560
  {
561
- "name": "hand_trajectory_forecast: public_field_input_short_is_human_readable",
562
  "status": "pass",
563
- "value": "current multimodal window",
564
  "raw_hits": []
565
  },
566
  {
567
- "name": "hand_trajectory_forecast: public_field_card_blurb_is_human_readable",
568
  "status": "pass",
569
- "value": "Predict the future 3D left/right hand path from the current multimodal state.",
570
  "raw_hits": []
571
  },
572
  {
573
- "name": "hand_trajectory_forecast: public_field_process_short_is_human_readable",
574
  "status": "pass",
575
- "value": "current features -> future mocap target -> regression head",
576
  "raw_hits": []
577
  },
578
  {
579
- "name": "hand_trajectory_forecast: public_field_research_name_is_human_readable",
580
  "status": "pass",
581
- "value": "3D Hand Motion Forecasting",
582
  "raw_hits": []
583
  },
584
  {
@@ -658,45 +658,45 @@
658
  "observed": "contact_prediction"
659
  },
660
  {
661
- "name": "contact_prediction: public_field_output_short_is_human_readable",
662
  "status": "pass",
663
- "value": "contact or no contact",
664
  "raw_hits": []
665
  },
666
  {
667
- "name": "contact_prediction: public_field_plain_goal_is_human_readable",
668
  "status": "pass",
669
- "value": "Predict whether the body or hand is in contact with something.",
670
  "raw_hits": []
671
  },
672
  {
673
- "name": "contact_prediction: public_field_display_name_is_human_readable",
674
  "status": "pass",
675
- "value": "Contact State Prediction",
676
  "raw_hits": []
677
  },
678
  {
679
- "name": "contact_prediction: public_field_input_short_is_human_readable",
680
  "status": "pass",
681
- "value": "non-contact, non-caption features",
682
  "raw_hits": []
683
  },
684
  {
685
- "name": "contact_prediction: public_field_card_blurb_is_human_readable",
686
  "status": "pass",
687
- "value": "Predict whether body or hand contact with the scene is occurring without leaking contact labels.",
688
  "raw_hits": []
689
  },
690
  {
691
- "name": "contact_prediction: public_field_process_short_is_human_readable",
692
  "status": "pass",
693
- "value": "feature filter -> contact target -> binary classifier",
694
  "raw_hits": []
695
  },
696
  {
697
- "name": "contact_prediction: public_field_research_name_is_human_readable",
698
  "status": "pass",
699
- "value": "Human-Object Contact Prediction",
700
  "raw_hits": []
701
  },
702
  {
@@ -774,45 +774,45 @@
774
  "observed": "object_relevance"
775
  },
776
  {
777
- "name": "object_relevance: public_field_output_short_is_human_readable",
778
  "status": "pass",
779
- "value": "relevant object set",
780
  "raw_hits": []
781
  },
782
  {
783
- "name": "object_relevance: public_field_plain_goal_is_human_readable",
784
  "status": "pass",
785
- "value": "Predict which objects matter in the current window.",
786
  "raw_hits": []
787
  },
788
  {
789
- "name": "object_relevance: public_field_display_name_is_human_readable",
790
  "status": "pass",
791
- "value": "Object Relevance Prediction",
792
  "raw_hits": []
793
  },
794
  {
795
- "name": "object_relevance: public_field_input_short_is_human_readable",
796
  "status": "pass",
797
- "value": "non-caption multimodal features",
798
  "raw_hits": []
799
  },
800
  {
801
- "name": "object_relevance: public_field_card_blurb_is_human_readable",
802
  "status": "pass",
803
- "value": "Infer which objects are relevant to the current manipulation window from non-caption features.",
804
  "raw_hits": []
805
  },
806
  {
807
- "name": "object_relevance: public_field_process_short_is_human_readable",
808
  "status": "pass",
809
- "value": "object vocabulary -> multi-hot labels -> sigmoid heads",
810
  "raw_hits": []
811
  },
812
  {
813
- "name": "object_relevance: public_field_research_name_is_human_readable",
814
  "status": "pass",
815
- "value": "Object-Centric Interaction Recognition",
816
  "raw_hits": []
817
  },
818
  {
@@ -892,45 +892,45 @@
892
  "observed": "caption_grounding"
893
  },
894
  {
895
- "name": "caption_grounding: public_field_output_short_is_human_readable",
896
  "status": "pass",
897
- "value": "ranked matching moments",
898
  "raw_hits": []
899
  },
900
  {
901
- "name": "caption_grounding: public_field_plain_goal_is_human_readable",
902
  "status": "pass",
903
- "value": "Given a text-like query from annotation, find the matching time window.",
904
  "raw_hits": []
905
  },
906
  {
907
- "name": "caption_grounding: public_field_display_name_is_human_readable",
908
  "status": "pass",
909
- "value": "Language Grounding",
910
  "raw_hits": []
911
  },
912
  {
913
- "name": "caption_grounding: public_field_input_short_is_human_readable",
914
  "status": "pass",
915
- "value": "text-like query and candidate windows",
916
  "raw_hits": []
917
  },
918
  {
919
- "name": "caption_grounding: public_field_card_blurb_is_human_readable",
920
  "status": "pass",
921
- "value": "Retrieve the matching time window for an annotation-derived text query.",
922
  "raw_hits": []
923
  },
924
  {
925
- "name": "caption_grounding: public_field_process_short_is_human_readable",
926
  "status": "pass",
927
- "value": "query features -> candidate index -> cosine ranker",
928
  "raw_hits": []
929
  },
930
  {
931
- "name": "caption_grounding: public_field_research_name_is_human_readable",
932
  "status": "pass",
933
- "value": "Language-to-Moment Grounding",
934
  "raw_hits": []
935
  },
936
  {
@@ -1008,45 +1008,45 @@
1008
  "observed": "cross_modal_retrieval"
1009
  },
1010
  {
1011
- "name": "cross_modal_retrieval: public_field_output_short_is_human_readable",
1012
  "status": "pass",
1013
- "value": "ranked visual windows",
1014
  "raw_hits": []
1015
  },
1016
  {
1017
- "name": "cross_modal_retrieval: public_field_plain_goal_is_human_readable",
1018
  "status": "pass",
1019
- "value": "Use one group of modalities to retrieve the matching window from another group.",
1020
  "raw_hits": []
1021
  },
1022
  {
1023
- "name": "cross_modal_retrieval: public_field_display_name_is_human_readable",
1024
  "status": "pass",
1025
- "value": "Cross-Modal Retrieval",
1026
  "raw_hits": []
1027
  },
1028
  {
1029
- "name": "cross_modal_retrieval: public_field_input_short_is_human_readable",
1030
  "status": "pass",
1031
- "value": "motion/IMU/pose query; depth/video candidates",
1032
  "raw_hits": []
1033
  },
1034
  {
1035
- "name": "cross_modal_retrieval: public_field_card_blurb_is_human_readable",
1036
  "status": "pass",
1037
- "value": "Use motion, IMU, and camera-pose signals to retrieve the matching depth/video window.",
1038
  "raw_hits": []
1039
  },
1040
  {
1041
- "name": "cross_modal_retrieval: public_field_process_short_is_human_readable",
1042
  "status": "pass",
1043
- "value": "modality split -> projection -> nearest-neighbor ranker",
1044
  "raw_hits": []
1045
  },
1046
  {
1047
- "name": "cross_modal_retrieval: public_field_research_name_is_human_readable",
1048
  "status": "pass",
1049
- "value": "Multimodal Representation Retrieval",
1050
  "raw_hits": []
1051
  },
1052
  {
@@ -1126,45 +1126,45 @@
1126
  "observed": "modality_reconstruction"
1127
  },
1128
  {
1129
- "name": "modality_reconstruction: public_field_output_short_is_human_readable",
1130
  "status": "pass",
1131
- "value": "reconstructed depth/video vector",
1132
  "raw_hits": []
1133
  },
1134
  {
1135
- "name": "modality_reconstruction: public_field_plain_goal_is_human_readable",
1136
  "status": "pass",
1137
- "value": "Predict one modality feature block from other modality blocks.",
1138
  "raw_hits": []
1139
  },
1140
  {
1141
- "name": "modality_reconstruction: public_field_display_name_is_human_readable",
1142
  "status": "pass",
1143
- "value": "Cross-Modal Reconstruction",
1144
  "raw_hits": []
1145
  },
1146
  {
1147
- "name": "modality_reconstruction: public_field_input_short_is_human_readable",
1148
  "status": "pass",
1149
- "value": "motion, IMU, and camera/pose features",
1150
  "raw_hits": []
1151
  },
1152
  {
1153
- "name": "modality_reconstruction: public_field_card_blurb_is_human_readable",
1154
  "status": "pass",
1155
- "value": "Predict compressed depth/video feature vectors from motion, IMU, and camera-pose features.",
1156
  "raw_hits": []
1157
  },
1158
  {
1159
- "name": "modality_reconstruction: public_field_process_short_is_human_readable",
1160
  "status": "pass",
1161
- "value": "source-target split -> scaler -> regression head",
1162
  "raw_hits": []
1163
  },
1164
  {
1165
- "name": "modality_reconstruction: public_field_research_name_is_human_readable",
1166
  "status": "pass",
1167
- "value": "Modality Feature Reconstruction",
1168
  "raw_hits": []
1169
  },
1170
  {
@@ -1244,43 +1244,43 @@
1244
  "observed": "temporal_order"
1245
  },
1246
  {
1247
- "name": "temporal_order: public_field_output_short_is_human_readable",
1248
  "status": "pass",
1249
- "value": "correct or reversed",
1250
  "raw_hits": []
1251
  },
1252
  {
1253
- "name": "temporal_order: public_field_plain_goal_is_human_readable",
1254
  "status": "pass",
1255
- "value": "Tell whether two nearby windows are in the correct time order.",
1256
  "raw_hits": []
1257
  },
1258
  {
1259
- "name": "temporal_order: public_field_display_name_is_human_readable",
1260
  "status": "pass",
1261
  "value": "Temporal Order Verification",
1262
  "raw_hits": []
1263
  },
1264
  {
1265
- "name": "temporal_order: public_field_input_short_is_human_readable",
1266
  "status": "pass",
1267
- "value": "two adjacent windows plus difference vector",
1268
  "raw_hits": []
1269
  },
1270
  {
1271
- "name": "temporal_order: public_field_card_blurb_is_human_readable",
1272
  "status": "pass",
1273
- "value": "Tell whether two neighboring windows are in chronological order or reversed.",
1274
  "raw_hits": []
1275
  },
1276
  {
1277
- "name": "temporal_order: public_field_process_short_is_human_readable",
1278
  "status": "pass",
1279
- "value": "pair builder -> feature combiner -> binary classifier",
1280
  "raw_hits": []
1281
  },
1282
  {
1283
- "name": "temporal_order: public_field_research_name_is_human_readable",
1284
  "status": "pass",
1285
  "value": "Temporal Order Verification",
1286
  "raw_hits": []
@@ -1360,45 +1360,45 @@
1360
  "observed": "misalignment_detection"
1361
  },
1362
  {
1363
- "name": "misalignment_detection: public_field_output_short_is_human_readable",
1364
  "status": "pass",
1365
- "value": "aligned or shifted",
1366
  "raw_hits": []
1367
  },
1368
  {
1369
- "name": "misalignment_detection: public_field_plain_goal_is_human_readable",
1370
  "status": "pass",
1371
- "value": "Detect when modalities that should match are shifted out of sync.",
1372
  "raw_hits": []
1373
  },
1374
  {
1375
- "name": "misalignment_detection: public_field_display_name_is_human_readable",
1376
  "status": "pass",
1377
- "value": "Multimodal Synchronization Detection",
1378
  "raw_hits": []
1379
  },
1380
  {
1381
- "name": "misalignment_detection: public_field_input_short_is_human_readable",
1382
  "status": "pass",
1383
- "value": "motion-side and visual/depth-side feature groups",
1384
  "raw_hits": []
1385
  },
1386
  {
1387
- "name": "misalignment_detection: public_field_card_blurb_is_human_readable",
1388
  "status": "pass",
1389
- "value": "Detect whether motion and visual/depth streams have been artificially shifted out of sync.",
1390
  "raw_hits": []
1391
  },
1392
  {
1393
- "name": "misalignment_detection: public_field_process_short_is_human_readable",
1394
  "status": "pass",
1395
- "value": "aligned/shifted pairs -> feature combiner -> binary classifier",
1396
  "raw_hits": []
1397
  },
1398
  {
1399
- "name": "misalignment_detection: public_field_research_name_is_human_readable",
1400
  "status": "pass",
1401
- "value": "Cross-Modal Misalignment Detection",
1402
  "raw_hits": []
1403
  },
1404
  {
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-13T17:46:47+00:00",
4
  "summary": {
5
  "task_count": 12,
6
  "expected_task_count": 12,
 
64
  "observed": "timeline_action"
65
  },
66
  {
67
+ "name": "timeline_action: public_field_input_short_is_human_readable",
68
  "status": "pass",
69
+ "value": "20-frame multimodal window",
70
  "raw_hits": []
71
  },
72
  {
73
+ "name": "timeline_action: public_field_process_short_is_human_readable",
74
  "status": "pass",
75
+ "value": "window features -> action label builder -> classifier",
76
  "raw_hits": []
77
  },
78
  {
79
+ "name": "timeline_action: public_field_research_name_is_human_readable",
80
  "status": "pass",
81
+ "value": "Egocentric Action Recognition",
82
  "raw_hits": []
83
  },
84
  {
85
+ "name": "timeline_action: public_field_card_blurb_is_human_readable",
86
  "status": "pass",
87
+ "value": "Recognize the current manipulation action from synchronized visual, motion, inertial, pose, and annotation context.",
88
  "raw_hits": []
89
  },
90
  {
91
+ "name": "timeline_action: public_field_output_short_is_human_readable",
92
  "status": "pass",
93
+ "value": "current action class",
94
  "raw_hits": []
95
  },
96
  {
97
+ "name": "timeline_action: public_field_plain_goal_is_human_readable",
98
  "status": "pass",
99
+ "value": "Look at one short multimodal window and name what action is happening now.",
100
  "raw_hits": []
101
  },
102
  {
103
+ "name": "timeline_action: public_field_display_name_is_human_readable",
104
  "status": "pass",
105
+ "value": "Action Recognition",
106
  "raw_hits": []
107
  },
108
  {
 
184
  "observed": "timeline_subtask"
185
  },
186
  {
187
+ "name": "timeline_subtask: public_field_input_short_is_human_readable",
188
  "status": "pass",
189
+ "value": "20-frame multimodal window",
190
  "raw_hits": []
191
  },
192
  {
193
+ "name": "timeline_subtask: public_field_process_short_is_human_readable",
194
  "status": "pass",
195
+ "value": "window features -> subtask label builder -> classifier",
196
  "raw_hits": []
197
  },
198
  {
199
+ "name": "timeline_subtask: public_field_research_name_is_human_readable",
200
  "status": "pass",
201
+ "value": "Temporal Subtask Recognition",
202
  "raw_hits": []
203
  },
204
  {
205
+ "name": "timeline_subtask: public_field_card_blurb_is_human_readable",
206
  "status": "pass",
207
+ "value": "Recognize the broader activity stage so fine actions become a readable procedure timeline.",
208
  "raw_hits": []
209
  },
210
  {
211
+ "name": "timeline_subtask: public_field_output_short_is_human_readable",
212
  "status": "pass",
213
+ "value": "current procedure step",
214
  "raw_hits": []
215
  },
216
  {
217
+ "name": "timeline_subtask: public_field_plain_goal_is_human_readable",
218
  "status": "pass",
219
+ "value": "Predict the higher-level task stage for the current window.",
220
  "raw_hits": []
221
  },
222
  {
223
+ "name": "timeline_subtask: public_field_display_name_is_human_readable",
224
  "status": "pass",
225
+ "value": "Procedure Step Recognition",
226
  "raw_hits": []
227
  },
228
  {
 
304
  "observed": "transition_detection"
305
  },
306
  {
307
+ "name": "transition_detection: public_field_input_short_is_human_readable",
308
  "status": "pass",
309
+ "value": "current window with boundary target",
310
  "raw_hits": []
311
  },
312
  {
313
+ "name": "transition_detection: public_field_process_short_is_human_readable",
314
  "status": "pass",
315
+ "value": "action changes -> boundary labels -> binary classifier",
316
  "raw_hits": []
317
  },
318
  {
319
+ "name": "transition_detection: public_field_research_name_is_human_readable",
320
  "status": "pass",
321
+ "value": "Temporal Action Segmentation",
322
  "raw_hits": []
323
  },
324
  {
325
+ "name": "transition_detection: public_field_card_blurb_is_human_readable",
326
  "status": "pass",
327
+ "value": "Detect the local moment where the episode changes from one action segment to the next.",
328
  "raw_hits": []
329
  },
330
  {
331
+ "name": "transition_detection: public_field_output_short_is_human_readable",
332
  "status": "pass",
333
+ "value": "boundary or steady",
334
  "raw_hits": []
335
  },
336
  {
337
+ "name": "transition_detection: public_field_plain_goal_is_human_readable",
338
  "status": "pass",
339
+ "value": "Detect whether the current window is near a boundary between actions.",
340
  "raw_hits": []
341
  },
342
  {
343
+ "name": "transition_detection: public_field_display_name_is_human_readable",
344
  "status": "pass",
345
+ "value": "Action Boundary Detection",
346
  "raw_hits": []
347
  },
348
  {
 
422
  "observed": "next_action"
423
  },
424
  {
425
+ "name": "next_action: public_field_input_short_is_human_readable",
426
  "status": "pass",
427
+ "value": "current window at time t",
428
  "raw_hits": []
429
  },
430
  {
431
+ "name": "next_action: public_field_process_short_is_human_readable",
432
  "status": "pass",
433
+ "value": "current features -> future label shift -> classifier",
434
  "raw_hits": []
435
  },
436
  {
437
+ "name": "next_action: public_field_research_name_is_human_readable",
438
  "status": "pass",
439
+ "value": "Short-Horizon Intention Prediction",
440
  "raw_hits": []
441
  },
442
  {
443
+ "name": "next_action: public_field_card_blurb_is_human_readable",
444
  "status": "pass",
445
+ "value": "Forecast the near-future action from the current observations only.",
446
  "raw_hits": []
447
  },
448
  {
449
+ "name": "next_action: public_field_output_short_is_human_readable",
450
  "status": "pass",
451
+ "value": "action at t+20 frames",
452
  "raw_hits": []
453
  },
454
  {
455
+ "name": "next_action: public_field_plain_goal_is_human_readable",
456
  "status": "pass",
457
+ "value": "Use the current window to guess the action that will happen shortly after it.",
458
  "raw_hits": []
459
  },
460
  {
461
+ "name": "next_action: public_field_display_name_is_human_readable",
462
  "status": "pass",
463
+ "value": "Next-Action Prediction",
464
  "raw_hits": []
465
  },
466
  {
 
540
  "observed": "hand_trajectory_forecast"
541
  },
542
  {
543
+ "name": "hand_trajectory_forecast: public_field_input_short_is_human_readable",
544
  "status": "pass",
545
+ "value": "current multimodal window",
546
  "raw_hits": []
547
  },
548
  {
549
+ "name": "hand_trajectory_forecast: public_field_process_short_is_human_readable",
550
  "status": "pass",
551
+ "value": "current features -> future mocap target -> regression head",
552
  "raw_hits": []
553
  },
554
  {
555
+ "name": "hand_trajectory_forecast: public_field_research_name_is_human_readable",
556
  "status": "pass",
557
+ "value": "3D Hand Motion Forecasting",
558
  "raw_hits": []
559
  },
560
  {
561
+ "name": "hand_trajectory_forecast: public_field_card_blurb_is_human_readable",
562
  "status": "pass",
563
+ "value": "Predict the future 3D left/right hand path from the current multimodal state.",
564
  "raw_hits": []
565
  },
566
  {
567
+ "name": "hand_trajectory_forecast: public_field_output_short_is_human_readable",
568
  "status": "pass",
569
+ "value": "future hand-joint trajectory",
570
  "raw_hits": []
571
  },
572
  {
573
+ "name": "hand_trajectory_forecast: public_field_plain_goal_is_human_readable",
574
  "status": "pass",
575
+ "value": "Predict where the hands will move over the next few frames.",
576
  "raw_hits": []
577
  },
578
  {
579
+ "name": "hand_trajectory_forecast: public_field_display_name_is_human_readable",
580
  "status": "pass",
581
+ "value": "Hand Trajectory Forecasting",
582
  "raw_hits": []
583
  },
584
  {
 
658
  "observed": "contact_prediction"
659
  },
660
  {
661
+ "name": "contact_prediction: public_field_input_short_is_human_readable",
662
  "status": "pass",
663
+ "value": "non-contact, non-caption features",
664
  "raw_hits": []
665
  },
666
  {
667
+ "name": "contact_prediction: public_field_process_short_is_human_readable",
668
  "status": "pass",
669
+ "value": "feature filter -> contact target -> binary classifier",
670
  "raw_hits": []
671
  },
672
  {
673
+ "name": "contact_prediction: public_field_research_name_is_human_readable",
674
  "status": "pass",
675
+ "value": "Human-Object Contact Prediction",
676
  "raw_hits": []
677
  },
678
  {
679
+ "name": "contact_prediction: public_field_card_blurb_is_human_readable",
680
  "status": "pass",
681
+ "value": "Predict whether body or hand contact with the scene is occurring without leaking contact labels.",
682
  "raw_hits": []
683
  },
684
  {
685
+ "name": "contact_prediction: public_field_output_short_is_human_readable",
686
  "status": "pass",
687
+ "value": "contact or no contact",
688
  "raw_hits": []
689
  },
690
  {
691
+ "name": "contact_prediction: public_field_plain_goal_is_human_readable",
692
  "status": "pass",
693
+ "value": "Predict whether the body or hand is in contact with something.",
694
  "raw_hits": []
695
  },
696
  {
697
+ "name": "contact_prediction: public_field_display_name_is_human_readable",
698
  "status": "pass",
699
+ "value": "Contact State Prediction",
700
  "raw_hits": []
701
  },
702
  {
 
774
  "observed": "object_relevance"
775
  },
776
  {
777
+ "name": "object_relevance: public_field_input_short_is_human_readable",
778
  "status": "pass",
779
+ "value": "non-caption multimodal features",
780
  "raw_hits": []
781
  },
782
  {
783
+ "name": "object_relevance: public_field_process_short_is_human_readable",
784
  "status": "pass",
785
+ "value": "object vocabulary -> multi-hot labels -> sigmoid heads",
786
  "raw_hits": []
787
  },
788
  {
789
+ "name": "object_relevance: public_field_research_name_is_human_readable",
790
  "status": "pass",
791
+ "value": "Object-Centric Interaction Recognition",
792
  "raw_hits": []
793
  },
794
  {
795
+ "name": "object_relevance: public_field_card_blurb_is_human_readable",
796
  "status": "pass",
797
+ "value": "Infer which objects are relevant to the current manipulation window from non-caption features.",
798
  "raw_hits": []
799
  },
800
  {
801
+ "name": "object_relevance: public_field_output_short_is_human_readable",
802
  "status": "pass",
803
+ "value": "relevant object set",
804
  "raw_hits": []
805
  },
806
  {
807
+ "name": "object_relevance: public_field_plain_goal_is_human_readable",
808
  "status": "pass",
809
+ "value": "Predict which objects matter in the current window.",
810
  "raw_hits": []
811
  },
812
  {
813
+ "name": "object_relevance: public_field_display_name_is_human_readable",
814
  "status": "pass",
815
+ "value": "Object Relevance Prediction",
816
  "raw_hits": []
817
  },
818
  {
 
892
  "observed": "caption_grounding"
893
  },
894
  {
895
+ "name": "caption_grounding: public_field_input_short_is_human_readable",
896
  "status": "pass",
897
+ "value": "text-like query and candidate windows",
898
  "raw_hits": []
899
  },
900
  {
901
+ "name": "caption_grounding: public_field_process_short_is_human_readable",
902
  "status": "pass",
903
+ "value": "query features -> candidate index -> cosine ranker",
904
  "raw_hits": []
905
  },
906
  {
907
+ "name": "caption_grounding: public_field_research_name_is_human_readable",
908
  "status": "pass",
909
+ "value": "Language-to-Moment Grounding",
910
  "raw_hits": []
911
  },
912
  {
913
+ "name": "caption_grounding: public_field_card_blurb_is_human_readable",
914
  "status": "pass",
915
+ "value": "Retrieve the matching time window for an annotation-derived text query.",
916
  "raw_hits": []
917
  },
918
  {
919
+ "name": "caption_grounding: public_field_output_short_is_human_readable",
920
  "status": "pass",
921
+ "value": "ranked matching moments",
922
  "raw_hits": []
923
  },
924
  {
925
+ "name": "caption_grounding: public_field_plain_goal_is_human_readable",
926
  "status": "pass",
927
+ "value": "Given a text-like query from annotation, find the matching time window.",
928
  "raw_hits": []
929
  },
930
  {
931
+ "name": "caption_grounding: public_field_display_name_is_human_readable",
932
  "status": "pass",
933
+ "value": "Language Grounding",
934
  "raw_hits": []
935
  },
936
  {
 
1008
  "observed": "cross_modal_retrieval"
1009
  },
1010
  {
1011
+ "name": "cross_modal_retrieval: public_field_input_short_is_human_readable",
1012
  "status": "pass",
1013
+ "value": "motion/IMU/pose query; depth/video candidates",
1014
  "raw_hits": []
1015
  },
1016
  {
1017
+ "name": "cross_modal_retrieval: public_field_process_short_is_human_readable",
1018
  "status": "pass",
1019
+ "value": "modality split -> projection -> nearest-neighbor ranker",
1020
  "raw_hits": []
1021
  },
1022
  {
1023
+ "name": "cross_modal_retrieval: public_field_research_name_is_human_readable",
1024
  "status": "pass",
1025
+ "value": "Multimodal Representation Retrieval",
1026
  "raw_hits": []
1027
  },
1028
  {
1029
+ "name": "cross_modal_retrieval: public_field_card_blurb_is_human_readable",
1030
  "status": "pass",
1031
+ "value": "Use motion, IMU, and camera-pose signals to retrieve the matching depth/video window.",
1032
  "raw_hits": []
1033
  },
1034
  {
1035
+ "name": "cross_modal_retrieval: public_field_output_short_is_human_readable",
1036
  "status": "pass",
1037
+ "value": "ranked visual windows",
1038
  "raw_hits": []
1039
  },
1040
  {
1041
+ "name": "cross_modal_retrieval: public_field_plain_goal_is_human_readable",
1042
  "status": "pass",
1043
+ "value": "Use one group of modalities to retrieve the matching window from another group.",
1044
  "raw_hits": []
1045
  },
1046
  {
1047
+ "name": "cross_modal_retrieval: public_field_display_name_is_human_readable",
1048
  "status": "pass",
1049
+ "value": "Cross-Modal Retrieval",
1050
  "raw_hits": []
1051
  },
1052
  {
 
1126
  "observed": "modality_reconstruction"
1127
  },
1128
  {
1129
+ "name": "modality_reconstruction: public_field_input_short_is_human_readable",
1130
  "status": "pass",
1131
+ "value": "motion, IMU, and camera/pose features",
1132
  "raw_hits": []
1133
  },
1134
  {
1135
+ "name": "modality_reconstruction: public_field_process_short_is_human_readable",
1136
  "status": "pass",
1137
+ "value": "source-target split -> scaler -> regression head",
1138
  "raw_hits": []
1139
  },
1140
  {
1141
+ "name": "modality_reconstruction: public_field_research_name_is_human_readable",
1142
  "status": "pass",
1143
+ "value": "Modality Feature Reconstruction",
1144
  "raw_hits": []
1145
  },
1146
  {
1147
+ "name": "modality_reconstruction: public_field_card_blurb_is_human_readable",
1148
  "status": "pass",
1149
+ "value": "Predict compressed depth/video feature vectors from motion, IMU, and camera-pose features.",
1150
  "raw_hits": []
1151
  },
1152
  {
1153
+ "name": "modality_reconstruction: public_field_output_short_is_human_readable",
1154
  "status": "pass",
1155
+ "value": "reconstructed depth/video vector",
1156
  "raw_hits": []
1157
  },
1158
  {
1159
+ "name": "modality_reconstruction: public_field_plain_goal_is_human_readable",
1160
  "status": "pass",
1161
+ "value": "Predict one modality feature block from other modality blocks.",
1162
  "raw_hits": []
1163
  },
1164
  {
1165
+ "name": "modality_reconstruction: public_field_display_name_is_human_readable",
1166
  "status": "pass",
1167
+ "value": "Cross-Modal Reconstruction",
1168
  "raw_hits": []
1169
  },
1170
  {
 
1244
  "observed": "temporal_order"
1245
  },
1246
  {
1247
+ "name": "temporal_order: public_field_input_short_is_human_readable",
1248
  "status": "pass",
1249
+ "value": "two adjacent windows plus difference vector",
1250
  "raw_hits": []
1251
  },
1252
  {
1253
+ "name": "temporal_order: public_field_process_short_is_human_readable",
1254
  "status": "pass",
1255
+ "value": "pair builder -> feature combiner -> binary classifier",
1256
  "raw_hits": []
1257
  },
1258
  {
1259
+ "name": "temporal_order: public_field_research_name_is_human_readable",
1260
  "status": "pass",
1261
  "value": "Temporal Order Verification",
1262
  "raw_hits": []
1263
  },
1264
  {
1265
+ "name": "temporal_order: public_field_card_blurb_is_human_readable",
1266
  "status": "pass",
1267
+ "value": "Tell whether two neighboring windows are in chronological order or reversed.",
1268
  "raw_hits": []
1269
  },
1270
  {
1271
+ "name": "temporal_order: public_field_output_short_is_human_readable",
1272
  "status": "pass",
1273
+ "value": "correct or reversed",
1274
  "raw_hits": []
1275
  },
1276
  {
1277
+ "name": "temporal_order: public_field_plain_goal_is_human_readable",
1278
  "status": "pass",
1279
+ "value": "Tell whether two nearby windows are in the correct time order.",
1280
  "raw_hits": []
1281
  },
1282
  {
1283
+ "name": "temporal_order: public_field_display_name_is_human_readable",
1284
  "status": "pass",
1285
  "value": "Temporal Order Verification",
1286
  "raw_hits": []
 
1360
  "observed": "misalignment_detection"
1361
  },
1362
  {
1363
+ "name": "misalignment_detection: public_field_input_short_is_human_readable",
1364
  "status": "pass",
1365
+ "value": "motion-side and visual/depth-side feature groups",
1366
  "raw_hits": []
1367
  },
1368
  {
1369
+ "name": "misalignment_detection: public_field_process_short_is_human_readable",
1370
  "status": "pass",
1371
+ "value": "aligned/shifted pairs -> feature combiner -> binary classifier",
1372
  "raw_hits": []
1373
  },
1374
  {
1375
+ "name": "misalignment_detection: public_field_research_name_is_human_readable",
1376
  "status": "pass",
1377
+ "value": "Cross-Modal Misalignment Detection",
1378
  "raw_hits": []
1379
  },
1380
  {
1381
+ "name": "misalignment_detection: public_field_card_blurb_is_human_readable",
1382
  "status": "pass",
1383
+ "value": "Detect whether motion and visual/depth streams have been artificially shifted out of sync.",
1384
  "raw_hits": []
1385
  },
1386
  {
1387
+ "name": "misalignment_detection: public_field_output_short_is_human_readable",
1388
  "status": "pass",
1389
+ "value": "aligned or shifted",
1390
  "raw_hits": []
1391
  },
1392
  {
1393
+ "name": "misalignment_detection: public_field_plain_goal_is_human_readable",
1394
  "status": "pass",
1395
+ "value": "Detect when modalities that should match are shifted out of sync.",
1396
  "raw_hits": []
1397
  },
1398
  {
1399
+ "name": "misalignment_detection: public_field_display_name_is_human_readable",
1400
  "status": "pass",
1401
+ "value": "Multimodal Synchronization Detection",
1402
  "raw_hits": []
1403
  },
1404
  {
metrics/website_integrity.json CHANGED
@@ -1,13 +1,13 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-12T18:14:52+00:00",
4
  "docs_root": "docs",
5
  "site_base": "/ropedia-xperience-10m-task-suite/",
6
  "summary": {
7
  "html_pages": 4,
8
  "local_references": 142,
9
  "external_reference_count": 111,
10
- "json_files": 37,
11
  "image_assets_referenced": 22,
12
  "failure_count": 0
13
  },
@@ -75,7 +75,7 @@
75
  "status": "pass",
76
  "reason": "The project overview should appear before the deeper progress ledger.",
77
  "overview_index": 67412,
78
- "evidence_index": 93026
79
  },
80
  {
81
  "name": "project_status_links_json",
@@ -137,7 +137,7 @@
137
  "statuses": [
138
  "implemented",
139
  "implemented_for_first_pilot",
140
- "verified_baseline",
141
  "verified_companion_result",
142
  "current",
143
  "active_next_step",
@@ -154,8 +154,8 @@
154
  "status": "pass",
155
  "reason": "The evaluation protocol should appear before the deeper evidence ledger.",
156
  "overview_index": 67412,
157
- "protocol_index": 89208,
158
- "evidence_index": 93026
159
  },
160
  {
161
  "name": "evaluation_protocol_links_json",
@@ -253,7 +253,7 @@
253
  },
254
  {
255
  "path": "data/artifact_index.json",
256
- "bytes": 87230,
257
  "top_level_type": "dict"
258
  },
259
  {
@@ -303,12 +303,12 @@
303
  },
304
  {
305
  "path": "data/omni_finetune_verified_result.json",
306
- "bytes": 4431,
307
  "top_level_type": "dict"
308
  },
309
  {
310
  "path": "data/omni_model_comparison.json",
311
- "bytes": 81593,
312
  "top_level_type": "dict"
313
  },
314
  {
@@ -323,12 +323,12 @@
323
  },
324
  {
325
  "path": "data/project_packet.json",
326
- "bytes": 10496,
327
  "top_level_type": "dict"
328
  },
329
  {
330
  "path": "data/project_status.json",
331
- "bytes": 23198,
332
  "top_level_type": "dict"
333
  },
334
  {
@@ -351,6 +351,11 @@
351
  "bytes": 12183,
352
  "top_level_type": "dict"
353
  },
 
 
 
 
 
354
  {
355
  "path": "data/rendered_site_check.json",
356
  "bytes": 4032,
@@ -373,12 +378,12 @@
373
  },
374
  {
375
  "path": "data/research_roadmap.json",
376
- "bytes": 13313,
377
  "top_level_type": "dict"
378
  },
379
  {
380
  "path": "data/research_roadmap_interactive.json",
381
- "bytes": 145083,
382
  "top_level_type": "dict"
383
  },
384
  {
@@ -388,7 +393,7 @@
388
  },
389
  {
390
  "path": "data/scope_claims_audit.json",
391
- "bytes": 21325,
392
  "top_level_type": "dict"
393
  },
394
  {
@@ -403,7 +408,7 @@
403
  },
404
  {
405
  "path": "data/summary_metrics.json",
406
- "bytes": 27604,
407
  "top_level_type": "dict"
408
  },
409
  {
@@ -423,7 +428,7 @@
423
  },
424
  {
425
  "path": "data/website_integrity.json",
426
- "bytes": 15654,
427
  "top_level_type": "dict"
428
  },
429
  {
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-13T17:46:49+00:00",
4
  "docs_root": "docs",
5
  "site_base": "/ropedia-xperience-10m-task-suite/",
6
  "summary": {
7
  "html_pages": 4,
8
  "local_references": 142,
9
  "external_reference_count": 111,
10
+ "json_files": 38,
11
  "image_assets_referenced": 22,
12
  "failure_count": 0
13
  },
 
75
  "status": "pass",
76
  "reason": "The project overview should appear before the deeper progress ledger.",
77
  "overview_index": 67412,
78
+ "evidence_index": 93059
79
  },
80
  {
81
  "name": "project_status_links_json",
 
137
  "statuses": [
138
  "implemented",
139
  "implemented_for_first_pilot",
140
+ "verified_latest_branch",
141
  "verified_companion_result",
142
  "current",
143
  "active_next_step",
 
154
  "status": "pass",
155
  "reason": "The evaluation protocol should appear before the deeper evidence ledger.",
156
  "overview_index": 67412,
157
+ "protocol_index": 89241,
158
+ "evidence_index": 93059
159
  },
160
  {
161
  "name": "evaluation_protocol_links_json",
 
253
  },
254
  {
255
  "path": "data/artifact_index.json",
256
+ "bytes": 88913,
257
  "top_level_type": "dict"
258
  },
259
  {
 
303
  },
304
  {
305
  "path": "data/omni_finetune_verified_result.json",
306
+ "bytes": 4325,
307
  "top_level_type": "dict"
308
  },
309
  {
310
  "path": "data/omni_model_comparison.json",
311
+ "bytes": 81866,
312
  "top_level_type": "dict"
313
  },
314
  {
 
323
  },
324
  {
325
  "path": "data/project_packet.json",
326
+ "bytes": 10597,
327
  "top_level_type": "dict"
328
  },
329
  {
330
  "path": "data/project_status.json",
331
+ "bytes": 23535,
332
  "top_level_type": "dict"
333
  },
334
  {
 
351
  "bytes": 12183,
352
  "top_level_type": "dict"
353
  },
354
+ {
355
+ "path": "data/qwen3_v5_v6_comparison.json",
356
+ "bytes": 2814,
357
+ "top_level_type": "dict"
358
+ },
359
  {
360
  "path": "data/rendered_site_check.json",
361
  "bytes": 4032,
 
378
  },
379
  {
380
  "path": "data/research_roadmap.json",
381
+ "bytes": 13450,
382
  "top_level_type": "dict"
383
  },
384
  {
385
  "path": "data/research_roadmap_interactive.json",
386
+ "bytes": 145234,
387
  "top_level_type": "dict"
388
  },
389
  {
 
393
  },
394
  {
395
  "path": "data/scope_claims_audit.json",
396
+ "bytes": 21795,
397
  "top_level_type": "dict"
398
  },
399
  {
 
408
  },
409
  {
410
  "path": "data/summary_metrics.json",
411
+ "bytes": 27807,
412
  "top_level_type": "dict"
413
  },
414
  {
 
428
  },
429
  {
430
  "path": "data/website_integrity.json",
431
+ "bytes": 15774,
432
  "top_level_type": "dict"
433
  },
434
  {