cy0307 commited on
Commit
f590137
·
verified ·
1 Parent(s): 4bf9c22

Add files using upload-large-folder tool

Browse files
PROJECT_BRIEF.md CHANGED
@@ -1,18 +1,18 @@
1
  # Project Brief
2
 
3
- This project turns the public Ropedia Xperience-10M sample into a concrete
4
- research task lab for embodied AI. It is designed to answer a practical
5
- question: what can be built, measured, and extended from a richly synchronized
6
- egocentric episode before scaling to held-out multi-episode training?
7
 
8
  ## Research Intent
9
 
10
- The public sample is treated as a small but real research system. The project
11
- does not try to inflate one episode into a final benchmark. Instead, it shows
12
- the full path from data inspection to task design, baseline modeling,
13
- evaluation, artifact packaging, and a guarded scale-up plan. A reader should be
14
- able to trace one model input, understand each task, reproduce the public-sample
15
- results, and see what remains before multi-episode model-quality claims.
16
 
17
  ## Capability Map
18
 
@@ -21,23 +21,25 @@ results, and see what remains before multi-episode model-quality claims.
21
  | Data understanding | `feature_manifest.json`, `available_modalities.json`, modality atlas, episode-window HF viewer |
22
  | Task design | 20 unified task contracts, task cards, case-study walkthroughs, and four research-direction extension probes |
23
  | Evaluation rigor | chronological split, per-task metrics, predictions, confusion matrices, leakage notes, and generated takeaways |
24
- | Scale-up planning | Final verified 96/16/16 Qwen3-Omni diagnostic result, same-split 128-episode baseline alignment, Cosmos3-Nano compatibility branch, and policy-model candidates after action-space conversion |
25
 
26
  ## What Exists Now
27
 
28
- | Layer | Current artifact |
29
  | --- | --- |
30
- | Data unit | 1 public sample episode, 5,821 frames, 1,161 synchronized 20-frame windows |
 
31
  | Modalities | Video-derived features, audio, depth, pose/SLAM, mocap, IMU, calibration, and language-derived features |
32
  | Task suite | 20 embodied-AI task contracts with inputs, targets, metrics, predictions, and setup alignment |
33
- | Models | Minimal linear/ridge/logistic baselines plus compact PyTorch MLP heads for the unified 20-task public-sample suite |
 
34
  | Research map | Four Ropedia research directions with direct, proxy, diagnostic, and extension-task coverage |
35
- | Scale-up path | A selected 96/16/16 Qwen3-Omni LoRA final diagnostic result is verified; strict-JSON validity meets target, while weak action/subtask metrics guide the next error-analysis pass |
36
 
37
  ## How To Read It
38
 
39
  1. Start with `PUBLIC_READER_MAP.md` if you need to choose between GitHub,
40
- the website, Hugging Face artifacts, baseline weights, model branches, or
41
  release-health files.
42
  2. Start with the website or this brief to understand the project shape.
43
  3. Open `RESEARCH_ROADMAP.md` to see how the work scales from the public
@@ -46,17 +48,17 @@ results, and see what remains before multi-episode model-quality claims.
46
  5. Use `RESEARCH_TAKEAWAYS.md` for the current metric interpretation.
47
  6. Inspect `results/episode_task_suite/feature_manifest.json` to understand one model input.
48
  7. Use `TASK_SUITE_20.md` and `docs/data/task_suite_20.json` to read the unified 20-task suite; the historical `docs/data/tier2_task_suite.json` path stores the tasks 13-20 result bundle.
49
- 8. Use `docs/data/omni_finetune_verified_result.json` for the current multi-episode Qwen3-Omni pilot result.
 
50
 
51
  ## What This Enables
52
 
53
- The public sample is enough to build and verify task definitions, feature
54
- contracts, metrics, visualization, and baseline code. It is not enough to
55
- measure final model quality for a general embodied-AI model. The first
56
- multi-episode Qwen3-Omni diagnostic pilot now verifies the held-out training
57
- loop with validation loss recorded; the next research stage is to improve
58
- JSON-format reliability and error analysis before larger robustness or
59
- alternative backbone claims.
60
 
61
  ## Best Entry Points
62
 
 
1
  # Project Brief
2
 
3
+ This project presents Ropedia Xperience-10M through two public evidence lines.
4
+ Line 1 turns one public sample episode into a concrete 20-task embodied-AI
5
+ task lab. Line 2 compares selected 128-episode public-safe artifacts across
6
+ aligned baselines, Qwen3-Omni v6, Cosmos3-Super, and Cosmos3-Nano.
7
 
8
  ## Research Intent
9
 
10
+ The public sample is treated as a small but real research system, while the
11
+ selected-128 line shows the first same-split scale-up comparison. The project
12
+ does not blend those two evidence types. A reader should be able to trace one
13
+ model input, understand each task, reproduce the public-sample results, compare
14
+ the 128-episode method rows, and see what remains before stronger
15
+ model-quality claims.
16
 
17
  ## Capability Map
18
 
 
21
  | Data understanding | `feature_manifest.json`, `available_modalities.json`, modality atlas, episode-window HF viewer |
22
  | Task design | 20 unified task contracts, task cards, case-study walkthroughs, and four research-direction extension probes |
23
  | Evaluation rigor | chronological split, per-task metrics, predictions, confusion matrices, leakage notes, and generated takeaways |
24
+ | Scale-up planning | Final verified 96/16/16 Qwen3-Omni v6 diagnostic row, same-split 128-episode baseline alignment, Cosmos3-Nano compatibility diagnostics, Cosmos3-Super diagnostics, and policy-model candidates after action-space conversion |
25
 
26
  ## What Exists Now
27
 
28
+ | Evidence view | Current artifact |
29
  | --- | --- |
30
+ | Line 1 data unit | 1 public sample episode, 5,821 frames, 1,161 synchronized 20-frame windows |
31
+ | Line 2 data unit | Selected 96/16/16 split over 128 source episodes, 34,269 Qwen3-Omni v6 multiscale windows, and public-safe processed features linked to official gated episode paths |
32
  | Modalities | Video-derived features, audio, depth, pose/SLAM, mocap, IMU, calibration, and language-derived features |
33
  | Task suite | 20 embodied-AI task contracts with inputs, targets, metrics, predictions, and setup alignment |
34
+ | Line 1 models | Minimal linear/ridge/logistic baselines plus compact PyTorch MLP heads for the unified 20-task public-sample suite |
35
+ | Line 2 methods | Metadata simple/NN, raw-feature simple/NN, Qwen3-Omni v6 LoRA, Cosmos3-Super Reasoner, and Cosmos3-Nano Future Window; 140/140 selected-128 scores, including 6 marked compact-proxy cells |
36
  | Research map | Four Ropedia research directions with direct, proxy, diagnostic, and extension-task coverage |
37
+ | Qwen3 lineage | Qwen3-Omni v1-v6 are run versions inside Line 2: v1-v4 are pipeline-hardening/ablation evidence, v5 is the pinned prior multiscale release, and v6 is the current 20-task Qwen3-Omni row |
38
 
39
  ## How To Read It
40
 
41
  1. Start with `PUBLIC_READER_MAP.md` if you need to choose between GitHub,
42
+ the website, Hugging Face artifacts, baseline weights, model-result repos, or
43
  release-health files.
44
  2. Start with the website or this brief to understand the project shape.
45
  3. Open `RESEARCH_ROADMAP.md` to see how the work scales from the public
 
48
  5. Use `RESEARCH_TAKEAWAYS.md` for the current metric interpretation.
49
  6. Inspect `results/episode_task_suite/feature_manifest.json` to understand one model input.
50
  7. Use `TASK_SUITE_20.md` and `docs/data/task_suite_20.json` to read the unified 20-task suite; the historical `docs/data/tier2_task_suite.json` path stores the tasks 13-20 result bundle.
51
+ 8. Use `QWEN3_OMNI_RUN_LINEAGE.md` and `docs/data/qwen3_omni_run_lineage.json` to read v1-v6 correctly.
52
+ 9. Use `docs/data/omni_finetune_verified_result.json` for the current multi-episode Qwen3-Omni v6 result.
53
 
54
  ## What This Enables
55
 
56
+ Line 1 is enough to build and verify task definitions, feature contracts,
57
+ metrics, visualization, and baseline code. It is not enough to measure final
58
+ general embodied-AI model quality. Line 2 verifies the selected-128 held-out
59
+ comparison surface and the Qwen3-Omni v6 diagnostic row; the next research
60
+ stage is action/subtask error analysis, stronger structured-output training,
61
+ and policy-target conversion before larger backbone claims.
 
62
 
63
  ## Best Entry Points
64
 
PROJECT_README.md CHANGED
@@ -35,11 +35,11 @@
35
  </p>
36
 
37
 
38
- **Ropedia Xperience-10M Task Suite** is organized as two public result lines, not one blended benchmark. The 1-sample line is a fully inspectable task lab. The selected-128 line is the comparison surface for aligned baselines, the Qwen3-Omni series, and the Cosmos3 series. Every score points back to a source artifact and keeps direct-vs-proxy status visible.
39
 
40
  **Updated:** 2026-06-21.
41
 
42
- **Scope:** one public sample episode for raw-file inspection and reproducible task construction; selected 128-episode public-safe artifacts for same-split metadata/raw baselines, Qwen3-Omni v6 LoRA, Cosmos3-Super Reasoner, and Cosmos3-Nano Future Window. Raw Xperience-10M MP4/HDF5/RRD files, Qwen3 base weights, Cosmos3 base weights, and gated data are not redistributed here.
43
 
44
  ## Contents
45
 
@@ -59,6 +59,8 @@
59
 
60
  Use the two evidence lines first, then choose the artifact that answers your question. The dashboard is the best visual overview; the GitHub repo is the source of truth for scripts and generated JSON; Hugging Face mirrors contain public-safe cards, metrics, figures, and model artifacts.
61
 
 
 
62
  The multilingual README files are reader guides. The canonical technical evidence is still the committed task contracts, result matrices, validation JSON, and public-safe result packages.
63
 
64
  ## At A Glance
@@ -72,8 +74,8 @@ The multilingual README files are reader guides. The canonical technical evidenc
72
  </thead>
73
  <tbody>
74
  <tr>
75
- <td><strong>Two result lines</strong></td>
76
- <td><strong>1 sample episode</strong> for task construction and reproducibility. <strong>128 selected episodes</strong> for same-split metadata/raw baselines plus Qwen3-Omni v6 and Cosmos3 diagnostics.</td>
77
  </tr>
78
  <tr>
79
  <td><strong>180 method-task records</strong></td>
@@ -104,7 +106,7 @@ The multilingual README files are reader guides. The canonical technical evidenc
104
 
105
  ## Two Evidence Lines
106
 
107
- The public suite is organized around two result lines. Keep them separate when reading metrics.
108
 
109
  <p align="center">
110
  <img src="docs/assets/charts/two_evidence_line_map.svg" alt="Two evidence-line map: 1 sample episode and 128 selected episodes combine into 180 scored method-task records" width="100%">
@@ -645,10 +647,12 @@ robotics, spatial intelligence, and world modeling. The public
645
  [`ropedia-ai/xperience-10m-sample`](https://huggingface.co/datasets/ropedia-ai/xperience-10m-sample)
646
  repo provides the sample episode used for the implemented task suite here.
647
 
648
- This project keeps those layers separate: the public sample supports the
649
- current 20-task study, while the gated full dataset is used only for the
650
- selected multi-episode Qwen3-Omni pilot. Raw Xperience-10M MP4/HDF5/RRD files
651
- are not redistributed in this repo or in the Hugging Face mirrors.
 
 
652
 
653
  The current verified public-sample subset is:
654
 
@@ -663,9 +667,9 @@ Detailed dataset notes are available in
663
  [`XPERIENCE10M_DATASET_CARD_ALIGNMENT.md`](XPERIENCE10M_DATASET_CARD_ALIGNMENT.md)
664
  and [`docs/data/xperience10m_dataset_card_alignment.json`](docs/data/xperience10m_dataset_card_alignment.json)
665
  for readers who need the full upstream-card and access-term context. The
666
- practical boundary is simple: current task-suite results come from the public
667
- sample, and the first multi-episode Qwen3-Omni diagnostic pilot is verified but
668
- not yet strong model quality.
669
 
670
  Start with the visual dashboard:
671
 
@@ -675,12 +679,12 @@ Hugging Face Space app:
675
 
676
  **[cy0307-ropedia-xperience-10m-task-suite.hf.space](https://cy0307-ropedia-xperience-10m-task-suite.hf.space/)**
677
 
678
- ## Read This Project In Three Layers
679
 
680
  <table>
681
  <thead>
682
  <tr>
683
- <th width="24%">Layer</th>
684
  <th width="34%">What to inspect</th>
685
  <th>Why it matters</th>
686
  </tr>
@@ -697,7 +701,7 @@ Hugging Face Space app:
697
  <tr><td><strong>Minimal heads</strong></td><td>softmax<br>ridge projection/regression<br>multi-label logistic heads</td><td>Keeps every input/output contract visible and inspectable.</td></tr>
698
  <tr><td><strong>Neural heads</strong></td><td>PyTorch MLP classifiers/regressors under <a href="results/episode_task_suite/neural_mlp/">neural_mlp/</a></td><td>Checks whether nonlinear heads improve each task without changing features.</td></tr>
699
  <tr><td><strong>Evidence</strong></td><td>metrics<br>predictions<br>confusion matrices<br>diagrams<br>dashboard</td><td>Makes the single-episode task development inspectable without rerunning first.</td></tr>
700
- <tr><td><strong>Artifact guide</strong></td><td><a href="ARTIFACT_GUIDE.md">ARTIFACT_GUIDE.md</a></td><td>Groups the public evidence into research-project layers after the first-pass overview.</td></tr>
701
  <tr><td><strong>Reproducibility contract</strong></td><td><a href="REPRODUCIBILITY.md">REPRODUCIBILITY.md</a><br><a href="docs/data/reproducibility_matrix.json">reproducibility_matrix.json</a></td><td>States public commands, expected outputs, exact-match reproduction evidence, and non-reproducible boundaries.</td></tr>
702
  <tr><td><strong>Citation metadata</strong></td><td><a href="CITATION.cff">CITATION.cff</a><br><a href="codemeta.json">codemeta.json</a><br><a href="LICENSE">LICENSE</a></td><td>Makes the repo easier to cite, index, and reuse without confusing code license and dataset terms.</td></tr>
703
  </tbody>
@@ -820,9 +824,10 @@ overlays exact labels, dimensions, and metrics from the committed result files.
820
 
821
  ## Scope
822
 
823
- This is a learning, inspection, and pipeline-validation repo built from one
824
- public sample episode. The next model-quality stage is to run the same suite
825
- over many episodes and split train/test by held-out episode.
 
826
 
827
  ## What Is Inside
828
 
@@ -1068,14 +1073,17 @@ baselines, Qwen3-Omni v6 LoRA, Cosmos3-Super Reasoner, and Cosmos3-Nano Future W
1068
  files also include `model_groups`: a model-first view that pairs 1-episode and
1069
  128-episode entries for the same family. Use that section when comparing task
1070
  heads against task heads, Qwen3-Omni smoke/LoRA against Qwen3-Omni LoRA, or
1071
- Cosmos3-Nano compatibility against future Cosmos weight releases.
 
 
 
1072
 
1073
- The no-new-episode enhancement layer is recorded in
1074
  [`docs/data/task_suite_enhancement_128.json`](docs/data/task_suite_enhancement_128.json)
1075
  and [`TASK_SUITE_ENHANCEMENT_128.md`](TASK_SUITE_ENHANCEMENT_128.md). It keeps
1076
  the current Qwen3-Omni v6 and Cosmos3 packages as baselines, then defines dense-window
1077
  scenarios, hierarchical action/subtask targets, task bottlenecks, and experiment
1078
- cards for a stronger 128-episode v5 run without overwriting earlier results.
1079
 
1080
  ### Sample Count Decision
1081
 
 
35
  </p>
36
 
37
 
38
+ **Ropedia Xperience-10M Task Suite** has two public evidence lines. **Line 1** is the 1-sample task lab for raw-file inspection, task construction, and reproducibility. **Line 2** is the selected-128 comparison surface for aligned metadata/raw baselines, Qwen3-Omni v6 LoRA, Cosmos3-Super Reasoner, and Cosmos3-Nano Future Window. Every score points to a source artifact and keeps direct-vs-proxy status visible.
39
 
40
  **Updated:** 2026-06-21.
41
 
42
+ **Scope:** Line 1 uses one public sample episode. Line 2 uses selected 128-episode public-safe artifacts linked back to official gated episode paths. Raw Xperience-10M MP4/HDF5/RRD files, Qwen3 base weights, Cosmos3 base weights, and gated data are not redistributed here.
43
 
44
  ## Contents
45
 
 
59
 
60
  Use the two evidence lines first, then choose the artifact that answers your question. The dashboard is the best visual overview; the GitHub repo is the source of truth for scripts and generated JSON; Hugging Face mirrors contain public-safe cards, metrics, figures, and model artifacts.
61
 
62
+ Quick rule: use **Line 1** for “can I inspect and reproduce the task?” Use **Line 2** for “how do aligned baselines and model diagnostics compare on the selected 128 episodes?”
63
+
64
  The multilingual README files are reader guides. The canonical technical evidence is still the committed task contracts, result matrices, validation JSON, and public-safe result packages.
65
 
66
  ## At A Glance
 
74
  </thead>
75
  <tbody>
76
  <tr>
77
+ <td><strong>Two-line contract</strong></td>
78
+ <td><strong>Line 1: 1 sample episode</strong> for task construction and reproducibility. <strong>Line 2: 128 selected episodes</strong> for same-split metadata/raw baselines, Qwen3-Omni v6, and Cosmos3 diagnostics.</td>
79
  </tr>
80
  <tr>
81
  <td><strong>180 method-task records</strong></td>
 
106
 
107
  ## Two Evidence Lines
108
 
109
+ The public suite is organized around two evidence lines. Keep them separate when reading metrics.
110
 
111
  <p align="center">
112
  <img src="docs/assets/charts/two_evidence_line_map.svg" alt="Two evidence-line map: 1 sample episode and 128 selected episodes combine into 180 scored method-task records" width="100%">
 
647
  [`ropedia-ai/xperience-10m-sample`](https://huggingface.co/datasets/ropedia-ai/xperience-10m-sample)
648
  repo provides the sample episode used for the implemented task suite here.
649
 
650
+ This project keeps two evidence lines separate. Line 1 uses the public sample
651
+ for raw-file inspection, task construction, and local reproducibility. Line 2
652
+ uses selected 128-episode public-safe artifacts for same-split method
653
+ comparison, Qwen3-Omni v6 diagnostics, and Cosmos3 diagnostics. Raw
654
+ Xperience-10M MP4/HDF5/RRD files are not redistributed in this repo or in the
655
+ Hugging Face mirrors.
656
 
657
  The current verified public-sample subset is:
658
 
 
667
  [`XPERIENCE10M_DATASET_CARD_ALIGNMENT.md`](XPERIENCE10M_DATASET_CARD_ALIGNMENT.md)
668
  and [`docs/data/xperience10m_dataset_card_alignment.json`](docs/data/xperience10m_dataset_card_alignment.json)
669
  for readers who need the full upstream-card and access-term context. The
670
+ practical boundary is simple: task-lab claims come from Line 1, selected-128
671
+ comparison claims come from Line 2, and compact-proxy cells stay explicitly
672
+ marked where direct raw targets are missing.
673
 
674
  Start with the visual dashboard:
675
 
 
679
 
680
  **[cy0307-ropedia-xperience-10m-task-suite.hf.space](https://cy0307-ropedia-xperience-10m-task-suite.hf.space/)**
681
 
682
+ ## Read This Project By Evidence View
683
 
684
  <table>
685
  <thead>
686
  <tr>
687
+ <th width="24%">View</th>
688
  <th width="34%">What to inspect</th>
689
  <th>Why it matters</th>
690
  </tr>
 
701
  <tr><td><strong>Minimal heads</strong></td><td>softmax<br>ridge projection/regression<br>multi-label logistic heads</td><td>Keeps every input/output contract visible and inspectable.</td></tr>
702
  <tr><td><strong>Neural heads</strong></td><td>PyTorch MLP classifiers/regressors under <a href="results/episode_task_suite/neural_mlp/">neural_mlp/</a></td><td>Checks whether nonlinear heads improve each task without changing features.</td></tr>
703
  <tr><td><strong>Evidence</strong></td><td>metrics<br>predictions<br>confusion matrices<br>diagrams<br>dashboard</td><td>Makes the single-episode task development inspectable without rerunning first.</td></tr>
704
+ <tr><td><strong>Artifact guide</strong></td><td><a href="ARTIFACT_GUIDE.md">ARTIFACT_GUIDE.md</a></td><td>Groups the public evidence into reader-facing views after the first-pass overview.</td></tr>
705
  <tr><td><strong>Reproducibility contract</strong></td><td><a href="REPRODUCIBILITY.md">REPRODUCIBILITY.md</a><br><a href="docs/data/reproducibility_matrix.json">reproducibility_matrix.json</a></td><td>States public commands, expected outputs, exact-match reproduction evidence, and non-reproducible boundaries.</td></tr>
706
  <tr><td><strong>Citation metadata</strong></td><td><a href="CITATION.cff">CITATION.cff</a><br><a href="codemeta.json">codemeta.json</a><br><a href="LICENSE">LICENSE</a></td><td>Makes the repo easier to cite, index, and reuse without confusing code license and dataset terms.</td></tr>
707
  </tbody>
 
824
 
825
  ## Scope
826
 
827
+ This is a learning, inspection, and pipeline-validation repo with two public
828
+ evidence lines. Line 1 is built from one public sample episode. Line 2 uses a
829
+ selected 96/16/16 split over 128 episode paths, public-safe processed features,
830
+ and verified Qwen3-Omni/Cosmos3 diagnostic artifacts.
831
 
832
  ## What Is Inside
833
 
 
1073
  files also include `model_groups`: a model-first view that pairs 1-episode and
1074
  128-episode entries for the same family. Use that section when comparing task
1075
  heads against task heads, Qwen3-Omni smoke/LoRA against Qwen3-Omni LoRA, or
1076
+ Cosmos3-Nano compatibility against future Cosmos weight releases. For
1077
+ Qwen3-Omni specifically, read `QWEN3_OMNI_RUN_LINEAGE.md`: v1-v4 are
1078
+ pipeline-hardening and ablation evidence, v5 is the pinned prior multiscale
1079
+ release, and v6 is the current public 20-task Qwen row.
1080
 
1081
+ The no-new-episode enhancement plan is recorded in
1082
  [`docs/data/task_suite_enhancement_128.json`](docs/data/task_suite_enhancement_128.json)
1083
  and [`TASK_SUITE_ENHANCEMENT_128.md`](TASK_SUITE_ENHANCEMENT_128.md). It keeps
1084
  the current Qwen3-Omni v6 and Cosmos3 packages as baselines, then defines dense-window
1085
  scenarios, hierarchical action/subtask targets, task bottlenecks, and experiment
1086
+ cards for stronger selected-128 runs without overwriting earlier results.
1087
 
1088
  ### Sample Count Decision
1089
 
PROJECT_STATUS.md CHANGED
@@ -2,21 +2,31 @@
2
 
3
  This is the fastest way to understand the current research project state.
4
  It summarizes what has already been implemented from the public
5
- Xperience-10M sample, what the first multi-episode Qwen3-Omni and Cosmos3
6
- diagnostic branches show, and which artifacts support the next development
7
- step.
8
 
9
  ## Research Positioning
10
 
11
  The project is a research-engineering study of Xperience-10M rather than a
12
  single demo result. It makes the public sample episode inspectable, defines
13
  embodied-AI tasks over synchronized modalities, records baseline behavior, and
14
- keeps the next multi-episode modeling stage explicit. The current evidence is
15
  useful for judging data understanding, task design, evaluation discipline, and
16
  scale-up readiness; it is not presented as final full-dataset model quality.
17
- The current no-new-episode enhancement layer records how to push the selected
18
  128-episode setup harder before asking for more raw storage.
19
 
 
 
 
 
 
 
 
 
 
 
 
20
  | Area | Current state | Evidence | Research readout |
21
  | --- | --- | --- | --- |
22
  | Public-sample pipeline | Verified | `results/episode_task_suite/summary_report.json`, `results/episode_task_suite/windows.csv`, `results/episode_task_suite/feature_manifest.json` | One public Xperience-10M sample episode is converted into 5,821 frames, 1,161 aligned 20-frame windows, and an 8,546-dimensional current feature contract. |
@@ -25,12 +35,12 @@ The current no-new-episode enhancement layer records how to push the selected
25
  | Neural heads | Verified | `scripts/neural_task_models.py`, `results/episode_task_suite/neural_mlp/` | Each task also has a compact PyTorch MLP run over the same feature tensor and chronological split. |
26
  | Audio contribution study | Verified | `scripts/audio_ablation_and_raw_upgrade.py`, `results/audio_ablation/`, `docs/data/audio_ablation_summary.json` | Audio variants are compared across the original task contracts; audio improves the primary metric on 6 of those contracts, and a 588-d audio-window representation improves over the baseline audio variant on 6 of those contracts. |
27
  | Research takeaways | Verified | `RESEARCH_TAKEAWAYS.md`, `docs/data/research_takeaways.json`, `scripts/build_research_takeaways.py` | The main result interpretation is generated from committed metrics: chronological class shift, neural gains on dynamics/order/alignment, open retrieval/reconstruction problems, and the need for held-out episodes. |
28
- | Research roadmap | Current | `RESEARCH_ROADMAP.md`, `docs/data/research_roadmap.json` | The roadmap connects public-sample task development to the final verified Qwen3-Omni diagnostic result, same-split baseline alignment, action/subtask error analysis, robustness runs, world/policy branches, and the future Xperience-native pretraining goal. |
29
- | 128-episode task-suite enhancement pack | Current no-new-episode plan | `TASK_SUITE_ENHANCEMENT_128.md`, `docs/data/task_suite_enhancement_128.json`, `results/omni_finetune/task_suite_enhancement_128_v1_20260608/enhancement_plan.json`, `scripts/omni/build_task_suite_enhancement_128.py` | The current 3,808-window selected split can be stressed without more episodes by exporting denser and multiscale windows. The recommended next export is `multiscale_20s10_40s20_80s40`, estimated at 106,095 windows from the observed frame spans; the pack also defines hierarchical action/subtask targets, raw-feature shard priorities for unsupported tasks, and Qwen/Cosmos follow-up run cards. |
30
- | Foundation-model plan | Current | `FOUNDATION_MODEL_PLAN.md`, `docs/data/foundation_model_plan.json` | Qwen3-Omni remains the first structured JSON LoRA baseline; Cosmos3-Nano is verified as a future-window compatibility branch; Cosmos3-Super is represented by a base-weight Reasoner evaluation and a fine-tuned Forward-Dynamics LoRA branch. The Super LoRA target is camera-pose-conditioned future vision velocity, not supervised JSON action-token prediction. OpenVLA/openpi/GR00T remain policy candidates after robot-compatible action targets are explicit. |
31
- | Cosmos3-Super action-target contract | Superseded by verified forward-dynamics LoRA | `scripts/omni/export_cosmos3_camera_pose_targets.py`, `scripts/omni/pack_cosmos3_super_action_batch.py`, `results/omni_finetune/xperience10m_cosmos3_camera_pose_targets_20260608/target_manifest.json`, `results/omni_finetune/xperience10m_cosmos3_super_training_contract_audit_camera_pose_20260608/training_contract_audit.json`, `results/omni_finetune/xperience10m_cosmos3_super_action_packer_schema_smoke_20260608/packer_summary.json` | The selected 128-episode JSONL is augmented with 3,808/3,808 valid `camera_pose` proxy `cosmos_action_target` records from SLAM pose deltas. The schema packer and contract audit are now supporting evidence for the trained forward-dynamics branch; they still do not supervise `preds_action`, so action-token prediction needs a separate policy or inverse-dynamics target export. |
32
- | Cosmos3-Super Forward-Dynamics LoRA | Verified fine-tuned adapter branch | `configs/omni_backbones/cosmos3_super_forward_dynamics.json`, `scripts/omni/train_cosmos3_super_forward_dynamics_lora.py`, `scripts/omni/eval_cosmos3_super_forward_dynamics_lora.py`, `results/omni_finetune/verified_public/xperience10m_cosmos3_super_forward_dynamics_lora_128ep_train1epoch_256_attn_full8gpu_20260608_eval_test_full_fsdp/verified_result_summary.json`, `results/omni_finetune/verified_public/xperience10m_cosmos3_super_forward_dynamics_lora_128ep_train1epoch_256_attn_full8gpu_20260608_eval_test_full_fsdp/package_audit.json` | The first fine-tuned Cosmos3-Super adapter branch is locally verified as a public-safe package: 26.2M LoRA parameters, 2,848 train rows, 512 validation rows, 448 held-out test rows, validation MSE 4.0082, and test MSE 3.6853. The package excludes adapter safetensors; weights are published separately at `cy0307/ropedia-cosmos3-super-forward-dynamics-lora-128ep`. |
33
- | Omni model extension contract | Current | `OMNI_MODEL_EXTENSION_CONTRACT.md`, `configs/omni_backbones/`, `scripts/omni/backbone_registry.py`, `scripts/omni/smoke_test_backbone_packaging.py` | Future model branches must keep the same episode split discipline, held-out metrics, validation gate, public-safe package contract, and explicit forbidden-artifact policy before reporting results. |
34
  | Xperience Embodied Foundation Model | Future goal | `XPERIENCE_EMBODIED_FOUNDATION_MODEL_PRETRAINING.md` | A future full-corpus pretraining plan describes target modules, objectives, staged scale-up, hardware ranges, and evaluation for a domain-specific embodied foundation model. |
35
  | Evaluation protocol | Verified | `EVALUATION_PROTOCOL.md`, `docs/data/evaluation_protocol.json`, `scripts/build_evaluation_protocol.py` | Windowing, chronological split, per-task metrics, leakage controls, and current limitations are generated from committed metric artifacts. |
36
  | Dataset context | Verified | `XPERIENCE10M_DATASET_CARD_ALIGNMENT.md`, official Xperience-10M and sample cards | The README and dashboard distinguish the public sample used here from the gated full dataset used for the selected multi-episode pilot. |
@@ -38,7 +48,7 @@ The current no-new-episode enhancement layer records how to push the selected
38
  | Public package policy | Verified | `DATA_NOTICE.md`, `REPRODUCIBILITY.md` | Raw Xperience-10M data, private gated files, large archives, credentials, and full Qwen weights are not redistributed. |
39
  | Reproducibility | Verified for the public sample | `REPRODUCIBILITY.md`, `docs/data/reproducibility_matrix.json`, `notes/reproducibility_audit.md` | The public sample workflow has explicit commands, expected outputs, and exact-match reproduction evidence. |
40
  | 128-episode aligned baselines | Verified companion result | `results/omni_finetune/multi_episode_128_task_baselines/BASELINE_ALIGNMENT_REPORT.md`, `results/omni_finetune/a100_128_metadata_task_baselines_20260616_v2/`, `scripts/omni/run_128_task_baselines.py`, `scripts/omni/run_128_raw20_task_baselines.py` | The earlier simple and neural baseline framing is aligned to the same selected 96/16/16 episode split used by the Qwen3-Omni pilot. Metadata/text simple and neural heads now have 20/20 rows, raw-feature simple and neural heads have 20/20 rows, and compact proxies remain marked for missing direct raw targets. |
41
- | Qwen3-Omni fine-tuning | Latest v6 diagnostic branch verified; JSON target met | `docs/data/omni_finetune_verified_result.json`, `docs/data/qwen3_v5_v6_comparison.json`, `results/omni_finetune/QWEN3_V5_V6_COMPARISON_20260614.md`, `results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full/`, `scripts/omni/package_verified_omni_result.py`, `scripts/omni/audit_verified_omni_package.py`, `scripts/omni/analyze_qwen3_omni_errors.py` | The selected 96/16/16 episode split now has a current public-safe v6 rank64/lr5e-5 held-out package with 34,269 exported windows and 4,032 test predictions. JSON validity is 99.90%, meeting the 98% target; transition accuracy is 98.98%, contact accuracy is 81.77%, object micro-F1 is 30.65%, next-action accuracy is 4.31%, and action/subtask metrics remain weak. v6 improves action macro-F1 and contact accuracy versus v5, but v5 remains stronger on JSON validity, subtask, next-action, transition, and object metrics. |
42
  | Raw Xperience-10M redistribution | Not included | `DATA_NOTICE.md`, `docs/data/publication_audit.json` | Raw MP4, HDF5, RRD files, private gated data, and full Qwen weights are intentionally excluded. |
43
 
44
  ## Fast Research Route
@@ -53,10 +63,10 @@ The current no-new-episode enhancement layer records how to push the selected
53
  5. Inspect `RESEARCH_ROADMAP.md` and `docs/data/research_roadmap.json` for
54
  the path from public-sample task work to multi-episode modeling.
55
  6. Inspect `FOUNDATION_MODEL_PLAN.md` and
56
- `docs/data/foundation_model_plan.json` before choosing a backbone branch.
57
  7. Inspect `OMNI_MODEL_EXTENSION_CONTRACT.md` and run
58
  `python scripts/omni/backbone_registry.py --validate --json` before adding
59
- a new Qwen, Cosmos-style, or VLA/policy branch.
60
  8. Inspect `XPERIENCE_EMBODIED_FOUNDATION_MODEL_PRETRAINING.md` for the
61
  long-term full-corpus pretraining goal.
62
  9. Inspect `TASK_SUITE_20.md`, `docs/data/task_suite_20.json`,
@@ -81,12 +91,13 @@ The current no-new-episode enhancement layer records how to push the selected
81
 
82
  ## Current Reading Notes
83
 
84
- - Cross-episode generalization is a later multi-episode evaluation target; the
85
- current results use one public sample episode.
 
86
  - Public-facing fine-tuning results should come from the verified result
87
  package, not from live process logs or setup-only artifacts.
88
  - The latest Qwen3-Omni v6 held-out package verifies the current dense
89
- multiscale branch and meets the strict-JSON target, but not strong
90
  action/subtask model quality: JSON validity is 99.90%, action macro-F1 is
91
  0.0029, and subtask accuracy is 0.0037. v5 remains the pinned prior release
92
  row because it is still stronger on several metrics.
@@ -104,11 +115,11 @@ The current no-new-episode enhancement layer records how to push the selected
104
  - Audio contribution is evaluated across the original task contracts in
105
  `results/audio_ablation/`.
106
  - Foundation-model selection is now explicit: Qwen3-Omni is the immediate
107
- trainable pilot, Cosmos 3 is the first world-model branch, and Cosmos3-Super
108
  has a camera-pose proxy forward-dynamics contract ready for trainer
109
  implementation; policy models such as OpenVLA/openpi/GR00T still wait for
110
  robot-compatible action-target conversion.
111
- - Future model branches should be added through the backbone registry and
112
  verified package contract, not by creating one-off result folders with
113
  incompatible metrics or publication rules.
114
  - The Xperience Embodied Foundation Model is a future native-pretraining goal,
 
2
 
3
  This is the fastest way to understand the current research project state.
4
  It summarizes what has already been implemented from the public
5
+ Xperience-10M sample, what the selected-128 Qwen3-Omni and Cosmos3
6
+ diagnostic rows show, and which artifacts support the next development step.
 
7
 
8
  ## Research Positioning
9
 
10
  The project is a research-engineering study of Xperience-10M rather than a
11
  single demo result. It makes the public sample episode inspectable, defines
12
  embodied-AI tasks over synchronized modalities, records baseline behavior, and
13
+ keeps the selected-128 modeling stage explicit. The current evidence is
14
  useful for judging data understanding, task design, evaluation discipline, and
15
  scale-up readiness; it is not presented as final full-dataset model quality.
16
+ The current no-new-episode enhancement plan records how to push the selected
17
  128-episode setup harder before asking for more raw storage.
18
 
19
+ ## Two Evidence Lines
20
+
21
+ | Evidence line | Data unit | Public result status | Correct reading |
22
+ | --- | --- | --- | --- |
23
+ | Line 1: 1 sample episode | One public sample episode: 5,821 frames and 1,161 aligned 20-frame windows | 2 methods x 20 tasks = 40/40 direct scores | Task construction, raw-sample inspection, reproducibility, and controlled single-episode baselines |
24
+ | Line 2: 128 selected episodes | Selected 96/16/16 split over 128 official episode paths, with public-safe processed features | 7 methods x 20 tasks = 140/140 scores: 134 direct + 6 compact-proxy | Same-split metadata/raw baselines, Qwen3-Omni v6 diagnostics, Cosmos3 diagnostics, and scale-up planning |
25
+
26
+ Qwen3-Omni v1-v6 are run versions inside Line 2, not six project evidence
27
+ lines. v1-v4 are pipeline-hardening and ablation evidence, v5 is the pinned
28
+ prior multiscale release, and v6 is the current public 20-task Qwen3-Omni row.
29
+
30
  | Area | Current state | Evidence | Research readout |
31
  | --- | --- | --- | --- |
32
  | Public-sample pipeline | Verified | `results/episode_task_suite/summary_report.json`, `results/episode_task_suite/windows.csv`, `results/episode_task_suite/feature_manifest.json` | One public Xperience-10M sample episode is converted into 5,821 frames, 1,161 aligned 20-frame windows, and an 8,546-dimensional current feature contract. |
 
35
  | Neural heads | Verified | `scripts/neural_task_models.py`, `results/episode_task_suite/neural_mlp/` | Each task also has a compact PyTorch MLP run over the same feature tensor and chronological split. |
36
  | Audio contribution study | Verified | `scripts/audio_ablation_and_raw_upgrade.py`, `results/audio_ablation/`, `docs/data/audio_ablation_summary.json` | Audio variants are compared across the original task contracts; audio improves the primary metric on 6 of those contracts, and a 588-d audio-window representation improves over the baseline audio variant on 6 of those contracts. |
37
  | Research takeaways | Verified | `RESEARCH_TAKEAWAYS.md`, `docs/data/research_takeaways.json`, `scripts/build_research_takeaways.py` | The main result interpretation is generated from committed metrics: chronological class shift, neural gains on dynamics/order/alignment, open retrieval/reconstruction problems, and the need for held-out episodes. |
38
+ | Research roadmap | Current | `RESEARCH_ROADMAP.md`, `docs/data/research_roadmap.json` | The roadmap connects public-sample task development to the final verified Qwen3-Omni diagnostic result, same-split baseline alignment, action/subtask error analysis, robustness runs, world/policy tracks, and the future Xperience-native pretraining goal. |
39
+ | 128-episode task-suite enhancement pack | Current no-new-episode plan | `TASK_SUITE_ENHANCEMENT_128.md`, `docs/data/task_suite_enhancement_128.json`, `results/omni_finetune/task_suite_enhancement_128_v1_20260608/enhancement_plan.json`, `scripts/omni/build_task_suite_enhancement_128.py` | The current 3,808-window selected split can be stressed without more episodes by exporting denser and multiscale windows. The recommended next export is `multiscale_20s10_40s20_80s40`, estimated at 106,095 windows from the observed frame spans; the pack also defines hierarchical action/subtask targets, raw-feature shard priorities for unsupported tasks, and Qwen3-Omni/Cosmos3 follow-up run cards. |
40
+ | Foundation-model plan | Current | `FOUNDATION_MODEL_PLAN.md`, `docs/data/foundation_model_plan.json` | Qwen3-Omni remains the first structured JSON LoRA baseline; Cosmos3-Nano is verified as a future-window compatibility diagnostic; Cosmos3-Super is represented by a base-weight Reasoner evaluation and a fine-tuned Forward-Dynamics LoRA adapter. The Super LoRA target is camera-pose-conditioned future vision velocity, not supervised JSON action-token prediction. OpenVLA/openpi/GR00T remain policy candidates after robot-compatible action targets are explicit. |
41
+ | Cosmos3-Super action-target contract | Superseded by verified forward-dynamics LoRA | `scripts/omni/export_cosmos3_camera_pose_targets.py`, `scripts/omni/pack_cosmos3_super_action_batch.py`, `results/omni_finetune/xperience10m_cosmos3_camera_pose_targets_20260608/target_manifest.json`, `results/omni_finetune/xperience10m_cosmos3_super_training_contract_audit_camera_pose_20260608/training_contract_audit.json`, `results/omni_finetune/xperience10m_cosmos3_super_action_packer_schema_smoke_20260608/packer_summary.json` | The selected 128-episode JSONL is augmented with 3,808/3,808 valid `camera_pose` proxy `cosmos_action_target` records from SLAM pose deltas. The schema packer and contract audit are now supporting evidence for the trained forward-dynamics adapter; they still do not supervise `preds_action`, so action-token prediction needs a separate policy or inverse-dynamics target export. |
42
+ | Cosmos3-Super Forward-Dynamics LoRA | Verified fine-tuned adapter | `configs/omni_backbones/cosmos3_super_forward_dynamics.json`, `scripts/omni/train_cosmos3_super_forward_dynamics_lora.py`, `scripts/omni/eval_cosmos3_super_forward_dynamics_lora.py`, `results/omni_finetune/verified_public/xperience10m_cosmos3_super_forward_dynamics_lora_128ep_train1epoch_256_attn_full8gpu_20260608_eval_test_full_fsdp/verified_result_summary.json`, `results/omni_finetune/verified_public/xperience10m_cosmos3_super_forward_dynamics_lora_128ep_train1epoch_256_attn_full8gpu_20260608_eval_test_full_fsdp/package_audit.json` | The first fine-tuned Cosmos3-Super adapter is locally verified as a public-safe package: 26.2M LoRA parameters, 2,848 train rows, 512 validation rows, 448 held-out test rows, validation MSE 4.0082, and test MSE 3.6853. The package excludes adapter safetensors; weights are published separately at `cy0307/ropedia-cosmos3-super-forward-dynamics-lora-128ep`. |
43
+ | Omni model extension contract | Current | `OMNI_MODEL_EXTENSION_CONTRACT.md`, `configs/omni_backbones/`, `scripts/omni/backbone_registry.py`, `scripts/omni/smoke_test_backbone_packaging.py` | Future model rows/adapters must keep the same episode split discipline, held-out metrics, validation gate, public-safe package contract, and explicit forbidden-artifact policy before reporting results. |
44
  | Xperience Embodied Foundation Model | Future goal | `XPERIENCE_EMBODIED_FOUNDATION_MODEL_PRETRAINING.md` | A future full-corpus pretraining plan describes target modules, objectives, staged scale-up, hardware ranges, and evaluation for a domain-specific embodied foundation model. |
45
  | Evaluation protocol | Verified | `EVALUATION_PROTOCOL.md`, `docs/data/evaluation_protocol.json`, `scripts/build_evaluation_protocol.py` | Windowing, chronological split, per-task metrics, leakage controls, and current limitations are generated from committed metric artifacts. |
46
  | Dataset context | Verified | `XPERIENCE10M_DATASET_CARD_ALIGNMENT.md`, official Xperience-10M and sample cards | The README and dashboard distinguish the public sample used here from the gated full dataset used for the selected multi-episode pilot. |
 
48
  | Public package policy | Verified | `DATA_NOTICE.md`, `REPRODUCIBILITY.md` | Raw Xperience-10M data, private gated files, large archives, credentials, and full Qwen weights are not redistributed. |
49
  | Reproducibility | Verified for the public sample | `REPRODUCIBILITY.md`, `docs/data/reproducibility_matrix.json`, `notes/reproducibility_audit.md` | The public sample workflow has explicit commands, expected outputs, and exact-match reproduction evidence. |
50
  | 128-episode aligned baselines | Verified companion result | `results/omni_finetune/multi_episode_128_task_baselines/BASELINE_ALIGNMENT_REPORT.md`, `results/omni_finetune/a100_128_metadata_task_baselines_20260616_v2/`, `scripts/omni/run_128_task_baselines.py`, `scripts/omni/run_128_raw20_task_baselines.py` | The earlier simple and neural baseline framing is aligned to the same selected 96/16/16 episode split used by the Qwen3-Omni pilot. Metadata/text simple and neural heads now have 20/20 rows, raw-feature simple and neural heads have 20/20 rows, and compact proxies remain marked for missing direct raw targets. |
51
+ | Qwen3-Omni fine-tuning | Latest v6 diagnostic row verified; JSON target met | `QWEN3_OMNI_RUN_LINEAGE.md`, `docs/data/qwen3_omni_run_lineage.json`, `docs/data/omni_finetune_verified_result.json`, `docs/data/qwen3_v5_v6_comparison.json`, `results/omni_finetune/QWEN3_V5_V6_COMPARISON_20260614.md`, `results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full/`, `scripts/omni/package_verified_omni_result.py`, `scripts/omni/audit_verified_omni_package.py`, `scripts/omni/analyze_qwen3_omni_errors.py` | Qwen3-Omni v1-v6 are one selected-128 run lineage: v1-v4 harden the pipeline and evaluate ablations, v5 is the pinned prior multiscale release, and v6 is the current public 20-task row. The v6 rank64/lr5e-5 package has 34,269 exported windows and 4,032 test predictions. JSON validity is 99.90%, meeting the 98% target; transition accuracy is 98.98%, contact accuracy is 81.77%, object micro-F1 is 30.65%, next-action accuracy is 4.31%, and action/subtask metrics remain weak. v6 improves action macro-F1 and contact accuracy versus v5, but v5 remains stronger on JSON validity, subtask, next-action, transition, and object metrics. |
52
  | Raw Xperience-10M redistribution | Not included | `DATA_NOTICE.md`, `docs/data/publication_audit.json` | Raw MP4, HDF5, RRD files, private gated data, and full Qwen weights are intentionally excluded. |
53
 
54
  ## Fast Research Route
 
63
  5. Inspect `RESEARCH_ROADMAP.md` and `docs/data/research_roadmap.json` for
64
  the path from public-sample task work to multi-episode modeling.
65
  6. Inspect `FOUNDATION_MODEL_PLAN.md` and
66
+ `docs/data/foundation_model_plan.json` before choosing a backbone track.
67
  7. Inspect `OMNI_MODEL_EXTENSION_CONTRACT.md` and run
68
  `python scripts/omni/backbone_registry.py --validate --json` before adding
69
+ a new Qwen, Cosmos-style, or VLA/policy row.
70
  8. Inspect `XPERIENCE_EMBODIED_FOUNDATION_MODEL_PRETRAINING.md` for the
71
  long-term full-corpus pretraining goal.
72
  9. Inspect `TASK_SUITE_20.md`, `docs/data/task_suite_20.json`,
 
91
 
92
  ## Current Reading Notes
93
 
94
+ - Line 1 uses one public sample episode. Line 2 uses selected-128 public-safe
95
+ processed artifacts linked to official gated episode paths. Do not merge
96
+ those claims when reading scores.
97
  - Public-facing fine-tuning results should come from the verified result
98
  package, not from live process logs or setup-only artifacts.
99
  - The latest Qwen3-Omni v6 held-out package verifies the current dense
100
+ multiscale run and meets the strict-JSON target, but not strong
101
  action/subtask model quality: JSON validity is 99.90%, action macro-F1 is
102
  0.0029, and subtask accuracy is 0.0037. v5 remains the pinned prior release
103
  row because it is still stronger on several metrics.
 
115
  - Audio contribution is evaluated across the original task contracts in
116
  `results/audio_ablation/`.
117
  - Foundation-model selection is now explicit: Qwen3-Omni is the immediate
118
+ trainable pilot, Cosmos 3 is the first world-model track, and Cosmos3-Super
119
  has a camera-pose proxy forward-dynamics contract ready for trainer
120
  implementation; policy models such as OpenVLA/openpi/GR00T still wait for
121
  robot-compatible action-target conversion.
122
+ - Future model rows/adapters should be added through the backbone registry and
123
  verified package contract, not by creating one-off result folders with
124
  incompatible metrics or publication rules.
125
  - The Xperience Embodied Foundation Model is a future native-pretraining goal,
PUBLIC_READER_MAP.md CHANGED
@@ -10,12 +10,14 @@ trail.
10
  | Reader goal | Start here | Then inspect |
11
  | --- | --- | --- |
12
  | Understand the project in one pass | `PROJECT_BRIEF.md` | `PROJECT_STATUS.md`, `RESEARCH_TAKEAWAYS.md` |
 
13
  | See the visual public dashboard | GitHub Pages or the HF Space | `docs/index.html`, `docs/data/project_packet.json` |
14
  | Understand the data unit | `results/episode_task_suite/windows.csv` | `results/episode_task_suite/feature_manifest.json`, `docs/data/raw_sample_files.json` |
15
  | Trace the 128-episode split | `XPERIENCE10M_128_EPISODE_FEATURE_INDEX.md` | `docs/data/xperience10m_128_episode_feature_index.json`, `results/omni_finetune/xperience10m_128_episode_selection.csv` |
16
  | Inspect the 20-task benchmark | `TASK_SUITE_20.md` | `docs/data/task_suite_20.json`, `EVALUATION_PROTOCOL.md` |
17
  | Compare current results | `RESEARCH_TAKEAWAYS.md` | `docs/data/task_method_20_result_matrix.json`, `docs/data/unified_task_model_radar.json` |
18
  | Compare 1-episode and 128-episode methods | Homepage radar section | `docs/data/single_episode_task_model_radar.json`, `docs/data/episode128_task_model_radar.json` |
 
19
  | Find all derived artifacts | `ARTIFACT_GUIDE.md` | HF artifact dataset, `docs/data/artifact_index.json` |
20
  | Download model weights with their matching results | Hugging Face weights/results repo | `manifest.json`, `analysis/docs/data/task_method_20_result_matrix.json`, `results/` |
21
  | Reproduce or extend the work | `REPRODUCIBILITY.md` | `QUALITY_GATES.md`, `scripts/`, `results/` |
@@ -34,7 +36,7 @@ trail.
34
  | HF weights/results repo | Consolidated baseline weights, Qwen3-Omni v6 LoRA, Cosmos3-Super adapter/result artifacts, verified results, analysis files, and file-level manifest | Auditing all public-safe weight-bearing artifacts from one repo |
35
  | Qwen3-Omni and Cosmos3 model repos | Adapter-specific public weights or package cards when a run is verified and publishable | Inspecting Qwen3-Omni and Cosmos3 artifacts |
36
 
37
- ## Evidence Layers
38
 
39
  1. Dataset/source boundary: upstream Xperience-10M links, public sample scope,
40
  raw-data exclusion, and derived-file policy.
@@ -42,8 +44,9 @@ trail.
42
  split policy, and leakage controls.
43
  3. Task suite: 20 named tasks with inputs, outputs, metrics, baseline
44
  artifacts, and walkthroughs.
45
- 4. Results: minimal heads, neural heads, 128-episode aligned baselines,
46
- Qwen3-Omni diagnostics, Cosmos diagnostics, radar views, and explicit gaps.
 
47
  5. Foundation directions: spatial intelligence, human-video world modeling, and
48
  vision-language-action training pipelines.
49
  6. Public-release checks: website integrity, source alignment, mirror parity,
@@ -54,6 +57,7 @@ trail.
54
  | Claim type | Public evidence | Boundary |
55
  | --- | --- | --- |
56
  | Single public-sample task behavior | `results/episode_task_suite/`, `docs/data/task_suite_20.json` | Describes one public sample episode, not the full dataset distribution |
57
- | 128-episode baseline comparison | `XPERIENCE10M_128_EPISODE_FEATURE_INDEX.md`, `docs/data/xperience10m_128_episode_feature_index.json`, `results/omni_finetune/*128*`, `docs/data/omni_model_comparison.json` | Uses selected held-out episodes and derived public-safe summaries; official raw files remain gated upstream |
 
58
  | Foundation-model track quality | Verified Qwen3-Omni and Cosmos3 result packages and model cards | Numeric task scores appear only when a task-specific eval/probe exists |
59
  | Reproducibility | `REPRODUCIBILITY.md`, `QUALITY_GATES.md`, release validators | Raw gated Xperience-10M files and full foundation weights are not redistributed |
 
10
  | Reader goal | Start here | Then inspect |
11
  | --- | --- | --- |
12
  | Understand the project in one pass | `PROJECT_BRIEF.md` | `PROJECT_STATUS.md`, `RESEARCH_TAKEAWAYS.md` |
13
+ | Understand the two evidence lines | `TWO_EVIDENCE_LINES.md` | `docs/data/two_evidence_lines.json`, `docs/data/two_evidence_line_result_summary.json` |
14
  | See the visual public dashboard | GitHub Pages or the HF Space | `docs/index.html`, `docs/data/project_packet.json` |
15
  | Understand the data unit | `results/episode_task_suite/windows.csv` | `results/episode_task_suite/feature_manifest.json`, `docs/data/raw_sample_files.json` |
16
  | Trace the 128-episode split | `XPERIENCE10M_128_EPISODE_FEATURE_INDEX.md` | `docs/data/xperience10m_128_episode_feature_index.json`, `results/omni_finetune/xperience10m_128_episode_selection.csv` |
17
  | Inspect the 20-task benchmark | `TASK_SUITE_20.md` | `docs/data/task_suite_20.json`, `EVALUATION_PROTOCOL.md` |
18
  | Compare current results | `RESEARCH_TAKEAWAYS.md` | `docs/data/task_method_20_result_matrix.json`, `docs/data/unified_task_model_radar.json` |
19
  | Compare 1-episode and 128-episode methods | Homepage radar section | `docs/data/single_episode_task_model_radar.json`, `docs/data/episode128_task_model_radar.json` |
20
+ | Read Qwen3-Omni v1-v6 correctly | `QWEN3_OMNI_RUN_LINEAGE.md` | `docs/data/qwen3_omni_run_lineage.json`, `docs/data/qwen3_v5_v6_comparison.json` |
21
  | Find all derived artifacts | `ARTIFACT_GUIDE.md` | HF artifact dataset, `docs/data/artifact_index.json` |
22
  | Download model weights with their matching results | Hugging Face weights/results repo | `manifest.json`, `analysis/docs/data/task_method_20_result_matrix.json`, `results/` |
23
  | Reproduce or extend the work | `REPRODUCIBILITY.md` | `QUALITY_GATES.md`, `scripts/`, `results/` |
 
36
  | HF weights/results repo | Consolidated baseline weights, Qwen3-Omni v6 LoRA, Cosmos3-Super adapter/result artifacts, verified results, analysis files, and file-level manifest | Auditing all public-safe weight-bearing artifacts from one repo |
37
  | Qwen3-Omni and Cosmos3 model repos | Adapter-specific public weights or package cards when a run is verified and publishable | Inspecting Qwen3-Omni and Cosmos3 artifacts |
38
 
39
+ ## Evidence Views
40
 
41
  1. Dataset/source boundary: upstream Xperience-10M links, public sample scope,
42
  raw-data exclusion, and derived-file policy.
 
44
  split policy, and leakage controls.
45
  3. Task suite: 20 named tasks with inputs, outputs, metrics, baseline
46
  artifacts, and walkthroughs.
47
+ 4. Results: Line 1 minimal/neural heads, Line 2 selected-128 aligned baselines,
48
+ Qwen3-Omni v6 diagnostics, Cosmos diagnostics, radar views, and explicit
49
+ direct-vs-proxy labels.
50
  5. Foundation directions: spatial intelligence, human-video world modeling, and
51
  vision-language-action training pipelines.
52
  6. Public-release checks: website integrity, source alignment, mirror parity,
 
57
  | Claim type | Public evidence | Boundary |
58
  | --- | --- | --- |
59
  | Single public-sample task behavior | `results/episode_task_suite/`, `docs/data/task_suite_20.json` | Describes one public sample episode, not the full dataset distribution |
60
+ | 128-episode method comparison | `XPERIENCE10M_128_EPISODE_FEATURE_INDEX.md`, `docs/data/xperience10m_128_episode_feature_index.json`, `results/omni_finetune/*128*`, `docs/data/omni_model_comparison.json` | Uses selected held-out episodes and derived public-safe summaries; official raw files remain gated upstream |
61
+ | Qwen3-Omni v1-v6 lineage | `QWEN3_OMNI_RUN_LINEAGE.md`, `docs/data/qwen3_omni_run_lineage.json` | v1-v4 are pipeline/ablation evidence, v5 is the pinned prior release, and v6 is the current public 20-task Qwen row |
62
  | Foundation-model track quality | Verified Qwen3-Omni and Cosmos3 result packages and model cards | Numeric task scores appear only when a task-specific eval/probe exists |
63
  | Reproducibility | `REPRODUCIBILITY.md`, `QUALITY_GATES.md`, release validators | Raw gated Xperience-10M files and full foundation weights are not redistributed |
QWEN3_OMNI_RUN_LINEAGE.md CHANGED
@@ -1,6 +1,6 @@
1
  # Qwen3-Omni v1-v6 Run Lineage
2
 
3
- Generated: `2026-06-21T10:54:46+00:00`.
4
 
5
  Scope: Verified public-safe Qwen3-Omni LoRA/eval packages over the selected Xperience-10M 128-episode surface.
6
 
 
1
  # Qwen3-Omni v1-v6 Run Lineage
2
 
3
+ Generated: `2026-06-21T11:47:45+00:00`.
4
 
5
  Scope: Verified public-safe Qwen3-Omni LoRA/eval packages over the selected Xperience-10M 128-episode surface.
6
 
README.md CHANGED
@@ -57,11 +57,11 @@ metrics:
57
  </p>
58
 
59
 
60
- **Ropedia Xperience-10M Task Suite** is organized as two public result lines, not one blended benchmark. The 1-sample line is a fully inspectable task lab. The selected-128 line is the comparison surface for aligned baselines, the Qwen3-Omni series, and the Cosmos3 series. Every score points back to a source artifact and keeps direct-vs-proxy status visible.
61
 
62
  **Updated:** 2026-06-21.
63
 
64
- **Scope:** one public sample episode for raw-file inspection and reproducible task construction; selected 128-episode public-safe artifacts for same-split metadata/raw baselines, Qwen3-Omni v6 LoRA, Cosmos3-Super Reasoner, and Cosmos3-Nano Future Window. Raw Xperience-10M MP4/HDF5/RRD files, Qwen3 base weights, Cosmos3 base weights, and gated data are not redistributed here.
65
 
66
  ## Contents
67
 
@@ -81,6 +81,8 @@ metrics:
81
 
82
  Use the two evidence lines first, then choose the artifact that answers your question. The dashboard is the best visual overview; the GitHub repo is the source of truth for scripts and generated JSON; Hugging Face mirrors contain public-safe cards, metrics, figures, and model artifacts.
83
 
 
 
84
  The multilingual README files are reader guides. The canonical technical evidence is still the committed task contracts, result matrices, validation JSON, and public-safe result packages.
85
 
86
  ## At A Glance
@@ -94,8 +96,8 @@ The multilingual README files are reader guides. The canonical technical evidenc
94
  </thead>
95
  <tbody>
96
  <tr>
97
- <td><strong>Two result lines</strong></td>
98
- <td><strong>1 sample episode</strong> for task construction and reproducibility. <strong>128 selected episodes</strong> for same-split metadata/raw baselines plus Qwen3-Omni v6 and Cosmos3 diagnostics.</td>
99
  </tr>
100
  <tr>
101
  <td><strong>180 method-task records</strong></td>
@@ -126,7 +128,7 @@ The multilingual README files are reader guides. The canonical technical evidenc
126
 
127
  ## Two Evidence Lines
128
 
129
- The public suite is organized around two result lines. Keep them separate when reading metrics.
130
 
131
  <p align="center">
132
  <img src="docs/assets/charts/two_evidence_line_map.svg" alt="Two evidence-line map: 1 sample episode and 128 selected episodes combine into 180 scored method-task records" width="100%">
@@ -667,10 +669,12 @@ robotics, spatial intelligence, and world modeling. The public
667
  [`ropedia-ai/xperience-10m-sample`](https://huggingface.co/datasets/ropedia-ai/xperience-10m-sample)
668
  repo provides the sample episode used for the implemented task suite here.
669
 
670
- This project keeps those layers separate: the public sample supports the
671
- current 20-task study, while the gated full dataset is used only for the
672
- selected multi-episode Qwen3-Omni pilot. Raw Xperience-10M MP4/HDF5/RRD files
673
- are not redistributed in this repo or in the Hugging Face mirrors.
 
 
674
 
675
  The current verified public-sample subset is:
676
 
@@ -685,9 +689,9 @@ Detailed dataset notes are available in
685
  [`XPERIENCE10M_DATASET_CARD_ALIGNMENT.md`](XPERIENCE10M_DATASET_CARD_ALIGNMENT.md)
686
  and [`docs/data/xperience10m_dataset_card_alignment.json`](docs/data/xperience10m_dataset_card_alignment.json)
687
  for readers who need the full upstream-card and access-term context. The
688
- practical boundary is simple: current task-suite results come from the public
689
- sample, and the first multi-episode Qwen3-Omni diagnostic pilot is verified but
690
- not yet strong model quality.
691
 
692
  Start with the visual dashboard:
693
 
@@ -697,12 +701,12 @@ Hugging Face Space app:
697
 
698
  **[cy0307-ropedia-xperience-10m-task-suite.hf.space](https://cy0307-ropedia-xperience-10m-task-suite.hf.space/)**
699
 
700
- ## Read This Project In Three Layers
701
 
702
  <table>
703
  <thead>
704
  <tr>
705
- <th width="24%">Layer</th>
706
  <th width="34%">What to inspect</th>
707
  <th>Why it matters</th>
708
  </tr>
@@ -719,7 +723,7 @@ Hugging Face Space app:
719
  <tr><td><strong>Minimal heads</strong></td><td>softmax<br>ridge projection/regression<br>multi-label logistic heads</td><td>Keeps every input/output contract visible and inspectable.</td></tr>
720
  <tr><td><strong>Neural heads</strong></td><td>PyTorch MLP classifiers/regressors under <a href="results/episode_task_suite/neural_mlp/">neural_mlp/</a></td><td>Checks whether nonlinear heads improve each task without changing features.</td></tr>
721
  <tr><td><strong>Evidence</strong></td><td>metrics<br>predictions<br>confusion matrices<br>diagrams<br>dashboard</td><td>Makes the single-episode task development inspectable without rerunning first.</td></tr>
722
- <tr><td><strong>Artifact guide</strong></td><td><a href="ARTIFACT_GUIDE.md">ARTIFACT_GUIDE.md</a></td><td>Groups the public evidence into research-project layers after the first-pass overview.</td></tr>
723
  <tr><td><strong>Reproducibility contract</strong></td><td><a href="REPRODUCIBILITY.md">REPRODUCIBILITY.md</a><br><a href="docs/data/reproducibility_matrix.json">reproducibility_matrix.json</a></td><td>States public commands, expected outputs, exact-match reproduction evidence, and non-reproducible boundaries.</td></tr>
724
  <tr><td><strong>Citation metadata</strong></td><td><a href="CITATION.cff">CITATION.cff</a><br><a href="codemeta.json">codemeta.json</a><br><a href="LICENSE">LICENSE</a></td><td>Makes the repo easier to cite, index, and reuse without confusing code license and dataset terms.</td></tr>
725
  </tbody>
@@ -842,9 +846,10 @@ overlays exact labels, dimensions, and metrics from the committed result files.
842
 
843
  ## Scope
844
 
845
- This is a learning, inspection, and pipeline-validation repo built from one
846
- public sample episode. The next model-quality stage is to run the same suite
847
- over many episodes and split train/test by held-out episode.
 
848
 
849
  ## What Is Inside
850
 
@@ -1090,14 +1095,17 @@ baselines, Qwen3-Omni v6 LoRA, Cosmos3-Super Reasoner, and Cosmos3-Nano Future W
1090
  files also include `model_groups`: a model-first view that pairs 1-episode and
1091
  128-episode entries for the same family. Use that section when comparing task
1092
  heads against task heads, Qwen3-Omni smoke/LoRA against Qwen3-Omni LoRA, or
1093
- Cosmos3-Nano compatibility against future Cosmos weight releases.
 
 
 
1094
 
1095
- The no-new-episode enhancement layer is recorded in
1096
  [`docs/data/task_suite_enhancement_128.json`](docs/data/task_suite_enhancement_128.json)
1097
  and [`TASK_SUITE_ENHANCEMENT_128.md`](TASK_SUITE_ENHANCEMENT_128.md). It keeps
1098
  the current Qwen3-Omni v6 and Cosmos3 packages as baselines, then defines dense-window
1099
  scenarios, hierarchical action/subtask targets, task bottlenecks, and experiment
1100
- cards for a stronger 128-episode v5 run without overwriting earlier results.
1101
 
1102
  ### Sample Count Decision
1103
 
 
57
  </p>
58
 
59
 
60
+ **Ropedia Xperience-10M Task Suite** has two public evidence lines. **Line 1** is the 1-sample task lab for raw-file inspection, task construction, and reproducibility. **Line 2** is the selected-128 comparison surface for aligned metadata/raw baselines, Qwen3-Omni v6 LoRA, Cosmos3-Super Reasoner, and Cosmos3-Nano Future Window. Every score points to a source artifact and keeps direct-vs-proxy status visible.
61
 
62
  **Updated:** 2026-06-21.
63
 
64
+ **Scope:** Line 1 uses one public sample episode. Line 2 uses selected 128-episode public-safe artifacts linked back to official gated episode paths. Raw Xperience-10M MP4/HDF5/RRD files, Qwen3 base weights, Cosmos3 base weights, and gated data are not redistributed here.
65
 
66
  ## Contents
67
 
 
81
 
82
  Use the two evidence lines first, then choose the artifact that answers your question. The dashboard is the best visual overview; the GitHub repo is the source of truth for scripts and generated JSON; Hugging Face mirrors contain public-safe cards, metrics, figures, and model artifacts.
83
 
84
+ Quick rule: use **Line 1** for “can I inspect and reproduce the task?” Use **Line 2** for “how do aligned baselines and model diagnostics compare on the selected 128 episodes?”
85
+
86
  The multilingual README files are reader guides. The canonical technical evidence is still the committed task contracts, result matrices, validation JSON, and public-safe result packages.
87
 
88
  ## At A Glance
 
96
  </thead>
97
  <tbody>
98
  <tr>
99
+ <td><strong>Two-line contract</strong></td>
100
+ <td><strong>Line 1: 1 sample episode</strong> for task construction and reproducibility. <strong>Line 2: 128 selected episodes</strong> for same-split metadata/raw baselines, Qwen3-Omni v6, and Cosmos3 diagnostics.</td>
101
  </tr>
102
  <tr>
103
  <td><strong>180 method-task records</strong></td>
 
128
 
129
  ## Two Evidence Lines
130
 
131
+ The public suite is organized around two evidence lines. Keep them separate when reading metrics.
132
 
133
  <p align="center">
134
  <img src="docs/assets/charts/two_evidence_line_map.svg" alt="Two evidence-line map: 1 sample episode and 128 selected episodes combine into 180 scored method-task records" width="100%">
 
669
  [`ropedia-ai/xperience-10m-sample`](https://huggingface.co/datasets/ropedia-ai/xperience-10m-sample)
670
  repo provides the sample episode used for the implemented task suite here.
671
 
672
+ This project keeps two evidence lines separate. Line 1 uses the public sample
673
+ for raw-file inspection, task construction, and local reproducibility. Line 2
674
+ uses selected 128-episode public-safe artifacts for same-split method
675
+ comparison, Qwen3-Omni v6 diagnostics, and Cosmos3 diagnostics. Raw
676
+ Xperience-10M MP4/HDF5/RRD files are not redistributed in this repo or in the
677
+ Hugging Face mirrors.
678
 
679
  The current verified public-sample subset is:
680
 
 
689
  [`XPERIENCE10M_DATASET_CARD_ALIGNMENT.md`](XPERIENCE10M_DATASET_CARD_ALIGNMENT.md)
690
  and [`docs/data/xperience10m_dataset_card_alignment.json`](docs/data/xperience10m_dataset_card_alignment.json)
691
  for readers who need the full upstream-card and access-term context. The
692
+ practical boundary is simple: task-lab claims come from Line 1, selected-128
693
+ comparison claims come from Line 2, and compact-proxy cells stay explicitly
694
+ marked where direct raw targets are missing.
695
 
696
  Start with the visual dashboard:
697
 
 
701
 
702
  **[cy0307-ropedia-xperience-10m-task-suite.hf.space](https://cy0307-ropedia-xperience-10m-task-suite.hf.space/)**
703
 
704
+ ## Read This Project By Evidence View
705
 
706
  <table>
707
  <thead>
708
  <tr>
709
+ <th width="24%">View</th>
710
  <th width="34%">What to inspect</th>
711
  <th>Why it matters</th>
712
  </tr>
 
723
  <tr><td><strong>Minimal heads</strong></td><td>softmax<br>ridge projection/regression<br>multi-label logistic heads</td><td>Keeps every input/output contract visible and inspectable.</td></tr>
724
  <tr><td><strong>Neural heads</strong></td><td>PyTorch MLP classifiers/regressors under <a href="results/episode_task_suite/neural_mlp/">neural_mlp/</a></td><td>Checks whether nonlinear heads improve each task without changing features.</td></tr>
725
  <tr><td><strong>Evidence</strong></td><td>metrics<br>predictions<br>confusion matrices<br>diagrams<br>dashboard</td><td>Makes the single-episode task development inspectable without rerunning first.</td></tr>
726
+ <tr><td><strong>Artifact guide</strong></td><td><a href="ARTIFACT_GUIDE.md">ARTIFACT_GUIDE.md</a></td><td>Groups the public evidence into reader-facing views after the first-pass overview.</td></tr>
727
  <tr><td><strong>Reproducibility contract</strong></td><td><a href="REPRODUCIBILITY.md">REPRODUCIBILITY.md</a><br><a href="docs/data/reproducibility_matrix.json">reproducibility_matrix.json</a></td><td>States public commands, expected outputs, exact-match reproduction evidence, and non-reproducible boundaries.</td></tr>
728
  <tr><td><strong>Citation metadata</strong></td><td><a href="CITATION.cff">CITATION.cff</a><br><a href="codemeta.json">codemeta.json</a><br><a href="LICENSE">LICENSE</a></td><td>Makes the repo easier to cite, index, and reuse without confusing code license and dataset terms.</td></tr>
729
  </tbody>
 
846
 
847
  ## Scope
848
 
849
+ This is a learning, inspection, and pipeline-validation repo with two public
850
+ evidence lines. Line 1 is built from one public sample episode. Line 2 uses a
851
+ selected 96/16/16 split over 128 episode paths, public-safe processed features,
852
+ and verified Qwen3-Omni/Cosmos3 diagnostic artifacts.
853
 
854
  ## What Is Inside
855
 
 
1095
  files also include `model_groups`: a model-first view that pairs 1-episode and
1096
  128-episode entries for the same family. Use that section when comparing task
1097
  heads against task heads, Qwen3-Omni smoke/LoRA against Qwen3-Omni LoRA, or
1098
+ Cosmos3-Nano compatibility against future Cosmos weight releases. For
1099
+ Qwen3-Omni specifically, read `QWEN3_OMNI_RUN_LINEAGE.md`: v1-v4 are
1100
+ pipeline-hardening and ablation evidence, v5 is the pinned prior multiscale
1101
+ release, and v6 is the current public 20-task Qwen row.
1102
 
1103
+ The no-new-episode enhancement plan is recorded in
1104
  [`docs/data/task_suite_enhancement_128.json`](docs/data/task_suite_enhancement_128.json)
1105
  and [`TASK_SUITE_ENHANCEMENT_128.md`](TASK_SUITE_ENHANCEMENT_128.md). It keeps
1106
  the current Qwen3-Omni v6 and Cosmos3 packages as baselines, then defines dense-window
1107
  scenarios, hierarchical action/subtask targets, task bottlenecks, and experiment
1108
+ cards for stronger selected-128 runs without overwriting earlier results.
1109
 
1110
  ### Sample Count Decision
1111
 
TWO_EVIDENCE_LINES.md CHANGED
@@ -1,6 +1,6 @@
1
  # Two Evidence Lines
2
 
3
- The public Xperience-10M suite has two result lines. Read them separately.
4
 
5
  ![Two evidence-line map](docs/assets/charts/two_evidence_line_map.svg)
6
 
@@ -9,7 +9,7 @@ Score formula: 2 single-episode methods x 20 tasks = 40 records; 7 selected-128
9
  | Line | Data unit | Score statement | Valid claim | Do not claim |
10
  | --- | --- | --- | --- | --- |
11
  | 1 sample episode | One public sample episode; 5,821 frames; 1,161 aligned 20-frame windows; 8,546 feature dimensions. | 40/40 direct scores from Minimal and Neural MLP heads. | Task construction, raw-file inspection, local reproducibility, and controlled single-episode baselines. | Multi-episode generalization. |
12
- | 128 selected episodes | Selected held-out 96/16/16 split; 34,269 exported windows; public-safe processed features linked to official gated episode paths. | 140/140 selected-128 scores: 134 direct + 6 compact-proxy. | Same-split baseline/model comparison, Qwen3-Omni v6 LoRA diagnostics, Cosmos3-Super/Cosmos3-Nano diagnostics, and scale-up planning. | Reading compact-proxy cells as direct raw-target measurements. |
13
 
14
  ## Result Ledger
15
 
@@ -28,7 +28,7 @@ Score formula: 2 single-episode methods x 20 tasks = 40 records; 7 selected-128
28
  | 128 selected episodes | Qwen3-Omni series | Qwen3-Omni v6 LoRA | 20/20 direct scores from verified selected-128 LoRA and task-specific probes. | Current trainable Qwen3-Omni diagnostic baseline on the selected-128 surface. |
29
  | 128 selected episodes | Cosmos3 series | Cosmos3-Super Reasoner; Cosmos3-Nano Future Window | 40/40 direct scores from verified public-safe reasoner and future-window artifacts. | Cosmos3 reasoner and future-window diagnostics on the selected-128 surface. |
30
 
31
- Qwen3 run v1-v6 is a LoRA/evaluation lineage, not the project-level result layer numbering. The 20-task matrix uses Qwen3-Omni v6 LoRA; v5 remains the pinned prior release. Cosmos3-Super Forward-Dynamics LoRA is a separate adapter artifact and is not counted as a 20-task matrix method row.
32
 
33
  ## Result Files
34
 
@@ -46,7 +46,7 @@ Qwen3 run v1-v6 is a LoRA/evaluation lineage, not the project-level result layer
46
  ## Interpretation Rule
47
 
48
  Use the 1-episode line for task construction and reproducibility claims.
49
- Use the 128-episode line for held-out comparison and model-branch claims.
50
  Do not mix those claims without naming the evidence line.
51
 
52
  ## Reading Order
 
1
  # Two Evidence Lines
2
 
3
+ The public Xperience-10M suite has two evidence lines. Read them separately.
4
 
5
  ![Two evidence-line map](docs/assets/charts/two_evidence_line_map.svg)
6
 
 
9
  | Line | Data unit | Score statement | Valid claim | Do not claim |
10
  | --- | --- | --- | --- | --- |
11
  | 1 sample episode | One public sample episode; 5,821 frames; 1,161 aligned 20-frame windows; 8,546 feature dimensions. | 40/40 direct scores from Minimal and Neural MLP heads. | Task construction, raw-file inspection, local reproducibility, and controlled single-episode baselines. | Multi-episode generalization. |
12
+ | 128 selected episodes | Selected held-out 96/16/16 split; 34,269 exported windows; public-safe processed features linked to official gated episode paths. | 140/140 selected-128 scores: 134 direct + 6 compact-proxy. | Same-split method comparison, Qwen3-Omni v6 LoRA diagnostics, Cosmos3-Super/Cosmos3-Nano diagnostics, and scale-up planning. | Reading compact-proxy cells as direct raw-target measurements. |
13
 
14
  ## Result Ledger
15
 
 
28
  | 128 selected episodes | Qwen3-Omni series | Qwen3-Omni v6 LoRA | 20/20 direct scores from verified selected-128 LoRA and task-specific probes. | Current trainable Qwen3-Omni diagnostic baseline on the selected-128 surface. |
29
  | 128 selected episodes | Cosmos3 series | Cosmos3-Super Reasoner; Cosmos3-Nano Future Window | 40/40 direct scores from verified public-safe reasoner and future-window artifacts. | Cosmos3 reasoner and future-window diagnostics on the selected-128 surface. |
30
 
31
+ Qwen3 run v1-v6 is a LoRA/evaluation lineage inside the 128-episode line, not the project evidence-line numbering. The 20-task matrix uses Qwen3-Omni v6 LoRA; v5 remains the pinned prior release. Cosmos3-Super Forward-Dynamics LoRA is a separate adapter artifact and is not counted as a 20-task matrix method row.
32
 
33
  ## Result Files
34
 
 
46
  ## Interpretation Rule
47
 
48
  Use the 1-episode line for task construction and reproducibility claims.
49
+ Use the 128-episode line for held-out same-split comparison and model-diagnostic claims.
50
  Do not mix those claims without naming the evidence line.
51
 
52
  ## Reading Order
TWO_EVIDENCE_LINE_RESULT_SUMMARY.md CHANGED
@@ -1,6 +1,6 @@
1
  # Two Evidence-Line Result Summary
2
 
3
- Generated: `2026-06-21T10:47:04+00:00`.
4
 
5
  Source matrix: [`docs/data/task_method_20_result_matrix.json`](docs/data/task_method_20_result_matrix.json)
6
 
@@ -8,7 +8,7 @@ Interpretation rule: Use the 1-episode line for task construction and reproducib
8
 
9
  ## Read This First
10
 
11
- The suite has two public result lines. Line 1 is the fully inspectable one-episode task lab. Line 2 is the 128-episode comparison surface for aligned baselines, the Qwen3-Omni series, and the Cosmos3 series. Do not mix the two when reading scores.
12
 
13
  Score formula: 2 single-episode methods x 20 tasks = 40 records; 7 selected-128 methods x 20 tasks = 140 records; total public matrix = 180/180 scored records.
14
 
 
1
  # Two Evidence-Line Result Summary
2
 
3
+ Generated: `2026-06-21T11:49:06+00:00`.
4
 
5
  Source matrix: [`docs/data/task_method_20_result_matrix.json`](docs/data/task_method_20_result_matrix.json)
6
 
 
8
 
9
  ## Read This First
10
 
11
+ The suite has two public evidence lines. Line 1 is the fully inspectable one-episode task lab. Line 2 is the 128-episode comparison surface for aligned baselines, the Qwen3-Omni series, and the Cosmos3 series. Do not mix the two when reading scores.
12
 
13
  Score formula: 2 single-episode methods x 20 tasks = 40 records; 7 selected-128 methods x 20 tasks = 140 records; total public matrix = 180/180 scored records.
14
 
data/mirror_parity.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T11:28:34+00:00",
4
  "hf_root": "hf_publish",
5
  "summary": {
6
  "group_count": 1256,
@@ -824,45 +824,45 @@
824
  "local": {
825
  "path": "repo:docs/data/public_reader_map.json",
826
  "exists": true,
827
- "bytes": 5990,
828
- "sha256": "45b89ce3c3da32856ecaff0902bc6903edcd16032595dc8f850e3fbf16cb0dd6"
829
  },
830
  "mirrors": {
831
  "hf_space": {
832
  "path": "hf_space:data/public_reader_map.json",
833
  "exists": true,
834
- "bytes": 5990,
835
- "sha256": "45b89ce3c3da32856ecaff0902bc6903edcd16032595dc8f850e3fbf16cb0dd6"
836
  },
837
  "hf_artifacts_data": {
838
  "path": "hf_artifacts:data/public_reader_map.json",
839
  "exists": true,
840
- "bytes": 5990,
841
- "sha256": "45b89ce3c3da32856ecaff0902bc6903edcd16032595dc8f850e3fbf16cb0dd6"
842
  },
843
  "hf_artifacts": {
844
  "path": "hf_artifacts:docs/data/public_reader_map.json",
845
  "exists": true,
846
- "bytes": 5990,
847
- "sha256": "45b89ce3c3da32856ecaff0902bc6903edcd16032595dc8f850e3fbf16cb0dd6"
848
  },
849
  "hf_model_data": {
850
  "path": "hf_model:data/public_reader_map.json",
851
  "exists": true,
852
- "bytes": 5990,
853
- "sha256": "45b89ce3c3da32856ecaff0902bc6903edcd16032595dc8f850e3fbf16cb0dd6"
854
  },
855
  "hf_model_docs_data": {
856
  "path": "hf_model:docs/data/public_reader_map.json",
857
  "exists": true,
858
- "bytes": 5990,
859
- "sha256": "45b89ce3c3da32856ecaff0902bc6903edcd16032595dc8f850e3fbf16cb0dd6"
860
  },
861
  "hf_model": {
862
  "path": "hf_model:metrics/public_reader_map.json",
863
  "exists": true,
864
- "bytes": 5990,
865
- "sha256": "45b89ce3c3da32856ecaff0902bc6903edcd16032595dc8f850e3fbf16cb0dd6"
866
  }
867
  },
868
  "failures": []
@@ -873,45 +873,45 @@
873
  "local": {
874
  "path": "repo:docs/data/project_status.json",
875
  "exists": true,
876
- "bytes": 23054,
877
- "sha256": "5ea0e111a04ad255f153567d603f06990240121351fd9cfce850a7935a8dd6f9"
878
  },
879
  "mirrors": {
880
  "hf_space": {
881
  "path": "hf_space:data/project_status.json",
882
  "exists": true,
883
- "bytes": 23054,
884
- "sha256": "5ea0e111a04ad255f153567d603f06990240121351fd9cfce850a7935a8dd6f9"
885
  },
886
  "hf_artifacts_data": {
887
  "path": "hf_artifacts:data/project_status.json",
888
  "exists": true,
889
- "bytes": 23054,
890
- "sha256": "5ea0e111a04ad255f153567d603f06990240121351fd9cfce850a7935a8dd6f9"
891
  },
892
  "hf_artifacts": {
893
  "path": "hf_artifacts:docs/data/project_status.json",
894
  "exists": true,
895
- "bytes": 23054,
896
- "sha256": "5ea0e111a04ad255f153567d603f06990240121351fd9cfce850a7935a8dd6f9"
897
  },
898
  "hf_model_data": {
899
  "path": "hf_model:data/project_status.json",
900
  "exists": true,
901
- "bytes": 23054,
902
- "sha256": "5ea0e111a04ad255f153567d603f06990240121351fd9cfce850a7935a8dd6f9"
903
  },
904
  "hf_model_docs_data": {
905
  "path": "hf_model:docs/data/project_status.json",
906
  "exists": true,
907
- "bytes": 23054,
908
- "sha256": "5ea0e111a04ad255f153567d603f06990240121351fd9cfce850a7935a8dd6f9"
909
  },
910
  "hf_model": {
911
  "path": "hf_model:metrics/project_status.json",
912
  "exists": true,
913
- "bytes": 23054,
914
- "sha256": "5ea0e111a04ad255f153567d603f06990240121351fd9cfce850a7935a8dd6f9"
915
  }
916
  },
917
  "failures": []
@@ -923,44 +923,44 @@
923
  "path": "repo:docs/data/publication_audit.json",
924
  "exists": true,
925
  "bytes": 10877,
926
- "sha256": "43304fea0b61a3a63d595c9de2176e058e75bda3bebc8768918e1fb6b900bb30"
927
  },
928
  "mirrors": {
929
  "hf_space": {
930
  "path": "hf_space:data/publication_audit.json",
931
  "exists": true,
932
  "bytes": 10877,
933
- "sha256": "43304fea0b61a3a63d595c9de2176e058e75bda3bebc8768918e1fb6b900bb30"
934
  },
935
  "hf_artifacts_data": {
936
  "path": "hf_artifacts:data/publication_audit.json",
937
  "exists": true,
938
  "bytes": 10877,
939
- "sha256": "43304fea0b61a3a63d595c9de2176e058e75bda3bebc8768918e1fb6b900bb30"
940
  },
941
  "hf_artifacts": {
942
  "path": "hf_artifacts:docs/data/publication_audit.json",
943
  "exists": true,
944
  "bytes": 10877,
945
- "sha256": "43304fea0b61a3a63d595c9de2176e058e75bda3bebc8768918e1fb6b900bb30"
946
  },
947
  "hf_model_data": {
948
  "path": "hf_model:data/publication_audit.json",
949
  "exists": true,
950
  "bytes": 10877,
951
- "sha256": "43304fea0b61a3a63d595c9de2176e058e75bda3bebc8768918e1fb6b900bb30"
952
  },
953
  "hf_model_docs_data": {
954
  "path": "hf_model:docs/data/publication_audit.json",
955
  "exists": true,
956
  "bytes": 10877,
957
- "sha256": "43304fea0b61a3a63d595c9de2176e058e75bda3bebc8768918e1fb6b900bb30"
958
  },
959
  "hf_model": {
960
  "path": "hf_model:metrics/publication_audit.json",
961
  "exists": true,
962
  "bytes": 10877,
963
- "sha256": "43304fea0b61a3a63d595c9de2176e058e75bda3bebc8768918e1fb6b900bb30"
964
  }
965
  },
966
  "failures": []
@@ -972,44 +972,44 @@
972
  "path": "repo:docs/data/public_surface_qa.json",
973
  "exists": true,
974
  "bytes": 7363,
975
- "sha256": "ccd0b14986276d41a41d567b40ef5e6e460abbd7b75126511d635e86e61ce8d8"
976
  },
977
  "mirrors": {
978
  "hf_space": {
979
  "path": "hf_space:data/public_surface_qa.json",
980
  "exists": true,
981
  "bytes": 7363,
982
- "sha256": "ccd0b14986276d41a41d567b40ef5e6e460abbd7b75126511d635e86e61ce8d8"
983
  },
984
  "hf_artifacts_data": {
985
  "path": "hf_artifacts:data/public_surface_qa.json",
986
  "exists": true,
987
  "bytes": 7363,
988
- "sha256": "ccd0b14986276d41a41d567b40ef5e6e460abbd7b75126511d635e86e61ce8d8"
989
  },
990
  "hf_artifacts": {
991
  "path": "hf_artifacts:docs/data/public_surface_qa.json",
992
  "exists": true,
993
  "bytes": 7363,
994
- "sha256": "ccd0b14986276d41a41d567b40ef5e6e460abbd7b75126511d635e86e61ce8d8"
995
  },
996
  "hf_model_data": {
997
  "path": "hf_model:data/public_surface_qa.json",
998
  "exists": true,
999
  "bytes": 7363,
1000
- "sha256": "ccd0b14986276d41a41d567b40ef5e6e460abbd7b75126511d635e86e61ce8d8"
1001
  },
1002
  "hf_model_docs_data": {
1003
  "path": "hf_model:docs/data/public_surface_qa.json",
1004
  "exists": true,
1005
  "bytes": 7363,
1006
- "sha256": "ccd0b14986276d41a41d567b40ef5e6e460abbd7b75126511d635e86e61ce8d8"
1007
  },
1008
  "hf_model": {
1009
  "path": "hf_model:metrics/public_surface_qa.json",
1010
  "exists": true,
1011
  "bytes": 7363,
1012
- "sha256": "ccd0b14986276d41a41d567b40ef5e6e460abbd7b75126511d635e86e61ce8d8"
1013
  }
1014
  },
1015
  "failures": []
@@ -1070,44 +1070,44 @@
1070
  "path": "repo:docs/data/qwen3_omni_run_lineage.json",
1071
  "exists": true,
1072
  "bytes": 11468,
1073
- "sha256": "ec71e0b80a3eb16d25aa78040fe7247cf0dd889bcce789167b54a3bfda7722c5"
1074
  },
1075
  "mirrors": {
1076
  "hf_space": {
1077
  "path": "hf_space:data/qwen3_omni_run_lineage.json",
1078
  "exists": true,
1079
  "bytes": 11468,
1080
- "sha256": "ec71e0b80a3eb16d25aa78040fe7247cf0dd889bcce789167b54a3bfda7722c5"
1081
  },
1082
  "hf_artifacts_data": {
1083
  "path": "hf_artifacts:data/qwen3_omni_run_lineage.json",
1084
  "exists": true,
1085
  "bytes": 11468,
1086
- "sha256": "ec71e0b80a3eb16d25aa78040fe7247cf0dd889bcce789167b54a3bfda7722c5"
1087
  },
1088
  "hf_artifacts": {
1089
  "path": "hf_artifacts:docs/data/qwen3_omni_run_lineage.json",
1090
  "exists": true,
1091
  "bytes": 11468,
1092
- "sha256": "ec71e0b80a3eb16d25aa78040fe7247cf0dd889bcce789167b54a3bfda7722c5"
1093
  },
1094
  "hf_model_data": {
1095
  "path": "hf_model:data/qwen3_omni_run_lineage.json",
1096
  "exists": true,
1097
  "bytes": 11468,
1098
- "sha256": "ec71e0b80a3eb16d25aa78040fe7247cf0dd889bcce789167b54a3bfda7722c5"
1099
  },
1100
  "hf_model_docs_data": {
1101
  "path": "hf_model:docs/data/qwen3_omni_run_lineage.json",
1102
  "exists": true,
1103
  "bytes": 11468,
1104
- "sha256": "ec71e0b80a3eb16d25aa78040fe7247cf0dd889bcce789167b54a3bfda7722c5"
1105
  },
1106
  "hf_model": {
1107
  "path": "hf_model:metrics/qwen3_omni_run_lineage.json",
1108
  "exists": true,
1109
  "bytes": 11468,
1110
- "sha256": "ec71e0b80a3eb16d25aa78040fe7247cf0dd889bcce789167b54a3bfda7722c5"
1111
  }
1112
  },
1113
  "failures": []
@@ -1168,44 +1168,44 @@
1168
  "path": "repo:docs/data/quality_gates.json",
1169
  "exists": true,
1170
  "bytes": 8640,
1171
- "sha256": "2078de7f914b19d768c9c529adfc6d83f7513a53660a9279d8d6d3041747179f"
1172
  },
1173
  "mirrors": {
1174
  "hf_space": {
1175
  "path": "hf_space:data/quality_gates.json",
1176
  "exists": true,
1177
  "bytes": 8640,
1178
- "sha256": "2078de7f914b19d768c9c529adfc6d83f7513a53660a9279d8d6d3041747179f"
1179
  },
1180
  "hf_artifacts_data": {
1181
  "path": "hf_artifacts:data/quality_gates.json",
1182
  "exists": true,
1183
  "bytes": 8640,
1184
- "sha256": "2078de7f914b19d768c9c529adfc6d83f7513a53660a9279d8d6d3041747179f"
1185
  },
1186
  "hf_artifacts": {
1187
  "path": "hf_artifacts:docs/data/quality_gates.json",
1188
  "exists": true,
1189
  "bytes": 8640,
1190
- "sha256": "2078de7f914b19d768c9c529adfc6d83f7513a53660a9279d8d6d3041747179f"
1191
  },
1192
  "hf_model_data": {
1193
  "path": "hf_model:data/quality_gates.json",
1194
  "exists": true,
1195
  "bytes": 8640,
1196
- "sha256": "2078de7f914b19d768c9c529adfc6d83f7513a53660a9279d8d6d3041747179f"
1197
  },
1198
  "hf_model_docs_data": {
1199
  "path": "hf_model:docs/data/quality_gates.json",
1200
  "exists": true,
1201
  "bytes": 8640,
1202
- "sha256": "2078de7f914b19d768c9c529adfc6d83f7513a53660a9279d8d6d3041747179f"
1203
  },
1204
  "hf_model": {
1205
  "path": "hf_model:metrics/quality_gates.json",
1206
  "exists": true,
1207
  "bytes": 8640,
1208
- "sha256": "2078de7f914b19d768c9c529adfc6d83f7513a53660a9279d8d6d3041747179f"
1209
  }
1210
  },
1211
  "failures": []
@@ -1609,44 +1609,44 @@
1609
  "path": "repo:docs/data/scope_claims_audit.json",
1610
  "exists": true,
1611
  "bytes": 21313,
1612
- "sha256": "ee829a14548686984894909cecae2f9c8012366c2760df62745f4ef409b04aac"
1613
  },
1614
  "mirrors": {
1615
  "hf_space": {
1616
  "path": "hf_space:data/scope_claims_audit.json",
1617
  "exists": true,
1618
  "bytes": 21313,
1619
- "sha256": "ee829a14548686984894909cecae2f9c8012366c2760df62745f4ef409b04aac"
1620
  },
1621
  "hf_artifacts_data": {
1622
  "path": "hf_artifacts:data/scope_claims_audit.json",
1623
  "exists": true,
1624
  "bytes": 21313,
1625
- "sha256": "ee829a14548686984894909cecae2f9c8012366c2760df62745f4ef409b04aac"
1626
  },
1627
  "hf_artifacts": {
1628
  "path": "hf_artifacts:docs/data/scope_claims_audit.json",
1629
  "exists": true,
1630
  "bytes": 21313,
1631
- "sha256": "ee829a14548686984894909cecae2f9c8012366c2760df62745f4ef409b04aac"
1632
  },
1633
  "hf_model_data": {
1634
  "path": "hf_model:data/scope_claims_audit.json",
1635
  "exists": true,
1636
  "bytes": 21313,
1637
- "sha256": "ee829a14548686984894909cecae2f9c8012366c2760df62745f4ef409b04aac"
1638
  },
1639
  "hf_model_docs_data": {
1640
  "path": "hf_model:docs/data/scope_claims_audit.json",
1641
  "exists": true,
1642
  "bytes": 21313,
1643
- "sha256": "ee829a14548686984894909cecae2f9c8012366c2760df62745f4ef409b04aac"
1644
  },
1645
  "hf_model": {
1646
  "path": "hf_model:metrics/scope_claims_audit.json",
1647
  "exists": true,
1648
  "bytes": 21313,
1649
- "sha256": "ee829a14548686984894909cecae2f9c8012366c2760df62745f4ef409b04aac"
1650
  }
1651
  },
1652
  "failures": []
@@ -1707,44 +1707,44 @@
1707
  "path": "repo:docs/data/source_alignment_audit.json",
1708
  "exists": true,
1709
  "bytes": 4432,
1710
- "sha256": "feb9f59b793e1e2222d8f0ef80c75bb346bf8a3fd7baa7da0bc28dd156f4d55d"
1711
  },
1712
  "mirrors": {
1713
  "hf_space": {
1714
  "path": "hf_space:data/source_alignment_audit.json",
1715
  "exists": true,
1716
  "bytes": 4432,
1717
- "sha256": "feb9f59b793e1e2222d8f0ef80c75bb346bf8a3fd7baa7da0bc28dd156f4d55d"
1718
  },
1719
  "hf_artifacts_data": {
1720
  "path": "hf_artifacts:data/source_alignment_audit.json",
1721
  "exists": true,
1722
  "bytes": 4432,
1723
- "sha256": "feb9f59b793e1e2222d8f0ef80c75bb346bf8a3fd7baa7da0bc28dd156f4d55d"
1724
  },
1725
  "hf_artifacts": {
1726
  "path": "hf_artifacts:docs/data/source_alignment_audit.json",
1727
  "exists": true,
1728
  "bytes": 4432,
1729
- "sha256": "feb9f59b793e1e2222d8f0ef80c75bb346bf8a3fd7baa7da0bc28dd156f4d55d"
1730
  },
1731
  "hf_model_data": {
1732
  "path": "hf_model:data/source_alignment_audit.json",
1733
  "exists": true,
1734
  "bytes": 4432,
1735
- "sha256": "feb9f59b793e1e2222d8f0ef80c75bb346bf8a3fd7baa7da0bc28dd156f4d55d"
1736
  },
1737
  "hf_model_docs_data": {
1738
  "path": "hf_model:docs/data/source_alignment_audit.json",
1739
  "exists": true,
1740
  "bytes": 4432,
1741
- "sha256": "feb9f59b793e1e2222d8f0ef80c75bb346bf8a3fd7baa7da0bc28dd156f4d55d"
1742
  },
1743
  "hf_model": {
1744
  "path": "hf_model:metrics/source_alignment_audit.json",
1745
  "exists": true,
1746
  "bytes": 4432,
1747
- "sha256": "feb9f59b793e1e2222d8f0ef80c75bb346bf8a3fd7baa7da0bc28dd156f4d55d"
1748
  }
1749
  },
1750
  "failures": []
@@ -1951,45 +1951,45 @@
1951
  "local": {
1952
  "path": "repo:docs/data/two_evidence_lines.json",
1953
  "exists": true,
1954
- "bytes": 7349,
1955
- "sha256": "f742b95626bcbd69d1798533b98b5ac9e3156b10442f8d310113812c5131c3cf"
1956
  },
1957
  "mirrors": {
1958
  "hf_space": {
1959
  "path": "hf_space:data/two_evidence_lines.json",
1960
  "exists": true,
1961
- "bytes": 7349,
1962
- "sha256": "f742b95626bcbd69d1798533b98b5ac9e3156b10442f8d310113812c5131c3cf"
1963
  },
1964
  "hf_artifacts_data": {
1965
  "path": "hf_artifacts:data/two_evidence_lines.json",
1966
  "exists": true,
1967
- "bytes": 7349,
1968
- "sha256": "f742b95626bcbd69d1798533b98b5ac9e3156b10442f8d310113812c5131c3cf"
1969
  },
1970
  "hf_artifacts": {
1971
  "path": "hf_artifacts:docs/data/two_evidence_lines.json",
1972
  "exists": true,
1973
- "bytes": 7349,
1974
- "sha256": "f742b95626bcbd69d1798533b98b5ac9e3156b10442f8d310113812c5131c3cf"
1975
  },
1976
  "hf_model_data": {
1977
  "path": "hf_model:data/two_evidence_lines.json",
1978
  "exists": true,
1979
- "bytes": 7349,
1980
- "sha256": "f742b95626bcbd69d1798533b98b5ac9e3156b10442f8d310113812c5131c3cf"
1981
  },
1982
  "hf_model_docs_data": {
1983
  "path": "hf_model:docs/data/two_evidence_lines.json",
1984
  "exists": true,
1985
- "bytes": 7349,
1986
- "sha256": "f742b95626bcbd69d1798533b98b5ac9e3156b10442f8d310113812c5131c3cf"
1987
  },
1988
  "hf_model": {
1989
  "path": "hf_model:metrics/two_evidence_lines.json",
1990
  "exists": true,
1991
- "bytes": 7349,
1992
- "sha256": "f742b95626bcbd69d1798533b98b5ac9e3156b10442f8d310113812c5131c3cf"
1993
  }
1994
  },
1995
  "failures": []
@@ -2000,45 +2000,45 @@
2000
  "local": {
2001
  "path": "repo:docs/data/two_evidence_line_result_summary.json",
2002
  "exists": true,
2003
- "bytes": 17414,
2004
- "sha256": "ab22372862456258c27eb429b8bb7a2520e8be178d466df7595767f2eb5815fc"
2005
  },
2006
  "mirrors": {
2007
  "hf_space": {
2008
  "path": "hf_space:data/two_evidence_line_result_summary.json",
2009
  "exists": true,
2010
- "bytes": 17414,
2011
- "sha256": "ab22372862456258c27eb429b8bb7a2520e8be178d466df7595767f2eb5815fc"
2012
  },
2013
  "hf_artifacts_data": {
2014
  "path": "hf_artifacts:data/two_evidence_line_result_summary.json",
2015
  "exists": true,
2016
- "bytes": 17414,
2017
- "sha256": "ab22372862456258c27eb429b8bb7a2520e8be178d466df7595767f2eb5815fc"
2018
  },
2019
  "hf_artifacts": {
2020
  "path": "hf_artifacts:docs/data/two_evidence_line_result_summary.json",
2021
  "exists": true,
2022
- "bytes": 17414,
2023
- "sha256": "ab22372862456258c27eb429b8bb7a2520e8be178d466df7595767f2eb5815fc"
2024
  },
2025
  "hf_model_data": {
2026
  "path": "hf_model:data/two_evidence_line_result_summary.json",
2027
  "exists": true,
2028
- "bytes": 17414,
2029
- "sha256": "ab22372862456258c27eb429b8bb7a2520e8be178d466df7595767f2eb5815fc"
2030
  },
2031
  "hf_model_docs_data": {
2032
  "path": "hf_model:docs/data/two_evidence_line_result_summary.json",
2033
  "exists": true,
2034
- "bytes": 17414,
2035
- "sha256": "ab22372862456258c27eb429b8bb7a2520e8be178d466df7595767f2eb5815fc"
2036
  },
2037
  "hf_model": {
2038
  "path": "hf_model:metrics/two_evidence_line_result_summary.json",
2039
  "exists": true,
2040
- "bytes": 17414,
2041
- "sha256": "ab22372862456258c27eb429b8bb7a2520e8be178d466df7595767f2eb5815fc"
2042
  }
2043
  },
2044
  "failures": []
@@ -2148,44 +2148,44 @@
2148
  "path": "repo:docs/data/task_surface_integrity.json",
2149
  "exists": true,
2150
  "bytes": 46246,
2151
- "sha256": "08b4b4d5922ace3e1465b17ec594529df96901be0d53a0e8b2a49a814315096a"
2152
  },
2153
  "mirrors": {
2154
  "hf_space": {
2155
  "path": "hf_space:data/task_surface_integrity.json",
2156
  "exists": true,
2157
  "bytes": 46246,
2158
- "sha256": "08b4b4d5922ace3e1465b17ec594529df96901be0d53a0e8b2a49a814315096a"
2159
  },
2160
  "hf_artifacts_data": {
2161
  "path": "hf_artifacts:data/task_surface_integrity.json",
2162
  "exists": true,
2163
  "bytes": 46246,
2164
- "sha256": "08b4b4d5922ace3e1465b17ec594529df96901be0d53a0e8b2a49a814315096a"
2165
  },
2166
  "hf_artifacts": {
2167
  "path": "hf_artifacts:docs/data/task_surface_integrity.json",
2168
  "exists": true,
2169
  "bytes": 46246,
2170
- "sha256": "08b4b4d5922ace3e1465b17ec594529df96901be0d53a0e8b2a49a814315096a"
2171
  },
2172
  "hf_model_data": {
2173
  "path": "hf_model:data/task_surface_integrity.json",
2174
  "exists": true,
2175
  "bytes": 46246,
2176
- "sha256": "08b4b4d5922ace3e1465b17ec594529df96901be0d53a0e8b2a49a814315096a"
2177
  },
2178
  "hf_model_docs_data": {
2179
  "path": "hf_model:docs/data/task_surface_integrity.json",
2180
  "exists": true,
2181
  "bytes": 46246,
2182
- "sha256": "08b4b4d5922ace3e1465b17ec594529df96901be0d53a0e8b2a49a814315096a"
2183
  },
2184
  "hf_model": {
2185
  "path": "hf_model:metrics/task_surface_integrity.json",
2186
  "exists": true,
2187
  "bytes": 46246,
2188
- "sha256": "08b4b4d5922ace3e1465b17ec594529df96901be0d53a0e8b2a49a814315096a"
2189
  }
2190
  },
2191
  "failures": []
@@ -2344,44 +2344,44 @@
2344
  "path": "repo:docs/data/task_method_20_source_audit.json",
2345
  "exists": true,
2346
  "bytes": 561,
2347
- "sha256": "1ec3670e868bd6ed60110fb4703e9794f2c425016800121177ae5aa57b661edb"
2348
  },
2349
  "mirrors": {
2350
  "hf_space": {
2351
  "path": "hf_space:data/task_method_20_source_audit.json",
2352
  "exists": true,
2353
  "bytes": 561,
2354
- "sha256": "1ec3670e868bd6ed60110fb4703e9794f2c425016800121177ae5aa57b661edb"
2355
  },
2356
  "hf_artifacts_data": {
2357
  "path": "hf_artifacts:data/task_method_20_source_audit.json",
2358
  "exists": true,
2359
  "bytes": 561,
2360
- "sha256": "1ec3670e868bd6ed60110fb4703e9794f2c425016800121177ae5aa57b661edb"
2361
  },
2362
  "hf_artifacts": {
2363
  "path": "hf_artifacts:docs/data/task_method_20_source_audit.json",
2364
  "exists": true,
2365
  "bytes": 561,
2366
- "sha256": "1ec3670e868bd6ed60110fb4703e9794f2c425016800121177ae5aa57b661edb"
2367
  },
2368
  "hf_model_data": {
2369
  "path": "hf_model:data/task_method_20_source_audit.json",
2370
  "exists": true,
2371
  "bytes": 561,
2372
- "sha256": "1ec3670e868bd6ed60110fb4703e9794f2c425016800121177ae5aa57b661edb"
2373
  },
2374
  "hf_model_docs_data": {
2375
  "path": "hf_model:docs/data/task_method_20_source_audit.json",
2376
  "exists": true,
2377
  "bytes": 561,
2378
- "sha256": "1ec3670e868bd6ed60110fb4703e9794f2c425016800121177ae5aa57b661edb"
2379
  },
2380
  "hf_model": {
2381
  "path": "hf_model:metrics/task_method_20_source_audit.json",
2382
  "exists": true,
2383
  "bytes": 561,
2384
- "sha256": "1ec3670e868bd6ed60110fb4703e9794f2c425016800121177ae5aa57b661edb"
2385
  }
2386
  },
2387
  "failures": []
@@ -2491,44 +2491,44 @@
2491
  "path": "repo:docs/data/website_integrity.json",
2492
  "exists": true,
2493
  "bytes": 20658,
2494
- "sha256": "936202947e3e2f1a1003205e4a7cecf2eabb0dd5fd4f5235af662094da3b5b5a"
2495
  },
2496
  "mirrors": {
2497
  "hf_space": {
2498
  "path": "hf_space:data/website_integrity.json",
2499
  "exists": true,
2500
  "bytes": 20658,
2501
- "sha256": "936202947e3e2f1a1003205e4a7cecf2eabb0dd5fd4f5235af662094da3b5b5a"
2502
  },
2503
  "hf_artifacts_data": {
2504
  "path": "hf_artifacts:data/website_integrity.json",
2505
  "exists": true,
2506
  "bytes": 20658,
2507
- "sha256": "936202947e3e2f1a1003205e4a7cecf2eabb0dd5fd4f5235af662094da3b5b5a"
2508
  },
2509
  "hf_artifacts": {
2510
  "path": "hf_artifacts:docs/data/website_integrity.json",
2511
  "exists": true,
2512
  "bytes": 20658,
2513
- "sha256": "936202947e3e2f1a1003205e4a7cecf2eabb0dd5fd4f5235af662094da3b5b5a"
2514
  },
2515
  "hf_model_data": {
2516
  "path": "hf_model:data/website_integrity.json",
2517
  "exists": true,
2518
  "bytes": 20658,
2519
- "sha256": "936202947e3e2f1a1003205e4a7cecf2eabb0dd5fd4f5235af662094da3b5b5a"
2520
  },
2521
  "hf_model_docs_data": {
2522
  "path": "hf_model:docs/data/website_integrity.json",
2523
  "exists": true,
2524
  "bytes": 20658,
2525
- "sha256": "936202947e3e2f1a1003205e4a7cecf2eabb0dd5fd4f5235af662094da3b5b5a"
2526
  },
2527
  "hf_model": {
2528
  "path": "hf_model:metrics/website_integrity.json",
2529
  "exists": true,
2530
  "bytes": 20658,
2531
- "sha256": "936202947e3e2f1a1003205e4a7cecf2eabb0dd5fd4f5235af662094da3b5b5a"
2532
  }
2533
  },
2534
  "failures": []
@@ -5503,21 +5503,21 @@
5503
  "local": {
5504
  "path": "repo:scripts/build_multilingual_public_readmes.py",
5505
  "exists": true,
5506
- "bytes": 51318,
5507
- "sha256": "12e32d1395d32f54fcc946826ff41658641b7dfadaef763db824ff9b8460e9f8"
5508
  },
5509
  "mirrors": {
5510
  "hf_artifacts": {
5511
  "path": "hf_artifacts:scripts/build_multilingual_public_readmes.py",
5512
  "exists": true,
5513
- "bytes": 51318,
5514
- "sha256": "12e32d1395d32f54fcc946826ff41658641b7dfadaef763db824ff9b8460e9f8"
5515
  },
5516
  "hf_model": {
5517
  "path": "hf_model:scripts/build_multilingual_public_readmes.py",
5518
  "exists": true,
5519
- "bytes": 51318,
5520
- "sha256": "12e32d1395d32f54fcc946826ff41658641b7dfadaef763db824ff9b8460e9f8"
5521
  }
5522
  },
5523
  "failures": []
@@ -6282,39 +6282,39 @@
6282
  "local": {
6283
  "path": "repo:docs/index.html",
6284
  "exists": true,
6285
- "bytes": 300323,
6286
- "sha256": "8237864aa6a7717d6d3d3395436a9477ed78fbf7fe5fd36ecdf55b675c68b838"
6287
  },
6288
  "mirrors": {
6289
  "hf_space": {
6290
  "path": "hf_space:index.html",
6291
  "exists": true,
6292
- "bytes": 300323,
6293
- "sha256": "8237864aa6a7717d6d3d3395436a9477ed78fbf7fe5fd36ecdf55b675c68b838"
6294
  },
6295
  "hf_artifacts_root": {
6296
  "path": "hf_artifacts:index.html",
6297
  "exists": true,
6298
- "bytes": 300323,
6299
- "sha256": "8237864aa6a7717d6d3d3395436a9477ed78fbf7fe5fd36ecdf55b675c68b838"
6300
  },
6301
  "hf_artifacts_docs": {
6302
  "path": "hf_artifacts:docs/index.html",
6303
  "exists": true,
6304
- "bytes": 300323,
6305
- "sha256": "8237864aa6a7717d6d3d3395436a9477ed78fbf7fe5fd36ecdf55b675c68b838"
6306
  },
6307
  "hf_model": {
6308
  "path": "hf_model:index.html",
6309
  "exists": true,
6310
- "bytes": 300323,
6311
- "sha256": "8237864aa6a7717d6d3d3395436a9477ed78fbf7fe5fd36ecdf55b675c68b838"
6312
  },
6313
  "hf_model_docs": {
6314
  "path": "hf_model:docs/index.html",
6315
  "exists": true,
6316
- "bytes": 300323,
6317
- "sha256": "8237864aa6a7717d6d3d3395436a9477ed78fbf7fe5fd36ecdf55b675c68b838"
6318
  }
6319
  },
6320
  "failures": []
@@ -31102,27 +31102,27 @@
31102
  "local": {
31103
  "path": "repo:TWO_EVIDENCE_LINES.md",
31104
  "exists": true,
31105
- "bytes": 4293,
31106
- "sha256": "f7c47021534898a4cf2d047be78931a88f9cf1efa98f740d7573fdc67d9c2294"
31107
  },
31108
  "mirrors": {
31109
  "hf_space": {
31110
  "path": "hf_space:TWO_EVIDENCE_LINES.md",
31111
  "exists": true,
31112
- "bytes": 4293,
31113
- "sha256": "f7c47021534898a4cf2d047be78931a88f9cf1efa98f740d7573fdc67d9c2294"
31114
  },
31115
  "hf_artifacts": {
31116
  "path": "hf_artifacts:TWO_EVIDENCE_LINES.md",
31117
  "exists": true,
31118
- "bytes": 4293,
31119
- "sha256": "f7c47021534898a4cf2d047be78931a88f9cf1efa98f740d7573fdc67d9c2294"
31120
  },
31121
  "hf_model": {
31122
  "path": "hf_model:TWO_EVIDENCE_LINES.md",
31123
  "exists": true,
31124
- "bytes": 4293,
31125
- "sha256": "f7c47021534898a4cf2d047be78931a88f9cf1efa98f740d7573fdc67d9c2294"
31126
  }
31127
  },
31128
  "failures": []
@@ -31133,27 +31133,27 @@
31133
  "local": {
31134
  "path": "repo:TWO_EVIDENCE_LINE_RESULT_SUMMARY.md",
31135
  "exists": true,
31136
- "bytes": 8724,
31137
- "sha256": "ae09f8d4fccbb4e5d9953bf98b65d619a092f844f892801deeb095ba0ac148e3"
31138
  },
31139
  "mirrors": {
31140
  "hf_space": {
31141
  "path": "hf_space:TWO_EVIDENCE_LINE_RESULT_SUMMARY.md",
31142
  "exists": true,
31143
- "bytes": 8724,
31144
- "sha256": "ae09f8d4fccbb4e5d9953bf98b65d619a092f844f892801deeb095ba0ac148e3"
31145
  },
31146
  "hf_artifacts": {
31147
  "path": "hf_artifacts:TWO_EVIDENCE_LINE_RESULT_SUMMARY.md",
31148
  "exists": true,
31149
- "bytes": 8724,
31150
- "sha256": "ae09f8d4fccbb4e5d9953bf98b65d619a092f844f892801deeb095ba0ac148e3"
31151
  },
31152
  "hf_model": {
31153
  "path": "hf_model:TWO_EVIDENCE_LINE_RESULT_SUMMARY.md",
31154
  "exists": true,
31155
- "bytes": 8724,
31156
- "sha256": "ae09f8d4fccbb4e5d9953bf98b65d619a092f844f892801deeb095ba0ac148e3"
31157
  }
31158
  },
31159
  "failures": []
@@ -31165,26 +31165,26 @@
31165
  "path": "repo:QWEN3_OMNI_RUN_LINEAGE.md",
31166
  "exists": true,
31167
  "bytes": 6287,
31168
- "sha256": "07e3d0576131200d15d31dc824303fe9d87cd93050725dd06aebedbc1388aa98"
31169
  },
31170
  "mirrors": {
31171
  "hf_space": {
31172
  "path": "hf_space:QWEN3_OMNI_RUN_LINEAGE.md",
31173
  "exists": true,
31174
  "bytes": 6287,
31175
- "sha256": "07e3d0576131200d15d31dc824303fe9d87cd93050725dd06aebedbc1388aa98"
31176
  },
31177
  "hf_artifacts": {
31178
  "path": "hf_artifacts:QWEN3_OMNI_RUN_LINEAGE.md",
31179
  "exists": true,
31180
  "bytes": 6287,
31181
- "sha256": "07e3d0576131200d15d31dc824303fe9d87cd93050725dd06aebedbc1388aa98"
31182
  },
31183
  "hf_model": {
31184
  "path": "hf_model:QWEN3_OMNI_RUN_LINEAGE.md",
31185
  "exists": true,
31186
  "bytes": 6287,
31187
- "sha256": "07e3d0576131200d15d31dc824303fe9d87cd93050725dd06aebedbc1388aa98"
31188
  }
31189
  },
31190
  "failures": []
@@ -31195,27 +31195,27 @@
31195
  "local": {
31196
  "path": "repo:PROJECT_BRIEF.md",
31197
  "exists": true,
31198
- "bytes": 4343,
31199
- "sha256": "fa002a54a46f1789eab79c74b5203e03e9389ff83abea7dd491fd20a676f1da5"
31200
  },
31201
  "mirrors": {
31202
  "hf_space": {
31203
  "path": "hf_space:PROJECT_BRIEF.md",
31204
  "exists": true,
31205
- "bytes": 4343,
31206
- "sha256": "fa002a54a46f1789eab79c74b5203e03e9389ff83abea7dd491fd20a676f1da5"
31207
  },
31208
  "hf_artifacts": {
31209
  "path": "hf_artifacts:PROJECT_BRIEF.md",
31210
  "exists": true,
31211
- "bytes": 4343,
31212
- "sha256": "fa002a54a46f1789eab79c74b5203e03e9389ff83abea7dd491fd20a676f1da5"
31213
  },
31214
  "hf_model": {
31215
  "path": "hf_model:PROJECT_BRIEF.md",
31216
  "exists": true,
31217
- "bytes": 4343,
31218
- "sha256": "fa002a54a46f1789eab79c74b5203e03e9389ff83abea7dd491fd20a676f1da5"
31219
  }
31220
  },
31221
  "failures": []
@@ -31226,27 +31226,27 @@
31226
  "local": {
31227
  "path": "repo:PUBLIC_READER_MAP.md",
31228
  "exists": true,
31229
- "bytes": 4948,
31230
- "sha256": "7a7128fdde08f770338c3fe2d473565918c5633f948dec6a78a6b2a67938e91a"
31231
  },
31232
  "mirrors": {
31233
  "hf_space": {
31234
  "path": "hf_space:PUBLIC_READER_MAP.md",
31235
  "exists": true,
31236
- "bytes": 4948,
31237
- "sha256": "7a7128fdde08f770338c3fe2d473565918c5633f948dec6a78a6b2a67938e91a"
31238
  },
31239
  "hf_artifacts": {
31240
  "path": "hf_artifacts:PUBLIC_READER_MAP.md",
31241
  "exists": true,
31242
- "bytes": 4948,
31243
- "sha256": "7a7128fdde08f770338c3fe2d473565918c5633f948dec6a78a6b2a67938e91a"
31244
  },
31245
  "hf_model": {
31246
  "path": "hf_model:PUBLIC_READER_MAP.md",
31247
  "exists": true,
31248
- "bytes": 4948,
31249
- "sha256": "7a7128fdde08f770338c3fe2d473565918c5633f948dec6a78a6b2a67938e91a"
31250
  }
31251
  },
31252
  "failures": []
@@ -31319,27 +31319,27 @@
31319
  "local": {
31320
  "path": "repo:PROJECT_STATUS.md",
31321
  "exists": true,
31322
- "bytes": 14845,
31323
- "sha256": "128daeed89b672d89b4c422956bf4900ffe8efd54356ec657fb9cf6dcb880ba5"
31324
  },
31325
  "mirrors": {
31326
  "hf_space": {
31327
  "path": "hf_space:PROJECT_STATUS.md",
31328
  "exists": true,
31329
- "bytes": 14845,
31330
- "sha256": "128daeed89b672d89b4c422956bf4900ffe8efd54356ec657fb9cf6dcb880ba5"
31331
  },
31332
  "hf_artifacts": {
31333
  "path": "hf_artifacts:PROJECT_STATUS.md",
31334
  "exists": true,
31335
- "bytes": 14845,
31336
- "sha256": "128daeed89b672d89b4c422956bf4900ffe8efd54356ec657fb9cf6dcb880ba5"
31337
  },
31338
  "hf_model": {
31339
  "path": "hf_model:PROJECT_STATUS.md",
31340
  "exists": true,
31341
- "bytes": 14845,
31342
- "sha256": "128daeed89b672d89b4c422956bf4900ffe8efd54356ec657fb9cf6dcb880ba5"
31343
  }
31344
  },
31345
  "failures": []
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T11:53:22+00:00",
4
  "hf_root": "hf_publish",
5
  "summary": {
6
  "group_count": 1256,
 
824
  "local": {
825
  "path": "repo:docs/data/public_reader_map.json",
826
  "exists": true,
827
+ "bytes": 6757,
828
+ "sha256": "b08deaeaeeb19e8522966e85076250724541370a30d487b67272fd544f0398fc"
829
  },
830
  "mirrors": {
831
  "hf_space": {
832
  "path": "hf_space:data/public_reader_map.json",
833
  "exists": true,
834
+ "bytes": 6757,
835
+ "sha256": "b08deaeaeeb19e8522966e85076250724541370a30d487b67272fd544f0398fc"
836
  },
837
  "hf_artifacts_data": {
838
  "path": "hf_artifacts:data/public_reader_map.json",
839
  "exists": true,
840
+ "bytes": 6757,
841
+ "sha256": "b08deaeaeeb19e8522966e85076250724541370a30d487b67272fd544f0398fc"
842
  },
843
  "hf_artifacts": {
844
  "path": "hf_artifacts:docs/data/public_reader_map.json",
845
  "exists": true,
846
+ "bytes": 6757,
847
+ "sha256": "b08deaeaeeb19e8522966e85076250724541370a30d487b67272fd544f0398fc"
848
  },
849
  "hf_model_data": {
850
  "path": "hf_model:data/public_reader_map.json",
851
  "exists": true,
852
+ "bytes": 6757,
853
+ "sha256": "b08deaeaeeb19e8522966e85076250724541370a30d487b67272fd544f0398fc"
854
  },
855
  "hf_model_docs_data": {
856
  "path": "hf_model:docs/data/public_reader_map.json",
857
  "exists": true,
858
+ "bytes": 6757,
859
+ "sha256": "b08deaeaeeb19e8522966e85076250724541370a30d487b67272fd544f0398fc"
860
  },
861
  "hf_model": {
862
  "path": "hf_model:metrics/public_reader_map.json",
863
  "exists": true,
864
+ "bytes": 6757,
865
+ "sha256": "b08deaeaeeb19e8522966e85076250724541370a30d487b67272fd544f0398fc"
866
  }
867
  },
868
  "failures": []
 
873
  "local": {
874
  "path": "repo:docs/data/project_status.json",
875
  "exists": true,
876
+ "bytes": 23255,
877
+ "sha256": "874f1133ee75f060735f0c9e763cf81463f304432f1dbca3ebc9837225c0259d"
878
  },
879
  "mirrors": {
880
  "hf_space": {
881
  "path": "hf_space:data/project_status.json",
882
  "exists": true,
883
+ "bytes": 23255,
884
+ "sha256": "874f1133ee75f060735f0c9e763cf81463f304432f1dbca3ebc9837225c0259d"
885
  },
886
  "hf_artifacts_data": {
887
  "path": "hf_artifacts:data/project_status.json",
888
  "exists": true,
889
+ "bytes": 23255,
890
+ "sha256": "874f1133ee75f060735f0c9e763cf81463f304432f1dbca3ebc9837225c0259d"
891
  },
892
  "hf_artifacts": {
893
  "path": "hf_artifacts:docs/data/project_status.json",
894
  "exists": true,
895
+ "bytes": 23255,
896
+ "sha256": "874f1133ee75f060735f0c9e763cf81463f304432f1dbca3ebc9837225c0259d"
897
  },
898
  "hf_model_data": {
899
  "path": "hf_model:data/project_status.json",
900
  "exists": true,
901
+ "bytes": 23255,
902
+ "sha256": "874f1133ee75f060735f0c9e763cf81463f304432f1dbca3ebc9837225c0259d"
903
  },
904
  "hf_model_docs_data": {
905
  "path": "hf_model:docs/data/project_status.json",
906
  "exists": true,
907
+ "bytes": 23255,
908
+ "sha256": "874f1133ee75f060735f0c9e763cf81463f304432f1dbca3ebc9837225c0259d"
909
  },
910
  "hf_model": {
911
  "path": "hf_model:metrics/project_status.json",
912
  "exists": true,
913
+ "bytes": 23255,
914
+ "sha256": "874f1133ee75f060735f0c9e763cf81463f304432f1dbca3ebc9837225c0259d"
915
  }
916
  },
917
  "failures": []
 
923
  "path": "repo:docs/data/publication_audit.json",
924
  "exists": true,
925
  "bytes": 10877,
926
+ "sha256": "b856e7450eaa019f17364c2d99f70ac4a99f313a57984263fede4ed999f0d167"
927
  },
928
  "mirrors": {
929
  "hf_space": {
930
  "path": "hf_space:data/publication_audit.json",
931
  "exists": true,
932
  "bytes": 10877,
933
+ "sha256": "b856e7450eaa019f17364c2d99f70ac4a99f313a57984263fede4ed999f0d167"
934
  },
935
  "hf_artifacts_data": {
936
  "path": "hf_artifacts:data/publication_audit.json",
937
  "exists": true,
938
  "bytes": 10877,
939
+ "sha256": "b856e7450eaa019f17364c2d99f70ac4a99f313a57984263fede4ed999f0d167"
940
  },
941
  "hf_artifacts": {
942
  "path": "hf_artifacts:docs/data/publication_audit.json",
943
  "exists": true,
944
  "bytes": 10877,
945
+ "sha256": "b856e7450eaa019f17364c2d99f70ac4a99f313a57984263fede4ed999f0d167"
946
  },
947
  "hf_model_data": {
948
  "path": "hf_model:data/publication_audit.json",
949
  "exists": true,
950
  "bytes": 10877,
951
+ "sha256": "b856e7450eaa019f17364c2d99f70ac4a99f313a57984263fede4ed999f0d167"
952
  },
953
  "hf_model_docs_data": {
954
  "path": "hf_model:docs/data/publication_audit.json",
955
  "exists": true,
956
  "bytes": 10877,
957
+ "sha256": "b856e7450eaa019f17364c2d99f70ac4a99f313a57984263fede4ed999f0d167"
958
  },
959
  "hf_model": {
960
  "path": "hf_model:metrics/publication_audit.json",
961
  "exists": true,
962
  "bytes": 10877,
963
+ "sha256": "b856e7450eaa019f17364c2d99f70ac4a99f313a57984263fede4ed999f0d167"
964
  }
965
  },
966
  "failures": []
 
972
  "path": "repo:docs/data/public_surface_qa.json",
973
  "exists": true,
974
  "bytes": 7363,
975
+ "sha256": "ce1d103b0f514339819c809f4def9dfff1400a3adf7d1f891fa228dc753c7dbd"
976
  },
977
  "mirrors": {
978
  "hf_space": {
979
  "path": "hf_space:data/public_surface_qa.json",
980
  "exists": true,
981
  "bytes": 7363,
982
+ "sha256": "ce1d103b0f514339819c809f4def9dfff1400a3adf7d1f891fa228dc753c7dbd"
983
  },
984
  "hf_artifacts_data": {
985
  "path": "hf_artifacts:data/public_surface_qa.json",
986
  "exists": true,
987
  "bytes": 7363,
988
+ "sha256": "ce1d103b0f514339819c809f4def9dfff1400a3adf7d1f891fa228dc753c7dbd"
989
  },
990
  "hf_artifacts": {
991
  "path": "hf_artifacts:docs/data/public_surface_qa.json",
992
  "exists": true,
993
  "bytes": 7363,
994
+ "sha256": "ce1d103b0f514339819c809f4def9dfff1400a3adf7d1f891fa228dc753c7dbd"
995
  },
996
  "hf_model_data": {
997
  "path": "hf_model:data/public_surface_qa.json",
998
  "exists": true,
999
  "bytes": 7363,
1000
+ "sha256": "ce1d103b0f514339819c809f4def9dfff1400a3adf7d1f891fa228dc753c7dbd"
1001
  },
1002
  "hf_model_docs_data": {
1003
  "path": "hf_model:docs/data/public_surface_qa.json",
1004
  "exists": true,
1005
  "bytes": 7363,
1006
+ "sha256": "ce1d103b0f514339819c809f4def9dfff1400a3adf7d1f891fa228dc753c7dbd"
1007
  },
1008
  "hf_model": {
1009
  "path": "hf_model:metrics/public_surface_qa.json",
1010
  "exists": true,
1011
  "bytes": 7363,
1012
+ "sha256": "ce1d103b0f514339819c809f4def9dfff1400a3adf7d1f891fa228dc753c7dbd"
1013
  }
1014
  },
1015
  "failures": []
 
1070
  "path": "repo:docs/data/qwen3_omni_run_lineage.json",
1071
  "exists": true,
1072
  "bytes": 11468,
1073
+ "sha256": "668b61d087351e6573c3d0aaddbe34dab425da4cc2b5f77252961f27c661aea8"
1074
  },
1075
  "mirrors": {
1076
  "hf_space": {
1077
  "path": "hf_space:data/qwen3_omni_run_lineage.json",
1078
  "exists": true,
1079
  "bytes": 11468,
1080
+ "sha256": "668b61d087351e6573c3d0aaddbe34dab425da4cc2b5f77252961f27c661aea8"
1081
  },
1082
  "hf_artifacts_data": {
1083
  "path": "hf_artifacts:data/qwen3_omni_run_lineage.json",
1084
  "exists": true,
1085
  "bytes": 11468,
1086
+ "sha256": "668b61d087351e6573c3d0aaddbe34dab425da4cc2b5f77252961f27c661aea8"
1087
  },
1088
  "hf_artifacts": {
1089
  "path": "hf_artifacts:docs/data/qwen3_omni_run_lineage.json",
1090
  "exists": true,
1091
  "bytes": 11468,
1092
+ "sha256": "668b61d087351e6573c3d0aaddbe34dab425da4cc2b5f77252961f27c661aea8"
1093
  },
1094
  "hf_model_data": {
1095
  "path": "hf_model:data/qwen3_omni_run_lineage.json",
1096
  "exists": true,
1097
  "bytes": 11468,
1098
+ "sha256": "668b61d087351e6573c3d0aaddbe34dab425da4cc2b5f77252961f27c661aea8"
1099
  },
1100
  "hf_model_docs_data": {
1101
  "path": "hf_model:docs/data/qwen3_omni_run_lineage.json",
1102
  "exists": true,
1103
  "bytes": 11468,
1104
+ "sha256": "668b61d087351e6573c3d0aaddbe34dab425da4cc2b5f77252961f27c661aea8"
1105
  },
1106
  "hf_model": {
1107
  "path": "hf_model:metrics/qwen3_omni_run_lineage.json",
1108
  "exists": true,
1109
  "bytes": 11468,
1110
+ "sha256": "668b61d087351e6573c3d0aaddbe34dab425da4cc2b5f77252961f27c661aea8"
1111
  }
1112
  },
1113
  "failures": []
 
1168
  "path": "repo:docs/data/quality_gates.json",
1169
  "exists": true,
1170
  "bytes": 8640,
1171
+ "sha256": "66ba956b430f916c8e623ba8eacb1b6ccc3cfa6b9b4f795ea2f0436a93b0702b"
1172
  },
1173
  "mirrors": {
1174
  "hf_space": {
1175
  "path": "hf_space:data/quality_gates.json",
1176
  "exists": true,
1177
  "bytes": 8640,
1178
+ "sha256": "66ba956b430f916c8e623ba8eacb1b6ccc3cfa6b9b4f795ea2f0436a93b0702b"
1179
  },
1180
  "hf_artifacts_data": {
1181
  "path": "hf_artifacts:data/quality_gates.json",
1182
  "exists": true,
1183
  "bytes": 8640,
1184
+ "sha256": "66ba956b430f916c8e623ba8eacb1b6ccc3cfa6b9b4f795ea2f0436a93b0702b"
1185
  },
1186
  "hf_artifacts": {
1187
  "path": "hf_artifacts:docs/data/quality_gates.json",
1188
  "exists": true,
1189
  "bytes": 8640,
1190
+ "sha256": "66ba956b430f916c8e623ba8eacb1b6ccc3cfa6b9b4f795ea2f0436a93b0702b"
1191
  },
1192
  "hf_model_data": {
1193
  "path": "hf_model:data/quality_gates.json",
1194
  "exists": true,
1195
  "bytes": 8640,
1196
+ "sha256": "66ba956b430f916c8e623ba8eacb1b6ccc3cfa6b9b4f795ea2f0436a93b0702b"
1197
  },
1198
  "hf_model_docs_data": {
1199
  "path": "hf_model:docs/data/quality_gates.json",
1200
  "exists": true,
1201
  "bytes": 8640,
1202
+ "sha256": "66ba956b430f916c8e623ba8eacb1b6ccc3cfa6b9b4f795ea2f0436a93b0702b"
1203
  },
1204
  "hf_model": {
1205
  "path": "hf_model:metrics/quality_gates.json",
1206
  "exists": true,
1207
  "bytes": 8640,
1208
+ "sha256": "66ba956b430f916c8e623ba8eacb1b6ccc3cfa6b9b4f795ea2f0436a93b0702b"
1209
  }
1210
  },
1211
  "failures": []
 
1609
  "path": "repo:docs/data/scope_claims_audit.json",
1610
  "exists": true,
1611
  "bytes": 21313,
1612
+ "sha256": "cbc55e80e98652c0bdca92721df3dba5c975aeec8b894d1fa48dfc5c820b85d4"
1613
  },
1614
  "mirrors": {
1615
  "hf_space": {
1616
  "path": "hf_space:data/scope_claims_audit.json",
1617
  "exists": true,
1618
  "bytes": 21313,
1619
+ "sha256": "cbc55e80e98652c0bdca92721df3dba5c975aeec8b894d1fa48dfc5c820b85d4"
1620
  },
1621
  "hf_artifacts_data": {
1622
  "path": "hf_artifacts:data/scope_claims_audit.json",
1623
  "exists": true,
1624
  "bytes": 21313,
1625
+ "sha256": "cbc55e80e98652c0bdca92721df3dba5c975aeec8b894d1fa48dfc5c820b85d4"
1626
  },
1627
  "hf_artifacts": {
1628
  "path": "hf_artifacts:docs/data/scope_claims_audit.json",
1629
  "exists": true,
1630
  "bytes": 21313,
1631
+ "sha256": "cbc55e80e98652c0bdca92721df3dba5c975aeec8b894d1fa48dfc5c820b85d4"
1632
  },
1633
  "hf_model_data": {
1634
  "path": "hf_model:data/scope_claims_audit.json",
1635
  "exists": true,
1636
  "bytes": 21313,
1637
+ "sha256": "cbc55e80e98652c0bdca92721df3dba5c975aeec8b894d1fa48dfc5c820b85d4"
1638
  },
1639
  "hf_model_docs_data": {
1640
  "path": "hf_model:docs/data/scope_claims_audit.json",
1641
  "exists": true,
1642
  "bytes": 21313,
1643
+ "sha256": "cbc55e80e98652c0bdca92721df3dba5c975aeec8b894d1fa48dfc5c820b85d4"
1644
  },
1645
  "hf_model": {
1646
  "path": "hf_model:metrics/scope_claims_audit.json",
1647
  "exists": true,
1648
  "bytes": 21313,
1649
+ "sha256": "cbc55e80e98652c0bdca92721df3dba5c975aeec8b894d1fa48dfc5c820b85d4"
1650
  }
1651
  },
1652
  "failures": []
 
1707
  "path": "repo:docs/data/source_alignment_audit.json",
1708
  "exists": true,
1709
  "bytes": 4432,
1710
+ "sha256": "395bb7b9cc1399be3e736172ce051a2281bc3274c1e05902e85ba9b48c262d9d"
1711
  },
1712
  "mirrors": {
1713
  "hf_space": {
1714
  "path": "hf_space:data/source_alignment_audit.json",
1715
  "exists": true,
1716
  "bytes": 4432,
1717
+ "sha256": "395bb7b9cc1399be3e736172ce051a2281bc3274c1e05902e85ba9b48c262d9d"
1718
  },
1719
  "hf_artifacts_data": {
1720
  "path": "hf_artifacts:data/source_alignment_audit.json",
1721
  "exists": true,
1722
  "bytes": 4432,
1723
+ "sha256": "395bb7b9cc1399be3e736172ce051a2281bc3274c1e05902e85ba9b48c262d9d"
1724
  },
1725
  "hf_artifacts": {
1726
  "path": "hf_artifacts:docs/data/source_alignment_audit.json",
1727
  "exists": true,
1728
  "bytes": 4432,
1729
+ "sha256": "395bb7b9cc1399be3e736172ce051a2281bc3274c1e05902e85ba9b48c262d9d"
1730
  },
1731
  "hf_model_data": {
1732
  "path": "hf_model:data/source_alignment_audit.json",
1733
  "exists": true,
1734
  "bytes": 4432,
1735
+ "sha256": "395bb7b9cc1399be3e736172ce051a2281bc3274c1e05902e85ba9b48c262d9d"
1736
  },
1737
  "hf_model_docs_data": {
1738
  "path": "hf_model:docs/data/source_alignment_audit.json",
1739
  "exists": true,
1740
  "bytes": 4432,
1741
+ "sha256": "395bb7b9cc1399be3e736172ce051a2281bc3274c1e05902e85ba9b48c262d9d"
1742
  },
1743
  "hf_model": {
1744
  "path": "hf_model:metrics/source_alignment_audit.json",
1745
  "exists": true,
1746
  "bytes": 4432,
1747
+ "sha256": "395bb7b9cc1399be3e736172ce051a2281bc3274c1e05902e85ba9b48c262d9d"
1748
  }
1749
  },
1750
  "failures": []
 
1951
  "local": {
1952
  "path": "repo:docs/data/two_evidence_lines.json",
1953
  "exists": true,
1954
+ "bytes": 7351,
1955
+ "sha256": "3e9870122c495361d237acdcfdd05a0900b52179fc0db611291d3edc186b0667"
1956
  },
1957
  "mirrors": {
1958
  "hf_space": {
1959
  "path": "hf_space:data/two_evidence_lines.json",
1960
  "exists": true,
1961
+ "bytes": 7351,
1962
+ "sha256": "3e9870122c495361d237acdcfdd05a0900b52179fc0db611291d3edc186b0667"
1963
  },
1964
  "hf_artifacts_data": {
1965
  "path": "hf_artifacts:data/two_evidence_lines.json",
1966
  "exists": true,
1967
+ "bytes": 7351,
1968
+ "sha256": "3e9870122c495361d237acdcfdd05a0900b52179fc0db611291d3edc186b0667"
1969
  },
1970
  "hf_artifacts": {
1971
  "path": "hf_artifacts:docs/data/two_evidence_lines.json",
1972
  "exists": true,
1973
+ "bytes": 7351,
1974
+ "sha256": "3e9870122c495361d237acdcfdd05a0900b52179fc0db611291d3edc186b0667"
1975
  },
1976
  "hf_model_data": {
1977
  "path": "hf_model:data/two_evidence_lines.json",
1978
  "exists": true,
1979
+ "bytes": 7351,
1980
+ "sha256": "3e9870122c495361d237acdcfdd05a0900b52179fc0db611291d3edc186b0667"
1981
  },
1982
  "hf_model_docs_data": {
1983
  "path": "hf_model:docs/data/two_evidence_lines.json",
1984
  "exists": true,
1985
+ "bytes": 7351,
1986
+ "sha256": "3e9870122c495361d237acdcfdd05a0900b52179fc0db611291d3edc186b0667"
1987
  },
1988
  "hf_model": {
1989
  "path": "hf_model:metrics/two_evidence_lines.json",
1990
  "exists": true,
1991
+ "bytes": 7351,
1992
+ "sha256": "3e9870122c495361d237acdcfdd05a0900b52179fc0db611291d3edc186b0667"
1993
  }
1994
  },
1995
  "failures": []
 
2000
  "local": {
2001
  "path": "repo:docs/data/two_evidence_line_result_summary.json",
2002
  "exists": true,
2003
+ "bytes": 17416,
2004
+ "sha256": "33f8d9280a6b77f5922b37d32d17949683b27eee362be8cf220a7bda80282408"
2005
  },
2006
  "mirrors": {
2007
  "hf_space": {
2008
  "path": "hf_space:data/two_evidence_line_result_summary.json",
2009
  "exists": true,
2010
+ "bytes": 17416,
2011
+ "sha256": "33f8d9280a6b77f5922b37d32d17949683b27eee362be8cf220a7bda80282408"
2012
  },
2013
  "hf_artifacts_data": {
2014
  "path": "hf_artifacts:data/two_evidence_line_result_summary.json",
2015
  "exists": true,
2016
+ "bytes": 17416,
2017
+ "sha256": "33f8d9280a6b77f5922b37d32d17949683b27eee362be8cf220a7bda80282408"
2018
  },
2019
  "hf_artifacts": {
2020
  "path": "hf_artifacts:docs/data/two_evidence_line_result_summary.json",
2021
  "exists": true,
2022
+ "bytes": 17416,
2023
+ "sha256": "33f8d9280a6b77f5922b37d32d17949683b27eee362be8cf220a7bda80282408"
2024
  },
2025
  "hf_model_data": {
2026
  "path": "hf_model:data/two_evidence_line_result_summary.json",
2027
  "exists": true,
2028
+ "bytes": 17416,
2029
+ "sha256": "33f8d9280a6b77f5922b37d32d17949683b27eee362be8cf220a7bda80282408"
2030
  },
2031
  "hf_model_docs_data": {
2032
  "path": "hf_model:docs/data/two_evidence_line_result_summary.json",
2033
  "exists": true,
2034
+ "bytes": 17416,
2035
+ "sha256": "33f8d9280a6b77f5922b37d32d17949683b27eee362be8cf220a7bda80282408"
2036
  },
2037
  "hf_model": {
2038
  "path": "hf_model:metrics/two_evidence_line_result_summary.json",
2039
  "exists": true,
2040
+ "bytes": 17416,
2041
+ "sha256": "33f8d9280a6b77f5922b37d32d17949683b27eee362be8cf220a7bda80282408"
2042
  }
2043
  },
2044
  "failures": []
 
2148
  "path": "repo:docs/data/task_surface_integrity.json",
2149
  "exists": true,
2150
  "bytes": 46246,
2151
+ "sha256": "e674404f5a8f0a9b92fc2fd7ff3a745e3f42a5fdbad52e17558790d5ecbddd91"
2152
  },
2153
  "mirrors": {
2154
  "hf_space": {
2155
  "path": "hf_space:data/task_surface_integrity.json",
2156
  "exists": true,
2157
  "bytes": 46246,
2158
+ "sha256": "e674404f5a8f0a9b92fc2fd7ff3a745e3f42a5fdbad52e17558790d5ecbddd91"
2159
  },
2160
  "hf_artifacts_data": {
2161
  "path": "hf_artifacts:data/task_surface_integrity.json",
2162
  "exists": true,
2163
  "bytes": 46246,
2164
+ "sha256": "e674404f5a8f0a9b92fc2fd7ff3a745e3f42a5fdbad52e17558790d5ecbddd91"
2165
  },
2166
  "hf_artifacts": {
2167
  "path": "hf_artifacts:docs/data/task_surface_integrity.json",
2168
  "exists": true,
2169
  "bytes": 46246,
2170
+ "sha256": "e674404f5a8f0a9b92fc2fd7ff3a745e3f42a5fdbad52e17558790d5ecbddd91"
2171
  },
2172
  "hf_model_data": {
2173
  "path": "hf_model:data/task_surface_integrity.json",
2174
  "exists": true,
2175
  "bytes": 46246,
2176
+ "sha256": "e674404f5a8f0a9b92fc2fd7ff3a745e3f42a5fdbad52e17558790d5ecbddd91"
2177
  },
2178
  "hf_model_docs_data": {
2179
  "path": "hf_model:docs/data/task_surface_integrity.json",
2180
  "exists": true,
2181
  "bytes": 46246,
2182
+ "sha256": "e674404f5a8f0a9b92fc2fd7ff3a745e3f42a5fdbad52e17558790d5ecbddd91"
2183
  },
2184
  "hf_model": {
2185
  "path": "hf_model:metrics/task_surface_integrity.json",
2186
  "exists": true,
2187
  "bytes": 46246,
2188
+ "sha256": "e674404f5a8f0a9b92fc2fd7ff3a745e3f42a5fdbad52e17558790d5ecbddd91"
2189
  }
2190
  },
2191
  "failures": []
 
2344
  "path": "repo:docs/data/task_method_20_source_audit.json",
2345
  "exists": true,
2346
  "bytes": 561,
2347
+ "sha256": "bb50000bb681f1ed12c4089bd3541f14b8e121ea5a1fe06e1a6a559cb8960e0b"
2348
  },
2349
  "mirrors": {
2350
  "hf_space": {
2351
  "path": "hf_space:data/task_method_20_source_audit.json",
2352
  "exists": true,
2353
  "bytes": 561,
2354
+ "sha256": "bb50000bb681f1ed12c4089bd3541f14b8e121ea5a1fe06e1a6a559cb8960e0b"
2355
  },
2356
  "hf_artifacts_data": {
2357
  "path": "hf_artifacts:data/task_method_20_source_audit.json",
2358
  "exists": true,
2359
  "bytes": 561,
2360
+ "sha256": "bb50000bb681f1ed12c4089bd3541f14b8e121ea5a1fe06e1a6a559cb8960e0b"
2361
  },
2362
  "hf_artifacts": {
2363
  "path": "hf_artifacts:docs/data/task_method_20_source_audit.json",
2364
  "exists": true,
2365
  "bytes": 561,
2366
+ "sha256": "bb50000bb681f1ed12c4089bd3541f14b8e121ea5a1fe06e1a6a559cb8960e0b"
2367
  },
2368
  "hf_model_data": {
2369
  "path": "hf_model:data/task_method_20_source_audit.json",
2370
  "exists": true,
2371
  "bytes": 561,
2372
+ "sha256": "bb50000bb681f1ed12c4089bd3541f14b8e121ea5a1fe06e1a6a559cb8960e0b"
2373
  },
2374
  "hf_model_docs_data": {
2375
  "path": "hf_model:docs/data/task_method_20_source_audit.json",
2376
  "exists": true,
2377
  "bytes": 561,
2378
+ "sha256": "bb50000bb681f1ed12c4089bd3541f14b8e121ea5a1fe06e1a6a559cb8960e0b"
2379
  },
2380
  "hf_model": {
2381
  "path": "hf_model:metrics/task_method_20_source_audit.json",
2382
  "exists": true,
2383
  "bytes": 561,
2384
+ "sha256": "bb50000bb681f1ed12c4089bd3541f14b8e121ea5a1fe06e1a6a559cb8960e0b"
2385
  }
2386
  },
2387
  "failures": []
 
2491
  "path": "repo:docs/data/website_integrity.json",
2492
  "exists": true,
2493
  "bytes": 20658,
2494
+ "sha256": "e1a0b46f5645783ee8430c45d88862fa0ce9fe20c980574d77ddc6abc84fd745"
2495
  },
2496
  "mirrors": {
2497
  "hf_space": {
2498
  "path": "hf_space:data/website_integrity.json",
2499
  "exists": true,
2500
  "bytes": 20658,
2501
+ "sha256": "e1a0b46f5645783ee8430c45d88862fa0ce9fe20c980574d77ddc6abc84fd745"
2502
  },
2503
  "hf_artifacts_data": {
2504
  "path": "hf_artifacts:data/website_integrity.json",
2505
  "exists": true,
2506
  "bytes": 20658,
2507
+ "sha256": "e1a0b46f5645783ee8430c45d88862fa0ce9fe20c980574d77ddc6abc84fd745"
2508
  },
2509
  "hf_artifacts": {
2510
  "path": "hf_artifacts:docs/data/website_integrity.json",
2511
  "exists": true,
2512
  "bytes": 20658,
2513
+ "sha256": "e1a0b46f5645783ee8430c45d88862fa0ce9fe20c980574d77ddc6abc84fd745"
2514
  },
2515
  "hf_model_data": {
2516
  "path": "hf_model:data/website_integrity.json",
2517
  "exists": true,
2518
  "bytes": 20658,
2519
+ "sha256": "e1a0b46f5645783ee8430c45d88862fa0ce9fe20c980574d77ddc6abc84fd745"
2520
  },
2521
  "hf_model_docs_data": {
2522
  "path": "hf_model:docs/data/website_integrity.json",
2523
  "exists": true,
2524
  "bytes": 20658,
2525
+ "sha256": "e1a0b46f5645783ee8430c45d88862fa0ce9fe20c980574d77ddc6abc84fd745"
2526
  },
2527
  "hf_model": {
2528
  "path": "hf_model:metrics/website_integrity.json",
2529
  "exists": true,
2530
  "bytes": 20658,
2531
+ "sha256": "e1a0b46f5645783ee8430c45d88862fa0ce9fe20c980574d77ddc6abc84fd745"
2532
  }
2533
  },
2534
  "failures": []
 
5503
  "local": {
5504
  "path": "repo:scripts/build_multilingual_public_readmes.py",
5505
  "exists": true,
5506
+ "bytes": 51467,
5507
+ "sha256": "d4fd285ff1da0479f06fb4a15bd123142f8b72d752709bd7be4c3375dfb544d3"
5508
  },
5509
  "mirrors": {
5510
  "hf_artifacts": {
5511
  "path": "hf_artifacts:scripts/build_multilingual_public_readmes.py",
5512
  "exists": true,
5513
+ "bytes": 51467,
5514
+ "sha256": "d4fd285ff1da0479f06fb4a15bd123142f8b72d752709bd7be4c3375dfb544d3"
5515
  },
5516
  "hf_model": {
5517
  "path": "hf_model:scripts/build_multilingual_public_readmes.py",
5518
  "exists": true,
5519
+ "bytes": 51467,
5520
+ "sha256": "d4fd285ff1da0479f06fb4a15bd123142f8b72d752709bd7be4c3375dfb544d3"
5521
  }
5522
  },
5523
  "failures": []
 
6282
  "local": {
6283
  "path": "repo:docs/index.html",
6284
  "exists": true,
6285
+ "bytes": 300335,
6286
+ "sha256": "37a16c8dcb18740c2e0a2c816b3919a3b89a8eb2f59b2549d957a51a2c5c4c19"
6287
  },
6288
  "mirrors": {
6289
  "hf_space": {
6290
  "path": "hf_space:index.html",
6291
  "exists": true,
6292
+ "bytes": 300335,
6293
+ "sha256": "37a16c8dcb18740c2e0a2c816b3919a3b89a8eb2f59b2549d957a51a2c5c4c19"
6294
  },
6295
  "hf_artifacts_root": {
6296
  "path": "hf_artifacts:index.html",
6297
  "exists": true,
6298
+ "bytes": 300335,
6299
+ "sha256": "37a16c8dcb18740c2e0a2c816b3919a3b89a8eb2f59b2549d957a51a2c5c4c19"
6300
  },
6301
  "hf_artifacts_docs": {
6302
  "path": "hf_artifacts:docs/index.html",
6303
  "exists": true,
6304
+ "bytes": 300335,
6305
+ "sha256": "37a16c8dcb18740c2e0a2c816b3919a3b89a8eb2f59b2549d957a51a2c5c4c19"
6306
  },
6307
  "hf_model": {
6308
  "path": "hf_model:index.html",
6309
  "exists": true,
6310
+ "bytes": 300335,
6311
+ "sha256": "37a16c8dcb18740c2e0a2c816b3919a3b89a8eb2f59b2549d957a51a2c5c4c19"
6312
  },
6313
  "hf_model_docs": {
6314
  "path": "hf_model:docs/index.html",
6315
  "exists": true,
6316
+ "bytes": 300335,
6317
+ "sha256": "37a16c8dcb18740c2e0a2c816b3919a3b89a8eb2f59b2549d957a51a2c5c4c19"
6318
  }
6319
  },
6320
  "failures": []
 
31102
  "local": {
31103
  "path": "repo:TWO_EVIDENCE_LINES.md",
31104
  "exists": true,
31105
+ "bytes": 4325,
31106
+ "sha256": "db6768959591dc8501bf53f1c398f0db51027f66e1c3b15ceb25396d94705bfc"
31107
  },
31108
  "mirrors": {
31109
  "hf_space": {
31110
  "path": "hf_space:TWO_EVIDENCE_LINES.md",
31111
  "exists": true,
31112
+ "bytes": 4325,
31113
+ "sha256": "db6768959591dc8501bf53f1c398f0db51027f66e1c3b15ceb25396d94705bfc"
31114
  },
31115
  "hf_artifacts": {
31116
  "path": "hf_artifacts:TWO_EVIDENCE_LINES.md",
31117
  "exists": true,
31118
+ "bytes": 4325,
31119
+ "sha256": "db6768959591dc8501bf53f1c398f0db51027f66e1c3b15ceb25396d94705bfc"
31120
  },
31121
  "hf_model": {
31122
  "path": "hf_model:TWO_EVIDENCE_LINES.md",
31123
  "exists": true,
31124
+ "bytes": 4325,
31125
+ "sha256": "db6768959591dc8501bf53f1c398f0db51027f66e1c3b15ceb25396d94705bfc"
31126
  }
31127
  },
31128
  "failures": []
 
31133
  "local": {
31134
  "path": "repo:TWO_EVIDENCE_LINE_RESULT_SUMMARY.md",
31135
  "exists": true,
31136
+ "bytes": 8726,
31137
+ "sha256": "23a459e2903af252b1e99a8fe5bf63e3e282d7d126f76b54141700ab5fb07ca8"
31138
  },
31139
  "mirrors": {
31140
  "hf_space": {
31141
  "path": "hf_space:TWO_EVIDENCE_LINE_RESULT_SUMMARY.md",
31142
  "exists": true,
31143
+ "bytes": 8726,
31144
+ "sha256": "23a459e2903af252b1e99a8fe5bf63e3e282d7d126f76b54141700ab5fb07ca8"
31145
  },
31146
  "hf_artifacts": {
31147
  "path": "hf_artifacts:TWO_EVIDENCE_LINE_RESULT_SUMMARY.md",
31148
  "exists": true,
31149
+ "bytes": 8726,
31150
+ "sha256": "23a459e2903af252b1e99a8fe5bf63e3e282d7d126f76b54141700ab5fb07ca8"
31151
  },
31152
  "hf_model": {
31153
  "path": "hf_model:TWO_EVIDENCE_LINE_RESULT_SUMMARY.md",
31154
  "exists": true,
31155
+ "bytes": 8726,
31156
+ "sha256": "23a459e2903af252b1e99a8fe5bf63e3e282d7d126f76b54141700ab5fb07ca8"
31157
  }
31158
  },
31159
  "failures": []
 
31165
  "path": "repo:QWEN3_OMNI_RUN_LINEAGE.md",
31166
  "exists": true,
31167
  "bytes": 6287,
31168
+ "sha256": "4d0f01d7ea321b8f7d4c75f4ae00b3ba4a454b100b3238d320c00982959e71d0"
31169
  },
31170
  "mirrors": {
31171
  "hf_space": {
31172
  "path": "hf_space:QWEN3_OMNI_RUN_LINEAGE.md",
31173
  "exists": true,
31174
  "bytes": 6287,
31175
+ "sha256": "4d0f01d7ea321b8f7d4c75f4ae00b3ba4a454b100b3238d320c00982959e71d0"
31176
  },
31177
  "hf_artifacts": {
31178
  "path": "hf_artifacts:QWEN3_OMNI_RUN_LINEAGE.md",
31179
  "exists": true,
31180
  "bytes": 6287,
31181
+ "sha256": "4d0f01d7ea321b8f7d4c75f4ae00b3ba4a454b100b3238d320c00982959e71d0"
31182
  },
31183
  "hf_model": {
31184
  "path": "hf_model:QWEN3_OMNI_RUN_LINEAGE.md",
31185
  "exists": true,
31186
  "bytes": 6287,
31187
+ "sha256": "4d0f01d7ea321b8f7d4c75f4ae00b3ba4a454b100b3238d320c00982959e71d0"
31188
  }
31189
  },
31190
  "failures": []
 
31195
  "local": {
31196
  "path": "repo:PROJECT_BRIEF.md",
31197
  "exists": true,
31198
+ "bytes": 4817,
31199
+ "sha256": "4bf611b5ab6959424d8a31ff6ccbaa460bb97496a59b6405830a3878036e49cb"
31200
  },
31201
  "mirrors": {
31202
  "hf_space": {
31203
  "path": "hf_space:PROJECT_BRIEF.md",
31204
  "exists": true,
31205
+ "bytes": 4817,
31206
+ "sha256": "4bf611b5ab6959424d8a31ff6ccbaa460bb97496a59b6405830a3878036e49cb"
31207
  },
31208
  "hf_artifacts": {
31209
  "path": "hf_artifacts:PROJECT_BRIEF.md",
31210
  "exists": true,
31211
+ "bytes": 4817,
31212
+ "sha256": "4bf611b5ab6959424d8a31ff6ccbaa460bb97496a59b6405830a3878036e49cb"
31213
  },
31214
  "hf_model": {
31215
  "path": "hf_model:PROJECT_BRIEF.md",
31216
  "exists": true,
31217
+ "bytes": 4817,
31218
+ "sha256": "4bf611b5ab6959424d8a31ff6ccbaa460bb97496a59b6405830a3878036e49cb"
31219
  }
31220
  },
31221
  "failures": []
 
31226
  "local": {
31227
  "path": "repo:PUBLIC_READER_MAP.md",
31228
  "exists": true,
31229
+ "bytes": 5497,
31230
+ "sha256": "332ddbeddd908794fdd2a7e18ee1c9d657e672844f8f65a305f147ffde065133"
31231
  },
31232
  "mirrors": {
31233
  "hf_space": {
31234
  "path": "hf_space:PUBLIC_READER_MAP.md",
31235
  "exists": true,
31236
+ "bytes": 5497,
31237
+ "sha256": "332ddbeddd908794fdd2a7e18ee1c9d657e672844f8f65a305f147ffde065133"
31238
  },
31239
  "hf_artifacts": {
31240
  "path": "hf_artifacts:PUBLIC_READER_MAP.md",
31241
  "exists": true,
31242
+ "bytes": 5497,
31243
+ "sha256": "332ddbeddd908794fdd2a7e18ee1c9d657e672844f8f65a305f147ffde065133"
31244
  },
31245
  "hf_model": {
31246
  "path": "hf_model:PUBLIC_READER_MAP.md",
31247
  "exists": true,
31248
+ "bytes": 5497,
31249
+ "sha256": "332ddbeddd908794fdd2a7e18ee1c9d657e672844f8f65a305f147ffde065133"
31250
  }
31251
  },
31252
  "failures": []
 
31319
  "local": {
31320
  "path": "repo:PROJECT_STATUS.md",
31321
  "exists": true,
31322
+ "bytes": 15977,
31323
+ "sha256": "6ce8ca736b0ad2f8fef463c62530294e414cfc0a0e4ed501907fbf1536de0aa9"
31324
  },
31325
  "mirrors": {
31326
  "hf_space": {
31327
  "path": "hf_space:PROJECT_STATUS.md",
31328
  "exists": true,
31329
+ "bytes": 15977,
31330
+ "sha256": "6ce8ca736b0ad2f8fef463c62530294e414cfc0a0e4ed501907fbf1536de0aa9"
31331
  },
31332
  "hf_artifacts": {
31333
  "path": "hf_artifacts:PROJECT_STATUS.md",
31334
  "exists": true,
31335
+ "bytes": 15977,
31336
+ "sha256": "6ce8ca736b0ad2f8fef463c62530294e414cfc0a0e4ed501907fbf1536de0aa9"
31337
  },
31338
  "hf_model": {
31339
  "path": "hf_model:PROJECT_STATUS.md",
31340
  "exists": true,
31341
+ "bytes": 15977,
31342
+ "sha256": "6ce8ca736b0ad2f8fef463c62530294e414cfc0a0e4ed501907fbf1536de0aa9"
31343
  }
31344
  },
31345
  "failures": []
data/project_status.json CHANGED
@@ -2,7 +2,7 @@
2
  "title": "Ropedia Xperience-10M Task Suite Project Status",
3
  "version": "2026-06-20",
4
  "decision": "public_sample_pipeline_verified_128_enhancement_qwen3_v6_cosmos_comparison",
5
- "research_positioning": "A research-engineering study that makes one public Xperience-10M sample episode inspectable, defines embodied-AI tasks over synchronized modalities, records baseline behavior, aligns simple/NN baselines to the selected 128-episode split, compares verified Qwen3-Omni and Cosmos3 packages as early cross-episode diagnostics, and now records a no-new-episode enhancement pack for pushing the current 128-episode suite harder.",
6
  "scope_boundary": {
7
  "validated_episode_count": 1,
8
  "aligned_frames": 5821,
@@ -259,6 +259,8 @@
259
  "area": "Qwen3-Omni fine-tuning",
260
  "status": "final_verified_diagnostic_result_json_target_met",
261
  "evidence": [
 
 
262
  "docs/data/omni_finetune_verified_result.json",
263
  "docs/data/qwen3_v5_v6_comparison.json",
264
  "results/omni_finetune/QWEN3_V5_V6_COMPARISON_20260614.md",
@@ -268,7 +270,7 @@
268
  "scripts/omni/audit_verified_omni_package.py",
269
  "scripts/omni/analyze_qwen3_omni_errors.py"
270
  ],
271
- "readout": "The selected 96/16/16 episode split now has a current v6 rank64/lr5e-5 public-safe held-out package with 34,269 exported windows, 4,032 test predictions, validation/audit summaries, and a public LoRA adapter repo. JSON validity is 99.90%, meeting the 98% target; transition accuracy is 98.98%, contact accuracy is 81.77%, object micro-F1 is 30.65%, next-action accuracy is 4.31%, and action/subtask metrics remain weak. v6 improves action macro-F1 and contact accuracy versus v5, but v5 remains stronger on JSON validity, subtask, next-action, transition, and object metrics."
272
  },
273
  {
274
  "area": "Cosmos3-Nano future-window package",
 
2
  "title": "Ropedia Xperience-10M Task Suite Project Status",
3
  "version": "2026-06-20",
4
  "decision": "public_sample_pipeline_verified_128_enhancement_qwen3_v6_cosmos_comparison",
5
+ "research_positioning": "A research-engineering study with two public evidence lines: Line 1 makes one public Xperience-10M sample episode inspectable and reproducible as a 20-task lab; Line 2 aligns selected 128-episode baselines with verified Qwen3-Omni v6, Cosmos3-Super, and Cosmos3-Nano diagnostics, then records a no-new-episode enhancement pack for pushing the 128-episode suite harder.",
6
  "scope_boundary": {
7
  "validated_episode_count": 1,
8
  "aligned_frames": 5821,
 
259
  "area": "Qwen3-Omni fine-tuning",
260
  "status": "final_verified_diagnostic_result_json_target_met",
261
  "evidence": [
262
+ "QWEN3_OMNI_RUN_LINEAGE.md",
263
+ "docs/data/qwen3_omni_run_lineage.json",
264
  "docs/data/omni_finetune_verified_result.json",
265
  "docs/data/qwen3_v5_v6_comparison.json",
266
  "results/omni_finetune/QWEN3_V5_V6_COMPARISON_20260614.md",
 
270
  "scripts/omni/audit_verified_omni_package.py",
271
  "scripts/omni/analyze_qwen3_omni_errors.py"
272
  ],
273
+ "readout": "Qwen3-Omni v1-v6 are one selected-128 run lineage, not six project evidence lines. v1-v4 harden the pipeline and record ablations, v5 is the pinned prior multiscale release, and v6 is the current public 20-task Qwen row. The v6 rank64/lr5e-5 public-safe held-out package has 34,269 exported windows, 4,032 test predictions, validation/audit summaries, and a public LoRA adapter repo. JSON validity is 99.90%, meeting the 98% target; transition accuracy is 98.98%, contact accuracy is 81.77%, object micro-F1 is 30.65%, next-action accuracy is 4.31%, and action/subtask metrics remain weak. v6 improves action macro-F1 and contact accuracy versus v5, but v5 remains stronger on JSON validity, subtask, next-action, transition, and object metrics."
274
  },
275
  {
276
  "area": "Cosmos3-Nano future-window package",
data/publication_audit.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T11:07:41+00:00",
4
  "checks": [
5
  {
6
  "name": "required_publication_assets_present",
@@ -255,7 +255,7 @@
255
  "hf_artifact_bundle": {
256
  "root": "hf_publish/artifacts",
257
  "exists": true,
258
- "file_count": 3049,
259
  "text_file_count": 1283,
260
  "largest_file": {
261
  "path": "results/omni_finetune/xperience10m_128ep_dense_multiscale_hierarchical_v1_20260608/dense_multiscale_windows.jsonl",
@@ -266,7 +266,7 @@
266
  "hf_model_bundle": {
267
  "root": "hf_publish/model",
268
  "exists": true,
269
- "file_count": 3533,
270
  "text_file_count": 1455,
271
  "largest_file": {
272
  "path": "results/omni_finetune/xperience10m_128ep_dense_multiscale_hierarchical_v1_20260608/dense_multiscale_windows.jsonl",
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T11:49:39+00:00",
4
  "checks": [
5
  {
6
  "name": "required_publication_assets_present",
 
255
  "hf_artifact_bundle": {
256
  "root": "hf_publish/artifacts",
257
  "exists": true,
258
+ "file_count": 4519,
259
  "text_file_count": 1283,
260
  "largest_file": {
261
  "path": "results/omni_finetune/xperience10m_128ep_dense_multiscale_hierarchical_v1_20260608/dense_multiscale_windows.jsonl",
 
266
  "hf_model_bundle": {
267
  "root": "hf_publish/model",
268
  "exists": true,
269
+ "file_count": 5281,
270
  "text_file_count": 1455,
271
  "largest_file": {
272
  "path": "results/omni_finetune/xperience10m_128ep_dense_multiscale_hierarchical_v1_20260608/dense_multiscale_windows.jsonl",
data/scope_claims_audit.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T11:08:09+00:00",
4
  "summary": {
5
  "qwen3_omni_verified_diagnostic_pilot": true,
6
  "dataset_manifest_num_episodes": 119,
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T11:49:59+00:00",
4
  "summary": {
5
  "qwen3_omni_verified_diagnostic_pilot": true,
6
  "dataset_manifest_num_episodes": 119,
data/two_evidence_line_result_summary.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "generated_at_utc": "2026-06-21T10:47:04+00:00",
3
  "interpretation_rule": "Use the 1-episode line for task construction and reproducibility claims. Use the 128-episode line for same-split metadata/raw baselines, Qwen3-Omni v6 LoRA diagnostics, and Cosmos3 diagnostics.",
4
  "lines": [
5
  {
@@ -337,7 +337,7 @@
337
  "selected_128_episode_surface": "Use for held-out comparison, metadata/raw-feature baselines, Qwen3-Omni v6 LoRA, Cosmos3-Super Reasoner, Cosmos3-Nano Future Window, and scale-up decisions.",
338
  "single_public_sample_episode": "Use for task construction, raw-file inspection, local reproducibility, and controlled Minimal-vs-Neural baseline behavior."
339
  },
340
- "reader_summary": "The suite has two public result lines. Line 1 is the fully inspectable one-episode task lab. Line 2 is the 128-episode comparison surface for aligned baselines, the Qwen3-Omni series, and the Cosmos3 series. Do not mix the two when reading scores.",
341
  "reading_order": [
342
  {
343
  "reason": "Line 1 answers task-lab and reproducibility questions; line 2 answers selected-128 comparison questions.",
 
1
  {
2
+ "generated_at_utc": "2026-06-21T11:49:06+00:00",
3
  "interpretation_rule": "Use the 1-episode line for task construction and reproducibility claims. Use the 128-episode line for same-split metadata/raw baselines, Qwen3-Omni v6 LoRA diagnostics, and Cosmos3 diagnostics.",
4
  "lines": [
5
  {
 
337
  "selected_128_episode_surface": "Use for held-out comparison, metadata/raw-feature baselines, Qwen3-Omni v6 LoRA, Cosmos3-Super Reasoner, Cosmos3-Nano Future Window, and scale-up decisions.",
338
  "single_public_sample_episode": "Use for task construction, raw-file inspection, local reproducibility, and controlled Minimal-vs-Neural baseline behavior."
339
  },
340
+ "reader_summary": "The suite has two public evidence lines. Line 1 is the fully inspectable one-episode task lab. Line 2 is the 128-episode comparison surface for aligned baselines, the Qwen3-Omni series, and the Cosmos3 series. Do not mix the two when reading scores.",
341
  "reading_order": [
342
  {
343
  "reason": "Line 1 answers task-lab and reproducibility questions; line 2 answers selected-128 comparison questions.",
docs/data/public_reader_map.json CHANGED
@@ -8,6 +8,11 @@
8
  "start_here": "PROJECT_BRIEF.md",
9
  "then_inspect": ["PROJECT_STATUS.md", "RESEARCH_TAKEAWAYS.md"]
10
  },
 
 
 
 
 
11
  {
12
  "reader_goal": "See the visual public dashboard",
13
  "start_here": "GitHub Pages dashboard or Hugging Face Space",
@@ -38,6 +43,11 @@
38
  "start_here": "Homepage radar section",
39
  "then_inspect": ["docs/data/single_episode_task_model_radar.json", "docs/data/episode128_task_model_radar.json"]
40
  },
 
 
 
 
 
41
  {
42
  "reader_goal": "Find all derived artifacts",
43
  "start_here": "ARTIFACT_GUIDE.md",
@@ -101,11 +111,11 @@
101
  "best_use": "Inspecting Qwen3-Omni and Cosmos3 artifacts."
102
  }
103
  ],
104
- "evidence_layers": [
105
  "Dataset/source boundary",
106
  "Data contract",
107
  "Task suite",
108
- "Results",
109
  "Foundation directions",
110
  "Public-release checks"
111
  ],
@@ -116,10 +126,15 @@
116
  "boundary": "Describes one public sample episode, not the full dataset distribution."
117
  },
118
  {
119
- "claim_type": "128-episode baseline comparison",
120
  "public_evidence": ["XPERIENCE10M_128_EPISODE_FEATURE_INDEX.md", "docs/data/xperience10m_128_episode_feature_index.json", "results/omni_finetune/*128*", "docs/data/omni_model_comparison.json"],
121
  "boundary": "Uses selected held-out episodes and derived public-safe summaries; official raw files remain gated upstream."
122
  },
 
 
 
 
 
123
  {
124
  "claim_type": "Foundation-model track quality",
125
  "public_evidence": ["Verified Qwen3-Omni and Cosmos3 result packages", "model cards"],
 
8
  "start_here": "PROJECT_BRIEF.md",
9
  "then_inspect": ["PROJECT_STATUS.md", "RESEARCH_TAKEAWAYS.md"]
10
  },
11
+ {
12
+ "reader_goal": "Understand the two evidence lines",
13
+ "start_here": "TWO_EVIDENCE_LINES.md",
14
+ "then_inspect": ["docs/data/two_evidence_lines.json", "docs/data/two_evidence_line_result_summary.json"]
15
+ },
16
  {
17
  "reader_goal": "See the visual public dashboard",
18
  "start_here": "GitHub Pages dashboard or Hugging Face Space",
 
43
  "start_here": "Homepage radar section",
44
  "then_inspect": ["docs/data/single_episode_task_model_radar.json", "docs/data/episode128_task_model_radar.json"]
45
  },
46
+ {
47
+ "reader_goal": "Read Qwen3-Omni v1-v6 correctly",
48
+ "start_here": "QWEN3_OMNI_RUN_LINEAGE.md",
49
+ "then_inspect": ["docs/data/qwen3_omni_run_lineage.json", "docs/data/qwen3_v5_v6_comparison.json"]
50
+ },
51
  {
52
  "reader_goal": "Find all derived artifacts",
53
  "start_here": "ARTIFACT_GUIDE.md",
 
111
  "best_use": "Inspecting Qwen3-Omni and Cosmos3 artifacts."
112
  }
113
  ],
114
+ "evidence_views": [
115
  "Dataset/source boundary",
116
  "Data contract",
117
  "Task suite",
118
+ "Results by Line 1 and Line 2",
119
  "Foundation directions",
120
  "Public-release checks"
121
  ],
 
126
  "boundary": "Describes one public sample episode, not the full dataset distribution."
127
  },
128
  {
129
+ "claim_type": "128-episode method comparison",
130
  "public_evidence": ["XPERIENCE10M_128_EPISODE_FEATURE_INDEX.md", "docs/data/xperience10m_128_episode_feature_index.json", "results/omni_finetune/*128*", "docs/data/omni_model_comparison.json"],
131
  "boundary": "Uses selected held-out episodes and derived public-safe summaries; official raw files remain gated upstream."
132
  },
133
+ {
134
+ "claim_type": "Qwen3-Omni v1-v6 lineage",
135
+ "public_evidence": ["QWEN3_OMNI_RUN_LINEAGE.md", "docs/data/qwen3_omni_run_lineage.json"],
136
+ "boundary": "v1-v4 are pipeline and ablation evidence, v5 is the pinned prior release, and v6 is the current public 20-task Qwen row."
137
+ },
138
  {
139
  "claim_type": "Foundation-model track quality",
140
  "public_evidence": ["Verified Qwen3-Omni and Cosmos3 result packages", "model cards"],
docs/data/public_surface_qa.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Public Project Surface",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T11:08:07+00:00",
5
  "scope": "Repo README, GitHub Pages HTML, Hugging Face Space card, artifact dataset card, and model card.",
6
  "checks": [
7
  {
@@ -18,7 +18,7 @@
18
  "website_integrity": {
19
  "exists": true,
20
  "status": "pass",
21
- "generated_at_utc": "2026-06-21T11:07:26+00:00"
22
  },
23
  "rendered_site_check": {
24
  "exists": true,
@@ -28,27 +28,27 @@
28
  "task_surface_integrity": {
29
  "exists": true,
30
  "status": "pass",
31
- "generated_at_utc": "2026-06-21T11:04:16+00:00"
32
  },
33
  "source_alignment": {
34
  "exists": true,
35
  "status": "pass",
36
- "generated_at_utc": "2026-06-21T11:04:16+00:00"
37
  },
38
  "scale_up_status": {
39
  "exists": true,
40
  "status": "pass",
41
- "generated_at_utc": "2026-06-21T11:03:20+00:00"
42
  },
43
  "publication_package": {
44
  "exists": true,
45
  "status": "pass",
46
- "generated_at_utc": "2026-06-21T11:07:41+00:00"
47
  },
48
  "mirror_parity": {
49
  "exists": true,
50
  "status": "pass",
51
- "generated_at_utc": "2026-06-21T11:05:04+00:00"
52
  }
53
  },
54
  "failures": {}
@@ -98,7 +98,7 @@
98
  "Ropedia Xperience-10M Task Suite": 20,
99
  "Xperience-10M": 166,
100
  "20-task": 89,
101
- "Qwen3-Omni": 241,
102
  "128-episode pilot": 1
103
  }
104
  },
 
1
  {
2
  "title": "Ropedia Xperience-10M Public Project Surface",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T11:49:55+00:00",
5
  "scope": "Repo README, GitHub Pages HTML, Hugging Face Space card, artifact dataset card, and model card.",
6
  "checks": [
7
  {
 
18
  "website_integrity": {
19
  "exists": true,
20
  "status": "pass",
21
+ "generated_at_utc": "2026-06-21T11:49:24+00:00"
22
  },
23
  "rendered_site_check": {
24
  "exists": true,
 
28
  "task_surface_integrity": {
29
  "exists": true,
30
  "status": "pass",
31
+ "generated_at_utc": "2026-06-21T11:08:07+00:00"
32
  },
33
  "source_alignment": {
34
  "exists": true,
35
  "status": "pass",
36
+ "generated_at_utc": "2026-06-21T11:49:23+00:00"
37
  },
38
  "scale_up_status": {
39
  "exists": true,
40
  "status": "pass",
41
+ "generated_at_utc": "2026-06-21T11:08:09+00:00"
42
  },
43
  "publication_package": {
44
  "exists": true,
45
  "status": "pass",
46
+ "generated_at_utc": "2026-06-21T11:49:39+00:00"
47
  },
48
  "mirror_parity": {
49
  "exists": true,
50
  "status": "pass",
51
+ "generated_at_utc": "2026-06-21T11:28:34+00:00"
52
  }
53
  },
54
  "failures": {}
 
98
  "Ropedia Xperience-10M Task Suite": 20,
99
  "Xperience-10M": 166,
100
  "20-task": 89,
101
+ "Qwen3-Omni": 242,
102
  "128-episode pilot": 1
103
  }
104
  },
docs/data/quality_gates.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Release Checks",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T11:28:42+00:00",
5
  "rule": "A release is current when the automated reports pass and the live GitHub/Hugging Face mirrors are verified after publishing.",
6
  "automated_gates": [
7
  {
 
1
  {
2
  "title": "Ropedia Xperience-10M Release Checks",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T11:52:46+00:00",
5
  "rule": "A release is current when the automated reports pass and the live GitHub/Hugging Face mirrors are verified after publishing.",
6
  "automated_gates": [
7
  {
docs/data/task_method_20_source_audit.json CHANGED
@@ -2,7 +2,7 @@
2
  "checked_json_metric_count": 180,
3
  "failure_count": 0,
4
  "failures": [],
5
- "generated_at_utc": "2026-06-21T11:07:42+00:00",
6
  "method_task_record_count": 180,
7
  "rule": "Every scored row that declares a JSON metric source must have the same numeric value under that row's metric_key.",
8
  "scored_method_task_count": 180,
 
2
  "checked_json_metric_count": 180,
3
  "failure_count": 0,
4
  "failures": [],
5
+ "generated_at_utc": "2026-06-21T11:49:23+00:00",
6
  "method_task_record_count": 180,
7
  "rule": "Every scored row that declares a JSON metric source must have the same numeric value under that row's metric_key.",
8
  "scored_method_task_count": 180,
docs/data/task_surface_integrity.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T11:08:07+00:00",
4
  "summary": {
5
  "original_walkthrough_task_count": 12,
6
  "expected_original_walkthrough_task_count": 12,
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T11:49:55+00:00",
4
  "summary": {
5
  "original_walkthrough_task_count": 12,
6
  "expected_original_walkthrough_task_count": 12,
docs/data/two_evidence_line_result_summary.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "generated_at_utc": "2026-06-21T10:47:04+00:00",
3
  "interpretation_rule": "Use the 1-episode line for task construction and reproducibility claims. Use the 128-episode line for same-split metadata/raw baselines, Qwen3-Omni v6 LoRA diagnostics, and Cosmos3 diagnostics.",
4
  "lines": [
5
  {
@@ -337,7 +337,7 @@
337
  "selected_128_episode_surface": "Use for held-out comparison, metadata/raw-feature baselines, Qwen3-Omni v6 LoRA, Cosmos3-Super Reasoner, Cosmos3-Nano Future Window, and scale-up decisions.",
338
  "single_public_sample_episode": "Use for task construction, raw-file inspection, local reproducibility, and controlled Minimal-vs-Neural baseline behavior."
339
  },
340
- "reader_summary": "The suite has two public result lines. Line 1 is the fully inspectable one-episode task lab. Line 2 is the 128-episode comparison surface for aligned baselines, the Qwen3-Omni series, and the Cosmos3 series. Do not mix the two when reading scores.",
341
  "reading_order": [
342
  {
343
  "reason": "Line 1 answers task-lab and reproducibility questions; line 2 answers selected-128 comparison questions.",
 
1
  {
2
+ "generated_at_utc": "2026-06-21T11:49:06+00:00",
3
  "interpretation_rule": "Use the 1-episode line for task construction and reproducibility claims. Use the 128-episode line for same-split metadata/raw baselines, Qwen3-Omni v6 LoRA diagnostics, and Cosmos3 diagnostics.",
4
  "lines": [
5
  {
 
337
  "selected_128_episode_surface": "Use for held-out comparison, metadata/raw-feature baselines, Qwen3-Omni v6 LoRA, Cosmos3-Super Reasoner, Cosmos3-Nano Future Window, and scale-up decisions.",
338
  "single_public_sample_episode": "Use for task construction, raw-file inspection, local reproducibility, and controlled Minimal-vs-Neural baseline behavior."
339
  },
340
+ "reader_summary": "The suite has two public evidence lines. Line 1 is the fully inspectable one-episode task lab. Line 2 is the 128-episode comparison surface for aligned baselines, the Qwen3-Omni series, and the Cosmos3 series. Do not mix the two when reading scores.",
341
  "reading_order": [
342
  {
343
  "reason": "Line 1 answers task-lab and reproducibility questions; line 2 answers selected-128 comparison questions.",
docs/data/two_evidence_lines.json CHANGED
@@ -2,7 +2,7 @@
2
  "status": "current",
3
  "updated_utc": "2026-06-21T00:00:00Z",
4
  "interpretation_rule": "Use the 1-episode line for task construction and reproducibility claims. Use the 128-episode line for same-split metadata/raw baselines, Qwen3-Omni v6 LoRA diagnostics, and Cosmos3 diagnostics.",
5
- "reader_summary": "The suite has two public result lines. Line 1 is the fully inspectable one-episode task lab. Line 2 is the 128-episode comparison surface for aligned baselines, the Qwen3-Omni series, and the Cosmos3 series. Do not mix the two when reading scores.",
6
  "score_formula": "2 single-episode methods x 20 tasks = 40 records; 7 selected-128 methods x 20 tasks = 140 records; total public matrix = 180/180 scored records.",
7
  "lines": [
8
  {
 
2
  "status": "current",
3
  "updated_utc": "2026-06-21T00:00:00Z",
4
  "interpretation_rule": "Use the 1-episode line for task construction and reproducibility claims. Use the 128-episode line for same-split metadata/raw baselines, Qwen3-Omni v6 LoRA diagnostics, and Cosmos3 diagnostics.",
5
+ "reader_summary": "The suite has two public evidence lines. Line 1 is the fully inspectable one-episode task lab. Line 2 is the 128-episode comparison surface for aligned baselines, the Qwen3-Omni series, and the Cosmos3 series. Do not mix the two when reading scores.",
6
  "score_formula": "2 single-episode methods x 20 tasks = 40 records; 7 selected-128 methods x 20 tasks = 140 records; total public matrix = 180/180 scored records.",
7
  "lines": [
8
  {
docs/data/website_integrity.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T11:07:26+00:00",
4
  "docs_root": "docs",
5
  "site_base": "/ropedia-xperience-10m-task-suite/",
6
  "summary": {
@@ -80,8 +80,8 @@
80
  "name": "project_overview_precedes_progress_ledger",
81
  "status": "pass",
82
  "reason": "The project overview should appear before the deeper progress ledger.",
83
- "overview_index": 118524,
84
- "evidence_index": 163802
85
  },
86
  {
87
  "name": "project_status_links_json",
@@ -159,9 +159,9 @@
159
  "name": "evaluation_protocol_between_overview_and_progress",
160
  "status": "pass",
161
  "reason": "The evaluation protocol should appear before the deeper evidence ledger.",
162
- "overview_index": 118524,
163
- "protocol_index": 159990,
164
- "evidence_index": 163802
165
  },
166
  {
167
  "name": "evaluation_protocol_links_json",
@@ -386,12 +386,12 @@
386
  },
387
  {
388
  "path": "data/project_status.json",
389
- "bytes": 23054,
390
  "top_level_type": "dict"
391
  },
392
  {
393
  "path": "data/public_reader_map.json",
394
- "bytes": 5990,
395
  "top_level_type": "dict"
396
  },
397
  {
@@ -536,12 +536,12 @@
536
  },
537
  {
538
  "path": "data/two_evidence_line_result_summary.json",
539
- "bytes": 17414,
540
  "top_level_type": "dict"
541
  },
542
  {
543
  "path": "data/two_evidence_lines.json",
544
- "bytes": 7349,
545
  "top_level_type": "dict"
546
  },
547
  {
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T11:49:24+00:00",
4
  "docs_root": "docs",
5
  "site_base": "/ropedia-xperience-10m-task-suite/",
6
  "summary": {
 
80
  "name": "project_overview_precedes_progress_ledger",
81
  "status": "pass",
82
  "reason": "The project overview should appear before the deeper progress ledger.",
83
+ "overview_index": 118530,
84
+ "evidence_index": 163804
85
  },
86
  {
87
  "name": "project_status_links_json",
 
159
  "name": "evaluation_protocol_between_overview_and_progress",
160
  "status": "pass",
161
  "reason": "The evaluation protocol should appear before the deeper evidence ledger.",
162
+ "overview_index": 118530,
163
+ "protocol_index": 159992,
164
+ "evidence_index": 163804
165
  },
166
  {
167
  "name": "evaluation_protocol_links_json",
 
386
  },
387
  {
388
  "path": "data/project_status.json",
389
+ "bytes": 23255,
390
  "top_level_type": "dict"
391
  },
392
  {
393
  "path": "data/public_reader_map.json",
394
+ "bytes": 6757,
395
  "top_level_type": "dict"
396
  },
397
  {
 
536
  },
537
  {
538
  "path": "data/two_evidence_line_result_summary.json",
539
+ "bytes": 17416,
540
  "top_level_type": "dict"
541
  },
542
  {
543
  "path": "data/two_evidence_lines.json",
544
+ "bytes": 7351,
545
  "top_level_type": "dict"
546
  },
547
  {
docs/index.html CHANGED
@@ -3892,7 +3892,7 @@
3892
  <div class="eyebrow">two evidence lines / 180 scored records</div>
3893
  <h1>Ropedia Xperience-10M Task Suite.</h1>
3894
  <p class="hero-copy">
3895
- The public suite has two result lines. Line 1 uses one public sample
3896
  episode to make the 20-task lab inspectable and reproducible. Line 2
3897
  uses 128 selected episodes to compare aligned baselines, Qwen3-Omni
3898
  v6 LoRA, Cosmos3-Super Reasoner, and Cosmos3-Nano Future Window. The public matrix is complete at 180/180 scored
@@ -3922,7 +3922,7 @@
3922
  </article>
3923
  <article class="suite-line-card">
3924
  <small>line 2 / 128 selected episodes</small>
3925
- <h3>128 selected episodes: comparison layer</h3>
3926
  <p>Seven methods share the selected-episode surface and the same 20 task axes.</p>
3927
  <div class="line-claim">
3928
  <div><span>valid claim</span><p>Same-split method comparison and scale-up planning.</p></div>
@@ -4045,7 +4045,7 @@
4045
  <div class="wrap section-tabs" id="sectionTabs" role="tablist" aria-label="Sections inside the selected project tab"></div>
4046
  <div class="wrap section-orientation" id="sectionOrientation" aria-live="polite">
4047
  <strong>You are in: Start / Project Overview</strong>
4048
- <span>Use the section pills to move within this layer, or jump to the full reader map.</span>
4049
  <a href="#reader-map">Reader map</a>
4050
  </div>
4051
  </div>
@@ -4186,7 +4186,7 @@
4186
  </tr>
4187
  </tbody>
4188
  </table>
4189
- <p class="table-note">Qwen v1-v6 are run-lineage labels inside the selected-128 evidence line, not project-level result lines. Use v6 for the public 20-task Qwen3-Omni row; keep v5 as the pinned prior multiscale comparator; read v1-v4 as pipeline-hardening and ablation evidence. Full details: <a href="data/qwen3_omni_run_lineage.json">qwen3_omni_run_lineage.json</a> and <a href="https://github.com/ChaoYue0307/ropedia-xperience-10m-task-suite/blob/main/QWEN3_OMNI_RUN_LINEAGE.md">QWEN3_OMNI_RUN_LINEAGE.md</a>.</p>
4190
  <div class="reader-journey" aria-label="Recommended reader journeys">
4191
  <article class="reader-step">
4192
  <small>01 understand</small>
@@ -5679,7 +5679,7 @@
5679
  <p>Use these files to navigate the whole project, open the published mirrors, or reproduce the public-sample pipeline.</p>
5680
  </div>
5681
  <div class="artifact-grid">
5682
- <article class="artifact primary-artifact"><div><h3>Public reader map</h3><p>Single navigation layer for GitHub, GitHub Pages, HF Space, artifact dataset, baseline model repo, Qwen3-Omni/Cosmos3 repos, and public claim boundaries.</p></div><a href="data/public_reader_map.json">reader map</a></article>
5683
  <article class="artifact primary-artifact"><div><h3>Artifact guide</h3><p>Human-readable map from project scope to data contract, task evidence, platform mirrors, and scale-up status.</p></div><a href="https://github.com/ChaoYue0307/ropedia-xperience-10m-task-suite/blob/main/ARTIFACT_GUIDE.md">artifact guide</a></article>
5684
  <article class="artifact"><h3>Reproduction scripts</h3><p>Training, visualization, taxonomy, walkthrough, validator, and omni-readiness scripts.</p><a href="https://github.com/ChaoYue0307/ropedia-xperience-10m-task-suite/tree/main/scripts">scripts/</a></article>
5685
  <article class="artifact"><h3>Hugging Face Space</h3><p>The dashboard packaged as a public static Space.</p><a href="https://huggingface.co/spaces/cy0307/ropedia-xperience-10m-task-suite">HF Space</a></article>
@@ -5761,7 +5761,7 @@
5761
  <article class="artifact"><h3>Reproducibility matrix</h3><p>Machine-readable command matrix covering sample download, baselines, the unified 20-task suite, figures, and validation.</p><a href="data/reproducibility_matrix.json">reproducibility matrix</a></article>
5762
  <article class="artifact"><h3>Exact-match reproduction record</h3><p>The last metric rebuild reproduced the public-sample outputs from a fresh cache and matched the committed metrics.</p><a href="https://github.com/ChaoYue0307/ropedia-xperience-10m-task-suite/blob/main/notes/reproducibility_audit.md">reproduction audit</a></article>
5763
  <article class="artifact"><h3>Project dashboard</h3><p>The website organizes the dataset sample, tasks, methods, results, directions, and scale-up path in one tabbed reader flow.</p><a href="#artifacts">project materials</a></article>
5764
- <article class="artifact"><h3>Line 2 model status</h3><p>The comparison JSON groups selected-128 baselines, Qwen3-Omni v6 LoRA, Cosmos3-Nano Future Window, and Cosmos3-Super Reasoner. Qwen v5/v6 detail stays in a separate lineage audit.</p><a href="data/omni_model_comparison.json">comparison</a><a href="data/qwen3_v5_v6_comparison.json">Qwen v5/v6</a></article>
5765
  </div>
5766
  <p class="repro-note">Minimal path: install the toolkit dependencies, download the official sample, run the task suite with neural heads, regenerate tasks 13-20, build the unified 20-task index, regenerate visualizations, then rebuild the supporting project reports.</p>
5767
  <pre class="code-panel"><button type="button" data-copy="setup">Copy</button><code id="setup">git clone https://github.com/Ropedia/HOMIE-toolkit.git
@@ -6132,7 +6132,7 @@ python scripts/validate_publication_package.py</code></pre>
6132
  if (!sectionOrientation) return;
6133
  const tabLabel = tabLabels[tabKey] || tabKey;
6134
  const sectionLabel = sectionLabels[sectionId] || labelForSection(document.getElementById(sectionId));
6135
- const description = sectionDescriptions[sectionId] || "Use the section pills to move within this layer without losing the evidence trail.";
6136
  sectionOrientation.innerHTML = `
6137
  <strong>You are in: ${tabLabel} / ${sectionLabel}</strong>
6138
  <span>${description}</span>
 
3892
  <div class="eyebrow">two evidence lines / 180 scored records</div>
3893
  <h1>Ropedia Xperience-10M Task Suite.</h1>
3894
  <p class="hero-copy">
3895
+ The public suite has two evidence lines. Line 1 uses one public sample
3896
  episode to make the 20-task lab inspectable and reproducible. Line 2
3897
  uses 128 selected episodes to compare aligned baselines, Qwen3-Omni
3898
  v6 LoRA, Cosmos3-Super Reasoner, and Cosmos3-Nano Future Window. The public matrix is complete at 180/180 scored
 
3922
  </article>
3923
  <article class="suite-line-card">
3924
  <small>line 2 / 128 selected episodes</small>
3925
+ <h3>128 selected episodes: comparison surface</h3>
3926
  <p>Seven methods share the selected-episode surface and the same 20 task axes.</p>
3927
  <div class="line-claim">
3928
  <div><span>valid claim</span><p>Same-split method comparison and scale-up planning.</p></div>
 
4045
  <div class="wrap section-tabs" id="sectionTabs" role="tablist" aria-label="Sections inside the selected project tab"></div>
4046
  <div class="wrap section-orientation" id="sectionOrientation" aria-live="polite">
4047
  <strong>You are in: Start / Project Overview</strong>
4048
+ <span>Use the section pills to move within this section, or jump to the full reader map.</span>
4049
  <a href="#reader-map">Reader map</a>
4050
  </div>
4051
  </div>
 
4186
  </tr>
4187
  </tbody>
4188
  </table>
4189
+ <p class="table-note">Qwen v1-v6 are run-lineage labels inside the selected-128 evidence line, not project evidence lines. Use v6 for the public 20-task Qwen3-Omni row; keep v5 as the pinned prior multiscale comparator; read v1-v4 as pipeline-hardening and ablation evidence. Full details: <a href="data/qwen3_omni_run_lineage.json">qwen3_omni_run_lineage.json</a> and <a href="https://github.com/ChaoYue0307/ropedia-xperience-10m-task-suite/blob/main/QWEN3_OMNI_RUN_LINEAGE.md">QWEN3_OMNI_RUN_LINEAGE.md</a>.</p>
4190
  <div class="reader-journey" aria-label="Recommended reader journeys">
4191
  <article class="reader-step">
4192
  <small>01 understand</small>
 
5679
  <p>Use these files to navigate the whole project, open the published mirrors, or reproduce the public-sample pipeline.</p>
5680
  </div>
5681
  <div class="artifact-grid">
5682
+ <article class="artifact primary-artifact"><div><h3>Public reader map</h3><p>Single navigation view for GitHub, GitHub Pages, HF Space, artifact dataset, baseline model repo, Qwen3-Omni/Cosmos3 repos, and public claim boundaries.</p></div><a href="data/public_reader_map.json">reader map</a></article>
5683
  <article class="artifact primary-artifact"><div><h3>Artifact guide</h3><p>Human-readable map from project scope to data contract, task evidence, platform mirrors, and scale-up status.</p></div><a href="https://github.com/ChaoYue0307/ropedia-xperience-10m-task-suite/blob/main/ARTIFACT_GUIDE.md">artifact guide</a></article>
5684
  <article class="artifact"><h3>Reproduction scripts</h3><p>Training, visualization, taxonomy, walkthrough, validator, and omni-readiness scripts.</p><a href="https://github.com/ChaoYue0307/ropedia-xperience-10m-task-suite/tree/main/scripts">scripts/</a></article>
5685
  <article class="artifact"><h3>Hugging Face Space</h3><p>The dashboard packaged as a public static Space.</p><a href="https://huggingface.co/spaces/cy0307/ropedia-xperience-10m-task-suite">HF Space</a></article>
 
5761
  <article class="artifact"><h3>Reproducibility matrix</h3><p>Machine-readable command matrix covering sample download, baselines, the unified 20-task suite, figures, and validation.</p><a href="data/reproducibility_matrix.json">reproducibility matrix</a></article>
5762
  <article class="artifact"><h3>Exact-match reproduction record</h3><p>The last metric rebuild reproduced the public-sample outputs from a fresh cache and matched the committed metrics.</p><a href="https://github.com/ChaoYue0307/ropedia-xperience-10m-task-suite/blob/main/notes/reproducibility_audit.md">reproduction audit</a></article>
5763
  <article class="artifact"><h3>Project dashboard</h3><p>The website organizes the dataset sample, tasks, methods, results, directions, and scale-up path in one tabbed reader flow.</p><a href="#artifacts">project materials</a></article>
5764
+ <article class="artifact"><h3>Line 2 model status</h3><p>The comparison JSON groups selected-128 baselines, Qwen3-Omni v6 LoRA, Cosmos3-Nano Future Window, and Cosmos3-Super Reasoner. Full Qwen v1-v6 detail stays in a separate lineage audit.</p><a href="data/omni_model_comparison.json">comparison</a><a href="data/qwen3_omni_run_lineage.json">Qwen v1-v6</a></article>
5765
  </div>
5766
  <p class="repro-note">Minimal path: install the toolkit dependencies, download the official sample, run the task suite with neural heads, regenerate tasks 13-20, build the unified 20-task index, regenerate visualizations, then rebuild the supporting project reports.</p>
5767
  <pre class="code-panel"><button type="button" data-copy="setup">Copy</button><code id="setup">git clone https://github.com/Ropedia/HOMIE-toolkit.git
 
6132
  if (!sectionOrientation) return;
6133
  const tabLabel = tabLabels[tabKey] || tabKey;
6134
  const sectionLabel = sectionLabels[sectionId] || labelForSection(document.getElementById(sectionId));
6135
+ const description = sectionDescriptions[sectionId] || "Use the section pills to move within this section without losing the evidence trail.";
6136
  sectionOrientation.innerHTML = `
6137
  <strong>You are in: ${tabLabel} / ${sectionLabel}</strong>
6138
  <span>${description}</span>
index.html CHANGED
@@ -3892,7 +3892,7 @@
3892
  <div class="eyebrow">two evidence lines / 180 scored records</div>
3893
  <h1>Ropedia Xperience-10M Task Suite.</h1>
3894
  <p class="hero-copy">
3895
- The public suite has two result lines. Line 1 uses one public sample
3896
  episode to make the 20-task lab inspectable and reproducible. Line 2
3897
  uses 128 selected episodes to compare aligned baselines, Qwen3-Omni
3898
  v6 LoRA, Cosmos3-Super Reasoner, and Cosmos3-Nano Future Window. The public matrix is complete at 180/180 scored
@@ -3922,7 +3922,7 @@
3922
  </article>
3923
  <article class="suite-line-card">
3924
  <small>line 2 / 128 selected episodes</small>
3925
- <h3>128 selected episodes: comparison layer</h3>
3926
  <p>Seven methods share the selected-episode surface and the same 20 task axes.</p>
3927
  <div class="line-claim">
3928
  <div><span>valid claim</span><p>Same-split method comparison and scale-up planning.</p></div>
@@ -4045,7 +4045,7 @@
4045
  <div class="wrap section-tabs" id="sectionTabs" role="tablist" aria-label="Sections inside the selected project tab"></div>
4046
  <div class="wrap section-orientation" id="sectionOrientation" aria-live="polite">
4047
  <strong>You are in: Start / Project Overview</strong>
4048
- <span>Use the section pills to move within this layer, or jump to the full reader map.</span>
4049
  <a href="#reader-map">Reader map</a>
4050
  </div>
4051
  </div>
@@ -4186,7 +4186,7 @@
4186
  </tr>
4187
  </tbody>
4188
  </table>
4189
- <p class="table-note">Qwen v1-v6 are run-lineage labels inside the selected-128 evidence line, not project-level result lines. Use v6 for the public 20-task Qwen3-Omni row; keep v5 as the pinned prior multiscale comparator; read v1-v4 as pipeline-hardening and ablation evidence. Full details: <a href="data/qwen3_omni_run_lineage.json">qwen3_omni_run_lineage.json</a> and <a href="https://github.com/ChaoYue0307/ropedia-xperience-10m-task-suite/blob/main/QWEN3_OMNI_RUN_LINEAGE.md">QWEN3_OMNI_RUN_LINEAGE.md</a>.</p>
4190
  <div class="reader-journey" aria-label="Recommended reader journeys">
4191
  <article class="reader-step">
4192
  <small>01 understand</small>
@@ -5679,7 +5679,7 @@
5679
  <p>Use these files to navigate the whole project, open the published mirrors, or reproduce the public-sample pipeline.</p>
5680
  </div>
5681
  <div class="artifact-grid">
5682
- <article class="artifact primary-artifact"><div><h3>Public reader map</h3><p>Single navigation layer for GitHub, GitHub Pages, HF Space, artifact dataset, baseline model repo, Qwen3-Omni/Cosmos3 repos, and public claim boundaries.</p></div><a href="data/public_reader_map.json">reader map</a></article>
5683
  <article class="artifact primary-artifact"><div><h3>Artifact guide</h3><p>Human-readable map from project scope to data contract, task evidence, platform mirrors, and scale-up status.</p></div><a href="https://github.com/ChaoYue0307/ropedia-xperience-10m-task-suite/blob/main/ARTIFACT_GUIDE.md">artifact guide</a></article>
5684
  <article class="artifact"><h3>Reproduction scripts</h3><p>Training, visualization, taxonomy, walkthrough, validator, and omni-readiness scripts.</p><a href="https://github.com/ChaoYue0307/ropedia-xperience-10m-task-suite/tree/main/scripts">scripts/</a></article>
5685
  <article class="artifact"><h3>Hugging Face Space</h3><p>The dashboard packaged as a public static Space.</p><a href="https://huggingface.co/spaces/cy0307/ropedia-xperience-10m-task-suite">HF Space</a></article>
@@ -5761,7 +5761,7 @@
5761
  <article class="artifact"><h3>Reproducibility matrix</h3><p>Machine-readable command matrix covering sample download, baselines, the unified 20-task suite, figures, and validation.</p><a href="data/reproducibility_matrix.json">reproducibility matrix</a></article>
5762
  <article class="artifact"><h3>Exact-match reproduction record</h3><p>The last metric rebuild reproduced the public-sample outputs from a fresh cache and matched the committed metrics.</p><a href="https://github.com/ChaoYue0307/ropedia-xperience-10m-task-suite/blob/main/notes/reproducibility_audit.md">reproduction audit</a></article>
5763
  <article class="artifact"><h3>Project dashboard</h3><p>The website organizes the dataset sample, tasks, methods, results, directions, and scale-up path in one tabbed reader flow.</p><a href="#artifacts">project materials</a></article>
5764
- <article class="artifact"><h3>Line 2 model status</h3><p>The comparison JSON groups selected-128 baselines, Qwen3-Omni v6 LoRA, Cosmos3-Nano Future Window, and Cosmos3-Super Reasoner. Qwen v5/v6 detail stays in a separate lineage audit.</p><a href="data/omni_model_comparison.json">comparison</a><a href="data/qwen3_v5_v6_comparison.json">Qwen v5/v6</a></article>
5765
  </div>
5766
  <p class="repro-note">Minimal path: install the toolkit dependencies, download the official sample, run the task suite with neural heads, regenerate tasks 13-20, build the unified 20-task index, regenerate visualizations, then rebuild the supporting project reports.</p>
5767
  <pre class="code-panel"><button type="button" data-copy="setup">Copy</button><code id="setup">git clone https://github.com/Ropedia/HOMIE-toolkit.git
@@ -6132,7 +6132,7 @@ python scripts/validate_publication_package.py</code></pre>
6132
  if (!sectionOrientation) return;
6133
  const tabLabel = tabLabels[tabKey] || tabKey;
6134
  const sectionLabel = sectionLabels[sectionId] || labelForSection(document.getElementById(sectionId));
6135
- const description = sectionDescriptions[sectionId] || "Use the section pills to move within this layer without losing the evidence trail.";
6136
  sectionOrientation.innerHTML = `
6137
  <strong>You are in: ${tabLabel} / ${sectionLabel}</strong>
6138
  <span>${description}</span>
 
3892
  <div class="eyebrow">two evidence lines / 180 scored records</div>
3893
  <h1>Ropedia Xperience-10M Task Suite.</h1>
3894
  <p class="hero-copy">
3895
+ The public suite has two evidence lines. Line 1 uses one public sample
3896
  episode to make the 20-task lab inspectable and reproducible. Line 2
3897
  uses 128 selected episodes to compare aligned baselines, Qwen3-Omni
3898
  v6 LoRA, Cosmos3-Super Reasoner, and Cosmos3-Nano Future Window. The public matrix is complete at 180/180 scored
 
3922
  </article>
3923
  <article class="suite-line-card">
3924
  <small>line 2 / 128 selected episodes</small>
3925
+ <h3>128 selected episodes: comparison surface</h3>
3926
  <p>Seven methods share the selected-episode surface and the same 20 task axes.</p>
3927
  <div class="line-claim">
3928
  <div><span>valid claim</span><p>Same-split method comparison and scale-up planning.</p></div>
 
4045
  <div class="wrap section-tabs" id="sectionTabs" role="tablist" aria-label="Sections inside the selected project tab"></div>
4046
  <div class="wrap section-orientation" id="sectionOrientation" aria-live="polite">
4047
  <strong>You are in: Start / Project Overview</strong>
4048
+ <span>Use the section pills to move within this section, or jump to the full reader map.</span>
4049
  <a href="#reader-map">Reader map</a>
4050
  </div>
4051
  </div>
 
4186
  </tr>
4187
  </tbody>
4188
  </table>
4189
+ <p class="table-note">Qwen v1-v6 are run-lineage labels inside the selected-128 evidence line, not project evidence lines. Use v6 for the public 20-task Qwen3-Omni row; keep v5 as the pinned prior multiscale comparator; read v1-v4 as pipeline-hardening and ablation evidence. Full details: <a href="data/qwen3_omni_run_lineage.json">qwen3_omni_run_lineage.json</a> and <a href="https://github.com/ChaoYue0307/ropedia-xperience-10m-task-suite/blob/main/QWEN3_OMNI_RUN_LINEAGE.md">QWEN3_OMNI_RUN_LINEAGE.md</a>.</p>
4190
  <div class="reader-journey" aria-label="Recommended reader journeys">
4191
  <article class="reader-step">
4192
  <small>01 understand</small>
 
5679
  <p>Use these files to navigate the whole project, open the published mirrors, or reproduce the public-sample pipeline.</p>
5680
  </div>
5681
  <div class="artifact-grid">
5682
+ <article class="artifact primary-artifact"><div><h3>Public reader map</h3><p>Single navigation view for GitHub, GitHub Pages, HF Space, artifact dataset, baseline model repo, Qwen3-Omni/Cosmos3 repos, and public claim boundaries.</p></div><a href="data/public_reader_map.json">reader map</a></article>
5683
  <article class="artifact primary-artifact"><div><h3>Artifact guide</h3><p>Human-readable map from project scope to data contract, task evidence, platform mirrors, and scale-up status.</p></div><a href="https://github.com/ChaoYue0307/ropedia-xperience-10m-task-suite/blob/main/ARTIFACT_GUIDE.md">artifact guide</a></article>
5684
  <article class="artifact"><h3>Reproduction scripts</h3><p>Training, visualization, taxonomy, walkthrough, validator, and omni-readiness scripts.</p><a href="https://github.com/ChaoYue0307/ropedia-xperience-10m-task-suite/tree/main/scripts">scripts/</a></article>
5685
  <article class="artifact"><h3>Hugging Face Space</h3><p>The dashboard packaged as a public static Space.</p><a href="https://huggingface.co/spaces/cy0307/ropedia-xperience-10m-task-suite">HF Space</a></article>
 
5761
  <article class="artifact"><h3>Reproducibility matrix</h3><p>Machine-readable command matrix covering sample download, baselines, the unified 20-task suite, figures, and validation.</p><a href="data/reproducibility_matrix.json">reproducibility matrix</a></article>
5762
  <article class="artifact"><h3>Exact-match reproduction record</h3><p>The last metric rebuild reproduced the public-sample outputs from a fresh cache and matched the committed metrics.</p><a href="https://github.com/ChaoYue0307/ropedia-xperience-10m-task-suite/blob/main/notes/reproducibility_audit.md">reproduction audit</a></article>
5763
  <article class="artifact"><h3>Project dashboard</h3><p>The website organizes the dataset sample, tasks, methods, results, directions, and scale-up path in one tabbed reader flow.</p><a href="#artifacts">project materials</a></article>
5764
+ <article class="artifact"><h3>Line 2 model status</h3><p>The comparison JSON groups selected-128 baselines, Qwen3-Omni v6 LoRA, Cosmos3-Nano Future Window, and Cosmos3-Super Reasoner. Full Qwen v1-v6 detail stays in a separate lineage audit.</p><a href="data/omni_model_comparison.json">comparison</a><a href="data/qwen3_omni_run_lineage.json">Qwen v1-v6</a></article>
5765
  </div>
5766
  <p class="repro-note">Minimal path: install the toolkit dependencies, download the official sample, run the task suite with neural heads, regenerate tasks 13-20, build the unified 20-task index, regenerate visualizations, then rebuild the supporting project reports.</p>
5767
  <pre class="code-panel"><button type="button" data-copy="setup">Copy</button><code id="setup">git clone https://github.com/Ropedia/HOMIE-toolkit.git
 
6132
  if (!sectionOrientation) return;
6133
  const tabLabel = tabLabels[tabKey] || tabKey;
6134
  const sectionLabel = sectionLabels[sectionId] || labelForSection(document.getElementById(sectionId));
6135
+ const description = sectionDescriptions[sectionId] || "Use the section pills to move within this section without losing the evidence trail.";
6136
  sectionOrientation.innerHTML = `
6137
  <strong>You are in: ${tabLabel} / ${sectionLabel}</strong>
6138
  <span>${description}</span>
metrics/mirror_parity.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T11:28:34+00:00",
4
  "hf_root": "hf_publish",
5
  "summary": {
6
  "group_count": 1256,
@@ -824,45 +824,45 @@
824
  "local": {
825
  "path": "repo:docs/data/public_reader_map.json",
826
  "exists": true,
827
- "bytes": 5990,
828
- "sha256": "45b89ce3c3da32856ecaff0902bc6903edcd16032595dc8f850e3fbf16cb0dd6"
829
  },
830
  "mirrors": {
831
  "hf_space": {
832
  "path": "hf_space:data/public_reader_map.json",
833
  "exists": true,
834
- "bytes": 5990,
835
- "sha256": "45b89ce3c3da32856ecaff0902bc6903edcd16032595dc8f850e3fbf16cb0dd6"
836
  },
837
  "hf_artifacts_data": {
838
  "path": "hf_artifacts:data/public_reader_map.json",
839
  "exists": true,
840
- "bytes": 5990,
841
- "sha256": "45b89ce3c3da32856ecaff0902bc6903edcd16032595dc8f850e3fbf16cb0dd6"
842
  },
843
  "hf_artifacts": {
844
  "path": "hf_artifacts:docs/data/public_reader_map.json",
845
  "exists": true,
846
- "bytes": 5990,
847
- "sha256": "45b89ce3c3da32856ecaff0902bc6903edcd16032595dc8f850e3fbf16cb0dd6"
848
  },
849
  "hf_model_data": {
850
  "path": "hf_model:data/public_reader_map.json",
851
  "exists": true,
852
- "bytes": 5990,
853
- "sha256": "45b89ce3c3da32856ecaff0902bc6903edcd16032595dc8f850e3fbf16cb0dd6"
854
  },
855
  "hf_model_docs_data": {
856
  "path": "hf_model:docs/data/public_reader_map.json",
857
  "exists": true,
858
- "bytes": 5990,
859
- "sha256": "45b89ce3c3da32856ecaff0902bc6903edcd16032595dc8f850e3fbf16cb0dd6"
860
  },
861
  "hf_model": {
862
  "path": "hf_model:metrics/public_reader_map.json",
863
  "exists": true,
864
- "bytes": 5990,
865
- "sha256": "45b89ce3c3da32856ecaff0902bc6903edcd16032595dc8f850e3fbf16cb0dd6"
866
  }
867
  },
868
  "failures": []
@@ -873,45 +873,45 @@
873
  "local": {
874
  "path": "repo:docs/data/project_status.json",
875
  "exists": true,
876
- "bytes": 23054,
877
- "sha256": "5ea0e111a04ad255f153567d603f06990240121351fd9cfce850a7935a8dd6f9"
878
  },
879
  "mirrors": {
880
  "hf_space": {
881
  "path": "hf_space:data/project_status.json",
882
  "exists": true,
883
- "bytes": 23054,
884
- "sha256": "5ea0e111a04ad255f153567d603f06990240121351fd9cfce850a7935a8dd6f9"
885
  },
886
  "hf_artifacts_data": {
887
  "path": "hf_artifacts:data/project_status.json",
888
  "exists": true,
889
- "bytes": 23054,
890
- "sha256": "5ea0e111a04ad255f153567d603f06990240121351fd9cfce850a7935a8dd6f9"
891
  },
892
  "hf_artifacts": {
893
  "path": "hf_artifacts:docs/data/project_status.json",
894
  "exists": true,
895
- "bytes": 23054,
896
- "sha256": "5ea0e111a04ad255f153567d603f06990240121351fd9cfce850a7935a8dd6f9"
897
  },
898
  "hf_model_data": {
899
  "path": "hf_model:data/project_status.json",
900
  "exists": true,
901
- "bytes": 23054,
902
- "sha256": "5ea0e111a04ad255f153567d603f06990240121351fd9cfce850a7935a8dd6f9"
903
  },
904
  "hf_model_docs_data": {
905
  "path": "hf_model:docs/data/project_status.json",
906
  "exists": true,
907
- "bytes": 23054,
908
- "sha256": "5ea0e111a04ad255f153567d603f06990240121351fd9cfce850a7935a8dd6f9"
909
  },
910
  "hf_model": {
911
  "path": "hf_model:metrics/project_status.json",
912
  "exists": true,
913
- "bytes": 23054,
914
- "sha256": "5ea0e111a04ad255f153567d603f06990240121351fd9cfce850a7935a8dd6f9"
915
  }
916
  },
917
  "failures": []
@@ -923,44 +923,44 @@
923
  "path": "repo:docs/data/publication_audit.json",
924
  "exists": true,
925
  "bytes": 10877,
926
- "sha256": "43304fea0b61a3a63d595c9de2176e058e75bda3bebc8768918e1fb6b900bb30"
927
  },
928
  "mirrors": {
929
  "hf_space": {
930
  "path": "hf_space:data/publication_audit.json",
931
  "exists": true,
932
  "bytes": 10877,
933
- "sha256": "43304fea0b61a3a63d595c9de2176e058e75bda3bebc8768918e1fb6b900bb30"
934
  },
935
  "hf_artifacts_data": {
936
  "path": "hf_artifacts:data/publication_audit.json",
937
  "exists": true,
938
  "bytes": 10877,
939
- "sha256": "43304fea0b61a3a63d595c9de2176e058e75bda3bebc8768918e1fb6b900bb30"
940
  },
941
  "hf_artifacts": {
942
  "path": "hf_artifacts:docs/data/publication_audit.json",
943
  "exists": true,
944
  "bytes": 10877,
945
- "sha256": "43304fea0b61a3a63d595c9de2176e058e75bda3bebc8768918e1fb6b900bb30"
946
  },
947
  "hf_model_data": {
948
  "path": "hf_model:data/publication_audit.json",
949
  "exists": true,
950
  "bytes": 10877,
951
- "sha256": "43304fea0b61a3a63d595c9de2176e058e75bda3bebc8768918e1fb6b900bb30"
952
  },
953
  "hf_model_docs_data": {
954
  "path": "hf_model:docs/data/publication_audit.json",
955
  "exists": true,
956
  "bytes": 10877,
957
- "sha256": "43304fea0b61a3a63d595c9de2176e058e75bda3bebc8768918e1fb6b900bb30"
958
  },
959
  "hf_model": {
960
  "path": "hf_model:metrics/publication_audit.json",
961
  "exists": true,
962
  "bytes": 10877,
963
- "sha256": "43304fea0b61a3a63d595c9de2176e058e75bda3bebc8768918e1fb6b900bb30"
964
  }
965
  },
966
  "failures": []
@@ -972,44 +972,44 @@
972
  "path": "repo:docs/data/public_surface_qa.json",
973
  "exists": true,
974
  "bytes": 7363,
975
- "sha256": "ccd0b14986276d41a41d567b40ef5e6e460abbd7b75126511d635e86e61ce8d8"
976
  },
977
  "mirrors": {
978
  "hf_space": {
979
  "path": "hf_space:data/public_surface_qa.json",
980
  "exists": true,
981
  "bytes": 7363,
982
- "sha256": "ccd0b14986276d41a41d567b40ef5e6e460abbd7b75126511d635e86e61ce8d8"
983
  },
984
  "hf_artifacts_data": {
985
  "path": "hf_artifacts:data/public_surface_qa.json",
986
  "exists": true,
987
  "bytes": 7363,
988
- "sha256": "ccd0b14986276d41a41d567b40ef5e6e460abbd7b75126511d635e86e61ce8d8"
989
  },
990
  "hf_artifacts": {
991
  "path": "hf_artifacts:docs/data/public_surface_qa.json",
992
  "exists": true,
993
  "bytes": 7363,
994
- "sha256": "ccd0b14986276d41a41d567b40ef5e6e460abbd7b75126511d635e86e61ce8d8"
995
  },
996
  "hf_model_data": {
997
  "path": "hf_model:data/public_surface_qa.json",
998
  "exists": true,
999
  "bytes": 7363,
1000
- "sha256": "ccd0b14986276d41a41d567b40ef5e6e460abbd7b75126511d635e86e61ce8d8"
1001
  },
1002
  "hf_model_docs_data": {
1003
  "path": "hf_model:docs/data/public_surface_qa.json",
1004
  "exists": true,
1005
  "bytes": 7363,
1006
- "sha256": "ccd0b14986276d41a41d567b40ef5e6e460abbd7b75126511d635e86e61ce8d8"
1007
  },
1008
  "hf_model": {
1009
  "path": "hf_model:metrics/public_surface_qa.json",
1010
  "exists": true,
1011
  "bytes": 7363,
1012
- "sha256": "ccd0b14986276d41a41d567b40ef5e6e460abbd7b75126511d635e86e61ce8d8"
1013
  }
1014
  },
1015
  "failures": []
@@ -1070,44 +1070,44 @@
1070
  "path": "repo:docs/data/qwen3_omni_run_lineage.json",
1071
  "exists": true,
1072
  "bytes": 11468,
1073
- "sha256": "ec71e0b80a3eb16d25aa78040fe7247cf0dd889bcce789167b54a3bfda7722c5"
1074
  },
1075
  "mirrors": {
1076
  "hf_space": {
1077
  "path": "hf_space:data/qwen3_omni_run_lineage.json",
1078
  "exists": true,
1079
  "bytes": 11468,
1080
- "sha256": "ec71e0b80a3eb16d25aa78040fe7247cf0dd889bcce789167b54a3bfda7722c5"
1081
  },
1082
  "hf_artifacts_data": {
1083
  "path": "hf_artifacts:data/qwen3_omni_run_lineage.json",
1084
  "exists": true,
1085
  "bytes": 11468,
1086
- "sha256": "ec71e0b80a3eb16d25aa78040fe7247cf0dd889bcce789167b54a3bfda7722c5"
1087
  },
1088
  "hf_artifacts": {
1089
  "path": "hf_artifacts:docs/data/qwen3_omni_run_lineage.json",
1090
  "exists": true,
1091
  "bytes": 11468,
1092
- "sha256": "ec71e0b80a3eb16d25aa78040fe7247cf0dd889bcce789167b54a3bfda7722c5"
1093
  },
1094
  "hf_model_data": {
1095
  "path": "hf_model:data/qwen3_omni_run_lineage.json",
1096
  "exists": true,
1097
  "bytes": 11468,
1098
- "sha256": "ec71e0b80a3eb16d25aa78040fe7247cf0dd889bcce789167b54a3bfda7722c5"
1099
  },
1100
  "hf_model_docs_data": {
1101
  "path": "hf_model:docs/data/qwen3_omni_run_lineage.json",
1102
  "exists": true,
1103
  "bytes": 11468,
1104
- "sha256": "ec71e0b80a3eb16d25aa78040fe7247cf0dd889bcce789167b54a3bfda7722c5"
1105
  },
1106
  "hf_model": {
1107
  "path": "hf_model:metrics/qwen3_omni_run_lineage.json",
1108
  "exists": true,
1109
  "bytes": 11468,
1110
- "sha256": "ec71e0b80a3eb16d25aa78040fe7247cf0dd889bcce789167b54a3bfda7722c5"
1111
  }
1112
  },
1113
  "failures": []
@@ -1168,44 +1168,44 @@
1168
  "path": "repo:docs/data/quality_gates.json",
1169
  "exists": true,
1170
  "bytes": 8640,
1171
- "sha256": "2078de7f914b19d768c9c529adfc6d83f7513a53660a9279d8d6d3041747179f"
1172
  },
1173
  "mirrors": {
1174
  "hf_space": {
1175
  "path": "hf_space:data/quality_gates.json",
1176
  "exists": true,
1177
  "bytes": 8640,
1178
- "sha256": "2078de7f914b19d768c9c529adfc6d83f7513a53660a9279d8d6d3041747179f"
1179
  },
1180
  "hf_artifacts_data": {
1181
  "path": "hf_artifacts:data/quality_gates.json",
1182
  "exists": true,
1183
  "bytes": 8640,
1184
- "sha256": "2078de7f914b19d768c9c529adfc6d83f7513a53660a9279d8d6d3041747179f"
1185
  },
1186
  "hf_artifacts": {
1187
  "path": "hf_artifacts:docs/data/quality_gates.json",
1188
  "exists": true,
1189
  "bytes": 8640,
1190
- "sha256": "2078de7f914b19d768c9c529adfc6d83f7513a53660a9279d8d6d3041747179f"
1191
  },
1192
  "hf_model_data": {
1193
  "path": "hf_model:data/quality_gates.json",
1194
  "exists": true,
1195
  "bytes": 8640,
1196
- "sha256": "2078de7f914b19d768c9c529adfc6d83f7513a53660a9279d8d6d3041747179f"
1197
  },
1198
  "hf_model_docs_data": {
1199
  "path": "hf_model:docs/data/quality_gates.json",
1200
  "exists": true,
1201
  "bytes": 8640,
1202
- "sha256": "2078de7f914b19d768c9c529adfc6d83f7513a53660a9279d8d6d3041747179f"
1203
  },
1204
  "hf_model": {
1205
  "path": "hf_model:metrics/quality_gates.json",
1206
  "exists": true,
1207
  "bytes": 8640,
1208
- "sha256": "2078de7f914b19d768c9c529adfc6d83f7513a53660a9279d8d6d3041747179f"
1209
  }
1210
  },
1211
  "failures": []
@@ -1609,44 +1609,44 @@
1609
  "path": "repo:docs/data/scope_claims_audit.json",
1610
  "exists": true,
1611
  "bytes": 21313,
1612
- "sha256": "ee829a14548686984894909cecae2f9c8012366c2760df62745f4ef409b04aac"
1613
  },
1614
  "mirrors": {
1615
  "hf_space": {
1616
  "path": "hf_space:data/scope_claims_audit.json",
1617
  "exists": true,
1618
  "bytes": 21313,
1619
- "sha256": "ee829a14548686984894909cecae2f9c8012366c2760df62745f4ef409b04aac"
1620
  },
1621
  "hf_artifacts_data": {
1622
  "path": "hf_artifacts:data/scope_claims_audit.json",
1623
  "exists": true,
1624
  "bytes": 21313,
1625
- "sha256": "ee829a14548686984894909cecae2f9c8012366c2760df62745f4ef409b04aac"
1626
  },
1627
  "hf_artifacts": {
1628
  "path": "hf_artifacts:docs/data/scope_claims_audit.json",
1629
  "exists": true,
1630
  "bytes": 21313,
1631
- "sha256": "ee829a14548686984894909cecae2f9c8012366c2760df62745f4ef409b04aac"
1632
  },
1633
  "hf_model_data": {
1634
  "path": "hf_model:data/scope_claims_audit.json",
1635
  "exists": true,
1636
  "bytes": 21313,
1637
- "sha256": "ee829a14548686984894909cecae2f9c8012366c2760df62745f4ef409b04aac"
1638
  },
1639
  "hf_model_docs_data": {
1640
  "path": "hf_model:docs/data/scope_claims_audit.json",
1641
  "exists": true,
1642
  "bytes": 21313,
1643
- "sha256": "ee829a14548686984894909cecae2f9c8012366c2760df62745f4ef409b04aac"
1644
  },
1645
  "hf_model": {
1646
  "path": "hf_model:metrics/scope_claims_audit.json",
1647
  "exists": true,
1648
  "bytes": 21313,
1649
- "sha256": "ee829a14548686984894909cecae2f9c8012366c2760df62745f4ef409b04aac"
1650
  }
1651
  },
1652
  "failures": []
@@ -1707,44 +1707,44 @@
1707
  "path": "repo:docs/data/source_alignment_audit.json",
1708
  "exists": true,
1709
  "bytes": 4432,
1710
- "sha256": "feb9f59b793e1e2222d8f0ef80c75bb346bf8a3fd7baa7da0bc28dd156f4d55d"
1711
  },
1712
  "mirrors": {
1713
  "hf_space": {
1714
  "path": "hf_space:data/source_alignment_audit.json",
1715
  "exists": true,
1716
  "bytes": 4432,
1717
- "sha256": "feb9f59b793e1e2222d8f0ef80c75bb346bf8a3fd7baa7da0bc28dd156f4d55d"
1718
  },
1719
  "hf_artifacts_data": {
1720
  "path": "hf_artifacts:data/source_alignment_audit.json",
1721
  "exists": true,
1722
  "bytes": 4432,
1723
- "sha256": "feb9f59b793e1e2222d8f0ef80c75bb346bf8a3fd7baa7da0bc28dd156f4d55d"
1724
  },
1725
  "hf_artifacts": {
1726
  "path": "hf_artifacts:docs/data/source_alignment_audit.json",
1727
  "exists": true,
1728
  "bytes": 4432,
1729
- "sha256": "feb9f59b793e1e2222d8f0ef80c75bb346bf8a3fd7baa7da0bc28dd156f4d55d"
1730
  },
1731
  "hf_model_data": {
1732
  "path": "hf_model:data/source_alignment_audit.json",
1733
  "exists": true,
1734
  "bytes": 4432,
1735
- "sha256": "feb9f59b793e1e2222d8f0ef80c75bb346bf8a3fd7baa7da0bc28dd156f4d55d"
1736
  },
1737
  "hf_model_docs_data": {
1738
  "path": "hf_model:docs/data/source_alignment_audit.json",
1739
  "exists": true,
1740
  "bytes": 4432,
1741
- "sha256": "feb9f59b793e1e2222d8f0ef80c75bb346bf8a3fd7baa7da0bc28dd156f4d55d"
1742
  },
1743
  "hf_model": {
1744
  "path": "hf_model:metrics/source_alignment_audit.json",
1745
  "exists": true,
1746
  "bytes": 4432,
1747
- "sha256": "feb9f59b793e1e2222d8f0ef80c75bb346bf8a3fd7baa7da0bc28dd156f4d55d"
1748
  }
1749
  },
1750
  "failures": []
@@ -1951,45 +1951,45 @@
1951
  "local": {
1952
  "path": "repo:docs/data/two_evidence_lines.json",
1953
  "exists": true,
1954
- "bytes": 7349,
1955
- "sha256": "f742b95626bcbd69d1798533b98b5ac9e3156b10442f8d310113812c5131c3cf"
1956
  },
1957
  "mirrors": {
1958
  "hf_space": {
1959
  "path": "hf_space:data/two_evidence_lines.json",
1960
  "exists": true,
1961
- "bytes": 7349,
1962
- "sha256": "f742b95626bcbd69d1798533b98b5ac9e3156b10442f8d310113812c5131c3cf"
1963
  },
1964
  "hf_artifacts_data": {
1965
  "path": "hf_artifacts:data/two_evidence_lines.json",
1966
  "exists": true,
1967
- "bytes": 7349,
1968
- "sha256": "f742b95626bcbd69d1798533b98b5ac9e3156b10442f8d310113812c5131c3cf"
1969
  },
1970
  "hf_artifacts": {
1971
  "path": "hf_artifacts:docs/data/two_evidence_lines.json",
1972
  "exists": true,
1973
- "bytes": 7349,
1974
- "sha256": "f742b95626bcbd69d1798533b98b5ac9e3156b10442f8d310113812c5131c3cf"
1975
  },
1976
  "hf_model_data": {
1977
  "path": "hf_model:data/two_evidence_lines.json",
1978
  "exists": true,
1979
- "bytes": 7349,
1980
- "sha256": "f742b95626bcbd69d1798533b98b5ac9e3156b10442f8d310113812c5131c3cf"
1981
  },
1982
  "hf_model_docs_data": {
1983
  "path": "hf_model:docs/data/two_evidence_lines.json",
1984
  "exists": true,
1985
- "bytes": 7349,
1986
- "sha256": "f742b95626bcbd69d1798533b98b5ac9e3156b10442f8d310113812c5131c3cf"
1987
  },
1988
  "hf_model": {
1989
  "path": "hf_model:metrics/two_evidence_lines.json",
1990
  "exists": true,
1991
- "bytes": 7349,
1992
- "sha256": "f742b95626bcbd69d1798533b98b5ac9e3156b10442f8d310113812c5131c3cf"
1993
  }
1994
  },
1995
  "failures": []
@@ -2000,45 +2000,45 @@
2000
  "local": {
2001
  "path": "repo:docs/data/two_evidence_line_result_summary.json",
2002
  "exists": true,
2003
- "bytes": 17414,
2004
- "sha256": "ab22372862456258c27eb429b8bb7a2520e8be178d466df7595767f2eb5815fc"
2005
  },
2006
  "mirrors": {
2007
  "hf_space": {
2008
  "path": "hf_space:data/two_evidence_line_result_summary.json",
2009
  "exists": true,
2010
- "bytes": 17414,
2011
- "sha256": "ab22372862456258c27eb429b8bb7a2520e8be178d466df7595767f2eb5815fc"
2012
  },
2013
  "hf_artifacts_data": {
2014
  "path": "hf_artifacts:data/two_evidence_line_result_summary.json",
2015
  "exists": true,
2016
- "bytes": 17414,
2017
- "sha256": "ab22372862456258c27eb429b8bb7a2520e8be178d466df7595767f2eb5815fc"
2018
  },
2019
  "hf_artifacts": {
2020
  "path": "hf_artifacts:docs/data/two_evidence_line_result_summary.json",
2021
  "exists": true,
2022
- "bytes": 17414,
2023
- "sha256": "ab22372862456258c27eb429b8bb7a2520e8be178d466df7595767f2eb5815fc"
2024
  },
2025
  "hf_model_data": {
2026
  "path": "hf_model:data/two_evidence_line_result_summary.json",
2027
  "exists": true,
2028
- "bytes": 17414,
2029
- "sha256": "ab22372862456258c27eb429b8bb7a2520e8be178d466df7595767f2eb5815fc"
2030
  },
2031
  "hf_model_docs_data": {
2032
  "path": "hf_model:docs/data/two_evidence_line_result_summary.json",
2033
  "exists": true,
2034
- "bytes": 17414,
2035
- "sha256": "ab22372862456258c27eb429b8bb7a2520e8be178d466df7595767f2eb5815fc"
2036
  },
2037
  "hf_model": {
2038
  "path": "hf_model:metrics/two_evidence_line_result_summary.json",
2039
  "exists": true,
2040
- "bytes": 17414,
2041
- "sha256": "ab22372862456258c27eb429b8bb7a2520e8be178d466df7595767f2eb5815fc"
2042
  }
2043
  },
2044
  "failures": []
@@ -2148,44 +2148,44 @@
2148
  "path": "repo:docs/data/task_surface_integrity.json",
2149
  "exists": true,
2150
  "bytes": 46246,
2151
- "sha256": "08b4b4d5922ace3e1465b17ec594529df96901be0d53a0e8b2a49a814315096a"
2152
  },
2153
  "mirrors": {
2154
  "hf_space": {
2155
  "path": "hf_space:data/task_surface_integrity.json",
2156
  "exists": true,
2157
  "bytes": 46246,
2158
- "sha256": "08b4b4d5922ace3e1465b17ec594529df96901be0d53a0e8b2a49a814315096a"
2159
  },
2160
  "hf_artifacts_data": {
2161
  "path": "hf_artifacts:data/task_surface_integrity.json",
2162
  "exists": true,
2163
  "bytes": 46246,
2164
- "sha256": "08b4b4d5922ace3e1465b17ec594529df96901be0d53a0e8b2a49a814315096a"
2165
  },
2166
  "hf_artifacts": {
2167
  "path": "hf_artifacts:docs/data/task_surface_integrity.json",
2168
  "exists": true,
2169
  "bytes": 46246,
2170
- "sha256": "08b4b4d5922ace3e1465b17ec594529df96901be0d53a0e8b2a49a814315096a"
2171
  },
2172
  "hf_model_data": {
2173
  "path": "hf_model:data/task_surface_integrity.json",
2174
  "exists": true,
2175
  "bytes": 46246,
2176
- "sha256": "08b4b4d5922ace3e1465b17ec594529df96901be0d53a0e8b2a49a814315096a"
2177
  },
2178
  "hf_model_docs_data": {
2179
  "path": "hf_model:docs/data/task_surface_integrity.json",
2180
  "exists": true,
2181
  "bytes": 46246,
2182
- "sha256": "08b4b4d5922ace3e1465b17ec594529df96901be0d53a0e8b2a49a814315096a"
2183
  },
2184
  "hf_model": {
2185
  "path": "hf_model:metrics/task_surface_integrity.json",
2186
  "exists": true,
2187
  "bytes": 46246,
2188
- "sha256": "08b4b4d5922ace3e1465b17ec594529df96901be0d53a0e8b2a49a814315096a"
2189
  }
2190
  },
2191
  "failures": []
@@ -2344,44 +2344,44 @@
2344
  "path": "repo:docs/data/task_method_20_source_audit.json",
2345
  "exists": true,
2346
  "bytes": 561,
2347
- "sha256": "1ec3670e868bd6ed60110fb4703e9794f2c425016800121177ae5aa57b661edb"
2348
  },
2349
  "mirrors": {
2350
  "hf_space": {
2351
  "path": "hf_space:data/task_method_20_source_audit.json",
2352
  "exists": true,
2353
  "bytes": 561,
2354
- "sha256": "1ec3670e868bd6ed60110fb4703e9794f2c425016800121177ae5aa57b661edb"
2355
  },
2356
  "hf_artifacts_data": {
2357
  "path": "hf_artifacts:data/task_method_20_source_audit.json",
2358
  "exists": true,
2359
  "bytes": 561,
2360
- "sha256": "1ec3670e868bd6ed60110fb4703e9794f2c425016800121177ae5aa57b661edb"
2361
  },
2362
  "hf_artifacts": {
2363
  "path": "hf_artifacts:docs/data/task_method_20_source_audit.json",
2364
  "exists": true,
2365
  "bytes": 561,
2366
- "sha256": "1ec3670e868bd6ed60110fb4703e9794f2c425016800121177ae5aa57b661edb"
2367
  },
2368
  "hf_model_data": {
2369
  "path": "hf_model:data/task_method_20_source_audit.json",
2370
  "exists": true,
2371
  "bytes": 561,
2372
- "sha256": "1ec3670e868bd6ed60110fb4703e9794f2c425016800121177ae5aa57b661edb"
2373
  },
2374
  "hf_model_docs_data": {
2375
  "path": "hf_model:docs/data/task_method_20_source_audit.json",
2376
  "exists": true,
2377
  "bytes": 561,
2378
- "sha256": "1ec3670e868bd6ed60110fb4703e9794f2c425016800121177ae5aa57b661edb"
2379
  },
2380
  "hf_model": {
2381
  "path": "hf_model:metrics/task_method_20_source_audit.json",
2382
  "exists": true,
2383
  "bytes": 561,
2384
- "sha256": "1ec3670e868bd6ed60110fb4703e9794f2c425016800121177ae5aa57b661edb"
2385
  }
2386
  },
2387
  "failures": []
@@ -2491,44 +2491,44 @@
2491
  "path": "repo:docs/data/website_integrity.json",
2492
  "exists": true,
2493
  "bytes": 20658,
2494
- "sha256": "936202947e3e2f1a1003205e4a7cecf2eabb0dd5fd4f5235af662094da3b5b5a"
2495
  },
2496
  "mirrors": {
2497
  "hf_space": {
2498
  "path": "hf_space:data/website_integrity.json",
2499
  "exists": true,
2500
  "bytes": 20658,
2501
- "sha256": "936202947e3e2f1a1003205e4a7cecf2eabb0dd5fd4f5235af662094da3b5b5a"
2502
  },
2503
  "hf_artifacts_data": {
2504
  "path": "hf_artifacts:data/website_integrity.json",
2505
  "exists": true,
2506
  "bytes": 20658,
2507
- "sha256": "936202947e3e2f1a1003205e4a7cecf2eabb0dd5fd4f5235af662094da3b5b5a"
2508
  },
2509
  "hf_artifacts": {
2510
  "path": "hf_artifacts:docs/data/website_integrity.json",
2511
  "exists": true,
2512
  "bytes": 20658,
2513
- "sha256": "936202947e3e2f1a1003205e4a7cecf2eabb0dd5fd4f5235af662094da3b5b5a"
2514
  },
2515
  "hf_model_data": {
2516
  "path": "hf_model:data/website_integrity.json",
2517
  "exists": true,
2518
  "bytes": 20658,
2519
- "sha256": "936202947e3e2f1a1003205e4a7cecf2eabb0dd5fd4f5235af662094da3b5b5a"
2520
  },
2521
  "hf_model_docs_data": {
2522
  "path": "hf_model:docs/data/website_integrity.json",
2523
  "exists": true,
2524
  "bytes": 20658,
2525
- "sha256": "936202947e3e2f1a1003205e4a7cecf2eabb0dd5fd4f5235af662094da3b5b5a"
2526
  },
2527
  "hf_model": {
2528
  "path": "hf_model:metrics/website_integrity.json",
2529
  "exists": true,
2530
  "bytes": 20658,
2531
- "sha256": "936202947e3e2f1a1003205e4a7cecf2eabb0dd5fd4f5235af662094da3b5b5a"
2532
  }
2533
  },
2534
  "failures": []
@@ -5503,21 +5503,21 @@
5503
  "local": {
5504
  "path": "repo:scripts/build_multilingual_public_readmes.py",
5505
  "exists": true,
5506
- "bytes": 51318,
5507
- "sha256": "12e32d1395d32f54fcc946826ff41658641b7dfadaef763db824ff9b8460e9f8"
5508
  },
5509
  "mirrors": {
5510
  "hf_artifacts": {
5511
  "path": "hf_artifacts:scripts/build_multilingual_public_readmes.py",
5512
  "exists": true,
5513
- "bytes": 51318,
5514
- "sha256": "12e32d1395d32f54fcc946826ff41658641b7dfadaef763db824ff9b8460e9f8"
5515
  },
5516
  "hf_model": {
5517
  "path": "hf_model:scripts/build_multilingual_public_readmes.py",
5518
  "exists": true,
5519
- "bytes": 51318,
5520
- "sha256": "12e32d1395d32f54fcc946826ff41658641b7dfadaef763db824ff9b8460e9f8"
5521
  }
5522
  },
5523
  "failures": []
@@ -6282,39 +6282,39 @@
6282
  "local": {
6283
  "path": "repo:docs/index.html",
6284
  "exists": true,
6285
- "bytes": 300323,
6286
- "sha256": "8237864aa6a7717d6d3d3395436a9477ed78fbf7fe5fd36ecdf55b675c68b838"
6287
  },
6288
  "mirrors": {
6289
  "hf_space": {
6290
  "path": "hf_space:index.html",
6291
  "exists": true,
6292
- "bytes": 300323,
6293
- "sha256": "8237864aa6a7717d6d3d3395436a9477ed78fbf7fe5fd36ecdf55b675c68b838"
6294
  },
6295
  "hf_artifacts_root": {
6296
  "path": "hf_artifacts:index.html",
6297
  "exists": true,
6298
- "bytes": 300323,
6299
- "sha256": "8237864aa6a7717d6d3d3395436a9477ed78fbf7fe5fd36ecdf55b675c68b838"
6300
  },
6301
  "hf_artifacts_docs": {
6302
  "path": "hf_artifacts:docs/index.html",
6303
  "exists": true,
6304
- "bytes": 300323,
6305
- "sha256": "8237864aa6a7717d6d3d3395436a9477ed78fbf7fe5fd36ecdf55b675c68b838"
6306
  },
6307
  "hf_model": {
6308
  "path": "hf_model:index.html",
6309
  "exists": true,
6310
- "bytes": 300323,
6311
- "sha256": "8237864aa6a7717d6d3d3395436a9477ed78fbf7fe5fd36ecdf55b675c68b838"
6312
  },
6313
  "hf_model_docs": {
6314
  "path": "hf_model:docs/index.html",
6315
  "exists": true,
6316
- "bytes": 300323,
6317
- "sha256": "8237864aa6a7717d6d3d3395436a9477ed78fbf7fe5fd36ecdf55b675c68b838"
6318
  }
6319
  },
6320
  "failures": []
@@ -31102,27 +31102,27 @@
31102
  "local": {
31103
  "path": "repo:TWO_EVIDENCE_LINES.md",
31104
  "exists": true,
31105
- "bytes": 4293,
31106
- "sha256": "f7c47021534898a4cf2d047be78931a88f9cf1efa98f740d7573fdc67d9c2294"
31107
  },
31108
  "mirrors": {
31109
  "hf_space": {
31110
  "path": "hf_space:TWO_EVIDENCE_LINES.md",
31111
  "exists": true,
31112
- "bytes": 4293,
31113
- "sha256": "f7c47021534898a4cf2d047be78931a88f9cf1efa98f740d7573fdc67d9c2294"
31114
  },
31115
  "hf_artifacts": {
31116
  "path": "hf_artifacts:TWO_EVIDENCE_LINES.md",
31117
  "exists": true,
31118
- "bytes": 4293,
31119
- "sha256": "f7c47021534898a4cf2d047be78931a88f9cf1efa98f740d7573fdc67d9c2294"
31120
  },
31121
  "hf_model": {
31122
  "path": "hf_model:TWO_EVIDENCE_LINES.md",
31123
  "exists": true,
31124
- "bytes": 4293,
31125
- "sha256": "f7c47021534898a4cf2d047be78931a88f9cf1efa98f740d7573fdc67d9c2294"
31126
  }
31127
  },
31128
  "failures": []
@@ -31133,27 +31133,27 @@
31133
  "local": {
31134
  "path": "repo:TWO_EVIDENCE_LINE_RESULT_SUMMARY.md",
31135
  "exists": true,
31136
- "bytes": 8724,
31137
- "sha256": "ae09f8d4fccbb4e5d9953bf98b65d619a092f844f892801deeb095ba0ac148e3"
31138
  },
31139
  "mirrors": {
31140
  "hf_space": {
31141
  "path": "hf_space:TWO_EVIDENCE_LINE_RESULT_SUMMARY.md",
31142
  "exists": true,
31143
- "bytes": 8724,
31144
- "sha256": "ae09f8d4fccbb4e5d9953bf98b65d619a092f844f892801deeb095ba0ac148e3"
31145
  },
31146
  "hf_artifacts": {
31147
  "path": "hf_artifacts:TWO_EVIDENCE_LINE_RESULT_SUMMARY.md",
31148
  "exists": true,
31149
- "bytes": 8724,
31150
- "sha256": "ae09f8d4fccbb4e5d9953bf98b65d619a092f844f892801deeb095ba0ac148e3"
31151
  },
31152
  "hf_model": {
31153
  "path": "hf_model:TWO_EVIDENCE_LINE_RESULT_SUMMARY.md",
31154
  "exists": true,
31155
- "bytes": 8724,
31156
- "sha256": "ae09f8d4fccbb4e5d9953bf98b65d619a092f844f892801deeb095ba0ac148e3"
31157
  }
31158
  },
31159
  "failures": []
@@ -31165,26 +31165,26 @@
31165
  "path": "repo:QWEN3_OMNI_RUN_LINEAGE.md",
31166
  "exists": true,
31167
  "bytes": 6287,
31168
- "sha256": "07e3d0576131200d15d31dc824303fe9d87cd93050725dd06aebedbc1388aa98"
31169
  },
31170
  "mirrors": {
31171
  "hf_space": {
31172
  "path": "hf_space:QWEN3_OMNI_RUN_LINEAGE.md",
31173
  "exists": true,
31174
  "bytes": 6287,
31175
- "sha256": "07e3d0576131200d15d31dc824303fe9d87cd93050725dd06aebedbc1388aa98"
31176
  },
31177
  "hf_artifacts": {
31178
  "path": "hf_artifacts:QWEN3_OMNI_RUN_LINEAGE.md",
31179
  "exists": true,
31180
  "bytes": 6287,
31181
- "sha256": "07e3d0576131200d15d31dc824303fe9d87cd93050725dd06aebedbc1388aa98"
31182
  },
31183
  "hf_model": {
31184
  "path": "hf_model:QWEN3_OMNI_RUN_LINEAGE.md",
31185
  "exists": true,
31186
  "bytes": 6287,
31187
- "sha256": "07e3d0576131200d15d31dc824303fe9d87cd93050725dd06aebedbc1388aa98"
31188
  }
31189
  },
31190
  "failures": []
@@ -31195,27 +31195,27 @@
31195
  "local": {
31196
  "path": "repo:PROJECT_BRIEF.md",
31197
  "exists": true,
31198
- "bytes": 4343,
31199
- "sha256": "fa002a54a46f1789eab79c74b5203e03e9389ff83abea7dd491fd20a676f1da5"
31200
  },
31201
  "mirrors": {
31202
  "hf_space": {
31203
  "path": "hf_space:PROJECT_BRIEF.md",
31204
  "exists": true,
31205
- "bytes": 4343,
31206
- "sha256": "fa002a54a46f1789eab79c74b5203e03e9389ff83abea7dd491fd20a676f1da5"
31207
  },
31208
  "hf_artifacts": {
31209
  "path": "hf_artifacts:PROJECT_BRIEF.md",
31210
  "exists": true,
31211
- "bytes": 4343,
31212
- "sha256": "fa002a54a46f1789eab79c74b5203e03e9389ff83abea7dd491fd20a676f1da5"
31213
  },
31214
  "hf_model": {
31215
  "path": "hf_model:PROJECT_BRIEF.md",
31216
  "exists": true,
31217
- "bytes": 4343,
31218
- "sha256": "fa002a54a46f1789eab79c74b5203e03e9389ff83abea7dd491fd20a676f1da5"
31219
  }
31220
  },
31221
  "failures": []
@@ -31226,27 +31226,27 @@
31226
  "local": {
31227
  "path": "repo:PUBLIC_READER_MAP.md",
31228
  "exists": true,
31229
- "bytes": 4948,
31230
- "sha256": "7a7128fdde08f770338c3fe2d473565918c5633f948dec6a78a6b2a67938e91a"
31231
  },
31232
  "mirrors": {
31233
  "hf_space": {
31234
  "path": "hf_space:PUBLIC_READER_MAP.md",
31235
  "exists": true,
31236
- "bytes": 4948,
31237
- "sha256": "7a7128fdde08f770338c3fe2d473565918c5633f948dec6a78a6b2a67938e91a"
31238
  },
31239
  "hf_artifacts": {
31240
  "path": "hf_artifacts:PUBLIC_READER_MAP.md",
31241
  "exists": true,
31242
- "bytes": 4948,
31243
- "sha256": "7a7128fdde08f770338c3fe2d473565918c5633f948dec6a78a6b2a67938e91a"
31244
  },
31245
  "hf_model": {
31246
  "path": "hf_model:PUBLIC_READER_MAP.md",
31247
  "exists": true,
31248
- "bytes": 4948,
31249
- "sha256": "7a7128fdde08f770338c3fe2d473565918c5633f948dec6a78a6b2a67938e91a"
31250
  }
31251
  },
31252
  "failures": []
@@ -31319,27 +31319,27 @@
31319
  "local": {
31320
  "path": "repo:PROJECT_STATUS.md",
31321
  "exists": true,
31322
- "bytes": 14845,
31323
- "sha256": "128daeed89b672d89b4c422956bf4900ffe8efd54356ec657fb9cf6dcb880ba5"
31324
  },
31325
  "mirrors": {
31326
  "hf_space": {
31327
  "path": "hf_space:PROJECT_STATUS.md",
31328
  "exists": true,
31329
- "bytes": 14845,
31330
- "sha256": "128daeed89b672d89b4c422956bf4900ffe8efd54356ec657fb9cf6dcb880ba5"
31331
  },
31332
  "hf_artifacts": {
31333
  "path": "hf_artifacts:PROJECT_STATUS.md",
31334
  "exists": true,
31335
- "bytes": 14845,
31336
- "sha256": "128daeed89b672d89b4c422956bf4900ffe8efd54356ec657fb9cf6dcb880ba5"
31337
  },
31338
  "hf_model": {
31339
  "path": "hf_model:PROJECT_STATUS.md",
31340
  "exists": true,
31341
- "bytes": 14845,
31342
- "sha256": "128daeed89b672d89b4c422956bf4900ffe8efd54356ec657fb9cf6dcb880ba5"
31343
  }
31344
  },
31345
  "failures": []
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T11:53:22+00:00",
4
  "hf_root": "hf_publish",
5
  "summary": {
6
  "group_count": 1256,
 
824
  "local": {
825
  "path": "repo:docs/data/public_reader_map.json",
826
  "exists": true,
827
+ "bytes": 6757,
828
+ "sha256": "b08deaeaeeb19e8522966e85076250724541370a30d487b67272fd544f0398fc"
829
  },
830
  "mirrors": {
831
  "hf_space": {
832
  "path": "hf_space:data/public_reader_map.json",
833
  "exists": true,
834
+ "bytes": 6757,
835
+ "sha256": "b08deaeaeeb19e8522966e85076250724541370a30d487b67272fd544f0398fc"
836
  },
837
  "hf_artifacts_data": {
838
  "path": "hf_artifacts:data/public_reader_map.json",
839
  "exists": true,
840
+ "bytes": 6757,
841
+ "sha256": "b08deaeaeeb19e8522966e85076250724541370a30d487b67272fd544f0398fc"
842
  },
843
  "hf_artifacts": {
844
  "path": "hf_artifacts:docs/data/public_reader_map.json",
845
  "exists": true,
846
+ "bytes": 6757,
847
+ "sha256": "b08deaeaeeb19e8522966e85076250724541370a30d487b67272fd544f0398fc"
848
  },
849
  "hf_model_data": {
850
  "path": "hf_model:data/public_reader_map.json",
851
  "exists": true,
852
+ "bytes": 6757,
853
+ "sha256": "b08deaeaeeb19e8522966e85076250724541370a30d487b67272fd544f0398fc"
854
  },
855
  "hf_model_docs_data": {
856
  "path": "hf_model:docs/data/public_reader_map.json",
857
  "exists": true,
858
+ "bytes": 6757,
859
+ "sha256": "b08deaeaeeb19e8522966e85076250724541370a30d487b67272fd544f0398fc"
860
  },
861
  "hf_model": {
862
  "path": "hf_model:metrics/public_reader_map.json",
863
  "exists": true,
864
+ "bytes": 6757,
865
+ "sha256": "b08deaeaeeb19e8522966e85076250724541370a30d487b67272fd544f0398fc"
866
  }
867
  },
868
  "failures": []
 
873
  "local": {
874
  "path": "repo:docs/data/project_status.json",
875
  "exists": true,
876
+ "bytes": 23255,
877
+ "sha256": "874f1133ee75f060735f0c9e763cf81463f304432f1dbca3ebc9837225c0259d"
878
  },
879
  "mirrors": {
880
  "hf_space": {
881
  "path": "hf_space:data/project_status.json",
882
  "exists": true,
883
+ "bytes": 23255,
884
+ "sha256": "874f1133ee75f060735f0c9e763cf81463f304432f1dbca3ebc9837225c0259d"
885
  },
886
  "hf_artifacts_data": {
887
  "path": "hf_artifacts:data/project_status.json",
888
  "exists": true,
889
+ "bytes": 23255,
890
+ "sha256": "874f1133ee75f060735f0c9e763cf81463f304432f1dbca3ebc9837225c0259d"
891
  },
892
  "hf_artifacts": {
893
  "path": "hf_artifacts:docs/data/project_status.json",
894
  "exists": true,
895
+ "bytes": 23255,
896
+ "sha256": "874f1133ee75f060735f0c9e763cf81463f304432f1dbca3ebc9837225c0259d"
897
  },
898
  "hf_model_data": {
899
  "path": "hf_model:data/project_status.json",
900
  "exists": true,
901
+ "bytes": 23255,
902
+ "sha256": "874f1133ee75f060735f0c9e763cf81463f304432f1dbca3ebc9837225c0259d"
903
  },
904
  "hf_model_docs_data": {
905
  "path": "hf_model:docs/data/project_status.json",
906
  "exists": true,
907
+ "bytes": 23255,
908
+ "sha256": "874f1133ee75f060735f0c9e763cf81463f304432f1dbca3ebc9837225c0259d"
909
  },
910
  "hf_model": {
911
  "path": "hf_model:metrics/project_status.json",
912
  "exists": true,
913
+ "bytes": 23255,
914
+ "sha256": "874f1133ee75f060735f0c9e763cf81463f304432f1dbca3ebc9837225c0259d"
915
  }
916
  },
917
  "failures": []
 
923
  "path": "repo:docs/data/publication_audit.json",
924
  "exists": true,
925
  "bytes": 10877,
926
+ "sha256": "b856e7450eaa019f17364c2d99f70ac4a99f313a57984263fede4ed999f0d167"
927
  },
928
  "mirrors": {
929
  "hf_space": {
930
  "path": "hf_space:data/publication_audit.json",
931
  "exists": true,
932
  "bytes": 10877,
933
+ "sha256": "b856e7450eaa019f17364c2d99f70ac4a99f313a57984263fede4ed999f0d167"
934
  },
935
  "hf_artifacts_data": {
936
  "path": "hf_artifacts:data/publication_audit.json",
937
  "exists": true,
938
  "bytes": 10877,
939
+ "sha256": "b856e7450eaa019f17364c2d99f70ac4a99f313a57984263fede4ed999f0d167"
940
  },
941
  "hf_artifacts": {
942
  "path": "hf_artifacts:docs/data/publication_audit.json",
943
  "exists": true,
944
  "bytes": 10877,
945
+ "sha256": "b856e7450eaa019f17364c2d99f70ac4a99f313a57984263fede4ed999f0d167"
946
  },
947
  "hf_model_data": {
948
  "path": "hf_model:data/publication_audit.json",
949
  "exists": true,
950
  "bytes": 10877,
951
+ "sha256": "b856e7450eaa019f17364c2d99f70ac4a99f313a57984263fede4ed999f0d167"
952
  },
953
  "hf_model_docs_data": {
954
  "path": "hf_model:docs/data/publication_audit.json",
955
  "exists": true,
956
  "bytes": 10877,
957
+ "sha256": "b856e7450eaa019f17364c2d99f70ac4a99f313a57984263fede4ed999f0d167"
958
  },
959
  "hf_model": {
960
  "path": "hf_model:metrics/publication_audit.json",
961
  "exists": true,
962
  "bytes": 10877,
963
+ "sha256": "b856e7450eaa019f17364c2d99f70ac4a99f313a57984263fede4ed999f0d167"
964
  }
965
  },
966
  "failures": []
 
972
  "path": "repo:docs/data/public_surface_qa.json",
973
  "exists": true,
974
  "bytes": 7363,
975
+ "sha256": "ce1d103b0f514339819c809f4def9dfff1400a3adf7d1f891fa228dc753c7dbd"
976
  },
977
  "mirrors": {
978
  "hf_space": {
979
  "path": "hf_space:data/public_surface_qa.json",
980
  "exists": true,
981
  "bytes": 7363,
982
+ "sha256": "ce1d103b0f514339819c809f4def9dfff1400a3adf7d1f891fa228dc753c7dbd"
983
  },
984
  "hf_artifacts_data": {
985
  "path": "hf_artifacts:data/public_surface_qa.json",
986
  "exists": true,
987
  "bytes": 7363,
988
+ "sha256": "ce1d103b0f514339819c809f4def9dfff1400a3adf7d1f891fa228dc753c7dbd"
989
  },
990
  "hf_artifacts": {
991
  "path": "hf_artifacts:docs/data/public_surface_qa.json",
992
  "exists": true,
993
  "bytes": 7363,
994
+ "sha256": "ce1d103b0f514339819c809f4def9dfff1400a3adf7d1f891fa228dc753c7dbd"
995
  },
996
  "hf_model_data": {
997
  "path": "hf_model:data/public_surface_qa.json",
998
  "exists": true,
999
  "bytes": 7363,
1000
+ "sha256": "ce1d103b0f514339819c809f4def9dfff1400a3adf7d1f891fa228dc753c7dbd"
1001
  },
1002
  "hf_model_docs_data": {
1003
  "path": "hf_model:docs/data/public_surface_qa.json",
1004
  "exists": true,
1005
  "bytes": 7363,
1006
+ "sha256": "ce1d103b0f514339819c809f4def9dfff1400a3adf7d1f891fa228dc753c7dbd"
1007
  },
1008
  "hf_model": {
1009
  "path": "hf_model:metrics/public_surface_qa.json",
1010
  "exists": true,
1011
  "bytes": 7363,
1012
+ "sha256": "ce1d103b0f514339819c809f4def9dfff1400a3adf7d1f891fa228dc753c7dbd"
1013
  }
1014
  },
1015
  "failures": []
 
1070
  "path": "repo:docs/data/qwen3_omni_run_lineage.json",
1071
  "exists": true,
1072
  "bytes": 11468,
1073
+ "sha256": "668b61d087351e6573c3d0aaddbe34dab425da4cc2b5f77252961f27c661aea8"
1074
  },
1075
  "mirrors": {
1076
  "hf_space": {
1077
  "path": "hf_space:data/qwen3_omni_run_lineage.json",
1078
  "exists": true,
1079
  "bytes": 11468,
1080
+ "sha256": "668b61d087351e6573c3d0aaddbe34dab425da4cc2b5f77252961f27c661aea8"
1081
  },
1082
  "hf_artifacts_data": {
1083
  "path": "hf_artifacts:data/qwen3_omni_run_lineage.json",
1084
  "exists": true,
1085
  "bytes": 11468,
1086
+ "sha256": "668b61d087351e6573c3d0aaddbe34dab425da4cc2b5f77252961f27c661aea8"
1087
  },
1088
  "hf_artifacts": {
1089
  "path": "hf_artifacts:docs/data/qwen3_omni_run_lineage.json",
1090
  "exists": true,
1091
  "bytes": 11468,
1092
+ "sha256": "668b61d087351e6573c3d0aaddbe34dab425da4cc2b5f77252961f27c661aea8"
1093
  },
1094
  "hf_model_data": {
1095
  "path": "hf_model:data/qwen3_omni_run_lineage.json",
1096
  "exists": true,
1097
  "bytes": 11468,
1098
+ "sha256": "668b61d087351e6573c3d0aaddbe34dab425da4cc2b5f77252961f27c661aea8"
1099
  },
1100
  "hf_model_docs_data": {
1101
  "path": "hf_model:docs/data/qwen3_omni_run_lineage.json",
1102
  "exists": true,
1103
  "bytes": 11468,
1104
+ "sha256": "668b61d087351e6573c3d0aaddbe34dab425da4cc2b5f77252961f27c661aea8"
1105
  },
1106
  "hf_model": {
1107
  "path": "hf_model:metrics/qwen3_omni_run_lineage.json",
1108
  "exists": true,
1109
  "bytes": 11468,
1110
+ "sha256": "668b61d087351e6573c3d0aaddbe34dab425da4cc2b5f77252961f27c661aea8"
1111
  }
1112
  },
1113
  "failures": []
 
1168
  "path": "repo:docs/data/quality_gates.json",
1169
  "exists": true,
1170
  "bytes": 8640,
1171
+ "sha256": "66ba956b430f916c8e623ba8eacb1b6ccc3cfa6b9b4f795ea2f0436a93b0702b"
1172
  },
1173
  "mirrors": {
1174
  "hf_space": {
1175
  "path": "hf_space:data/quality_gates.json",
1176
  "exists": true,
1177
  "bytes": 8640,
1178
+ "sha256": "66ba956b430f916c8e623ba8eacb1b6ccc3cfa6b9b4f795ea2f0436a93b0702b"
1179
  },
1180
  "hf_artifacts_data": {
1181
  "path": "hf_artifacts:data/quality_gates.json",
1182
  "exists": true,
1183
  "bytes": 8640,
1184
+ "sha256": "66ba956b430f916c8e623ba8eacb1b6ccc3cfa6b9b4f795ea2f0436a93b0702b"
1185
  },
1186
  "hf_artifacts": {
1187
  "path": "hf_artifacts:docs/data/quality_gates.json",
1188
  "exists": true,
1189
  "bytes": 8640,
1190
+ "sha256": "66ba956b430f916c8e623ba8eacb1b6ccc3cfa6b9b4f795ea2f0436a93b0702b"
1191
  },
1192
  "hf_model_data": {
1193
  "path": "hf_model:data/quality_gates.json",
1194
  "exists": true,
1195
  "bytes": 8640,
1196
+ "sha256": "66ba956b430f916c8e623ba8eacb1b6ccc3cfa6b9b4f795ea2f0436a93b0702b"
1197
  },
1198
  "hf_model_docs_data": {
1199
  "path": "hf_model:docs/data/quality_gates.json",
1200
  "exists": true,
1201
  "bytes": 8640,
1202
+ "sha256": "66ba956b430f916c8e623ba8eacb1b6ccc3cfa6b9b4f795ea2f0436a93b0702b"
1203
  },
1204
  "hf_model": {
1205
  "path": "hf_model:metrics/quality_gates.json",
1206
  "exists": true,
1207
  "bytes": 8640,
1208
+ "sha256": "66ba956b430f916c8e623ba8eacb1b6ccc3cfa6b9b4f795ea2f0436a93b0702b"
1209
  }
1210
  },
1211
  "failures": []
 
1609
  "path": "repo:docs/data/scope_claims_audit.json",
1610
  "exists": true,
1611
  "bytes": 21313,
1612
+ "sha256": "cbc55e80e98652c0bdca92721df3dba5c975aeec8b894d1fa48dfc5c820b85d4"
1613
  },
1614
  "mirrors": {
1615
  "hf_space": {
1616
  "path": "hf_space:data/scope_claims_audit.json",
1617
  "exists": true,
1618
  "bytes": 21313,
1619
+ "sha256": "cbc55e80e98652c0bdca92721df3dba5c975aeec8b894d1fa48dfc5c820b85d4"
1620
  },
1621
  "hf_artifacts_data": {
1622
  "path": "hf_artifacts:data/scope_claims_audit.json",
1623
  "exists": true,
1624
  "bytes": 21313,
1625
+ "sha256": "cbc55e80e98652c0bdca92721df3dba5c975aeec8b894d1fa48dfc5c820b85d4"
1626
  },
1627
  "hf_artifacts": {
1628
  "path": "hf_artifacts:docs/data/scope_claims_audit.json",
1629
  "exists": true,
1630
  "bytes": 21313,
1631
+ "sha256": "cbc55e80e98652c0bdca92721df3dba5c975aeec8b894d1fa48dfc5c820b85d4"
1632
  },
1633
  "hf_model_data": {
1634
  "path": "hf_model:data/scope_claims_audit.json",
1635
  "exists": true,
1636
  "bytes": 21313,
1637
+ "sha256": "cbc55e80e98652c0bdca92721df3dba5c975aeec8b894d1fa48dfc5c820b85d4"
1638
  },
1639
  "hf_model_docs_data": {
1640
  "path": "hf_model:docs/data/scope_claims_audit.json",
1641
  "exists": true,
1642
  "bytes": 21313,
1643
+ "sha256": "cbc55e80e98652c0bdca92721df3dba5c975aeec8b894d1fa48dfc5c820b85d4"
1644
  },
1645
  "hf_model": {
1646
  "path": "hf_model:metrics/scope_claims_audit.json",
1647
  "exists": true,
1648
  "bytes": 21313,
1649
+ "sha256": "cbc55e80e98652c0bdca92721df3dba5c975aeec8b894d1fa48dfc5c820b85d4"
1650
  }
1651
  },
1652
  "failures": []
 
1707
  "path": "repo:docs/data/source_alignment_audit.json",
1708
  "exists": true,
1709
  "bytes": 4432,
1710
+ "sha256": "395bb7b9cc1399be3e736172ce051a2281bc3274c1e05902e85ba9b48c262d9d"
1711
  },
1712
  "mirrors": {
1713
  "hf_space": {
1714
  "path": "hf_space:data/source_alignment_audit.json",
1715
  "exists": true,
1716
  "bytes": 4432,
1717
+ "sha256": "395bb7b9cc1399be3e736172ce051a2281bc3274c1e05902e85ba9b48c262d9d"
1718
  },
1719
  "hf_artifacts_data": {
1720
  "path": "hf_artifacts:data/source_alignment_audit.json",
1721
  "exists": true,
1722
  "bytes": 4432,
1723
+ "sha256": "395bb7b9cc1399be3e736172ce051a2281bc3274c1e05902e85ba9b48c262d9d"
1724
  },
1725
  "hf_artifacts": {
1726
  "path": "hf_artifacts:docs/data/source_alignment_audit.json",
1727
  "exists": true,
1728
  "bytes": 4432,
1729
+ "sha256": "395bb7b9cc1399be3e736172ce051a2281bc3274c1e05902e85ba9b48c262d9d"
1730
  },
1731
  "hf_model_data": {
1732
  "path": "hf_model:data/source_alignment_audit.json",
1733
  "exists": true,
1734
  "bytes": 4432,
1735
+ "sha256": "395bb7b9cc1399be3e736172ce051a2281bc3274c1e05902e85ba9b48c262d9d"
1736
  },
1737
  "hf_model_docs_data": {
1738
  "path": "hf_model:docs/data/source_alignment_audit.json",
1739
  "exists": true,
1740
  "bytes": 4432,
1741
+ "sha256": "395bb7b9cc1399be3e736172ce051a2281bc3274c1e05902e85ba9b48c262d9d"
1742
  },
1743
  "hf_model": {
1744
  "path": "hf_model:metrics/source_alignment_audit.json",
1745
  "exists": true,
1746
  "bytes": 4432,
1747
+ "sha256": "395bb7b9cc1399be3e736172ce051a2281bc3274c1e05902e85ba9b48c262d9d"
1748
  }
1749
  },
1750
  "failures": []
 
1951
  "local": {
1952
  "path": "repo:docs/data/two_evidence_lines.json",
1953
  "exists": true,
1954
+ "bytes": 7351,
1955
+ "sha256": "3e9870122c495361d237acdcfdd05a0900b52179fc0db611291d3edc186b0667"
1956
  },
1957
  "mirrors": {
1958
  "hf_space": {
1959
  "path": "hf_space:data/two_evidence_lines.json",
1960
  "exists": true,
1961
+ "bytes": 7351,
1962
+ "sha256": "3e9870122c495361d237acdcfdd05a0900b52179fc0db611291d3edc186b0667"
1963
  },
1964
  "hf_artifacts_data": {
1965
  "path": "hf_artifacts:data/two_evidence_lines.json",
1966
  "exists": true,
1967
+ "bytes": 7351,
1968
+ "sha256": "3e9870122c495361d237acdcfdd05a0900b52179fc0db611291d3edc186b0667"
1969
  },
1970
  "hf_artifacts": {
1971
  "path": "hf_artifacts:docs/data/two_evidence_lines.json",
1972
  "exists": true,
1973
+ "bytes": 7351,
1974
+ "sha256": "3e9870122c495361d237acdcfdd05a0900b52179fc0db611291d3edc186b0667"
1975
  },
1976
  "hf_model_data": {
1977
  "path": "hf_model:data/two_evidence_lines.json",
1978
  "exists": true,
1979
+ "bytes": 7351,
1980
+ "sha256": "3e9870122c495361d237acdcfdd05a0900b52179fc0db611291d3edc186b0667"
1981
  },
1982
  "hf_model_docs_data": {
1983
  "path": "hf_model:docs/data/two_evidence_lines.json",
1984
  "exists": true,
1985
+ "bytes": 7351,
1986
+ "sha256": "3e9870122c495361d237acdcfdd05a0900b52179fc0db611291d3edc186b0667"
1987
  },
1988
  "hf_model": {
1989
  "path": "hf_model:metrics/two_evidence_lines.json",
1990
  "exists": true,
1991
+ "bytes": 7351,
1992
+ "sha256": "3e9870122c495361d237acdcfdd05a0900b52179fc0db611291d3edc186b0667"
1993
  }
1994
  },
1995
  "failures": []
 
2000
  "local": {
2001
  "path": "repo:docs/data/two_evidence_line_result_summary.json",
2002
  "exists": true,
2003
+ "bytes": 17416,
2004
+ "sha256": "33f8d9280a6b77f5922b37d32d17949683b27eee362be8cf220a7bda80282408"
2005
  },
2006
  "mirrors": {
2007
  "hf_space": {
2008
  "path": "hf_space:data/two_evidence_line_result_summary.json",
2009
  "exists": true,
2010
+ "bytes": 17416,
2011
+ "sha256": "33f8d9280a6b77f5922b37d32d17949683b27eee362be8cf220a7bda80282408"
2012
  },
2013
  "hf_artifacts_data": {
2014
  "path": "hf_artifacts:data/two_evidence_line_result_summary.json",
2015
  "exists": true,
2016
+ "bytes": 17416,
2017
+ "sha256": "33f8d9280a6b77f5922b37d32d17949683b27eee362be8cf220a7bda80282408"
2018
  },
2019
  "hf_artifacts": {
2020
  "path": "hf_artifacts:docs/data/two_evidence_line_result_summary.json",
2021
  "exists": true,
2022
+ "bytes": 17416,
2023
+ "sha256": "33f8d9280a6b77f5922b37d32d17949683b27eee362be8cf220a7bda80282408"
2024
  },
2025
  "hf_model_data": {
2026
  "path": "hf_model:data/two_evidence_line_result_summary.json",
2027
  "exists": true,
2028
+ "bytes": 17416,
2029
+ "sha256": "33f8d9280a6b77f5922b37d32d17949683b27eee362be8cf220a7bda80282408"
2030
  },
2031
  "hf_model_docs_data": {
2032
  "path": "hf_model:docs/data/two_evidence_line_result_summary.json",
2033
  "exists": true,
2034
+ "bytes": 17416,
2035
+ "sha256": "33f8d9280a6b77f5922b37d32d17949683b27eee362be8cf220a7bda80282408"
2036
  },
2037
  "hf_model": {
2038
  "path": "hf_model:metrics/two_evidence_line_result_summary.json",
2039
  "exists": true,
2040
+ "bytes": 17416,
2041
+ "sha256": "33f8d9280a6b77f5922b37d32d17949683b27eee362be8cf220a7bda80282408"
2042
  }
2043
  },
2044
  "failures": []
 
2148
  "path": "repo:docs/data/task_surface_integrity.json",
2149
  "exists": true,
2150
  "bytes": 46246,
2151
+ "sha256": "e674404f5a8f0a9b92fc2fd7ff3a745e3f42a5fdbad52e17558790d5ecbddd91"
2152
  },
2153
  "mirrors": {
2154
  "hf_space": {
2155
  "path": "hf_space:data/task_surface_integrity.json",
2156
  "exists": true,
2157
  "bytes": 46246,
2158
+ "sha256": "e674404f5a8f0a9b92fc2fd7ff3a745e3f42a5fdbad52e17558790d5ecbddd91"
2159
  },
2160
  "hf_artifacts_data": {
2161
  "path": "hf_artifacts:data/task_surface_integrity.json",
2162
  "exists": true,
2163
  "bytes": 46246,
2164
+ "sha256": "e674404f5a8f0a9b92fc2fd7ff3a745e3f42a5fdbad52e17558790d5ecbddd91"
2165
  },
2166
  "hf_artifacts": {
2167
  "path": "hf_artifacts:docs/data/task_surface_integrity.json",
2168
  "exists": true,
2169
  "bytes": 46246,
2170
+ "sha256": "e674404f5a8f0a9b92fc2fd7ff3a745e3f42a5fdbad52e17558790d5ecbddd91"
2171
  },
2172
  "hf_model_data": {
2173
  "path": "hf_model:data/task_surface_integrity.json",
2174
  "exists": true,
2175
  "bytes": 46246,
2176
+ "sha256": "e674404f5a8f0a9b92fc2fd7ff3a745e3f42a5fdbad52e17558790d5ecbddd91"
2177
  },
2178
  "hf_model_docs_data": {
2179
  "path": "hf_model:docs/data/task_surface_integrity.json",
2180
  "exists": true,
2181
  "bytes": 46246,
2182
+ "sha256": "e674404f5a8f0a9b92fc2fd7ff3a745e3f42a5fdbad52e17558790d5ecbddd91"
2183
  },
2184
  "hf_model": {
2185
  "path": "hf_model:metrics/task_surface_integrity.json",
2186
  "exists": true,
2187
  "bytes": 46246,
2188
+ "sha256": "e674404f5a8f0a9b92fc2fd7ff3a745e3f42a5fdbad52e17558790d5ecbddd91"
2189
  }
2190
  },
2191
  "failures": []
 
2344
  "path": "repo:docs/data/task_method_20_source_audit.json",
2345
  "exists": true,
2346
  "bytes": 561,
2347
+ "sha256": "bb50000bb681f1ed12c4089bd3541f14b8e121ea5a1fe06e1a6a559cb8960e0b"
2348
  },
2349
  "mirrors": {
2350
  "hf_space": {
2351
  "path": "hf_space:data/task_method_20_source_audit.json",
2352
  "exists": true,
2353
  "bytes": 561,
2354
+ "sha256": "bb50000bb681f1ed12c4089bd3541f14b8e121ea5a1fe06e1a6a559cb8960e0b"
2355
  },
2356
  "hf_artifacts_data": {
2357
  "path": "hf_artifacts:data/task_method_20_source_audit.json",
2358
  "exists": true,
2359
  "bytes": 561,
2360
+ "sha256": "bb50000bb681f1ed12c4089bd3541f14b8e121ea5a1fe06e1a6a559cb8960e0b"
2361
  },
2362
  "hf_artifacts": {
2363
  "path": "hf_artifacts:docs/data/task_method_20_source_audit.json",
2364
  "exists": true,
2365
  "bytes": 561,
2366
+ "sha256": "bb50000bb681f1ed12c4089bd3541f14b8e121ea5a1fe06e1a6a559cb8960e0b"
2367
  },
2368
  "hf_model_data": {
2369
  "path": "hf_model:data/task_method_20_source_audit.json",
2370
  "exists": true,
2371
  "bytes": 561,
2372
+ "sha256": "bb50000bb681f1ed12c4089bd3541f14b8e121ea5a1fe06e1a6a559cb8960e0b"
2373
  },
2374
  "hf_model_docs_data": {
2375
  "path": "hf_model:docs/data/task_method_20_source_audit.json",
2376
  "exists": true,
2377
  "bytes": 561,
2378
+ "sha256": "bb50000bb681f1ed12c4089bd3541f14b8e121ea5a1fe06e1a6a559cb8960e0b"
2379
  },
2380
  "hf_model": {
2381
  "path": "hf_model:metrics/task_method_20_source_audit.json",
2382
  "exists": true,
2383
  "bytes": 561,
2384
+ "sha256": "bb50000bb681f1ed12c4089bd3541f14b8e121ea5a1fe06e1a6a559cb8960e0b"
2385
  }
2386
  },
2387
  "failures": []
 
2491
  "path": "repo:docs/data/website_integrity.json",
2492
  "exists": true,
2493
  "bytes": 20658,
2494
+ "sha256": "e1a0b46f5645783ee8430c45d88862fa0ce9fe20c980574d77ddc6abc84fd745"
2495
  },
2496
  "mirrors": {
2497
  "hf_space": {
2498
  "path": "hf_space:data/website_integrity.json",
2499
  "exists": true,
2500
  "bytes": 20658,
2501
+ "sha256": "e1a0b46f5645783ee8430c45d88862fa0ce9fe20c980574d77ddc6abc84fd745"
2502
  },
2503
  "hf_artifacts_data": {
2504
  "path": "hf_artifacts:data/website_integrity.json",
2505
  "exists": true,
2506
  "bytes": 20658,
2507
+ "sha256": "e1a0b46f5645783ee8430c45d88862fa0ce9fe20c980574d77ddc6abc84fd745"
2508
  },
2509
  "hf_artifacts": {
2510
  "path": "hf_artifacts:docs/data/website_integrity.json",
2511
  "exists": true,
2512
  "bytes": 20658,
2513
+ "sha256": "e1a0b46f5645783ee8430c45d88862fa0ce9fe20c980574d77ddc6abc84fd745"
2514
  },
2515
  "hf_model_data": {
2516
  "path": "hf_model:data/website_integrity.json",
2517
  "exists": true,
2518
  "bytes": 20658,
2519
+ "sha256": "e1a0b46f5645783ee8430c45d88862fa0ce9fe20c980574d77ddc6abc84fd745"
2520
  },
2521
  "hf_model_docs_data": {
2522
  "path": "hf_model:docs/data/website_integrity.json",
2523
  "exists": true,
2524
  "bytes": 20658,
2525
+ "sha256": "e1a0b46f5645783ee8430c45d88862fa0ce9fe20c980574d77ddc6abc84fd745"
2526
  },
2527
  "hf_model": {
2528
  "path": "hf_model:metrics/website_integrity.json",
2529
  "exists": true,
2530
  "bytes": 20658,
2531
+ "sha256": "e1a0b46f5645783ee8430c45d88862fa0ce9fe20c980574d77ddc6abc84fd745"
2532
  }
2533
  },
2534
  "failures": []
 
5503
  "local": {
5504
  "path": "repo:scripts/build_multilingual_public_readmes.py",
5505
  "exists": true,
5506
+ "bytes": 51467,
5507
+ "sha256": "d4fd285ff1da0479f06fb4a15bd123142f8b72d752709bd7be4c3375dfb544d3"
5508
  },
5509
  "mirrors": {
5510
  "hf_artifacts": {
5511
  "path": "hf_artifacts:scripts/build_multilingual_public_readmes.py",
5512
  "exists": true,
5513
+ "bytes": 51467,
5514
+ "sha256": "d4fd285ff1da0479f06fb4a15bd123142f8b72d752709bd7be4c3375dfb544d3"
5515
  },
5516
  "hf_model": {
5517
  "path": "hf_model:scripts/build_multilingual_public_readmes.py",
5518
  "exists": true,
5519
+ "bytes": 51467,
5520
+ "sha256": "d4fd285ff1da0479f06fb4a15bd123142f8b72d752709bd7be4c3375dfb544d3"
5521
  }
5522
  },
5523
  "failures": []
 
6282
  "local": {
6283
  "path": "repo:docs/index.html",
6284
  "exists": true,
6285
+ "bytes": 300335,
6286
+ "sha256": "37a16c8dcb18740c2e0a2c816b3919a3b89a8eb2f59b2549d957a51a2c5c4c19"
6287
  },
6288
  "mirrors": {
6289
  "hf_space": {
6290
  "path": "hf_space:index.html",
6291
  "exists": true,
6292
+ "bytes": 300335,
6293
+ "sha256": "37a16c8dcb18740c2e0a2c816b3919a3b89a8eb2f59b2549d957a51a2c5c4c19"
6294
  },
6295
  "hf_artifacts_root": {
6296
  "path": "hf_artifacts:index.html",
6297
  "exists": true,
6298
+ "bytes": 300335,
6299
+ "sha256": "37a16c8dcb18740c2e0a2c816b3919a3b89a8eb2f59b2549d957a51a2c5c4c19"
6300
  },
6301
  "hf_artifacts_docs": {
6302
  "path": "hf_artifacts:docs/index.html",
6303
  "exists": true,
6304
+ "bytes": 300335,
6305
+ "sha256": "37a16c8dcb18740c2e0a2c816b3919a3b89a8eb2f59b2549d957a51a2c5c4c19"
6306
  },
6307
  "hf_model": {
6308
  "path": "hf_model:index.html",
6309
  "exists": true,
6310
+ "bytes": 300335,
6311
+ "sha256": "37a16c8dcb18740c2e0a2c816b3919a3b89a8eb2f59b2549d957a51a2c5c4c19"
6312
  },
6313
  "hf_model_docs": {
6314
  "path": "hf_model:docs/index.html",
6315
  "exists": true,
6316
+ "bytes": 300335,
6317
+ "sha256": "37a16c8dcb18740c2e0a2c816b3919a3b89a8eb2f59b2549d957a51a2c5c4c19"
6318
  }
6319
  },
6320
  "failures": []
 
31102
  "local": {
31103
  "path": "repo:TWO_EVIDENCE_LINES.md",
31104
  "exists": true,
31105
+ "bytes": 4325,
31106
+ "sha256": "db6768959591dc8501bf53f1c398f0db51027f66e1c3b15ceb25396d94705bfc"
31107
  },
31108
  "mirrors": {
31109
  "hf_space": {
31110
  "path": "hf_space:TWO_EVIDENCE_LINES.md",
31111
  "exists": true,
31112
+ "bytes": 4325,
31113
+ "sha256": "db6768959591dc8501bf53f1c398f0db51027f66e1c3b15ceb25396d94705bfc"
31114
  },
31115
  "hf_artifacts": {
31116
  "path": "hf_artifacts:TWO_EVIDENCE_LINES.md",
31117
  "exists": true,
31118
+ "bytes": 4325,
31119
+ "sha256": "db6768959591dc8501bf53f1c398f0db51027f66e1c3b15ceb25396d94705bfc"
31120
  },
31121
  "hf_model": {
31122
  "path": "hf_model:TWO_EVIDENCE_LINES.md",
31123
  "exists": true,
31124
+ "bytes": 4325,
31125
+ "sha256": "db6768959591dc8501bf53f1c398f0db51027f66e1c3b15ceb25396d94705bfc"
31126
  }
31127
  },
31128
  "failures": []
 
31133
  "local": {
31134
  "path": "repo:TWO_EVIDENCE_LINE_RESULT_SUMMARY.md",
31135
  "exists": true,
31136
+ "bytes": 8726,
31137
+ "sha256": "23a459e2903af252b1e99a8fe5bf63e3e282d7d126f76b54141700ab5fb07ca8"
31138
  },
31139
  "mirrors": {
31140
  "hf_space": {
31141
  "path": "hf_space:TWO_EVIDENCE_LINE_RESULT_SUMMARY.md",
31142
  "exists": true,
31143
+ "bytes": 8726,
31144
+ "sha256": "23a459e2903af252b1e99a8fe5bf63e3e282d7d126f76b54141700ab5fb07ca8"
31145
  },
31146
  "hf_artifacts": {
31147
  "path": "hf_artifacts:TWO_EVIDENCE_LINE_RESULT_SUMMARY.md",
31148
  "exists": true,
31149
+ "bytes": 8726,
31150
+ "sha256": "23a459e2903af252b1e99a8fe5bf63e3e282d7d126f76b54141700ab5fb07ca8"
31151
  },
31152
  "hf_model": {
31153
  "path": "hf_model:TWO_EVIDENCE_LINE_RESULT_SUMMARY.md",
31154
  "exists": true,
31155
+ "bytes": 8726,
31156
+ "sha256": "23a459e2903af252b1e99a8fe5bf63e3e282d7d126f76b54141700ab5fb07ca8"
31157
  }
31158
  },
31159
  "failures": []
 
31165
  "path": "repo:QWEN3_OMNI_RUN_LINEAGE.md",
31166
  "exists": true,
31167
  "bytes": 6287,
31168
+ "sha256": "4d0f01d7ea321b8f7d4c75f4ae00b3ba4a454b100b3238d320c00982959e71d0"
31169
  },
31170
  "mirrors": {
31171
  "hf_space": {
31172
  "path": "hf_space:QWEN3_OMNI_RUN_LINEAGE.md",
31173
  "exists": true,
31174
  "bytes": 6287,
31175
+ "sha256": "4d0f01d7ea321b8f7d4c75f4ae00b3ba4a454b100b3238d320c00982959e71d0"
31176
  },
31177
  "hf_artifacts": {
31178
  "path": "hf_artifacts:QWEN3_OMNI_RUN_LINEAGE.md",
31179
  "exists": true,
31180
  "bytes": 6287,
31181
+ "sha256": "4d0f01d7ea321b8f7d4c75f4ae00b3ba4a454b100b3238d320c00982959e71d0"
31182
  },
31183
  "hf_model": {
31184
  "path": "hf_model:QWEN3_OMNI_RUN_LINEAGE.md",
31185
  "exists": true,
31186
  "bytes": 6287,
31187
+ "sha256": "4d0f01d7ea321b8f7d4c75f4ae00b3ba4a454b100b3238d320c00982959e71d0"
31188
  }
31189
  },
31190
  "failures": []
 
31195
  "local": {
31196
  "path": "repo:PROJECT_BRIEF.md",
31197
  "exists": true,
31198
+ "bytes": 4817,
31199
+ "sha256": "4bf611b5ab6959424d8a31ff6ccbaa460bb97496a59b6405830a3878036e49cb"
31200
  },
31201
  "mirrors": {
31202
  "hf_space": {
31203
  "path": "hf_space:PROJECT_BRIEF.md",
31204
  "exists": true,
31205
+ "bytes": 4817,
31206
+ "sha256": "4bf611b5ab6959424d8a31ff6ccbaa460bb97496a59b6405830a3878036e49cb"
31207
  },
31208
  "hf_artifacts": {
31209
  "path": "hf_artifacts:PROJECT_BRIEF.md",
31210
  "exists": true,
31211
+ "bytes": 4817,
31212
+ "sha256": "4bf611b5ab6959424d8a31ff6ccbaa460bb97496a59b6405830a3878036e49cb"
31213
  },
31214
  "hf_model": {
31215
  "path": "hf_model:PROJECT_BRIEF.md",
31216
  "exists": true,
31217
+ "bytes": 4817,
31218
+ "sha256": "4bf611b5ab6959424d8a31ff6ccbaa460bb97496a59b6405830a3878036e49cb"
31219
  }
31220
  },
31221
  "failures": []
 
31226
  "local": {
31227
  "path": "repo:PUBLIC_READER_MAP.md",
31228
  "exists": true,
31229
+ "bytes": 5497,
31230
+ "sha256": "332ddbeddd908794fdd2a7e18ee1c9d657e672844f8f65a305f147ffde065133"
31231
  },
31232
  "mirrors": {
31233
  "hf_space": {
31234
  "path": "hf_space:PUBLIC_READER_MAP.md",
31235
  "exists": true,
31236
+ "bytes": 5497,
31237
+ "sha256": "332ddbeddd908794fdd2a7e18ee1c9d657e672844f8f65a305f147ffde065133"
31238
  },
31239
  "hf_artifacts": {
31240
  "path": "hf_artifacts:PUBLIC_READER_MAP.md",
31241
  "exists": true,
31242
+ "bytes": 5497,
31243
+ "sha256": "332ddbeddd908794fdd2a7e18ee1c9d657e672844f8f65a305f147ffde065133"
31244
  },
31245
  "hf_model": {
31246
  "path": "hf_model:PUBLIC_READER_MAP.md",
31247
  "exists": true,
31248
+ "bytes": 5497,
31249
+ "sha256": "332ddbeddd908794fdd2a7e18ee1c9d657e672844f8f65a305f147ffde065133"
31250
  }
31251
  },
31252
  "failures": []
 
31319
  "local": {
31320
  "path": "repo:PROJECT_STATUS.md",
31321
  "exists": true,
31322
+ "bytes": 15977,
31323
+ "sha256": "6ce8ca736b0ad2f8fef463c62530294e414cfc0a0e4ed501907fbf1536de0aa9"
31324
  },
31325
  "mirrors": {
31326
  "hf_space": {
31327
  "path": "hf_space:PROJECT_STATUS.md",
31328
  "exists": true,
31329
+ "bytes": 15977,
31330
+ "sha256": "6ce8ca736b0ad2f8fef463c62530294e414cfc0a0e4ed501907fbf1536de0aa9"
31331
  },
31332
  "hf_artifacts": {
31333
  "path": "hf_artifacts:PROJECT_STATUS.md",
31334
  "exists": true,
31335
+ "bytes": 15977,
31336
+ "sha256": "6ce8ca736b0ad2f8fef463c62530294e414cfc0a0e4ed501907fbf1536de0aa9"
31337
  },
31338
  "hf_model": {
31339
  "path": "hf_model:PROJECT_STATUS.md",
31340
  "exists": true,
31341
+ "bytes": 15977,
31342
+ "sha256": "6ce8ca736b0ad2f8fef463c62530294e414cfc0a0e4ed501907fbf1536de0aa9"
31343
  }
31344
  },
31345
  "failures": []
metrics/project_status.json CHANGED
@@ -2,7 +2,7 @@
2
  "title": "Ropedia Xperience-10M Task Suite Project Status",
3
  "version": "2026-06-20",
4
  "decision": "public_sample_pipeline_verified_128_enhancement_qwen3_v6_cosmos_comparison",
5
- "research_positioning": "A research-engineering study that makes one public Xperience-10M sample episode inspectable, defines embodied-AI tasks over synchronized modalities, records baseline behavior, aligns simple/NN baselines to the selected 128-episode split, compares verified Qwen3-Omni and Cosmos3 packages as early cross-episode diagnostics, and now records a no-new-episode enhancement pack for pushing the current 128-episode suite harder.",
6
  "scope_boundary": {
7
  "validated_episode_count": 1,
8
  "aligned_frames": 5821,
@@ -259,6 +259,8 @@
259
  "area": "Qwen3-Omni fine-tuning",
260
  "status": "final_verified_diagnostic_result_json_target_met",
261
  "evidence": [
 
 
262
  "docs/data/omni_finetune_verified_result.json",
263
  "docs/data/qwen3_v5_v6_comparison.json",
264
  "results/omni_finetune/QWEN3_V5_V6_COMPARISON_20260614.md",
@@ -268,7 +270,7 @@
268
  "scripts/omni/audit_verified_omni_package.py",
269
  "scripts/omni/analyze_qwen3_omni_errors.py"
270
  ],
271
- "readout": "The selected 96/16/16 episode split now has a current v6 rank64/lr5e-5 public-safe held-out package with 34,269 exported windows, 4,032 test predictions, validation/audit summaries, and a public LoRA adapter repo. JSON validity is 99.90%, meeting the 98% target; transition accuracy is 98.98%, contact accuracy is 81.77%, object micro-F1 is 30.65%, next-action accuracy is 4.31%, and action/subtask metrics remain weak. v6 improves action macro-F1 and contact accuracy versus v5, but v5 remains stronger on JSON validity, subtask, next-action, transition, and object metrics."
272
  },
273
  {
274
  "area": "Cosmos3-Nano future-window package",
 
2
  "title": "Ropedia Xperience-10M Task Suite Project Status",
3
  "version": "2026-06-20",
4
  "decision": "public_sample_pipeline_verified_128_enhancement_qwen3_v6_cosmos_comparison",
5
+ "research_positioning": "A research-engineering study with two public evidence lines: Line 1 makes one public Xperience-10M sample episode inspectable and reproducible as a 20-task lab; Line 2 aligns selected 128-episode baselines with verified Qwen3-Omni v6, Cosmos3-Super, and Cosmos3-Nano diagnostics, then records a no-new-episode enhancement pack for pushing the 128-episode suite harder.",
6
  "scope_boundary": {
7
  "validated_episode_count": 1,
8
  "aligned_frames": 5821,
 
259
  "area": "Qwen3-Omni fine-tuning",
260
  "status": "final_verified_diagnostic_result_json_target_met",
261
  "evidence": [
262
+ "QWEN3_OMNI_RUN_LINEAGE.md",
263
+ "docs/data/qwen3_omni_run_lineage.json",
264
  "docs/data/omni_finetune_verified_result.json",
265
  "docs/data/qwen3_v5_v6_comparison.json",
266
  "results/omni_finetune/QWEN3_V5_V6_COMPARISON_20260614.md",
 
270
  "scripts/omni/audit_verified_omni_package.py",
271
  "scripts/omni/analyze_qwen3_omni_errors.py"
272
  ],
273
+ "readout": "Qwen3-Omni v1-v6 are one selected-128 run lineage, not six project evidence lines. v1-v4 harden the pipeline and record ablations, v5 is the pinned prior multiscale release, and v6 is the current public 20-task Qwen row. The v6 rank64/lr5e-5 public-safe held-out package has 34,269 exported windows, 4,032 test predictions, validation/audit summaries, and a public LoRA adapter repo. JSON validity is 99.90%, meeting the 98% target; transition accuracy is 98.98%, contact accuracy is 81.77%, object micro-F1 is 30.65%, next-action accuracy is 4.31%, and action/subtask metrics remain weak. v6 improves action macro-F1 and contact accuracy versus v5, but v5 remains stronger on JSON validity, subtask, next-action, transition, and object metrics."
274
  },
275
  {
276
  "area": "Cosmos3-Nano future-window package",
metrics/public_reader_map.json CHANGED
@@ -8,6 +8,11 @@
8
  "start_here": "PROJECT_BRIEF.md",
9
  "then_inspect": ["PROJECT_STATUS.md", "RESEARCH_TAKEAWAYS.md"]
10
  },
 
 
 
 
 
11
  {
12
  "reader_goal": "See the visual public dashboard",
13
  "start_here": "GitHub Pages dashboard or Hugging Face Space",
@@ -38,6 +43,11 @@
38
  "start_here": "Homepage radar section",
39
  "then_inspect": ["docs/data/single_episode_task_model_radar.json", "docs/data/episode128_task_model_radar.json"]
40
  },
 
 
 
 
 
41
  {
42
  "reader_goal": "Find all derived artifacts",
43
  "start_here": "ARTIFACT_GUIDE.md",
@@ -101,11 +111,11 @@
101
  "best_use": "Inspecting Qwen3-Omni and Cosmos3 artifacts."
102
  }
103
  ],
104
- "evidence_layers": [
105
  "Dataset/source boundary",
106
  "Data contract",
107
  "Task suite",
108
- "Results",
109
  "Foundation directions",
110
  "Public-release checks"
111
  ],
@@ -116,10 +126,15 @@
116
  "boundary": "Describes one public sample episode, not the full dataset distribution."
117
  },
118
  {
119
- "claim_type": "128-episode baseline comparison",
120
  "public_evidence": ["XPERIENCE10M_128_EPISODE_FEATURE_INDEX.md", "docs/data/xperience10m_128_episode_feature_index.json", "results/omni_finetune/*128*", "docs/data/omni_model_comparison.json"],
121
  "boundary": "Uses selected held-out episodes and derived public-safe summaries; official raw files remain gated upstream."
122
  },
 
 
 
 
 
123
  {
124
  "claim_type": "Foundation-model track quality",
125
  "public_evidence": ["Verified Qwen3-Omni and Cosmos3 result packages", "model cards"],
 
8
  "start_here": "PROJECT_BRIEF.md",
9
  "then_inspect": ["PROJECT_STATUS.md", "RESEARCH_TAKEAWAYS.md"]
10
  },
11
+ {
12
+ "reader_goal": "Understand the two evidence lines",
13
+ "start_here": "TWO_EVIDENCE_LINES.md",
14
+ "then_inspect": ["docs/data/two_evidence_lines.json", "docs/data/two_evidence_line_result_summary.json"]
15
+ },
16
  {
17
  "reader_goal": "See the visual public dashboard",
18
  "start_here": "GitHub Pages dashboard or Hugging Face Space",
 
43
  "start_here": "Homepage radar section",
44
  "then_inspect": ["docs/data/single_episode_task_model_radar.json", "docs/data/episode128_task_model_radar.json"]
45
  },
46
+ {
47
+ "reader_goal": "Read Qwen3-Omni v1-v6 correctly",
48
+ "start_here": "QWEN3_OMNI_RUN_LINEAGE.md",
49
+ "then_inspect": ["docs/data/qwen3_omni_run_lineage.json", "docs/data/qwen3_v5_v6_comparison.json"]
50
+ },
51
  {
52
  "reader_goal": "Find all derived artifacts",
53
  "start_here": "ARTIFACT_GUIDE.md",
 
111
  "best_use": "Inspecting Qwen3-Omni and Cosmos3 artifacts."
112
  }
113
  ],
114
+ "evidence_views": [
115
  "Dataset/source boundary",
116
  "Data contract",
117
  "Task suite",
118
+ "Results by Line 1 and Line 2",
119
  "Foundation directions",
120
  "Public-release checks"
121
  ],
 
126
  "boundary": "Describes one public sample episode, not the full dataset distribution."
127
  },
128
  {
129
+ "claim_type": "128-episode method comparison",
130
  "public_evidence": ["XPERIENCE10M_128_EPISODE_FEATURE_INDEX.md", "docs/data/xperience10m_128_episode_feature_index.json", "results/omni_finetune/*128*", "docs/data/omni_model_comparison.json"],
131
  "boundary": "Uses selected held-out episodes and derived public-safe summaries; official raw files remain gated upstream."
132
  },
133
+ {
134
+ "claim_type": "Qwen3-Omni v1-v6 lineage",
135
+ "public_evidence": ["QWEN3_OMNI_RUN_LINEAGE.md", "docs/data/qwen3_omni_run_lineage.json"],
136
+ "boundary": "v1-v4 are pipeline and ablation evidence, v5 is the pinned prior release, and v6 is the current public 20-task Qwen row."
137
+ },
138
  {
139
  "claim_type": "Foundation-model track quality",
140
  "public_evidence": ["Verified Qwen3-Omni and Cosmos3 result packages", "model cards"],
metrics/public_surface_qa.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Public Project Surface",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T11:08:07+00:00",
5
  "scope": "Repo README, GitHub Pages HTML, Hugging Face Space card, artifact dataset card, and model card.",
6
  "checks": [
7
  {
@@ -18,7 +18,7 @@
18
  "website_integrity": {
19
  "exists": true,
20
  "status": "pass",
21
- "generated_at_utc": "2026-06-21T11:07:26+00:00"
22
  },
23
  "rendered_site_check": {
24
  "exists": true,
@@ -28,27 +28,27 @@
28
  "task_surface_integrity": {
29
  "exists": true,
30
  "status": "pass",
31
- "generated_at_utc": "2026-06-21T11:04:16+00:00"
32
  },
33
  "source_alignment": {
34
  "exists": true,
35
  "status": "pass",
36
- "generated_at_utc": "2026-06-21T11:04:16+00:00"
37
  },
38
  "scale_up_status": {
39
  "exists": true,
40
  "status": "pass",
41
- "generated_at_utc": "2026-06-21T11:03:20+00:00"
42
  },
43
  "publication_package": {
44
  "exists": true,
45
  "status": "pass",
46
- "generated_at_utc": "2026-06-21T11:07:41+00:00"
47
  },
48
  "mirror_parity": {
49
  "exists": true,
50
  "status": "pass",
51
- "generated_at_utc": "2026-06-21T11:05:04+00:00"
52
  }
53
  },
54
  "failures": {}
@@ -98,7 +98,7 @@
98
  "Ropedia Xperience-10M Task Suite": 20,
99
  "Xperience-10M": 166,
100
  "20-task": 89,
101
- "Qwen3-Omni": 241,
102
  "128-episode pilot": 1
103
  }
104
  },
 
1
  {
2
  "title": "Ropedia Xperience-10M Public Project Surface",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T11:49:55+00:00",
5
  "scope": "Repo README, GitHub Pages HTML, Hugging Face Space card, artifact dataset card, and model card.",
6
  "checks": [
7
  {
 
18
  "website_integrity": {
19
  "exists": true,
20
  "status": "pass",
21
+ "generated_at_utc": "2026-06-21T11:49:24+00:00"
22
  },
23
  "rendered_site_check": {
24
  "exists": true,
 
28
  "task_surface_integrity": {
29
  "exists": true,
30
  "status": "pass",
31
+ "generated_at_utc": "2026-06-21T11:08:07+00:00"
32
  },
33
  "source_alignment": {
34
  "exists": true,
35
  "status": "pass",
36
+ "generated_at_utc": "2026-06-21T11:49:23+00:00"
37
  },
38
  "scale_up_status": {
39
  "exists": true,
40
  "status": "pass",
41
+ "generated_at_utc": "2026-06-21T11:08:09+00:00"
42
  },
43
  "publication_package": {
44
  "exists": true,
45
  "status": "pass",
46
+ "generated_at_utc": "2026-06-21T11:49:39+00:00"
47
  },
48
  "mirror_parity": {
49
  "exists": true,
50
  "status": "pass",
51
+ "generated_at_utc": "2026-06-21T11:28:34+00:00"
52
  }
53
  },
54
  "failures": {}
 
98
  "Ropedia Xperience-10M Task Suite": 20,
99
  "Xperience-10M": 166,
100
  "20-task": 89,
101
+ "Qwen3-Omni": 242,
102
  "128-episode pilot": 1
103
  }
104
  },
metrics/publication_audit.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T11:07:41+00:00",
4
  "checks": [
5
  {
6
  "name": "required_publication_assets_present",
@@ -255,7 +255,7 @@
255
  "hf_artifact_bundle": {
256
  "root": "hf_publish/artifacts",
257
  "exists": true,
258
- "file_count": 3049,
259
  "text_file_count": 1283,
260
  "largest_file": {
261
  "path": "results/omni_finetune/xperience10m_128ep_dense_multiscale_hierarchical_v1_20260608/dense_multiscale_windows.jsonl",
@@ -266,7 +266,7 @@
266
  "hf_model_bundle": {
267
  "root": "hf_publish/model",
268
  "exists": true,
269
- "file_count": 3533,
270
  "text_file_count": 1455,
271
  "largest_file": {
272
  "path": "results/omni_finetune/xperience10m_128ep_dense_multiscale_hierarchical_v1_20260608/dense_multiscale_windows.jsonl",
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T11:49:39+00:00",
4
  "checks": [
5
  {
6
  "name": "required_publication_assets_present",
 
255
  "hf_artifact_bundle": {
256
  "root": "hf_publish/artifacts",
257
  "exists": true,
258
+ "file_count": 4519,
259
  "text_file_count": 1283,
260
  "largest_file": {
261
  "path": "results/omni_finetune/xperience10m_128ep_dense_multiscale_hierarchical_v1_20260608/dense_multiscale_windows.jsonl",
 
266
  "hf_model_bundle": {
267
  "root": "hf_publish/model",
268
  "exists": true,
269
+ "file_count": 5281,
270
  "text_file_count": 1455,
271
  "largest_file": {
272
  "path": "results/omni_finetune/xperience10m_128ep_dense_multiscale_hierarchical_v1_20260608/dense_multiscale_windows.jsonl",
metrics/quality_gates.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Release Checks",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T11:28:42+00:00",
5
  "rule": "A release is current when the automated reports pass and the live GitHub/Hugging Face mirrors are verified after publishing.",
6
  "automated_gates": [
7
  {
 
1
  {
2
  "title": "Ropedia Xperience-10M Release Checks",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T11:52:46+00:00",
5
  "rule": "A release is current when the automated reports pass and the live GitHub/Hugging Face mirrors are verified after publishing.",
6
  "automated_gates": [
7
  {
metrics/qwen3_omni_run_lineage.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "current_public_matrix_row": "qwen3_omni_v6_lora",
3
- "generated_at_utc": "2026-06-21T10:54:46+00:00",
4
  "interpretation_rule": "Do not confuse the Qwen run versions with the project evidence lines. The project evidence lines are one public sample episode and selected 128-episode artifacts. Qwen v1-v6 are only the Qwen3-Omni run lineage inside the selected-128 line. The 20-task matrix uses Qwen3-Omni v6 LoRA; v5 remains the pinned prior release; v1-v4 are lineage and ablation evidence.",
5
  "pinned_prior_release": "v5",
6
  "related_engineering_artifacts": [
 
1
  {
2
  "current_public_matrix_row": "qwen3_omni_v6_lora",
3
+ "generated_at_utc": "2026-06-21T11:47:45+00:00",
4
  "interpretation_rule": "Do not confuse the Qwen run versions with the project evidence lines. The project evidence lines are one public sample episode and selected 128-episode artifacts. Qwen v1-v6 are only the Qwen3-Omni run lineage inside the selected-128 line. The 20-task matrix uses Qwen3-Omni v6 LoRA; v5 remains the pinned prior release; v1-v4 are lineage and ablation evidence.",
5
  "pinned_prior_release": "v5",
6
  "related_engineering_artifacts": [
metrics/scope_claims_audit.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T11:08:09+00:00",
4
  "summary": {
5
  "qwen3_omni_verified_diagnostic_pilot": true,
6
  "dataset_manifest_num_episodes": 119,
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T11:49:59+00:00",
4
  "summary": {
5
  "qwen3_omni_verified_diagnostic_pilot": true,
6
  "dataset_manifest_num_episodes": 119,
metrics/source_alignment_audit.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Source Alignment Note",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T11:08:07+00:00",
5
  "alignment_json": "docs/data/xperience10m_dataset_card_alignment.json",
6
  "alignment_summary": {
7
  "full_dataset_repo": "ropedia-ai/xperience-10m",
 
1
  {
2
  "title": "Ropedia Xperience-10M Source Alignment Note",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T11:49:23+00:00",
5
  "alignment_json": "docs/data/xperience10m_dataset_card_alignment.json",
6
  "alignment_summary": {
7
  "full_dataset_repo": "ropedia-ai/xperience-10m",
metrics/task_method_20_source_audit.json CHANGED
@@ -2,7 +2,7 @@
2
  "checked_json_metric_count": 180,
3
  "failure_count": 0,
4
  "failures": [],
5
- "generated_at_utc": "2026-06-21T11:07:42+00:00",
6
  "method_task_record_count": 180,
7
  "rule": "Every scored row that declares a JSON metric source must have the same numeric value under that row's metric_key.",
8
  "scored_method_task_count": 180,
 
2
  "checked_json_metric_count": 180,
3
  "failure_count": 0,
4
  "failures": [],
5
+ "generated_at_utc": "2026-06-21T11:49:23+00:00",
6
  "method_task_record_count": 180,
7
  "rule": "Every scored row that declares a JSON metric source must have the same numeric value under that row's metric_key.",
8
  "scored_method_task_count": 180,
metrics/task_surface_integrity.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T11:08:07+00:00",
4
  "summary": {
5
  "original_walkthrough_task_count": 12,
6
  "expected_original_walkthrough_task_count": 12,
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T11:49:55+00:00",
4
  "summary": {
5
  "original_walkthrough_task_count": 12,
6
  "expected_original_walkthrough_task_count": 12,
metrics/two_evidence_line_result_summary.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "generated_at_utc": "2026-06-21T10:47:04+00:00",
3
  "interpretation_rule": "Use the 1-episode line for task construction and reproducibility claims. Use the 128-episode line for same-split metadata/raw baselines, Qwen3-Omni v6 LoRA diagnostics, and Cosmos3 diagnostics.",
4
  "lines": [
5
  {
@@ -337,7 +337,7 @@
337
  "selected_128_episode_surface": "Use for held-out comparison, metadata/raw-feature baselines, Qwen3-Omni v6 LoRA, Cosmos3-Super Reasoner, Cosmos3-Nano Future Window, and scale-up decisions.",
338
  "single_public_sample_episode": "Use for task construction, raw-file inspection, local reproducibility, and controlled Minimal-vs-Neural baseline behavior."
339
  },
340
- "reader_summary": "The suite has two public result lines. Line 1 is the fully inspectable one-episode task lab. Line 2 is the 128-episode comparison surface for aligned baselines, the Qwen3-Omni series, and the Cosmos3 series. Do not mix the two when reading scores.",
341
  "reading_order": [
342
  {
343
  "reason": "Line 1 answers task-lab and reproducibility questions; line 2 answers selected-128 comparison questions.",
 
1
  {
2
+ "generated_at_utc": "2026-06-21T11:49:06+00:00",
3
  "interpretation_rule": "Use the 1-episode line for task construction and reproducibility claims. Use the 128-episode line for same-split metadata/raw baselines, Qwen3-Omni v6 LoRA diagnostics, and Cosmos3 diagnostics.",
4
  "lines": [
5
  {
 
337
  "selected_128_episode_surface": "Use for held-out comparison, metadata/raw-feature baselines, Qwen3-Omni v6 LoRA, Cosmos3-Super Reasoner, Cosmos3-Nano Future Window, and scale-up decisions.",
338
  "single_public_sample_episode": "Use for task construction, raw-file inspection, local reproducibility, and controlled Minimal-vs-Neural baseline behavior."
339
  },
340
+ "reader_summary": "The suite has two public evidence lines. Line 1 is the fully inspectable one-episode task lab. Line 2 is the 128-episode comparison surface for aligned baselines, the Qwen3-Omni series, and the Cosmos3 series. Do not mix the two when reading scores.",
341
  "reading_order": [
342
  {
343
  "reason": "Line 1 answers task-lab and reproducibility questions; line 2 answers selected-128 comparison questions.",
metrics/two_evidence_lines.json CHANGED
@@ -2,7 +2,7 @@
2
  "status": "current",
3
  "updated_utc": "2026-06-21T00:00:00Z",
4
  "interpretation_rule": "Use the 1-episode line for task construction and reproducibility claims. Use the 128-episode line for same-split metadata/raw baselines, Qwen3-Omni v6 LoRA diagnostics, and Cosmos3 diagnostics.",
5
- "reader_summary": "The suite has two public result lines. Line 1 is the fully inspectable one-episode task lab. Line 2 is the 128-episode comparison surface for aligned baselines, the Qwen3-Omni series, and the Cosmos3 series. Do not mix the two when reading scores.",
6
  "score_formula": "2 single-episode methods x 20 tasks = 40 records; 7 selected-128 methods x 20 tasks = 140 records; total public matrix = 180/180 scored records.",
7
  "lines": [
8
  {
 
2
  "status": "current",
3
  "updated_utc": "2026-06-21T00:00:00Z",
4
  "interpretation_rule": "Use the 1-episode line for task construction and reproducibility claims. Use the 128-episode line for same-split metadata/raw baselines, Qwen3-Omni v6 LoRA diagnostics, and Cosmos3 diagnostics.",
5
+ "reader_summary": "The suite has two public evidence lines. Line 1 is the fully inspectable one-episode task lab. Line 2 is the 128-episode comparison surface for aligned baselines, the Qwen3-Omni series, and the Cosmos3 series. Do not mix the two when reading scores.",
6
  "score_formula": "2 single-episode methods x 20 tasks = 40 records; 7 selected-128 methods x 20 tasks = 140 records; total public matrix = 180/180 scored records.",
7
  "lines": [
8
  {
metrics/website_integrity.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T11:07:26+00:00",
4
  "docs_root": "docs",
5
  "site_base": "/ropedia-xperience-10m-task-suite/",
6
  "summary": {
@@ -80,8 +80,8 @@
80
  "name": "project_overview_precedes_progress_ledger",
81
  "status": "pass",
82
  "reason": "The project overview should appear before the deeper progress ledger.",
83
- "overview_index": 118524,
84
- "evidence_index": 163802
85
  },
86
  {
87
  "name": "project_status_links_json",
@@ -159,9 +159,9 @@
159
  "name": "evaluation_protocol_between_overview_and_progress",
160
  "status": "pass",
161
  "reason": "The evaluation protocol should appear before the deeper evidence ledger.",
162
- "overview_index": 118524,
163
- "protocol_index": 159990,
164
- "evidence_index": 163802
165
  },
166
  {
167
  "name": "evaluation_protocol_links_json",
@@ -386,12 +386,12 @@
386
  },
387
  {
388
  "path": "data/project_status.json",
389
- "bytes": 23054,
390
  "top_level_type": "dict"
391
  },
392
  {
393
  "path": "data/public_reader_map.json",
394
- "bytes": 5990,
395
  "top_level_type": "dict"
396
  },
397
  {
@@ -536,12 +536,12 @@
536
  },
537
  {
538
  "path": "data/two_evidence_line_result_summary.json",
539
- "bytes": 17414,
540
  "top_level_type": "dict"
541
  },
542
  {
543
  "path": "data/two_evidence_lines.json",
544
- "bytes": 7349,
545
  "top_level_type": "dict"
546
  },
547
  {
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T11:49:24+00:00",
4
  "docs_root": "docs",
5
  "site_base": "/ropedia-xperience-10m-task-suite/",
6
  "summary": {
 
80
  "name": "project_overview_precedes_progress_ledger",
81
  "status": "pass",
82
  "reason": "The project overview should appear before the deeper progress ledger.",
83
+ "overview_index": 118530,
84
+ "evidence_index": 163804
85
  },
86
  {
87
  "name": "project_status_links_json",
 
159
  "name": "evaluation_protocol_between_overview_and_progress",
160
  "status": "pass",
161
  "reason": "The evaluation protocol should appear before the deeper evidence ledger.",
162
+ "overview_index": 118530,
163
+ "protocol_index": 159992,
164
+ "evidence_index": 163804
165
  },
166
  {
167
  "name": "evaluation_protocol_links_json",
 
386
  },
387
  {
388
  "path": "data/project_status.json",
389
+ "bytes": 23255,
390
  "top_level_type": "dict"
391
  },
392
  {
393
  "path": "data/public_reader_map.json",
394
+ "bytes": 6757,
395
  "top_level_type": "dict"
396
  },
397
  {
 
536
  },
537
  {
538
  "path": "data/two_evidence_line_result_summary.json",
539
+ "bytes": 17416,
540
  "top_level_type": "dict"
541
  },
542
  {
543
  "path": "data/two_evidence_lines.json",
544
+ "bytes": 7351,
545
  "top_level_type": "dict"
546
  },
547
  {