cy0307 commited on
Commit
47429ce
·
verified ·
1 Parent(s): 176f74a

Add files using upload-large-folder tool

Browse files
Files changed (47) hide show
  1. FIGURE_INDEX.md +3 -3
  2. PROJECT_README.md +28 -29
  3. README.md +28 -29
  4. TASK_METHOD_20_RESULT_MATRIX.md +1 -1
  5. assets/charts/episode128_task_model_radar.svg +0 -0
  6. assets/charts/single_episode_task_model_radar.svg +231 -228
  7. assets/charts/unified_task_model_radar.svg +0 -0
  8. data/figure_index.json +15 -15
  9. data/mirror_parity.json +210 -210
  10. data/publication_audit.json +1 -1
  11. data/unified_task_model_radar.json +56 -13
  12. docs/assets/charts/episode128_task_model_radar.svg +0 -0
  13. docs/assets/charts/single_episode_task_model_radar.svg +231 -228
  14. docs/assets/charts/unified_task_model_radar.svg +0 -0
  15. docs/data/artifact_index.json +62 -62
  16. docs/data/episode128_task_model_radar.json +46 -11
  17. docs/data/figure_index.json +15 -15
  18. docs/data/mirror_parity.json +210 -210
  19. docs/data/public_surface_qa.json +9 -9
  20. docs/data/publication_audit.json +1 -1
  21. docs/data/quality_gates.json +1 -1
  22. docs/data/scope_claims_audit.json +1 -1
  23. docs/data/single_episode_task_model_radar.json +24 -6
  24. docs/data/source_alignment_audit.json +1 -1
  25. docs/data/task_method_20_result_matrix.json +10 -10
  26. docs/data/task_surface_integrity.json +1 -1
  27. docs/data/unified_task_model_radar.json +56 -13
  28. docs/data/website_integrity.json +16 -16
  29. docs/index.html +17 -17
  30. index.html +17 -17
  31. metrics/artifact_index.json +62 -62
  32. metrics/episode128_task_model_radar.json +46 -11
  33. metrics/figure_index.json +15 -15
  34. metrics/mirror_parity.json +210 -210
  35. metrics/public_surface_qa.json +9 -9
  36. metrics/publication_audit.json +1 -1
  37. metrics/quality_gates.json +1 -1
  38. metrics/scope_claims_audit.json +1 -1
  39. metrics/single_episode_task_model_radar.json +24 -6
  40. metrics/source_alignment_audit.json +1 -1
  41. metrics/task_method_20_result_matrix.json +10 -10
  42. metrics/task_surface_integrity.json +1 -1
  43. metrics/unified_task_model_radar.json +56 -13
  44. metrics/website_integrity.json +16 -16
  45. scripts/build_artifact_index.py +5 -5
  46. scripts/build_figure_index.py +2 -2
  47. scripts/build_unified_task_model_radar.py +401 -118
FIGURE_INDEX.md CHANGED
@@ -34,9 +34,9 @@ Public figures, diagrams, charts, and derived modality thumbnails. Raw Xperience
34
  | Research direction coverage chart | `docs/assets/charts/research_direction_coverage.svg` | 1180 x 700 | `scripts/generate_visualizations.py` | Four-track coverage map for Ropedia research directions. |
35
  | Research direction extension chart | `docs/assets/charts/research_direction_extension_tasks.svg` | 1420 x 920 | `scripts/generate_visualizations.py` | Four coded extension probes, one per Ropedia research direction. |
36
  | Unified 20-task provenance chart | `docs/assets/charts/tier2_task_suite.svg` | 1440 x 832 | `scripts/tier2_task_suite.py` | Historical provenance rows inside the unified 20-task suite with aligned minimal and neural baseline metrics. |
37
- | Unified 20-task model radar | `docs/assets/charts/unified_task_model_radar.svg` | 2400 x 1840 | `scripts/build_unified_task_model_radar.py` | Twenty-axis direction-aware comparison of minimal and neural MLP baselines, with 128-episode metadata, Qwen3, and Cosmos task-aligned overlay points and branch notes. |
38
- | Single-episode 20-task model radar | `docs/assets/charts/single_episode_task_model_radar.svg` | 2400 x 1840 | `scripts/build_unified_task_model_radar.py` | Twenty-axis split radar for the one public-sample episode, comparing Minimal and Neural MLP as two complete 20/20 scored polygons. |
39
- | 128-episode 20-task model radar | `docs/assets/charts/episode128_task_model_radar.svg` | 2400 x 1840 | `scripts/build_unified_task_model_radar.py` | Twenty-axis split radar for selected 128-episode methods: raw-feature simple/NN as complete scored polygons plus metadata, Qwen3-Omni, Cosmos3-Super, and Cosmos3-Nano task-aligned overlays. |
40
  | Feature block chart | `docs/assets/charts/feature_blocks.svg` | 1100 x 760 | `scripts/generate_visualizations.py` | Feature allocation by modality block. |
41
  | Minimal task score chart | `docs/assets/charts/episode_task_scores.svg` | 1100 x 556 | `scripts/generate_visualizations.py` | Minimal baseline metric snapshot across the task suite. |
42
  | Cross-modal retrieval chart | `docs/assets/charts/cross_modal_retrieval.svg` | 1100 x 284 | `scripts/generate_visualizations.py` | Retrieval behavior chart for the cross-modal task. |
 
34
  | Research direction coverage chart | `docs/assets/charts/research_direction_coverage.svg` | 1180 x 700 | `scripts/generate_visualizations.py` | Four-track coverage map for Ropedia research directions. |
35
  | Research direction extension chart | `docs/assets/charts/research_direction_extension_tasks.svg` | 1420 x 920 | `scripts/generate_visualizations.py` | Four coded extension probes, one per Ropedia research direction. |
36
  | Unified 20-task provenance chart | `docs/assets/charts/tier2_task_suite.svg` | 1440 x 832 | `scripts/tier2_task_suite.py` | Historical provenance rows inside the unified 20-task suite with aligned minimal and neural baseline metrics. |
37
+ | Unified 20-task model radar | `docs/assets/charts/unified_task_model_radar.svg` | 2400 x 1900 | `scripts/build_unified_task_model_radar.py` | Grouped small-multiple 20-task radar board for all nine methods, separating single-episode, 128-episode metadata/text, 128-episode raw-feature, and foundation-model rows while preserving task keys and proxy notes. |
38
+ | Single-episode 20-task model radar | `docs/assets/charts/single_episode_task_model_radar.svg` | 2400 x 1900 | `scripts/build_unified_task_model_radar.py` | Twenty-axis split radar for the one public-sample episode, comparing Minimal and Neural MLP as two complete 20/20 scored polygons. |
39
+ | 128-episode 20-task model radar | `docs/assets/charts/episode128_task_model_radar.svg` | 2400 x 1900 | `scripts/build_unified_task_model_radar.py` | Grouped 20-task radar for selected 128-episode methods: metadata/text baselines, raw-feature simple/NN, Qwen3-Omni, Cosmos3-Super, and Cosmos3-Nano with local legends and proxy notes. |
40
  | Feature block chart | `docs/assets/charts/feature_blocks.svg` | 1100 x 760 | `scripts/generate_visualizations.py` | Feature allocation by modality block. |
41
  | Minimal task score chart | `docs/assets/charts/episode_task_scores.svg` | 1100 x 556 | `scripts/generate_visualizations.py` | Minimal baseline metric snapshot across the task suite. |
42
  | Cross-modal retrieval chart | `docs/assets/charts/cross_modal_retrieval.svg` | 1100 x 284 | `scripts/generate_visualizations.py` | Retrieval behavior chart for the cross-modal task. |
PROJECT_README.md CHANGED
@@ -788,22 +788,21 @@ suite.
788
 
789
  ![Unified 20-task model radar](docs/assets/charts/unified_task_model_radar.svg)
790
 
791
- The unified radar compares all 20 task axes with two filled colors for the
792
- minimal and neural MLP baselines. Every method now has 20 explicit result
793
- records in the public matrix; numeric points appear only where the runner or
794
- verified package produced that task target. The 128-episode raw-feature
795
- simple/NN overlays are plotted on all 20 axes backed by the exported
796
- 4430-dimensional sensor NPZ blocks. Tasks 15 and 19 are marked as compact-proxy
797
- completions because the 128 export lacks raw interaction strings and paired
798
- video-view embeddings. The verified model-output probe package adds task-16
799
- action/object relation scores for Qwen3-Omni and Cosmos3-Super, plus a task-13
800
- long-horizon next-action score for Cosmos3-Nano derived from its existing
801
- held-out future-window predictions. Metadata-only baselines and model diagnostics
802
- now have scored records on all 20 axes; six compact-proxy scores stay
803
- explicitly marked instead of being blended into direct-target metrics.
804
  Cosmos3-Super forward-dynamics LoRA
805
- remains a separate artifact card because its camera-pose proxy MSE is not one of the 20
806
- task metrics. The machine-readable copies are
 
807
  [`docs/data/unified_task_model_radar.json`](docs/data/unified_task_model_radar.json)
808
  and
809
  [`docs/data/task_method_20_result_matrix.json`](docs/data/task_method_20_result_matrix.json);
@@ -822,13 +821,13 @@ For easier reading, the same source data is also split into two focused radars:
822
 
823
  ![128-episode 20-task model radar](docs/assets/charts/episode128_task_model_radar.svg)
824
 
825
- The single-episode radar isolates Minimal vs Neural MLP, both with 20/20 scored
826
- public-sample axes. The 128-episode radar isolates metadata/raw baselines,
827
- Qwen3-Omni v6 LoRA, Cosmos3-Super Reasoner, and Cosmos3-Nano Future Window:
828
- metadata and raw-feature simple/NN baselines are now complete 20/20
829
- multi-episode records, with documented compact proxy notes where the public
830
- export lacks the original raw target. The current matrix has 180/180 scored
831
- method-task records.
832
 
833
  The website raw sample browser includes a concise stream-to-feature ledger
834
  backed by [`docs/data/modality_atlas.json`](docs/data/modality_atlas.json) and
@@ -865,7 +864,7 @@ scripts/
865
  research_direction_extension_tasks.py # one extra data-backed probe per track
866
  tier2_task_suite.py # historical-name provenance builder for unified task rows
867
  build_unified_task_suite.py # builds TASK_SUITE_20.md and task_suite_20.json
868
- build_unified_task_model_radar.py # builds the unified 20-axis model comparison chart
869
  build_task_method_20_gap_audit.py # builds the explicit 180/180 scored-cell ledger
870
  task_walkthroughs.py # human-readable task-card and walkthrough-storyboard metadata
871
  generate_visualizations.py # refreshes SVG charts + summary JSON
@@ -908,9 +907,9 @@ docs/
908
  data/additional_development_directions.json # concrete non-backbone project directions
909
  data/summary_metrics.json # website-readable metrics bundle
910
  data/task_suite_20.json # unified 20-task suite bundle
911
- data/unified_task_model_radar.json # 20-task radar values and method overlays
912
- data/single_episode_task_model_radar.json # 1-episode split radar values
913
- data/episode128_task_model_radar.json # 128-episode split radar values
914
  data/task_method_20_result_matrix.json # 9-method x 20-task result matrix
915
  data/task_method_20_gap_audit.json # explicit 180/180 scored-cell ledger
916
  data/evidence_contract.json # machine-readable project scope
@@ -933,9 +932,9 @@ docs/
933
  assets/pipeline_diagram.png # verified episode pipeline graphic
934
  assets/qwen3_omni_lora_pipeline.png # Qwen3-Omni LoRA training-flow figure
935
  assets/task_architectures.png # verified task-head architecture map
936
- assets/charts/unified_task_model_radar.svg # 20-task minimal/NN/Qwen3-Omni/Cosmos3 radar
937
- assets/charts/single_episode_task_model_radar.svg # 1-episode split radar
938
- assets/charts/episode128_task_model_radar.svg # 128-episode split radar
939
  assets/charts/*.svg # regenerated visualizations
940
 
941
  notes/
 
788
 
789
  ![Unified 20-task model radar](docs/assets/charts/unified_task_model_radar.svg)
790
 
791
+ The unified radar is now a grouped small-multiple comparison board instead of a
792
+ nine-method overlay. It keeps all 20 task axes and all 9 method rows visible,
793
+ but separates the methods into single-episode, 128-episode metadata/text,
794
+ 128-episode raw-feature, and foundation-model panels. Every method has 20
795
+ explicit result records in the public matrix. Tasks 15 and 19 are marked as
796
+ compact-proxy completions where the 128 export lacks raw interaction strings or
797
+ paired video-view embeddings; those six proxy cells stay explicitly marked
798
+ instead of being blended into direct-target metrics. The SVG uses
799
+ `sqrt(normalized_score)` only for visual radius so small but real differences
800
+ are readable; raw metrics and exact linear normalized scores remain in JSON and
801
+ the table.
 
 
802
  Cosmos3-Super forward-dynamics LoRA
803
+ remains a separate artifact card because its camera-pose proxy MSE is not one of
804
+ the 20 task metrics.
805
+ The machine-readable copies are
806
  [`docs/data/unified_task_model_radar.json`](docs/data/unified_task_model_radar.json)
807
  and
808
  [`docs/data/task_method_20_result_matrix.json`](docs/data/task_method_20_result_matrix.json);
 
821
 
822
  ![128-episode 20-task model radar](docs/assets/charts/episode128_task_model_radar.svg)
823
 
824
+ The single-episode radar uses one enlarged panel for Minimal vs Neural MLP, both
825
+ with 20/20 scored public-sample axes. The 128-episode radar uses three grouped
826
+ panels for metadata/text baselines, raw-feature baselines, and foundation-model
827
+ rows: metadata and raw-feature simple/NN baselines are now complete 20/20
828
+ multi-episode records, and Qwen3-Omni v6 LoRA, Cosmos3-Super Reasoner, and
829
+ Cosmos3-Nano Future Window each carry 20 scored task records. The current matrix
830
+ has 180/180 scored method-task records.
831
 
832
  The website raw sample browser includes a concise stream-to-feature ledger
833
  backed by [`docs/data/modality_atlas.json`](docs/data/modality_atlas.json) and
 
864
  research_direction_extension_tasks.py # one extra data-backed probe per track
865
  tier2_task_suite.py # historical-name provenance builder for unified task rows
866
  build_unified_task_suite.py # builds TASK_SUITE_20.md and task_suite_20.json
867
+ build_unified_task_model_radar.py # builds grouped 20-axis model comparison radars
868
  build_task_method_20_gap_audit.py # builds the explicit 180/180 scored-cell ledger
869
  task_walkthroughs.py # human-readable task-card and walkthrough-storyboard metadata
870
  generate_visualizations.py # refreshes SVG charts + summary JSON
 
907
  data/additional_development_directions.json # concrete non-backbone project directions
908
  data/summary_metrics.json # website-readable metrics bundle
909
  data/task_suite_20.json # unified 20-task suite bundle
910
+ data/unified_task_model_radar.json # 20-task radar values, groups, and sources
911
+ data/single_episode_task_model_radar.json # 1-episode grouped radar values
912
+ data/episode128_task_model_radar.json # 128-episode grouped radar values
913
  data/task_method_20_result_matrix.json # 9-method x 20-task result matrix
914
  data/task_method_20_gap_audit.json # explicit 180/180 scored-cell ledger
915
  data/evidence_contract.json # machine-readable project scope
 
932
  assets/pipeline_diagram.png # verified episode pipeline graphic
933
  assets/qwen3_omni_lora_pipeline.png # Qwen3-Omni LoRA training-flow figure
934
  assets/task_architectures.png # verified task-head architecture map
935
+ assets/charts/unified_task_model_radar.svg # 9-method grouped small-multiple radar board
936
+ assets/charts/single_episode_task_model_radar.svg # 1-episode enlarged radar panel
937
+ assets/charts/episode128_task_model_radar.svg # 128-episode grouped radar panels
938
  assets/charts/*.svg # regenerated visualizations
939
 
940
  notes/
README.md CHANGED
@@ -810,22 +810,21 @@ suite.
810
 
811
  ![Unified 20-task model radar](docs/assets/charts/unified_task_model_radar.svg)
812
 
813
- The unified radar compares all 20 task axes with two filled colors for the
814
- minimal and neural MLP baselines. Every method now has 20 explicit result
815
- records in the public matrix; numeric points appear only where the runner or
816
- verified package produced that task target. The 128-episode raw-feature
817
- simple/NN overlays are plotted on all 20 axes backed by the exported
818
- 4430-dimensional sensor NPZ blocks. Tasks 15 and 19 are marked as compact-proxy
819
- completions because the 128 export lacks raw interaction strings and paired
820
- video-view embeddings. The verified model-output probe package adds task-16
821
- action/object relation scores for Qwen3-Omni and Cosmos3-Super, plus a task-13
822
- long-horizon next-action score for Cosmos3-Nano derived from its existing
823
- held-out future-window predictions. Metadata-only baselines and model diagnostics
824
- now have scored records on all 20 axes; six compact-proxy scores stay
825
- explicitly marked instead of being blended into direct-target metrics.
826
  Cosmos3-Super forward-dynamics LoRA
827
- remains a separate artifact card because its camera-pose proxy MSE is not one of the 20
828
- task metrics. The machine-readable copies are
 
829
  [`docs/data/unified_task_model_radar.json`](docs/data/unified_task_model_radar.json)
830
  and
831
  [`docs/data/task_method_20_result_matrix.json`](docs/data/task_method_20_result_matrix.json);
@@ -844,13 +843,13 @@ For easier reading, the same source data is also split into two focused radars:
844
 
845
  ![128-episode 20-task model radar](docs/assets/charts/episode128_task_model_radar.svg)
846
 
847
- The single-episode radar isolates Minimal vs Neural MLP, both with 20/20 scored
848
- public-sample axes. The 128-episode radar isolates metadata/raw baselines,
849
- Qwen3-Omni v6 LoRA, Cosmos3-Super Reasoner, and Cosmos3-Nano Future Window:
850
- metadata and raw-feature simple/NN baselines are now complete 20/20
851
- multi-episode records, with documented compact proxy notes where the public
852
- export lacks the original raw target. The current matrix has 180/180 scored
853
- method-task records.
854
 
855
  The website raw sample browser includes a concise stream-to-feature ledger
856
  backed by [`docs/data/modality_atlas.json`](docs/data/modality_atlas.json) and
@@ -887,7 +886,7 @@ scripts/
887
  research_direction_extension_tasks.py # one extra data-backed probe per track
888
  tier2_task_suite.py # historical-name provenance builder for unified task rows
889
  build_unified_task_suite.py # builds TASK_SUITE_20.md and task_suite_20.json
890
- build_unified_task_model_radar.py # builds the unified 20-axis model comparison chart
891
  build_task_method_20_gap_audit.py # builds the explicit 180/180 scored-cell ledger
892
  task_walkthroughs.py # human-readable task-card and walkthrough-storyboard metadata
893
  generate_visualizations.py # refreshes SVG charts + summary JSON
@@ -930,9 +929,9 @@ docs/
930
  data/additional_development_directions.json # concrete non-backbone project directions
931
  data/summary_metrics.json # website-readable metrics bundle
932
  data/task_suite_20.json # unified 20-task suite bundle
933
- data/unified_task_model_radar.json # 20-task radar values and method overlays
934
- data/single_episode_task_model_radar.json # 1-episode split radar values
935
- data/episode128_task_model_radar.json # 128-episode split radar values
936
  data/task_method_20_result_matrix.json # 9-method x 20-task result matrix
937
  data/task_method_20_gap_audit.json # explicit 180/180 scored-cell ledger
938
  data/evidence_contract.json # machine-readable project scope
@@ -955,9 +954,9 @@ docs/
955
  assets/pipeline_diagram.png # verified episode pipeline graphic
956
  assets/qwen3_omni_lora_pipeline.png # Qwen3-Omni LoRA training-flow figure
957
  assets/task_architectures.png # verified task-head architecture map
958
- assets/charts/unified_task_model_radar.svg # 20-task minimal/NN/Qwen3-Omni/Cosmos3 radar
959
- assets/charts/single_episode_task_model_radar.svg # 1-episode split radar
960
- assets/charts/episode128_task_model_radar.svg # 128-episode split radar
961
  assets/charts/*.svg # regenerated visualizations
962
 
963
  notes/
 
810
 
811
  ![Unified 20-task model radar](docs/assets/charts/unified_task_model_radar.svg)
812
 
813
+ The unified radar is now a grouped small-multiple comparison board instead of a
814
+ nine-method overlay. It keeps all 20 task axes and all 9 method rows visible,
815
+ but separates the methods into single-episode, 128-episode metadata/text,
816
+ 128-episode raw-feature, and foundation-model panels. Every method has 20
817
+ explicit result records in the public matrix. Tasks 15 and 19 are marked as
818
+ compact-proxy completions where the 128 export lacks raw interaction strings or
819
+ paired video-view embeddings; those six proxy cells stay explicitly marked
820
+ instead of being blended into direct-target metrics. The SVG uses
821
+ `sqrt(normalized_score)` only for visual radius so small but real differences
822
+ are readable; raw metrics and exact linear normalized scores remain in JSON and
823
+ the table.
 
 
824
  Cosmos3-Super forward-dynamics LoRA
825
+ remains a separate artifact card because its camera-pose proxy MSE is not one of
826
+ the 20 task metrics.
827
+ The machine-readable copies are
828
  [`docs/data/unified_task_model_radar.json`](docs/data/unified_task_model_radar.json)
829
  and
830
  [`docs/data/task_method_20_result_matrix.json`](docs/data/task_method_20_result_matrix.json);
 
843
 
844
  ![128-episode 20-task model radar](docs/assets/charts/episode128_task_model_radar.svg)
845
 
846
+ The single-episode radar uses one enlarged panel for Minimal vs Neural MLP, both
847
+ with 20/20 scored public-sample axes. The 128-episode radar uses three grouped
848
+ panels for metadata/text baselines, raw-feature baselines, and foundation-model
849
+ rows: metadata and raw-feature simple/NN baselines are now complete 20/20
850
+ multi-episode records, and Qwen3-Omni v6 LoRA, Cosmos3-Super Reasoner, and
851
+ Cosmos3-Nano Future Window each carry 20 scored task records. The current matrix
852
+ has 180/180 scored method-task records.
853
 
854
  The website raw sample browser includes a concise stream-to-feature ledger
855
  backed by [`docs/data/modality_atlas.json`](docs/data/modality_atlas.json) and
 
886
  research_direction_extension_tasks.py # one extra data-backed probe per track
887
  tier2_task_suite.py # historical-name provenance builder for unified task rows
888
  build_unified_task_suite.py # builds TASK_SUITE_20.md and task_suite_20.json
889
+ build_unified_task_model_radar.py # builds grouped 20-axis model comparison radars
890
  build_task_method_20_gap_audit.py # builds the explicit 180/180 scored-cell ledger
891
  task_walkthroughs.py # human-readable task-card and walkthrough-storyboard metadata
892
  generate_visualizations.py # refreshes SVG charts + summary JSON
 
929
  data/additional_development_directions.json # concrete non-backbone project directions
930
  data/summary_metrics.json # website-readable metrics bundle
931
  data/task_suite_20.json # unified 20-task suite bundle
932
+ data/unified_task_model_radar.json # 20-task radar values, groups, and sources
933
+ data/single_episode_task_model_radar.json # 1-episode grouped radar values
934
+ data/episode128_task_model_radar.json # 128-episode grouped radar values
935
  data/task_method_20_result_matrix.json # 9-method x 20-task result matrix
936
  data/task_method_20_gap_audit.json # explicit 180/180 scored-cell ledger
937
  data/evidence_contract.json # machine-readable project scope
 
954
  assets/pipeline_diagram.png # verified episode pipeline graphic
955
  assets/qwen3_omni_lora_pipeline.png # Qwen3-Omni LoRA training-flow figure
956
  assets/task_architectures.png # verified task-head architecture map
957
+ assets/charts/unified_task_model_radar.svg # 9-method grouped small-multiple radar board
958
+ assets/charts/single_episode_task_model_radar.svg # 1-episode enlarged radar panel
959
+ assets/charts/episode128_task_model_radar.svg # 128-episode grouped radar panels
960
  assets/charts/*.svg # regenerated visualizations
961
 
962
  notes/
TASK_METHOD_20_RESULT_MATRIX.md CHANGED
@@ -18,7 +18,7 @@ Legend: `score` = direct numeric task score and `proxy` = documented compact sub
18
 
19
  ## Compact Score Matrix
20
 
21
- Cells show `raw metric value`, then `direct/proxy; normalized radar value; metric key`. The raw metric is the value to cite; the normalized value is the 0-1 plotting value used by the radar.
22
 
23
  | # | Task | Min | NN | 128-S | 128-NN | 128-RS | 128-RN | Qwen3 | C3-S | C3-N |
24
  | ---: | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
 
18
 
19
  ## Compact Score Matrix
20
 
21
+ Cells show `raw metric value`, then `direct/proxy; normalized radar value; metric key`. The raw metric is the value to cite; the normalized value is the exact linear 0-1 score retained in JSON. The SVG radar uses sqrt(normalized score) only for visual radius, so low but real differences remain visible without changing the table values.
22
 
23
  | # | Task | Min | NN | 128-S | 128-NN | 128-RS | 128-RN | Qwen3 | C3-S | C3-N |
24
  | ---: | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
assets/charts/episode128_task_model_radar.svg CHANGED
assets/charts/single_episode_task_model_radar.svg CHANGED
assets/charts/unified_task_model_radar.svg CHANGED
data/figure_index.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Figure Index",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T15:19:00+00:00",
5
  "scope": "Public figures, diagrams, charts, and derived modality thumbnails. Raw Xperience-10M videos, annotations, RRD files, and Qwen weights are excluded.",
6
  "figure_count": 29,
7
  "figures": [
@@ -406,17 +406,17 @@
406
  "id": "unified_task_model_radar",
407
  "title": "Unified 20-task model radar",
408
  "path": "docs/assets/charts/unified_task_model_radar.svg",
409
- "role": "Twenty-axis direction-aware comparison of minimal and neural MLP baselines, with 128-episode metadata, Qwen3, and Cosmos task-aligned overlay points and branch notes.",
410
  "source_script": "scripts/build_unified_task_model_radar.py",
411
  "surface": "website unified task section, README, HF mirrors",
412
  "exists": true,
413
- "bytes": 57938,
414
- "sha256": "bb83b80b47fe679ebdce2c99378a4548120f1c8cc2d725b88e409d8c386dcbf8",
415
  "dimensions": {
416
  "format": "SVG",
417
  "width": 2400,
418
- "height": 1840,
419
- "view_box": "0 0 2400 1840"
420
  },
421
  "source_script_exists": true
422
  },
@@ -428,13 +428,13 @@
428
  "source_script": "scripts/build_unified_task_model_radar.py",
429
  "surface": "website unified task section, README, HF mirrors",
430
  "exists": true,
431
- "bytes": 35232,
432
- "sha256": "87b52a7dead40358f1778dda43ade4d2e875ac98e507e01ca007084363e5977e",
433
  "dimensions": {
434
  "format": "SVG",
435
  "width": 2400,
436
- "height": 1840,
437
- "view_box": "0 0 2400 1840"
438
  },
439
  "source_script_exists": true
440
  },
@@ -442,17 +442,17 @@
442
  "id": "episode128_task_model_radar",
443
  "title": "128-episode 20-task model radar",
444
  "path": "docs/assets/charts/episode128_task_model_radar.svg",
445
- "role": "Twenty-axis split radar for selected 128-episode methods: raw-feature simple/NN as complete scored polygons plus metadata, Qwen3-Omni, Cosmos3-Super, and Cosmos3-Nano task-aligned overlays.",
446
  "source_script": "scripts/build_unified_task_model_radar.py",
447
  "surface": "website unified task section, README, HF mirrors",
448
  "exists": true,
449
- "bytes": 51915,
450
- "sha256": "047ea4b05a04f6734e2afcf792863559dc8f3091eae88a97ff90e8b038a423f4",
451
  "dimensions": {
452
  "format": "SVG",
453
  "width": 2400,
454
- "height": 1840,
455
- "view_box": "0 0 2400 1840"
456
  },
457
  "source_script_exists": true
458
  },
 
1
  {
2
  "title": "Ropedia Xperience-10M Figure Index",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T20:35:17+00:00",
5
  "scope": "Public figures, diagrams, charts, and derived modality thumbnails. Raw Xperience-10M videos, annotations, RRD files, and Qwen weights are excluded.",
6
  "figure_count": 29,
7
  "figures": [
 
406
  "id": "unified_task_model_radar",
407
  "title": "Unified 20-task model radar",
408
  "path": "docs/assets/charts/unified_task_model_radar.svg",
409
+ "role": "Grouped small-multiple 20-task radar board for all nine methods, separating single-episode, 128-episode metadata/text, 128-episode raw-feature, and foundation-model rows while preserving task keys and proxy notes.",
410
  "source_script": "scripts/build_unified_task_model_radar.py",
411
  "surface": "website unified task section, README, HF mirrors",
412
  "exists": true,
413
+ "bytes": 98527,
414
+ "sha256": "5b034b22d2a772a57e7db50f300cb70d00bd31ac89d0c039c16ac8c23a5137ec",
415
  "dimensions": {
416
  "format": "SVG",
417
  "width": 2400,
418
+ "height": 1900,
419
+ "view_box": "0 0 2400 1900"
420
  },
421
  "source_script_exists": true
422
  },
 
428
  "source_script": "scripts/build_unified_task_model_radar.py",
429
  "surface": "website unified task section, README, HF mirrors",
430
  "exists": true,
431
+ "bytes": 36930,
432
+ "sha256": "96e609b0577e66db0ee8c63939c11b1fb28018285a1d259362de0bff415cc939",
433
  "dimensions": {
434
  "format": "SVG",
435
  "width": 2400,
436
+ "height": 1900,
437
+ "view_box": "0 0 2400 1900"
438
  },
439
  "source_script_exists": true
440
  },
 
442
  "id": "episode128_task_model_radar",
443
  "title": "128-episode 20-task model radar",
444
  "path": "docs/assets/charts/episode128_task_model_radar.svg",
445
+ "role": "Grouped 20-task radar for selected 128-episode methods: metadata/text baselines, raw-feature simple/NN, Qwen3-Omni, Cosmos3-Super, and Cosmos3-Nano with local legends and proxy notes.",
446
  "source_script": "scripts/build_unified_task_model_radar.py",
447
  "surface": "website unified task section, README, HF mirrors",
448
  "exists": true,
449
+ "bytes": 79370,
450
+ "sha256": "5151c8aca22bd4aeda60b143b1164c1d1b9eb4babbeabf6da598701ccbbbf5c9",
451
  "dimensions": {
452
  "format": "SVG",
453
  "width": 2400,
454
+ "height": 1900,
455
+ "view_box": "0 0 2400 1900"
456
  },
457
  "source_script_exists": true
458
  },
data/mirror_parity.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T20:00:18+00:00",
4
  "hf_root": "hf_publish",
5
  "summary": {
6
  "group_count": 1258,
@@ -138,45 +138,45 @@
138
  "local": {
139
  "path": "repo:docs/data/artifact_index.json",
140
  "exists": true,
141
- "bytes": 124341,
142
- "sha256": "d2887200f777fa80235644383d26cd3a171c313d9c4641e06974fb705f02ee5d"
143
  },
144
  "mirrors": {
145
  "hf_space": {
146
  "path": "hf_space:data/artifact_index.json",
147
  "exists": true,
148
- "bytes": 124341,
149
- "sha256": "d2887200f777fa80235644383d26cd3a171c313d9c4641e06974fb705f02ee5d"
150
  },
151
  "hf_artifacts_data": {
152
  "path": "hf_artifacts:data/artifact_index.json",
153
  "exists": true,
154
- "bytes": 124341,
155
- "sha256": "d2887200f777fa80235644383d26cd3a171c313d9c4641e06974fb705f02ee5d"
156
  },
157
  "hf_artifacts": {
158
  "path": "hf_artifacts:docs/data/artifact_index.json",
159
  "exists": true,
160
- "bytes": 124341,
161
- "sha256": "d2887200f777fa80235644383d26cd3a171c313d9c4641e06974fb705f02ee5d"
162
  },
163
  "hf_model_data": {
164
  "path": "hf_model:data/artifact_index.json",
165
  "exists": true,
166
- "bytes": 124341,
167
- "sha256": "d2887200f777fa80235644383d26cd3a171c313d9c4641e06974fb705f02ee5d"
168
  },
169
  "hf_model_docs_data": {
170
  "path": "hf_model:docs/data/artifact_index.json",
171
  "exists": true,
172
- "bytes": 124341,
173
- "sha256": "d2887200f777fa80235644383d26cd3a171c313d9c4641e06974fb705f02ee5d"
174
  },
175
  "hf_model": {
176
  "path": "hf_model:metrics/artifact_index.json",
177
  "exists": true,
178
- "bytes": 124341,
179
- "sha256": "d2887200f777fa80235644383d26cd3a171c313d9c4641e06974fb705f02ee5d"
180
  }
181
  },
182
  "failures": []
@@ -334,45 +334,45 @@
334
  "local": {
335
  "path": "repo:docs/data/figure_index.json",
336
  "exists": true,
337
- "bytes": 19485,
338
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
339
  },
340
  "mirrors": {
341
  "hf_space": {
342
  "path": "hf_space:data/figure_index.json",
343
  "exists": true,
344
- "bytes": 19485,
345
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
346
  },
347
  "hf_artifacts_data": {
348
  "path": "hf_artifacts:data/figure_index.json",
349
  "exists": true,
350
- "bytes": 19485,
351
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
352
  },
353
  "hf_artifacts": {
354
  "path": "hf_artifacts:docs/data/figure_index.json",
355
  "exists": true,
356
- "bytes": 19485,
357
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
358
  },
359
  "hf_model_data": {
360
  "path": "hf_model:data/figure_index.json",
361
  "exists": true,
362
- "bytes": 19485,
363
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
364
  },
365
  "hf_model_docs_data": {
366
  "path": "hf_model:docs/data/figure_index.json",
367
  "exists": true,
368
- "bytes": 19485,
369
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
370
  },
371
  "hf_model": {
372
  "path": "hf_model:metrics/figure_index.json",
373
  "exists": true,
374
- "bytes": 19485,
375
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
376
  }
377
  },
378
  "failures": []
@@ -972,44 +972,44 @@
972
  "path": "repo:docs/data/publication_audit.json",
973
  "exists": true,
974
  "bytes": 10940,
975
- "sha256": "656414245ff90b4f44a303ae2e848d5e893a8a861f1e4102d19a0dd979012997"
976
  },
977
  "mirrors": {
978
  "hf_space": {
979
  "path": "hf_space:data/publication_audit.json",
980
  "exists": true,
981
  "bytes": 10940,
982
- "sha256": "656414245ff90b4f44a303ae2e848d5e893a8a861f1e4102d19a0dd979012997"
983
  },
984
  "hf_artifacts_data": {
985
  "path": "hf_artifacts:data/publication_audit.json",
986
  "exists": true,
987
  "bytes": 10940,
988
- "sha256": "656414245ff90b4f44a303ae2e848d5e893a8a861f1e4102d19a0dd979012997"
989
  },
990
  "hf_artifacts": {
991
  "path": "hf_artifacts:docs/data/publication_audit.json",
992
  "exists": true,
993
  "bytes": 10940,
994
- "sha256": "656414245ff90b4f44a303ae2e848d5e893a8a861f1e4102d19a0dd979012997"
995
  },
996
  "hf_model_data": {
997
  "path": "hf_model:data/publication_audit.json",
998
  "exists": true,
999
  "bytes": 10940,
1000
- "sha256": "656414245ff90b4f44a303ae2e848d5e893a8a861f1e4102d19a0dd979012997"
1001
  },
1002
  "hf_model_docs_data": {
1003
  "path": "hf_model:docs/data/publication_audit.json",
1004
  "exists": true,
1005
  "bytes": 10940,
1006
- "sha256": "656414245ff90b4f44a303ae2e848d5e893a8a861f1e4102d19a0dd979012997"
1007
  },
1008
  "hf_model": {
1009
  "path": "hf_model:metrics/publication_audit.json",
1010
  "exists": true,
1011
  "bytes": 10940,
1012
- "sha256": "656414245ff90b4f44a303ae2e848d5e893a8a861f1e4102d19a0dd979012997"
1013
  }
1014
  },
1015
  "failures": []
@@ -1021,44 +1021,44 @@
1021
  "path": "repo:docs/data/public_surface_qa.json",
1022
  "exists": true,
1023
  "bytes": 7693,
1024
- "sha256": "7dc38f37df3efac5c7ce1213267dd7693515df5806be778138ea5f8de70b0111"
1025
  },
1026
  "mirrors": {
1027
  "hf_space": {
1028
  "path": "hf_space:data/public_surface_qa.json",
1029
  "exists": true,
1030
  "bytes": 7693,
1031
- "sha256": "7dc38f37df3efac5c7ce1213267dd7693515df5806be778138ea5f8de70b0111"
1032
  },
1033
  "hf_artifacts_data": {
1034
  "path": "hf_artifacts:data/public_surface_qa.json",
1035
  "exists": true,
1036
  "bytes": 7693,
1037
- "sha256": "7dc38f37df3efac5c7ce1213267dd7693515df5806be778138ea5f8de70b0111"
1038
  },
1039
  "hf_artifacts": {
1040
  "path": "hf_artifacts:docs/data/public_surface_qa.json",
1041
  "exists": true,
1042
  "bytes": 7693,
1043
- "sha256": "7dc38f37df3efac5c7ce1213267dd7693515df5806be778138ea5f8de70b0111"
1044
  },
1045
  "hf_model_data": {
1046
  "path": "hf_model:data/public_surface_qa.json",
1047
  "exists": true,
1048
  "bytes": 7693,
1049
- "sha256": "7dc38f37df3efac5c7ce1213267dd7693515df5806be778138ea5f8de70b0111"
1050
  },
1051
  "hf_model_docs_data": {
1052
  "path": "hf_model:docs/data/public_surface_qa.json",
1053
  "exists": true,
1054
  "bytes": 7693,
1055
- "sha256": "7dc38f37df3efac5c7ce1213267dd7693515df5806be778138ea5f8de70b0111"
1056
  },
1057
  "hf_model": {
1058
  "path": "hf_model:metrics/public_surface_qa.json",
1059
  "exists": true,
1060
  "bytes": 7693,
1061
- "sha256": "7dc38f37df3efac5c7ce1213267dd7693515df5806be778138ea5f8de70b0111"
1062
  }
1063
  },
1064
  "failures": []
@@ -1217,44 +1217,44 @@
1217
  "path": "repo:docs/data/quality_gates.json",
1218
  "exists": true,
1219
  "bytes": 8640,
1220
- "sha256": "271e4f6363b8e3738274c0be42b403ae7f449effe87e34daec6477e048194ff0"
1221
  },
1222
  "mirrors": {
1223
  "hf_space": {
1224
  "path": "hf_space:data/quality_gates.json",
1225
  "exists": true,
1226
  "bytes": 8640,
1227
- "sha256": "271e4f6363b8e3738274c0be42b403ae7f449effe87e34daec6477e048194ff0"
1228
  },
1229
  "hf_artifacts_data": {
1230
  "path": "hf_artifacts:data/quality_gates.json",
1231
  "exists": true,
1232
  "bytes": 8640,
1233
- "sha256": "271e4f6363b8e3738274c0be42b403ae7f449effe87e34daec6477e048194ff0"
1234
  },
1235
  "hf_artifacts": {
1236
  "path": "hf_artifacts:docs/data/quality_gates.json",
1237
  "exists": true,
1238
  "bytes": 8640,
1239
- "sha256": "271e4f6363b8e3738274c0be42b403ae7f449effe87e34daec6477e048194ff0"
1240
  },
1241
  "hf_model_data": {
1242
  "path": "hf_model:data/quality_gates.json",
1243
  "exists": true,
1244
  "bytes": 8640,
1245
- "sha256": "271e4f6363b8e3738274c0be42b403ae7f449effe87e34daec6477e048194ff0"
1246
  },
1247
  "hf_model_docs_data": {
1248
  "path": "hf_model:docs/data/quality_gates.json",
1249
  "exists": true,
1250
  "bytes": 8640,
1251
- "sha256": "271e4f6363b8e3738274c0be42b403ae7f449effe87e34daec6477e048194ff0"
1252
  },
1253
  "hf_model": {
1254
  "path": "hf_model:metrics/quality_gates.json",
1255
  "exists": true,
1256
  "bytes": 8640,
1257
- "sha256": "271e4f6363b8e3738274c0be42b403ae7f449effe87e34daec6477e048194ff0"
1258
  }
1259
  },
1260
  "failures": []
@@ -1658,44 +1658,44 @@
1658
  "path": "repo:docs/data/scope_claims_audit.json",
1659
  "exists": true,
1660
  "bytes": 21313,
1661
- "sha256": "a03133115a089d88dc3ce87f831674603d8b252138790693d8def878b9264da0"
1662
  },
1663
  "mirrors": {
1664
  "hf_space": {
1665
  "path": "hf_space:data/scope_claims_audit.json",
1666
  "exists": true,
1667
  "bytes": 21313,
1668
- "sha256": "a03133115a089d88dc3ce87f831674603d8b252138790693d8def878b9264da0"
1669
  },
1670
  "hf_artifacts_data": {
1671
  "path": "hf_artifacts:data/scope_claims_audit.json",
1672
  "exists": true,
1673
  "bytes": 21313,
1674
- "sha256": "a03133115a089d88dc3ce87f831674603d8b252138790693d8def878b9264da0"
1675
  },
1676
  "hf_artifacts": {
1677
  "path": "hf_artifacts:docs/data/scope_claims_audit.json",
1678
  "exists": true,
1679
  "bytes": 21313,
1680
- "sha256": "a03133115a089d88dc3ce87f831674603d8b252138790693d8def878b9264da0"
1681
  },
1682
  "hf_model_data": {
1683
  "path": "hf_model:data/scope_claims_audit.json",
1684
  "exists": true,
1685
  "bytes": 21313,
1686
- "sha256": "a03133115a089d88dc3ce87f831674603d8b252138790693d8def878b9264da0"
1687
  },
1688
  "hf_model_docs_data": {
1689
  "path": "hf_model:docs/data/scope_claims_audit.json",
1690
  "exists": true,
1691
  "bytes": 21313,
1692
- "sha256": "a03133115a089d88dc3ce87f831674603d8b252138790693d8def878b9264da0"
1693
  },
1694
  "hf_model": {
1695
  "path": "hf_model:metrics/scope_claims_audit.json",
1696
  "exists": true,
1697
  "bytes": 21313,
1698
- "sha256": "a03133115a089d88dc3ce87f831674603d8b252138790693d8def878b9264da0"
1699
  }
1700
  },
1701
  "failures": []
@@ -1756,44 +1756,44 @@
1756
  "path": "repo:docs/data/source_alignment_audit.json",
1757
  "exists": true,
1758
  "bytes": 4432,
1759
- "sha256": "6173b5fb5e1c69983f5a2b83288d521968de16810f8e01bd2717b99a67f4285c"
1760
  },
1761
  "mirrors": {
1762
  "hf_space": {
1763
  "path": "hf_space:data/source_alignment_audit.json",
1764
  "exists": true,
1765
  "bytes": 4432,
1766
- "sha256": "6173b5fb5e1c69983f5a2b83288d521968de16810f8e01bd2717b99a67f4285c"
1767
  },
1768
  "hf_artifacts_data": {
1769
  "path": "hf_artifacts:data/source_alignment_audit.json",
1770
  "exists": true,
1771
  "bytes": 4432,
1772
- "sha256": "6173b5fb5e1c69983f5a2b83288d521968de16810f8e01bd2717b99a67f4285c"
1773
  },
1774
  "hf_artifacts": {
1775
  "path": "hf_artifacts:docs/data/source_alignment_audit.json",
1776
  "exists": true,
1777
  "bytes": 4432,
1778
- "sha256": "6173b5fb5e1c69983f5a2b83288d521968de16810f8e01bd2717b99a67f4285c"
1779
  },
1780
  "hf_model_data": {
1781
  "path": "hf_model:data/source_alignment_audit.json",
1782
  "exists": true,
1783
  "bytes": 4432,
1784
- "sha256": "6173b5fb5e1c69983f5a2b83288d521968de16810f8e01bd2717b99a67f4285c"
1785
  },
1786
  "hf_model_docs_data": {
1787
  "path": "hf_model:docs/data/source_alignment_audit.json",
1788
  "exists": true,
1789
  "bytes": 4432,
1790
- "sha256": "6173b5fb5e1c69983f5a2b83288d521968de16810f8e01bd2717b99a67f4285c"
1791
  },
1792
  "hf_model": {
1793
  "path": "hf_model:metrics/source_alignment_audit.json",
1794
  "exists": true,
1795
  "bytes": 4432,
1796
- "sha256": "6173b5fb5e1c69983f5a2b83288d521968de16810f8e01bd2717b99a67f4285c"
1797
  }
1798
  },
1799
  "failures": []
@@ -1853,45 +1853,45 @@
1853
  "local": {
1854
  "path": "repo:docs/data/single_episode_task_model_radar.json",
1855
  "exists": true,
1856
- "bytes": 51327,
1857
- "sha256": "791793dc1dc1655dcae8803300def7f9e7e2c84b5af6321cbb1561c76bff2e7c"
1858
  },
1859
  "mirrors": {
1860
  "hf_space": {
1861
  "path": "hf_space:data/single_episode_task_model_radar.json",
1862
  "exists": true,
1863
- "bytes": 51327,
1864
- "sha256": "791793dc1dc1655dcae8803300def7f9e7e2c84b5af6321cbb1561c76bff2e7c"
1865
  },
1866
  "hf_artifacts_data": {
1867
  "path": "hf_artifacts:data/single_episode_task_model_radar.json",
1868
  "exists": true,
1869
- "bytes": 51327,
1870
- "sha256": "791793dc1dc1655dcae8803300def7f9e7e2c84b5af6321cbb1561c76bff2e7c"
1871
  },
1872
  "hf_artifacts": {
1873
  "path": "hf_artifacts:docs/data/single_episode_task_model_radar.json",
1874
  "exists": true,
1875
- "bytes": 51327,
1876
- "sha256": "791793dc1dc1655dcae8803300def7f9e7e2c84b5af6321cbb1561c76bff2e7c"
1877
  },
1878
  "hf_model_data": {
1879
  "path": "hf_model:data/single_episode_task_model_radar.json",
1880
  "exists": true,
1881
- "bytes": 51327,
1882
- "sha256": "791793dc1dc1655dcae8803300def7f9e7e2c84b5af6321cbb1561c76bff2e7c"
1883
  },
1884
  "hf_model_docs_data": {
1885
  "path": "hf_model:docs/data/single_episode_task_model_radar.json",
1886
  "exists": true,
1887
- "bytes": 51327,
1888
- "sha256": "791793dc1dc1655dcae8803300def7f9e7e2c84b5af6321cbb1561c76bff2e7c"
1889
  },
1890
  "hf_model": {
1891
  "path": "hf_model:metrics/single_episode_task_model_radar.json",
1892
  "exists": true,
1893
- "bytes": 51327,
1894
- "sha256": "791793dc1dc1655dcae8803300def7f9e7e2c84b5af6321cbb1561c76bff2e7c"
1895
  }
1896
  },
1897
  "failures": []
@@ -1902,45 +1902,45 @@
1902
  "local": {
1903
  "path": "repo:docs/data/episode128_task_model_radar.json",
1904
  "exists": true,
1905
- "bytes": 185212,
1906
- "sha256": "f3007f5799e2bb92ab9891bcdfd1f0f63dfef6452198d44e2b3f8d38a5493859"
1907
  },
1908
  "mirrors": {
1909
  "hf_space": {
1910
  "path": "hf_space:data/episode128_task_model_radar.json",
1911
  "exists": true,
1912
- "bytes": 185212,
1913
- "sha256": "f3007f5799e2bb92ab9891bcdfd1f0f63dfef6452198d44e2b3f8d38a5493859"
1914
  },
1915
  "hf_artifacts_data": {
1916
  "path": "hf_artifacts:data/episode128_task_model_radar.json",
1917
  "exists": true,
1918
- "bytes": 185212,
1919
- "sha256": "f3007f5799e2bb92ab9891bcdfd1f0f63dfef6452198d44e2b3f8d38a5493859"
1920
  },
1921
  "hf_artifacts": {
1922
  "path": "hf_artifacts:docs/data/episode128_task_model_radar.json",
1923
  "exists": true,
1924
- "bytes": 185212,
1925
- "sha256": "f3007f5799e2bb92ab9891bcdfd1f0f63dfef6452198d44e2b3f8d38a5493859"
1926
  },
1927
  "hf_model_data": {
1928
  "path": "hf_model:data/episode128_task_model_radar.json",
1929
  "exists": true,
1930
- "bytes": 185212,
1931
- "sha256": "f3007f5799e2bb92ab9891bcdfd1f0f63dfef6452198d44e2b3f8d38a5493859"
1932
  },
1933
  "hf_model_docs_data": {
1934
  "path": "hf_model:docs/data/episode128_task_model_radar.json",
1935
  "exists": true,
1936
- "bytes": 185212,
1937
- "sha256": "f3007f5799e2bb92ab9891bcdfd1f0f63dfef6452198d44e2b3f8d38a5493859"
1938
  },
1939
  "hf_model": {
1940
  "path": "hf_model:metrics/episode128_task_model_radar.json",
1941
  "exists": true,
1942
- "bytes": 185212,
1943
- "sha256": "f3007f5799e2bb92ab9891bcdfd1f0f63dfef6452198d44e2b3f8d38a5493859"
1944
  }
1945
  },
1946
  "failures": []
@@ -2197,44 +2197,44 @@
2197
  "path": "repo:docs/data/task_surface_integrity.json",
2198
  "exists": true,
2199
  "bytes": 46246,
2200
- "sha256": "d4437d2ae8900c75122057be8739b4812da61b91f7495a97271844e9ad33d539"
2201
  },
2202
  "mirrors": {
2203
  "hf_space": {
2204
  "path": "hf_space:data/task_surface_integrity.json",
2205
  "exists": true,
2206
  "bytes": 46246,
2207
- "sha256": "d4437d2ae8900c75122057be8739b4812da61b91f7495a97271844e9ad33d539"
2208
  },
2209
  "hf_artifacts_data": {
2210
  "path": "hf_artifacts:data/task_surface_integrity.json",
2211
  "exists": true,
2212
  "bytes": 46246,
2213
- "sha256": "d4437d2ae8900c75122057be8739b4812da61b91f7495a97271844e9ad33d539"
2214
  },
2215
  "hf_artifacts": {
2216
  "path": "hf_artifacts:docs/data/task_surface_integrity.json",
2217
  "exists": true,
2218
  "bytes": 46246,
2219
- "sha256": "d4437d2ae8900c75122057be8739b4812da61b91f7495a97271844e9ad33d539"
2220
  },
2221
  "hf_model_data": {
2222
  "path": "hf_model:data/task_surface_integrity.json",
2223
  "exists": true,
2224
  "bytes": 46246,
2225
- "sha256": "d4437d2ae8900c75122057be8739b4812da61b91f7495a97271844e9ad33d539"
2226
  },
2227
  "hf_model_docs_data": {
2228
  "path": "hf_model:docs/data/task_surface_integrity.json",
2229
  "exists": true,
2230
  "bytes": 46246,
2231
- "sha256": "d4437d2ae8900c75122057be8739b4812da61b91f7495a97271844e9ad33d539"
2232
  },
2233
  "hf_model": {
2234
  "path": "hf_model:metrics/task_surface_integrity.json",
2235
  "exists": true,
2236
  "bytes": 46246,
2237
- "sha256": "d4437d2ae8900c75122057be8739b4812da61b91f7495a97271844e9ad33d539"
2238
  }
2239
  },
2240
  "failures": []
@@ -2294,45 +2294,45 @@
2294
  "local": {
2295
  "path": "repo:docs/data/task_method_20_result_matrix.json",
2296
  "exists": true,
2297
- "bytes": 128509,
2298
- "sha256": "d5002c1a36f0c4dc15f184de2480fb123aee5a80ae1f6c5a964c449c1c97cd54"
2299
  },
2300
  "mirrors": {
2301
  "hf_space": {
2302
  "path": "hf_space:data/task_method_20_result_matrix.json",
2303
  "exists": true,
2304
- "bytes": 128509,
2305
- "sha256": "d5002c1a36f0c4dc15f184de2480fb123aee5a80ae1f6c5a964c449c1c97cd54"
2306
  },
2307
  "hf_artifacts_data": {
2308
  "path": "hf_artifacts:data/task_method_20_result_matrix.json",
2309
  "exists": true,
2310
- "bytes": 128509,
2311
- "sha256": "d5002c1a36f0c4dc15f184de2480fb123aee5a80ae1f6c5a964c449c1c97cd54"
2312
  },
2313
  "hf_artifacts": {
2314
  "path": "hf_artifacts:docs/data/task_method_20_result_matrix.json",
2315
  "exists": true,
2316
- "bytes": 128509,
2317
- "sha256": "d5002c1a36f0c4dc15f184de2480fb123aee5a80ae1f6c5a964c449c1c97cd54"
2318
  },
2319
  "hf_model_data": {
2320
  "path": "hf_model:data/task_method_20_result_matrix.json",
2321
  "exists": true,
2322
- "bytes": 128509,
2323
- "sha256": "d5002c1a36f0c4dc15f184de2480fb123aee5a80ae1f6c5a964c449c1c97cd54"
2324
  },
2325
  "hf_model_docs_data": {
2326
  "path": "hf_model:docs/data/task_method_20_result_matrix.json",
2327
  "exists": true,
2328
- "bytes": 128509,
2329
- "sha256": "d5002c1a36f0c4dc15f184de2480fb123aee5a80ae1f6c5a964c449c1c97cd54"
2330
  },
2331
  "hf_model": {
2332
  "path": "hf_model:metrics/task_method_20_result_matrix.json",
2333
  "exists": true,
2334
- "bytes": 128509,
2335
- "sha256": "d5002c1a36f0c4dc15f184de2480fb123aee5a80ae1f6c5a964c449c1c97cd54"
2336
  }
2337
  },
2338
  "failures": []
@@ -2490,45 +2490,45 @@
2490
  "local": {
2491
  "path": "repo:docs/data/unified_task_model_radar.json",
2492
  "exists": true,
2493
- "bytes": 229035,
2494
- "sha256": "c810377ee743c0ad251eb77b1549d4e096c581fa861da46b9e0fd1f2dce46a22"
2495
  },
2496
  "mirrors": {
2497
  "hf_space": {
2498
  "path": "hf_space:data/unified_task_model_radar.json",
2499
  "exists": true,
2500
- "bytes": 229035,
2501
- "sha256": "c810377ee743c0ad251eb77b1549d4e096c581fa861da46b9e0fd1f2dce46a22"
2502
  },
2503
  "hf_artifacts_data": {
2504
  "path": "hf_artifacts:data/unified_task_model_radar.json",
2505
  "exists": true,
2506
- "bytes": 229035,
2507
- "sha256": "c810377ee743c0ad251eb77b1549d4e096c581fa861da46b9e0fd1f2dce46a22"
2508
  },
2509
  "hf_artifacts": {
2510
  "path": "hf_artifacts:docs/data/unified_task_model_radar.json",
2511
  "exists": true,
2512
- "bytes": 229035,
2513
- "sha256": "c810377ee743c0ad251eb77b1549d4e096c581fa861da46b9e0fd1f2dce46a22"
2514
  },
2515
  "hf_model_data": {
2516
  "path": "hf_model:data/unified_task_model_radar.json",
2517
  "exists": true,
2518
- "bytes": 229035,
2519
- "sha256": "c810377ee743c0ad251eb77b1549d4e096c581fa861da46b9e0fd1f2dce46a22"
2520
  },
2521
  "hf_model_docs_data": {
2522
  "path": "hf_model:docs/data/unified_task_model_radar.json",
2523
  "exists": true,
2524
- "bytes": 229035,
2525
- "sha256": "c810377ee743c0ad251eb77b1549d4e096c581fa861da46b9e0fd1f2dce46a22"
2526
  },
2527
  "hf_model": {
2528
  "path": "hf_model:metrics/unified_task_model_radar.json",
2529
  "exists": true,
2530
- "bytes": 229035,
2531
- "sha256": "c810377ee743c0ad251eb77b1549d4e096c581fa861da46b9e0fd1f2dce46a22"
2532
  }
2533
  },
2534
  "failures": []
@@ -2540,44 +2540,44 @@
2540
  "path": "repo:docs/data/website_integrity.json",
2541
  "exists": true,
2542
  "bytes": 20178,
2543
- "sha256": "d9e72c5e628c9ed808a7a4648e29fbca9a455ff2ae466abde9cabd27f2f87e6a"
2544
  },
2545
  "mirrors": {
2546
  "hf_space": {
2547
  "path": "hf_space:data/website_integrity.json",
2548
  "exists": true,
2549
  "bytes": 20178,
2550
- "sha256": "d9e72c5e628c9ed808a7a4648e29fbca9a455ff2ae466abde9cabd27f2f87e6a"
2551
  },
2552
  "hf_artifacts_data": {
2553
  "path": "hf_artifacts:data/website_integrity.json",
2554
  "exists": true,
2555
  "bytes": 20178,
2556
- "sha256": "d9e72c5e628c9ed808a7a4648e29fbca9a455ff2ae466abde9cabd27f2f87e6a"
2557
  },
2558
  "hf_artifacts": {
2559
  "path": "hf_artifacts:docs/data/website_integrity.json",
2560
  "exists": true,
2561
  "bytes": 20178,
2562
- "sha256": "d9e72c5e628c9ed808a7a4648e29fbca9a455ff2ae466abde9cabd27f2f87e6a"
2563
  },
2564
  "hf_model_data": {
2565
  "path": "hf_model:data/website_integrity.json",
2566
  "exists": true,
2567
  "bytes": 20178,
2568
- "sha256": "d9e72c5e628c9ed808a7a4648e29fbca9a455ff2ae466abde9cabd27f2f87e6a"
2569
  },
2570
  "hf_model_docs_data": {
2571
  "path": "hf_model:docs/data/website_integrity.json",
2572
  "exists": true,
2573
  "bytes": 20178,
2574
- "sha256": "d9e72c5e628c9ed808a7a4648e29fbca9a455ff2ae466abde9cabd27f2f87e6a"
2575
  },
2576
  "hf_model": {
2577
  "path": "hf_model:metrics/website_integrity.json",
2578
  "exists": true,
2579
  "bytes": 20178,
2580
- "sha256": "d9e72c5e628c9ed808a7a4648e29fbca9a455ff2ae466abde9cabd27f2f87e6a"
2581
  }
2582
  },
2583
  "failures": []
@@ -2760,33 +2760,33 @@
2760
  "local": {
2761
  "path": "repo:docs/assets/charts/single_episode_task_model_radar.svg",
2762
  "exists": true,
2763
- "bytes": 35232,
2764
- "sha256": "87b52a7dead40358f1778dda43ade4d2e875ac98e507e01ca007084363e5977e"
2765
  },
2766
  "mirrors": {
2767
  "hf_space": {
2768
  "path": "hf_space:assets/charts/single_episode_task_model_radar.svg",
2769
  "exists": true,
2770
- "bytes": 35232,
2771
- "sha256": "87b52a7dead40358f1778dda43ade4d2e875ac98e507e01ca007084363e5977e"
2772
  },
2773
  "hf_artifacts_docs": {
2774
  "path": "hf_artifacts:docs/assets/charts/single_episode_task_model_radar.svg",
2775
  "exists": true,
2776
- "bytes": 35232,
2777
- "sha256": "87b52a7dead40358f1778dda43ade4d2e875ac98e507e01ca007084363e5977e"
2778
  },
2779
  "hf_artifacts_card": {
2780
  "path": "hf_artifacts:assets/charts/single_episode_task_model_radar.svg",
2781
  "exists": true,
2782
- "bytes": 35232,
2783
- "sha256": "87b52a7dead40358f1778dda43ade4d2e875ac98e507e01ca007084363e5977e"
2784
  },
2785
  "hf_model": {
2786
  "path": "hf_model:assets/charts/single_episode_task_model_radar.svg",
2787
  "exists": true,
2788
- "bytes": 35232,
2789
- "sha256": "87b52a7dead40358f1778dda43ade4d2e875ac98e507e01ca007084363e5977e"
2790
  }
2791
  },
2792
  "failures": []
@@ -2797,33 +2797,33 @@
2797
  "local": {
2798
  "path": "repo:docs/assets/charts/episode128_task_model_radar.svg",
2799
  "exists": true,
2800
- "bytes": 51915,
2801
- "sha256": "047ea4b05a04f6734e2afcf792863559dc8f3091eae88a97ff90e8b038a423f4"
2802
  },
2803
  "mirrors": {
2804
  "hf_space": {
2805
  "path": "hf_space:assets/charts/episode128_task_model_radar.svg",
2806
  "exists": true,
2807
- "bytes": 51915,
2808
- "sha256": "047ea4b05a04f6734e2afcf792863559dc8f3091eae88a97ff90e8b038a423f4"
2809
  },
2810
  "hf_artifacts_docs": {
2811
  "path": "hf_artifacts:docs/assets/charts/episode128_task_model_radar.svg",
2812
  "exists": true,
2813
- "bytes": 51915,
2814
- "sha256": "047ea4b05a04f6734e2afcf792863559dc8f3091eae88a97ff90e8b038a423f4"
2815
  },
2816
  "hf_artifacts_card": {
2817
  "path": "hf_artifacts:assets/charts/episode128_task_model_radar.svg",
2818
  "exists": true,
2819
- "bytes": 51915,
2820
- "sha256": "047ea4b05a04f6734e2afcf792863559dc8f3091eae88a97ff90e8b038a423f4"
2821
  },
2822
  "hf_model": {
2823
  "path": "hf_model:assets/charts/episode128_task_model_radar.svg",
2824
  "exists": true,
2825
- "bytes": 51915,
2826
- "sha256": "047ea4b05a04f6734e2afcf792863559dc8f3091eae88a97ff90e8b038a423f4"
2827
  }
2828
  },
2829
  "failures": []
@@ -2871,33 +2871,33 @@
2871
  "local": {
2872
  "path": "repo:docs/assets/charts/unified_task_model_radar.svg",
2873
  "exists": true,
2874
- "bytes": 57938,
2875
- "sha256": "bb83b80b47fe679ebdce2c99378a4548120f1c8cc2d725b88e409d8c386dcbf8"
2876
  },
2877
  "mirrors": {
2878
  "hf_space": {
2879
  "path": "hf_space:assets/charts/unified_task_model_radar.svg",
2880
  "exists": true,
2881
- "bytes": 57938,
2882
- "sha256": "bb83b80b47fe679ebdce2c99378a4548120f1c8cc2d725b88e409d8c386dcbf8"
2883
  },
2884
  "hf_artifacts_docs": {
2885
  "path": "hf_artifacts:docs/assets/charts/unified_task_model_radar.svg",
2886
  "exists": true,
2887
- "bytes": 57938,
2888
- "sha256": "bb83b80b47fe679ebdce2c99378a4548120f1c8cc2d725b88e409d8c386dcbf8"
2889
  },
2890
  "hf_artifacts_card": {
2891
  "path": "hf_artifacts:assets/charts/unified_task_model_radar.svg",
2892
  "exists": true,
2893
- "bytes": 57938,
2894
- "sha256": "bb83b80b47fe679ebdce2c99378a4548120f1c8cc2d725b88e409d8c386dcbf8"
2895
  },
2896
  "hf_model": {
2897
  "path": "hf_model:assets/charts/unified_task_model_radar.svg",
2898
  "exists": true,
2899
- "bytes": 57938,
2900
- "sha256": "bb83b80b47fe679ebdce2c99378a4548120f1c8cc2d725b88e409d8c386dcbf8"
2901
  }
2902
  },
2903
  "failures": []
@@ -5327,21 +5327,21 @@
5327
  "local": {
5328
  "path": "repo:scripts/build_artifact_index.py",
5329
  "exists": true,
5330
- "bytes": 68279,
5331
- "sha256": "69b43ad5d3dc5a6893c4592fa47fff6a7a87691728ec2c61b121ec262d00bf2a"
5332
  },
5333
  "mirrors": {
5334
  "hf_artifacts": {
5335
  "path": "hf_artifacts:scripts/build_artifact_index.py",
5336
  "exists": true,
5337
- "bytes": 68279,
5338
- "sha256": "69b43ad5d3dc5a6893c4592fa47fff6a7a87691728ec2c61b121ec262d00bf2a"
5339
  },
5340
  "hf_model": {
5341
  "path": "hf_model:scripts/build_artifact_index.py",
5342
  "exists": true,
5343
- "bytes": 68279,
5344
- "sha256": "69b43ad5d3dc5a6893c4592fa47fff6a7a87691728ec2c61b121ec262d00bf2a"
5345
  }
5346
  },
5347
  "failures": []
@@ -5402,21 +5402,21 @@
5402
  "local": {
5403
  "path": "repo:scripts/build_figure_index.py",
5404
  "exists": true,
5405
- "bytes": 16845,
5406
- "sha256": "3f91f7f13a3fb08ab57c2f0a6b320102e9d5ae19b102b71499edb5b8fd5a2cec"
5407
  },
5408
  "mirrors": {
5409
  "hf_artifacts": {
5410
  "path": "hf_artifacts:scripts/build_figure_index.py",
5411
  "exists": true,
5412
- "bytes": 16845,
5413
- "sha256": "3f91f7f13a3fb08ab57c2f0a6b320102e9d5ae19b102b71499edb5b8fd5a2cec"
5414
  },
5415
  "hf_model": {
5416
  "path": "hf_model:scripts/build_figure_index.py",
5417
  "exists": true,
5418
- "bytes": 16845,
5419
- "sha256": "3f91f7f13a3fb08ab57c2f0a6b320102e9d5ae19b102b71499edb5b8fd5a2cec"
5420
  }
5421
  },
5422
  "failures": []
@@ -5877,21 +5877,21 @@
5877
  "local": {
5878
  "path": "repo:scripts/build_unified_task_model_radar.py",
5879
  "exists": true,
5880
- "bytes": 69956,
5881
- "sha256": "3d137eced3c376ed674090b823ff9439cea0e8e8670ea9de02c3d3781305a0c9"
5882
  },
5883
  "mirrors": {
5884
  "hf_artifacts": {
5885
  "path": "hf_artifacts:scripts/build_unified_task_model_radar.py",
5886
  "exists": true,
5887
- "bytes": 69956,
5888
- "sha256": "3d137eced3c376ed674090b823ff9439cea0e8e8670ea9de02c3d3781305a0c9"
5889
  },
5890
  "hf_model": {
5891
  "path": "hf_model:scripts/build_unified_task_model_radar.py",
5892
  "exists": true,
5893
- "bytes": 69956,
5894
- "sha256": "3d137eced3c376ed674090b823ff9439cea0e8e8670ea9de02c3d3781305a0c9"
5895
  }
5896
  },
5897
  "failures": []
@@ -6331,39 +6331,39 @@
6331
  "local": {
6332
  "path": "repo:docs/index.html",
6333
  "exists": true,
6334
- "bytes": 348267,
6335
- "sha256": "8e6ac665f1a0e810667379733013a906d3f13a390bceb4b41594769aa773626d"
6336
  },
6337
  "mirrors": {
6338
  "hf_space": {
6339
  "path": "hf_space:index.html",
6340
  "exists": true,
6341
- "bytes": 348267,
6342
- "sha256": "8e6ac665f1a0e810667379733013a906d3f13a390bceb4b41594769aa773626d"
6343
  },
6344
  "hf_artifacts_root": {
6345
  "path": "hf_artifacts:index.html",
6346
  "exists": true,
6347
- "bytes": 348267,
6348
- "sha256": "8e6ac665f1a0e810667379733013a906d3f13a390bceb4b41594769aa773626d"
6349
  },
6350
  "hf_artifacts_docs": {
6351
  "path": "hf_artifacts:docs/index.html",
6352
  "exists": true,
6353
- "bytes": 348267,
6354
- "sha256": "8e6ac665f1a0e810667379733013a906d3f13a390bceb4b41594769aa773626d"
6355
  },
6356
  "hf_model": {
6357
  "path": "hf_model:index.html",
6358
  "exists": true,
6359
- "bytes": 348267,
6360
- "sha256": "8e6ac665f1a0e810667379733013a906d3f13a390bceb4b41594769aa773626d"
6361
  },
6362
  "hf_model_docs": {
6363
  "path": "hf_model:docs/index.html",
6364
  "exists": true,
6365
- "bytes": 348267,
6366
- "sha256": "8e6ac665f1a0e810667379733013a906d3f13a390bceb4b41594769aa773626d"
6367
  }
6368
  },
6369
  "failures": []
@@ -31027,27 +31027,27 @@
31027
  "local": {
31028
  "path": "repo:FIGURE_INDEX.md",
31029
  "exists": true,
31030
- "bytes": 7027,
31031
- "sha256": "b7b507c35cd3cba2765586e9703a447c8025c89658c3daa390df67db4211d0fc"
31032
  },
31033
  "mirrors": {
31034
  "hf_space": {
31035
  "path": "hf_space:FIGURE_INDEX.md",
31036
  "exists": true,
31037
- "bytes": 7027,
31038
- "sha256": "b7b507c35cd3cba2765586e9703a447c8025c89658c3daa390df67db4211d0fc"
31039
  },
31040
  "hf_artifacts": {
31041
  "path": "hf_artifacts:FIGURE_INDEX.md",
31042
  "exists": true,
31043
- "bytes": 7027,
31044
- "sha256": "b7b507c35cd3cba2765586e9703a447c8025c89658c3daa390df67db4211d0fc"
31045
  },
31046
  "hf_model": {
31047
  "path": "hf_model:FIGURE_INDEX.md",
31048
  "exists": true,
31049
- "bytes": 7027,
31050
- "sha256": "b7b507c35cd3cba2765586e9703a447c8025c89658c3daa390df67db4211d0fc"
31051
  }
31052
  },
31053
  "failures": []
@@ -31554,27 +31554,27 @@
31554
  "local": {
31555
  "path": "repo:TASK_METHOD_20_RESULT_MATRIX.md",
31556
  "exists": true,
31557
- "bytes": 14715,
31558
- "sha256": "acf665d5c07f15224b42d9da93bc2586279852bd60241868391ba7c0a462a24f"
31559
  },
31560
  "mirrors": {
31561
  "hf_space": {
31562
  "path": "hf_space:TASK_METHOD_20_RESULT_MATRIX.md",
31563
  "exists": true,
31564
- "bytes": 14715,
31565
- "sha256": "acf665d5c07f15224b42d9da93bc2586279852bd60241868391ba7c0a462a24f"
31566
  },
31567
  "hf_artifacts": {
31568
  "path": "hf_artifacts:TASK_METHOD_20_RESULT_MATRIX.md",
31569
  "exists": true,
31570
- "bytes": 14715,
31571
- "sha256": "acf665d5c07f15224b42d9da93bc2586279852bd60241868391ba7c0a462a24f"
31572
  },
31573
  "hf_model": {
31574
  "path": "hf_model:TASK_METHOD_20_RESULT_MATRIX.md",
31575
  "exists": true,
31576
- "bytes": 14715,
31577
- "sha256": "acf665d5c07f15224b42d9da93bc2586279852bd60241868391ba7c0a462a24f"
31578
  }
31579
  },
31580
  "failures": []
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T20:33:57+00:00",
4
  "hf_root": "hf_publish",
5
  "summary": {
6
  "group_count": 1258,
 
138
  "local": {
139
  "path": "repo:docs/data/artifact_index.json",
140
  "exists": true,
141
+ "bytes": 124471,
142
+ "sha256": "8e9954b1de5bbd935a1ed0c6753f737d9a2a325c0bd85377087751ce198afeeb"
143
  },
144
  "mirrors": {
145
  "hf_space": {
146
  "path": "hf_space:data/artifact_index.json",
147
  "exists": true,
148
+ "bytes": 124471,
149
+ "sha256": "8e9954b1de5bbd935a1ed0c6753f737d9a2a325c0bd85377087751ce198afeeb"
150
  },
151
  "hf_artifacts_data": {
152
  "path": "hf_artifacts:data/artifact_index.json",
153
  "exists": true,
154
+ "bytes": 124471,
155
+ "sha256": "8e9954b1de5bbd935a1ed0c6753f737d9a2a325c0bd85377087751ce198afeeb"
156
  },
157
  "hf_artifacts": {
158
  "path": "hf_artifacts:docs/data/artifact_index.json",
159
  "exists": true,
160
+ "bytes": 124471,
161
+ "sha256": "8e9954b1de5bbd935a1ed0c6753f737d9a2a325c0bd85377087751ce198afeeb"
162
  },
163
  "hf_model_data": {
164
  "path": "hf_model:data/artifact_index.json",
165
  "exists": true,
166
+ "bytes": 124471,
167
+ "sha256": "8e9954b1de5bbd935a1ed0c6753f737d9a2a325c0bd85377087751ce198afeeb"
168
  },
169
  "hf_model_docs_data": {
170
  "path": "hf_model:docs/data/artifact_index.json",
171
  "exists": true,
172
+ "bytes": 124471,
173
+ "sha256": "8e9954b1de5bbd935a1ed0c6753f737d9a2a325c0bd85377087751ce198afeeb"
174
  },
175
  "hf_model": {
176
  "path": "hf_model:metrics/artifact_index.json",
177
  "exists": true,
178
+ "bytes": 124471,
179
+ "sha256": "8e9954b1de5bbd935a1ed0c6753f737d9a2a325c0bd85377087751ce198afeeb"
180
  }
181
  },
182
  "failures": []
 
334
  "local": {
335
  "path": "repo:docs/data/figure_index.json",
336
  "exists": true,
337
+ "bytes": 19526,
338
+ "sha256": "2216a4c718a2ad7aba2a6f3f30b2d699c77dea6a867219df40f2e7a83a467eb4"
339
  },
340
  "mirrors": {
341
  "hf_space": {
342
  "path": "hf_space:data/figure_index.json",
343
  "exists": true,
344
+ "bytes": 19526,
345
+ "sha256": "2216a4c718a2ad7aba2a6f3f30b2d699c77dea6a867219df40f2e7a83a467eb4"
346
  },
347
  "hf_artifacts_data": {
348
  "path": "hf_artifacts:data/figure_index.json",
349
  "exists": true,
350
+ "bytes": 19526,
351
+ "sha256": "2216a4c718a2ad7aba2a6f3f30b2d699c77dea6a867219df40f2e7a83a467eb4"
352
  },
353
  "hf_artifacts": {
354
  "path": "hf_artifacts:docs/data/figure_index.json",
355
  "exists": true,
356
+ "bytes": 19526,
357
+ "sha256": "2216a4c718a2ad7aba2a6f3f30b2d699c77dea6a867219df40f2e7a83a467eb4"
358
  },
359
  "hf_model_data": {
360
  "path": "hf_model:data/figure_index.json",
361
  "exists": true,
362
+ "bytes": 19526,
363
+ "sha256": "2216a4c718a2ad7aba2a6f3f30b2d699c77dea6a867219df40f2e7a83a467eb4"
364
  },
365
  "hf_model_docs_data": {
366
  "path": "hf_model:docs/data/figure_index.json",
367
  "exists": true,
368
+ "bytes": 19526,
369
+ "sha256": "2216a4c718a2ad7aba2a6f3f30b2d699c77dea6a867219df40f2e7a83a467eb4"
370
  },
371
  "hf_model": {
372
  "path": "hf_model:metrics/figure_index.json",
373
  "exists": true,
374
+ "bytes": 19526,
375
+ "sha256": "2216a4c718a2ad7aba2a6f3f30b2d699c77dea6a867219df40f2e7a83a467eb4"
376
  }
377
  },
378
  "failures": []
 
972
  "path": "repo:docs/data/publication_audit.json",
973
  "exists": true,
974
  "bytes": 10940,
975
+ "sha256": "f5a4c25846e1a8d730f4dc540c1c4fb56bd204c7de3d426b7759e72529009a06"
976
  },
977
  "mirrors": {
978
  "hf_space": {
979
  "path": "hf_space:data/publication_audit.json",
980
  "exists": true,
981
  "bytes": 10940,
982
+ "sha256": "f5a4c25846e1a8d730f4dc540c1c4fb56bd204c7de3d426b7759e72529009a06"
983
  },
984
  "hf_artifacts_data": {
985
  "path": "hf_artifacts:data/publication_audit.json",
986
  "exists": true,
987
  "bytes": 10940,
988
+ "sha256": "f5a4c25846e1a8d730f4dc540c1c4fb56bd204c7de3d426b7759e72529009a06"
989
  },
990
  "hf_artifacts": {
991
  "path": "hf_artifacts:docs/data/publication_audit.json",
992
  "exists": true,
993
  "bytes": 10940,
994
+ "sha256": "f5a4c25846e1a8d730f4dc540c1c4fb56bd204c7de3d426b7759e72529009a06"
995
  },
996
  "hf_model_data": {
997
  "path": "hf_model:data/publication_audit.json",
998
  "exists": true,
999
  "bytes": 10940,
1000
+ "sha256": "f5a4c25846e1a8d730f4dc540c1c4fb56bd204c7de3d426b7759e72529009a06"
1001
  },
1002
  "hf_model_docs_data": {
1003
  "path": "hf_model:docs/data/publication_audit.json",
1004
  "exists": true,
1005
  "bytes": 10940,
1006
+ "sha256": "f5a4c25846e1a8d730f4dc540c1c4fb56bd204c7de3d426b7759e72529009a06"
1007
  },
1008
  "hf_model": {
1009
  "path": "hf_model:metrics/publication_audit.json",
1010
  "exists": true,
1011
  "bytes": 10940,
1012
+ "sha256": "f5a4c25846e1a8d730f4dc540c1c4fb56bd204c7de3d426b7759e72529009a06"
1013
  }
1014
  },
1015
  "failures": []
 
1021
  "path": "repo:docs/data/public_surface_qa.json",
1022
  "exists": true,
1023
  "bytes": 7693,
1024
+ "sha256": "c3c9cab63c92026e5cf5265d5113d0bc42beec3664a71880c8471cc3305a8b04"
1025
  },
1026
  "mirrors": {
1027
  "hf_space": {
1028
  "path": "hf_space:data/public_surface_qa.json",
1029
  "exists": true,
1030
  "bytes": 7693,
1031
+ "sha256": "c3c9cab63c92026e5cf5265d5113d0bc42beec3664a71880c8471cc3305a8b04"
1032
  },
1033
  "hf_artifacts_data": {
1034
  "path": "hf_artifacts:data/public_surface_qa.json",
1035
  "exists": true,
1036
  "bytes": 7693,
1037
+ "sha256": "c3c9cab63c92026e5cf5265d5113d0bc42beec3664a71880c8471cc3305a8b04"
1038
  },
1039
  "hf_artifacts": {
1040
  "path": "hf_artifacts:docs/data/public_surface_qa.json",
1041
  "exists": true,
1042
  "bytes": 7693,
1043
+ "sha256": "c3c9cab63c92026e5cf5265d5113d0bc42beec3664a71880c8471cc3305a8b04"
1044
  },
1045
  "hf_model_data": {
1046
  "path": "hf_model:data/public_surface_qa.json",
1047
  "exists": true,
1048
  "bytes": 7693,
1049
+ "sha256": "c3c9cab63c92026e5cf5265d5113d0bc42beec3664a71880c8471cc3305a8b04"
1050
  },
1051
  "hf_model_docs_data": {
1052
  "path": "hf_model:docs/data/public_surface_qa.json",
1053
  "exists": true,
1054
  "bytes": 7693,
1055
+ "sha256": "c3c9cab63c92026e5cf5265d5113d0bc42beec3664a71880c8471cc3305a8b04"
1056
  },
1057
  "hf_model": {
1058
  "path": "hf_model:metrics/public_surface_qa.json",
1059
  "exists": true,
1060
  "bytes": 7693,
1061
+ "sha256": "c3c9cab63c92026e5cf5265d5113d0bc42beec3664a71880c8471cc3305a8b04"
1062
  }
1063
  },
1064
  "failures": []
 
1217
  "path": "repo:docs/data/quality_gates.json",
1218
  "exists": true,
1219
  "bytes": 8640,
1220
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1221
  },
1222
  "mirrors": {
1223
  "hf_space": {
1224
  "path": "hf_space:data/quality_gates.json",
1225
  "exists": true,
1226
  "bytes": 8640,
1227
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1228
  },
1229
  "hf_artifacts_data": {
1230
  "path": "hf_artifacts:data/quality_gates.json",
1231
  "exists": true,
1232
  "bytes": 8640,
1233
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1234
  },
1235
  "hf_artifacts": {
1236
  "path": "hf_artifacts:docs/data/quality_gates.json",
1237
  "exists": true,
1238
  "bytes": 8640,
1239
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1240
  },
1241
  "hf_model_data": {
1242
  "path": "hf_model:data/quality_gates.json",
1243
  "exists": true,
1244
  "bytes": 8640,
1245
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1246
  },
1247
  "hf_model_docs_data": {
1248
  "path": "hf_model:docs/data/quality_gates.json",
1249
  "exists": true,
1250
  "bytes": 8640,
1251
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1252
  },
1253
  "hf_model": {
1254
  "path": "hf_model:metrics/quality_gates.json",
1255
  "exists": true,
1256
  "bytes": 8640,
1257
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1258
  }
1259
  },
1260
  "failures": []
 
1658
  "path": "repo:docs/data/scope_claims_audit.json",
1659
  "exists": true,
1660
  "bytes": 21313,
1661
+ "sha256": "b997454e0e9bb382da0afcdec319086081213d176546306e13a7d47d42e8957c"
1662
  },
1663
  "mirrors": {
1664
  "hf_space": {
1665
  "path": "hf_space:data/scope_claims_audit.json",
1666
  "exists": true,
1667
  "bytes": 21313,
1668
+ "sha256": "b997454e0e9bb382da0afcdec319086081213d176546306e13a7d47d42e8957c"
1669
  },
1670
  "hf_artifacts_data": {
1671
  "path": "hf_artifacts:data/scope_claims_audit.json",
1672
  "exists": true,
1673
  "bytes": 21313,
1674
+ "sha256": "b997454e0e9bb382da0afcdec319086081213d176546306e13a7d47d42e8957c"
1675
  },
1676
  "hf_artifacts": {
1677
  "path": "hf_artifacts:docs/data/scope_claims_audit.json",
1678
  "exists": true,
1679
  "bytes": 21313,
1680
+ "sha256": "b997454e0e9bb382da0afcdec319086081213d176546306e13a7d47d42e8957c"
1681
  },
1682
  "hf_model_data": {
1683
  "path": "hf_model:data/scope_claims_audit.json",
1684
  "exists": true,
1685
  "bytes": 21313,
1686
+ "sha256": "b997454e0e9bb382da0afcdec319086081213d176546306e13a7d47d42e8957c"
1687
  },
1688
  "hf_model_docs_data": {
1689
  "path": "hf_model:docs/data/scope_claims_audit.json",
1690
  "exists": true,
1691
  "bytes": 21313,
1692
+ "sha256": "b997454e0e9bb382da0afcdec319086081213d176546306e13a7d47d42e8957c"
1693
  },
1694
  "hf_model": {
1695
  "path": "hf_model:metrics/scope_claims_audit.json",
1696
  "exists": true,
1697
  "bytes": 21313,
1698
+ "sha256": "b997454e0e9bb382da0afcdec319086081213d176546306e13a7d47d42e8957c"
1699
  }
1700
  },
1701
  "failures": []
 
1756
  "path": "repo:docs/data/source_alignment_audit.json",
1757
  "exists": true,
1758
  "bytes": 4432,
1759
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
1760
  },
1761
  "mirrors": {
1762
  "hf_space": {
1763
  "path": "hf_space:data/source_alignment_audit.json",
1764
  "exists": true,
1765
  "bytes": 4432,
1766
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
1767
  },
1768
  "hf_artifacts_data": {
1769
  "path": "hf_artifacts:data/source_alignment_audit.json",
1770
  "exists": true,
1771
  "bytes": 4432,
1772
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
1773
  },
1774
  "hf_artifacts": {
1775
  "path": "hf_artifacts:docs/data/source_alignment_audit.json",
1776
  "exists": true,
1777
  "bytes": 4432,
1778
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
1779
  },
1780
  "hf_model_data": {
1781
  "path": "hf_model:data/source_alignment_audit.json",
1782
  "exists": true,
1783
  "bytes": 4432,
1784
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
1785
  },
1786
  "hf_model_docs_data": {
1787
  "path": "hf_model:docs/data/source_alignment_audit.json",
1788
  "exists": true,
1789
  "bytes": 4432,
1790
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
1791
  },
1792
  "hf_model": {
1793
  "path": "hf_model:metrics/source_alignment_audit.json",
1794
  "exists": true,
1795
  "bytes": 4432,
1796
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
1797
  }
1798
  },
1799
  "failures": []
 
1853
  "local": {
1854
  "path": "repo:docs/data/single_episode_task_model_radar.json",
1855
  "exists": true,
1856
+ "bytes": 52866,
1857
+ "sha256": "804273420658eae853df3d1c1f45f83040fb09551c74c203bb833d73b6c2a4c1"
1858
  },
1859
  "mirrors": {
1860
  "hf_space": {
1861
  "path": "hf_space:data/single_episode_task_model_radar.json",
1862
  "exists": true,
1863
+ "bytes": 52866,
1864
+ "sha256": "804273420658eae853df3d1c1f45f83040fb09551c74c203bb833d73b6c2a4c1"
1865
  },
1866
  "hf_artifacts_data": {
1867
  "path": "hf_artifacts:data/single_episode_task_model_radar.json",
1868
  "exists": true,
1869
+ "bytes": 52866,
1870
+ "sha256": "804273420658eae853df3d1c1f45f83040fb09551c74c203bb833d73b6c2a4c1"
1871
  },
1872
  "hf_artifacts": {
1873
  "path": "hf_artifacts:docs/data/single_episode_task_model_radar.json",
1874
  "exists": true,
1875
+ "bytes": 52866,
1876
+ "sha256": "804273420658eae853df3d1c1f45f83040fb09551c74c203bb833d73b6c2a4c1"
1877
  },
1878
  "hf_model_data": {
1879
  "path": "hf_model:data/single_episode_task_model_radar.json",
1880
  "exists": true,
1881
+ "bytes": 52866,
1882
+ "sha256": "804273420658eae853df3d1c1f45f83040fb09551c74c203bb833d73b6c2a4c1"
1883
  },
1884
  "hf_model_docs_data": {
1885
  "path": "hf_model:docs/data/single_episode_task_model_radar.json",
1886
  "exists": true,
1887
+ "bytes": 52866,
1888
+ "sha256": "804273420658eae853df3d1c1f45f83040fb09551c74c203bb833d73b6c2a4c1"
1889
  },
1890
  "hf_model": {
1891
  "path": "hf_model:metrics/single_episode_task_model_radar.json",
1892
  "exists": true,
1893
+ "bytes": 52866,
1894
+ "sha256": "804273420658eae853df3d1c1f45f83040fb09551c74c203bb833d73b6c2a4c1"
1895
  }
1896
  },
1897
  "failures": []
 
1902
  "local": {
1903
  "path": "repo:docs/data/episode128_task_model_radar.json",
1904
  "exists": true,
1905
+ "bytes": 186997,
1906
+ "sha256": "d15869d177f5a9cc37e20d0cab62fdc1be61d596975fee13a5d1d61108c7176a"
1907
  },
1908
  "mirrors": {
1909
  "hf_space": {
1910
  "path": "hf_space:data/episode128_task_model_radar.json",
1911
  "exists": true,
1912
+ "bytes": 186997,
1913
+ "sha256": "d15869d177f5a9cc37e20d0cab62fdc1be61d596975fee13a5d1d61108c7176a"
1914
  },
1915
  "hf_artifacts_data": {
1916
  "path": "hf_artifacts:data/episode128_task_model_radar.json",
1917
  "exists": true,
1918
+ "bytes": 186997,
1919
+ "sha256": "d15869d177f5a9cc37e20d0cab62fdc1be61d596975fee13a5d1d61108c7176a"
1920
  },
1921
  "hf_artifacts": {
1922
  "path": "hf_artifacts:docs/data/episode128_task_model_radar.json",
1923
  "exists": true,
1924
+ "bytes": 186997,
1925
+ "sha256": "d15869d177f5a9cc37e20d0cab62fdc1be61d596975fee13a5d1d61108c7176a"
1926
  },
1927
  "hf_model_data": {
1928
  "path": "hf_model:data/episode128_task_model_radar.json",
1929
  "exists": true,
1930
+ "bytes": 186997,
1931
+ "sha256": "d15869d177f5a9cc37e20d0cab62fdc1be61d596975fee13a5d1d61108c7176a"
1932
  },
1933
  "hf_model_docs_data": {
1934
  "path": "hf_model:docs/data/episode128_task_model_radar.json",
1935
  "exists": true,
1936
+ "bytes": 186997,
1937
+ "sha256": "d15869d177f5a9cc37e20d0cab62fdc1be61d596975fee13a5d1d61108c7176a"
1938
  },
1939
  "hf_model": {
1940
  "path": "hf_model:metrics/episode128_task_model_radar.json",
1941
  "exists": true,
1942
+ "bytes": 186997,
1943
+ "sha256": "d15869d177f5a9cc37e20d0cab62fdc1be61d596975fee13a5d1d61108c7176a"
1944
  }
1945
  },
1946
  "failures": []
 
2197
  "path": "repo:docs/data/task_surface_integrity.json",
2198
  "exists": true,
2199
  "bytes": 46246,
2200
+ "sha256": "e85510c4a7df9e1fd43651d18291b51b1ad2c3ab0b5042d4edc2a598a1929556"
2201
  },
2202
  "mirrors": {
2203
  "hf_space": {
2204
  "path": "hf_space:data/task_surface_integrity.json",
2205
  "exists": true,
2206
  "bytes": 46246,
2207
+ "sha256": "e85510c4a7df9e1fd43651d18291b51b1ad2c3ab0b5042d4edc2a598a1929556"
2208
  },
2209
  "hf_artifacts_data": {
2210
  "path": "hf_artifacts:data/task_surface_integrity.json",
2211
  "exists": true,
2212
  "bytes": 46246,
2213
+ "sha256": "e85510c4a7df9e1fd43651d18291b51b1ad2c3ab0b5042d4edc2a598a1929556"
2214
  },
2215
  "hf_artifacts": {
2216
  "path": "hf_artifacts:docs/data/task_surface_integrity.json",
2217
  "exists": true,
2218
  "bytes": 46246,
2219
+ "sha256": "e85510c4a7df9e1fd43651d18291b51b1ad2c3ab0b5042d4edc2a598a1929556"
2220
  },
2221
  "hf_model_data": {
2222
  "path": "hf_model:data/task_surface_integrity.json",
2223
  "exists": true,
2224
  "bytes": 46246,
2225
+ "sha256": "e85510c4a7df9e1fd43651d18291b51b1ad2c3ab0b5042d4edc2a598a1929556"
2226
  },
2227
  "hf_model_docs_data": {
2228
  "path": "hf_model:docs/data/task_surface_integrity.json",
2229
  "exists": true,
2230
  "bytes": 46246,
2231
+ "sha256": "e85510c4a7df9e1fd43651d18291b51b1ad2c3ab0b5042d4edc2a598a1929556"
2232
  },
2233
  "hf_model": {
2234
  "path": "hf_model:metrics/task_surface_integrity.json",
2235
  "exists": true,
2236
  "bytes": 46246,
2237
+ "sha256": "e85510c4a7df9e1fd43651d18291b51b1ad2c3ab0b5042d4edc2a598a1929556"
2238
  }
2239
  },
2240
  "failures": []
 
2294
  "local": {
2295
  "path": "repo:docs/data/task_method_20_result_matrix.json",
2296
  "exists": true,
2297
+ "bytes": 128991,
2298
+ "sha256": "c070f4430cff2583e150cb66ed75b5759ee985429af7aa22a8a83c14a5a8841f"
2299
  },
2300
  "mirrors": {
2301
  "hf_space": {
2302
  "path": "hf_space:data/task_method_20_result_matrix.json",
2303
  "exists": true,
2304
+ "bytes": 128991,
2305
+ "sha256": "c070f4430cff2583e150cb66ed75b5759ee985429af7aa22a8a83c14a5a8841f"
2306
  },
2307
  "hf_artifacts_data": {
2308
  "path": "hf_artifacts:data/task_method_20_result_matrix.json",
2309
  "exists": true,
2310
+ "bytes": 128991,
2311
+ "sha256": "c070f4430cff2583e150cb66ed75b5759ee985429af7aa22a8a83c14a5a8841f"
2312
  },
2313
  "hf_artifacts": {
2314
  "path": "hf_artifacts:docs/data/task_method_20_result_matrix.json",
2315
  "exists": true,
2316
+ "bytes": 128991,
2317
+ "sha256": "c070f4430cff2583e150cb66ed75b5759ee985429af7aa22a8a83c14a5a8841f"
2318
  },
2319
  "hf_model_data": {
2320
  "path": "hf_model:data/task_method_20_result_matrix.json",
2321
  "exists": true,
2322
+ "bytes": 128991,
2323
+ "sha256": "c070f4430cff2583e150cb66ed75b5759ee985429af7aa22a8a83c14a5a8841f"
2324
  },
2325
  "hf_model_docs_data": {
2326
  "path": "hf_model:docs/data/task_method_20_result_matrix.json",
2327
  "exists": true,
2328
+ "bytes": 128991,
2329
+ "sha256": "c070f4430cff2583e150cb66ed75b5759ee985429af7aa22a8a83c14a5a8841f"
2330
  },
2331
  "hf_model": {
2332
  "path": "hf_model:metrics/task_method_20_result_matrix.json",
2333
  "exists": true,
2334
+ "bytes": 128991,
2335
+ "sha256": "c070f4430cff2583e150cb66ed75b5759ee985429af7aa22a8a83c14a5a8841f"
2336
  }
2337
  },
2338
  "failures": []
 
2490
  "local": {
2491
  "path": "repo:docs/data/unified_task_model_radar.json",
2492
  "exists": true,
2493
+ "bytes": 230938,
2494
+ "sha256": "e9523c3f97d6493058030ad05c3a8a80c51a11727fbf8eb936c405ac986531c1"
2495
  },
2496
  "mirrors": {
2497
  "hf_space": {
2498
  "path": "hf_space:data/unified_task_model_radar.json",
2499
  "exists": true,
2500
+ "bytes": 230938,
2501
+ "sha256": "e9523c3f97d6493058030ad05c3a8a80c51a11727fbf8eb936c405ac986531c1"
2502
  },
2503
  "hf_artifacts_data": {
2504
  "path": "hf_artifacts:data/unified_task_model_radar.json",
2505
  "exists": true,
2506
+ "bytes": 230938,
2507
+ "sha256": "e9523c3f97d6493058030ad05c3a8a80c51a11727fbf8eb936c405ac986531c1"
2508
  },
2509
  "hf_artifacts": {
2510
  "path": "hf_artifacts:docs/data/unified_task_model_radar.json",
2511
  "exists": true,
2512
+ "bytes": 230938,
2513
+ "sha256": "e9523c3f97d6493058030ad05c3a8a80c51a11727fbf8eb936c405ac986531c1"
2514
  },
2515
  "hf_model_data": {
2516
  "path": "hf_model:data/unified_task_model_radar.json",
2517
  "exists": true,
2518
+ "bytes": 230938,
2519
+ "sha256": "e9523c3f97d6493058030ad05c3a8a80c51a11727fbf8eb936c405ac986531c1"
2520
  },
2521
  "hf_model_docs_data": {
2522
  "path": "hf_model:docs/data/unified_task_model_radar.json",
2523
  "exists": true,
2524
+ "bytes": 230938,
2525
+ "sha256": "e9523c3f97d6493058030ad05c3a8a80c51a11727fbf8eb936c405ac986531c1"
2526
  },
2527
  "hf_model": {
2528
  "path": "hf_model:metrics/unified_task_model_radar.json",
2529
  "exists": true,
2530
+ "bytes": 230938,
2531
+ "sha256": "e9523c3f97d6493058030ad05c3a8a80c51a11727fbf8eb936c405ac986531c1"
2532
  }
2533
  },
2534
  "failures": []
 
2540
  "path": "repo:docs/data/website_integrity.json",
2541
  "exists": true,
2542
  "bytes": 20178,
2543
+ "sha256": "b24c99ae7b5f9ae9fd48f04f39728979aabec7d79eaad1a0ed4437e02a7cb218"
2544
  },
2545
  "mirrors": {
2546
  "hf_space": {
2547
  "path": "hf_space:data/website_integrity.json",
2548
  "exists": true,
2549
  "bytes": 20178,
2550
+ "sha256": "b24c99ae7b5f9ae9fd48f04f39728979aabec7d79eaad1a0ed4437e02a7cb218"
2551
  },
2552
  "hf_artifacts_data": {
2553
  "path": "hf_artifacts:data/website_integrity.json",
2554
  "exists": true,
2555
  "bytes": 20178,
2556
+ "sha256": "b24c99ae7b5f9ae9fd48f04f39728979aabec7d79eaad1a0ed4437e02a7cb218"
2557
  },
2558
  "hf_artifacts": {
2559
  "path": "hf_artifacts:docs/data/website_integrity.json",
2560
  "exists": true,
2561
  "bytes": 20178,
2562
+ "sha256": "b24c99ae7b5f9ae9fd48f04f39728979aabec7d79eaad1a0ed4437e02a7cb218"
2563
  },
2564
  "hf_model_data": {
2565
  "path": "hf_model:data/website_integrity.json",
2566
  "exists": true,
2567
  "bytes": 20178,
2568
+ "sha256": "b24c99ae7b5f9ae9fd48f04f39728979aabec7d79eaad1a0ed4437e02a7cb218"
2569
  },
2570
  "hf_model_docs_data": {
2571
  "path": "hf_model:docs/data/website_integrity.json",
2572
  "exists": true,
2573
  "bytes": 20178,
2574
+ "sha256": "b24c99ae7b5f9ae9fd48f04f39728979aabec7d79eaad1a0ed4437e02a7cb218"
2575
  },
2576
  "hf_model": {
2577
  "path": "hf_model:metrics/website_integrity.json",
2578
  "exists": true,
2579
  "bytes": 20178,
2580
+ "sha256": "b24c99ae7b5f9ae9fd48f04f39728979aabec7d79eaad1a0ed4437e02a7cb218"
2581
  }
2582
  },
2583
  "failures": []
 
2760
  "local": {
2761
  "path": "repo:docs/assets/charts/single_episode_task_model_radar.svg",
2762
  "exists": true,
2763
+ "bytes": 36930,
2764
+ "sha256": "96e609b0577e66db0ee8c63939c11b1fb28018285a1d259362de0bff415cc939"
2765
  },
2766
  "mirrors": {
2767
  "hf_space": {
2768
  "path": "hf_space:assets/charts/single_episode_task_model_radar.svg",
2769
  "exists": true,
2770
+ "bytes": 36930,
2771
+ "sha256": "96e609b0577e66db0ee8c63939c11b1fb28018285a1d259362de0bff415cc939"
2772
  },
2773
  "hf_artifacts_docs": {
2774
  "path": "hf_artifacts:docs/assets/charts/single_episode_task_model_radar.svg",
2775
  "exists": true,
2776
+ "bytes": 36930,
2777
+ "sha256": "96e609b0577e66db0ee8c63939c11b1fb28018285a1d259362de0bff415cc939"
2778
  },
2779
  "hf_artifacts_card": {
2780
  "path": "hf_artifacts:assets/charts/single_episode_task_model_radar.svg",
2781
  "exists": true,
2782
+ "bytes": 36930,
2783
+ "sha256": "96e609b0577e66db0ee8c63939c11b1fb28018285a1d259362de0bff415cc939"
2784
  },
2785
  "hf_model": {
2786
  "path": "hf_model:assets/charts/single_episode_task_model_radar.svg",
2787
  "exists": true,
2788
+ "bytes": 36930,
2789
+ "sha256": "96e609b0577e66db0ee8c63939c11b1fb28018285a1d259362de0bff415cc939"
2790
  }
2791
  },
2792
  "failures": []
 
2797
  "local": {
2798
  "path": "repo:docs/assets/charts/episode128_task_model_radar.svg",
2799
  "exists": true,
2800
+ "bytes": 79370,
2801
+ "sha256": "5151c8aca22bd4aeda60b143b1164c1d1b9eb4babbeabf6da598701ccbbbf5c9"
2802
  },
2803
  "mirrors": {
2804
  "hf_space": {
2805
  "path": "hf_space:assets/charts/episode128_task_model_radar.svg",
2806
  "exists": true,
2807
+ "bytes": 79370,
2808
+ "sha256": "5151c8aca22bd4aeda60b143b1164c1d1b9eb4babbeabf6da598701ccbbbf5c9"
2809
  },
2810
  "hf_artifacts_docs": {
2811
  "path": "hf_artifacts:docs/assets/charts/episode128_task_model_radar.svg",
2812
  "exists": true,
2813
+ "bytes": 79370,
2814
+ "sha256": "5151c8aca22bd4aeda60b143b1164c1d1b9eb4babbeabf6da598701ccbbbf5c9"
2815
  },
2816
  "hf_artifacts_card": {
2817
  "path": "hf_artifacts:assets/charts/episode128_task_model_radar.svg",
2818
  "exists": true,
2819
+ "bytes": 79370,
2820
+ "sha256": "5151c8aca22bd4aeda60b143b1164c1d1b9eb4babbeabf6da598701ccbbbf5c9"
2821
  },
2822
  "hf_model": {
2823
  "path": "hf_model:assets/charts/episode128_task_model_radar.svg",
2824
  "exists": true,
2825
+ "bytes": 79370,
2826
+ "sha256": "5151c8aca22bd4aeda60b143b1164c1d1b9eb4babbeabf6da598701ccbbbf5c9"
2827
  }
2828
  },
2829
  "failures": []
 
2871
  "local": {
2872
  "path": "repo:docs/assets/charts/unified_task_model_radar.svg",
2873
  "exists": true,
2874
+ "bytes": 98527,
2875
+ "sha256": "5b034b22d2a772a57e7db50f300cb70d00bd31ac89d0c039c16ac8c23a5137ec"
2876
  },
2877
  "mirrors": {
2878
  "hf_space": {
2879
  "path": "hf_space:assets/charts/unified_task_model_radar.svg",
2880
  "exists": true,
2881
+ "bytes": 98527,
2882
+ "sha256": "5b034b22d2a772a57e7db50f300cb70d00bd31ac89d0c039c16ac8c23a5137ec"
2883
  },
2884
  "hf_artifacts_docs": {
2885
  "path": "hf_artifacts:docs/assets/charts/unified_task_model_radar.svg",
2886
  "exists": true,
2887
+ "bytes": 98527,
2888
+ "sha256": "5b034b22d2a772a57e7db50f300cb70d00bd31ac89d0c039c16ac8c23a5137ec"
2889
  },
2890
  "hf_artifacts_card": {
2891
  "path": "hf_artifacts:assets/charts/unified_task_model_radar.svg",
2892
  "exists": true,
2893
+ "bytes": 98527,
2894
+ "sha256": "5b034b22d2a772a57e7db50f300cb70d00bd31ac89d0c039c16ac8c23a5137ec"
2895
  },
2896
  "hf_model": {
2897
  "path": "hf_model:assets/charts/unified_task_model_radar.svg",
2898
  "exists": true,
2899
+ "bytes": 98527,
2900
+ "sha256": "5b034b22d2a772a57e7db50f300cb70d00bd31ac89d0c039c16ac8c23a5137ec"
2901
  }
2902
  },
2903
  "failures": []
 
5327
  "local": {
5328
  "path": "repo:scripts/build_artifact_index.py",
5329
  "exists": true,
5330
+ "bytes": 68407,
5331
+ "sha256": "9f5fc1af7c4c23083975066691bf6d75aa99280961198e2c47be88754c142418"
5332
  },
5333
  "mirrors": {
5334
  "hf_artifacts": {
5335
  "path": "hf_artifacts:scripts/build_artifact_index.py",
5336
  "exists": true,
5337
+ "bytes": 68407,
5338
+ "sha256": "9f5fc1af7c4c23083975066691bf6d75aa99280961198e2c47be88754c142418"
5339
  },
5340
  "hf_model": {
5341
  "path": "hf_model:scripts/build_artifact_index.py",
5342
  "exists": true,
5343
+ "bytes": 68407,
5344
+ "sha256": "9f5fc1af7c4c23083975066691bf6d75aa99280961198e2c47be88754c142418"
5345
  }
5346
  },
5347
  "failures": []
 
5402
  "local": {
5403
  "path": "repo:scripts/build_figure_index.py",
5404
  "exists": true,
5405
+ "bytes": 16886,
5406
+ "sha256": "fc44ea60b6f491d290a56d4c1097fcfc37c024f8a9bc3b3db013252f45d96e64"
5407
  },
5408
  "mirrors": {
5409
  "hf_artifacts": {
5410
  "path": "hf_artifacts:scripts/build_figure_index.py",
5411
  "exists": true,
5412
+ "bytes": 16886,
5413
+ "sha256": "fc44ea60b6f491d290a56d4c1097fcfc37c024f8a9bc3b3db013252f45d96e64"
5414
  },
5415
  "hf_model": {
5416
  "path": "hf_model:scripts/build_figure_index.py",
5417
  "exists": true,
5418
+ "bytes": 16886,
5419
+ "sha256": "fc44ea60b6f491d290a56d4c1097fcfc37c024f8a9bc3b3db013252f45d96e64"
5420
  }
5421
  },
5422
  "failures": []
 
5877
  "local": {
5878
  "path": "repo:scripts/build_unified_task_model_radar.py",
5879
  "exists": true,
5880
+ "bytes": 78735,
5881
+ "sha256": "8e7110381d56e5ef0effb7dcc7935d1bb0ce2653289504a50d04f29571a4e6b7"
5882
  },
5883
  "mirrors": {
5884
  "hf_artifacts": {
5885
  "path": "hf_artifacts:scripts/build_unified_task_model_radar.py",
5886
  "exists": true,
5887
+ "bytes": 78735,
5888
+ "sha256": "8e7110381d56e5ef0effb7dcc7935d1bb0ce2653289504a50d04f29571a4e6b7"
5889
  },
5890
  "hf_model": {
5891
  "path": "hf_model:scripts/build_unified_task_model_radar.py",
5892
  "exists": true,
5893
+ "bytes": 78735,
5894
+ "sha256": "8e7110381d56e5ef0effb7dcc7935d1bb0ce2653289504a50d04f29571a4e6b7"
5895
  }
5896
  },
5897
  "failures": []
 
6331
  "local": {
6332
  "path": "repo:docs/index.html",
6333
  "exists": true,
6334
+ "bytes": 348384,
6335
+ "sha256": "6f6b2c6b40e8d38210fd2e0c04750732afb5179996a663185f2525442c9c957a"
6336
  },
6337
  "mirrors": {
6338
  "hf_space": {
6339
  "path": "hf_space:index.html",
6340
  "exists": true,
6341
+ "bytes": 348384,
6342
+ "sha256": "6f6b2c6b40e8d38210fd2e0c04750732afb5179996a663185f2525442c9c957a"
6343
  },
6344
  "hf_artifacts_root": {
6345
  "path": "hf_artifacts:index.html",
6346
  "exists": true,
6347
+ "bytes": 348384,
6348
+ "sha256": "6f6b2c6b40e8d38210fd2e0c04750732afb5179996a663185f2525442c9c957a"
6349
  },
6350
  "hf_artifacts_docs": {
6351
  "path": "hf_artifacts:docs/index.html",
6352
  "exists": true,
6353
+ "bytes": 348384,
6354
+ "sha256": "6f6b2c6b40e8d38210fd2e0c04750732afb5179996a663185f2525442c9c957a"
6355
  },
6356
  "hf_model": {
6357
  "path": "hf_model:index.html",
6358
  "exists": true,
6359
+ "bytes": 348384,
6360
+ "sha256": "6f6b2c6b40e8d38210fd2e0c04750732afb5179996a663185f2525442c9c957a"
6361
  },
6362
  "hf_model_docs": {
6363
  "path": "hf_model:docs/index.html",
6364
  "exists": true,
6365
+ "bytes": 348384,
6366
+ "sha256": "6f6b2c6b40e8d38210fd2e0c04750732afb5179996a663185f2525442c9c957a"
6367
  }
6368
  },
6369
  "failures": []
 
31027
  "local": {
31028
  "path": "repo:FIGURE_INDEX.md",
31029
  "exists": true,
31030
+ "bytes": 7068,
31031
+ "sha256": "cc64c0bb070e7eb0035ba590a6d83ed07fcc68fb56081668caebf40b49b9900f"
31032
  },
31033
  "mirrors": {
31034
  "hf_space": {
31035
  "path": "hf_space:FIGURE_INDEX.md",
31036
  "exists": true,
31037
+ "bytes": 7068,
31038
+ "sha256": "cc64c0bb070e7eb0035ba590a6d83ed07fcc68fb56081668caebf40b49b9900f"
31039
  },
31040
  "hf_artifacts": {
31041
  "path": "hf_artifacts:FIGURE_INDEX.md",
31042
  "exists": true,
31043
+ "bytes": 7068,
31044
+ "sha256": "cc64c0bb070e7eb0035ba590a6d83ed07fcc68fb56081668caebf40b49b9900f"
31045
  },
31046
  "hf_model": {
31047
  "path": "hf_model:FIGURE_INDEX.md",
31048
  "exists": true,
31049
+ "bytes": 7068,
31050
+ "sha256": "cc64c0bb070e7eb0035ba590a6d83ed07fcc68fb56081668caebf40b49b9900f"
31051
  }
31052
  },
31053
  "failures": []
 
31554
  "local": {
31555
  "path": "repo:TASK_METHOD_20_RESULT_MATRIX.md",
31556
  "exists": true,
31557
+ "bytes": 14862,
31558
+ "sha256": "a6f769d40ed22a2d63dad88029be393d55ed9af3082ab61fc1ad8b314aa871a6"
31559
  },
31560
  "mirrors": {
31561
  "hf_space": {
31562
  "path": "hf_space:TASK_METHOD_20_RESULT_MATRIX.md",
31563
  "exists": true,
31564
+ "bytes": 14862,
31565
+ "sha256": "a6f769d40ed22a2d63dad88029be393d55ed9af3082ab61fc1ad8b314aa871a6"
31566
  },
31567
  "hf_artifacts": {
31568
  "path": "hf_artifacts:TASK_METHOD_20_RESULT_MATRIX.md",
31569
  "exists": true,
31570
+ "bytes": 14862,
31571
+ "sha256": "a6f769d40ed22a2d63dad88029be393d55ed9af3082ab61fc1ad8b314aa871a6"
31572
  },
31573
  "hf_model": {
31574
  "path": "hf_model:TASK_METHOD_20_RESULT_MATRIX.md",
31575
  "exists": true,
31576
+ "bytes": 14862,
31577
+ "sha256": "a6f769d40ed22a2d63dad88029be393d55ed9af3082ab61fc1ad8b314aa871a6"
31578
  }
31579
  },
31580
  "failures": []
data/publication_audit.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T20:02:09+00:00",
4
  "checks": [
5
  {
6
  "name": "required_publication_assets_present",
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T20:35:41+00:00",
4
  "checks": [
5
  {
6
  "name": "required_publication_assets_present",
data/unified_task_model_radar.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Unified 20-Task Model Radar",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T15:20:34+00:00",
5
  "task_count": 20,
6
  "method_count": 9,
7
  "method_task_record_count": 180,
@@ -10,10 +10,53 @@
10
  "higher_is_better": "bounded metrics are plotted directly on 0-1 axes after clipping to [0, 1]",
11
  "lower_is_better": "lower-error metrics are converted to best_observed_value / raw_value within the same task",
12
  "raw_values": "raw metric values, metric keys, and sources are retained in this JSON; the SVG is an overview, not a replacement for the metric table",
 
13
  "result_record_policy": "every method has 20 task records; the current public release has 180/180 scored rows with proxy flags and reasons retained where compact substitute targets are used",
14
- "foundation_model_overlay": "Qwen3-Omni and Cosmos3 points are plotted only on task-aligned axes. Scoreless records mean the public result does not evaluate that task contract.",
15
- "metadata_128_overlay": "128-episode aligned baselines have 20 records. Numeric scores come from JSONL metadata/text tasks plus staged sensor-block targets when the processed target exists; raw interaction text and paired camera-view embeddings remain explicit gaps.",
16
- "raw_128_overlay": "128-episode raw-feature baselines use staged sensor NPZ features. Eighteen axes use direct task targets; interaction text and camera-view sync are completed with documented compact proxies because raw interaction strings and paired video-view embeddings are absent from the 128 export."
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
17
  },
18
  "series": [
19
  {
@@ -25,7 +68,7 @@
25
  "scope": "1 public sample episode",
26
  "stroke_dasharray": null,
27
  "method_detail": "Single-episode simple heads over the public sample split.",
28
- "plotted_as": "filled polygon",
29
  "result_record_count": 20,
30
  "scored_task_count": 20,
31
  "covered_task_count": 20,
@@ -48,7 +91,7 @@
48
  "scope": "1 public sample episode",
49
  "stroke_dasharray": null,
50
  "method_detail": "Single-episode compact PyTorch MLP heads on the same 20 task contracts.",
51
- "plotted_as": "filled polygon",
52
  "result_record_count": 20,
53
  "scored_task_count": 20,
54
  "covered_task_count": 20,
@@ -71,7 +114,7 @@
71
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
72
  "stroke_dasharray": "9 6",
73
  "method_detail": "128-episode aligned simple baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
74
- "plotted_as": "colored point overlay",
75
  "result_record_count": 20,
76
  "scored_task_count": 20,
77
  "covered_task_count": 20,
@@ -95,7 +138,7 @@
95
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
96
  "stroke_dasharray": "3 6",
97
  "method_detail": "128-episode aligned MLP baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
98
- "plotted_as": "colored point overlay",
99
  "result_record_count": 20,
100
  "scored_task_count": 20,
101
  "covered_task_count": 20,
@@ -119,7 +162,7 @@
119
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
120
  "stroke_dasharray": "8 4",
121
  "method_detail": "128-episode 4430-dim sensor NPZ simple heads; tasks 15/19 use compact proxies.",
122
- "plotted_as": "colored point overlay",
123
  "result_record_count": 20,
124
  "scored_task_count": 20,
125
  "covered_task_count": 20,
@@ -143,7 +186,7 @@
143
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
144
  "stroke_dasharray": "2 5",
145
  "method_detail": "128-episode 4430-dim sensor NPZ MLP heads; tasks 15/19 use compact proxies.",
146
- "plotted_as": "colored point overlay",
147
  "result_record_count": 20,
148
  "scored_task_count": 20,
149
  "covered_task_count": 20,
@@ -167,7 +210,7 @@
167
  "scope": "128 selected episodes, held-out test",
168
  "stroke_dasharray": "7 7",
169
  "method_detail": "Verified held-out Qwen3-Omni v6 LoRA metrics, plus task 16 and any completed private-GPU future/retrieval/sensor-target probes scored from task-specific JSON.",
170
- "plotted_as": "colored point overlay",
171
  "result_record_count": 20,
172
  "scored_task_count": 20,
173
  "covered_task_count": 20,
@@ -190,7 +233,7 @@
190
  "scope": "128 selected episodes, held-out test",
191
  "stroke_dasharray": "4 7",
192
  "method_detail": "Verified Cosmos3-Super base-weight Reasoner JSON-task evaluation, plus task 5/8/9/10/11/12/13/14/16/17/18/19/20 probes where public metrics exist.",
193
- "plotted_as": "colored point overlay",
194
  "result_record_count": 20,
195
  "scored_task_count": 20,
196
  "covered_task_count": 20,
@@ -213,7 +256,7 @@
213
  "scope": "128 selected episodes, held-out test",
214
  "stroke_dasharray": "2 7",
215
  "method_detail": "Verified Cosmos3-Nano future-window compatibility metrics, plus model-output probes for tasks 2/5/7/8/10/11/12/13/14/15/16/17/18/19 and a derived task-20 boundary timing probe scored from held-out future-window artifacts.",
216
- "plotted_as": "colored point overlay",
217
  "result_record_count": 20,
218
  "scored_task_count": 20,
219
  "covered_task_count": 20,
 
1
  {
2
  "title": "Unified 20-Task Model Radar",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T20:35:16+00:00",
5
  "task_count": 20,
6
  "method_count": 9,
7
  "method_task_record_count": 180,
 
10
  "higher_is_better": "bounded metrics are plotted directly on 0-1 axes after clipping to [0, 1]",
11
  "lower_is_better": "lower-error metrics are converted to best_observed_value / raw_value within the same task",
12
  "raw_values": "raw metric values, metric keys, and sources are retained in this JSON; the SVG is an overview, not a replacement for the metric table",
13
+ "radar_visual_radius": "SVG radar panels use sqrt(normalized_score) for radius so polygon area remains closer to the score and low-valued but real differences stay visible; the JSON and matrix retain exact linear normalized_score values",
14
  "result_record_policy": "every method has 20 task records; the current public release has 180/180 scored rows with proxy flags and reasons retained where compact substitute targets are used",
15
+ "foundation_model_overlay": "Qwen3-Omni and Cosmos3 are grouped in the foundation-model radar panel. All current public model rows have 20 scored task records, with source paths retained for every metric.",
16
+ "metadata_128_overlay": "128-episode aligned baselines are grouped in the metadata/text radar panel. Numeric scores come from JSONL metadata/text tasks plus staged sensor-block targets when the processed target exists.",
17
+ "raw_128_overlay": "128-episode raw-feature baselines are grouped in the raw-feature radar panel. Eighteen axes use direct task targets; interaction text and camera-view sync are completed with documented compact proxies because raw interaction strings and paired video-view embeddings are absent from the 128 export."
18
+ },
19
+ "chart_design": {
20
+ "mode": "grouped_small_multiples",
21
+ "method_count": 9,
22
+ "reason": "The public release has nine methods and 180 scored records; small-multiple radar panels avoid a nine-polygon overlay while keeping every method visible.",
23
+ "groups": [
24
+ {
25
+ "id": "single_episode",
26
+ "title": "Single-episode sample",
27
+ "series_ids": [
28
+ "minimal",
29
+ "neural_mlp"
30
+ ]
31
+ },
32
+ {
33
+ "id": "metadata_128",
34
+ "title": "128-episode metadata/text",
35
+ "series_ids": [
36
+ "metadata128_simple",
37
+ "metadata128_neural_mlp"
38
+ ]
39
+ },
40
+ {
41
+ "id": "raw_128",
42
+ "title": "128-episode raw features",
43
+ "series_ids": [
44
+ "raw128_simple",
45
+ "raw128_neural_mlp"
46
+ ]
47
+ },
48
+ {
49
+ "id": "foundation_models",
50
+ "title": "Foundation-model probes",
51
+ "series_ids": [
52
+ "qwen3_omni_v6_lora",
53
+ "cosmos3_super_reasoner",
54
+ "cosmos3_nano_future_window"
55
+ ]
56
+ }
57
+ ],
58
+ "visual_radius_transform": "sqrt(normalized_score)",
59
+ "exact_value_source": "docs/data/task_method_20_result_matrix.json"
60
  },
61
  "series": [
62
  {
 
68
  "scope": "1 public sample episode",
69
  "stroke_dasharray": null,
70
  "method_detail": "Single-episode simple heads over the public sample split.",
71
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
72
  "result_record_count": 20,
73
  "scored_task_count": 20,
74
  "covered_task_count": 20,
 
91
  "scope": "1 public sample episode",
92
  "stroke_dasharray": null,
93
  "method_detail": "Single-episode compact PyTorch MLP heads on the same 20 task contracts.",
94
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
95
  "result_record_count": 20,
96
  "scored_task_count": 20,
97
  "covered_task_count": 20,
 
114
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
115
  "stroke_dasharray": "9 6",
116
  "method_detail": "128-episode aligned simple baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
117
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
118
  "result_record_count": 20,
119
  "scored_task_count": 20,
120
  "covered_task_count": 20,
 
138
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
139
  "stroke_dasharray": "3 6",
140
  "method_detail": "128-episode aligned MLP baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
141
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
142
  "result_record_count": 20,
143
  "scored_task_count": 20,
144
  "covered_task_count": 20,
 
162
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
163
  "stroke_dasharray": "8 4",
164
  "method_detail": "128-episode 4430-dim sensor NPZ simple heads; tasks 15/19 use compact proxies.",
165
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
166
  "result_record_count": 20,
167
  "scored_task_count": 20,
168
  "covered_task_count": 20,
 
186
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
187
  "stroke_dasharray": "2 5",
188
  "method_detail": "128-episode 4430-dim sensor NPZ MLP heads; tasks 15/19 use compact proxies.",
189
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
190
  "result_record_count": 20,
191
  "scored_task_count": 20,
192
  "covered_task_count": 20,
 
210
  "scope": "128 selected episodes, held-out test",
211
  "stroke_dasharray": "7 7",
212
  "method_detail": "Verified held-out Qwen3-Omni v6 LoRA metrics, plus task 16 and any completed private-GPU future/retrieval/sensor-target probes scored from task-specific JSON.",
213
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
214
  "result_record_count": 20,
215
  "scored_task_count": 20,
216
  "covered_task_count": 20,
 
233
  "scope": "128 selected episodes, held-out test",
234
  "stroke_dasharray": "4 7",
235
  "method_detail": "Verified Cosmos3-Super base-weight Reasoner JSON-task evaluation, plus task 5/8/9/10/11/12/13/14/16/17/18/19/20 probes where public metrics exist.",
236
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
237
  "result_record_count": 20,
238
  "scored_task_count": 20,
239
  "covered_task_count": 20,
 
256
  "scope": "128 selected episodes, held-out test",
257
  "stroke_dasharray": "2 7",
258
  "method_detail": "Verified Cosmos3-Nano future-window compatibility metrics, plus model-output probes for tasks 2/5/7/8/10/11/12/13/14/15/16/17/18/19 and a derived task-20 boundary timing probe scored from held-out future-window artifacts.",
259
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
260
  "result_record_count": 20,
261
  "scored_task_count": 20,
262
  "covered_task_count": 20,
docs/assets/charts/episode128_task_model_radar.svg CHANGED
docs/assets/charts/single_episode_task_model_radar.svg CHANGED
docs/assets/charts/unified_task_model_radar.svg CHANGED
docs/data/artifact_index.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "title": "Ropedia Xperience-10M Task Suite Artifact Index",
3
- "generated_at_utc": "2026-06-21T15:19:00+00:00",
4
  "status": "pass",
5
  "artifact_count": 228,
6
  "missing": [],
@@ -92,8 +92,8 @@
92
  "surface": "repo_hf",
93
  "shows": "Defines terminology that can be confused across data scope, task metrics, model branches, and public mirrors.",
94
  "exists": true,
95
- "bytes": 10220,
96
- "sha256": "c3723a3dfaf902f4bc2527b78d11057ab8ba01c6b9715fa873c7dd819deddf81"
97
  },
98
  {
99
  "id": "glossary_json",
@@ -103,8 +103,8 @@
103
  "surface": "website_hf",
104
  "shows": "Machine-readable terminology layer for the website, artifact dataset, model mirror, and public QA checks.",
105
  "exists": true,
106
- "bytes": 17380,
107
- "sha256": "3ba746413eb4a7d2646333e3746e0b3f4039db61d03c159d2d059fecfe4b51ed"
108
  },
109
  {
110
  "id": "research_roadmap",
@@ -158,8 +158,8 @@
158
  "surface": "repo_hf",
159
  "shows": "Frames spatial intelligence, human-video world modeling, and vision-language-action as three pipeline tracks with explicit inputs, outputs, maturity, and next evidence gates.",
160
  "exists": true,
161
- "bytes": 8008,
162
- "sha256": "abd179631f1d204328fa5a2e018207eceeb7d46be12b7dab2826e70dfde97130"
163
  },
164
  {
165
  "id": "three_foundation_pipelines_json",
@@ -169,8 +169,8 @@
169
  "surface": "website_hf",
170
  "shows": "Machine-readable pipeline-track contract for the website and Hugging Face mirrors.",
171
  "exists": true,
172
- "bytes": 10312,
173
- "sha256": "9eb30d098f1b3787255a5506e4299f2cd4904ef989f2eaeaa2e5f77519c17378"
174
  },
175
  {
176
  "id": "spatial_intelligence_slide_diagram",
@@ -632,7 +632,7 @@
632
  "shows": "Machine-readable source-alignment pass/fail check for repo, website, and HF surfaces.",
633
  "exists": true,
634
  "bytes": 4432,
635
- "sha256": "5ab2ea4bfefe9f5bc7854f02b2e1e2b5206766a54447647191828da1a1a2077c"
636
  },
637
  {
638
  "id": "source_alignment_validator",
@@ -653,8 +653,8 @@
653
  "surface": "repo_hf",
654
  "shows": "Publishes prepared Space, artifact dataset, and model bundles, including an explicit model-binary upload batch.",
655
  "exists": true,
656
- "bytes": 25115,
657
- "sha256": "31866958ffaa3954f1bc4f920fbf80d5b41fd95228be50989dadb6e8cbe619e3"
658
  },
659
  {
660
  "id": "github_package_dockerfile",
@@ -697,8 +697,8 @@
697
  "surface": "website_hf",
698
  "shows": "Machine-readable protocol generated from committed task metrics for website and HF mirrors.",
699
  "exists": true,
700
- "bytes": 24047,
701
- "sha256": "d8f61b646a2f3f1e0af901dbdaff310ebfeea90622c93a34b9e35f34be98b896"
702
  },
703
  {
704
  "id": "evaluation_protocol_builder",
@@ -708,8 +708,8 @@
708
  "surface": "repo_hf",
709
  "shows": "Regenerates the protocol from committed summary metrics and task artifacts.",
710
  "exists": true,
711
- "bytes": 19825,
712
- "sha256": "aa9de1582f8fa79c1850e10e69fb125c0e3c1add433c7ebedc104c2efb42272e"
713
  },
714
  {
715
  "id": "task_suite_20",
@@ -730,8 +730,8 @@
730
  "surface": "website_hf",
731
  "shows": "Machine-readable unified 20-task index for the website, Hugging Face mirrors, and live verification.",
732
  "exists": true,
733
- "bytes": 34585,
734
- "sha256": "75145285cf71bc3bb9a10377a1921b60e85c4546dc8b858102b3c26e94c11a01"
735
  },
736
  {
737
  "id": "task_suite_20_builder",
@@ -741,8 +741,8 @@
741
  "surface": "repo_hf",
742
  "shows": "Regenerates the unified 20-task JSON and Markdown from the public-sample metrics plus the historical provenance result bundle.",
743
  "exists": true,
744
- "bytes": 12157,
745
- "sha256": "157265b5c025f279ce1eb56c52dd720ce0969b8426d5887030bfa179a3b565e0"
746
  },
747
  {
748
  "id": "unified_task_model_radar_json",
@@ -750,10 +750,10 @@
750
  "path": "docs/data/unified_task_model_radar.json",
751
  "kind": "website_data",
752
  "surface": "website_hf",
753
- "shows": "Stores normalized 20-axis radar values, raw task metrics, Qwen3-Omni/Cosmos3 overlay mappings, method-card caveats, proxy flags, and source artifacts.",
754
  "exists": true,
755
- "bytes": 228815,
756
- "sha256": "862376178e8b0d01b536f49a18b7934a373494f8b36080790f616438ec0e035e"
757
  },
758
  {
759
  "id": "single_episode_task_model_radar_json",
@@ -763,8 +763,8 @@
763
  "surface": "website_hf",
764
  "shows": "Machine-readable split radar for the one-episode Minimal and Neural MLP baselines, both scored on all 20 task contracts.",
765
  "exists": true,
766
- "bytes": 51107,
767
- "sha256": "5f2ebb41e8488446ea5c5cd2cb75bbedce688433feffe1412288de56b133bd5c"
768
  },
769
  {
770
  "id": "episode128_task_model_radar_json",
@@ -774,8 +774,8 @@
774
  "surface": "website_hf",
775
  "shows": "Machine-readable split radar for selected 128-episode metadata/raw baselines, Qwen3-Omni v6, Cosmos3-Super, and Cosmos3-Nano, now complete at 140/140 scored rows with proxy notes retained.",
776
  "exists": true,
777
- "bytes": 184992,
778
- "sha256": "385704db90443d74903f365e90b27538020f5574c96f296bbf63173f488a645d"
779
  },
780
  {
781
  "id": "task_method_20_result_matrix_json",
@@ -785,8 +785,8 @@
785
  "surface": "website_hf",
786
  "shows": "Machine-readable 9-method by 20-task matrix where every method has 20 records and the current release is complete at 180/180 scored rows.",
787
  "exists": true,
788
- "bytes": 128509,
789
- "sha256": "96082daa33771963ac40b7d719df00a76ec443508a3d3101cb6dd82d87965729"
790
  },
791
  {
792
  "id": "task_method_20_result_matrix",
@@ -796,8 +796,8 @@
796
  "surface": "repo_hf",
797
  "shows": "Reader-facing table that separates 20 records per method, direct numeric scores, documented compact-proxy scores, and source artifacts.",
798
  "exists": true,
799
- "bytes": 3563,
800
- "sha256": "84b21bd3359e6149b952a9b3b6991f7ba8ff412390a92983bd6d59f212d58316"
801
  },
802
  {
803
  "id": "task_method_20_gap_audit_json",
@@ -808,7 +808,7 @@
808
  "shows": "Machine-readable 180-record completion ledger with numeric scores, proxy flags, explicit status reasons, and source artifacts.",
809
  "exists": true,
810
  "bytes": 8500,
811
- "sha256": "5743354519dfae924d973ffca90a5d1ef4b0c316cf201b3ac443f6b5d4600de5"
812
  },
813
  {
814
  "id": "task_method_20_gap_audit",
@@ -819,7 +819,7 @@
819
  "shows": "Reader-facing ledger confirming 180/180 scored method-task cells and listing the six compact-proxy records separately.",
820
  "exists": true,
821
  "bytes": 3417,
822
- "sha256": "01f47c74db965887e55f17b674e1617e396dd01a51fdc7bf9a9c7612f5596b2c"
823
  },
824
  {
825
  "id": "task_method_20_source_audit_json",
@@ -830,7 +830,7 @@
830
  "shows": "Machine-readable check that scored JSON-backed matrix cells match their declared metric source values.",
831
  "exists": true,
832
  "bytes": 561,
833
- "sha256": "bb50000bb681f1ed12c4089bd3541f14b8e121ea5a1fe06e1a6a559cb8960e0b"
834
  },
835
  {
836
  "id": "task_method_20_source_audit",
@@ -841,7 +841,7 @@
841
  "shows": "Reader-facing source-value audit for the 180-result matrix.",
842
  "exists": true,
843
  "bytes": 447,
844
- "sha256": "1a0583629368cee3abadc49d4a0220dead924326a04e82b3a22a9fc6d6b0d252"
845
  },
846
  {
847
  "id": "two_evidence_line_map_chart",
@@ -860,10 +860,10 @@
860
  "path": "docs/assets/charts/unified_task_model_radar.svg",
861
  "kind": "generated_figure",
862
  "surface": "website_hf",
863
- "shows": "Compares minimal and neural MLP baselines across all 20 tasks, with Qwen3-Omni and Cosmos3 task-aligned overlays.",
864
  "exists": true,
865
- "bytes": 57938,
866
- "sha256": "bb83b80b47fe679ebdce2c99378a4548120f1c8cc2d725b88e409d8c386dcbf8"
867
  },
868
  {
869
  "id": "single_episode_task_model_radar_chart",
@@ -871,10 +871,10 @@
871
  "path": "docs/assets/charts/single_episode_task_model_radar.svg",
872
  "kind": "generated_figure",
873
  "surface": "website_hf",
874
- "shows": "Separates the one-episode Minimal and Neural MLP 20/20 scored baselines into a clean two-polygon radar.",
875
  "exists": true,
876
- "bytes": 35232,
877
- "sha256": "87b52a7dead40358f1778dda43ade4d2e875ac98e507e01ca007084363e5977e"
878
  },
879
  {
880
  "id": "episode128_task_model_radar_chart",
@@ -882,10 +882,10 @@
882
  "path": "docs/assets/charts/episode128_task_model_radar.svg",
883
  "kind": "generated_figure",
884
  "surface": "website_hf",
885
- "shows": "Separates the selected 128-episode methods: raw-feature simple/NN as complete 20/20 scored polygons plus metadata, Qwen3-Omni, Cosmos3-Super, and Cosmos3-Nano task-aligned overlays.",
886
  "exists": true,
887
- "bytes": 51915,
888
- "sha256": "047ea4b05a04f6734e2afcf792863559dc8f3091eae88a97ff90e8b038a423f4"
889
  },
890
  {
891
  "id": "unified_task_model_radar_builder",
@@ -893,10 +893,10 @@
893
  "path": "scripts/build_unified_task_model_radar.py",
894
  "kind": "visualization_builder",
895
  "surface": "repo_hf",
896
- "shows": "Regenerates the direction-aware radar chart and machine-readable metric overlay JSON.",
897
  "exists": true,
898
- "bytes": 68610,
899
- "sha256": "96bc2df0de5a9e512d69961ddb13ea87b26ef01f1f943f5a78a6dc373400949d"
900
  },
901
  {
902
  "id": "task_method_20_gap_audit_builder",
@@ -1093,8 +1093,8 @@
1093
  "surface": "repo_hf",
1094
  "shows": "Catalogs public figures, charts, modality thumbnails, dimensions, hashes, roles, and source scripts.",
1095
  "exists": true,
1096
- "bytes": 7027,
1097
- "sha256": "b7b507c35cd3cba2765586e9703a447c8025c89658c3daa390df67db4211d0fc"
1098
  },
1099
  {
1100
  "id": "figure_index_json",
@@ -1104,8 +1104,8 @@
1104
  "surface": "website_hf",
1105
  "shows": "Machine-readable visual asset index for website and Hugging Face mirrors.",
1106
  "exists": true,
1107
- "bytes": 19485,
1108
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
1109
  },
1110
  {
1111
  "id": "figure_index_builder",
@@ -1115,8 +1115,8 @@
1115
  "surface": "repo_hf",
1116
  "shows": "Regenerates visual-asset hashes, dimensions, and source-script provenance.",
1117
  "exists": true,
1118
- "bytes": 16845,
1119
- "sha256": "3f91f7f13a3fb08ab57c2f0a6b320102e9d5ae19b102b71499edb5b8fd5a2cec"
1120
  },
1121
  {
1122
  "id": "brand_assets_json",
@@ -1182,7 +1182,7 @@
1182
  "shows": "Machine-readable release-check summary for validators, mirrors, and public project surfaces.",
1183
  "exists": true,
1184
  "bytes": 8640,
1185
- "sha256": "6e54f6828b8fef97e963a9a56bccc91162b8a632f6897743095e32407fa0db98"
1186
  },
1187
  {
1188
  "id": "public_surface_qa",
@@ -1226,7 +1226,7 @@
1226
  "volatile": true,
1227
  "shows": "Machine-readable report for SEO/social metadata, accessible tab semantics, public links, project links, and clear project presentation.",
1228
  "exists": true,
1229
- "bytes": 7691,
1230
  "hash_policy": "existence_and_size_only"
1231
  },
1232
  {
@@ -1318,8 +1318,8 @@
1318
  "surface": "repo",
1319
  "shows": "Fetches the published GitHub/HF URLs and compares live hashes and public-card markers against the release assets.",
1320
  "exists": true,
1321
- "bytes": 69151,
1322
- "sha256": "c4af8644d50dafe7d4249dd7c5b36bb19e996628ff6d8436fbb6e027da526c1f"
1323
  },
1324
  {
1325
  "id": "reproducibility_contract",
@@ -1351,8 +1351,8 @@
1351
  "surface": "repo_hf",
1352
  "shows": "Generates the selective artifact catalog from local files.",
1353
  "exists": true,
1354
- "bytes": 68279,
1355
- "sha256": "69b43ad5d3dc5a6893c4592fa47fff6a7a87691728ec2c61b121ec262d00bf2a"
1356
  },
1357
  {
1358
  "id": "publication_audit",
@@ -1363,7 +1363,7 @@
1363
  "volatile": true,
1364
  "shows": "Confirms public bundles exclude raw data, caches, heavy archives, and credential text.",
1365
  "exists": true,
1366
- "bytes": 10939,
1367
  "hash_policy": "existence_and_size_only"
1368
  },
1369
  {
@@ -1387,7 +1387,7 @@
1387
  "volatile": true,
1388
  "shows": "Confirms prepared GitHub/HF Space/artifact/model mirrors share the same critical data, figure, website HTML, and validator files.",
1389
  "exists": true,
1390
- "bytes": 1420743,
1391
  "hash_policy": "existence_and_size_only"
1392
  },
1393
  {
@@ -1399,7 +1399,7 @@
1399
  "volatile": true,
1400
  "shows": "Confirms local website links, anchors, JSON data files, and referenced images resolve.",
1401
  "exists": true,
1402
- "bytes": 20760,
1403
  "hash_policy": "existence_and_size_only"
1404
  },
1405
  {
 
1
  {
2
  "title": "Ropedia Xperience-10M Task Suite Artifact Index",
3
+ "generated_at_utc": "2026-06-21T20:35:18+00:00",
4
  "status": "pass",
5
  "artifact_count": 228,
6
  "missing": [],
 
92
  "surface": "repo_hf",
93
  "shows": "Defines terminology that can be confused across data scope, task metrics, model branches, and public mirrors.",
94
  "exists": true,
95
+ "bytes": 10857,
96
+ "sha256": "b9b8f8695b6c211e849073e1634057f158877a61161eca05b1266769158ee83e"
97
  },
98
  {
99
  "id": "glossary_json",
 
103
  "surface": "website_hf",
104
  "shows": "Machine-readable terminology layer for the website, artifact dataset, model mirror, and public QA checks.",
105
  "exists": true,
106
+ "bytes": 18754,
107
+ "sha256": "b1dc42e1f42a7c19bb4b2ebd32a0862df28bec671eaa849b09a97f103675e9eb"
108
  },
109
  {
110
  "id": "research_roadmap",
 
158
  "surface": "repo_hf",
159
  "shows": "Frames spatial intelligence, human-video world modeling, and vision-language-action as three pipeline tracks with explicit inputs, outputs, maturity, and next evidence gates.",
160
  "exists": true,
161
+ "bytes": 11609,
162
+ "sha256": "69d4d8781015c807c58a720b3bf967ee872540a2fe2b7e7973105946c5a1cf11"
163
  },
164
  {
165
  "id": "three_foundation_pipelines_json",
 
169
  "surface": "website_hf",
170
  "shows": "Machine-readable pipeline-track contract for the website and Hugging Face mirrors.",
171
  "exists": true,
172
+ "bytes": 14465,
173
+ "sha256": "c8c9b7a9ee8d3ecfe8662cf0336a7ca2ade6d670c5567d2a50dfb67c7241defd"
174
  },
175
  {
176
  "id": "spatial_intelligence_slide_diagram",
 
632
  "shows": "Machine-readable source-alignment pass/fail check for repo, website, and HF surfaces.",
633
  "exists": true,
634
  "bytes": 4432,
635
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
636
  },
637
  {
638
  "id": "source_alignment_validator",
 
653
  "surface": "repo_hf",
654
  "shows": "Publishes prepared Space, artifact dataset, and model bundles, including an explicit model-binary upload batch.",
655
  "exists": true,
656
+ "bytes": 29935,
657
+ "sha256": "89ed1b25784cd42f19f2e4731eaa52e8655e8ed06cc2a12b6a2c0add027c5fc0"
658
  },
659
  {
660
  "id": "github_package_dockerfile",
 
697
  "surface": "website_hf",
698
  "shows": "Machine-readable protocol generated from committed task metrics for website and HF mirrors.",
699
  "exists": true,
700
+ "bytes": 24267,
701
+ "sha256": "83289b52e0193fa86d63fac7dd8af98d0612199b1e1a5f5ed37014baa9168656"
702
  },
703
  {
704
  "id": "evaluation_protocol_builder",
 
708
  "surface": "repo_hf",
709
  "shows": "Regenerates the protocol from committed summary metrics and task artifacts.",
710
  "exists": true,
711
+ "bytes": 19845,
712
+ "sha256": "b5d888fd67ba6b8150b24960aa0858db50f16a5c7b59a8ce274401e384f0a2a3"
713
  },
714
  {
715
  "id": "task_suite_20",
 
730
  "surface": "website_hf",
731
  "shows": "Machine-readable unified 20-task index for the website, Hugging Face mirrors, and live verification.",
732
  "exists": true,
733
+ "bytes": 34805,
734
+ "sha256": "5d7a3b5b57e64980291b3ce6a8741b0b822e94bd447ca95519d1da7040923476"
735
  },
736
  {
737
  "id": "task_suite_20_builder",
 
741
  "surface": "repo_hf",
742
  "shows": "Regenerates the unified 20-task JSON and Markdown from the public-sample metrics plus the historical provenance result bundle.",
743
  "exists": true,
744
+ "bytes": 12129,
745
+ "sha256": "4c2550c657bfa5bc37e21e78e7e27dd70daa3b178c8ffb3191e52f835221713a"
746
  },
747
  {
748
  "id": "unified_task_model_radar_json",
 
750
  "path": "docs/data/unified_task_model_radar.json",
751
  "kind": "website_data",
752
  "surface": "website_hf",
753
+ "shows": "Stores normalized 20-axis radar values, raw task metrics, grouped chart-design metadata, Qwen3-Omni/Cosmos3 source mappings, method-card caveats, proxy flags, and source artifacts.",
754
  "exists": true,
755
+ "bytes": 230938,
756
+ "sha256": "bd7b8fd8a169273c5cec2beab050a583307dcc64e5e3fefcd690484b4a0754bc"
757
  },
758
  {
759
  "id": "single_episode_task_model_radar_json",
 
763
  "surface": "website_hf",
764
  "shows": "Machine-readable split radar for the one-episode Minimal and Neural MLP baselines, both scored on all 20 task contracts.",
765
  "exists": true,
766
+ "bytes": 52256,
767
+ "sha256": "f282dc0c4c654fe6f2cd646612fd0942d267dec60c6143cc36688edbc27c13da"
768
  },
769
  {
770
  "id": "episode128_task_model_radar_json",
 
774
  "surface": "website_hf",
775
  "shows": "Machine-readable split radar for selected 128-episode metadata/raw baselines, Qwen3-Omni v6, Cosmos3-Super, and Cosmos3-Nano, now complete at 140/140 scored rows with proxy notes retained.",
776
  "exists": true,
777
+ "bytes": 186828,
778
+ "sha256": "f904cf9eb3160a50e4469989e44557e77d710e459f812ba7b194d0ed8d9746bd"
779
  },
780
  {
781
  "id": "task_method_20_result_matrix_json",
 
785
  "surface": "website_hf",
786
  "shows": "Machine-readable 9-method by 20-task matrix where every method has 20 records and the current release is complete at 180/180 scored rows.",
787
  "exists": true,
788
+ "bytes": 128991,
789
+ "sha256": "bacb549e64b6d1936b55fe7593cfbaeb1dbf5bfe824c8507ea75e493230212fe"
790
  },
791
  {
792
  "id": "task_method_20_result_matrix",
 
796
  "surface": "repo_hf",
797
  "shows": "Reader-facing table that separates 20 records per method, direct numeric scores, documented compact-proxy scores, and source artifacts.",
798
  "exists": true,
799
+ "bytes": 14862,
800
+ "sha256": "a6f769d40ed22a2d63dad88029be393d55ed9af3082ab61fc1ad8b314aa871a6"
801
  },
802
  {
803
  "id": "task_method_20_gap_audit_json",
 
808
  "shows": "Machine-readable 180-record completion ledger with numeric scores, proxy flags, explicit status reasons, and source artifacts.",
809
  "exists": true,
810
  "bytes": 8500,
811
+ "sha256": "4f5cc7a29fe030a9fd5b97893ac67454ebc6287f5942aeedb2bfca71c411332d"
812
  },
813
  {
814
  "id": "task_method_20_gap_audit",
 
819
  "shows": "Reader-facing ledger confirming 180/180 scored method-task cells and listing the six compact-proxy records separately.",
820
  "exists": true,
821
  "bytes": 3417,
822
+ "sha256": "b094732dd98c904ea0ab30e741576a3d5eeb8ca07e7a494e459b11e31320ae4d"
823
  },
824
  {
825
  "id": "task_method_20_source_audit_json",
 
830
  "shows": "Machine-readable check that scored JSON-backed matrix cells match their declared metric source values.",
831
  "exists": true,
832
  "bytes": 561,
833
+ "sha256": "1bc6bb15ab45e73ada4d3c2d1ec326cffd9c6bc6d6d791c7468b3ed16881a463"
834
  },
835
  {
836
  "id": "task_method_20_source_audit",
 
841
  "shows": "Reader-facing source-value audit for the 180-result matrix.",
842
  "exists": true,
843
  "bytes": 447,
844
+ "sha256": "7f5e80d6dcd66be1c6b2c63144adb64d0cdad0008884ebd8f015388545e4b914"
845
  },
846
  {
847
  "id": "two_evidence_line_map_chart",
 
860
  "path": "docs/assets/charts/unified_task_model_radar.svg",
861
  "kind": "generated_figure",
862
  "surface": "website_hf",
863
+ "shows": "Groups all nine methods into small-multiple 20-task radar panels so single-episode, 128-episode metadata/text, 128-episode raw-feature, and foundation-model rows remain readable.",
864
  "exists": true,
865
+ "bytes": 98527,
866
+ "sha256": "5b034b22d2a772a57e7db50f300cb70d00bd31ac89d0c039c16ac8c23a5137ec"
867
  },
868
  {
869
  "id": "single_episode_task_model_radar_chart",
 
871
  "path": "docs/assets/charts/single_episode_task_model_radar.svg",
872
  "kind": "generated_figure",
873
  "surface": "website_hf",
874
+ "shows": "Shows the one-episode Minimal and Neural MLP 20/20 scored baselines in one enlarged radar panel with local legend and task key.",
875
  "exists": true,
876
+ "bytes": 36930,
877
+ "sha256": "96e609b0577e66db0ee8c63939c11b1fb28018285a1d259362de0bff415cc939"
878
  },
879
  {
880
  "id": "episode128_task_model_radar_chart",
 
882
  "path": "docs/assets/charts/episode128_task_model_radar.svg",
883
  "kind": "generated_figure",
884
  "surface": "website_hf",
885
+ "shows": "Separates selected 128-episode methods into metadata/text, raw-feature, and foundation-model radar panels with all 140 result rows scored and proxy notes retained.",
886
  "exists": true,
887
+ "bytes": 79370,
888
+ "sha256": "5151c8aca22bd4aeda60b143b1164c1d1b9eb4babbeabf6da598701ccbbbf5c9"
889
  },
890
  {
891
  "id": "unified_task_model_radar_builder",
 
893
  "path": "scripts/build_unified_task_model_radar.py",
894
  "kind": "visualization_builder",
895
  "surface": "repo_hf",
896
+ "shows": "Regenerates grouped 20-task radar charts plus machine-readable metric, source, chart-design, and proxy metadata.",
897
  "exists": true,
898
+ "bytes": 79396,
899
+ "sha256": "78949d7030cf6995edbb1d46a35692c7fe835d102df619cdf8c9d8ea9c5318e2"
900
  },
901
  {
902
  "id": "task_method_20_gap_audit_builder",
 
1093
  "surface": "repo_hf",
1094
  "shows": "Catalogs public figures, charts, modality thumbnails, dimensions, hashes, roles, and source scripts.",
1095
  "exists": true,
1096
+ "bytes": 7068,
1097
+ "sha256": "cc64c0bb070e7eb0035ba590a6d83ed07fcc68fb56081668caebf40b49b9900f"
1098
  },
1099
  {
1100
  "id": "figure_index_json",
 
1104
  "surface": "website_hf",
1105
  "shows": "Machine-readable visual asset index for website and Hugging Face mirrors.",
1106
  "exists": true,
1107
+ "bytes": 19526,
1108
+ "sha256": "601acd715891bb148b0a1706573a2fdb015ca3c5967f02a169577b483b0dcd43"
1109
  },
1110
  {
1111
  "id": "figure_index_builder",
 
1115
  "surface": "repo_hf",
1116
  "shows": "Regenerates visual-asset hashes, dimensions, and source-script provenance.",
1117
  "exists": true,
1118
+ "bytes": 16886,
1119
+ "sha256": "fc44ea60b6f491d290a56d4c1097fcfc37c024f8a9bc3b3db013252f45d96e64"
1120
  },
1121
  {
1122
  "id": "brand_assets_json",
 
1182
  "shows": "Machine-readable release-check summary for validators, mirrors, and public project surfaces.",
1183
  "exists": true,
1184
  "bytes": 8640,
1185
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1186
  },
1187
  {
1188
  "id": "public_surface_qa",
 
1226
  "volatile": true,
1227
  "shows": "Machine-readable report for SEO/social metadata, accessible tab semantics, public links, project links, and clear project presentation.",
1228
  "exists": true,
1229
+ "bytes": 7693,
1230
  "hash_policy": "existence_and_size_only"
1231
  },
1232
  {
 
1318
  "surface": "repo",
1319
  "shows": "Fetches the published GitHub/HF URLs and compares live hashes and public-card markers against the release assets.",
1320
  "exists": true,
1321
+ "bytes": 69123,
1322
+ "sha256": "a96d0a95ea1da54fb8b6a95b2c91d4a067eef8358d825c44d919e66d78ecb8f7"
1323
  },
1324
  {
1325
  "id": "reproducibility_contract",
 
1351
  "surface": "repo_hf",
1352
  "shows": "Generates the selective artifact catalog from local files.",
1353
  "exists": true,
1354
+ "bytes": 68407,
1355
+ "sha256": "9f5fc1af7c4c23083975066691bf6d75aa99280961198e2c47be88754c142418"
1356
  },
1357
  {
1358
  "id": "publication_audit",
 
1363
  "volatile": true,
1364
  "shows": "Confirms public bundles exclude raw data, caches, heavy archives, and credential text.",
1365
  "exists": true,
1366
+ "bytes": 10940,
1367
  "hash_policy": "existence_and_size_only"
1368
  },
1369
  {
 
1387
  "volatile": true,
1388
  "shows": "Confirms prepared GitHub/HF Space/artifact/model mirrors share the same critical data, figure, website HTML, and validator files.",
1389
  "exists": true,
1390
+ "bytes": 1420751,
1391
  "hash_policy": "existence_and_size_only"
1392
  },
1393
  {
 
1399
  "volatile": true,
1400
  "shows": "Confirms local website links, anchors, JSON data files, and referenced images resolve.",
1401
  "exists": true,
1402
+ "bytes": 20178,
1403
  "hash_policy": "existence_and_size_only"
1404
  },
1405
  {
docs/data/episode128_task_model_radar.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "128-Episode 20-Task Radar",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T15:20:34+00:00",
5
  "description": "Selected 128-episode metadata/raw baselines plus verified Qwen3-Omni v6, Cosmos3-Super, and Cosmos3-Nano diagnostics. Every method has 20 records; numeric scores appear only where the public artifact produced that task target.",
6
  "task_count": 20,
7
  "method_count": 7,
@@ -11,10 +11,45 @@
11
  "higher_is_better": "bounded metrics are plotted directly on 0-1 axes after clipping to [0, 1]",
12
  "lower_is_better": "lower-error metrics are converted to best_observed_value / raw_value within the same task",
13
  "raw_values": "raw metric values, metric keys, and sources are retained in this JSON; the SVG is an overview, not a replacement for the metric table",
 
14
  "result_record_policy": "every method has 20 task records; the current public release has 180/180 scored rows with proxy flags and reasons retained where compact substitute targets are used",
15
- "foundation_model_overlay": "Qwen3-Omni and Cosmos3 points are plotted only on task-aligned axes. Scoreless records mean the public result does not evaluate that task contract.",
16
- "metadata_128_overlay": "128-episode aligned baselines have 20 records. Numeric scores come from JSONL metadata/text tasks plus staged sensor-block targets when the processed target exists; raw interaction text and paired camera-view embeddings remain explicit gaps.",
17
- "raw_128_overlay": "128-episode raw-feature baselines use staged sensor NPZ features. Eighteen axes use direct task targets; interaction text and camera-view sync are completed with documented compact proxies because raw interaction strings and paired video-view embeddings are absent from the 128 export."
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
18
  },
19
  "source_unified_radar": "docs/data/unified_task_model_radar.json",
20
  "source_result_matrix": "docs/data/task_method_20_result_matrix.json",
@@ -28,7 +63,7 @@
28
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
29
  "stroke_dasharray": "9 6",
30
  "method_detail": "128-episode aligned simple baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
31
- "plotted_as": "colored point overlay",
32
  "result_record_count": 20,
33
  "scored_task_count": 20,
34
  "covered_task_count": 20,
@@ -52,7 +87,7 @@
52
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
53
  "stroke_dasharray": "3 6",
54
  "method_detail": "128-episode aligned MLP baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
55
- "plotted_as": "colored point overlay",
56
  "result_record_count": 20,
57
  "scored_task_count": 20,
58
  "covered_task_count": 20,
@@ -76,7 +111,7 @@
76
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
77
  "stroke_dasharray": "8 4",
78
  "method_detail": "128-episode 4430-dim sensor NPZ simple heads; tasks 15/19 use compact proxies.",
79
- "plotted_as": "colored point overlay",
80
  "result_record_count": 20,
81
  "scored_task_count": 20,
82
  "covered_task_count": 20,
@@ -100,7 +135,7 @@
100
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
101
  "stroke_dasharray": "2 5",
102
  "method_detail": "128-episode 4430-dim sensor NPZ MLP heads; tasks 15/19 use compact proxies.",
103
- "plotted_as": "colored point overlay",
104
  "result_record_count": 20,
105
  "scored_task_count": 20,
106
  "covered_task_count": 20,
@@ -124,7 +159,7 @@
124
  "scope": "128 selected episodes, held-out test",
125
  "stroke_dasharray": "7 7",
126
  "method_detail": "Verified held-out Qwen3-Omni v6 LoRA metrics, plus task 16 and any completed private-GPU future/retrieval/sensor-target probes scored from task-specific JSON.",
127
- "plotted_as": "colored point overlay",
128
  "result_record_count": 20,
129
  "scored_task_count": 20,
130
  "covered_task_count": 20,
@@ -147,7 +182,7 @@
147
  "scope": "128 selected episodes, held-out test",
148
  "stroke_dasharray": "4 7",
149
  "method_detail": "Verified Cosmos3-Super base-weight Reasoner JSON-task evaluation, plus task 5/8/9/10/11/12/13/14/16/17/18/19/20 probes where public metrics exist.",
150
- "plotted_as": "colored point overlay",
151
  "result_record_count": 20,
152
  "scored_task_count": 20,
153
  "covered_task_count": 20,
@@ -170,7 +205,7 @@
170
  "scope": "128 selected episodes, held-out test",
171
  "stroke_dasharray": "2 7",
172
  "method_detail": "Verified Cosmos3-Nano future-window compatibility metrics, plus model-output probes for tasks 2/5/7/8/10/11/12/13/14/15/16/17/18/19 and a derived task-20 boundary timing probe scored from held-out future-window artifacts.",
173
- "plotted_as": "colored point overlay",
174
  "result_record_count": 20,
175
  "scored_task_count": 20,
176
  "covered_task_count": 20,
 
1
  {
2
  "title": "128-Episode 20-Task Radar",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T20:35:16+00:00",
5
  "description": "Selected 128-episode metadata/raw baselines plus verified Qwen3-Omni v6, Cosmos3-Super, and Cosmos3-Nano diagnostics. Every method has 20 records; numeric scores appear only where the public artifact produced that task target.",
6
  "task_count": 20,
7
  "method_count": 7,
 
11
  "higher_is_better": "bounded metrics are plotted directly on 0-1 axes after clipping to [0, 1]",
12
  "lower_is_better": "lower-error metrics are converted to best_observed_value / raw_value within the same task",
13
  "raw_values": "raw metric values, metric keys, and sources are retained in this JSON; the SVG is an overview, not a replacement for the metric table",
14
+ "radar_visual_radius": "SVG radar panels use sqrt(normalized_score) for radius so polygon area remains closer to the score and low-valued but real differences stay visible; the JSON and matrix retain exact linear normalized_score values",
15
  "result_record_policy": "every method has 20 task records; the current public release has 180/180 scored rows with proxy flags and reasons retained where compact substitute targets are used",
16
+ "foundation_model_overlay": "Qwen3-Omni and Cosmos3 are grouped in the foundation-model radar panel. All current public model rows have 20 scored task records, with source paths retained for every metric.",
17
+ "metadata_128_overlay": "128-episode aligned baselines are grouped in the metadata/text radar panel. Numeric scores come from JSONL metadata/text tasks plus staged sensor-block targets when the processed target exists.",
18
+ "raw_128_overlay": "128-episode raw-feature baselines are grouped in the raw-feature radar panel. Eighteen axes use direct task targets; interaction text and camera-view sync are completed with documented compact proxies because raw interaction strings and paired video-view embeddings are absent from the 128 export."
19
+ },
20
+ "chart_design": {
21
+ "mode": "grouped_small_multiples",
22
+ "method_count": 7,
23
+ "reason": "This split view has 7 methods and 140 method-task records; grouped radar panels keep related methods readable while retaining the unified source matrix.",
24
+ "groups": [
25
+ {
26
+ "id": "metadata_128",
27
+ "title": "128-episode metadata/text",
28
+ "series_ids": [
29
+ "metadata128_simple",
30
+ "metadata128_neural_mlp"
31
+ ]
32
+ },
33
+ {
34
+ "id": "raw_128",
35
+ "title": "128-episode raw features",
36
+ "series_ids": [
37
+ "raw128_simple",
38
+ "raw128_neural_mlp"
39
+ ]
40
+ },
41
+ {
42
+ "id": "foundation_models",
43
+ "title": "Foundation-model probes",
44
+ "series_ids": [
45
+ "qwen3_omni_v6_lora",
46
+ "cosmos3_super_reasoner",
47
+ "cosmos3_nano_future_window"
48
+ ]
49
+ }
50
+ ],
51
+ "visual_radius_transform": "sqrt(normalized_score)",
52
+ "exact_value_source": "docs/data/task_method_20_result_matrix.json"
53
  },
54
  "source_unified_radar": "docs/data/unified_task_model_radar.json",
55
  "source_result_matrix": "docs/data/task_method_20_result_matrix.json",
 
63
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
64
  "stroke_dasharray": "9 6",
65
  "method_detail": "128-episode aligned simple baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
66
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
67
  "result_record_count": 20,
68
  "scored_task_count": 20,
69
  "covered_task_count": 20,
 
87
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
88
  "stroke_dasharray": "3 6",
89
  "method_detail": "128-episode aligned MLP baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
90
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
91
  "result_record_count": 20,
92
  "scored_task_count": 20,
93
  "covered_task_count": 20,
 
111
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
112
  "stroke_dasharray": "8 4",
113
  "method_detail": "128-episode 4430-dim sensor NPZ simple heads; tasks 15/19 use compact proxies.",
114
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
115
  "result_record_count": 20,
116
  "scored_task_count": 20,
117
  "covered_task_count": 20,
 
135
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
136
  "stroke_dasharray": "2 5",
137
  "method_detail": "128-episode 4430-dim sensor NPZ MLP heads; tasks 15/19 use compact proxies.",
138
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
139
  "result_record_count": 20,
140
  "scored_task_count": 20,
141
  "covered_task_count": 20,
 
159
  "scope": "128 selected episodes, held-out test",
160
  "stroke_dasharray": "7 7",
161
  "method_detail": "Verified held-out Qwen3-Omni v6 LoRA metrics, plus task 16 and any completed private-GPU future/retrieval/sensor-target probes scored from task-specific JSON.",
162
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
163
  "result_record_count": 20,
164
  "scored_task_count": 20,
165
  "covered_task_count": 20,
 
182
  "scope": "128 selected episodes, held-out test",
183
  "stroke_dasharray": "4 7",
184
  "method_detail": "Verified Cosmos3-Super base-weight Reasoner JSON-task evaluation, plus task 5/8/9/10/11/12/13/14/16/17/18/19/20 probes where public metrics exist.",
185
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
186
  "result_record_count": 20,
187
  "scored_task_count": 20,
188
  "covered_task_count": 20,
 
205
  "scope": "128 selected episodes, held-out test",
206
  "stroke_dasharray": "2 7",
207
  "method_detail": "Verified Cosmos3-Nano future-window compatibility metrics, plus model-output probes for tasks 2/5/7/8/10/11/12/13/14/15/16/17/18/19 and a derived task-20 boundary timing probe scored from held-out future-window artifacts.",
208
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
209
  "result_record_count": 20,
210
  "scored_task_count": 20,
211
  "covered_task_count": 20,
docs/data/figure_index.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Figure Index",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T15:19:00+00:00",
5
  "scope": "Public figures, diagrams, charts, and derived modality thumbnails. Raw Xperience-10M videos, annotations, RRD files, and Qwen weights are excluded.",
6
  "figure_count": 29,
7
  "figures": [
@@ -406,17 +406,17 @@
406
  "id": "unified_task_model_radar",
407
  "title": "Unified 20-task model radar",
408
  "path": "docs/assets/charts/unified_task_model_radar.svg",
409
- "role": "Twenty-axis direction-aware comparison of minimal and neural MLP baselines, with 128-episode metadata, Qwen3, and Cosmos task-aligned overlay points and branch notes.",
410
  "source_script": "scripts/build_unified_task_model_radar.py",
411
  "surface": "website unified task section, README, HF mirrors",
412
  "exists": true,
413
- "bytes": 57938,
414
- "sha256": "bb83b80b47fe679ebdce2c99378a4548120f1c8cc2d725b88e409d8c386dcbf8",
415
  "dimensions": {
416
  "format": "SVG",
417
  "width": 2400,
418
- "height": 1840,
419
- "view_box": "0 0 2400 1840"
420
  },
421
  "source_script_exists": true
422
  },
@@ -428,13 +428,13 @@
428
  "source_script": "scripts/build_unified_task_model_radar.py",
429
  "surface": "website unified task section, README, HF mirrors",
430
  "exists": true,
431
- "bytes": 35232,
432
- "sha256": "87b52a7dead40358f1778dda43ade4d2e875ac98e507e01ca007084363e5977e",
433
  "dimensions": {
434
  "format": "SVG",
435
  "width": 2400,
436
- "height": 1840,
437
- "view_box": "0 0 2400 1840"
438
  },
439
  "source_script_exists": true
440
  },
@@ -442,17 +442,17 @@
442
  "id": "episode128_task_model_radar",
443
  "title": "128-episode 20-task model radar",
444
  "path": "docs/assets/charts/episode128_task_model_radar.svg",
445
- "role": "Twenty-axis split radar for selected 128-episode methods: raw-feature simple/NN as complete scored polygons plus metadata, Qwen3-Omni, Cosmos3-Super, and Cosmos3-Nano task-aligned overlays.",
446
  "source_script": "scripts/build_unified_task_model_radar.py",
447
  "surface": "website unified task section, README, HF mirrors",
448
  "exists": true,
449
- "bytes": 51915,
450
- "sha256": "047ea4b05a04f6734e2afcf792863559dc8f3091eae88a97ff90e8b038a423f4",
451
  "dimensions": {
452
  "format": "SVG",
453
  "width": 2400,
454
- "height": 1840,
455
- "view_box": "0 0 2400 1840"
456
  },
457
  "source_script_exists": true
458
  },
 
1
  {
2
  "title": "Ropedia Xperience-10M Figure Index",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T20:35:17+00:00",
5
  "scope": "Public figures, diagrams, charts, and derived modality thumbnails. Raw Xperience-10M videos, annotations, RRD files, and Qwen weights are excluded.",
6
  "figure_count": 29,
7
  "figures": [
 
406
  "id": "unified_task_model_radar",
407
  "title": "Unified 20-task model radar",
408
  "path": "docs/assets/charts/unified_task_model_radar.svg",
409
+ "role": "Grouped small-multiple 20-task radar board for all nine methods, separating single-episode, 128-episode metadata/text, 128-episode raw-feature, and foundation-model rows while preserving task keys and proxy notes.",
410
  "source_script": "scripts/build_unified_task_model_radar.py",
411
  "surface": "website unified task section, README, HF mirrors",
412
  "exists": true,
413
+ "bytes": 98527,
414
+ "sha256": "5b034b22d2a772a57e7db50f300cb70d00bd31ac89d0c039c16ac8c23a5137ec",
415
  "dimensions": {
416
  "format": "SVG",
417
  "width": 2400,
418
+ "height": 1900,
419
+ "view_box": "0 0 2400 1900"
420
  },
421
  "source_script_exists": true
422
  },
 
428
  "source_script": "scripts/build_unified_task_model_radar.py",
429
  "surface": "website unified task section, README, HF mirrors",
430
  "exists": true,
431
+ "bytes": 36930,
432
+ "sha256": "96e609b0577e66db0ee8c63939c11b1fb28018285a1d259362de0bff415cc939",
433
  "dimensions": {
434
  "format": "SVG",
435
  "width": 2400,
436
+ "height": 1900,
437
+ "view_box": "0 0 2400 1900"
438
  },
439
  "source_script_exists": true
440
  },
 
442
  "id": "episode128_task_model_radar",
443
  "title": "128-episode 20-task model radar",
444
  "path": "docs/assets/charts/episode128_task_model_radar.svg",
445
+ "role": "Grouped 20-task radar for selected 128-episode methods: metadata/text baselines, raw-feature simple/NN, Qwen3-Omni, Cosmos3-Super, and Cosmos3-Nano with local legends and proxy notes.",
446
  "source_script": "scripts/build_unified_task_model_radar.py",
447
  "surface": "website unified task section, README, HF mirrors",
448
  "exists": true,
449
+ "bytes": 79370,
450
+ "sha256": "5151c8aca22bd4aeda60b143b1164c1d1b9eb4babbeabf6da598701ccbbbf5c9",
451
  "dimensions": {
452
  "format": "SVG",
453
  "width": 2400,
454
+ "height": 1900,
455
+ "view_box": "0 0 2400 1900"
456
  },
457
  "source_script_exists": true
458
  },
docs/data/mirror_parity.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T20:00:18+00:00",
4
  "hf_root": "hf_publish",
5
  "summary": {
6
  "group_count": 1258,
@@ -138,45 +138,45 @@
138
  "local": {
139
  "path": "repo:docs/data/artifact_index.json",
140
  "exists": true,
141
- "bytes": 124341,
142
- "sha256": "d2887200f777fa80235644383d26cd3a171c313d9c4641e06974fb705f02ee5d"
143
  },
144
  "mirrors": {
145
  "hf_space": {
146
  "path": "hf_space:data/artifact_index.json",
147
  "exists": true,
148
- "bytes": 124341,
149
- "sha256": "d2887200f777fa80235644383d26cd3a171c313d9c4641e06974fb705f02ee5d"
150
  },
151
  "hf_artifacts_data": {
152
  "path": "hf_artifacts:data/artifact_index.json",
153
  "exists": true,
154
- "bytes": 124341,
155
- "sha256": "d2887200f777fa80235644383d26cd3a171c313d9c4641e06974fb705f02ee5d"
156
  },
157
  "hf_artifacts": {
158
  "path": "hf_artifacts:docs/data/artifact_index.json",
159
  "exists": true,
160
- "bytes": 124341,
161
- "sha256": "d2887200f777fa80235644383d26cd3a171c313d9c4641e06974fb705f02ee5d"
162
  },
163
  "hf_model_data": {
164
  "path": "hf_model:data/artifact_index.json",
165
  "exists": true,
166
- "bytes": 124341,
167
- "sha256": "d2887200f777fa80235644383d26cd3a171c313d9c4641e06974fb705f02ee5d"
168
  },
169
  "hf_model_docs_data": {
170
  "path": "hf_model:docs/data/artifact_index.json",
171
  "exists": true,
172
- "bytes": 124341,
173
- "sha256": "d2887200f777fa80235644383d26cd3a171c313d9c4641e06974fb705f02ee5d"
174
  },
175
  "hf_model": {
176
  "path": "hf_model:metrics/artifact_index.json",
177
  "exists": true,
178
- "bytes": 124341,
179
- "sha256": "d2887200f777fa80235644383d26cd3a171c313d9c4641e06974fb705f02ee5d"
180
  }
181
  },
182
  "failures": []
@@ -334,45 +334,45 @@
334
  "local": {
335
  "path": "repo:docs/data/figure_index.json",
336
  "exists": true,
337
- "bytes": 19485,
338
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
339
  },
340
  "mirrors": {
341
  "hf_space": {
342
  "path": "hf_space:data/figure_index.json",
343
  "exists": true,
344
- "bytes": 19485,
345
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
346
  },
347
  "hf_artifacts_data": {
348
  "path": "hf_artifacts:data/figure_index.json",
349
  "exists": true,
350
- "bytes": 19485,
351
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
352
  },
353
  "hf_artifacts": {
354
  "path": "hf_artifacts:docs/data/figure_index.json",
355
  "exists": true,
356
- "bytes": 19485,
357
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
358
  },
359
  "hf_model_data": {
360
  "path": "hf_model:data/figure_index.json",
361
  "exists": true,
362
- "bytes": 19485,
363
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
364
  },
365
  "hf_model_docs_data": {
366
  "path": "hf_model:docs/data/figure_index.json",
367
  "exists": true,
368
- "bytes": 19485,
369
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
370
  },
371
  "hf_model": {
372
  "path": "hf_model:metrics/figure_index.json",
373
  "exists": true,
374
- "bytes": 19485,
375
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
376
  }
377
  },
378
  "failures": []
@@ -972,44 +972,44 @@
972
  "path": "repo:docs/data/publication_audit.json",
973
  "exists": true,
974
  "bytes": 10940,
975
- "sha256": "656414245ff90b4f44a303ae2e848d5e893a8a861f1e4102d19a0dd979012997"
976
  },
977
  "mirrors": {
978
  "hf_space": {
979
  "path": "hf_space:data/publication_audit.json",
980
  "exists": true,
981
  "bytes": 10940,
982
- "sha256": "656414245ff90b4f44a303ae2e848d5e893a8a861f1e4102d19a0dd979012997"
983
  },
984
  "hf_artifacts_data": {
985
  "path": "hf_artifacts:data/publication_audit.json",
986
  "exists": true,
987
  "bytes": 10940,
988
- "sha256": "656414245ff90b4f44a303ae2e848d5e893a8a861f1e4102d19a0dd979012997"
989
  },
990
  "hf_artifacts": {
991
  "path": "hf_artifacts:docs/data/publication_audit.json",
992
  "exists": true,
993
  "bytes": 10940,
994
- "sha256": "656414245ff90b4f44a303ae2e848d5e893a8a861f1e4102d19a0dd979012997"
995
  },
996
  "hf_model_data": {
997
  "path": "hf_model:data/publication_audit.json",
998
  "exists": true,
999
  "bytes": 10940,
1000
- "sha256": "656414245ff90b4f44a303ae2e848d5e893a8a861f1e4102d19a0dd979012997"
1001
  },
1002
  "hf_model_docs_data": {
1003
  "path": "hf_model:docs/data/publication_audit.json",
1004
  "exists": true,
1005
  "bytes": 10940,
1006
- "sha256": "656414245ff90b4f44a303ae2e848d5e893a8a861f1e4102d19a0dd979012997"
1007
  },
1008
  "hf_model": {
1009
  "path": "hf_model:metrics/publication_audit.json",
1010
  "exists": true,
1011
  "bytes": 10940,
1012
- "sha256": "656414245ff90b4f44a303ae2e848d5e893a8a861f1e4102d19a0dd979012997"
1013
  }
1014
  },
1015
  "failures": []
@@ -1021,44 +1021,44 @@
1021
  "path": "repo:docs/data/public_surface_qa.json",
1022
  "exists": true,
1023
  "bytes": 7693,
1024
- "sha256": "7dc38f37df3efac5c7ce1213267dd7693515df5806be778138ea5f8de70b0111"
1025
  },
1026
  "mirrors": {
1027
  "hf_space": {
1028
  "path": "hf_space:data/public_surface_qa.json",
1029
  "exists": true,
1030
  "bytes": 7693,
1031
- "sha256": "7dc38f37df3efac5c7ce1213267dd7693515df5806be778138ea5f8de70b0111"
1032
  },
1033
  "hf_artifacts_data": {
1034
  "path": "hf_artifacts:data/public_surface_qa.json",
1035
  "exists": true,
1036
  "bytes": 7693,
1037
- "sha256": "7dc38f37df3efac5c7ce1213267dd7693515df5806be778138ea5f8de70b0111"
1038
  },
1039
  "hf_artifacts": {
1040
  "path": "hf_artifacts:docs/data/public_surface_qa.json",
1041
  "exists": true,
1042
  "bytes": 7693,
1043
- "sha256": "7dc38f37df3efac5c7ce1213267dd7693515df5806be778138ea5f8de70b0111"
1044
  },
1045
  "hf_model_data": {
1046
  "path": "hf_model:data/public_surface_qa.json",
1047
  "exists": true,
1048
  "bytes": 7693,
1049
- "sha256": "7dc38f37df3efac5c7ce1213267dd7693515df5806be778138ea5f8de70b0111"
1050
  },
1051
  "hf_model_docs_data": {
1052
  "path": "hf_model:docs/data/public_surface_qa.json",
1053
  "exists": true,
1054
  "bytes": 7693,
1055
- "sha256": "7dc38f37df3efac5c7ce1213267dd7693515df5806be778138ea5f8de70b0111"
1056
  },
1057
  "hf_model": {
1058
  "path": "hf_model:metrics/public_surface_qa.json",
1059
  "exists": true,
1060
  "bytes": 7693,
1061
- "sha256": "7dc38f37df3efac5c7ce1213267dd7693515df5806be778138ea5f8de70b0111"
1062
  }
1063
  },
1064
  "failures": []
@@ -1217,44 +1217,44 @@
1217
  "path": "repo:docs/data/quality_gates.json",
1218
  "exists": true,
1219
  "bytes": 8640,
1220
- "sha256": "271e4f6363b8e3738274c0be42b403ae7f449effe87e34daec6477e048194ff0"
1221
  },
1222
  "mirrors": {
1223
  "hf_space": {
1224
  "path": "hf_space:data/quality_gates.json",
1225
  "exists": true,
1226
  "bytes": 8640,
1227
- "sha256": "271e4f6363b8e3738274c0be42b403ae7f449effe87e34daec6477e048194ff0"
1228
  },
1229
  "hf_artifacts_data": {
1230
  "path": "hf_artifacts:data/quality_gates.json",
1231
  "exists": true,
1232
  "bytes": 8640,
1233
- "sha256": "271e4f6363b8e3738274c0be42b403ae7f449effe87e34daec6477e048194ff0"
1234
  },
1235
  "hf_artifacts": {
1236
  "path": "hf_artifacts:docs/data/quality_gates.json",
1237
  "exists": true,
1238
  "bytes": 8640,
1239
- "sha256": "271e4f6363b8e3738274c0be42b403ae7f449effe87e34daec6477e048194ff0"
1240
  },
1241
  "hf_model_data": {
1242
  "path": "hf_model:data/quality_gates.json",
1243
  "exists": true,
1244
  "bytes": 8640,
1245
- "sha256": "271e4f6363b8e3738274c0be42b403ae7f449effe87e34daec6477e048194ff0"
1246
  },
1247
  "hf_model_docs_data": {
1248
  "path": "hf_model:docs/data/quality_gates.json",
1249
  "exists": true,
1250
  "bytes": 8640,
1251
- "sha256": "271e4f6363b8e3738274c0be42b403ae7f449effe87e34daec6477e048194ff0"
1252
  },
1253
  "hf_model": {
1254
  "path": "hf_model:metrics/quality_gates.json",
1255
  "exists": true,
1256
  "bytes": 8640,
1257
- "sha256": "271e4f6363b8e3738274c0be42b403ae7f449effe87e34daec6477e048194ff0"
1258
  }
1259
  },
1260
  "failures": []
@@ -1658,44 +1658,44 @@
1658
  "path": "repo:docs/data/scope_claims_audit.json",
1659
  "exists": true,
1660
  "bytes": 21313,
1661
- "sha256": "a03133115a089d88dc3ce87f831674603d8b252138790693d8def878b9264da0"
1662
  },
1663
  "mirrors": {
1664
  "hf_space": {
1665
  "path": "hf_space:data/scope_claims_audit.json",
1666
  "exists": true,
1667
  "bytes": 21313,
1668
- "sha256": "a03133115a089d88dc3ce87f831674603d8b252138790693d8def878b9264da0"
1669
  },
1670
  "hf_artifacts_data": {
1671
  "path": "hf_artifacts:data/scope_claims_audit.json",
1672
  "exists": true,
1673
  "bytes": 21313,
1674
- "sha256": "a03133115a089d88dc3ce87f831674603d8b252138790693d8def878b9264da0"
1675
  },
1676
  "hf_artifacts": {
1677
  "path": "hf_artifacts:docs/data/scope_claims_audit.json",
1678
  "exists": true,
1679
  "bytes": 21313,
1680
- "sha256": "a03133115a089d88dc3ce87f831674603d8b252138790693d8def878b9264da0"
1681
  },
1682
  "hf_model_data": {
1683
  "path": "hf_model:data/scope_claims_audit.json",
1684
  "exists": true,
1685
  "bytes": 21313,
1686
- "sha256": "a03133115a089d88dc3ce87f831674603d8b252138790693d8def878b9264da0"
1687
  },
1688
  "hf_model_docs_data": {
1689
  "path": "hf_model:docs/data/scope_claims_audit.json",
1690
  "exists": true,
1691
  "bytes": 21313,
1692
- "sha256": "a03133115a089d88dc3ce87f831674603d8b252138790693d8def878b9264da0"
1693
  },
1694
  "hf_model": {
1695
  "path": "hf_model:metrics/scope_claims_audit.json",
1696
  "exists": true,
1697
  "bytes": 21313,
1698
- "sha256": "a03133115a089d88dc3ce87f831674603d8b252138790693d8def878b9264da0"
1699
  }
1700
  },
1701
  "failures": []
@@ -1756,44 +1756,44 @@
1756
  "path": "repo:docs/data/source_alignment_audit.json",
1757
  "exists": true,
1758
  "bytes": 4432,
1759
- "sha256": "6173b5fb5e1c69983f5a2b83288d521968de16810f8e01bd2717b99a67f4285c"
1760
  },
1761
  "mirrors": {
1762
  "hf_space": {
1763
  "path": "hf_space:data/source_alignment_audit.json",
1764
  "exists": true,
1765
  "bytes": 4432,
1766
- "sha256": "6173b5fb5e1c69983f5a2b83288d521968de16810f8e01bd2717b99a67f4285c"
1767
  },
1768
  "hf_artifacts_data": {
1769
  "path": "hf_artifacts:data/source_alignment_audit.json",
1770
  "exists": true,
1771
  "bytes": 4432,
1772
- "sha256": "6173b5fb5e1c69983f5a2b83288d521968de16810f8e01bd2717b99a67f4285c"
1773
  },
1774
  "hf_artifacts": {
1775
  "path": "hf_artifacts:docs/data/source_alignment_audit.json",
1776
  "exists": true,
1777
  "bytes": 4432,
1778
- "sha256": "6173b5fb5e1c69983f5a2b83288d521968de16810f8e01bd2717b99a67f4285c"
1779
  },
1780
  "hf_model_data": {
1781
  "path": "hf_model:data/source_alignment_audit.json",
1782
  "exists": true,
1783
  "bytes": 4432,
1784
- "sha256": "6173b5fb5e1c69983f5a2b83288d521968de16810f8e01bd2717b99a67f4285c"
1785
  },
1786
  "hf_model_docs_data": {
1787
  "path": "hf_model:docs/data/source_alignment_audit.json",
1788
  "exists": true,
1789
  "bytes": 4432,
1790
- "sha256": "6173b5fb5e1c69983f5a2b83288d521968de16810f8e01bd2717b99a67f4285c"
1791
  },
1792
  "hf_model": {
1793
  "path": "hf_model:metrics/source_alignment_audit.json",
1794
  "exists": true,
1795
  "bytes": 4432,
1796
- "sha256": "6173b5fb5e1c69983f5a2b83288d521968de16810f8e01bd2717b99a67f4285c"
1797
  }
1798
  },
1799
  "failures": []
@@ -1853,45 +1853,45 @@
1853
  "local": {
1854
  "path": "repo:docs/data/single_episode_task_model_radar.json",
1855
  "exists": true,
1856
- "bytes": 51327,
1857
- "sha256": "791793dc1dc1655dcae8803300def7f9e7e2c84b5af6321cbb1561c76bff2e7c"
1858
  },
1859
  "mirrors": {
1860
  "hf_space": {
1861
  "path": "hf_space:data/single_episode_task_model_radar.json",
1862
  "exists": true,
1863
- "bytes": 51327,
1864
- "sha256": "791793dc1dc1655dcae8803300def7f9e7e2c84b5af6321cbb1561c76bff2e7c"
1865
  },
1866
  "hf_artifacts_data": {
1867
  "path": "hf_artifacts:data/single_episode_task_model_radar.json",
1868
  "exists": true,
1869
- "bytes": 51327,
1870
- "sha256": "791793dc1dc1655dcae8803300def7f9e7e2c84b5af6321cbb1561c76bff2e7c"
1871
  },
1872
  "hf_artifacts": {
1873
  "path": "hf_artifacts:docs/data/single_episode_task_model_radar.json",
1874
  "exists": true,
1875
- "bytes": 51327,
1876
- "sha256": "791793dc1dc1655dcae8803300def7f9e7e2c84b5af6321cbb1561c76bff2e7c"
1877
  },
1878
  "hf_model_data": {
1879
  "path": "hf_model:data/single_episode_task_model_radar.json",
1880
  "exists": true,
1881
- "bytes": 51327,
1882
- "sha256": "791793dc1dc1655dcae8803300def7f9e7e2c84b5af6321cbb1561c76bff2e7c"
1883
  },
1884
  "hf_model_docs_data": {
1885
  "path": "hf_model:docs/data/single_episode_task_model_radar.json",
1886
  "exists": true,
1887
- "bytes": 51327,
1888
- "sha256": "791793dc1dc1655dcae8803300def7f9e7e2c84b5af6321cbb1561c76bff2e7c"
1889
  },
1890
  "hf_model": {
1891
  "path": "hf_model:metrics/single_episode_task_model_radar.json",
1892
  "exists": true,
1893
- "bytes": 51327,
1894
- "sha256": "791793dc1dc1655dcae8803300def7f9e7e2c84b5af6321cbb1561c76bff2e7c"
1895
  }
1896
  },
1897
  "failures": []
@@ -1902,45 +1902,45 @@
1902
  "local": {
1903
  "path": "repo:docs/data/episode128_task_model_radar.json",
1904
  "exists": true,
1905
- "bytes": 185212,
1906
- "sha256": "f3007f5799e2bb92ab9891bcdfd1f0f63dfef6452198d44e2b3f8d38a5493859"
1907
  },
1908
  "mirrors": {
1909
  "hf_space": {
1910
  "path": "hf_space:data/episode128_task_model_radar.json",
1911
  "exists": true,
1912
- "bytes": 185212,
1913
- "sha256": "f3007f5799e2bb92ab9891bcdfd1f0f63dfef6452198d44e2b3f8d38a5493859"
1914
  },
1915
  "hf_artifacts_data": {
1916
  "path": "hf_artifacts:data/episode128_task_model_radar.json",
1917
  "exists": true,
1918
- "bytes": 185212,
1919
- "sha256": "f3007f5799e2bb92ab9891bcdfd1f0f63dfef6452198d44e2b3f8d38a5493859"
1920
  },
1921
  "hf_artifacts": {
1922
  "path": "hf_artifacts:docs/data/episode128_task_model_radar.json",
1923
  "exists": true,
1924
- "bytes": 185212,
1925
- "sha256": "f3007f5799e2bb92ab9891bcdfd1f0f63dfef6452198d44e2b3f8d38a5493859"
1926
  },
1927
  "hf_model_data": {
1928
  "path": "hf_model:data/episode128_task_model_radar.json",
1929
  "exists": true,
1930
- "bytes": 185212,
1931
- "sha256": "f3007f5799e2bb92ab9891bcdfd1f0f63dfef6452198d44e2b3f8d38a5493859"
1932
  },
1933
  "hf_model_docs_data": {
1934
  "path": "hf_model:docs/data/episode128_task_model_radar.json",
1935
  "exists": true,
1936
- "bytes": 185212,
1937
- "sha256": "f3007f5799e2bb92ab9891bcdfd1f0f63dfef6452198d44e2b3f8d38a5493859"
1938
  },
1939
  "hf_model": {
1940
  "path": "hf_model:metrics/episode128_task_model_radar.json",
1941
  "exists": true,
1942
- "bytes": 185212,
1943
- "sha256": "f3007f5799e2bb92ab9891bcdfd1f0f63dfef6452198d44e2b3f8d38a5493859"
1944
  }
1945
  },
1946
  "failures": []
@@ -2197,44 +2197,44 @@
2197
  "path": "repo:docs/data/task_surface_integrity.json",
2198
  "exists": true,
2199
  "bytes": 46246,
2200
- "sha256": "d4437d2ae8900c75122057be8739b4812da61b91f7495a97271844e9ad33d539"
2201
  },
2202
  "mirrors": {
2203
  "hf_space": {
2204
  "path": "hf_space:data/task_surface_integrity.json",
2205
  "exists": true,
2206
  "bytes": 46246,
2207
- "sha256": "d4437d2ae8900c75122057be8739b4812da61b91f7495a97271844e9ad33d539"
2208
  },
2209
  "hf_artifacts_data": {
2210
  "path": "hf_artifacts:data/task_surface_integrity.json",
2211
  "exists": true,
2212
  "bytes": 46246,
2213
- "sha256": "d4437d2ae8900c75122057be8739b4812da61b91f7495a97271844e9ad33d539"
2214
  },
2215
  "hf_artifacts": {
2216
  "path": "hf_artifacts:docs/data/task_surface_integrity.json",
2217
  "exists": true,
2218
  "bytes": 46246,
2219
- "sha256": "d4437d2ae8900c75122057be8739b4812da61b91f7495a97271844e9ad33d539"
2220
  },
2221
  "hf_model_data": {
2222
  "path": "hf_model:data/task_surface_integrity.json",
2223
  "exists": true,
2224
  "bytes": 46246,
2225
- "sha256": "d4437d2ae8900c75122057be8739b4812da61b91f7495a97271844e9ad33d539"
2226
  },
2227
  "hf_model_docs_data": {
2228
  "path": "hf_model:docs/data/task_surface_integrity.json",
2229
  "exists": true,
2230
  "bytes": 46246,
2231
- "sha256": "d4437d2ae8900c75122057be8739b4812da61b91f7495a97271844e9ad33d539"
2232
  },
2233
  "hf_model": {
2234
  "path": "hf_model:metrics/task_surface_integrity.json",
2235
  "exists": true,
2236
  "bytes": 46246,
2237
- "sha256": "d4437d2ae8900c75122057be8739b4812da61b91f7495a97271844e9ad33d539"
2238
  }
2239
  },
2240
  "failures": []
@@ -2294,45 +2294,45 @@
2294
  "local": {
2295
  "path": "repo:docs/data/task_method_20_result_matrix.json",
2296
  "exists": true,
2297
- "bytes": 128509,
2298
- "sha256": "d5002c1a36f0c4dc15f184de2480fb123aee5a80ae1f6c5a964c449c1c97cd54"
2299
  },
2300
  "mirrors": {
2301
  "hf_space": {
2302
  "path": "hf_space:data/task_method_20_result_matrix.json",
2303
  "exists": true,
2304
- "bytes": 128509,
2305
- "sha256": "d5002c1a36f0c4dc15f184de2480fb123aee5a80ae1f6c5a964c449c1c97cd54"
2306
  },
2307
  "hf_artifacts_data": {
2308
  "path": "hf_artifacts:data/task_method_20_result_matrix.json",
2309
  "exists": true,
2310
- "bytes": 128509,
2311
- "sha256": "d5002c1a36f0c4dc15f184de2480fb123aee5a80ae1f6c5a964c449c1c97cd54"
2312
  },
2313
  "hf_artifacts": {
2314
  "path": "hf_artifacts:docs/data/task_method_20_result_matrix.json",
2315
  "exists": true,
2316
- "bytes": 128509,
2317
- "sha256": "d5002c1a36f0c4dc15f184de2480fb123aee5a80ae1f6c5a964c449c1c97cd54"
2318
  },
2319
  "hf_model_data": {
2320
  "path": "hf_model:data/task_method_20_result_matrix.json",
2321
  "exists": true,
2322
- "bytes": 128509,
2323
- "sha256": "d5002c1a36f0c4dc15f184de2480fb123aee5a80ae1f6c5a964c449c1c97cd54"
2324
  },
2325
  "hf_model_docs_data": {
2326
  "path": "hf_model:docs/data/task_method_20_result_matrix.json",
2327
  "exists": true,
2328
- "bytes": 128509,
2329
- "sha256": "d5002c1a36f0c4dc15f184de2480fb123aee5a80ae1f6c5a964c449c1c97cd54"
2330
  },
2331
  "hf_model": {
2332
  "path": "hf_model:metrics/task_method_20_result_matrix.json",
2333
  "exists": true,
2334
- "bytes": 128509,
2335
- "sha256": "d5002c1a36f0c4dc15f184de2480fb123aee5a80ae1f6c5a964c449c1c97cd54"
2336
  }
2337
  },
2338
  "failures": []
@@ -2490,45 +2490,45 @@
2490
  "local": {
2491
  "path": "repo:docs/data/unified_task_model_radar.json",
2492
  "exists": true,
2493
- "bytes": 229035,
2494
- "sha256": "c810377ee743c0ad251eb77b1549d4e096c581fa861da46b9e0fd1f2dce46a22"
2495
  },
2496
  "mirrors": {
2497
  "hf_space": {
2498
  "path": "hf_space:data/unified_task_model_radar.json",
2499
  "exists": true,
2500
- "bytes": 229035,
2501
- "sha256": "c810377ee743c0ad251eb77b1549d4e096c581fa861da46b9e0fd1f2dce46a22"
2502
  },
2503
  "hf_artifacts_data": {
2504
  "path": "hf_artifacts:data/unified_task_model_radar.json",
2505
  "exists": true,
2506
- "bytes": 229035,
2507
- "sha256": "c810377ee743c0ad251eb77b1549d4e096c581fa861da46b9e0fd1f2dce46a22"
2508
  },
2509
  "hf_artifacts": {
2510
  "path": "hf_artifacts:docs/data/unified_task_model_radar.json",
2511
  "exists": true,
2512
- "bytes": 229035,
2513
- "sha256": "c810377ee743c0ad251eb77b1549d4e096c581fa861da46b9e0fd1f2dce46a22"
2514
  },
2515
  "hf_model_data": {
2516
  "path": "hf_model:data/unified_task_model_radar.json",
2517
  "exists": true,
2518
- "bytes": 229035,
2519
- "sha256": "c810377ee743c0ad251eb77b1549d4e096c581fa861da46b9e0fd1f2dce46a22"
2520
  },
2521
  "hf_model_docs_data": {
2522
  "path": "hf_model:docs/data/unified_task_model_radar.json",
2523
  "exists": true,
2524
- "bytes": 229035,
2525
- "sha256": "c810377ee743c0ad251eb77b1549d4e096c581fa861da46b9e0fd1f2dce46a22"
2526
  },
2527
  "hf_model": {
2528
  "path": "hf_model:metrics/unified_task_model_radar.json",
2529
  "exists": true,
2530
- "bytes": 229035,
2531
- "sha256": "c810377ee743c0ad251eb77b1549d4e096c581fa861da46b9e0fd1f2dce46a22"
2532
  }
2533
  },
2534
  "failures": []
@@ -2540,44 +2540,44 @@
2540
  "path": "repo:docs/data/website_integrity.json",
2541
  "exists": true,
2542
  "bytes": 20178,
2543
- "sha256": "d9e72c5e628c9ed808a7a4648e29fbca9a455ff2ae466abde9cabd27f2f87e6a"
2544
  },
2545
  "mirrors": {
2546
  "hf_space": {
2547
  "path": "hf_space:data/website_integrity.json",
2548
  "exists": true,
2549
  "bytes": 20178,
2550
- "sha256": "d9e72c5e628c9ed808a7a4648e29fbca9a455ff2ae466abde9cabd27f2f87e6a"
2551
  },
2552
  "hf_artifacts_data": {
2553
  "path": "hf_artifacts:data/website_integrity.json",
2554
  "exists": true,
2555
  "bytes": 20178,
2556
- "sha256": "d9e72c5e628c9ed808a7a4648e29fbca9a455ff2ae466abde9cabd27f2f87e6a"
2557
  },
2558
  "hf_artifacts": {
2559
  "path": "hf_artifacts:docs/data/website_integrity.json",
2560
  "exists": true,
2561
  "bytes": 20178,
2562
- "sha256": "d9e72c5e628c9ed808a7a4648e29fbca9a455ff2ae466abde9cabd27f2f87e6a"
2563
  },
2564
  "hf_model_data": {
2565
  "path": "hf_model:data/website_integrity.json",
2566
  "exists": true,
2567
  "bytes": 20178,
2568
- "sha256": "d9e72c5e628c9ed808a7a4648e29fbca9a455ff2ae466abde9cabd27f2f87e6a"
2569
  },
2570
  "hf_model_docs_data": {
2571
  "path": "hf_model:docs/data/website_integrity.json",
2572
  "exists": true,
2573
  "bytes": 20178,
2574
- "sha256": "d9e72c5e628c9ed808a7a4648e29fbca9a455ff2ae466abde9cabd27f2f87e6a"
2575
  },
2576
  "hf_model": {
2577
  "path": "hf_model:metrics/website_integrity.json",
2578
  "exists": true,
2579
  "bytes": 20178,
2580
- "sha256": "d9e72c5e628c9ed808a7a4648e29fbca9a455ff2ae466abde9cabd27f2f87e6a"
2581
  }
2582
  },
2583
  "failures": []
@@ -2760,33 +2760,33 @@
2760
  "local": {
2761
  "path": "repo:docs/assets/charts/single_episode_task_model_radar.svg",
2762
  "exists": true,
2763
- "bytes": 35232,
2764
- "sha256": "87b52a7dead40358f1778dda43ade4d2e875ac98e507e01ca007084363e5977e"
2765
  },
2766
  "mirrors": {
2767
  "hf_space": {
2768
  "path": "hf_space:assets/charts/single_episode_task_model_radar.svg",
2769
  "exists": true,
2770
- "bytes": 35232,
2771
- "sha256": "87b52a7dead40358f1778dda43ade4d2e875ac98e507e01ca007084363e5977e"
2772
  },
2773
  "hf_artifacts_docs": {
2774
  "path": "hf_artifacts:docs/assets/charts/single_episode_task_model_radar.svg",
2775
  "exists": true,
2776
- "bytes": 35232,
2777
- "sha256": "87b52a7dead40358f1778dda43ade4d2e875ac98e507e01ca007084363e5977e"
2778
  },
2779
  "hf_artifacts_card": {
2780
  "path": "hf_artifacts:assets/charts/single_episode_task_model_radar.svg",
2781
  "exists": true,
2782
- "bytes": 35232,
2783
- "sha256": "87b52a7dead40358f1778dda43ade4d2e875ac98e507e01ca007084363e5977e"
2784
  },
2785
  "hf_model": {
2786
  "path": "hf_model:assets/charts/single_episode_task_model_radar.svg",
2787
  "exists": true,
2788
- "bytes": 35232,
2789
- "sha256": "87b52a7dead40358f1778dda43ade4d2e875ac98e507e01ca007084363e5977e"
2790
  }
2791
  },
2792
  "failures": []
@@ -2797,33 +2797,33 @@
2797
  "local": {
2798
  "path": "repo:docs/assets/charts/episode128_task_model_radar.svg",
2799
  "exists": true,
2800
- "bytes": 51915,
2801
- "sha256": "047ea4b05a04f6734e2afcf792863559dc8f3091eae88a97ff90e8b038a423f4"
2802
  },
2803
  "mirrors": {
2804
  "hf_space": {
2805
  "path": "hf_space:assets/charts/episode128_task_model_radar.svg",
2806
  "exists": true,
2807
- "bytes": 51915,
2808
- "sha256": "047ea4b05a04f6734e2afcf792863559dc8f3091eae88a97ff90e8b038a423f4"
2809
  },
2810
  "hf_artifacts_docs": {
2811
  "path": "hf_artifacts:docs/assets/charts/episode128_task_model_radar.svg",
2812
  "exists": true,
2813
- "bytes": 51915,
2814
- "sha256": "047ea4b05a04f6734e2afcf792863559dc8f3091eae88a97ff90e8b038a423f4"
2815
  },
2816
  "hf_artifacts_card": {
2817
  "path": "hf_artifacts:assets/charts/episode128_task_model_radar.svg",
2818
  "exists": true,
2819
- "bytes": 51915,
2820
- "sha256": "047ea4b05a04f6734e2afcf792863559dc8f3091eae88a97ff90e8b038a423f4"
2821
  },
2822
  "hf_model": {
2823
  "path": "hf_model:assets/charts/episode128_task_model_radar.svg",
2824
  "exists": true,
2825
- "bytes": 51915,
2826
- "sha256": "047ea4b05a04f6734e2afcf792863559dc8f3091eae88a97ff90e8b038a423f4"
2827
  }
2828
  },
2829
  "failures": []
@@ -2871,33 +2871,33 @@
2871
  "local": {
2872
  "path": "repo:docs/assets/charts/unified_task_model_radar.svg",
2873
  "exists": true,
2874
- "bytes": 57938,
2875
- "sha256": "bb83b80b47fe679ebdce2c99378a4548120f1c8cc2d725b88e409d8c386dcbf8"
2876
  },
2877
  "mirrors": {
2878
  "hf_space": {
2879
  "path": "hf_space:assets/charts/unified_task_model_radar.svg",
2880
  "exists": true,
2881
- "bytes": 57938,
2882
- "sha256": "bb83b80b47fe679ebdce2c99378a4548120f1c8cc2d725b88e409d8c386dcbf8"
2883
  },
2884
  "hf_artifacts_docs": {
2885
  "path": "hf_artifacts:docs/assets/charts/unified_task_model_radar.svg",
2886
  "exists": true,
2887
- "bytes": 57938,
2888
- "sha256": "bb83b80b47fe679ebdce2c99378a4548120f1c8cc2d725b88e409d8c386dcbf8"
2889
  },
2890
  "hf_artifacts_card": {
2891
  "path": "hf_artifacts:assets/charts/unified_task_model_radar.svg",
2892
  "exists": true,
2893
- "bytes": 57938,
2894
- "sha256": "bb83b80b47fe679ebdce2c99378a4548120f1c8cc2d725b88e409d8c386dcbf8"
2895
  },
2896
  "hf_model": {
2897
  "path": "hf_model:assets/charts/unified_task_model_radar.svg",
2898
  "exists": true,
2899
- "bytes": 57938,
2900
- "sha256": "bb83b80b47fe679ebdce2c99378a4548120f1c8cc2d725b88e409d8c386dcbf8"
2901
  }
2902
  },
2903
  "failures": []
@@ -5327,21 +5327,21 @@
5327
  "local": {
5328
  "path": "repo:scripts/build_artifact_index.py",
5329
  "exists": true,
5330
- "bytes": 68279,
5331
- "sha256": "69b43ad5d3dc5a6893c4592fa47fff6a7a87691728ec2c61b121ec262d00bf2a"
5332
  },
5333
  "mirrors": {
5334
  "hf_artifacts": {
5335
  "path": "hf_artifacts:scripts/build_artifact_index.py",
5336
  "exists": true,
5337
- "bytes": 68279,
5338
- "sha256": "69b43ad5d3dc5a6893c4592fa47fff6a7a87691728ec2c61b121ec262d00bf2a"
5339
  },
5340
  "hf_model": {
5341
  "path": "hf_model:scripts/build_artifact_index.py",
5342
  "exists": true,
5343
- "bytes": 68279,
5344
- "sha256": "69b43ad5d3dc5a6893c4592fa47fff6a7a87691728ec2c61b121ec262d00bf2a"
5345
  }
5346
  },
5347
  "failures": []
@@ -5402,21 +5402,21 @@
5402
  "local": {
5403
  "path": "repo:scripts/build_figure_index.py",
5404
  "exists": true,
5405
- "bytes": 16845,
5406
- "sha256": "3f91f7f13a3fb08ab57c2f0a6b320102e9d5ae19b102b71499edb5b8fd5a2cec"
5407
  },
5408
  "mirrors": {
5409
  "hf_artifacts": {
5410
  "path": "hf_artifacts:scripts/build_figure_index.py",
5411
  "exists": true,
5412
- "bytes": 16845,
5413
- "sha256": "3f91f7f13a3fb08ab57c2f0a6b320102e9d5ae19b102b71499edb5b8fd5a2cec"
5414
  },
5415
  "hf_model": {
5416
  "path": "hf_model:scripts/build_figure_index.py",
5417
  "exists": true,
5418
- "bytes": 16845,
5419
- "sha256": "3f91f7f13a3fb08ab57c2f0a6b320102e9d5ae19b102b71499edb5b8fd5a2cec"
5420
  }
5421
  },
5422
  "failures": []
@@ -5877,21 +5877,21 @@
5877
  "local": {
5878
  "path": "repo:scripts/build_unified_task_model_radar.py",
5879
  "exists": true,
5880
- "bytes": 69956,
5881
- "sha256": "3d137eced3c376ed674090b823ff9439cea0e8e8670ea9de02c3d3781305a0c9"
5882
  },
5883
  "mirrors": {
5884
  "hf_artifacts": {
5885
  "path": "hf_artifacts:scripts/build_unified_task_model_radar.py",
5886
  "exists": true,
5887
- "bytes": 69956,
5888
- "sha256": "3d137eced3c376ed674090b823ff9439cea0e8e8670ea9de02c3d3781305a0c9"
5889
  },
5890
  "hf_model": {
5891
  "path": "hf_model:scripts/build_unified_task_model_radar.py",
5892
  "exists": true,
5893
- "bytes": 69956,
5894
- "sha256": "3d137eced3c376ed674090b823ff9439cea0e8e8670ea9de02c3d3781305a0c9"
5895
  }
5896
  },
5897
  "failures": []
@@ -6331,39 +6331,39 @@
6331
  "local": {
6332
  "path": "repo:docs/index.html",
6333
  "exists": true,
6334
- "bytes": 348267,
6335
- "sha256": "8e6ac665f1a0e810667379733013a906d3f13a390bceb4b41594769aa773626d"
6336
  },
6337
  "mirrors": {
6338
  "hf_space": {
6339
  "path": "hf_space:index.html",
6340
  "exists": true,
6341
- "bytes": 348267,
6342
- "sha256": "8e6ac665f1a0e810667379733013a906d3f13a390bceb4b41594769aa773626d"
6343
  },
6344
  "hf_artifacts_root": {
6345
  "path": "hf_artifacts:index.html",
6346
  "exists": true,
6347
- "bytes": 348267,
6348
- "sha256": "8e6ac665f1a0e810667379733013a906d3f13a390bceb4b41594769aa773626d"
6349
  },
6350
  "hf_artifacts_docs": {
6351
  "path": "hf_artifacts:docs/index.html",
6352
  "exists": true,
6353
- "bytes": 348267,
6354
- "sha256": "8e6ac665f1a0e810667379733013a906d3f13a390bceb4b41594769aa773626d"
6355
  },
6356
  "hf_model": {
6357
  "path": "hf_model:index.html",
6358
  "exists": true,
6359
- "bytes": 348267,
6360
- "sha256": "8e6ac665f1a0e810667379733013a906d3f13a390bceb4b41594769aa773626d"
6361
  },
6362
  "hf_model_docs": {
6363
  "path": "hf_model:docs/index.html",
6364
  "exists": true,
6365
- "bytes": 348267,
6366
- "sha256": "8e6ac665f1a0e810667379733013a906d3f13a390bceb4b41594769aa773626d"
6367
  }
6368
  },
6369
  "failures": []
@@ -31027,27 +31027,27 @@
31027
  "local": {
31028
  "path": "repo:FIGURE_INDEX.md",
31029
  "exists": true,
31030
- "bytes": 7027,
31031
- "sha256": "b7b507c35cd3cba2765586e9703a447c8025c89658c3daa390df67db4211d0fc"
31032
  },
31033
  "mirrors": {
31034
  "hf_space": {
31035
  "path": "hf_space:FIGURE_INDEX.md",
31036
  "exists": true,
31037
- "bytes": 7027,
31038
- "sha256": "b7b507c35cd3cba2765586e9703a447c8025c89658c3daa390df67db4211d0fc"
31039
  },
31040
  "hf_artifacts": {
31041
  "path": "hf_artifacts:FIGURE_INDEX.md",
31042
  "exists": true,
31043
- "bytes": 7027,
31044
- "sha256": "b7b507c35cd3cba2765586e9703a447c8025c89658c3daa390df67db4211d0fc"
31045
  },
31046
  "hf_model": {
31047
  "path": "hf_model:FIGURE_INDEX.md",
31048
  "exists": true,
31049
- "bytes": 7027,
31050
- "sha256": "b7b507c35cd3cba2765586e9703a447c8025c89658c3daa390df67db4211d0fc"
31051
  }
31052
  },
31053
  "failures": []
@@ -31554,27 +31554,27 @@
31554
  "local": {
31555
  "path": "repo:TASK_METHOD_20_RESULT_MATRIX.md",
31556
  "exists": true,
31557
- "bytes": 14715,
31558
- "sha256": "acf665d5c07f15224b42d9da93bc2586279852bd60241868391ba7c0a462a24f"
31559
  },
31560
  "mirrors": {
31561
  "hf_space": {
31562
  "path": "hf_space:TASK_METHOD_20_RESULT_MATRIX.md",
31563
  "exists": true,
31564
- "bytes": 14715,
31565
- "sha256": "acf665d5c07f15224b42d9da93bc2586279852bd60241868391ba7c0a462a24f"
31566
  },
31567
  "hf_artifacts": {
31568
  "path": "hf_artifacts:TASK_METHOD_20_RESULT_MATRIX.md",
31569
  "exists": true,
31570
- "bytes": 14715,
31571
- "sha256": "acf665d5c07f15224b42d9da93bc2586279852bd60241868391ba7c0a462a24f"
31572
  },
31573
  "hf_model": {
31574
  "path": "hf_model:TASK_METHOD_20_RESULT_MATRIX.md",
31575
  "exists": true,
31576
- "bytes": 14715,
31577
- "sha256": "acf665d5c07f15224b42d9da93bc2586279852bd60241868391ba7c0a462a24f"
31578
  }
31579
  },
31580
  "failures": []
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T20:33:57+00:00",
4
  "hf_root": "hf_publish",
5
  "summary": {
6
  "group_count": 1258,
 
138
  "local": {
139
  "path": "repo:docs/data/artifact_index.json",
140
  "exists": true,
141
+ "bytes": 124471,
142
+ "sha256": "8e9954b1de5bbd935a1ed0c6753f737d9a2a325c0bd85377087751ce198afeeb"
143
  },
144
  "mirrors": {
145
  "hf_space": {
146
  "path": "hf_space:data/artifact_index.json",
147
  "exists": true,
148
+ "bytes": 124471,
149
+ "sha256": "8e9954b1de5bbd935a1ed0c6753f737d9a2a325c0bd85377087751ce198afeeb"
150
  },
151
  "hf_artifacts_data": {
152
  "path": "hf_artifacts:data/artifact_index.json",
153
  "exists": true,
154
+ "bytes": 124471,
155
+ "sha256": "8e9954b1de5bbd935a1ed0c6753f737d9a2a325c0bd85377087751ce198afeeb"
156
  },
157
  "hf_artifacts": {
158
  "path": "hf_artifacts:docs/data/artifact_index.json",
159
  "exists": true,
160
+ "bytes": 124471,
161
+ "sha256": "8e9954b1de5bbd935a1ed0c6753f737d9a2a325c0bd85377087751ce198afeeb"
162
  },
163
  "hf_model_data": {
164
  "path": "hf_model:data/artifact_index.json",
165
  "exists": true,
166
+ "bytes": 124471,
167
+ "sha256": "8e9954b1de5bbd935a1ed0c6753f737d9a2a325c0bd85377087751ce198afeeb"
168
  },
169
  "hf_model_docs_data": {
170
  "path": "hf_model:docs/data/artifact_index.json",
171
  "exists": true,
172
+ "bytes": 124471,
173
+ "sha256": "8e9954b1de5bbd935a1ed0c6753f737d9a2a325c0bd85377087751ce198afeeb"
174
  },
175
  "hf_model": {
176
  "path": "hf_model:metrics/artifact_index.json",
177
  "exists": true,
178
+ "bytes": 124471,
179
+ "sha256": "8e9954b1de5bbd935a1ed0c6753f737d9a2a325c0bd85377087751ce198afeeb"
180
  }
181
  },
182
  "failures": []
 
334
  "local": {
335
  "path": "repo:docs/data/figure_index.json",
336
  "exists": true,
337
+ "bytes": 19526,
338
+ "sha256": "2216a4c718a2ad7aba2a6f3f30b2d699c77dea6a867219df40f2e7a83a467eb4"
339
  },
340
  "mirrors": {
341
  "hf_space": {
342
  "path": "hf_space:data/figure_index.json",
343
  "exists": true,
344
+ "bytes": 19526,
345
+ "sha256": "2216a4c718a2ad7aba2a6f3f30b2d699c77dea6a867219df40f2e7a83a467eb4"
346
  },
347
  "hf_artifacts_data": {
348
  "path": "hf_artifacts:data/figure_index.json",
349
  "exists": true,
350
+ "bytes": 19526,
351
+ "sha256": "2216a4c718a2ad7aba2a6f3f30b2d699c77dea6a867219df40f2e7a83a467eb4"
352
  },
353
  "hf_artifacts": {
354
  "path": "hf_artifacts:docs/data/figure_index.json",
355
  "exists": true,
356
+ "bytes": 19526,
357
+ "sha256": "2216a4c718a2ad7aba2a6f3f30b2d699c77dea6a867219df40f2e7a83a467eb4"
358
  },
359
  "hf_model_data": {
360
  "path": "hf_model:data/figure_index.json",
361
  "exists": true,
362
+ "bytes": 19526,
363
+ "sha256": "2216a4c718a2ad7aba2a6f3f30b2d699c77dea6a867219df40f2e7a83a467eb4"
364
  },
365
  "hf_model_docs_data": {
366
  "path": "hf_model:docs/data/figure_index.json",
367
  "exists": true,
368
+ "bytes": 19526,
369
+ "sha256": "2216a4c718a2ad7aba2a6f3f30b2d699c77dea6a867219df40f2e7a83a467eb4"
370
  },
371
  "hf_model": {
372
  "path": "hf_model:metrics/figure_index.json",
373
  "exists": true,
374
+ "bytes": 19526,
375
+ "sha256": "2216a4c718a2ad7aba2a6f3f30b2d699c77dea6a867219df40f2e7a83a467eb4"
376
  }
377
  },
378
  "failures": []
 
972
  "path": "repo:docs/data/publication_audit.json",
973
  "exists": true,
974
  "bytes": 10940,
975
+ "sha256": "f5a4c25846e1a8d730f4dc540c1c4fb56bd204c7de3d426b7759e72529009a06"
976
  },
977
  "mirrors": {
978
  "hf_space": {
979
  "path": "hf_space:data/publication_audit.json",
980
  "exists": true,
981
  "bytes": 10940,
982
+ "sha256": "f5a4c25846e1a8d730f4dc540c1c4fb56bd204c7de3d426b7759e72529009a06"
983
  },
984
  "hf_artifacts_data": {
985
  "path": "hf_artifacts:data/publication_audit.json",
986
  "exists": true,
987
  "bytes": 10940,
988
+ "sha256": "f5a4c25846e1a8d730f4dc540c1c4fb56bd204c7de3d426b7759e72529009a06"
989
  },
990
  "hf_artifacts": {
991
  "path": "hf_artifacts:docs/data/publication_audit.json",
992
  "exists": true,
993
  "bytes": 10940,
994
+ "sha256": "f5a4c25846e1a8d730f4dc540c1c4fb56bd204c7de3d426b7759e72529009a06"
995
  },
996
  "hf_model_data": {
997
  "path": "hf_model:data/publication_audit.json",
998
  "exists": true,
999
  "bytes": 10940,
1000
+ "sha256": "f5a4c25846e1a8d730f4dc540c1c4fb56bd204c7de3d426b7759e72529009a06"
1001
  },
1002
  "hf_model_docs_data": {
1003
  "path": "hf_model:docs/data/publication_audit.json",
1004
  "exists": true,
1005
  "bytes": 10940,
1006
+ "sha256": "f5a4c25846e1a8d730f4dc540c1c4fb56bd204c7de3d426b7759e72529009a06"
1007
  },
1008
  "hf_model": {
1009
  "path": "hf_model:metrics/publication_audit.json",
1010
  "exists": true,
1011
  "bytes": 10940,
1012
+ "sha256": "f5a4c25846e1a8d730f4dc540c1c4fb56bd204c7de3d426b7759e72529009a06"
1013
  }
1014
  },
1015
  "failures": []
 
1021
  "path": "repo:docs/data/public_surface_qa.json",
1022
  "exists": true,
1023
  "bytes": 7693,
1024
+ "sha256": "c3c9cab63c92026e5cf5265d5113d0bc42beec3664a71880c8471cc3305a8b04"
1025
  },
1026
  "mirrors": {
1027
  "hf_space": {
1028
  "path": "hf_space:data/public_surface_qa.json",
1029
  "exists": true,
1030
  "bytes": 7693,
1031
+ "sha256": "c3c9cab63c92026e5cf5265d5113d0bc42beec3664a71880c8471cc3305a8b04"
1032
  },
1033
  "hf_artifacts_data": {
1034
  "path": "hf_artifacts:data/public_surface_qa.json",
1035
  "exists": true,
1036
  "bytes": 7693,
1037
+ "sha256": "c3c9cab63c92026e5cf5265d5113d0bc42beec3664a71880c8471cc3305a8b04"
1038
  },
1039
  "hf_artifacts": {
1040
  "path": "hf_artifacts:docs/data/public_surface_qa.json",
1041
  "exists": true,
1042
  "bytes": 7693,
1043
+ "sha256": "c3c9cab63c92026e5cf5265d5113d0bc42beec3664a71880c8471cc3305a8b04"
1044
  },
1045
  "hf_model_data": {
1046
  "path": "hf_model:data/public_surface_qa.json",
1047
  "exists": true,
1048
  "bytes": 7693,
1049
+ "sha256": "c3c9cab63c92026e5cf5265d5113d0bc42beec3664a71880c8471cc3305a8b04"
1050
  },
1051
  "hf_model_docs_data": {
1052
  "path": "hf_model:docs/data/public_surface_qa.json",
1053
  "exists": true,
1054
  "bytes": 7693,
1055
+ "sha256": "c3c9cab63c92026e5cf5265d5113d0bc42beec3664a71880c8471cc3305a8b04"
1056
  },
1057
  "hf_model": {
1058
  "path": "hf_model:metrics/public_surface_qa.json",
1059
  "exists": true,
1060
  "bytes": 7693,
1061
+ "sha256": "c3c9cab63c92026e5cf5265d5113d0bc42beec3664a71880c8471cc3305a8b04"
1062
  }
1063
  },
1064
  "failures": []
 
1217
  "path": "repo:docs/data/quality_gates.json",
1218
  "exists": true,
1219
  "bytes": 8640,
1220
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1221
  },
1222
  "mirrors": {
1223
  "hf_space": {
1224
  "path": "hf_space:data/quality_gates.json",
1225
  "exists": true,
1226
  "bytes": 8640,
1227
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1228
  },
1229
  "hf_artifacts_data": {
1230
  "path": "hf_artifacts:data/quality_gates.json",
1231
  "exists": true,
1232
  "bytes": 8640,
1233
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1234
  },
1235
  "hf_artifacts": {
1236
  "path": "hf_artifacts:docs/data/quality_gates.json",
1237
  "exists": true,
1238
  "bytes": 8640,
1239
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1240
  },
1241
  "hf_model_data": {
1242
  "path": "hf_model:data/quality_gates.json",
1243
  "exists": true,
1244
  "bytes": 8640,
1245
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1246
  },
1247
  "hf_model_docs_data": {
1248
  "path": "hf_model:docs/data/quality_gates.json",
1249
  "exists": true,
1250
  "bytes": 8640,
1251
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1252
  },
1253
  "hf_model": {
1254
  "path": "hf_model:metrics/quality_gates.json",
1255
  "exists": true,
1256
  "bytes": 8640,
1257
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1258
  }
1259
  },
1260
  "failures": []
 
1658
  "path": "repo:docs/data/scope_claims_audit.json",
1659
  "exists": true,
1660
  "bytes": 21313,
1661
+ "sha256": "b997454e0e9bb382da0afcdec319086081213d176546306e13a7d47d42e8957c"
1662
  },
1663
  "mirrors": {
1664
  "hf_space": {
1665
  "path": "hf_space:data/scope_claims_audit.json",
1666
  "exists": true,
1667
  "bytes": 21313,
1668
+ "sha256": "b997454e0e9bb382da0afcdec319086081213d176546306e13a7d47d42e8957c"
1669
  },
1670
  "hf_artifacts_data": {
1671
  "path": "hf_artifacts:data/scope_claims_audit.json",
1672
  "exists": true,
1673
  "bytes": 21313,
1674
+ "sha256": "b997454e0e9bb382da0afcdec319086081213d176546306e13a7d47d42e8957c"
1675
  },
1676
  "hf_artifacts": {
1677
  "path": "hf_artifacts:docs/data/scope_claims_audit.json",
1678
  "exists": true,
1679
  "bytes": 21313,
1680
+ "sha256": "b997454e0e9bb382da0afcdec319086081213d176546306e13a7d47d42e8957c"
1681
  },
1682
  "hf_model_data": {
1683
  "path": "hf_model:data/scope_claims_audit.json",
1684
  "exists": true,
1685
  "bytes": 21313,
1686
+ "sha256": "b997454e0e9bb382da0afcdec319086081213d176546306e13a7d47d42e8957c"
1687
  },
1688
  "hf_model_docs_data": {
1689
  "path": "hf_model:docs/data/scope_claims_audit.json",
1690
  "exists": true,
1691
  "bytes": 21313,
1692
+ "sha256": "b997454e0e9bb382da0afcdec319086081213d176546306e13a7d47d42e8957c"
1693
  },
1694
  "hf_model": {
1695
  "path": "hf_model:metrics/scope_claims_audit.json",
1696
  "exists": true,
1697
  "bytes": 21313,
1698
+ "sha256": "b997454e0e9bb382da0afcdec319086081213d176546306e13a7d47d42e8957c"
1699
  }
1700
  },
1701
  "failures": []
 
1756
  "path": "repo:docs/data/source_alignment_audit.json",
1757
  "exists": true,
1758
  "bytes": 4432,
1759
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
1760
  },
1761
  "mirrors": {
1762
  "hf_space": {
1763
  "path": "hf_space:data/source_alignment_audit.json",
1764
  "exists": true,
1765
  "bytes": 4432,
1766
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
1767
  },
1768
  "hf_artifacts_data": {
1769
  "path": "hf_artifacts:data/source_alignment_audit.json",
1770
  "exists": true,
1771
  "bytes": 4432,
1772
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
1773
  },
1774
  "hf_artifacts": {
1775
  "path": "hf_artifacts:docs/data/source_alignment_audit.json",
1776
  "exists": true,
1777
  "bytes": 4432,
1778
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
1779
  },
1780
  "hf_model_data": {
1781
  "path": "hf_model:data/source_alignment_audit.json",
1782
  "exists": true,
1783
  "bytes": 4432,
1784
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
1785
  },
1786
  "hf_model_docs_data": {
1787
  "path": "hf_model:docs/data/source_alignment_audit.json",
1788
  "exists": true,
1789
  "bytes": 4432,
1790
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
1791
  },
1792
  "hf_model": {
1793
  "path": "hf_model:metrics/source_alignment_audit.json",
1794
  "exists": true,
1795
  "bytes": 4432,
1796
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
1797
  }
1798
  },
1799
  "failures": []
 
1853
  "local": {
1854
  "path": "repo:docs/data/single_episode_task_model_radar.json",
1855
  "exists": true,
1856
+ "bytes": 52866,
1857
+ "sha256": "804273420658eae853df3d1c1f45f83040fb09551c74c203bb833d73b6c2a4c1"
1858
  },
1859
  "mirrors": {
1860
  "hf_space": {
1861
  "path": "hf_space:data/single_episode_task_model_radar.json",
1862
  "exists": true,
1863
+ "bytes": 52866,
1864
+ "sha256": "804273420658eae853df3d1c1f45f83040fb09551c74c203bb833d73b6c2a4c1"
1865
  },
1866
  "hf_artifacts_data": {
1867
  "path": "hf_artifacts:data/single_episode_task_model_radar.json",
1868
  "exists": true,
1869
+ "bytes": 52866,
1870
+ "sha256": "804273420658eae853df3d1c1f45f83040fb09551c74c203bb833d73b6c2a4c1"
1871
  },
1872
  "hf_artifacts": {
1873
  "path": "hf_artifacts:docs/data/single_episode_task_model_radar.json",
1874
  "exists": true,
1875
+ "bytes": 52866,
1876
+ "sha256": "804273420658eae853df3d1c1f45f83040fb09551c74c203bb833d73b6c2a4c1"
1877
  },
1878
  "hf_model_data": {
1879
  "path": "hf_model:data/single_episode_task_model_radar.json",
1880
  "exists": true,
1881
+ "bytes": 52866,
1882
+ "sha256": "804273420658eae853df3d1c1f45f83040fb09551c74c203bb833d73b6c2a4c1"
1883
  },
1884
  "hf_model_docs_data": {
1885
  "path": "hf_model:docs/data/single_episode_task_model_radar.json",
1886
  "exists": true,
1887
+ "bytes": 52866,
1888
+ "sha256": "804273420658eae853df3d1c1f45f83040fb09551c74c203bb833d73b6c2a4c1"
1889
  },
1890
  "hf_model": {
1891
  "path": "hf_model:metrics/single_episode_task_model_radar.json",
1892
  "exists": true,
1893
+ "bytes": 52866,
1894
+ "sha256": "804273420658eae853df3d1c1f45f83040fb09551c74c203bb833d73b6c2a4c1"
1895
  }
1896
  },
1897
  "failures": []
 
1902
  "local": {
1903
  "path": "repo:docs/data/episode128_task_model_radar.json",
1904
  "exists": true,
1905
+ "bytes": 186997,
1906
+ "sha256": "d15869d177f5a9cc37e20d0cab62fdc1be61d596975fee13a5d1d61108c7176a"
1907
  },
1908
  "mirrors": {
1909
  "hf_space": {
1910
  "path": "hf_space:data/episode128_task_model_radar.json",
1911
  "exists": true,
1912
+ "bytes": 186997,
1913
+ "sha256": "d15869d177f5a9cc37e20d0cab62fdc1be61d596975fee13a5d1d61108c7176a"
1914
  },
1915
  "hf_artifacts_data": {
1916
  "path": "hf_artifacts:data/episode128_task_model_radar.json",
1917
  "exists": true,
1918
+ "bytes": 186997,
1919
+ "sha256": "d15869d177f5a9cc37e20d0cab62fdc1be61d596975fee13a5d1d61108c7176a"
1920
  },
1921
  "hf_artifacts": {
1922
  "path": "hf_artifacts:docs/data/episode128_task_model_radar.json",
1923
  "exists": true,
1924
+ "bytes": 186997,
1925
+ "sha256": "d15869d177f5a9cc37e20d0cab62fdc1be61d596975fee13a5d1d61108c7176a"
1926
  },
1927
  "hf_model_data": {
1928
  "path": "hf_model:data/episode128_task_model_radar.json",
1929
  "exists": true,
1930
+ "bytes": 186997,
1931
+ "sha256": "d15869d177f5a9cc37e20d0cab62fdc1be61d596975fee13a5d1d61108c7176a"
1932
  },
1933
  "hf_model_docs_data": {
1934
  "path": "hf_model:docs/data/episode128_task_model_radar.json",
1935
  "exists": true,
1936
+ "bytes": 186997,
1937
+ "sha256": "d15869d177f5a9cc37e20d0cab62fdc1be61d596975fee13a5d1d61108c7176a"
1938
  },
1939
  "hf_model": {
1940
  "path": "hf_model:metrics/episode128_task_model_radar.json",
1941
  "exists": true,
1942
+ "bytes": 186997,
1943
+ "sha256": "d15869d177f5a9cc37e20d0cab62fdc1be61d596975fee13a5d1d61108c7176a"
1944
  }
1945
  },
1946
  "failures": []
 
2197
  "path": "repo:docs/data/task_surface_integrity.json",
2198
  "exists": true,
2199
  "bytes": 46246,
2200
+ "sha256": "e85510c4a7df9e1fd43651d18291b51b1ad2c3ab0b5042d4edc2a598a1929556"
2201
  },
2202
  "mirrors": {
2203
  "hf_space": {
2204
  "path": "hf_space:data/task_surface_integrity.json",
2205
  "exists": true,
2206
  "bytes": 46246,
2207
+ "sha256": "e85510c4a7df9e1fd43651d18291b51b1ad2c3ab0b5042d4edc2a598a1929556"
2208
  },
2209
  "hf_artifacts_data": {
2210
  "path": "hf_artifacts:data/task_surface_integrity.json",
2211
  "exists": true,
2212
  "bytes": 46246,
2213
+ "sha256": "e85510c4a7df9e1fd43651d18291b51b1ad2c3ab0b5042d4edc2a598a1929556"
2214
  },
2215
  "hf_artifacts": {
2216
  "path": "hf_artifacts:docs/data/task_surface_integrity.json",
2217
  "exists": true,
2218
  "bytes": 46246,
2219
+ "sha256": "e85510c4a7df9e1fd43651d18291b51b1ad2c3ab0b5042d4edc2a598a1929556"
2220
  },
2221
  "hf_model_data": {
2222
  "path": "hf_model:data/task_surface_integrity.json",
2223
  "exists": true,
2224
  "bytes": 46246,
2225
+ "sha256": "e85510c4a7df9e1fd43651d18291b51b1ad2c3ab0b5042d4edc2a598a1929556"
2226
  },
2227
  "hf_model_docs_data": {
2228
  "path": "hf_model:docs/data/task_surface_integrity.json",
2229
  "exists": true,
2230
  "bytes": 46246,
2231
+ "sha256": "e85510c4a7df9e1fd43651d18291b51b1ad2c3ab0b5042d4edc2a598a1929556"
2232
  },
2233
  "hf_model": {
2234
  "path": "hf_model:metrics/task_surface_integrity.json",
2235
  "exists": true,
2236
  "bytes": 46246,
2237
+ "sha256": "e85510c4a7df9e1fd43651d18291b51b1ad2c3ab0b5042d4edc2a598a1929556"
2238
  }
2239
  },
2240
  "failures": []
 
2294
  "local": {
2295
  "path": "repo:docs/data/task_method_20_result_matrix.json",
2296
  "exists": true,
2297
+ "bytes": 128991,
2298
+ "sha256": "c070f4430cff2583e150cb66ed75b5759ee985429af7aa22a8a83c14a5a8841f"
2299
  },
2300
  "mirrors": {
2301
  "hf_space": {
2302
  "path": "hf_space:data/task_method_20_result_matrix.json",
2303
  "exists": true,
2304
+ "bytes": 128991,
2305
+ "sha256": "c070f4430cff2583e150cb66ed75b5759ee985429af7aa22a8a83c14a5a8841f"
2306
  },
2307
  "hf_artifacts_data": {
2308
  "path": "hf_artifacts:data/task_method_20_result_matrix.json",
2309
  "exists": true,
2310
+ "bytes": 128991,
2311
+ "sha256": "c070f4430cff2583e150cb66ed75b5759ee985429af7aa22a8a83c14a5a8841f"
2312
  },
2313
  "hf_artifacts": {
2314
  "path": "hf_artifacts:docs/data/task_method_20_result_matrix.json",
2315
  "exists": true,
2316
+ "bytes": 128991,
2317
+ "sha256": "c070f4430cff2583e150cb66ed75b5759ee985429af7aa22a8a83c14a5a8841f"
2318
  },
2319
  "hf_model_data": {
2320
  "path": "hf_model:data/task_method_20_result_matrix.json",
2321
  "exists": true,
2322
+ "bytes": 128991,
2323
+ "sha256": "c070f4430cff2583e150cb66ed75b5759ee985429af7aa22a8a83c14a5a8841f"
2324
  },
2325
  "hf_model_docs_data": {
2326
  "path": "hf_model:docs/data/task_method_20_result_matrix.json",
2327
  "exists": true,
2328
+ "bytes": 128991,
2329
+ "sha256": "c070f4430cff2583e150cb66ed75b5759ee985429af7aa22a8a83c14a5a8841f"
2330
  },
2331
  "hf_model": {
2332
  "path": "hf_model:metrics/task_method_20_result_matrix.json",
2333
  "exists": true,
2334
+ "bytes": 128991,
2335
+ "sha256": "c070f4430cff2583e150cb66ed75b5759ee985429af7aa22a8a83c14a5a8841f"
2336
  }
2337
  },
2338
  "failures": []
 
2490
  "local": {
2491
  "path": "repo:docs/data/unified_task_model_radar.json",
2492
  "exists": true,
2493
+ "bytes": 230938,
2494
+ "sha256": "e9523c3f97d6493058030ad05c3a8a80c51a11727fbf8eb936c405ac986531c1"
2495
  },
2496
  "mirrors": {
2497
  "hf_space": {
2498
  "path": "hf_space:data/unified_task_model_radar.json",
2499
  "exists": true,
2500
+ "bytes": 230938,
2501
+ "sha256": "e9523c3f97d6493058030ad05c3a8a80c51a11727fbf8eb936c405ac986531c1"
2502
  },
2503
  "hf_artifacts_data": {
2504
  "path": "hf_artifacts:data/unified_task_model_radar.json",
2505
  "exists": true,
2506
+ "bytes": 230938,
2507
+ "sha256": "e9523c3f97d6493058030ad05c3a8a80c51a11727fbf8eb936c405ac986531c1"
2508
  },
2509
  "hf_artifacts": {
2510
  "path": "hf_artifacts:docs/data/unified_task_model_radar.json",
2511
  "exists": true,
2512
+ "bytes": 230938,
2513
+ "sha256": "e9523c3f97d6493058030ad05c3a8a80c51a11727fbf8eb936c405ac986531c1"
2514
  },
2515
  "hf_model_data": {
2516
  "path": "hf_model:data/unified_task_model_radar.json",
2517
  "exists": true,
2518
+ "bytes": 230938,
2519
+ "sha256": "e9523c3f97d6493058030ad05c3a8a80c51a11727fbf8eb936c405ac986531c1"
2520
  },
2521
  "hf_model_docs_data": {
2522
  "path": "hf_model:docs/data/unified_task_model_radar.json",
2523
  "exists": true,
2524
+ "bytes": 230938,
2525
+ "sha256": "e9523c3f97d6493058030ad05c3a8a80c51a11727fbf8eb936c405ac986531c1"
2526
  },
2527
  "hf_model": {
2528
  "path": "hf_model:metrics/unified_task_model_radar.json",
2529
  "exists": true,
2530
+ "bytes": 230938,
2531
+ "sha256": "e9523c3f97d6493058030ad05c3a8a80c51a11727fbf8eb936c405ac986531c1"
2532
  }
2533
  },
2534
  "failures": []
 
2540
  "path": "repo:docs/data/website_integrity.json",
2541
  "exists": true,
2542
  "bytes": 20178,
2543
+ "sha256": "b24c99ae7b5f9ae9fd48f04f39728979aabec7d79eaad1a0ed4437e02a7cb218"
2544
  },
2545
  "mirrors": {
2546
  "hf_space": {
2547
  "path": "hf_space:data/website_integrity.json",
2548
  "exists": true,
2549
  "bytes": 20178,
2550
+ "sha256": "b24c99ae7b5f9ae9fd48f04f39728979aabec7d79eaad1a0ed4437e02a7cb218"
2551
  },
2552
  "hf_artifacts_data": {
2553
  "path": "hf_artifacts:data/website_integrity.json",
2554
  "exists": true,
2555
  "bytes": 20178,
2556
+ "sha256": "b24c99ae7b5f9ae9fd48f04f39728979aabec7d79eaad1a0ed4437e02a7cb218"
2557
  },
2558
  "hf_artifacts": {
2559
  "path": "hf_artifacts:docs/data/website_integrity.json",
2560
  "exists": true,
2561
  "bytes": 20178,
2562
+ "sha256": "b24c99ae7b5f9ae9fd48f04f39728979aabec7d79eaad1a0ed4437e02a7cb218"
2563
  },
2564
  "hf_model_data": {
2565
  "path": "hf_model:data/website_integrity.json",
2566
  "exists": true,
2567
  "bytes": 20178,
2568
+ "sha256": "b24c99ae7b5f9ae9fd48f04f39728979aabec7d79eaad1a0ed4437e02a7cb218"
2569
  },
2570
  "hf_model_docs_data": {
2571
  "path": "hf_model:docs/data/website_integrity.json",
2572
  "exists": true,
2573
  "bytes": 20178,
2574
+ "sha256": "b24c99ae7b5f9ae9fd48f04f39728979aabec7d79eaad1a0ed4437e02a7cb218"
2575
  },
2576
  "hf_model": {
2577
  "path": "hf_model:metrics/website_integrity.json",
2578
  "exists": true,
2579
  "bytes": 20178,
2580
+ "sha256": "b24c99ae7b5f9ae9fd48f04f39728979aabec7d79eaad1a0ed4437e02a7cb218"
2581
  }
2582
  },
2583
  "failures": []
 
2760
  "local": {
2761
  "path": "repo:docs/assets/charts/single_episode_task_model_radar.svg",
2762
  "exists": true,
2763
+ "bytes": 36930,
2764
+ "sha256": "96e609b0577e66db0ee8c63939c11b1fb28018285a1d259362de0bff415cc939"
2765
  },
2766
  "mirrors": {
2767
  "hf_space": {
2768
  "path": "hf_space:assets/charts/single_episode_task_model_radar.svg",
2769
  "exists": true,
2770
+ "bytes": 36930,
2771
+ "sha256": "96e609b0577e66db0ee8c63939c11b1fb28018285a1d259362de0bff415cc939"
2772
  },
2773
  "hf_artifacts_docs": {
2774
  "path": "hf_artifacts:docs/assets/charts/single_episode_task_model_radar.svg",
2775
  "exists": true,
2776
+ "bytes": 36930,
2777
+ "sha256": "96e609b0577e66db0ee8c63939c11b1fb28018285a1d259362de0bff415cc939"
2778
  },
2779
  "hf_artifacts_card": {
2780
  "path": "hf_artifacts:assets/charts/single_episode_task_model_radar.svg",
2781
  "exists": true,
2782
+ "bytes": 36930,
2783
+ "sha256": "96e609b0577e66db0ee8c63939c11b1fb28018285a1d259362de0bff415cc939"
2784
  },
2785
  "hf_model": {
2786
  "path": "hf_model:assets/charts/single_episode_task_model_radar.svg",
2787
  "exists": true,
2788
+ "bytes": 36930,
2789
+ "sha256": "96e609b0577e66db0ee8c63939c11b1fb28018285a1d259362de0bff415cc939"
2790
  }
2791
  },
2792
  "failures": []
 
2797
  "local": {
2798
  "path": "repo:docs/assets/charts/episode128_task_model_radar.svg",
2799
  "exists": true,
2800
+ "bytes": 79370,
2801
+ "sha256": "5151c8aca22bd4aeda60b143b1164c1d1b9eb4babbeabf6da598701ccbbbf5c9"
2802
  },
2803
  "mirrors": {
2804
  "hf_space": {
2805
  "path": "hf_space:assets/charts/episode128_task_model_radar.svg",
2806
  "exists": true,
2807
+ "bytes": 79370,
2808
+ "sha256": "5151c8aca22bd4aeda60b143b1164c1d1b9eb4babbeabf6da598701ccbbbf5c9"
2809
  },
2810
  "hf_artifacts_docs": {
2811
  "path": "hf_artifacts:docs/assets/charts/episode128_task_model_radar.svg",
2812
  "exists": true,
2813
+ "bytes": 79370,
2814
+ "sha256": "5151c8aca22bd4aeda60b143b1164c1d1b9eb4babbeabf6da598701ccbbbf5c9"
2815
  },
2816
  "hf_artifacts_card": {
2817
  "path": "hf_artifacts:assets/charts/episode128_task_model_radar.svg",
2818
  "exists": true,
2819
+ "bytes": 79370,
2820
+ "sha256": "5151c8aca22bd4aeda60b143b1164c1d1b9eb4babbeabf6da598701ccbbbf5c9"
2821
  },
2822
  "hf_model": {
2823
  "path": "hf_model:assets/charts/episode128_task_model_radar.svg",
2824
  "exists": true,
2825
+ "bytes": 79370,
2826
+ "sha256": "5151c8aca22bd4aeda60b143b1164c1d1b9eb4babbeabf6da598701ccbbbf5c9"
2827
  }
2828
  },
2829
  "failures": []
 
2871
  "local": {
2872
  "path": "repo:docs/assets/charts/unified_task_model_radar.svg",
2873
  "exists": true,
2874
+ "bytes": 98527,
2875
+ "sha256": "5b034b22d2a772a57e7db50f300cb70d00bd31ac89d0c039c16ac8c23a5137ec"
2876
  },
2877
  "mirrors": {
2878
  "hf_space": {
2879
  "path": "hf_space:assets/charts/unified_task_model_radar.svg",
2880
  "exists": true,
2881
+ "bytes": 98527,
2882
+ "sha256": "5b034b22d2a772a57e7db50f300cb70d00bd31ac89d0c039c16ac8c23a5137ec"
2883
  },
2884
  "hf_artifacts_docs": {
2885
  "path": "hf_artifacts:docs/assets/charts/unified_task_model_radar.svg",
2886
  "exists": true,
2887
+ "bytes": 98527,
2888
+ "sha256": "5b034b22d2a772a57e7db50f300cb70d00bd31ac89d0c039c16ac8c23a5137ec"
2889
  },
2890
  "hf_artifacts_card": {
2891
  "path": "hf_artifacts:assets/charts/unified_task_model_radar.svg",
2892
  "exists": true,
2893
+ "bytes": 98527,
2894
+ "sha256": "5b034b22d2a772a57e7db50f300cb70d00bd31ac89d0c039c16ac8c23a5137ec"
2895
  },
2896
  "hf_model": {
2897
  "path": "hf_model:assets/charts/unified_task_model_radar.svg",
2898
  "exists": true,
2899
+ "bytes": 98527,
2900
+ "sha256": "5b034b22d2a772a57e7db50f300cb70d00bd31ac89d0c039c16ac8c23a5137ec"
2901
  }
2902
  },
2903
  "failures": []
 
5327
  "local": {
5328
  "path": "repo:scripts/build_artifact_index.py",
5329
  "exists": true,
5330
+ "bytes": 68407,
5331
+ "sha256": "9f5fc1af7c4c23083975066691bf6d75aa99280961198e2c47be88754c142418"
5332
  },
5333
  "mirrors": {
5334
  "hf_artifacts": {
5335
  "path": "hf_artifacts:scripts/build_artifact_index.py",
5336
  "exists": true,
5337
+ "bytes": 68407,
5338
+ "sha256": "9f5fc1af7c4c23083975066691bf6d75aa99280961198e2c47be88754c142418"
5339
  },
5340
  "hf_model": {
5341
  "path": "hf_model:scripts/build_artifact_index.py",
5342
  "exists": true,
5343
+ "bytes": 68407,
5344
+ "sha256": "9f5fc1af7c4c23083975066691bf6d75aa99280961198e2c47be88754c142418"
5345
  }
5346
  },
5347
  "failures": []
 
5402
  "local": {
5403
  "path": "repo:scripts/build_figure_index.py",
5404
  "exists": true,
5405
+ "bytes": 16886,
5406
+ "sha256": "fc44ea60b6f491d290a56d4c1097fcfc37c024f8a9bc3b3db013252f45d96e64"
5407
  },
5408
  "mirrors": {
5409
  "hf_artifacts": {
5410
  "path": "hf_artifacts:scripts/build_figure_index.py",
5411
  "exists": true,
5412
+ "bytes": 16886,
5413
+ "sha256": "fc44ea60b6f491d290a56d4c1097fcfc37c024f8a9bc3b3db013252f45d96e64"
5414
  },
5415
  "hf_model": {
5416
  "path": "hf_model:scripts/build_figure_index.py",
5417
  "exists": true,
5418
+ "bytes": 16886,
5419
+ "sha256": "fc44ea60b6f491d290a56d4c1097fcfc37c024f8a9bc3b3db013252f45d96e64"
5420
  }
5421
  },
5422
  "failures": []
 
5877
  "local": {
5878
  "path": "repo:scripts/build_unified_task_model_radar.py",
5879
  "exists": true,
5880
+ "bytes": 78735,
5881
+ "sha256": "8e7110381d56e5ef0effb7dcc7935d1bb0ce2653289504a50d04f29571a4e6b7"
5882
  },
5883
  "mirrors": {
5884
  "hf_artifacts": {
5885
  "path": "hf_artifacts:scripts/build_unified_task_model_radar.py",
5886
  "exists": true,
5887
+ "bytes": 78735,
5888
+ "sha256": "8e7110381d56e5ef0effb7dcc7935d1bb0ce2653289504a50d04f29571a4e6b7"
5889
  },
5890
  "hf_model": {
5891
  "path": "hf_model:scripts/build_unified_task_model_radar.py",
5892
  "exists": true,
5893
+ "bytes": 78735,
5894
+ "sha256": "8e7110381d56e5ef0effb7dcc7935d1bb0ce2653289504a50d04f29571a4e6b7"
5895
  }
5896
  },
5897
  "failures": []
 
6331
  "local": {
6332
  "path": "repo:docs/index.html",
6333
  "exists": true,
6334
+ "bytes": 348384,
6335
+ "sha256": "6f6b2c6b40e8d38210fd2e0c04750732afb5179996a663185f2525442c9c957a"
6336
  },
6337
  "mirrors": {
6338
  "hf_space": {
6339
  "path": "hf_space:index.html",
6340
  "exists": true,
6341
+ "bytes": 348384,
6342
+ "sha256": "6f6b2c6b40e8d38210fd2e0c04750732afb5179996a663185f2525442c9c957a"
6343
  },
6344
  "hf_artifacts_root": {
6345
  "path": "hf_artifacts:index.html",
6346
  "exists": true,
6347
+ "bytes": 348384,
6348
+ "sha256": "6f6b2c6b40e8d38210fd2e0c04750732afb5179996a663185f2525442c9c957a"
6349
  },
6350
  "hf_artifacts_docs": {
6351
  "path": "hf_artifacts:docs/index.html",
6352
  "exists": true,
6353
+ "bytes": 348384,
6354
+ "sha256": "6f6b2c6b40e8d38210fd2e0c04750732afb5179996a663185f2525442c9c957a"
6355
  },
6356
  "hf_model": {
6357
  "path": "hf_model:index.html",
6358
  "exists": true,
6359
+ "bytes": 348384,
6360
+ "sha256": "6f6b2c6b40e8d38210fd2e0c04750732afb5179996a663185f2525442c9c957a"
6361
  },
6362
  "hf_model_docs": {
6363
  "path": "hf_model:docs/index.html",
6364
  "exists": true,
6365
+ "bytes": 348384,
6366
+ "sha256": "6f6b2c6b40e8d38210fd2e0c04750732afb5179996a663185f2525442c9c957a"
6367
  }
6368
  },
6369
  "failures": []
 
31027
  "local": {
31028
  "path": "repo:FIGURE_INDEX.md",
31029
  "exists": true,
31030
+ "bytes": 7068,
31031
+ "sha256": "cc64c0bb070e7eb0035ba590a6d83ed07fcc68fb56081668caebf40b49b9900f"
31032
  },
31033
  "mirrors": {
31034
  "hf_space": {
31035
  "path": "hf_space:FIGURE_INDEX.md",
31036
  "exists": true,
31037
+ "bytes": 7068,
31038
+ "sha256": "cc64c0bb070e7eb0035ba590a6d83ed07fcc68fb56081668caebf40b49b9900f"
31039
  },
31040
  "hf_artifacts": {
31041
  "path": "hf_artifacts:FIGURE_INDEX.md",
31042
  "exists": true,
31043
+ "bytes": 7068,
31044
+ "sha256": "cc64c0bb070e7eb0035ba590a6d83ed07fcc68fb56081668caebf40b49b9900f"
31045
  },
31046
  "hf_model": {
31047
  "path": "hf_model:FIGURE_INDEX.md",
31048
  "exists": true,
31049
+ "bytes": 7068,
31050
+ "sha256": "cc64c0bb070e7eb0035ba590a6d83ed07fcc68fb56081668caebf40b49b9900f"
31051
  }
31052
  },
31053
  "failures": []
 
31554
  "local": {
31555
  "path": "repo:TASK_METHOD_20_RESULT_MATRIX.md",
31556
  "exists": true,
31557
+ "bytes": 14862,
31558
+ "sha256": "a6f769d40ed22a2d63dad88029be393d55ed9af3082ab61fc1ad8b314aa871a6"
31559
  },
31560
  "mirrors": {
31561
  "hf_space": {
31562
  "path": "hf_space:TASK_METHOD_20_RESULT_MATRIX.md",
31563
  "exists": true,
31564
+ "bytes": 14862,
31565
+ "sha256": "a6f769d40ed22a2d63dad88029be393d55ed9af3082ab61fc1ad8b314aa871a6"
31566
  },
31567
  "hf_artifacts": {
31568
  "path": "hf_artifacts:TASK_METHOD_20_RESULT_MATRIX.md",
31569
  "exists": true,
31570
+ "bytes": 14862,
31571
+ "sha256": "a6f769d40ed22a2d63dad88029be393d55ed9af3082ab61fc1ad8b314aa871a6"
31572
  },
31573
  "hf_model": {
31574
  "path": "hf_model:TASK_METHOD_20_RESULT_MATRIX.md",
31575
  "exists": true,
31576
+ "bytes": 14862,
31577
+ "sha256": "a6f769d40ed22a2d63dad88029be393d55ed9af3082ab61fc1ad8b314aa871a6"
31578
  }
31579
  },
31580
  "failures": []
docs/data/public_surface_qa.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Public Project Surface",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T20:01:39+00:00",
5
  "scope": "Repo README, GitHub Pages HTML, Hugging Face Space card, artifact dataset card, and model card.",
6
  "checks": [
7
  {
@@ -18,7 +18,7 @@
18
  "website_integrity": {
19
  "exists": true,
20
  "status": "pass",
21
- "generated_at_utc": "2026-06-21T20:01:18+00:00"
22
  },
23
  "rendered_site_check": {
24
  "exists": true,
@@ -28,27 +28,27 @@
28
  "task_surface_integrity": {
29
  "exists": true,
30
  "status": "pass",
31
- "generated_at_utc": "2026-06-21T15:21:55+00:00"
32
  },
33
  "source_alignment": {
34
  "exists": true,
35
  "status": "pass",
36
- "generated_at_utc": "2026-06-21T15:21:55+00:00"
37
  },
38
  "scale_up_status": {
39
  "exists": true,
40
  "status": "pass",
41
- "generated_at_utc": "2026-06-21T15:23:13+00:00"
42
  },
43
  "publication_package": {
44
  "exists": true,
45
  "status": "pass",
46
- "generated_at_utc": "2026-06-21T19:57:53+00:00"
47
  },
48
  "mirror_parity": {
49
  "exists": true,
50
  "status": "pass",
51
- "generated_at_utc": "2026-06-21T20:00:18+00:00"
52
  }
53
  },
54
  "failures": {}
@@ -97,8 +97,8 @@
97
  "marker_counts": {
98
  "Ropedia Xperience-10M Task Suite": 22,
99
  "Xperience-10M": 173,
100
- "20-task": 110,
101
- "Qwen3-Omni": 246,
102
  "128-episode pilot": 1
103
  }
104
  },
 
1
  {
2
  "title": "Ropedia Xperience-10M Public Project Surface",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T20:35:18+00:00",
5
  "scope": "Repo README, GitHub Pages HTML, Hugging Face Space card, artifact dataset card, and model card.",
6
  "checks": [
7
  {
 
18
  "website_integrity": {
19
  "exists": true,
20
  "status": "pass",
21
+ "generated_at_utc": "2026-06-21T20:35:18+00:00"
22
  },
23
  "rendered_site_check": {
24
  "exists": true,
 
28
  "task_surface_integrity": {
29
  "exists": true,
30
  "status": "pass",
31
+ "generated_at_utc": "2026-06-21T20:30:48+00:00"
32
  },
33
  "source_alignment": {
34
  "exists": true,
35
  "status": "pass",
36
+ "generated_at_utc": "2026-06-21T20:30:47+00:00"
37
  },
38
  "scale_up_status": {
39
  "exists": true,
40
  "status": "pass",
41
+ "generated_at_utc": "2026-06-21T20:30:47+00:00"
42
  },
43
  "publication_package": {
44
  "exists": true,
45
  "status": "pass",
46
+ "generated_at_utc": "2026-06-21T20:32:45+00:00"
47
  },
48
  "mirror_parity": {
49
  "exists": true,
50
  "status": "pass",
51
+ "generated_at_utc": "2026-06-21T20:33:57+00:00"
52
  }
53
  },
54
  "failures": {}
 
97
  "marker_counts": {
98
  "Ropedia Xperience-10M Task Suite": 22,
99
  "Xperience-10M": 173,
100
+ "20-task": 107,
101
+ "Qwen3-Omni": 239,
102
  "128-episode pilot": 1
103
  }
104
  },
docs/data/publication_audit.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T20:02:09+00:00",
4
  "checks": [
5
  {
6
  "name": "required_publication_assets_present",
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T20:35:41+00:00",
4
  "checks": [
5
  {
6
  "name": "required_publication_assets_present",
docs/data/quality_gates.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Release Checks",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T15:21:42+00:00",
5
  "rule": "A release is current when the automated reports pass and the live GitHub/Hugging Face mirrors are verified after publishing.",
6
  "automated_gates": [
7
  {
 
1
  {
2
  "title": "Ropedia Xperience-10M Release Checks",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T20:35:18+00:00",
5
  "rule": "A release is current when the automated reports pass and the live GitHub/Hugging Face mirrors are verified after publishing.",
6
  "automated_gates": [
7
  {
docs/data/scope_claims_audit.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T15:23:13+00:00",
4
  "summary": {
5
  "qwen3_omni_verified_diagnostic_pilot": true,
6
  "dataset_manifest_num_episodes": 119,
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T20:35:22+00:00",
4
  "summary": {
5
  "qwen3_omni_verified_diagnostic_pilot": true,
6
  "dataset_manifest_num_episodes": 119,
docs/data/single_episode_task_model_radar.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Single-Episode 20-Task Radar",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T15:20:34+00:00",
5
  "description": "Minimal and Neural MLP baselines on the one public sample episode, both scored on all 20 task contracts.",
6
  "task_count": 20,
7
  "method_count": 2,
@@ -11,10 +11,28 @@
11
  "higher_is_better": "bounded metrics are plotted directly on 0-1 axes after clipping to [0, 1]",
12
  "lower_is_better": "lower-error metrics are converted to best_observed_value / raw_value within the same task",
13
  "raw_values": "raw metric values, metric keys, and sources are retained in this JSON; the SVG is an overview, not a replacement for the metric table",
 
14
  "result_record_policy": "every method has 20 task records; the current public release has 180/180 scored rows with proxy flags and reasons retained where compact substitute targets are used",
15
- "foundation_model_overlay": "Qwen3-Omni and Cosmos3 points are plotted only on task-aligned axes. Scoreless records mean the public result does not evaluate that task contract.",
16
- "metadata_128_overlay": "128-episode aligned baselines have 20 records. Numeric scores come from JSONL metadata/text tasks plus staged sensor-block targets when the processed target exists; raw interaction text and paired camera-view embeddings remain explicit gaps.",
17
- "raw_128_overlay": "128-episode raw-feature baselines use staged sensor NPZ features. Eighteen axes use direct task targets; interaction text and camera-view sync are completed with documented compact proxies because raw interaction strings and paired video-view embeddings are absent from the 128 export."
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
18
  },
19
  "source_unified_radar": "docs/data/unified_task_model_radar.json",
20
  "source_result_matrix": "docs/data/task_method_20_result_matrix.json",
@@ -28,7 +46,7 @@
28
  "scope": "1 public sample episode",
29
  "stroke_dasharray": null,
30
  "method_detail": "Single-episode simple heads over the public sample split.",
31
- "plotted_as": "filled polygon",
32
  "result_record_count": 20,
33
  "scored_task_count": 20,
34
  "covered_task_count": 20,
@@ -51,7 +69,7 @@
51
  "scope": "1 public sample episode",
52
  "stroke_dasharray": null,
53
  "method_detail": "Single-episode compact PyTorch MLP heads on the same 20 task contracts.",
54
- "plotted_as": "filled polygon",
55
  "result_record_count": 20,
56
  "scored_task_count": 20,
57
  "covered_task_count": 20,
 
1
  {
2
  "title": "Single-Episode 20-Task Radar",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T20:35:16+00:00",
5
  "description": "Minimal and Neural MLP baselines on the one public sample episode, both scored on all 20 task contracts.",
6
  "task_count": 20,
7
  "method_count": 2,
 
11
  "higher_is_better": "bounded metrics are plotted directly on 0-1 axes after clipping to [0, 1]",
12
  "lower_is_better": "lower-error metrics are converted to best_observed_value / raw_value within the same task",
13
  "raw_values": "raw metric values, metric keys, and sources are retained in this JSON; the SVG is an overview, not a replacement for the metric table",
14
+ "radar_visual_radius": "SVG radar panels use sqrt(normalized_score) for radius so polygon area remains closer to the score and low-valued but real differences stay visible; the JSON and matrix retain exact linear normalized_score values",
15
  "result_record_policy": "every method has 20 task records; the current public release has 180/180 scored rows with proxy flags and reasons retained where compact substitute targets are used",
16
+ "foundation_model_overlay": "Qwen3-Omni and Cosmos3 are grouped in the foundation-model radar panel. All current public model rows have 20 scored task records, with source paths retained for every metric.",
17
+ "metadata_128_overlay": "128-episode aligned baselines are grouped in the metadata/text radar panel. Numeric scores come from JSONL metadata/text tasks plus staged sensor-block targets when the processed target exists.",
18
+ "raw_128_overlay": "128-episode raw-feature baselines are grouped in the raw-feature radar panel. Eighteen axes use direct task targets; interaction text and camera-view sync are completed with documented compact proxies because raw interaction strings and paired video-view embeddings are absent from the 128 export."
19
+ },
20
+ "chart_design": {
21
+ "mode": "grouped_small_multiples",
22
+ "method_count": 2,
23
+ "reason": "This split view has 2 methods and 40 method-task records; grouped radar panels keep related methods readable while retaining the unified source matrix.",
24
+ "groups": [
25
+ {
26
+ "id": "single_episode",
27
+ "title": "Single-episode sample",
28
+ "series_ids": [
29
+ "minimal",
30
+ "neural_mlp"
31
+ ]
32
+ }
33
+ ],
34
+ "visual_radius_transform": "sqrt(normalized_score)",
35
+ "exact_value_source": "docs/data/task_method_20_result_matrix.json"
36
  },
37
  "source_unified_radar": "docs/data/unified_task_model_radar.json",
38
  "source_result_matrix": "docs/data/task_method_20_result_matrix.json",
 
46
  "scope": "1 public sample episode",
47
  "stroke_dasharray": null,
48
  "method_detail": "Single-episode simple heads over the public sample split.",
49
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
50
  "result_record_count": 20,
51
  "scored_task_count": 20,
52
  "covered_task_count": 20,
 
69
  "scope": "1 public sample episode",
70
  "stroke_dasharray": null,
71
  "method_detail": "Single-episode compact PyTorch MLP heads on the same 20 task contracts.",
72
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
73
  "result_record_count": 20,
74
  "scored_task_count": 20,
75
  "covered_task_count": 20,
docs/data/source_alignment_audit.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Source Alignment Note",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T15:21:55+00:00",
5
  "alignment_json": "docs/data/xperience10m_dataset_card_alignment.json",
6
  "alignment_summary": {
7
  "full_dataset_repo": "ropedia-ai/xperience-10m",
 
1
  {
2
  "title": "Ropedia Xperience-10M Source Alignment Note",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T20:35:22+00:00",
5
  "alignment_json": "docs/data/xperience10m_dataset_card_alignment.json",
6
  "alignment_summary": {
7
  "full_dataset_repo": "ropedia-ai/xperience-10m",
docs/data/task_method_20_result_matrix.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Task Method 20-Result Matrix",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T15:20:34+00:00",
5
  "task_count": 20,
6
  "method_count": 9,
7
  "method_task_record_count": 180,
@@ -16,7 +16,7 @@
16
  "scope": "1 public sample episode",
17
  "stroke_dasharray": null,
18
  "method_detail": "Single-episode simple heads over the public sample split.",
19
- "plotted_as": "filled polygon",
20
  "result_record_count": 20,
21
  "scored_task_count": 20,
22
  "covered_task_count": 20,
@@ -39,7 +39,7 @@
39
  "scope": "1 public sample episode",
40
  "stroke_dasharray": null,
41
  "method_detail": "Single-episode compact PyTorch MLP heads on the same 20 task contracts.",
42
- "plotted_as": "filled polygon",
43
  "result_record_count": 20,
44
  "scored_task_count": 20,
45
  "covered_task_count": 20,
@@ -62,7 +62,7 @@
62
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
63
  "stroke_dasharray": "9 6",
64
  "method_detail": "128-episode aligned simple baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
65
- "plotted_as": "colored point overlay",
66
  "result_record_count": 20,
67
  "scored_task_count": 20,
68
  "covered_task_count": 20,
@@ -86,7 +86,7 @@
86
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
87
  "stroke_dasharray": "3 6",
88
  "method_detail": "128-episode aligned MLP baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
89
- "plotted_as": "colored point overlay",
90
  "result_record_count": 20,
91
  "scored_task_count": 20,
92
  "covered_task_count": 20,
@@ -110,7 +110,7 @@
110
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
111
  "stroke_dasharray": "8 4",
112
  "method_detail": "128-episode 4430-dim sensor NPZ simple heads; tasks 15/19 use compact proxies.",
113
- "plotted_as": "colored point overlay",
114
  "result_record_count": 20,
115
  "scored_task_count": 20,
116
  "covered_task_count": 20,
@@ -134,7 +134,7 @@
134
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
135
  "stroke_dasharray": "2 5",
136
  "method_detail": "128-episode 4430-dim sensor NPZ MLP heads; tasks 15/19 use compact proxies.",
137
- "plotted_as": "colored point overlay",
138
  "result_record_count": 20,
139
  "scored_task_count": 20,
140
  "covered_task_count": 20,
@@ -158,7 +158,7 @@
158
  "scope": "128 selected episodes, held-out test",
159
  "stroke_dasharray": "7 7",
160
  "method_detail": "Verified held-out Qwen3-Omni v6 LoRA metrics, plus task 16 and any completed private-GPU future/retrieval/sensor-target probes scored from task-specific JSON.",
161
- "plotted_as": "colored point overlay",
162
  "result_record_count": 20,
163
  "scored_task_count": 20,
164
  "covered_task_count": 20,
@@ -181,7 +181,7 @@
181
  "scope": "128 selected episodes, held-out test",
182
  "stroke_dasharray": "4 7",
183
  "method_detail": "Verified Cosmos3-Super base-weight Reasoner JSON-task evaluation, plus task 5/8/9/10/11/12/13/14/16/17/18/19/20 probes where public metrics exist.",
184
- "plotted_as": "colored point overlay",
185
  "result_record_count": 20,
186
  "scored_task_count": 20,
187
  "covered_task_count": 20,
@@ -204,7 +204,7 @@
204
  "scope": "128 selected episodes, held-out test",
205
  "stroke_dasharray": "2 7",
206
  "method_detail": "Verified Cosmos3-Nano future-window compatibility metrics, plus model-output probes for tasks 2/5/7/8/10/11/12/13/14/15/16/17/18/19 and a derived task-20 boundary timing probe scored from held-out future-window artifacts.",
207
- "plotted_as": "colored point overlay",
208
  "result_record_count": 20,
209
  "scored_task_count": 20,
210
  "covered_task_count": 20,
 
1
  {
2
  "title": "Task Method 20-Result Matrix",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T20:35:16+00:00",
5
  "task_count": 20,
6
  "method_count": 9,
7
  "method_task_record_count": 180,
 
16
  "scope": "1 public sample episode",
17
  "stroke_dasharray": null,
18
  "method_detail": "Single-episode simple heads over the public sample split.",
19
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
20
  "result_record_count": 20,
21
  "scored_task_count": 20,
22
  "covered_task_count": 20,
 
39
  "scope": "1 public sample episode",
40
  "stroke_dasharray": null,
41
  "method_detail": "Single-episode compact PyTorch MLP heads on the same 20 task contracts.",
42
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
43
  "result_record_count": 20,
44
  "scored_task_count": 20,
45
  "covered_task_count": 20,
 
62
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
63
  "stroke_dasharray": "9 6",
64
  "method_detail": "128-episode aligned simple baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
65
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
66
  "result_record_count": 20,
67
  "scored_task_count": 20,
68
  "covered_task_count": 20,
 
86
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
87
  "stroke_dasharray": "3 6",
88
  "method_detail": "128-episode aligned MLP baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
89
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
90
  "result_record_count": 20,
91
  "scored_task_count": 20,
92
  "covered_task_count": 20,
 
110
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
111
  "stroke_dasharray": "8 4",
112
  "method_detail": "128-episode 4430-dim sensor NPZ simple heads; tasks 15/19 use compact proxies.",
113
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
114
  "result_record_count": 20,
115
  "scored_task_count": 20,
116
  "covered_task_count": 20,
 
134
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
135
  "stroke_dasharray": "2 5",
136
  "method_detail": "128-episode 4430-dim sensor NPZ MLP heads; tasks 15/19 use compact proxies.",
137
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
138
  "result_record_count": 20,
139
  "scored_task_count": 20,
140
  "covered_task_count": 20,
 
158
  "scope": "128 selected episodes, held-out test",
159
  "stroke_dasharray": "7 7",
160
  "method_detail": "Verified held-out Qwen3-Omni v6 LoRA metrics, plus task 16 and any completed private-GPU future/retrieval/sensor-target probes scored from task-specific JSON.",
161
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
162
  "result_record_count": 20,
163
  "scored_task_count": 20,
164
  "covered_task_count": 20,
 
181
  "scope": "128 selected episodes, held-out test",
182
  "stroke_dasharray": "4 7",
183
  "method_detail": "Verified Cosmos3-Super base-weight Reasoner JSON-task evaluation, plus task 5/8/9/10/11/12/13/14/16/17/18/19/20 probes where public metrics exist.",
184
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
185
  "result_record_count": 20,
186
  "scored_task_count": 20,
187
  "covered_task_count": 20,
 
204
  "scope": "128 selected episodes, held-out test",
205
  "stroke_dasharray": "2 7",
206
  "method_detail": "Verified Cosmos3-Nano future-window compatibility metrics, plus model-output probes for tasks 2/5/7/8/10/11/12/13/14/15/16/17/18/19 and a derived task-20 boundary timing probe scored from held-out future-window artifacts.",
207
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
208
  "result_record_count": 20,
209
  "scored_task_count": 20,
210
  "covered_task_count": 20,
docs/data/task_surface_integrity.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T15:21:55+00:00",
4
  "summary": {
5
  "original_walkthrough_task_count": 12,
6
  "expected_original_walkthrough_task_count": 12,
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T20:35:22+00:00",
4
  "summary": {
5
  "original_walkthrough_task_count": 12,
6
  "expected_original_walkthrough_task_count": 12,
docs/data/unified_task_model_radar.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Unified 20-Task Model Radar",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T15:20:34+00:00",
5
  "task_count": 20,
6
  "method_count": 9,
7
  "method_task_record_count": 180,
@@ -10,10 +10,53 @@
10
  "higher_is_better": "bounded metrics are plotted directly on 0-1 axes after clipping to [0, 1]",
11
  "lower_is_better": "lower-error metrics are converted to best_observed_value / raw_value within the same task",
12
  "raw_values": "raw metric values, metric keys, and sources are retained in this JSON; the SVG is an overview, not a replacement for the metric table",
 
13
  "result_record_policy": "every method has 20 task records; the current public release has 180/180 scored rows with proxy flags and reasons retained where compact substitute targets are used",
14
- "foundation_model_overlay": "Qwen3-Omni and Cosmos3 points are plotted only on task-aligned axes. Scoreless records mean the public result does not evaluate that task contract.",
15
- "metadata_128_overlay": "128-episode aligned baselines have 20 records. Numeric scores come from JSONL metadata/text tasks plus staged sensor-block targets when the processed target exists; raw interaction text and paired camera-view embeddings remain explicit gaps.",
16
- "raw_128_overlay": "128-episode raw-feature baselines use staged sensor NPZ features. Eighteen axes use direct task targets; interaction text and camera-view sync are completed with documented compact proxies because raw interaction strings and paired video-view embeddings are absent from the 128 export."
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
17
  },
18
  "series": [
19
  {
@@ -25,7 +68,7 @@
25
  "scope": "1 public sample episode",
26
  "stroke_dasharray": null,
27
  "method_detail": "Single-episode simple heads over the public sample split.",
28
- "plotted_as": "filled polygon",
29
  "result_record_count": 20,
30
  "scored_task_count": 20,
31
  "covered_task_count": 20,
@@ -48,7 +91,7 @@
48
  "scope": "1 public sample episode",
49
  "stroke_dasharray": null,
50
  "method_detail": "Single-episode compact PyTorch MLP heads on the same 20 task contracts.",
51
- "plotted_as": "filled polygon",
52
  "result_record_count": 20,
53
  "scored_task_count": 20,
54
  "covered_task_count": 20,
@@ -71,7 +114,7 @@
71
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
72
  "stroke_dasharray": "9 6",
73
  "method_detail": "128-episode aligned simple baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
74
- "plotted_as": "colored point overlay",
75
  "result_record_count": 20,
76
  "scored_task_count": 20,
77
  "covered_task_count": 20,
@@ -95,7 +138,7 @@
95
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
96
  "stroke_dasharray": "3 6",
97
  "method_detail": "128-episode aligned MLP baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
98
- "plotted_as": "colored point overlay",
99
  "result_record_count": 20,
100
  "scored_task_count": 20,
101
  "covered_task_count": 20,
@@ -119,7 +162,7 @@
119
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
120
  "stroke_dasharray": "8 4",
121
  "method_detail": "128-episode 4430-dim sensor NPZ simple heads; tasks 15/19 use compact proxies.",
122
- "plotted_as": "colored point overlay",
123
  "result_record_count": 20,
124
  "scored_task_count": 20,
125
  "covered_task_count": 20,
@@ -143,7 +186,7 @@
143
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
144
  "stroke_dasharray": "2 5",
145
  "method_detail": "128-episode 4430-dim sensor NPZ MLP heads; tasks 15/19 use compact proxies.",
146
- "plotted_as": "colored point overlay",
147
  "result_record_count": 20,
148
  "scored_task_count": 20,
149
  "covered_task_count": 20,
@@ -167,7 +210,7 @@
167
  "scope": "128 selected episodes, held-out test",
168
  "stroke_dasharray": "7 7",
169
  "method_detail": "Verified held-out Qwen3-Omni v6 LoRA metrics, plus task 16 and any completed private-GPU future/retrieval/sensor-target probes scored from task-specific JSON.",
170
- "plotted_as": "colored point overlay",
171
  "result_record_count": 20,
172
  "scored_task_count": 20,
173
  "covered_task_count": 20,
@@ -190,7 +233,7 @@
190
  "scope": "128 selected episodes, held-out test",
191
  "stroke_dasharray": "4 7",
192
  "method_detail": "Verified Cosmos3-Super base-weight Reasoner JSON-task evaluation, plus task 5/8/9/10/11/12/13/14/16/17/18/19/20 probes where public metrics exist.",
193
- "plotted_as": "colored point overlay",
194
  "result_record_count": 20,
195
  "scored_task_count": 20,
196
  "covered_task_count": 20,
@@ -213,7 +256,7 @@
213
  "scope": "128 selected episodes, held-out test",
214
  "stroke_dasharray": "2 7",
215
  "method_detail": "Verified Cosmos3-Nano future-window compatibility metrics, plus model-output probes for tasks 2/5/7/8/10/11/12/13/14/15/16/17/18/19 and a derived task-20 boundary timing probe scored from held-out future-window artifacts.",
216
- "plotted_as": "colored point overlay",
217
  "result_record_count": 20,
218
  "scored_task_count": 20,
219
  "covered_task_count": 20,
 
1
  {
2
  "title": "Unified 20-Task Model Radar",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T20:35:16+00:00",
5
  "task_count": 20,
6
  "method_count": 9,
7
  "method_task_record_count": 180,
 
10
  "higher_is_better": "bounded metrics are plotted directly on 0-1 axes after clipping to [0, 1]",
11
  "lower_is_better": "lower-error metrics are converted to best_observed_value / raw_value within the same task",
12
  "raw_values": "raw metric values, metric keys, and sources are retained in this JSON; the SVG is an overview, not a replacement for the metric table",
13
+ "radar_visual_radius": "SVG radar panels use sqrt(normalized_score) for radius so polygon area remains closer to the score and low-valued but real differences stay visible; the JSON and matrix retain exact linear normalized_score values",
14
  "result_record_policy": "every method has 20 task records; the current public release has 180/180 scored rows with proxy flags and reasons retained where compact substitute targets are used",
15
+ "foundation_model_overlay": "Qwen3-Omni and Cosmos3 are grouped in the foundation-model radar panel. All current public model rows have 20 scored task records, with source paths retained for every metric.",
16
+ "metadata_128_overlay": "128-episode aligned baselines are grouped in the metadata/text radar panel. Numeric scores come from JSONL metadata/text tasks plus staged sensor-block targets when the processed target exists.",
17
+ "raw_128_overlay": "128-episode raw-feature baselines are grouped in the raw-feature radar panel. Eighteen axes use direct task targets; interaction text and camera-view sync are completed with documented compact proxies because raw interaction strings and paired video-view embeddings are absent from the 128 export."
18
+ },
19
+ "chart_design": {
20
+ "mode": "grouped_small_multiples",
21
+ "method_count": 9,
22
+ "reason": "The public release has nine methods and 180 scored records; small-multiple radar panels avoid a nine-polygon overlay while keeping every method visible.",
23
+ "groups": [
24
+ {
25
+ "id": "single_episode",
26
+ "title": "Single-episode sample",
27
+ "series_ids": [
28
+ "minimal",
29
+ "neural_mlp"
30
+ ]
31
+ },
32
+ {
33
+ "id": "metadata_128",
34
+ "title": "128-episode metadata/text",
35
+ "series_ids": [
36
+ "metadata128_simple",
37
+ "metadata128_neural_mlp"
38
+ ]
39
+ },
40
+ {
41
+ "id": "raw_128",
42
+ "title": "128-episode raw features",
43
+ "series_ids": [
44
+ "raw128_simple",
45
+ "raw128_neural_mlp"
46
+ ]
47
+ },
48
+ {
49
+ "id": "foundation_models",
50
+ "title": "Foundation-model probes",
51
+ "series_ids": [
52
+ "qwen3_omni_v6_lora",
53
+ "cosmos3_super_reasoner",
54
+ "cosmos3_nano_future_window"
55
+ ]
56
+ }
57
+ ],
58
+ "visual_radius_transform": "sqrt(normalized_score)",
59
+ "exact_value_source": "docs/data/task_method_20_result_matrix.json"
60
  },
61
  "series": [
62
  {
 
68
  "scope": "1 public sample episode",
69
  "stroke_dasharray": null,
70
  "method_detail": "Single-episode simple heads over the public sample split.",
71
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
72
  "result_record_count": 20,
73
  "scored_task_count": 20,
74
  "covered_task_count": 20,
 
91
  "scope": "1 public sample episode",
92
  "stroke_dasharray": null,
93
  "method_detail": "Single-episode compact PyTorch MLP heads on the same 20 task contracts.",
94
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
95
  "result_record_count": 20,
96
  "scored_task_count": 20,
97
  "covered_task_count": 20,
 
114
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
115
  "stroke_dasharray": "9 6",
116
  "method_detail": "128-episode aligned simple baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
117
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
118
  "result_record_count": 20,
119
  "scored_task_count": 20,
120
  "covered_task_count": 20,
 
138
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
139
  "stroke_dasharray": "3 6",
140
  "method_detail": "128-episode aligned MLP baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
141
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
142
  "result_record_count": 20,
143
  "scored_task_count": 20,
144
  "covered_task_count": 20,
 
162
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
163
  "stroke_dasharray": "8 4",
164
  "method_detail": "128-episode 4430-dim sensor NPZ simple heads; tasks 15/19 use compact proxies.",
165
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
166
  "result_record_count": 20,
167
  "scored_task_count": 20,
168
  "covered_task_count": 20,
 
186
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
187
  "stroke_dasharray": "2 5",
188
  "method_detail": "128-episode 4430-dim sensor NPZ MLP heads; tasks 15/19 use compact proxies.",
189
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
190
  "result_record_count": 20,
191
  "scored_task_count": 20,
192
  "covered_task_count": 20,
 
210
  "scope": "128 selected episodes, held-out test",
211
  "stroke_dasharray": "7 7",
212
  "method_detail": "Verified held-out Qwen3-Omni v6 LoRA metrics, plus task 16 and any completed private-GPU future/retrieval/sensor-target probes scored from task-specific JSON.",
213
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
214
  "result_record_count": 20,
215
  "scored_task_count": 20,
216
  "covered_task_count": 20,
 
233
  "scope": "128 selected episodes, held-out test",
234
  "stroke_dasharray": "4 7",
235
  "method_detail": "Verified Cosmos3-Super base-weight Reasoner JSON-task evaluation, plus task 5/8/9/10/11/12/13/14/16/17/18/19/20 probes where public metrics exist.",
236
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
237
  "result_record_count": 20,
238
  "scored_task_count": 20,
239
  "covered_task_count": 20,
 
256
  "scope": "128 selected episodes, held-out test",
257
  "stroke_dasharray": "2 7",
258
  "method_detail": "Verified Cosmos3-Nano future-window compatibility metrics, plus model-output probes for tasks 2/5/7/8/10/11/12/13/14/15/16/17/18/19 and a derived task-20 boundary timing probe scored from held-out future-window artifacts.",
259
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
260
  "result_record_count": 20,
261
  "scored_task_count": 20,
262
  "covered_task_count": 20,
docs/data/website_integrity.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T20:01:18+00:00",
4
  "docs_root": "docs",
5
  "site_base": "/ropedia-xperience-10m-task-suite/",
6
  "summary": {
@@ -80,8 +80,8 @@
80
  "name": "project_overview_precedes_progress_ledger",
81
  "status": "pass",
82
  "reason": "The project overview should appear before the deeper progress ledger.",
83
- "overview_index": 136413,
84
- "evidence_index": 187740
85
  },
86
  {
87
  "name": "project_status_links_json",
@@ -159,9 +159,9 @@
159
  "name": "evaluation_protocol_between_overview_and_progress",
160
  "status": "pass",
161
  "reason": "The evaluation protocol should appear before the deeper evidence ledger.",
162
- "overview_index": 136413,
163
- "protocol_index": 183928,
164
- "evidence_index": 187740
165
  },
166
  {
167
  "name": "evaluation_protocol_links_json",
@@ -187,7 +187,7 @@
187
  "status": "pass",
188
  "reason": "The Suite anchor should show the task-suite map before the radar/results surface.",
189
  "first_marker_index": 468,
190
- "second_marker_index": 1838
191
  },
192
  {
193
  "name": "raw_sample_stream_ledger_contains_seven_modalities",
@@ -315,7 +315,7 @@
315
  },
316
  {
317
  "path": "data/artifact_index.json",
318
- "bytes": 124341,
319
  "top_level_type": "dict"
320
  },
321
  {
@@ -330,7 +330,7 @@
330
  },
331
  {
332
  "path": "data/episode128_task_model_radar.json",
333
- "bytes": 185212,
334
  "top_level_type": "dict"
335
  },
336
  {
@@ -345,7 +345,7 @@
345
  },
346
  {
347
  "path": "data/figure_index.json",
348
- "bytes": 19485,
349
  "top_level_type": "dict"
350
  },
351
  {
@@ -495,7 +495,7 @@
495
  },
496
  {
497
  "path": "data/single_episode_task_model_radar.json",
498
- "bytes": 51327,
499
  "top_level_type": "dict"
500
  },
501
  {
@@ -515,7 +515,7 @@
515
  },
516
  {
517
  "path": "data/task_method_20_result_matrix.json",
518
- "bytes": 128509,
519
  "top_level_type": "dict"
520
  },
521
  {
@@ -565,7 +565,7 @@
565
  },
566
  {
567
  "path": "data/unified_task_model_radar.json",
568
- "bytes": 229035,
569
  "top_level_type": "dict"
570
  },
571
  {
@@ -610,7 +610,7 @@
610
  {
611
  "path": "assets/charts/episode128_task_model_radar.svg",
612
  "exists": true,
613
- "bytes": 51915,
614
  "format": "SVG",
615
  "has_viewbox": true
616
  },
@@ -666,7 +666,7 @@
666
  {
667
  "path": "assets/charts/single_episode_task_model_radar.svg",
668
  "exists": true,
669
- "bytes": 35232,
670
  "format": "SVG",
671
  "has_viewbox": true
672
  },
@@ -680,7 +680,7 @@
680
  {
681
  "path": "assets/charts/unified_task_model_radar.svg",
682
  "exists": true,
683
- "bytes": 57938,
684
  "format": "SVG",
685
  "has_viewbox": true
686
  },
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T20:35:18+00:00",
4
  "docs_root": "docs",
5
  "site_base": "/ropedia-xperience-10m-task-suite/",
6
  "summary": {
 
80
  "name": "project_overview_precedes_progress_ledger",
81
  "status": "pass",
82
  "reason": "The project overview should appear before the deeper progress ledger.",
83
+ "overview_index": 136346,
84
+ "evidence_index": 187681
85
  },
86
  {
87
  "name": "project_status_links_json",
 
159
  "name": "evaluation_protocol_between_overview_and_progress",
160
  "status": "pass",
161
  "reason": "The evaluation protocol should appear before the deeper evidence ledger.",
162
+ "overview_index": 136346,
163
+ "protocol_index": 183869,
164
+ "evidence_index": 187681
165
  },
166
  {
167
  "name": "evaluation_protocol_links_json",
 
187
  "status": "pass",
188
  "reason": "The Suite anchor should show the task-suite map before the radar/results surface.",
189
  "first_marker_index": 468,
190
+ "second_marker_index": 2000
191
  },
192
  {
193
  "name": "raw_sample_stream_ledger_contains_seven_modalities",
 
315
  },
316
  {
317
  "path": "data/artifact_index.json",
318
+ "bytes": 124471,
319
  "top_level_type": "dict"
320
  },
321
  {
 
330
  },
331
  {
332
  "path": "data/episode128_task_model_radar.json",
333
+ "bytes": 186828,
334
  "top_level_type": "dict"
335
  },
336
  {
 
345
  },
346
  {
347
  "path": "data/figure_index.json",
348
+ "bytes": 19526,
349
  "top_level_type": "dict"
350
  },
351
  {
 
495
  },
496
  {
497
  "path": "data/single_episode_task_model_radar.json",
498
+ "bytes": 52256,
499
  "top_level_type": "dict"
500
  },
501
  {
 
515
  },
516
  {
517
  "path": "data/task_method_20_result_matrix.json",
518
+ "bytes": 128991,
519
  "top_level_type": "dict"
520
  },
521
  {
 
565
  },
566
  {
567
  "path": "data/unified_task_model_radar.json",
568
+ "bytes": 230938,
569
  "top_level_type": "dict"
570
  },
571
  {
 
610
  {
611
  "path": "assets/charts/episode128_task_model_radar.svg",
612
  "exists": true,
613
+ "bytes": 79370,
614
  "format": "SVG",
615
  "has_viewbox": true
616
  },
 
666
  {
667
  "path": "assets/charts/single_episode_task_model_radar.svg",
668
  "exists": true,
669
+ "bytes": 36930,
670
  "format": "SVG",
671
  "has_viewbox": true
672
  },
 
680
  {
681
  "path": "assets/charts/unified_task_model_radar.svg",
682
  "exists": true,
683
+ "bytes": 98527,
684
  "format": "SVG",
685
  "has_viewbox": true
686
  },
docs/index.html CHANGED
@@ -4654,15 +4654,15 @@
4654
  <div class="hero-radar-panel" aria-label="Unified 20-task radar comparison">
4655
  <div class="hero-radar-top">
4656
  <strong>home radar comparison</strong>
4657
- <span>two evidence lines / 180 scored records / 174 direct + 6 compact-proxy</span>
4658
  </div>
4659
  <div class="hero-radar-layout">
4660
  <a class="hero-radar-frame" href="#suite" aria-label="Open full unified 20-task model radar">
4661
- <img src="assets/charts/unified_task_model_radar.svg?v=xperience10m-20task-radar-v6" alt="Unified 20-task model radar with full task-name key, method legend, 20-record counts, score counts, and raw128 proxy notes">
4662
  </a>
4663
  <div class="hero-radar-copy">
4664
- <h2>Model comparison is split by evidence line.</h2>
4665
- <p>The full SVG names every task axis and keeps method records, source artifacts, and proxy notes attached to the same comparison view.</p>
4666
  <div class="hero-radar-stats" aria-label="Radar coverage summary">
4667
  <div class="hero-radar-stat"><strong>180</strong><span>method-task records</span></div>
4668
  <div class="hero-radar-stat"><strong>174</strong><span>direct scores</span></div>
@@ -4670,10 +4670,10 @@
4670
  <div class="hero-radar-stat"><strong>34,269</strong><span>128ep windows</span></div>
4671
  </div>
4672
  <div class="hero-method-list" aria-label="Method families shown in the radar">
4673
- <div class="hero-method" style="--method-color:#67e8d1"><strong>Line 1: Minimal + Neural MLP</strong><span>Single public-sample episode; 40/40 direct task scores.</span></div>
4674
- <div class="hero-method" style="--method-color:#f59e0b"><strong>Line 2: metadata + raw baselines</strong><span>Same selected-128 split; all four baseline rows have 20 records, with proxy notes where direct raw targets are absent.</span></div>
4675
- <div class="hero-method" style="--method-color:#9bb8ff"><strong>Line 2: Qwen3-Omni v6 LoRA</strong><span>One trainable selected-128 model row; 20/20 direct scores from verified LoRA and task-specific probe artifacts.</span></div>
4676
- <div class="hero-method" style="--method-color:#d8f4a5"><strong>Line 2: Cosmos3 series</strong><span>Cosmos3-Super Reasoner and Cosmos3-Nano Future Window are separate selected-128 method rows; 40/40 direct scores.</span></div>
4677
  </div>
4678
  <div class="hero-task-strip" aria-label="Radar task axis examples">
4679
  <span>01 Action Recognition</span>
@@ -5065,7 +5065,7 @@
5065
  <article class="split-radar-card">
5066
  <h3>1-Episode 20-Task Radar</h3>
5067
  <p>Minimal and Neural MLP baselines over the original public-sample episode, with 40/40 scored method-task records.</p>
5068
- <img src="assets/charts/single_episode_task_model_radar.svg?v=xperience10m-split-radar-v1" alt="Single-episode 20-task radar comparing Minimal and Neural MLP across all 20 scored task axes">
5069
  <div class="split-radar-links">
5070
  <a href="assets/charts/single_episode_task_model_radar.svg">Open SVG</a>
5071
  <a href="data/single_episode_task_model_radar.json">Open JSON</a>
@@ -5074,7 +5074,7 @@
5074
  <article class="split-radar-card">
5075
  <h3>128-Episode 20-Task Radar</h3>
5076
  <p>Metadata, raw-feature, Qwen3-Omni, and Cosmos3 methods on the aligned 128-episode surface, with all 140 rows scored and proxy/evidence notes kept explicit.</p>
5077
- <img src="assets/charts/episode128_task_model_radar.svg?v=xperience10m-split-radar-v1" alt="128-episode 20-task radar comparing raw-feature baselines, metadata baselines, Qwen3-Omni, and Cosmos3 series with explicit score counts">
5078
  <div class="split-radar-links">
5079
  <a href="assets/charts/episode128_task_model_radar.svg">Open SVG</a>
5080
  <a href="data/episode128_task_model_radar.json">Open JSON</a>
@@ -5777,23 +5777,23 @@
5777
  <div class="figure-brief">
5778
  <article class="figure-brief-card">
5779
  <h3>Unified plus split radars</h3>
5780
- <p>The unified radar keeps all 9 methods in one view. The split radars separate the 1-episode Minimal/NN baseline comparison from the 128-episode metadata/raw, Qwen3-Omni v6 LoRA, and Cosmos3-Super/Nano comparison.</p>
5781
  </article>
5782
  <article class="figure-brief-card">
5783
  <h3>Metric normalization</h3>
5784
- <p>Higher-is-better metrics are plotted directly on 0-1 axes. Lower-is-better metrics are converted to best/value within the task, while raw values, status reasons, sources, and the two raw128 compact proxy notes remain in the JSON mirrors.</p>
5785
  </article>
5786
  <article class="figure-brief-card">
5787
  <h3>Score/proxy audit</h3>
5788
  <p>The matrix has 180/180 scored method-task records: 174 direct scores and 6 compact-proxy scores. The audit records the source artifact, metric key, and proxy reason for each marked cell.</p>
5789
  </article>
5790
  </div>
5791
- <img class="chart radar-chart unified-radar-chart" src="assets/charts/unified_task_model_radar.svg?v=xperience10m-20task-radar-v6" alt="Unified 20-task radar comparing Minimal, Neural MLP, 128-episode metadata/raw baselines, Qwen3-Omni, and Cosmos3 with task names, method details, 20-record counts, score counts, and proxy notes">
5792
  <div class="split-radar-grid" aria-label="Split 20-task radar comparisons">
5793
  <article class="split-radar-card">
5794
  <h3>1-Episode 20-Task Radar</h3>
5795
- <p>Minimal and Neural MLP are both scored on all 20 public-sample task contracts, shown as two filled polygons without 128-episode overlays.</p>
5796
- <img src="assets/charts/single_episode_task_model_radar.svg?v=xperience10m-split-radar-v1" alt="Single-episode 20-task radar comparing Minimal and Neural MLP across all 20 scored task axes">
5797
  <div class="split-radar-links">
5798
  <a href="assets/charts/single_episode_task_model_radar.svg">Open SVG</a>
5799
  <a href="data/single_episode_task_model_radar.json">Open JSON</a>
@@ -5801,8 +5801,8 @@
5801
  </article>
5802
  <article class="split-radar-card">
5803
  <h3>128-Episode 20-Task Radar</h3>
5804
- <p>Seven aligned 128-episode methods cover all 20 axes: metadata simple/NN, raw-feature simple/NN, Qwen3-Omni, Cosmos3-Super, and Cosmos3-Nano. Proxy axes stay labeled in the JSON.</p>
5805
- <img src="assets/charts/episode128_task_model_radar.svg?v=xperience10m-split-radar-v1" alt="128-episode 20-task radar comparing raw-feature baselines, metadata baselines, Qwen3-Omni, and Cosmos3 series with explicit score counts">
5806
  <div class="split-radar-links">
5807
  <a href="assets/charts/episode128_task_model_radar.svg">Open SVG</a>
5808
  <a href="data/episode128_task_model_radar.json">Open JSON</a>
 
4654
  <div class="hero-radar-panel" aria-label="Unified 20-task radar comparison">
4655
  <div class="hero-radar-top">
4656
  <strong>home radar comparison</strong>
4657
+ <span>4 grouped panels / 180 scored records / 174 direct + 6 compact-proxy</span>
4658
  </div>
4659
  <div class="hero-radar-layout">
4660
  <a class="hero-radar-frame" href="#suite" aria-label="Open full unified 20-task model radar">
4661
+ <img src="assets/charts/unified_task_model_radar.svg?v=xperience10m-20task-radar-v7" alt="Unified 20-task grouped radar board with method-family panels, task key, score counts, and raw128 proxy notes">
4662
  </a>
4663
  <div class="hero-radar-copy">
4664
+ <h2>Model comparison is grouped by method family.</h2>
4665
+ <p>The full SVG names every task axis, separates the nine methods into readable panels, and keeps source artifacts plus proxy notes attached to the same comparison view.</p>
4666
  <div class="hero-radar-stats" aria-label="Radar coverage summary">
4667
  <div class="hero-radar-stat"><strong>180</strong><span>method-task records</span></div>
4668
  <div class="hero-radar-stat"><strong>174</strong><span>direct scores</span></div>
 
4670
  <div class="hero-radar-stat"><strong>34,269</strong><span>128ep windows</span></div>
4671
  </div>
4672
  <div class="hero-method-list" aria-label="Method families shown in the radar">
4673
+ <div class="hero-method" style="--method-color:#67e8d1"><strong>Panel 1: Minimal + Neural MLP</strong><span>Single public-sample episode; 40/40 direct task scores.</span></div>
4674
+ <div class="hero-method" style="--method-color:#ffd166"><strong>Panel 2: 128ep metadata/text</strong><span>Aligned JSONL and staged-target baselines; 40/40 scored with proxy flags retained.</span></div>
4675
+ <div class="hero-method" style="--method-color:#22d3ee"><strong>Panel 3: 128ep raw features</strong><span>Sensor-block simple/NN heads; 40/40 scored with task 15/19 compact proxies marked.</span></div>
4676
+ <div class="hero-method" style="--method-color:#9bb8ff"><strong>Panel 4: Qwen3 + Cosmos3</strong><span>Qwen3-Omni v6 LoRA, Cosmos3-Super, and Cosmos3-Nano; 60/60 scored from verified artifacts.</span></div>
4677
  </div>
4678
  <div class="hero-task-strip" aria-label="Radar task axis examples">
4679
  <span>01 Action Recognition</span>
 
5065
  <article class="split-radar-card">
5066
  <h3>1-Episode 20-Task Radar</h3>
5067
  <p>Minimal and Neural MLP baselines over the original public-sample episode, with 40/40 scored method-task records.</p>
5068
+ <img src="assets/charts/single_episode_task_model_radar.svg?v=xperience10m-split-radar-v2" alt="Single-episode 20-task radar comparing Minimal and Neural MLP across all 20 scored task axes">
5069
  <div class="split-radar-links">
5070
  <a href="assets/charts/single_episode_task_model_radar.svg">Open SVG</a>
5071
  <a href="data/single_episode_task_model_radar.json">Open JSON</a>
 
5074
  <article class="split-radar-card">
5075
  <h3>128-Episode 20-Task Radar</h3>
5076
  <p>Metadata, raw-feature, Qwen3-Omni, and Cosmos3 methods on the aligned 128-episode surface, with all 140 rows scored and proxy/evidence notes kept explicit.</p>
5077
+ <img src="assets/charts/episode128_task_model_radar.svg?v=xperience10m-split-radar-v2" alt="128-episode grouped 20-task radar comparing metadata baselines, raw-feature baselines, Qwen3-Omni, and Cosmos3 series with explicit score counts">
5078
  <div class="split-radar-links">
5079
  <a href="assets/charts/episode128_task_model_radar.svg">Open SVG</a>
5080
  <a href="data/episode128_task_model_radar.json">Open JSON</a>
 
5777
  <div class="figure-brief">
5778
  <article class="figure-brief-card">
5779
  <h3>Unified plus split radars</h3>
5780
+ <p>The unified radar keeps all nine methods in one comparison board, but groups them into small-multiple panels so each method family can be read directly. The split radars separate the 1-episode Minimal/NN baseline comparison from the 128-episode metadata/raw, Qwen3-Omni v6 LoRA, and Cosmos3-Super/Nano comparison.</p>
5781
  </article>
5782
  <article class="figure-brief-card">
5783
  <h3>Metric normalization</h3>
5784
+ <p>Higher-is-better metrics are normalized to 0-1; lower-is-better metrics are converted to best/value within the task. The SVG uses sqrt(normalized score) only for visual radius, while raw values, linear normalized scores, status reasons, sources, and compact proxy notes remain in the JSON mirrors.</p>
5785
  </article>
5786
  <article class="figure-brief-card">
5787
  <h3>Score/proxy audit</h3>
5788
  <p>The matrix has 180/180 scored method-task records: 174 direct scores and 6 compact-proxy scores. The audit records the source artifact, metric key, and proxy reason for each marked cell.</p>
5789
  </article>
5790
  </div>
5791
+ <img class="chart radar-chart unified-radar-chart" src="assets/charts/unified_task_model_radar.svg?v=xperience10m-20task-radar-v7" alt="Unified grouped 20-task radar comparing Minimal, Neural MLP, 128-episode metadata/raw baselines, Qwen3-Omni, and Cosmos3 with task names, method details, 20-record counts, score counts, and proxy notes">
5792
  <div class="split-radar-grid" aria-label="Split 20-task radar comparisons">
5793
  <article class="split-radar-card">
5794
  <h3>1-Episode 20-Task Radar</h3>
5795
+ <p>Minimal and Neural MLP are both scored on all 20 public-sample task contracts in one enlarged panel without 128-episode methods competing for attention.</p>
5796
+ <img src="assets/charts/single_episode_task_model_radar.svg?v=xperience10m-split-radar-v2" alt="Single-episode 20-task radar comparing Minimal and Neural MLP across all 20 scored task axes">
5797
  <div class="split-radar-links">
5798
  <a href="assets/charts/single_episode_task_model_radar.svg">Open SVG</a>
5799
  <a href="data/single_episode_task_model_radar.json">Open JSON</a>
 
5801
  </article>
5802
  <article class="split-radar-card">
5803
  <h3>128-Episode 20-Task Radar</h3>
5804
+ <p>Seven aligned 128-episode methods cover all 20 axes across metadata/text, raw-feature, and foundation-model panels. Proxy axes stay labeled in the SVG and JSON.</p>
5805
+ <img src="assets/charts/episode128_task_model_radar.svg?v=xperience10m-split-radar-v2" alt="128-episode grouped 20-task radar comparing raw-feature baselines, metadata baselines, Qwen3-Omni, and Cosmos3 series with explicit score counts">
5806
  <div class="split-radar-links">
5807
  <a href="assets/charts/episode128_task_model_radar.svg">Open SVG</a>
5808
  <a href="data/episode128_task_model_radar.json">Open JSON</a>
index.html CHANGED
@@ -4654,15 +4654,15 @@
4654
  <div class="hero-radar-panel" aria-label="Unified 20-task radar comparison">
4655
  <div class="hero-radar-top">
4656
  <strong>home radar comparison</strong>
4657
- <span>two evidence lines / 180 scored records / 174 direct + 6 compact-proxy</span>
4658
  </div>
4659
  <div class="hero-radar-layout">
4660
  <a class="hero-radar-frame" href="#suite" aria-label="Open full unified 20-task model radar">
4661
- <img src="assets/charts/unified_task_model_radar.svg?v=xperience10m-20task-radar-v6" alt="Unified 20-task model radar with full task-name key, method legend, 20-record counts, score counts, and raw128 proxy notes">
4662
  </a>
4663
  <div class="hero-radar-copy">
4664
- <h2>Model comparison is split by evidence line.</h2>
4665
- <p>The full SVG names every task axis and keeps method records, source artifacts, and proxy notes attached to the same comparison view.</p>
4666
  <div class="hero-radar-stats" aria-label="Radar coverage summary">
4667
  <div class="hero-radar-stat"><strong>180</strong><span>method-task records</span></div>
4668
  <div class="hero-radar-stat"><strong>174</strong><span>direct scores</span></div>
@@ -4670,10 +4670,10 @@
4670
  <div class="hero-radar-stat"><strong>34,269</strong><span>128ep windows</span></div>
4671
  </div>
4672
  <div class="hero-method-list" aria-label="Method families shown in the radar">
4673
- <div class="hero-method" style="--method-color:#67e8d1"><strong>Line 1: Minimal + Neural MLP</strong><span>Single public-sample episode; 40/40 direct task scores.</span></div>
4674
- <div class="hero-method" style="--method-color:#f59e0b"><strong>Line 2: metadata + raw baselines</strong><span>Same selected-128 split; all four baseline rows have 20 records, with proxy notes where direct raw targets are absent.</span></div>
4675
- <div class="hero-method" style="--method-color:#9bb8ff"><strong>Line 2: Qwen3-Omni v6 LoRA</strong><span>One trainable selected-128 model row; 20/20 direct scores from verified LoRA and task-specific probe artifacts.</span></div>
4676
- <div class="hero-method" style="--method-color:#d8f4a5"><strong>Line 2: Cosmos3 series</strong><span>Cosmos3-Super Reasoner and Cosmos3-Nano Future Window are separate selected-128 method rows; 40/40 direct scores.</span></div>
4677
  </div>
4678
  <div class="hero-task-strip" aria-label="Radar task axis examples">
4679
  <span>01 Action Recognition</span>
@@ -5065,7 +5065,7 @@
5065
  <article class="split-radar-card">
5066
  <h3>1-Episode 20-Task Radar</h3>
5067
  <p>Minimal and Neural MLP baselines over the original public-sample episode, with 40/40 scored method-task records.</p>
5068
- <img src="assets/charts/single_episode_task_model_radar.svg?v=xperience10m-split-radar-v1" alt="Single-episode 20-task radar comparing Minimal and Neural MLP across all 20 scored task axes">
5069
  <div class="split-radar-links">
5070
  <a href="assets/charts/single_episode_task_model_radar.svg">Open SVG</a>
5071
  <a href="data/single_episode_task_model_radar.json">Open JSON</a>
@@ -5074,7 +5074,7 @@
5074
  <article class="split-radar-card">
5075
  <h3>128-Episode 20-Task Radar</h3>
5076
  <p>Metadata, raw-feature, Qwen3-Omni, and Cosmos3 methods on the aligned 128-episode surface, with all 140 rows scored and proxy/evidence notes kept explicit.</p>
5077
- <img src="assets/charts/episode128_task_model_radar.svg?v=xperience10m-split-radar-v1" alt="128-episode 20-task radar comparing raw-feature baselines, metadata baselines, Qwen3-Omni, and Cosmos3 series with explicit score counts">
5078
  <div class="split-radar-links">
5079
  <a href="assets/charts/episode128_task_model_radar.svg">Open SVG</a>
5080
  <a href="data/episode128_task_model_radar.json">Open JSON</a>
@@ -5777,23 +5777,23 @@
5777
  <div class="figure-brief">
5778
  <article class="figure-brief-card">
5779
  <h3>Unified plus split radars</h3>
5780
- <p>The unified radar keeps all 9 methods in one view. The split radars separate the 1-episode Minimal/NN baseline comparison from the 128-episode metadata/raw, Qwen3-Omni v6 LoRA, and Cosmos3-Super/Nano comparison.</p>
5781
  </article>
5782
  <article class="figure-brief-card">
5783
  <h3>Metric normalization</h3>
5784
- <p>Higher-is-better metrics are plotted directly on 0-1 axes. Lower-is-better metrics are converted to best/value within the task, while raw values, status reasons, sources, and the two raw128 compact proxy notes remain in the JSON mirrors.</p>
5785
  </article>
5786
  <article class="figure-brief-card">
5787
  <h3>Score/proxy audit</h3>
5788
  <p>The matrix has 180/180 scored method-task records: 174 direct scores and 6 compact-proxy scores. The audit records the source artifact, metric key, and proxy reason for each marked cell.</p>
5789
  </article>
5790
  </div>
5791
- <img class="chart radar-chart unified-radar-chart" src="assets/charts/unified_task_model_radar.svg?v=xperience10m-20task-radar-v6" alt="Unified 20-task radar comparing Minimal, Neural MLP, 128-episode metadata/raw baselines, Qwen3-Omni, and Cosmos3 with task names, method details, 20-record counts, score counts, and proxy notes">
5792
  <div class="split-radar-grid" aria-label="Split 20-task radar comparisons">
5793
  <article class="split-radar-card">
5794
  <h3>1-Episode 20-Task Radar</h3>
5795
- <p>Minimal and Neural MLP are both scored on all 20 public-sample task contracts, shown as two filled polygons without 128-episode overlays.</p>
5796
- <img src="assets/charts/single_episode_task_model_radar.svg?v=xperience10m-split-radar-v1" alt="Single-episode 20-task radar comparing Minimal and Neural MLP across all 20 scored task axes">
5797
  <div class="split-radar-links">
5798
  <a href="assets/charts/single_episode_task_model_radar.svg">Open SVG</a>
5799
  <a href="data/single_episode_task_model_radar.json">Open JSON</a>
@@ -5801,8 +5801,8 @@
5801
  </article>
5802
  <article class="split-radar-card">
5803
  <h3>128-Episode 20-Task Radar</h3>
5804
- <p>Seven aligned 128-episode methods cover all 20 axes: metadata simple/NN, raw-feature simple/NN, Qwen3-Omni, Cosmos3-Super, and Cosmos3-Nano. Proxy axes stay labeled in the JSON.</p>
5805
- <img src="assets/charts/episode128_task_model_radar.svg?v=xperience10m-split-radar-v1" alt="128-episode 20-task radar comparing raw-feature baselines, metadata baselines, Qwen3-Omni, and Cosmos3 series with explicit score counts">
5806
  <div class="split-radar-links">
5807
  <a href="assets/charts/episode128_task_model_radar.svg">Open SVG</a>
5808
  <a href="data/episode128_task_model_radar.json">Open JSON</a>
 
4654
  <div class="hero-radar-panel" aria-label="Unified 20-task radar comparison">
4655
  <div class="hero-radar-top">
4656
  <strong>home radar comparison</strong>
4657
+ <span>4 grouped panels / 180 scored records / 174 direct + 6 compact-proxy</span>
4658
  </div>
4659
  <div class="hero-radar-layout">
4660
  <a class="hero-radar-frame" href="#suite" aria-label="Open full unified 20-task model radar">
4661
+ <img src="assets/charts/unified_task_model_radar.svg?v=xperience10m-20task-radar-v7" alt="Unified 20-task grouped radar board with method-family panels, task key, score counts, and raw128 proxy notes">
4662
  </a>
4663
  <div class="hero-radar-copy">
4664
+ <h2>Model comparison is grouped by method family.</h2>
4665
+ <p>The full SVG names every task axis, separates the nine methods into readable panels, and keeps source artifacts plus proxy notes attached to the same comparison view.</p>
4666
  <div class="hero-radar-stats" aria-label="Radar coverage summary">
4667
  <div class="hero-radar-stat"><strong>180</strong><span>method-task records</span></div>
4668
  <div class="hero-radar-stat"><strong>174</strong><span>direct scores</span></div>
 
4670
  <div class="hero-radar-stat"><strong>34,269</strong><span>128ep windows</span></div>
4671
  </div>
4672
  <div class="hero-method-list" aria-label="Method families shown in the radar">
4673
+ <div class="hero-method" style="--method-color:#67e8d1"><strong>Panel 1: Minimal + Neural MLP</strong><span>Single public-sample episode; 40/40 direct task scores.</span></div>
4674
+ <div class="hero-method" style="--method-color:#ffd166"><strong>Panel 2: 128ep metadata/text</strong><span>Aligned JSONL and staged-target baselines; 40/40 scored with proxy flags retained.</span></div>
4675
+ <div class="hero-method" style="--method-color:#22d3ee"><strong>Panel 3: 128ep raw features</strong><span>Sensor-block simple/NN heads; 40/40 scored with task 15/19 compact proxies marked.</span></div>
4676
+ <div class="hero-method" style="--method-color:#9bb8ff"><strong>Panel 4: Qwen3 + Cosmos3</strong><span>Qwen3-Omni v6 LoRA, Cosmos3-Super, and Cosmos3-Nano; 60/60 scored from verified artifacts.</span></div>
4677
  </div>
4678
  <div class="hero-task-strip" aria-label="Radar task axis examples">
4679
  <span>01 Action Recognition</span>
 
5065
  <article class="split-radar-card">
5066
  <h3>1-Episode 20-Task Radar</h3>
5067
  <p>Minimal and Neural MLP baselines over the original public-sample episode, with 40/40 scored method-task records.</p>
5068
+ <img src="assets/charts/single_episode_task_model_radar.svg?v=xperience10m-split-radar-v2" alt="Single-episode 20-task radar comparing Minimal and Neural MLP across all 20 scored task axes">
5069
  <div class="split-radar-links">
5070
  <a href="assets/charts/single_episode_task_model_radar.svg">Open SVG</a>
5071
  <a href="data/single_episode_task_model_radar.json">Open JSON</a>
 
5074
  <article class="split-radar-card">
5075
  <h3>128-Episode 20-Task Radar</h3>
5076
  <p>Metadata, raw-feature, Qwen3-Omni, and Cosmos3 methods on the aligned 128-episode surface, with all 140 rows scored and proxy/evidence notes kept explicit.</p>
5077
+ <img src="assets/charts/episode128_task_model_radar.svg?v=xperience10m-split-radar-v2" alt="128-episode grouped 20-task radar comparing metadata baselines, raw-feature baselines, Qwen3-Omni, and Cosmos3 series with explicit score counts">
5078
  <div class="split-radar-links">
5079
  <a href="assets/charts/episode128_task_model_radar.svg">Open SVG</a>
5080
  <a href="data/episode128_task_model_radar.json">Open JSON</a>
 
5777
  <div class="figure-brief">
5778
  <article class="figure-brief-card">
5779
  <h3>Unified plus split radars</h3>
5780
+ <p>The unified radar keeps all nine methods in one comparison board, but groups them into small-multiple panels so each method family can be read directly. The split radars separate the 1-episode Minimal/NN baseline comparison from the 128-episode metadata/raw, Qwen3-Omni v6 LoRA, and Cosmos3-Super/Nano comparison.</p>
5781
  </article>
5782
  <article class="figure-brief-card">
5783
  <h3>Metric normalization</h3>
5784
+ <p>Higher-is-better metrics are normalized to 0-1; lower-is-better metrics are converted to best/value within the task. The SVG uses sqrt(normalized score) only for visual radius, while raw values, linear normalized scores, status reasons, sources, and compact proxy notes remain in the JSON mirrors.</p>
5785
  </article>
5786
  <article class="figure-brief-card">
5787
  <h3>Score/proxy audit</h3>
5788
  <p>The matrix has 180/180 scored method-task records: 174 direct scores and 6 compact-proxy scores. The audit records the source artifact, metric key, and proxy reason for each marked cell.</p>
5789
  </article>
5790
  </div>
5791
+ <img class="chart radar-chart unified-radar-chart" src="assets/charts/unified_task_model_radar.svg?v=xperience10m-20task-radar-v7" alt="Unified grouped 20-task radar comparing Minimal, Neural MLP, 128-episode metadata/raw baselines, Qwen3-Omni, and Cosmos3 with task names, method details, 20-record counts, score counts, and proxy notes">
5792
  <div class="split-radar-grid" aria-label="Split 20-task radar comparisons">
5793
  <article class="split-radar-card">
5794
  <h3>1-Episode 20-Task Radar</h3>
5795
+ <p>Minimal and Neural MLP are both scored on all 20 public-sample task contracts in one enlarged panel without 128-episode methods competing for attention.</p>
5796
+ <img src="assets/charts/single_episode_task_model_radar.svg?v=xperience10m-split-radar-v2" alt="Single-episode 20-task radar comparing Minimal and Neural MLP across all 20 scored task axes">
5797
  <div class="split-radar-links">
5798
  <a href="assets/charts/single_episode_task_model_radar.svg">Open SVG</a>
5799
  <a href="data/single_episode_task_model_radar.json">Open JSON</a>
 
5801
  </article>
5802
  <article class="split-radar-card">
5803
  <h3>128-Episode 20-Task Radar</h3>
5804
+ <p>Seven aligned 128-episode methods cover all 20 axes across metadata/text, raw-feature, and foundation-model panels. Proxy axes stay labeled in the SVG and JSON.</p>
5805
+ <img src="assets/charts/episode128_task_model_radar.svg?v=xperience10m-split-radar-v2" alt="128-episode grouped 20-task radar comparing raw-feature baselines, metadata baselines, Qwen3-Omni, and Cosmos3 series with explicit score counts">
5806
  <div class="split-radar-links">
5807
  <a href="assets/charts/episode128_task_model_radar.svg">Open SVG</a>
5808
  <a href="data/episode128_task_model_radar.json">Open JSON</a>
metrics/artifact_index.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "title": "Ropedia Xperience-10M Task Suite Artifact Index",
3
- "generated_at_utc": "2026-06-21T15:19:00+00:00",
4
  "status": "pass",
5
  "artifact_count": 228,
6
  "missing": [],
@@ -92,8 +92,8 @@
92
  "surface": "repo_hf",
93
  "shows": "Defines terminology that can be confused across data scope, task metrics, model branches, and public mirrors.",
94
  "exists": true,
95
- "bytes": 10220,
96
- "sha256": "c3723a3dfaf902f4bc2527b78d11057ab8ba01c6b9715fa873c7dd819deddf81"
97
  },
98
  {
99
  "id": "glossary_json",
@@ -103,8 +103,8 @@
103
  "surface": "website_hf",
104
  "shows": "Machine-readable terminology layer for the website, artifact dataset, model mirror, and public QA checks.",
105
  "exists": true,
106
- "bytes": 17380,
107
- "sha256": "3ba746413eb4a7d2646333e3746e0b3f4039db61d03c159d2d059fecfe4b51ed"
108
  },
109
  {
110
  "id": "research_roadmap",
@@ -158,8 +158,8 @@
158
  "surface": "repo_hf",
159
  "shows": "Frames spatial intelligence, human-video world modeling, and vision-language-action as three pipeline tracks with explicit inputs, outputs, maturity, and next evidence gates.",
160
  "exists": true,
161
- "bytes": 8008,
162
- "sha256": "abd179631f1d204328fa5a2e018207eceeb7d46be12b7dab2826e70dfde97130"
163
  },
164
  {
165
  "id": "three_foundation_pipelines_json",
@@ -169,8 +169,8 @@
169
  "surface": "website_hf",
170
  "shows": "Machine-readable pipeline-track contract for the website and Hugging Face mirrors.",
171
  "exists": true,
172
- "bytes": 10312,
173
- "sha256": "9eb30d098f1b3787255a5506e4299f2cd4904ef989f2eaeaa2e5f77519c17378"
174
  },
175
  {
176
  "id": "spatial_intelligence_slide_diagram",
@@ -632,7 +632,7 @@
632
  "shows": "Machine-readable source-alignment pass/fail check for repo, website, and HF surfaces.",
633
  "exists": true,
634
  "bytes": 4432,
635
- "sha256": "5ab2ea4bfefe9f5bc7854f02b2e1e2b5206766a54447647191828da1a1a2077c"
636
  },
637
  {
638
  "id": "source_alignment_validator",
@@ -653,8 +653,8 @@
653
  "surface": "repo_hf",
654
  "shows": "Publishes prepared Space, artifact dataset, and model bundles, including an explicit model-binary upload batch.",
655
  "exists": true,
656
- "bytes": 25115,
657
- "sha256": "31866958ffaa3954f1bc4f920fbf80d5b41fd95228be50989dadb6e8cbe619e3"
658
  },
659
  {
660
  "id": "github_package_dockerfile",
@@ -697,8 +697,8 @@
697
  "surface": "website_hf",
698
  "shows": "Machine-readable protocol generated from committed task metrics for website and HF mirrors.",
699
  "exists": true,
700
- "bytes": 24047,
701
- "sha256": "d8f61b646a2f3f1e0af901dbdaff310ebfeea90622c93a34b9e35f34be98b896"
702
  },
703
  {
704
  "id": "evaluation_protocol_builder",
@@ -708,8 +708,8 @@
708
  "surface": "repo_hf",
709
  "shows": "Regenerates the protocol from committed summary metrics and task artifacts.",
710
  "exists": true,
711
- "bytes": 19825,
712
- "sha256": "aa9de1582f8fa79c1850e10e69fb125c0e3c1add433c7ebedc104c2efb42272e"
713
  },
714
  {
715
  "id": "task_suite_20",
@@ -730,8 +730,8 @@
730
  "surface": "website_hf",
731
  "shows": "Machine-readable unified 20-task index for the website, Hugging Face mirrors, and live verification.",
732
  "exists": true,
733
- "bytes": 34585,
734
- "sha256": "75145285cf71bc3bb9a10377a1921b60e85c4546dc8b858102b3c26e94c11a01"
735
  },
736
  {
737
  "id": "task_suite_20_builder",
@@ -741,8 +741,8 @@
741
  "surface": "repo_hf",
742
  "shows": "Regenerates the unified 20-task JSON and Markdown from the public-sample metrics plus the historical provenance result bundle.",
743
  "exists": true,
744
- "bytes": 12157,
745
- "sha256": "157265b5c025f279ce1eb56c52dd720ce0969b8426d5887030bfa179a3b565e0"
746
  },
747
  {
748
  "id": "unified_task_model_radar_json",
@@ -750,10 +750,10 @@
750
  "path": "docs/data/unified_task_model_radar.json",
751
  "kind": "website_data",
752
  "surface": "website_hf",
753
- "shows": "Stores normalized 20-axis radar values, raw task metrics, Qwen3-Omni/Cosmos3 overlay mappings, method-card caveats, proxy flags, and source artifacts.",
754
  "exists": true,
755
- "bytes": 228815,
756
- "sha256": "862376178e8b0d01b536f49a18b7934a373494f8b36080790f616438ec0e035e"
757
  },
758
  {
759
  "id": "single_episode_task_model_radar_json",
@@ -763,8 +763,8 @@
763
  "surface": "website_hf",
764
  "shows": "Machine-readable split radar for the one-episode Minimal and Neural MLP baselines, both scored on all 20 task contracts.",
765
  "exists": true,
766
- "bytes": 51107,
767
- "sha256": "5f2ebb41e8488446ea5c5cd2cb75bbedce688433feffe1412288de56b133bd5c"
768
  },
769
  {
770
  "id": "episode128_task_model_radar_json",
@@ -774,8 +774,8 @@
774
  "surface": "website_hf",
775
  "shows": "Machine-readable split radar for selected 128-episode metadata/raw baselines, Qwen3-Omni v6, Cosmos3-Super, and Cosmos3-Nano, now complete at 140/140 scored rows with proxy notes retained.",
776
  "exists": true,
777
- "bytes": 184992,
778
- "sha256": "385704db90443d74903f365e90b27538020f5574c96f296bbf63173f488a645d"
779
  },
780
  {
781
  "id": "task_method_20_result_matrix_json",
@@ -785,8 +785,8 @@
785
  "surface": "website_hf",
786
  "shows": "Machine-readable 9-method by 20-task matrix where every method has 20 records and the current release is complete at 180/180 scored rows.",
787
  "exists": true,
788
- "bytes": 128509,
789
- "sha256": "96082daa33771963ac40b7d719df00a76ec443508a3d3101cb6dd82d87965729"
790
  },
791
  {
792
  "id": "task_method_20_result_matrix",
@@ -796,8 +796,8 @@
796
  "surface": "repo_hf",
797
  "shows": "Reader-facing table that separates 20 records per method, direct numeric scores, documented compact-proxy scores, and source artifacts.",
798
  "exists": true,
799
- "bytes": 3563,
800
- "sha256": "84b21bd3359e6149b952a9b3b6991f7ba8ff412390a92983bd6d59f212d58316"
801
  },
802
  {
803
  "id": "task_method_20_gap_audit_json",
@@ -808,7 +808,7 @@
808
  "shows": "Machine-readable 180-record completion ledger with numeric scores, proxy flags, explicit status reasons, and source artifacts.",
809
  "exists": true,
810
  "bytes": 8500,
811
- "sha256": "5743354519dfae924d973ffca90a5d1ef4b0c316cf201b3ac443f6b5d4600de5"
812
  },
813
  {
814
  "id": "task_method_20_gap_audit",
@@ -819,7 +819,7 @@
819
  "shows": "Reader-facing ledger confirming 180/180 scored method-task cells and listing the six compact-proxy records separately.",
820
  "exists": true,
821
  "bytes": 3417,
822
- "sha256": "01f47c74db965887e55f17b674e1617e396dd01a51fdc7bf9a9c7612f5596b2c"
823
  },
824
  {
825
  "id": "task_method_20_source_audit_json",
@@ -830,7 +830,7 @@
830
  "shows": "Machine-readable check that scored JSON-backed matrix cells match their declared metric source values.",
831
  "exists": true,
832
  "bytes": 561,
833
- "sha256": "bb50000bb681f1ed12c4089bd3541f14b8e121ea5a1fe06e1a6a559cb8960e0b"
834
  },
835
  {
836
  "id": "task_method_20_source_audit",
@@ -841,7 +841,7 @@
841
  "shows": "Reader-facing source-value audit for the 180-result matrix.",
842
  "exists": true,
843
  "bytes": 447,
844
- "sha256": "1a0583629368cee3abadc49d4a0220dead924326a04e82b3a22a9fc6d6b0d252"
845
  },
846
  {
847
  "id": "two_evidence_line_map_chart",
@@ -860,10 +860,10 @@
860
  "path": "docs/assets/charts/unified_task_model_radar.svg",
861
  "kind": "generated_figure",
862
  "surface": "website_hf",
863
- "shows": "Compares minimal and neural MLP baselines across all 20 tasks, with Qwen3-Omni and Cosmos3 task-aligned overlays.",
864
  "exists": true,
865
- "bytes": 57938,
866
- "sha256": "bb83b80b47fe679ebdce2c99378a4548120f1c8cc2d725b88e409d8c386dcbf8"
867
  },
868
  {
869
  "id": "single_episode_task_model_radar_chart",
@@ -871,10 +871,10 @@
871
  "path": "docs/assets/charts/single_episode_task_model_radar.svg",
872
  "kind": "generated_figure",
873
  "surface": "website_hf",
874
- "shows": "Separates the one-episode Minimal and Neural MLP 20/20 scored baselines into a clean two-polygon radar.",
875
  "exists": true,
876
- "bytes": 35232,
877
- "sha256": "87b52a7dead40358f1778dda43ade4d2e875ac98e507e01ca007084363e5977e"
878
  },
879
  {
880
  "id": "episode128_task_model_radar_chart",
@@ -882,10 +882,10 @@
882
  "path": "docs/assets/charts/episode128_task_model_radar.svg",
883
  "kind": "generated_figure",
884
  "surface": "website_hf",
885
- "shows": "Separates the selected 128-episode methods: raw-feature simple/NN as complete 20/20 scored polygons plus metadata, Qwen3-Omni, Cosmos3-Super, and Cosmos3-Nano task-aligned overlays.",
886
  "exists": true,
887
- "bytes": 51915,
888
- "sha256": "047ea4b05a04f6734e2afcf792863559dc8f3091eae88a97ff90e8b038a423f4"
889
  },
890
  {
891
  "id": "unified_task_model_radar_builder",
@@ -893,10 +893,10 @@
893
  "path": "scripts/build_unified_task_model_radar.py",
894
  "kind": "visualization_builder",
895
  "surface": "repo_hf",
896
- "shows": "Regenerates the direction-aware radar chart and machine-readable metric overlay JSON.",
897
  "exists": true,
898
- "bytes": 68610,
899
- "sha256": "96bc2df0de5a9e512d69961ddb13ea87b26ef01f1f943f5a78a6dc373400949d"
900
  },
901
  {
902
  "id": "task_method_20_gap_audit_builder",
@@ -1093,8 +1093,8 @@
1093
  "surface": "repo_hf",
1094
  "shows": "Catalogs public figures, charts, modality thumbnails, dimensions, hashes, roles, and source scripts.",
1095
  "exists": true,
1096
- "bytes": 7027,
1097
- "sha256": "b7b507c35cd3cba2765586e9703a447c8025c89658c3daa390df67db4211d0fc"
1098
  },
1099
  {
1100
  "id": "figure_index_json",
@@ -1104,8 +1104,8 @@
1104
  "surface": "website_hf",
1105
  "shows": "Machine-readable visual asset index for website and Hugging Face mirrors.",
1106
  "exists": true,
1107
- "bytes": 19485,
1108
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
1109
  },
1110
  {
1111
  "id": "figure_index_builder",
@@ -1115,8 +1115,8 @@
1115
  "surface": "repo_hf",
1116
  "shows": "Regenerates visual-asset hashes, dimensions, and source-script provenance.",
1117
  "exists": true,
1118
- "bytes": 16845,
1119
- "sha256": "3f91f7f13a3fb08ab57c2f0a6b320102e9d5ae19b102b71499edb5b8fd5a2cec"
1120
  },
1121
  {
1122
  "id": "brand_assets_json",
@@ -1182,7 +1182,7 @@
1182
  "shows": "Machine-readable release-check summary for validators, mirrors, and public project surfaces.",
1183
  "exists": true,
1184
  "bytes": 8640,
1185
- "sha256": "6e54f6828b8fef97e963a9a56bccc91162b8a632f6897743095e32407fa0db98"
1186
  },
1187
  {
1188
  "id": "public_surface_qa",
@@ -1226,7 +1226,7 @@
1226
  "volatile": true,
1227
  "shows": "Machine-readable report for SEO/social metadata, accessible tab semantics, public links, project links, and clear project presentation.",
1228
  "exists": true,
1229
- "bytes": 7691,
1230
  "hash_policy": "existence_and_size_only"
1231
  },
1232
  {
@@ -1318,8 +1318,8 @@
1318
  "surface": "repo",
1319
  "shows": "Fetches the published GitHub/HF URLs and compares live hashes and public-card markers against the release assets.",
1320
  "exists": true,
1321
- "bytes": 69151,
1322
- "sha256": "c4af8644d50dafe7d4249dd7c5b36bb19e996628ff6d8436fbb6e027da526c1f"
1323
  },
1324
  {
1325
  "id": "reproducibility_contract",
@@ -1351,8 +1351,8 @@
1351
  "surface": "repo_hf",
1352
  "shows": "Generates the selective artifact catalog from local files.",
1353
  "exists": true,
1354
- "bytes": 68279,
1355
- "sha256": "69b43ad5d3dc5a6893c4592fa47fff6a7a87691728ec2c61b121ec262d00bf2a"
1356
  },
1357
  {
1358
  "id": "publication_audit",
@@ -1363,7 +1363,7 @@
1363
  "volatile": true,
1364
  "shows": "Confirms public bundles exclude raw data, caches, heavy archives, and credential text.",
1365
  "exists": true,
1366
- "bytes": 10939,
1367
  "hash_policy": "existence_and_size_only"
1368
  },
1369
  {
@@ -1387,7 +1387,7 @@
1387
  "volatile": true,
1388
  "shows": "Confirms prepared GitHub/HF Space/artifact/model mirrors share the same critical data, figure, website HTML, and validator files.",
1389
  "exists": true,
1390
- "bytes": 1420743,
1391
  "hash_policy": "existence_and_size_only"
1392
  },
1393
  {
@@ -1399,7 +1399,7 @@
1399
  "volatile": true,
1400
  "shows": "Confirms local website links, anchors, JSON data files, and referenced images resolve.",
1401
  "exists": true,
1402
- "bytes": 20760,
1403
  "hash_policy": "existence_and_size_only"
1404
  },
1405
  {
 
1
  {
2
  "title": "Ropedia Xperience-10M Task Suite Artifact Index",
3
+ "generated_at_utc": "2026-06-21T20:35:18+00:00",
4
  "status": "pass",
5
  "artifact_count": 228,
6
  "missing": [],
 
92
  "surface": "repo_hf",
93
  "shows": "Defines terminology that can be confused across data scope, task metrics, model branches, and public mirrors.",
94
  "exists": true,
95
+ "bytes": 10857,
96
+ "sha256": "b9b8f8695b6c211e849073e1634057f158877a61161eca05b1266769158ee83e"
97
  },
98
  {
99
  "id": "glossary_json",
 
103
  "surface": "website_hf",
104
  "shows": "Machine-readable terminology layer for the website, artifact dataset, model mirror, and public QA checks.",
105
  "exists": true,
106
+ "bytes": 18754,
107
+ "sha256": "b1dc42e1f42a7c19bb4b2ebd32a0862df28bec671eaa849b09a97f103675e9eb"
108
  },
109
  {
110
  "id": "research_roadmap",
 
158
  "surface": "repo_hf",
159
  "shows": "Frames spatial intelligence, human-video world modeling, and vision-language-action as three pipeline tracks with explicit inputs, outputs, maturity, and next evidence gates.",
160
  "exists": true,
161
+ "bytes": 11609,
162
+ "sha256": "69d4d8781015c807c58a720b3bf967ee872540a2fe2b7e7973105946c5a1cf11"
163
  },
164
  {
165
  "id": "three_foundation_pipelines_json",
 
169
  "surface": "website_hf",
170
  "shows": "Machine-readable pipeline-track contract for the website and Hugging Face mirrors.",
171
  "exists": true,
172
+ "bytes": 14465,
173
+ "sha256": "c8c9b7a9ee8d3ecfe8662cf0336a7ca2ade6d670c5567d2a50dfb67c7241defd"
174
  },
175
  {
176
  "id": "spatial_intelligence_slide_diagram",
 
632
  "shows": "Machine-readable source-alignment pass/fail check for repo, website, and HF surfaces.",
633
  "exists": true,
634
  "bytes": 4432,
635
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
636
  },
637
  {
638
  "id": "source_alignment_validator",
 
653
  "surface": "repo_hf",
654
  "shows": "Publishes prepared Space, artifact dataset, and model bundles, including an explicit model-binary upload batch.",
655
  "exists": true,
656
+ "bytes": 29935,
657
+ "sha256": "89ed1b25784cd42f19f2e4731eaa52e8655e8ed06cc2a12b6a2c0add027c5fc0"
658
  },
659
  {
660
  "id": "github_package_dockerfile",
 
697
  "surface": "website_hf",
698
  "shows": "Machine-readable protocol generated from committed task metrics for website and HF mirrors.",
699
  "exists": true,
700
+ "bytes": 24267,
701
+ "sha256": "83289b52e0193fa86d63fac7dd8af98d0612199b1e1a5f5ed37014baa9168656"
702
  },
703
  {
704
  "id": "evaluation_protocol_builder",
 
708
  "surface": "repo_hf",
709
  "shows": "Regenerates the protocol from committed summary metrics and task artifacts.",
710
  "exists": true,
711
+ "bytes": 19845,
712
+ "sha256": "b5d888fd67ba6b8150b24960aa0858db50f16a5c7b59a8ce274401e384f0a2a3"
713
  },
714
  {
715
  "id": "task_suite_20",
 
730
  "surface": "website_hf",
731
  "shows": "Machine-readable unified 20-task index for the website, Hugging Face mirrors, and live verification.",
732
  "exists": true,
733
+ "bytes": 34805,
734
+ "sha256": "5d7a3b5b57e64980291b3ce6a8741b0b822e94bd447ca95519d1da7040923476"
735
  },
736
  {
737
  "id": "task_suite_20_builder",
 
741
  "surface": "repo_hf",
742
  "shows": "Regenerates the unified 20-task JSON and Markdown from the public-sample metrics plus the historical provenance result bundle.",
743
  "exists": true,
744
+ "bytes": 12129,
745
+ "sha256": "4c2550c657bfa5bc37e21e78e7e27dd70daa3b178c8ffb3191e52f835221713a"
746
  },
747
  {
748
  "id": "unified_task_model_radar_json",
 
750
  "path": "docs/data/unified_task_model_radar.json",
751
  "kind": "website_data",
752
  "surface": "website_hf",
753
+ "shows": "Stores normalized 20-axis radar values, raw task metrics, grouped chart-design metadata, Qwen3-Omni/Cosmos3 source mappings, method-card caveats, proxy flags, and source artifacts.",
754
  "exists": true,
755
+ "bytes": 230938,
756
+ "sha256": "bd7b8fd8a169273c5cec2beab050a583307dcc64e5e3fefcd690484b4a0754bc"
757
  },
758
  {
759
  "id": "single_episode_task_model_radar_json",
 
763
  "surface": "website_hf",
764
  "shows": "Machine-readable split radar for the one-episode Minimal and Neural MLP baselines, both scored on all 20 task contracts.",
765
  "exists": true,
766
+ "bytes": 52256,
767
+ "sha256": "f282dc0c4c654fe6f2cd646612fd0942d267dec60c6143cc36688edbc27c13da"
768
  },
769
  {
770
  "id": "episode128_task_model_radar_json",
 
774
  "surface": "website_hf",
775
  "shows": "Machine-readable split radar for selected 128-episode metadata/raw baselines, Qwen3-Omni v6, Cosmos3-Super, and Cosmos3-Nano, now complete at 140/140 scored rows with proxy notes retained.",
776
  "exists": true,
777
+ "bytes": 186828,
778
+ "sha256": "f904cf9eb3160a50e4469989e44557e77d710e459f812ba7b194d0ed8d9746bd"
779
  },
780
  {
781
  "id": "task_method_20_result_matrix_json",
 
785
  "surface": "website_hf",
786
  "shows": "Machine-readable 9-method by 20-task matrix where every method has 20 records and the current release is complete at 180/180 scored rows.",
787
  "exists": true,
788
+ "bytes": 128991,
789
+ "sha256": "bacb549e64b6d1936b55fe7593cfbaeb1dbf5bfe824c8507ea75e493230212fe"
790
  },
791
  {
792
  "id": "task_method_20_result_matrix",
 
796
  "surface": "repo_hf",
797
  "shows": "Reader-facing table that separates 20 records per method, direct numeric scores, documented compact-proxy scores, and source artifacts.",
798
  "exists": true,
799
+ "bytes": 14862,
800
+ "sha256": "a6f769d40ed22a2d63dad88029be393d55ed9af3082ab61fc1ad8b314aa871a6"
801
  },
802
  {
803
  "id": "task_method_20_gap_audit_json",
 
808
  "shows": "Machine-readable 180-record completion ledger with numeric scores, proxy flags, explicit status reasons, and source artifacts.",
809
  "exists": true,
810
  "bytes": 8500,
811
+ "sha256": "4f5cc7a29fe030a9fd5b97893ac67454ebc6287f5942aeedb2bfca71c411332d"
812
  },
813
  {
814
  "id": "task_method_20_gap_audit",
 
819
  "shows": "Reader-facing ledger confirming 180/180 scored method-task cells and listing the six compact-proxy records separately.",
820
  "exists": true,
821
  "bytes": 3417,
822
+ "sha256": "b094732dd98c904ea0ab30e741576a3d5eeb8ca07e7a494e459b11e31320ae4d"
823
  },
824
  {
825
  "id": "task_method_20_source_audit_json",
 
830
  "shows": "Machine-readable check that scored JSON-backed matrix cells match their declared metric source values.",
831
  "exists": true,
832
  "bytes": 561,
833
+ "sha256": "1bc6bb15ab45e73ada4d3c2d1ec326cffd9c6bc6d6d791c7468b3ed16881a463"
834
  },
835
  {
836
  "id": "task_method_20_source_audit",
 
841
  "shows": "Reader-facing source-value audit for the 180-result matrix.",
842
  "exists": true,
843
  "bytes": 447,
844
+ "sha256": "7f5e80d6dcd66be1c6b2c63144adb64d0cdad0008884ebd8f015388545e4b914"
845
  },
846
  {
847
  "id": "two_evidence_line_map_chart",
 
860
  "path": "docs/assets/charts/unified_task_model_radar.svg",
861
  "kind": "generated_figure",
862
  "surface": "website_hf",
863
+ "shows": "Groups all nine methods into small-multiple 20-task radar panels so single-episode, 128-episode metadata/text, 128-episode raw-feature, and foundation-model rows remain readable.",
864
  "exists": true,
865
+ "bytes": 98527,
866
+ "sha256": "5b034b22d2a772a57e7db50f300cb70d00bd31ac89d0c039c16ac8c23a5137ec"
867
  },
868
  {
869
  "id": "single_episode_task_model_radar_chart",
 
871
  "path": "docs/assets/charts/single_episode_task_model_radar.svg",
872
  "kind": "generated_figure",
873
  "surface": "website_hf",
874
+ "shows": "Shows the one-episode Minimal and Neural MLP 20/20 scored baselines in one enlarged radar panel with local legend and task key.",
875
  "exists": true,
876
+ "bytes": 36930,
877
+ "sha256": "96e609b0577e66db0ee8c63939c11b1fb28018285a1d259362de0bff415cc939"
878
  },
879
  {
880
  "id": "episode128_task_model_radar_chart",
 
882
  "path": "docs/assets/charts/episode128_task_model_radar.svg",
883
  "kind": "generated_figure",
884
  "surface": "website_hf",
885
+ "shows": "Separates selected 128-episode methods into metadata/text, raw-feature, and foundation-model radar panels with all 140 result rows scored and proxy notes retained.",
886
  "exists": true,
887
+ "bytes": 79370,
888
+ "sha256": "5151c8aca22bd4aeda60b143b1164c1d1b9eb4babbeabf6da598701ccbbbf5c9"
889
  },
890
  {
891
  "id": "unified_task_model_radar_builder",
 
893
  "path": "scripts/build_unified_task_model_radar.py",
894
  "kind": "visualization_builder",
895
  "surface": "repo_hf",
896
+ "shows": "Regenerates grouped 20-task radar charts plus machine-readable metric, source, chart-design, and proxy metadata.",
897
  "exists": true,
898
+ "bytes": 79396,
899
+ "sha256": "78949d7030cf6995edbb1d46a35692c7fe835d102df619cdf8c9d8ea9c5318e2"
900
  },
901
  {
902
  "id": "task_method_20_gap_audit_builder",
 
1093
  "surface": "repo_hf",
1094
  "shows": "Catalogs public figures, charts, modality thumbnails, dimensions, hashes, roles, and source scripts.",
1095
  "exists": true,
1096
+ "bytes": 7068,
1097
+ "sha256": "cc64c0bb070e7eb0035ba590a6d83ed07fcc68fb56081668caebf40b49b9900f"
1098
  },
1099
  {
1100
  "id": "figure_index_json",
 
1104
  "surface": "website_hf",
1105
  "shows": "Machine-readable visual asset index for website and Hugging Face mirrors.",
1106
  "exists": true,
1107
+ "bytes": 19526,
1108
+ "sha256": "601acd715891bb148b0a1706573a2fdb015ca3c5967f02a169577b483b0dcd43"
1109
  },
1110
  {
1111
  "id": "figure_index_builder",
 
1115
  "surface": "repo_hf",
1116
  "shows": "Regenerates visual-asset hashes, dimensions, and source-script provenance.",
1117
  "exists": true,
1118
+ "bytes": 16886,
1119
+ "sha256": "fc44ea60b6f491d290a56d4c1097fcfc37c024f8a9bc3b3db013252f45d96e64"
1120
  },
1121
  {
1122
  "id": "brand_assets_json",
 
1182
  "shows": "Machine-readable release-check summary for validators, mirrors, and public project surfaces.",
1183
  "exists": true,
1184
  "bytes": 8640,
1185
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1186
  },
1187
  {
1188
  "id": "public_surface_qa",
 
1226
  "volatile": true,
1227
  "shows": "Machine-readable report for SEO/social metadata, accessible tab semantics, public links, project links, and clear project presentation.",
1228
  "exists": true,
1229
+ "bytes": 7693,
1230
  "hash_policy": "existence_and_size_only"
1231
  },
1232
  {
 
1318
  "surface": "repo",
1319
  "shows": "Fetches the published GitHub/HF URLs and compares live hashes and public-card markers against the release assets.",
1320
  "exists": true,
1321
+ "bytes": 69123,
1322
+ "sha256": "a96d0a95ea1da54fb8b6a95b2c91d4a067eef8358d825c44d919e66d78ecb8f7"
1323
  },
1324
  {
1325
  "id": "reproducibility_contract",
 
1351
  "surface": "repo_hf",
1352
  "shows": "Generates the selective artifact catalog from local files.",
1353
  "exists": true,
1354
+ "bytes": 68407,
1355
+ "sha256": "9f5fc1af7c4c23083975066691bf6d75aa99280961198e2c47be88754c142418"
1356
  },
1357
  {
1358
  "id": "publication_audit",
 
1363
  "volatile": true,
1364
  "shows": "Confirms public bundles exclude raw data, caches, heavy archives, and credential text.",
1365
  "exists": true,
1366
+ "bytes": 10940,
1367
  "hash_policy": "existence_and_size_only"
1368
  },
1369
  {
 
1387
  "volatile": true,
1388
  "shows": "Confirms prepared GitHub/HF Space/artifact/model mirrors share the same critical data, figure, website HTML, and validator files.",
1389
  "exists": true,
1390
+ "bytes": 1420751,
1391
  "hash_policy": "existence_and_size_only"
1392
  },
1393
  {
 
1399
  "volatile": true,
1400
  "shows": "Confirms local website links, anchors, JSON data files, and referenced images resolve.",
1401
  "exists": true,
1402
+ "bytes": 20178,
1403
  "hash_policy": "existence_and_size_only"
1404
  },
1405
  {
metrics/episode128_task_model_radar.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "128-Episode 20-Task Radar",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T15:20:34+00:00",
5
  "description": "Selected 128-episode metadata/raw baselines plus verified Qwen3-Omni v6, Cosmos3-Super, and Cosmos3-Nano diagnostics. Every method has 20 records; numeric scores appear only where the public artifact produced that task target.",
6
  "task_count": 20,
7
  "method_count": 7,
@@ -11,10 +11,45 @@
11
  "higher_is_better": "bounded metrics are plotted directly on 0-1 axes after clipping to [0, 1]",
12
  "lower_is_better": "lower-error metrics are converted to best_observed_value / raw_value within the same task",
13
  "raw_values": "raw metric values, metric keys, and sources are retained in this JSON; the SVG is an overview, not a replacement for the metric table",
 
14
  "result_record_policy": "every method has 20 task records; the current public release has 180/180 scored rows with proxy flags and reasons retained where compact substitute targets are used",
15
- "foundation_model_overlay": "Qwen3-Omni and Cosmos3 points are plotted only on task-aligned axes. Scoreless records mean the public result does not evaluate that task contract.",
16
- "metadata_128_overlay": "128-episode aligned baselines have 20 records. Numeric scores come from JSONL metadata/text tasks plus staged sensor-block targets when the processed target exists; raw interaction text and paired camera-view embeddings remain explicit gaps.",
17
- "raw_128_overlay": "128-episode raw-feature baselines use staged sensor NPZ features. Eighteen axes use direct task targets; interaction text and camera-view sync are completed with documented compact proxies because raw interaction strings and paired video-view embeddings are absent from the 128 export."
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
18
  },
19
  "source_unified_radar": "docs/data/unified_task_model_radar.json",
20
  "source_result_matrix": "docs/data/task_method_20_result_matrix.json",
@@ -28,7 +63,7 @@
28
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
29
  "stroke_dasharray": "9 6",
30
  "method_detail": "128-episode aligned simple baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
31
- "plotted_as": "colored point overlay",
32
  "result_record_count": 20,
33
  "scored_task_count": 20,
34
  "covered_task_count": 20,
@@ -52,7 +87,7 @@
52
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
53
  "stroke_dasharray": "3 6",
54
  "method_detail": "128-episode aligned MLP baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
55
- "plotted_as": "colored point overlay",
56
  "result_record_count": 20,
57
  "scored_task_count": 20,
58
  "covered_task_count": 20,
@@ -76,7 +111,7 @@
76
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
77
  "stroke_dasharray": "8 4",
78
  "method_detail": "128-episode 4430-dim sensor NPZ simple heads; tasks 15/19 use compact proxies.",
79
- "plotted_as": "colored point overlay",
80
  "result_record_count": 20,
81
  "scored_task_count": 20,
82
  "covered_task_count": 20,
@@ -100,7 +135,7 @@
100
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
101
  "stroke_dasharray": "2 5",
102
  "method_detail": "128-episode 4430-dim sensor NPZ MLP heads; tasks 15/19 use compact proxies.",
103
- "plotted_as": "colored point overlay",
104
  "result_record_count": 20,
105
  "scored_task_count": 20,
106
  "covered_task_count": 20,
@@ -124,7 +159,7 @@
124
  "scope": "128 selected episodes, held-out test",
125
  "stroke_dasharray": "7 7",
126
  "method_detail": "Verified held-out Qwen3-Omni v6 LoRA metrics, plus task 16 and any completed private-GPU future/retrieval/sensor-target probes scored from task-specific JSON.",
127
- "plotted_as": "colored point overlay",
128
  "result_record_count": 20,
129
  "scored_task_count": 20,
130
  "covered_task_count": 20,
@@ -147,7 +182,7 @@
147
  "scope": "128 selected episodes, held-out test",
148
  "stroke_dasharray": "4 7",
149
  "method_detail": "Verified Cosmos3-Super base-weight Reasoner JSON-task evaluation, plus task 5/8/9/10/11/12/13/14/16/17/18/19/20 probes where public metrics exist.",
150
- "plotted_as": "colored point overlay",
151
  "result_record_count": 20,
152
  "scored_task_count": 20,
153
  "covered_task_count": 20,
@@ -170,7 +205,7 @@
170
  "scope": "128 selected episodes, held-out test",
171
  "stroke_dasharray": "2 7",
172
  "method_detail": "Verified Cosmos3-Nano future-window compatibility metrics, plus model-output probes for tasks 2/5/7/8/10/11/12/13/14/15/16/17/18/19 and a derived task-20 boundary timing probe scored from held-out future-window artifacts.",
173
- "plotted_as": "colored point overlay",
174
  "result_record_count": 20,
175
  "scored_task_count": 20,
176
  "covered_task_count": 20,
 
1
  {
2
  "title": "128-Episode 20-Task Radar",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T20:35:16+00:00",
5
  "description": "Selected 128-episode metadata/raw baselines plus verified Qwen3-Omni v6, Cosmos3-Super, and Cosmos3-Nano diagnostics. Every method has 20 records; numeric scores appear only where the public artifact produced that task target.",
6
  "task_count": 20,
7
  "method_count": 7,
 
11
  "higher_is_better": "bounded metrics are plotted directly on 0-1 axes after clipping to [0, 1]",
12
  "lower_is_better": "lower-error metrics are converted to best_observed_value / raw_value within the same task",
13
  "raw_values": "raw metric values, metric keys, and sources are retained in this JSON; the SVG is an overview, not a replacement for the metric table",
14
+ "radar_visual_radius": "SVG radar panels use sqrt(normalized_score) for radius so polygon area remains closer to the score and low-valued but real differences stay visible; the JSON and matrix retain exact linear normalized_score values",
15
  "result_record_policy": "every method has 20 task records; the current public release has 180/180 scored rows with proxy flags and reasons retained where compact substitute targets are used",
16
+ "foundation_model_overlay": "Qwen3-Omni and Cosmos3 are grouped in the foundation-model radar panel. All current public model rows have 20 scored task records, with source paths retained for every metric.",
17
+ "metadata_128_overlay": "128-episode aligned baselines are grouped in the metadata/text radar panel. Numeric scores come from JSONL metadata/text tasks plus staged sensor-block targets when the processed target exists.",
18
+ "raw_128_overlay": "128-episode raw-feature baselines are grouped in the raw-feature radar panel. Eighteen axes use direct task targets; interaction text and camera-view sync are completed with documented compact proxies because raw interaction strings and paired video-view embeddings are absent from the 128 export."
19
+ },
20
+ "chart_design": {
21
+ "mode": "grouped_small_multiples",
22
+ "method_count": 7,
23
+ "reason": "This split view has 7 methods and 140 method-task records; grouped radar panels keep related methods readable while retaining the unified source matrix.",
24
+ "groups": [
25
+ {
26
+ "id": "metadata_128",
27
+ "title": "128-episode metadata/text",
28
+ "series_ids": [
29
+ "metadata128_simple",
30
+ "metadata128_neural_mlp"
31
+ ]
32
+ },
33
+ {
34
+ "id": "raw_128",
35
+ "title": "128-episode raw features",
36
+ "series_ids": [
37
+ "raw128_simple",
38
+ "raw128_neural_mlp"
39
+ ]
40
+ },
41
+ {
42
+ "id": "foundation_models",
43
+ "title": "Foundation-model probes",
44
+ "series_ids": [
45
+ "qwen3_omni_v6_lora",
46
+ "cosmos3_super_reasoner",
47
+ "cosmos3_nano_future_window"
48
+ ]
49
+ }
50
+ ],
51
+ "visual_radius_transform": "sqrt(normalized_score)",
52
+ "exact_value_source": "docs/data/task_method_20_result_matrix.json"
53
  },
54
  "source_unified_radar": "docs/data/unified_task_model_radar.json",
55
  "source_result_matrix": "docs/data/task_method_20_result_matrix.json",
 
63
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
64
  "stroke_dasharray": "9 6",
65
  "method_detail": "128-episode aligned simple baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
66
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
67
  "result_record_count": 20,
68
  "scored_task_count": 20,
69
  "covered_task_count": 20,
 
87
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
88
  "stroke_dasharray": "3 6",
89
  "method_detail": "128-episode aligned MLP baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
90
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
91
  "result_record_count": 20,
92
  "scored_task_count": 20,
93
  "covered_task_count": 20,
 
111
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
112
  "stroke_dasharray": "8 4",
113
  "method_detail": "128-episode 4430-dim sensor NPZ simple heads; tasks 15/19 use compact proxies.",
114
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
115
  "result_record_count": 20,
116
  "scored_task_count": 20,
117
  "covered_task_count": 20,
 
135
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
136
  "stroke_dasharray": "2 5",
137
  "method_detail": "128-episode 4430-dim sensor NPZ MLP heads; tasks 15/19 use compact proxies.",
138
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
139
  "result_record_count": 20,
140
  "scored_task_count": 20,
141
  "covered_task_count": 20,
 
159
  "scope": "128 selected episodes, held-out test",
160
  "stroke_dasharray": "7 7",
161
  "method_detail": "Verified held-out Qwen3-Omni v6 LoRA metrics, plus task 16 and any completed private-GPU future/retrieval/sensor-target probes scored from task-specific JSON.",
162
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
163
  "result_record_count": 20,
164
  "scored_task_count": 20,
165
  "covered_task_count": 20,
 
182
  "scope": "128 selected episodes, held-out test",
183
  "stroke_dasharray": "4 7",
184
  "method_detail": "Verified Cosmos3-Super base-weight Reasoner JSON-task evaluation, plus task 5/8/9/10/11/12/13/14/16/17/18/19/20 probes where public metrics exist.",
185
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
186
  "result_record_count": 20,
187
  "scored_task_count": 20,
188
  "covered_task_count": 20,
 
205
  "scope": "128 selected episodes, held-out test",
206
  "stroke_dasharray": "2 7",
207
  "method_detail": "Verified Cosmos3-Nano future-window compatibility metrics, plus model-output probes for tasks 2/5/7/8/10/11/12/13/14/15/16/17/18/19 and a derived task-20 boundary timing probe scored from held-out future-window artifacts.",
208
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
209
  "result_record_count": 20,
210
  "scored_task_count": 20,
211
  "covered_task_count": 20,
metrics/figure_index.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Figure Index",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T15:19:00+00:00",
5
  "scope": "Public figures, diagrams, charts, and derived modality thumbnails. Raw Xperience-10M videos, annotations, RRD files, and Qwen weights are excluded.",
6
  "figure_count": 29,
7
  "figures": [
@@ -406,17 +406,17 @@
406
  "id": "unified_task_model_radar",
407
  "title": "Unified 20-task model radar",
408
  "path": "docs/assets/charts/unified_task_model_radar.svg",
409
- "role": "Twenty-axis direction-aware comparison of minimal and neural MLP baselines, with 128-episode metadata, Qwen3, and Cosmos task-aligned overlay points and branch notes.",
410
  "source_script": "scripts/build_unified_task_model_radar.py",
411
  "surface": "website unified task section, README, HF mirrors",
412
  "exists": true,
413
- "bytes": 57938,
414
- "sha256": "bb83b80b47fe679ebdce2c99378a4548120f1c8cc2d725b88e409d8c386dcbf8",
415
  "dimensions": {
416
  "format": "SVG",
417
  "width": 2400,
418
- "height": 1840,
419
- "view_box": "0 0 2400 1840"
420
  },
421
  "source_script_exists": true
422
  },
@@ -428,13 +428,13 @@
428
  "source_script": "scripts/build_unified_task_model_radar.py",
429
  "surface": "website unified task section, README, HF mirrors",
430
  "exists": true,
431
- "bytes": 35232,
432
- "sha256": "87b52a7dead40358f1778dda43ade4d2e875ac98e507e01ca007084363e5977e",
433
  "dimensions": {
434
  "format": "SVG",
435
  "width": 2400,
436
- "height": 1840,
437
- "view_box": "0 0 2400 1840"
438
  },
439
  "source_script_exists": true
440
  },
@@ -442,17 +442,17 @@
442
  "id": "episode128_task_model_radar",
443
  "title": "128-episode 20-task model radar",
444
  "path": "docs/assets/charts/episode128_task_model_radar.svg",
445
- "role": "Twenty-axis split radar for selected 128-episode methods: raw-feature simple/NN as complete scored polygons plus metadata, Qwen3-Omni, Cosmos3-Super, and Cosmos3-Nano task-aligned overlays.",
446
  "source_script": "scripts/build_unified_task_model_radar.py",
447
  "surface": "website unified task section, README, HF mirrors",
448
  "exists": true,
449
- "bytes": 51915,
450
- "sha256": "047ea4b05a04f6734e2afcf792863559dc8f3091eae88a97ff90e8b038a423f4",
451
  "dimensions": {
452
  "format": "SVG",
453
  "width": 2400,
454
- "height": 1840,
455
- "view_box": "0 0 2400 1840"
456
  },
457
  "source_script_exists": true
458
  },
 
1
  {
2
  "title": "Ropedia Xperience-10M Figure Index",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T20:35:17+00:00",
5
  "scope": "Public figures, diagrams, charts, and derived modality thumbnails. Raw Xperience-10M videos, annotations, RRD files, and Qwen weights are excluded.",
6
  "figure_count": 29,
7
  "figures": [
 
406
  "id": "unified_task_model_radar",
407
  "title": "Unified 20-task model radar",
408
  "path": "docs/assets/charts/unified_task_model_radar.svg",
409
+ "role": "Grouped small-multiple 20-task radar board for all nine methods, separating single-episode, 128-episode metadata/text, 128-episode raw-feature, and foundation-model rows while preserving task keys and proxy notes.",
410
  "source_script": "scripts/build_unified_task_model_radar.py",
411
  "surface": "website unified task section, README, HF mirrors",
412
  "exists": true,
413
+ "bytes": 98527,
414
+ "sha256": "5b034b22d2a772a57e7db50f300cb70d00bd31ac89d0c039c16ac8c23a5137ec",
415
  "dimensions": {
416
  "format": "SVG",
417
  "width": 2400,
418
+ "height": 1900,
419
+ "view_box": "0 0 2400 1900"
420
  },
421
  "source_script_exists": true
422
  },
 
428
  "source_script": "scripts/build_unified_task_model_radar.py",
429
  "surface": "website unified task section, README, HF mirrors",
430
  "exists": true,
431
+ "bytes": 36930,
432
+ "sha256": "96e609b0577e66db0ee8c63939c11b1fb28018285a1d259362de0bff415cc939",
433
  "dimensions": {
434
  "format": "SVG",
435
  "width": 2400,
436
+ "height": 1900,
437
+ "view_box": "0 0 2400 1900"
438
  },
439
  "source_script_exists": true
440
  },
 
442
  "id": "episode128_task_model_radar",
443
  "title": "128-episode 20-task model radar",
444
  "path": "docs/assets/charts/episode128_task_model_radar.svg",
445
+ "role": "Grouped 20-task radar for selected 128-episode methods: metadata/text baselines, raw-feature simple/NN, Qwen3-Omni, Cosmos3-Super, and Cosmos3-Nano with local legends and proxy notes.",
446
  "source_script": "scripts/build_unified_task_model_radar.py",
447
  "surface": "website unified task section, README, HF mirrors",
448
  "exists": true,
449
+ "bytes": 79370,
450
+ "sha256": "5151c8aca22bd4aeda60b143b1164c1d1b9eb4babbeabf6da598701ccbbbf5c9",
451
  "dimensions": {
452
  "format": "SVG",
453
  "width": 2400,
454
+ "height": 1900,
455
+ "view_box": "0 0 2400 1900"
456
  },
457
  "source_script_exists": true
458
  },
metrics/mirror_parity.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T20:00:18+00:00",
4
  "hf_root": "hf_publish",
5
  "summary": {
6
  "group_count": 1258,
@@ -138,45 +138,45 @@
138
  "local": {
139
  "path": "repo:docs/data/artifact_index.json",
140
  "exists": true,
141
- "bytes": 124341,
142
- "sha256": "d2887200f777fa80235644383d26cd3a171c313d9c4641e06974fb705f02ee5d"
143
  },
144
  "mirrors": {
145
  "hf_space": {
146
  "path": "hf_space:data/artifact_index.json",
147
  "exists": true,
148
- "bytes": 124341,
149
- "sha256": "d2887200f777fa80235644383d26cd3a171c313d9c4641e06974fb705f02ee5d"
150
  },
151
  "hf_artifacts_data": {
152
  "path": "hf_artifacts:data/artifact_index.json",
153
  "exists": true,
154
- "bytes": 124341,
155
- "sha256": "d2887200f777fa80235644383d26cd3a171c313d9c4641e06974fb705f02ee5d"
156
  },
157
  "hf_artifacts": {
158
  "path": "hf_artifacts:docs/data/artifact_index.json",
159
  "exists": true,
160
- "bytes": 124341,
161
- "sha256": "d2887200f777fa80235644383d26cd3a171c313d9c4641e06974fb705f02ee5d"
162
  },
163
  "hf_model_data": {
164
  "path": "hf_model:data/artifact_index.json",
165
  "exists": true,
166
- "bytes": 124341,
167
- "sha256": "d2887200f777fa80235644383d26cd3a171c313d9c4641e06974fb705f02ee5d"
168
  },
169
  "hf_model_docs_data": {
170
  "path": "hf_model:docs/data/artifact_index.json",
171
  "exists": true,
172
- "bytes": 124341,
173
- "sha256": "d2887200f777fa80235644383d26cd3a171c313d9c4641e06974fb705f02ee5d"
174
  },
175
  "hf_model": {
176
  "path": "hf_model:metrics/artifact_index.json",
177
  "exists": true,
178
- "bytes": 124341,
179
- "sha256": "d2887200f777fa80235644383d26cd3a171c313d9c4641e06974fb705f02ee5d"
180
  }
181
  },
182
  "failures": []
@@ -334,45 +334,45 @@
334
  "local": {
335
  "path": "repo:docs/data/figure_index.json",
336
  "exists": true,
337
- "bytes": 19485,
338
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
339
  },
340
  "mirrors": {
341
  "hf_space": {
342
  "path": "hf_space:data/figure_index.json",
343
  "exists": true,
344
- "bytes": 19485,
345
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
346
  },
347
  "hf_artifacts_data": {
348
  "path": "hf_artifacts:data/figure_index.json",
349
  "exists": true,
350
- "bytes": 19485,
351
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
352
  },
353
  "hf_artifacts": {
354
  "path": "hf_artifacts:docs/data/figure_index.json",
355
  "exists": true,
356
- "bytes": 19485,
357
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
358
  },
359
  "hf_model_data": {
360
  "path": "hf_model:data/figure_index.json",
361
  "exists": true,
362
- "bytes": 19485,
363
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
364
  },
365
  "hf_model_docs_data": {
366
  "path": "hf_model:docs/data/figure_index.json",
367
  "exists": true,
368
- "bytes": 19485,
369
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
370
  },
371
  "hf_model": {
372
  "path": "hf_model:metrics/figure_index.json",
373
  "exists": true,
374
- "bytes": 19485,
375
- "sha256": "4f225bf08f00fbe843999d6bd2b3d5f5d6c17f2ff67e1f6a85eee9094c6bb6a3"
376
  }
377
  },
378
  "failures": []
@@ -972,44 +972,44 @@
972
  "path": "repo:docs/data/publication_audit.json",
973
  "exists": true,
974
  "bytes": 10940,
975
- "sha256": "656414245ff90b4f44a303ae2e848d5e893a8a861f1e4102d19a0dd979012997"
976
  },
977
  "mirrors": {
978
  "hf_space": {
979
  "path": "hf_space:data/publication_audit.json",
980
  "exists": true,
981
  "bytes": 10940,
982
- "sha256": "656414245ff90b4f44a303ae2e848d5e893a8a861f1e4102d19a0dd979012997"
983
  },
984
  "hf_artifacts_data": {
985
  "path": "hf_artifacts:data/publication_audit.json",
986
  "exists": true,
987
  "bytes": 10940,
988
- "sha256": "656414245ff90b4f44a303ae2e848d5e893a8a861f1e4102d19a0dd979012997"
989
  },
990
  "hf_artifacts": {
991
  "path": "hf_artifacts:docs/data/publication_audit.json",
992
  "exists": true,
993
  "bytes": 10940,
994
- "sha256": "656414245ff90b4f44a303ae2e848d5e893a8a861f1e4102d19a0dd979012997"
995
  },
996
  "hf_model_data": {
997
  "path": "hf_model:data/publication_audit.json",
998
  "exists": true,
999
  "bytes": 10940,
1000
- "sha256": "656414245ff90b4f44a303ae2e848d5e893a8a861f1e4102d19a0dd979012997"
1001
  },
1002
  "hf_model_docs_data": {
1003
  "path": "hf_model:docs/data/publication_audit.json",
1004
  "exists": true,
1005
  "bytes": 10940,
1006
- "sha256": "656414245ff90b4f44a303ae2e848d5e893a8a861f1e4102d19a0dd979012997"
1007
  },
1008
  "hf_model": {
1009
  "path": "hf_model:metrics/publication_audit.json",
1010
  "exists": true,
1011
  "bytes": 10940,
1012
- "sha256": "656414245ff90b4f44a303ae2e848d5e893a8a861f1e4102d19a0dd979012997"
1013
  }
1014
  },
1015
  "failures": []
@@ -1021,44 +1021,44 @@
1021
  "path": "repo:docs/data/public_surface_qa.json",
1022
  "exists": true,
1023
  "bytes": 7693,
1024
- "sha256": "7dc38f37df3efac5c7ce1213267dd7693515df5806be778138ea5f8de70b0111"
1025
  },
1026
  "mirrors": {
1027
  "hf_space": {
1028
  "path": "hf_space:data/public_surface_qa.json",
1029
  "exists": true,
1030
  "bytes": 7693,
1031
- "sha256": "7dc38f37df3efac5c7ce1213267dd7693515df5806be778138ea5f8de70b0111"
1032
  },
1033
  "hf_artifacts_data": {
1034
  "path": "hf_artifacts:data/public_surface_qa.json",
1035
  "exists": true,
1036
  "bytes": 7693,
1037
- "sha256": "7dc38f37df3efac5c7ce1213267dd7693515df5806be778138ea5f8de70b0111"
1038
  },
1039
  "hf_artifacts": {
1040
  "path": "hf_artifacts:docs/data/public_surface_qa.json",
1041
  "exists": true,
1042
  "bytes": 7693,
1043
- "sha256": "7dc38f37df3efac5c7ce1213267dd7693515df5806be778138ea5f8de70b0111"
1044
  },
1045
  "hf_model_data": {
1046
  "path": "hf_model:data/public_surface_qa.json",
1047
  "exists": true,
1048
  "bytes": 7693,
1049
- "sha256": "7dc38f37df3efac5c7ce1213267dd7693515df5806be778138ea5f8de70b0111"
1050
  },
1051
  "hf_model_docs_data": {
1052
  "path": "hf_model:docs/data/public_surface_qa.json",
1053
  "exists": true,
1054
  "bytes": 7693,
1055
- "sha256": "7dc38f37df3efac5c7ce1213267dd7693515df5806be778138ea5f8de70b0111"
1056
  },
1057
  "hf_model": {
1058
  "path": "hf_model:metrics/public_surface_qa.json",
1059
  "exists": true,
1060
  "bytes": 7693,
1061
- "sha256": "7dc38f37df3efac5c7ce1213267dd7693515df5806be778138ea5f8de70b0111"
1062
  }
1063
  },
1064
  "failures": []
@@ -1217,44 +1217,44 @@
1217
  "path": "repo:docs/data/quality_gates.json",
1218
  "exists": true,
1219
  "bytes": 8640,
1220
- "sha256": "271e4f6363b8e3738274c0be42b403ae7f449effe87e34daec6477e048194ff0"
1221
  },
1222
  "mirrors": {
1223
  "hf_space": {
1224
  "path": "hf_space:data/quality_gates.json",
1225
  "exists": true,
1226
  "bytes": 8640,
1227
- "sha256": "271e4f6363b8e3738274c0be42b403ae7f449effe87e34daec6477e048194ff0"
1228
  },
1229
  "hf_artifacts_data": {
1230
  "path": "hf_artifacts:data/quality_gates.json",
1231
  "exists": true,
1232
  "bytes": 8640,
1233
- "sha256": "271e4f6363b8e3738274c0be42b403ae7f449effe87e34daec6477e048194ff0"
1234
  },
1235
  "hf_artifacts": {
1236
  "path": "hf_artifacts:docs/data/quality_gates.json",
1237
  "exists": true,
1238
  "bytes": 8640,
1239
- "sha256": "271e4f6363b8e3738274c0be42b403ae7f449effe87e34daec6477e048194ff0"
1240
  },
1241
  "hf_model_data": {
1242
  "path": "hf_model:data/quality_gates.json",
1243
  "exists": true,
1244
  "bytes": 8640,
1245
- "sha256": "271e4f6363b8e3738274c0be42b403ae7f449effe87e34daec6477e048194ff0"
1246
  },
1247
  "hf_model_docs_data": {
1248
  "path": "hf_model:docs/data/quality_gates.json",
1249
  "exists": true,
1250
  "bytes": 8640,
1251
- "sha256": "271e4f6363b8e3738274c0be42b403ae7f449effe87e34daec6477e048194ff0"
1252
  },
1253
  "hf_model": {
1254
  "path": "hf_model:metrics/quality_gates.json",
1255
  "exists": true,
1256
  "bytes": 8640,
1257
- "sha256": "271e4f6363b8e3738274c0be42b403ae7f449effe87e34daec6477e048194ff0"
1258
  }
1259
  },
1260
  "failures": []
@@ -1658,44 +1658,44 @@
1658
  "path": "repo:docs/data/scope_claims_audit.json",
1659
  "exists": true,
1660
  "bytes": 21313,
1661
- "sha256": "a03133115a089d88dc3ce87f831674603d8b252138790693d8def878b9264da0"
1662
  },
1663
  "mirrors": {
1664
  "hf_space": {
1665
  "path": "hf_space:data/scope_claims_audit.json",
1666
  "exists": true,
1667
  "bytes": 21313,
1668
- "sha256": "a03133115a089d88dc3ce87f831674603d8b252138790693d8def878b9264da0"
1669
  },
1670
  "hf_artifacts_data": {
1671
  "path": "hf_artifacts:data/scope_claims_audit.json",
1672
  "exists": true,
1673
  "bytes": 21313,
1674
- "sha256": "a03133115a089d88dc3ce87f831674603d8b252138790693d8def878b9264da0"
1675
  },
1676
  "hf_artifacts": {
1677
  "path": "hf_artifacts:docs/data/scope_claims_audit.json",
1678
  "exists": true,
1679
  "bytes": 21313,
1680
- "sha256": "a03133115a089d88dc3ce87f831674603d8b252138790693d8def878b9264da0"
1681
  },
1682
  "hf_model_data": {
1683
  "path": "hf_model:data/scope_claims_audit.json",
1684
  "exists": true,
1685
  "bytes": 21313,
1686
- "sha256": "a03133115a089d88dc3ce87f831674603d8b252138790693d8def878b9264da0"
1687
  },
1688
  "hf_model_docs_data": {
1689
  "path": "hf_model:docs/data/scope_claims_audit.json",
1690
  "exists": true,
1691
  "bytes": 21313,
1692
- "sha256": "a03133115a089d88dc3ce87f831674603d8b252138790693d8def878b9264da0"
1693
  },
1694
  "hf_model": {
1695
  "path": "hf_model:metrics/scope_claims_audit.json",
1696
  "exists": true,
1697
  "bytes": 21313,
1698
- "sha256": "a03133115a089d88dc3ce87f831674603d8b252138790693d8def878b9264da0"
1699
  }
1700
  },
1701
  "failures": []
@@ -1756,44 +1756,44 @@
1756
  "path": "repo:docs/data/source_alignment_audit.json",
1757
  "exists": true,
1758
  "bytes": 4432,
1759
- "sha256": "6173b5fb5e1c69983f5a2b83288d521968de16810f8e01bd2717b99a67f4285c"
1760
  },
1761
  "mirrors": {
1762
  "hf_space": {
1763
  "path": "hf_space:data/source_alignment_audit.json",
1764
  "exists": true,
1765
  "bytes": 4432,
1766
- "sha256": "6173b5fb5e1c69983f5a2b83288d521968de16810f8e01bd2717b99a67f4285c"
1767
  },
1768
  "hf_artifacts_data": {
1769
  "path": "hf_artifacts:data/source_alignment_audit.json",
1770
  "exists": true,
1771
  "bytes": 4432,
1772
- "sha256": "6173b5fb5e1c69983f5a2b83288d521968de16810f8e01bd2717b99a67f4285c"
1773
  },
1774
  "hf_artifacts": {
1775
  "path": "hf_artifacts:docs/data/source_alignment_audit.json",
1776
  "exists": true,
1777
  "bytes": 4432,
1778
- "sha256": "6173b5fb5e1c69983f5a2b83288d521968de16810f8e01bd2717b99a67f4285c"
1779
  },
1780
  "hf_model_data": {
1781
  "path": "hf_model:data/source_alignment_audit.json",
1782
  "exists": true,
1783
  "bytes": 4432,
1784
- "sha256": "6173b5fb5e1c69983f5a2b83288d521968de16810f8e01bd2717b99a67f4285c"
1785
  },
1786
  "hf_model_docs_data": {
1787
  "path": "hf_model:docs/data/source_alignment_audit.json",
1788
  "exists": true,
1789
  "bytes": 4432,
1790
- "sha256": "6173b5fb5e1c69983f5a2b83288d521968de16810f8e01bd2717b99a67f4285c"
1791
  },
1792
  "hf_model": {
1793
  "path": "hf_model:metrics/source_alignment_audit.json",
1794
  "exists": true,
1795
  "bytes": 4432,
1796
- "sha256": "6173b5fb5e1c69983f5a2b83288d521968de16810f8e01bd2717b99a67f4285c"
1797
  }
1798
  },
1799
  "failures": []
@@ -1853,45 +1853,45 @@
1853
  "local": {
1854
  "path": "repo:docs/data/single_episode_task_model_radar.json",
1855
  "exists": true,
1856
- "bytes": 51327,
1857
- "sha256": "791793dc1dc1655dcae8803300def7f9e7e2c84b5af6321cbb1561c76bff2e7c"
1858
  },
1859
  "mirrors": {
1860
  "hf_space": {
1861
  "path": "hf_space:data/single_episode_task_model_radar.json",
1862
  "exists": true,
1863
- "bytes": 51327,
1864
- "sha256": "791793dc1dc1655dcae8803300def7f9e7e2c84b5af6321cbb1561c76bff2e7c"
1865
  },
1866
  "hf_artifacts_data": {
1867
  "path": "hf_artifacts:data/single_episode_task_model_radar.json",
1868
  "exists": true,
1869
- "bytes": 51327,
1870
- "sha256": "791793dc1dc1655dcae8803300def7f9e7e2c84b5af6321cbb1561c76bff2e7c"
1871
  },
1872
  "hf_artifacts": {
1873
  "path": "hf_artifacts:docs/data/single_episode_task_model_radar.json",
1874
  "exists": true,
1875
- "bytes": 51327,
1876
- "sha256": "791793dc1dc1655dcae8803300def7f9e7e2c84b5af6321cbb1561c76bff2e7c"
1877
  },
1878
  "hf_model_data": {
1879
  "path": "hf_model:data/single_episode_task_model_radar.json",
1880
  "exists": true,
1881
- "bytes": 51327,
1882
- "sha256": "791793dc1dc1655dcae8803300def7f9e7e2c84b5af6321cbb1561c76bff2e7c"
1883
  },
1884
  "hf_model_docs_data": {
1885
  "path": "hf_model:docs/data/single_episode_task_model_radar.json",
1886
  "exists": true,
1887
- "bytes": 51327,
1888
- "sha256": "791793dc1dc1655dcae8803300def7f9e7e2c84b5af6321cbb1561c76bff2e7c"
1889
  },
1890
  "hf_model": {
1891
  "path": "hf_model:metrics/single_episode_task_model_radar.json",
1892
  "exists": true,
1893
- "bytes": 51327,
1894
- "sha256": "791793dc1dc1655dcae8803300def7f9e7e2c84b5af6321cbb1561c76bff2e7c"
1895
  }
1896
  },
1897
  "failures": []
@@ -1902,45 +1902,45 @@
1902
  "local": {
1903
  "path": "repo:docs/data/episode128_task_model_radar.json",
1904
  "exists": true,
1905
- "bytes": 185212,
1906
- "sha256": "f3007f5799e2bb92ab9891bcdfd1f0f63dfef6452198d44e2b3f8d38a5493859"
1907
  },
1908
  "mirrors": {
1909
  "hf_space": {
1910
  "path": "hf_space:data/episode128_task_model_radar.json",
1911
  "exists": true,
1912
- "bytes": 185212,
1913
- "sha256": "f3007f5799e2bb92ab9891bcdfd1f0f63dfef6452198d44e2b3f8d38a5493859"
1914
  },
1915
  "hf_artifacts_data": {
1916
  "path": "hf_artifacts:data/episode128_task_model_radar.json",
1917
  "exists": true,
1918
- "bytes": 185212,
1919
- "sha256": "f3007f5799e2bb92ab9891bcdfd1f0f63dfef6452198d44e2b3f8d38a5493859"
1920
  },
1921
  "hf_artifacts": {
1922
  "path": "hf_artifacts:docs/data/episode128_task_model_radar.json",
1923
  "exists": true,
1924
- "bytes": 185212,
1925
- "sha256": "f3007f5799e2bb92ab9891bcdfd1f0f63dfef6452198d44e2b3f8d38a5493859"
1926
  },
1927
  "hf_model_data": {
1928
  "path": "hf_model:data/episode128_task_model_radar.json",
1929
  "exists": true,
1930
- "bytes": 185212,
1931
- "sha256": "f3007f5799e2bb92ab9891bcdfd1f0f63dfef6452198d44e2b3f8d38a5493859"
1932
  },
1933
  "hf_model_docs_data": {
1934
  "path": "hf_model:docs/data/episode128_task_model_radar.json",
1935
  "exists": true,
1936
- "bytes": 185212,
1937
- "sha256": "f3007f5799e2bb92ab9891bcdfd1f0f63dfef6452198d44e2b3f8d38a5493859"
1938
  },
1939
  "hf_model": {
1940
  "path": "hf_model:metrics/episode128_task_model_radar.json",
1941
  "exists": true,
1942
- "bytes": 185212,
1943
- "sha256": "f3007f5799e2bb92ab9891bcdfd1f0f63dfef6452198d44e2b3f8d38a5493859"
1944
  }
1945
  },
1946
  "failures": []
@@ -2197,44 +2197,44 @@
2197
  "path": "repo:docs/data/task_surface_integrity.json",
2198
  "exists": true,
2199
  "bytes": 46246,
2200
- "sha256": "d4437d2ae8900c75122057be8739b4812da61b91f7495a97271844e9ad33d539"
2201
  },
2202
  "mirrors": {
2203
  "hf_space": {
2204
  "path": "hf_space:data/task_surface_integrity.json",
2205
  "exists": true,
2206
  "bytes": 46246,
2207
- "sha256": "d4437d2ae8900c75122057be8739b4812da61b91f7495a97271844e9ad33d539"
2208
  },
2209
  "hf_artifacts_data": {
2210
  "path": "hf_artifacts:data/task_surface_integrity.json",
2211
  "exists": true,
2212
  "bytes": 46246,
2213
- "sha256": "d4437d2ae8900c75122057be8739b4812da61b91f7495a97271844e9ad33d539"
2214
  },
2215
  "hf_artifacts": {
2216
  "path": "hf_artifacts:docs/data/task_surface_integrity.json",
2217
  "exists": true,
2218
  "bytes": 46246,
2219
- "sha256": "d4437d2ae8900c75122057be8739b4812da61b91f7495a97271844e9ad33d539"
2220
  },
2221
  "hf_model_data": {
2222
  "path": "hf_model:data/task_surface_integrity.json",
2223
  "exists": true,
2224
  "bytes": 46246,
2225
- "sha256": "d4437d2ae8900c75122057be8739b4812da61b91f7495a97271844e9ad33d539"
2226
  },
2227
  "hf_model_docs_data": {
2228
  "path": "hf_model:docs/data/task_surface_integrity.json",
2229
  "exists": true,
2230
  "bytes": 46246,
2231
- "sha256": "d4437d2ae8900c75122057be8739b4812da61b91f7495a97271844e9ad33d539"
2232
  },
2233
  "hf_model": {
2234
  "path": "hf_model:metrics/task_surface_integrity.json",
2235
  "exists": true,
2236
  "bytes": 46246,
2237
- "sha256": "d4437d2ae8900c75122057be8739b4812da61b91f7495a97271844e9ad33d539"
2238
  }
2239
  },
2240
  "failures": []
@@ -2294,45 +2294,45 @@
2294
  "local": {
2295
  "path": "repo:docs/data/task_method_20_result_matrix.json",
2296
  "exists": true,
2297
- "bytes": 128509,
2298
- "sha256": "d5002c1a36f0c4dc15f184de2480fb123aee5a80ae1f6c5a964c449c1c97cd54"
2299
  },
2300
  "mirrors": {
2301
  "hf_space": {
2302
  "path": "hf_space:data/task_method_20_result_matrix.json",
2303
  "exists": true,
2304
- "bytes": 128509,
2305
- "sha256": "d5002c1a36f0c4dc15f184de2480fb123aee5a80ae1f6c5a964c449c1c97cd54"
2306
  },
2307
  "hf_artifacts_data": {
2308
  "path": "hf_artifacts:data/task_method_20_result_matrix.json",
2309
  "exists": true,
2310
- "bytes": 128509,
2311
- "sha256": "d5002c1a36f0c4dc15f184de2480fb123aee5a80ae1f6c5a964c449c1c97cd54"
2312
  },
2313
  "hf_artifacts": {
2314
  "path": "hf_artifacts:docs/data/task_method_20_result_matrix.json",
2315
  "exists": true,
2316
- "bytes": 128509,
2317
- "sha256": "d5002c1a36f0c4dc15f184de2480fb123aee5a80ae1f6c5a964c449c1c97cd54"
2318
  },
2319
  "hf_model_data": {
2320
  "path": "hf_model:data/task_method_20_result_matrix.json",
2321
  "exists": true,
2322
- "bytes": 128509,
2323
- "sha256": "d5002c1a36f0c4dc15f184de2480fb123aee5a80ae1f6c5a964c449c1c97cd54"
2324
  },
2325
  "hf_model_docs_data": {
2326
  "path": "hf_model:docs/data/task_method_20_result_matrix.json",
2327
  "exists": true,
2328
- "bytes": 128509,
2329
- "sha256": "d5002c1a36f0c4dc15f184de2480fb123aee5a80ae1f6c5a964c449c1c97cd54"
2330
  },
2331
  "hf_model": {
2332
  "path": "hf_model:metrics/task_method_20_result_matrix.json",
2333
  "exists": true,
2334
- "bytes": 128509,
2335
- "sha256": "d5002c1a36f0c4dc15f184de2480fb123aee5a80ae1f6c5a964c449c1c97cd54"
2336
  }
2337
  },
2338
  "failures": []
@@ -2490,45 +2490,45 @@
2490
  "local": {
2491
  "path": "repo:docs/data/unified_task_model_radar.json",
2492
  "exists": true,
2493
- "bytes": 229035,
2494
- "sha256": "c810377ee743c0ad251eb77b1549d4e096c581fa861da46b9e0fd1f2dce46a22"
2495
  },
2496
  "mirrors": {
2497
  "hf_space": {
2498
  "path": "hf_space:data/unified_task_model_radar.json",
2499
  "exists": true,
2500
- "bytes": 229035,
2501
- "sha256": "c810377ee743c0ad251eb77b1549d4e096c581fa861da46b9e0fd1f2dce46a22"
2502
  },
2503
  "hf_artifacts_data": {
2504
  "path": "hf_artifacts:data/unified_task_model_radar.json",
2505
  "exists": true,
2506
- "bytes": 229035,
2507
- "sha256": "c810377ee743c0ad251eb77b1549d4e096c581fa861da46b9e0fd1f2dce46a22"
2508
  },
2509
  "hf_artifacts": {
2510
  "path": "hf_artifacts:docs/data/unified_task_model_radar.json",
2511
  "exists": true,
2512
- "bytes": 229035,
2513
- "sha256": "c810377ee743c0ad251eb77b1549d4e096c581fa861da46b9e0fd1f2dce46a22"
2514
  },
2515
  "hf_model_data": {
2516
  "path": "hf_model:data/unified_task_model_radar.json",
2517
  "exists": true,
2518
- "bytes": 229035,
2519
- "sha256": "c810377ee743c0ad251eb77b1549d4e096c581fa861da46b9e0fd1f2dce46a22"
2520
  },
2521
  "hf_model_docs_data": {
2522
  "path": "hf_model:docs/data/unified_task_model_radar.json",
2523
  "exists": true,
2524
- "bytes": 229035,
2525
- "sha256": "c810377ee743c0ad251eb77b1549d4e096c581fa861da46b9e0fd1f2dce46a22"
2526
  },
2527
  "hf_model": {
2528
  "path": "hf_model:metrics/unified_task_model_radar.json",
2529
  "exists": true,
2530
- "bytes": 229035,
2531
- "sha256": "c810377ee743c0ad251eb77b1549d4e096c581fa861da46b9e0fd1f2dce46a22"
2532
  }
2533
  },
2534
  "failures": []
@@ -2540,44 +2540,44 @@
2540
  "path": "repo:docs/data/website_integrity.json",
2541
  "exists": true,
2542
  "bytes": 20178,
2543
- "sha256": "d9e72c5e628c9ed808a7a4648e29fbca9a455ff2ae466abde9cabd27f2f87e6a"
2544
  },
2545
  "mirrors": {
2546
  "hf_space": {
2547
  "path": "hf_space:data/website_integrity.json",
2548
  "exists": true,
2549
  "bytes": 20178,
2550
- "sha256": "d9e72c5e628c9ed808a7a4648e29fbca9a455ff2ae466abde9cabd27f2f87e6a"
2551
  },
2552
  "hf_artifacts_data": {
2553
  "path": "hf_artifacts:data/website_integrity.json",
2554
  "exists": true,
2555
  "bytes": 20178,
2556
- "sha256": "d9e72c5e628c9ed808a7a4648e29fbca9a455ff2ae466abde9cabd27f2f87e6a"
2557
  },
2558
  "hf_artifacts": {
2559
  "path": "hf_artifacts:docs/data/website_integrity.json",
2560
  "exists": true,
2561
  "bytes": 20178,
2562
- "sha256": "d9e72c5e628c9ed808a7a4648e29fbca9a455ff2ae466abde9cabd27f2f87e6a"
2563
  },
2564
  "hf_model_data": {
2565
  "path": "hf_model:data/website_integrity.json",
2566
  "exists": true,
2567
  "bytes": 20178,
2568
- "sha256": "d9e72c5e628c9ed808a7a4648e29fbca9a455ff2ae466abde9cabd27f2f87e6a"
2569
  },
2570
  "hf_model_docs_data": {
2571
  "path": "hf_model:docs/data/website_integrity.json",
2572
  "exists": true,
2573
  "bytes": 20178,
2574
- "sha256": "d9e72c5e628c9ed808a7a4648e29fbca9a455ff2ae466abde9cabd27f2f87e6a"
2575
  },
2576
  "hf_model": {
2577
  "path": "hf_model:metrics/website_integrity.json",
2578
  "exists": true,
2579
  "bytes": 20178,
2580
- "sha256": "d9e72c5e628c9ed808a7a4648e29fbca9a455ff2ae466abde9cabd27f2f87e6a"
2581
  }
2582
  },
2583
  "failures": []
@@ -2760,33 +2760,33 @@
2760
  "local": {
2761
  "path": "repo:docs/assets/charts/single_episode_task_model_radar.svg",
2762
  "exists": true,
2763
- "bytes": 35232,
2764
- "sha256": "87b52a7dead40358f1778dda43ade4d2e875ac98e507e01ca007084363e5977e"
2765
  },
2766
  "mirrors": {
2767
  "hf_space": {
2768
  "path": "hf_space:assets/charts/single_episode_task_model_radar.svg",
2769
  "exists": true,
2770
- "bytes": 35232,
2771
- "sha256": "87b52a7dead40358f1778dda43ade4d2e875ac98e507e01ca007084363e5977e"
2772
  },
2773
  "hf_artifacts_docs": {
2774
  "path": "hf_artifacts:docs/assets/charts/single_episode_task_model_radar.svg",
2775
  "exists": true,
2776
- "bytes": 35232,
2777
- "sha256": "87b52a7dead40358f1778dda43ade4d2e875ac98e507e01ca007084363e5977e"
2778
  },
2779
  "hf_artifacts_card": {
2780
  "path": "hf_artifacts:assets/charts/single_episode_task_model_radar.svg",
2781
  "exists": true,
2782
- "bytes": 35232,
2783
- "sha256": "87b52a7dead40358f1778dda43ade4d2e875ac98e507e01ca007084363e5977e"
2784
  },
2785
  "hf_model": {
2786
  "path": "hf_model:assets/charts/single_episode_task_model_radar.svg",
2787
  "exists": true,
2788
- "bytes": 35232,
2789
- "sha256": "87b52a7dead40358f1778dda43ade4d2e875ac98e507e01ca007084363e5977e"
2790
  }
2791
  },
2792
  "failures": []
@@ -2797,33 +2797,33 @@
2797
  "local": {
2798
  "path": "repo:docs/assets/charts/episode128_task_model_radar.svg",
2799
  "exists": true,
2800
- "bytes": 51915,
2801
- "sha256": "047ea4b05a04f6734e2afcf792863559dc8f3091eae88a97ff90e8b038a423f4"
2802
  },
2803
  "mirrors": {
2804
  "hf_space": {
2805
  "path": "hf_space:assets/charts/episode128_task_model_radar.svg",
2806
  "exists": true,
2807
- "bytes": 51915,
2808
- "sha256": "047ea4b05a04f6734e2afcf792863559dc8f3091eae88a97ff90e8b038a423f4"
2809
  },
2810
  "hf_artifacts_docs": {
2811
  "path": "hf_artifacts:docs/assets/charts/episode128_task_model_radar.svg",
2812
  "exists": true,
2813
- "bytes": 51915,
2814
- "sha256": "047ea4b05a04f6734e2afcf792863559dc8f3091eae88a97ff90e8b038a423f4"
2815
  },
2816
  "hf_artifacts_card": {
2817
  "path": "hf_artifacts:assets/charts/episode128_task_model_radar.svg",
2818
  "exists": true,
2819
- "bytes": 51915,
2820
- "sha256": "047ea4b05a04f6734e2afcf792863559dc8f3091eae88a97ff90e8b038a423f4"
2821
  },
2822
  "hf_model": {
2823
  "path": "hf_model:assets/charts/episode128_task_model_radar.svg",
2824
  "exists": true,
2825
- "bytes": 51915,
2826
- "sha256": "047ea4b05a04f6734e2afcf792863559dc8f3091eae88a97ff90e8b038a423f4"
2827
  }
2828
  },
2829
  "failures": []
@@ -2871,33 +2871,33 @@
2871
  "local": {
2872
  "path": "repo:docs/assets/charts/unified_task_model_radar.svg",
2873
  "exists": true,
2874
- "bytes": 57938,
2875
- "sha256": "bb83b80b47fe679ebdce2c99378a4548120f1c8cc2d725b88e409d8c386dcbf8"
2876
  },
2877
  "mirrors": {
2878
  "hf_space": {
2879
  "path": "hf_space:assets/charts/unified_task_model_radar.svg",
2880
  "exists": true,
2881
- "bytes": 57938,
2882
- "sha256": "bb83b80b47fe679ebdce2c99378a4548120f1c8cc2d725b88e409d8c386dcbf8"
2883
  },
2884
  "hf_artifacts_docs": {
2885
  "path": "hf_artifacts:docs/assets/charts/unified_task_model_radar.svg",
2886
  "exists": true,
2887
- "bytes": 57938,
2888
- "sha256": "bb83b80b47fe679ebdce2c99378a4548120f1c8cc2d725b88e409d8c386dcbf8"
2889
  },
2890
  "hf_artifacts_card": {
2891
  "path": "hf_artifacts:assets/charts/unified_task_model_radar.svg",
2892
  "exists": true,
2893
- "bytes": 57938,
2894
- "sha256": "bb83b80b47fe679ebdce2c99378a4548120f1c8cc2d725b88e409d8c386dcbf8"
2895
  },
2896
  "hf_model": {
2897
  "path": "hf_model:assets/charts/unified_task_model_radar.svg",
2898
  "exists": true,
2899
- "bytes": 57938,
2900
- "sha256": "bb83b80b47fe679ebdce2c99378a4548120f1c8cc2d725b88e409d8c386dcbf8"
2901
  }
2902
  },
2903
  "failures": []
@@ -5327,21 +5327,21 @@
5327
  "local": {
5328
  "path": "repo:scripts/build_artifact_index.py",
5329
  "exists": true,
5330
- "bytes": 68279,
5331
- "sha256": "69b43ad5d3dc5a6893c4592fa47fff6a7a87691728ec2c61b121ec262d00bf2a"
5332
  },
5333
  "mirrors": {
5334
  "hf_artifacts": {
5335
  "path": "hf_artifacts:scripts/build_artifact_index.py",
5336
  "exists": true,
5337
- "bytes": 68279,
5338
- "sha256": "69b43ad5d3dc5a6893c4592fa47fff6a7a87691728ec2c61b121ec262d00bf2a"
5339
  },
5340
  "hf_model": {
5341
  "path": "hf_model:scripts/build_artifact_index.py",
5342
  "exists": true,
5343
- "bytes": 68279,
5344
- "sha256": "69b43ad5d3dc5a6893c4592fa47fff6a7a87691728ec2c61b121ec262d00bf2a"
5345
  }
5346
  },
5347
  "failures": []
@@ -5402,21 +5402,21 @@
5402
  "local": {
5403
  "path": "repo:scripts/build_figure_index.py",
5404
  "exists": true,
5405
- "bytes": 16845,
5406
- "sha256": "3f91f7f13a3fb08ab57c2f0a6b320102e9d5ae19b102b71499edb5b8fd5a2cec"
5407
  },
5408
  "mirrors": {
5409
  "hf_artifacts": {
5410
  "path": "hf_artifacts:scripts/build_figure_index.py",
5411
  "exists": true,
5412
- "bytes": 16845,
5413
- "sha256": "3f91f7f13a3fb08ab57c2f0a6b320102e9d5ae19b102b71499edb5b8fd5a2cec"
5414
  },
5415
  "hf_model": {
5416
  "path": "hf_model:scripts/build_figure_index.py",
5417
  "exists": true,
5418
- "bytes": 16845,
5419
- "sha256": "3f91f7f13a3fb08ab57c2f0a6b320102e9d5ae19b102b71499edb5b8fd5a2cec"
5420
  }
5421
  },
5422
  "failures": []
@@ -5877,21 +5877,21 @@
5877
  "local": {
5878
  "path": "repo:scripts/build_unified_task_model_radar.py",
5879
  "exists": true,
5880
- "bytes": 69956,
5881
- "sha256": "3d137eced3c376ed674090b823ff9439cea0e8e8670ea9de02c3d3781305a0c9"
5882
  },
5883
  "mirrors": {
5884
  "hf_artifacts": {
5885
  "path": "hf_artifacts:scripts/build_unified_task_model_radar.py",
5886
  "exists": true,
5887
- "bytes": 69956,
5888
- "sha256": "3d137eced3c376ed674090b823ff9439cea0e8e8670ea9de02c3d3781305a0c9"
5889
  },
5890
  "hf_model": {
5891
  "path": "hf_model:scripts/build_unified_task_model_radar.py",
5892
  "exists": true,
5893
- "bytes": 69956,
5894
- "sha256": "3d137eced3c376ed674090b823ff9439cea0e8e8670ea9de02c3d3781305a0c9"
5895
  }
5896
  },
5897
  "failures": []
@@ -6331,39 +6331,39 @@
6331
  "local": {
6332
  "path": "repo:docs/index.html",
6333
  "exists": true,
6334
- "bytes": 348267,
6335
- "sha256": "8e6ac665f1a0e810667379733013a906d3f13a390bceb4b41594769aa773626d"
6336
  },
6337
  "mirrors": {
6338
  "hf_space": {
6339
  "path": "hf_space:index.html",
6340
  "exists": true,
6341
- "bytes": 348267,
6342
- "sha256": "8e6ac665f1a0e810667379733013a906d3f13a390bceb4b41594769aa773626d"
6343
  },
6344
  "hf_artifacts_root": {
6345
  "path": "hf_artifacts:index.html",
6346
  "exists": true,
6347
- "bytes": 348267,
6348
- "sha256": "8e6ac665f1a0e810667379733013a906d3f13a390bceb4b41594769aa773626d"
6349
  },
6350
  "hf_artifacts_docs": {
6351
  "path": "hf_artifacts:docs/index.html",
6352
  "exists": true,
6353
- "bytes": 348267,
6354
- "sha256": "8e6ac665f1a0e810667379733013a906d3f13a390bceb4b41594769aa773626d"
6355
  },
6356
  "hf_model": {
6357
  "path": "hf_model:index.html",
6358
  "exists": true,
6359
- "bytes": 348267,
6360
- "sha256": "8e6ac665f1a0e810667379733013a906d3f13a390bceb4b41594769aa773626d"
6361
  },
6362
  "hf_model_docs": {
6363
  "path": "hf_model:docs/index.html",
6364
  "exists": true,
6365
- "bytes": 348267,
6366
- "sha256": "8e6ac665f1a0e810667379733013a906d3f13a390bceb4b41594769aa773626d"
6367
  }
6368
  },
6369
  "failures": []
@@ -31027,27 +31027,27 @@
31027
  "local": {
31028
  "path": "repo:FIGURE_INDEX.md",
31029
  "exists": true,
31030
- "bytes": 7027,
31031
- "sha256": "b7b507c35cd3cba2765586e9703a447c8025c89658c3daa390df67db4211d0fc"
31032
  },
31033
  "mirrors": {
31034
  "hf_space": {
31035
  "path": "hf_space:FIGURE_INDEX.md",
31036
  "exists": true,
31037
- "bytes": 7027,
31038
- "sha256": "b7b507c35cd3cba2765586e9703a447c8025c89658c3daa390df67db4211d0fc"
31039
  },
31040
  "hf_artifacts": {
31041
  "path": "hf_artifacts:FIGURE_INDEX.md",
31042
  "exists": true,
31043
- "bytes": 7027,
31044
- "sha256": "b7b507c35cd3cba2765586e9703a447c8025c89658c3daa390df67db4211d0fc"
31045
  },
31046
  "hf_model": {
31047
  "path": "hf_model:FIGURE_INDEX.md",
31048
  "exists": true,
31049
- "bytes": 7027,
31050
- "sha256": "b7b507c35cd3cba2765586e9703a447c8025c89658c3daa390df67db4211d0fc"
31051
  }
31052
  },
31053
  "failures": []
@@ -31554,27 +31554,27 @@
31554
  "local": {
31555
  "path": "repo:TASK_METHOD_20_RESULT_MATRIX.md",
31556
  "exists": true,
31557
- "bytes": 14715,
31558
- "sha256": "acf665d5c07f15224b42d9da93bc2586279852bd60241868391ba7c0a462a24f"
31559
  },
31560
  "mirrors": {
31561
  "hf_space": {
31562
  "path": "hf_space:TASK_METHOD_20_RESULT_MATRIX.md",
31563
  "exists": true,
31564
- "bytes": 14715,
31565
- "sha256": "acf665d5c07f15224b42d9da93bc2586279852bd60241868391ba7c0a462a24f"
31566
  },
31567
  "hf_artifacts": {
31568
  "path": "hf_artifacts:TASK_METHOD_20_RESULT_MATRIX.md",
31569
  "exists": true,
31570
- "bytes": 14715,
31571
- "sha256": "acf665d5c07f15224b42d9da93bc2586279852bd60241868391ba7c0a462a24f"
31572
  },
31573
  "hf_model": {
31574
  "path": "hf_model:TASK_METHOD_20_RESULT_MATRIX.md",
31575
  "exists": true,
31576
- "bytes": 14715,
31577
- "sha256": "acf665d5c07f15224b42d9da93bc2586279852bd60241868391ba7c0a462a24f"
31578
  }
31579
  },
31580
  "failures": []
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T20:33:57+00:00",
4
  "hf_root": "hf_publish",
5
  "summary": {
6
  "group_count": 1258,
 
138
  "local": {
139
  "path": "repo:docs/data/artifact_index.json",
140
  "exists": true,
141
+ "bytes": 124471,
142
+ "sha256": "8e9954b1de5bbd935a1ed0c6753f737d9a2a325c0bd85377087751ce198afeeb"
143
  },
144
  "mirrors": {
145
  "hf_space": {
146
  "path": "hf_space:data/artifact_index.json",
147
  "exists": true,
148
+ "bytes": 124471,
149
+ "sha256": "8e9954b1de5bbd935a1ed0c6753f737d9a2a325c0bd85377087751ce198afeeb"
150
  },
151
  "hf_artifacts_data": {
152
  "path": "hf_artifacts:data/artifact_index.json",
153
  "exists": true,
154
+ "bytes": 124471,
155
+ "sha256": "8e9954b1de5bbd935a1ed0c6753f737d9a2a325c0bd85377087751ce198afeeb"
156
  },
157
  "hf_artifacts": {
158
  "path": "hf_artifacts:docs/data/artifact_index.json",
159
  "exists": true,
160
+ "bytes": 124471,
161
+ "sha256": "8e9954b1de5bbd935a1ed0c6753f737d9a2a325c0bd85377087751ce198afeeb"
162
  },
163
  "hf_model_data": {
164
  "path": "hf_model:data/artifact_index.json",
165
  "exists": true,
166
+ "bytes": 124471,
167
+ "sha256": "8e9954b1de5bbd935a1ed0c6753f737d9a2a325c0bd85377087751ce198afeeb"
168
  },
169
  "hf_model_docs_data": {
170
  "path": "hf_model:docs/data/artifact_index.json",
171
  "exists": true,
172
+ "bytes": 124471,
173
+ "sha256": "8e9954b1de5bbd935a1ed0c6753f737d9a2a325c0bd85377087751ce198afeeb"
174
  },
175
  "hf_model": {
176
  "path": "hf_model:metrics/artifact_index.json",
177
  "exists": true,
178
+ "bytes": 124471,
179
+ "sha256": "8e9954b1de5bbd935a1ed0c6753f737d9a2a325c0bd85377087751ce198afeeb"
180
  }
181
  },
182
  "failures": []
 
334
  "local": {
335
  "path": "repo:docs/data/figure_index.json",
336
  "exists": true,
337
+ "bytes": 19526,
338
+ "sha256": "2216a4c718a2ad7aba2a6f3f30b2d699c77dea6a867219df40f2e7a83a467eb4"
339
  },
340
  "mirrors": {
341
  "hf_space": {
342
  "path": "hf_space:data/figure_index.json",
343
  "exists": true,
344
+ "bytes": 19526,
345
+ "sha256": "2216a4c718a2ad7aba2a6f3f30b2d699c77dea6a867219df40f2e7a83a467eb4"
346
  },
347
  "hf_artifacts_data": {
348
  "path": "hf_artifacts:data/figure_index.json",
349
  "exists": true,
350
+ "bytes": 19526,
351
+ "sha256": "2216a4c718a2ad7aba2a6f3f30b2d699c77dea6a867219df40f2e7a83a467eb4"
352
  },
353
  "hf_artifacts": {
354
  "path": "hf_artifacts:docs/data/figure_index.json",
355
  "exists": true,
356
+ "bytes": 19526,
357
+ "sha256": "2216a4c718a2ad7aba2a6f3f30b2d699c77dea6a867219df40f2e7a83a467eb4"
358
  },
359
  "hf_model_data": {
360
  "path": "hf_model:data/figure_index.json",
361
  "exists": true,
362
+ "bytes": 19526,
363
+ "sha256": "2216a4c718a2ad7aba2a6f3f30b2d699c77dea6a867219df40f2e7a83a467eb4"
364
  },
365
  "hf_model_docs_data": {
366
  "path": "hf_model:docs/data/figure_index.json",
367
  "exists": true,
368
+ "bytes": 19526,
369
+ "sha256": "2216a4c718a2ad7aba2a6f3f30b2d699c77dea6a867219df40f2e7a83a467eb4"
370
  },
371
  "hf_model": {
372
  "path": "hf_model:metrics/figure_index.json",
373
  "exists": true,
374
+ "bytes": 19526,
375
+ "sha256": "2216a4c718a2ad7aba2a6f3f30b2d699c77dea6a867219df40f2e7a83a467eb4"
376
  }
377
  },
378
  "failures": []
 
972
  "path": "repo:docs/data/publication_audit.json",
973
  "exists": true,
974
  "bytes": 10940,
975
+ "sha256": "f5a4c25846e1a8d730f4dc540c1c4fb56bd204c7de3d426b7759e72529009a06"
976
  },
977
  "mirrors": {
978
  "hf_space": {
979
  "path": "hf_space:data/publication_audit.json",
980
  "exists": true,
981
  "bytes": 10940,
982
+ "sha256": "f5a4c25846e1a8d730f4dc540c1c4fb56bd204c7de3d426b7759e72529009a06"
983
  },
984
  "hf_artifacts_data": {
985
  "path": "hf_artifacts:data/publication_audit.json",
986
  "exists": true,
987
  "bytes": 10940,
988
+ "sha256": "f5a4c25846e1a8d730f4dc540c1c4fb56bd204c7de3d426b7759e72529009a06"
989
  },
990
  "hf_artifacts": {
991
  "path": "hf_artifacts:docs/data/publication_audit.json",
992
  "exists": true,
993
  "bytes": 10940,
994
+ "sha256": "f5a4c25846e1a8d730f4dc540c1c4fb56bd204c7de3d426b7759e72529009a06"
995
  },
996
  "hf_model_data": {
997
  "path": "hf_model:data/publication_audit.json",
998
  "exists": true,
999
  "bytes": 10940,
1000
+ "sha256": "f5a4c25846e1a8d730f4dc540c1c4fb56bd204c7de3d426b7759e72529009a06"
1001
  },
1002
  "hf_model_docs_data": {
1003
  "path": "hf_model:docs/data/publication_audit.json",
1004
  "exists": true,
1005
  "bytes": 10940,
1006
+ "sha256": "f5a4c25846e1a8d730f4dc540c1c4fb56bd204c7de3d426b7759e72529009a06"
1007
  },
1008
  "hf_model": {
1009
  "path": "hf_model:metrics/publication_audit.json",
1010
  "exists": true,
1011
  "bytes": 10940,
1012
+ "sha256": "f5a4c25846e1a8d730f4dc540c1c4fb56bd204c7de3d426b7759e72529009a06"
1013
  }
1014
  },
1015
  "failures": []
 
1021
  "path": "repo:docs/data/public_surface_qa.json",
1022
  "exists": true,
1023
  "bytes": 7693,
1024
+ "sha256": "c3c9cab63c92026e5cf5265d5113d0bc42beec3664a71880c8471cc3305a8b04"
1025
  },
1026
  "mirrors": {
1027
  "hf_space": {
1028
  "path": "hf_space:data/public_surface_qa.json",
1029
  "exists": true,
1030
  "bytes": 7693,
1031
+ "sha256": "c3c9cab63c92026e5cf5265d5113d0bc42beec3664a71880c8471cc3305a8b04"
1032
  },
1033
  "hf_artifacts_data": {
1034
  "path": "hf_artifacts:data/public_surface_qa.json",
1035
  "exists": true,
1036
  "bytes": 7693,
1037
+ "sha256": "c3c9cab63c92026e5cf5265d5113d0bc42beec3664a71880c8471cc3305a8b04"
1038
  },
1039
  "hf_artifacts": {
1040
  "path": "hf_artifacts:docs/data/public_surface_qa.json",
1041
  "exists": true,
1042
  "bytes": 7693,
1043
+ "sha256": "c3c9cab63c92026e5cf5265d5113d0bc42beec3664a71880c8471cc3305a8b04"
1044
  },
1045
  "hf_model_data": {
1046
  "path": "hf_model:data/public_surface_qa.json",
1047
  "exists": true,
1048
  "bytes": 7693,
1049
+ "sha256": "c3c9cab63c92026e5cf5265d5113d0bc42beec3664a71880c8471cc3305a8b04"
1050
  },
1051
  "hf_model_docs_data": {
1052
  "path": "hf_model:docs/data/public_surface_qa.json",
1053
  "exists": true,
1054
  "bytes": 7693,
1055
+ "sha256": "c3c9cab63c92026e5cf5265d5113d0bc42beec3664a71880c8471cc3305a8b04"
1056
  },
1057
  "hf_model": {
1058
  "path": "hf_model:metrics/public_surface_qa.json",
1059
  "exists": true,
1060
  "bytes": 7693,
1061
+ "sha256": "c3c9cab63c92026e5cf5265d5113d0bc42beec3664a71880c8471cc3305a8b04"
1062
  }
1063
  },
1064
  "failures": []
 
1217
  "path": "repo:docs/data/quality_gates.json",
1218
  "exists": true,
1219
  "bytes": 8640,
1220
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1221
  },
1222
  "mirrors": {
1223
  "hf_space": {
1224
  "path": "hf_space:data/quality_gates.json",
1225
  "exists": true,
1226
  "bytes": 8640,
1227
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1228
  },
1229
  "hf_artifacts_data": {
1230
  "path": "hf_artifacts:data/quality_gates.json",
1231
  "exists": true,
1232
  "bytes": 8640,
1233
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1234
  },
1235
  "hf_artifacts": {
1236
  "path": "hf_artifacts:docs/data/quality_gates.json",
1237
  "exists": true,
1238
  "bytes": 8640,
1239
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1240
  },
1241
  "hf_model_data": {
1242
  "path": "hf_model:data/quality_gates.json",
1243
  "exists": true,
1244
  "bytes": 8640,
1245
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1246
  },
1247
  "hf_model_docs_data": {
1248
  "path": "hf_model:docs/data/quality_gates.json",
1249
  "exists": true,
1250
  "bytes": 8640,
1251
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1252
  },
1253
  "hf_model": {
1254
  "path": "hf_model:metrics/quality_gates.json",
1255
  "exists": true,
1256
  "bytes": 8640,
1257
+ "sha256": "87c26049df761996430f1ca6ef25ee53400387f907f356dcea6b3fcc7d6dac82"
1258
  }
1259
  },
1260
  "failures": []
 
1658
  "path": "repo:docs/data/scope_claims_audit.json",
1659
  "exists": true,
1660
  "bytes": 21313,
1661
+ "sha256": "b997454e0e9bb382da0afcdec319086081213d176546306e13a7d47d42e8957c"
1662
  },
1663
  "mirrors": {
1664
  "hf_space": {
1665
  "path": "hf_space:data/scope_claims_audit.json",
1666
  "exists": true,
1667
  "bytes": 21313,
1668
+ "sha256": "b997454e0e9bb382da0afcdec319086081213d176546306e13a7d47d42e8957c"
1669
  },
1670
  "hf_artifacts_data": {
1671
  "path": "hf_artifacts:data/scope_claims_audit.json",
1672
  "exists": true,
1673
  "bytes": 21313,
1674
+ "sha256": "b997454e0e9bb382da0afcdec319086081213d176546306e13a7d47d42e8957c"
1675
  },
1676
  "hf_artifacts": {
1677
  "path": "hf_artifacts:docs/data/scope_claims_audit.json",
1678
  "exists": true,
1679
  "bytes": 21313,
1680
+ "sha256": "b997454e0e9bb382da0afcdec319086081213d176546306e13a7d47d42e8957c"
1681
  },
1682
  "hf_model_data": {
1683
  "path": "hf_model:data/scope_claims_audit.json",
1684
  "exists": true,
1685
  "bytes": 21313,
1686
+ "sha256": "b997454e0e9bb382da0afcdec319086081213d176546306e13a7d47d42e8957c"
1687
  },
1688
  "hf_model_docs_data": {
1689
  "path": "hf_model:docs/data/scope_claims_audit.json",
1690
  "exists": true,
1691
  "bytes": 21313,
1692
+ "sha256": "b997454e0e9bb382da0afcdec319086081213d176546306e13a7d47d42e8957c"
1693
  },
1694
  "hf_model": {
1695
  "path": "hf_model:metrics/scope_claims_audit.json",
1696
  "exists": true,
1697
  "bytes": 21313,
1698
+ "sha256": "b997454e0e9bb382da0afcdec319086081213d176546306e13a7d47d42e8957c"
1699
  }
1700
  },
1701
  "failures": []
 
1756
  "path": "repo:docs/data/source_alignment_audit.json",
1757
  "exists": true,
1758
  "bytes": 4432,
1759
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
1760
  },
1761
  "mirrors": {
1762
  "hf_space": {
1763
  "path": "hf_space:data/source_alignment_audit.json",
1764
  "exists": true,
1765
  "bytes": 4432,
1766
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
1767
  },
1768
  "hf_artifacts_data": {
1769
  "path": "hf_artifacts:data/source_alignment_audit.json",
1770
  "exists": true,
1771
  "bytes": 4432,
1772
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
1773
  },
1774
  "hf_artifacts": {
1775
  "path": "hf_artifacts:docs/data/source_alignment_audit.json",
1776
  "exists": true,
1777
  "bytes": 4432,
1778
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
1779
  },
1780
  "hf_model_data": {
1781
  "path": "hf_model:data/source_alignment_audit.json",
1782
  "exists": true,
1783
  "bytes": 4432,
1784
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
1785
  },
1786
  "hf_model_docs_data": {
1787
  "path": "hf_model:docs/data/source_alignment_audit.json",
1788
  "exists": true,
1789
  "bytes": 4432,
1790
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
1791
  },
1792
  "hf_model": {
1793
  "path": "hf_model:metrics/source_alignment_audit.json",
1794
  "exists": true,
1795
  "bytes": 4432,
1796
+ "sha256": "2f3d03a1c56ebb1b220574e565a05d35705a10a22d1cb7ecdcb1452630574bc9"
1797
  }
1798
  },
1799
  "failures": []
 
1853
  "local": {
1854
  "path": "repo:docs/data/single_episode_task_model_radar.json",
1855
  "exists": true,
1856
+ "bytes": 52866,
1857
+ "sha256": "804273420658eae853df3d1c1f45f83040fb09551c74c203bb833d73b6c2a4c1"
1858
  },
1859
  "mirrors": {
1860
  "hf_space": {
1861
  "path": "hf_space:data/single_episode_task_model_radar.json",
1862
  "exists": true,
1863
+ "bytes": 52866,
1864
+ "sha256": "804273420658eae853df3d1c1f45f83040fb09551c74c203bb833d73b6c2a4c1"
1865
  },
1866
  "hf_artifacts_data": {
1867
  "path": "hf_artifacts:data/single_episode_task_model_radar.json",
1868
  "exists": true,
1869
+ "bytes": 52866,
1870
+ "sha256": "804273420658eae853df3d1c1f45f83040fb09551c74c203bb833d73b6c2a4c1"
1871
  },
1872
  "hf_artifacts": {
1873
  "path": "hf_artifacts:docs/data/single_episode_task_model_radar.json",
1874
  "exists": true,
1875
+ "bytes": 52866,
1876
+ "sha256": "804273420658eae853df3d1c1f45f83040fb09551c74c203bb833d73b6c2a4c1"
1877
  },
1878
  "hf_model_data": {
1879
  "path": "hf_model:data/single_episode_task_model_radar.json",
1880
  "exists": true,
1881
+ "bytes": 52866,
1882
+ "sha256": "804273420658eae853df3d1c1f45f83040fb09551c74c203bb833d73b6c2a4c1"
1883
  },
1884
  "hf_model_docs_data": {
1885
  "path": "hf_model:docs/data/single_episode_task_model_radar.json",
1886
  "exists": true,
1887
+ "bytes": 52866,
1888
+ "sha256": "804273420658eae853df3d1c1f45f83040fb09551c74c203bb833d73b6c2a4c1"
1889
  },
1890
  "hf_model": {
1891
  "path": "hf_model:metrics/single_episode_task_model_radar.json",
1892
  "exists": true,
1893
+ "bytes": 52866,
1894
+ "sha256": "804273420658eae853df3d1c1f45f83040fb09551c74c203bb833d73b6c2a4c1"
1895
  }
1896
  },
1897
  "failures": []
 
1902
  "local": {
1903
  "path": "repo:docs/data/episode128_task_model_radar.json",
1904
  "exists": true,
1905
+ "bytes": 186997,
1906
+ "sha256": "d15869d177f5a9cc37e20d0cab62fdc1be61d596975fee13a5d1d61108c7176a"
1907
  },
1908
  "mirrors": {
1909
  "hf_space": {
1910
  "path": "hf_space:data/episode128_task_model_radar.json",
1911
  "exists": true,
1912
+ "bytes": 186997,
1913
+ "sha256": "d15869d177f5a9cc37e20d0cab62fdc1be61d596975fee13a5d1d61108c7176a"
1914
  },
1915
  "hf_artifacts_data": {
1916
  "path": "hf_artifacts:data/episode128_task_model_radar.json",
1917
  "exists": true,
1918
+ "bytes": 186997,
1919
+ "sha256": "d15869d177f5a9cc37e20d0cab62fdc1be61d596975fee13a5d1d61108c7176a"
1920
  },
1921
  "hf_artifacts": {
1922
  "path": "hf_artifacts:docs/data/episode128_task_model_radar.json",
1923
  "exists": true,
1924
+ "bytes": 186997,
1925
+ "sha256": "d15869d177f5a9cc37e20d0cab62fdc1be61d596975fee13a5d1d61108c7176a"
1926
  },
1927
  "hf_model_data": {
1928
  "path": "hf_model:data/episode128_task_model_radar.json",
1929
  "exists": true,
1930
+ "bytes": 186997,
1931
+ "sha256": "d15869d177f5a9cc37e20d0cab62fdc1be61d596975fee13a5d1d61108c7176a"
1932
  },
1933
  "hf_model_docs_data": {
1934
  "path": "hf_model:docs/data/episode128_task_model_radar.json",
1935
  "exists": true,
1936
+ "bytes": 186997,
1937
+ "sha256": "d15869d177f5a9cc37e20d0cab62fdc1be61d596975fee13a5d1d61108c7176a"
1938
  },
1939
  "hf_model": {
1940
  "path": "hf_model:metrics/episode128_task_model_radar.json",
1941
  "exists": true,
1942
+ "bytes": 186997,
1943
+ "sha256": "d15869d177f5a9cc37e20d0cab62fdc1be61d596975fee13a5d1d61108c7176a"
1944
  }
1945
  },
1946
  "failures": []
 
2197
  "path": "repo:docs/data/task_surface_integrity.json",
2198
  "exists": true,
2199
  "bytes": 46246,
2200
+ "sha256": "e85510c4a7df9e1fd43651d18291b51b1ad2c3ab0b5042d4edc2a598a1929556"
2201
  },
2202
  "mirrors": {
2203
  "hf_space": {
2204
  "path": "hf_space:data/task_surface_integrity.json",
2205
  "exists": true,
2206
  "bytes": 46246,
2207
+ "sha256": "e85510c4a7df9e1fd43651d18291b51b1ad2c3ab0b5042d4edc2a598a1929556"
2208
  },
2209
  "hf_artifacts_data": {
2210
  "path": "hf_artifacts:data/task_surface_integrity.json",
2211
  "exists": true,
2212
  "bytes": 46246,
2213
+ "sha256": "e85510c4a7df9e1fd43651d18291b51b1ad2c3ab0b5042d4edc2a598a1929556"
2214
  },
2215
  "hf_artifacts": {
2216
  "path": "hf_artifacts:docs/data/task_surface_integrity.json",
2217
  "exists": true,
2218
  "bytes": 46246,
2219
+ "sha256": "e85510c4a7df9e1fd43651d18291b51b1ad2c3ab0b5042d4edc2a598a1929556"
2220
  },
2221
  "hf_model_data": {
2222
  "path": "hf_model:data/task_surface_integrity.json",
2223
  "exists": true,
2224
  "bytes": 46246,
2225
+ "sha256": "e85510c4a7df9e1fd43651d18291b51b1ad2c3ab0b5042d4edc2a598a1929556"
2226
  },
2227
  "hf_model_docs_data": {
2228
  "path": "hf_model:docs/data/task_surface_integrity.json",
2229
  "exists": true,
2230
  "bytes": 46246,
2231
+ "sha256": "e85510c4a7df9e1fd43651d18291b51b1ad2c3ab0b5042d4edc2a598a1929556"
2232
  },
2233
  "hf_model": {
2234
  "path": "hf_model:metrics/task_surface_integrity.json",
2235
  "exists": true,
2236
  "bytes": 46246,
2237
+ "sha256": "e85510c4a7df9e1fd43651d18291b51b1ad2c3ab0b5042d4edc2a598a1929556"
2238
  }
2239
  },
2240
  "failures": []
 
2294
  "local": {
2295
  "path": "repo:docs/data/task_method_20_result_matrix.json",
2296
  "exists": true,
2297
+ "bytes": 128991,
2298
+ "sha256": "c070f4430cff2583e150cb66ed75b5759ee985429af7aa22a8a83c14a5a8841f"
2299
  },
2300
  "mirrors": {
2301
  "hf_space": {
2302
  "path": "hf_space:data/task_method_20_result_matrix.json",
2303
  "exists": true,
2304
+ "bytes": 128991,
2305
+ "sha256": "c070f4430cff2583e150cb66ed75b5759ee985429af7aa22a8a83c14a5a8841f"
2306
  },
2307
  "hf_artifacts_data": {
2308
  "path": "hf_artifacts:data/task_method_20_result_matrix.json",
2309
  "exists": true,
2310
+ "bytes": 128991,
2311
+ "sha256": "c070f4430cff2583e150cb66ed75b5759ee985429af7aa22a8a83c14a5a8841f"
2312
  },
2313
  "hf_artifacts": {
2314
  "path": "hf_artifacts:docs/data/task_method_20_result_matrix.json",
2315
  "exists": true,
2316
+ "bytes": 128991,
2317
+ "sha256": "c070f4430cff2583e150cb66ed75b5759ee985429af7aa22a8a83c14a5a8841f"
2318
  },
2319
  "hf_model_data": {
2320
  "path": "hf_model:data/task_method_20_result_matrix.json",
2321
  "exists": true,
2322
+ "bytes": 128991,
2323
+ "sha256": "c070f4430cff2583e150cb66ed75b5759ee985429af7aa22a8a83c14a5a8841f"
2324
  },
2325
  "hf_model_docs_data": {
2326
  "path": "hf_model:docs/data/task_method_20_result_matrix.json",
2327
  "exists": true,
2328
+ "bytes": 128991,
2329
+ "sha256": "c070f4430cff2583e150cb66ed75b5759ee985429af7aa22a8a83c14a5a8841f"
2330
  },
2331
  "hf_model": {
2332
  "path": "hf_model:metrics/task_method_20_result_matrix.json",
2333
  "exists": true,
2334
+ "bytes": 128991,
2335
+ "sha256": "c070f4430cff2583e150cb66ed75b5759ee985429af7aa22a8a83c14a5a8841f"
2336
  }
2337
  },
2338
  "failures": []
 
2490
  "local": {
2491
  "path": "repo:docs/data/unified_task_model_radar.json",
2492
  "exists": true,
2493
+ "bytes": 230938,
2494
+ "sha256": "e9523c3f97d6493058030ad05c3a8a80c51a11727fbf8eb936c405ac986531c1"
2495
  },
2496
  "mirrors": {
2497
  "hf_space": {
2498
  "path": "hf_space:data/unified_task_model_radar.json",
2499
  "exists": true,
2500
+ "bytes": 230938,
2501
+ "sha256": "e9523c3f97d6493058030ad05c3a8a80c51a11727fbf8eb936c405ac986531c1"
2502
  },
2503
  "hf_artifacts_data": {
2504
  "path": "hf_artifacts:data/unified_task_model_radar.json",
2505
  "exists": true,
2506
+ "bytes": 230938,
2507
+ "sha256": "e9523c3f97d6493058030ad05c3a8a80c51a11727fbf8eb936c405ac986531c1"
2508
  },
2509
  "hf_artifacts": {
2510
  "path": "hf_artifacts:docs/data/unified_task_model_radar.json",
2511
  "exists": true,
2512
+ "bytes": 230938,
2513
+ "sha256": "e9523c3f97d6493058030ad05c3a8a80c51a11727fbf8eb936c405ac986531c1"
2514
  },
2515
  "hf_model_data": {
2516
  "path": "hf_model:data/unified_task_model_radar.json",
2517
  "exists": true,
2518
+ "bytes": 230938,
2519
+ "sha256": "e9523c3f97d6493058030ad05c3a8a80c51a11727fbf8eb936c405ac986531c1"
2520
  },
2521
  "hf_model_docs_data": {
2522
  "path": "hf_model:docs/data/unified_task_model_radar.json",
2523
  "exists": true,
2524
+ "bytes": 230938,
2525
+ "sha256": "e9523c3f97d6493058030ad05c3a8a80c51a11727fbf8eb936c405ac986531c1"
2526
  },
2527
  "hf_model": {
2528
  "path": "hf_model:metrics/unified_task_model_radar.json",
2529
  "exists": true,
2530
+ "bytes": 230938,
2531
+ "sha256": "e9523c3f97d6493058030ad05c3a8a80c51a11727fbf8eb936c405ac986531c1"
2532
  }
2533
  },
2534
  "failures": []
 
2540
  "path": "repo:docs/data/website_integrity.json",
2541
  "exists": true,
2542
  "bytes": 20178,
2543
+ "sha256": "b24c99ae7b5f9ae9fd48f04f39728979aabec7d79eaad1a0ed4437e02a7cb218"
2544
  },
2545
  "mirrors": {
2546
  "hf_space": {
2547
  "path": "hf_space:data/website_integrity.json",
2548
  "exists": true,
2549
  "bytes": 20178,
2550
+ "sha256": "b24c99ae7b5f9ae9fd48f04f39728979aabec7d79eaad1a0ed4437e02a7cb218"
2551
  },
2552
  "hf_artifacts_data": {
2553
  "path": "hf_artifacts:data/website_integrity.json",
2554
  "exists": true,
2555
  "bytes": 20178,
2556
+ "sha256": "b24c99ae7b5f9ae9fd48f04f39728979aabec7d79eaad1a0ed4437e02a7cb218"
2557
  },
2558
  "hf_artifacts": {
2559
  "path": "hf_artifacts:docs/data/website_integrity.json",
2560
  "exists": true,
2561
  "bytes": 20178,
2562
+ "sha256": "b24c99ae7b5f9ae9fd48f04f39728979aabec7d79eaad1a0ed4437e02a7cb218"
2563
  },
2564
  "hf_model_data": {
2565
  "path": "hf_model:data/website_integrity.json",
2566
  "exists": true,
2567
  "bytes": 20178,
2568
+ "sha256": "b24c99ae7b5f9ae9fd48f04f39728979aabec7d79eaad1a0ed4437e02a7cb218"
2569
  },
2570
  "hf_model_docs_data": {
2571
  "path": "hf_model:docs/data/website_integrity.json",
2572
  "exists": true,
2573
  "bytes": 20178,
2574
+ "sha256": "b24c99ae7b5f9ae9fd48f04f39728979aabec7d79eaad1a0ed4437e02a7cb218"
2575
  },
2576
  "hf_model": {
2577
  "path": "hf_model:metrics/website_integrity.json",
2578
  "exists": true,
2579
  "bytes": 20178,
2580
+ "sha256": "b24c99ae7b5f9ae9fd48f04f39728979aabec7d79eaad1a0ed4437e02a7cb218"
2581
  }
2582
  },
2583
  "failures": []
 
2760
  "local": {
2761
  "path": "repo:docs/assets/charts/single_episode_task_model_radar.svg",
2762
  "exists": true,
2763
+ "bytes": 36930,
2764
+ "sha256": "96e609b0577e66db0ee8c63939c11b1fb28018285a1d259362de0bff415cc939"
2765
  },
2766
  "mirrors": {
2767
  "hf_space": {
2768
  "path": "hf_space:assets/charts/single_episode_task_model_radar.svg",
2769
  "exists": true,
2770
+ "bytes": 36930,
2771
+ "sha256": "96e609b0577e66db0ee8c63939c11b1fb28018285a1d259362de0bff415cc939"
2772
  },
2773
  "hf_artifacts_docs": {
2774
  "path": "hf_artifacts:docs/assets/charts/single_episode_task_model_radar.svg",
2775
  "exists": true,
2776
+ "bytes": 36930,
2777
+ "sha256": "96e609b0577e66db0ee8c63939c11b1fb28018285a1d259362de0bff415cc939"
2778
  },
2779
  "hf_artifacts_card": {
2780
  "path": "hf_artifacts:assets/charts/single_episode_task_model_radar.svg",
2781
  "exists": true,
2782
+ "bytes": 36930,
2783
+ "sha256": "96e609b0577e66db0ee8c63939c11b1fb28018285a1d259362de0bff415cc939"
2784
  },
2785
  "hf_model": {
2786
  "path": "hf_model:assets/charts/single_episode_task_model_radar.svg",
2787
  "exists": true,
2788
+ "bytes": 36930,
2789
+ "sha256": "96e609b0577e66db0ee8c63939c11b1fb28018285a1d259362de0bff415cc939"
2790
  }
2791
  },
2792
  "failures": []
 
2797
  "local": {
2798
  "path": "repo:docs/assets/charts/episode128_task_model_radar.svg",
2799
  "exists": true,
2800
+ "bytes": 79370,
2801
+ "sha256": "5151c8aca22bd4aeda60b143b1164c1d1b9eb4babbeabf6da598701ccbbbf5c9"
2802
  },
2803
  "mirrors": {
2804
  "hf_space": {
2805
  "path": "hf_space:assets/charts/episode128_task_model_radar.svg",
2806
  "exists": true,
2807
+ "bytes": 79370,
2808
+ "sha256": "5151c8aca22bd4aeda60b143b1164c1d1b9eb4babbeabf6da598701ccbbbf5c9"
2809
  },
2810
  "hf_artifacts_docs": {
2811
  "path": "hf_artifacts:docs/assets/charts/episode128_task_model_radar.svg",
2812
  "exists": true,
2813
+ "bytes": 79370,
2814
+ "sha256": "5151c8aca22bd4aeda60b143b1164c1d1b9eb4babbeabf6da598701ccbbbf5c9"
2815
  },
2816
  "hf_artifacts_card": {
2817
  "path": "hf_artifacts:assets/charts/episode128_task_model_radar.svg",
2818
  "exists": true,
2819
+ "bytes": 79370,
2820
+ "sha256": "5151c8aca22bd4aeda60b143b1164c1d1b9eb4babbeabf6da598701ccbbbf5c9"
2821
  },
2822
  "hf_model": {
2823
  "path": "hf_model:assets/charts/episode128_task_model_radar.svg",
2824
  "exists": true,
2825
+ "bytes": 79370,
2826
+ "sha256": "5151c8aca22bd4aeda60b143b1164c1d1b9eb4babbeabf6da598701ccbbbf5c9"
2827
  }
2828
  },
2829
  "failures": []
 
2871
  "local": {
2872
  "path": "repo:docs/assets/charts/unified_task_model_radar.svg",
2873
  "exists": true,
2874
+ "bytes": 98527,
2875
+ "sha256": "5b034b22d2a772a57e7db50f300cb70d00bd31ac89d0c039c16ac8c23a5137ec"
2876
  },
2877
  "mirrors": {
2878
  "hf_space": {
2879
  "path": "hf_space:assets/charts/unified_task_model_radar.svg",
2880
  "exists": true,
2881
+ "bytes": 98527,
2882
+ "sha256": "5b034b22d2a772a57e7db50f300cb70d00bd31ac89d0c039c16ac8c23a5137ec"
2883
  },
2884
  "hf_artifacts_docs": {
2885
  "path": "hf_artifacts:docs/assets/charts/unified_task_model_radar.svg",
2886
  "exists": true,
2887
+ "bytes": 98527,
2888
+ "sha256": "5b034b22d2a772a57e7db50f300cb70d00bd31ac89d0c039c16ac8c23a5137ec"
2889
  },
2890
  "hf_artifacts_card": {
2891
  "path": "hf_artifacts:assets/charts/unified_task_model_radar.svg",
2892
  "exists": true,
2893
+ "bytes": 98527,
2894
+ "sha256": "5b034b22d2a772a57e7db50f300cb70d00bd31ac89d0c039c16ac8c23a5137ec"
2895
  },
2896
  "hf_model": {
2897
  "path": "hf_model:assets/charts/unified_task_model_radar.svg",
2898
  "exists": true,
2899
+ "bytes": 98527,
2900
+ "sha256": "5b034b22d2a772a57e7db50f300cb70d00bd31ac89d0c039c16ac8c23a5137ec"
2901
  }
2902
  },
2903
  "failures": []
 
5327
  "local": {
5328
  "path": "repo:scripts/build_artifact_index.py",
5329
  "exists": true,
5330
+ "bytes": 68407,
5331
+ "sha256": "9f5fc1af7c4c23083975066691bf6d75aa99280961198e2c47be88754c142418"
5332
  },
5333
  "mirrors": {
5334
  "hf_artifacts": {
5335
  "path": "hf_artifacts:scripts/build_artifact_index.py",
5336
  "exists": true,
5337
+ "bytes": 68407,
5338
+ "sha256": "9f5fc1af7c4c23083975066691bf6d75aa99280961198e2c47be88754c142418"
5339
  },
5340
  "hf_model": {
5341
  "path": "hf_model:scripts/build_artifact_index.py",
5342
  "exists": true,
5343
+ "bytes": 68407,
5344
+ "sha256": "9f5fc1af7c4c23083975066691bf6d75aa99280961198e2c47be88754c142418"
5345
  }
5346
  },
5347
  "failures": []
 
5402
  "local": {
5403
  "path": "repo:scripts/build_figure_index.py",
5404
  "exists": true,
5405
+ "bytes": 16886,
5406
+ "sha256": "fc44ea60b6f491d290a56d4c1097fcfc37c024f8a9bc3b3db013252f45d96e64"
5407
  },
5408
  "mirrors": {
5409
  "hf_artifacts": {
5410
  "path": "hf_artifacts:scripts/build_figure_index.py",
5411
  "exists": true,
5412
+ "bytes": 16886,
5413
+ "sha256": "fc44ea60b6f491d290a56d4c1097fcfc37c024f8a9bc3b3db013252f45d96e64"
5414
  },
5415
  "hf_model": {
5416
  "path": "hf_model:scripts/build_figure_index.py",
5417
  "exists": true,
5418
+ "bytes": 16886,
5419
+ "sha256": "fc44ea60b6f491d290a56d4c1097fcfc37c024f8a9bc3b3db013252f45d96e64"
5420
  }
5421
  },
5422
  "failures": []
 
5877
  "local": {
5878
  "path": "repo:scripts/build_unified_task_model_radar.py",
5879
  "exists": true,
5880
+ "bytes": 78735,
5881
+ "sha256": "8e7110381d56e5ef0effb7dcc7935d1bb0ce2653289504a50d04f29571a4e6b7"
5882
  },
5883
  "mirrors": {
5884
  "hf_artifacts": {
5885
  "path": "hf_artifacts:scripts/build_unified_task_model_radar.py",
5886
  "exists": true,
5887
+ "bytes": 78735,
5888
+ "sha256": "8e7110381d56e5ef0effb7dcc7935d1bb0ce2653289504a50d04f29571a4e6b7"
5889
  },
5890
  "hf_model": {
5891
  "path": "hf_model:scripts/build_unified_task_model_radar.py",
5892
  "exists": true,
5893
+ "bytes": 78735,
5894
+ "sha256": "8e7110381d56e5ef0effb7dcc7935d1bb0ce2653289504a50d04f29571a4e6b7"
5895
  }
5896
  },
5897
  "failures": []
 
6331
  "local": {
6332
  "path": "repo:docs/index.html",
6333
  "exists": true,
6334
+ "bytes": 348384,
6335
+ "sha256": "6f6b2c6b40e8d38210fd2e0c04750732afb5179996a663185f2525442c9c957a"
6336
  },
6337
  "mirrors": {
6338
  "hf_space": {
6339
  "path": "hf_space:index.html",
6340
  "exists": true,
6341
+ "bytes": 348384,
6342
+ "sha256": "6f6b2c6b40e8d38210fd2e0c04750732afb5179996a663185f2525442c9c957a"
6343
  },
6344
  "hf_artifacts_root": {
6345
  "path": "hf_artifacts:index.html",
6346
  "exists": true,
6347
+ "bytes": 348384,
6348
+ "sha256": "6f6b2c6b40e8d38210fd2e0c04750732afb5179996a663185f2525442c9c957a"
6349
  },
6350
  "hf_artifacts_docs": {
6351
  "path": "hf_artifacts:docs/index.html",
6352
  "exists": true,
6353
+ "bytes": 348384,
6354
+ "sha256": "6f6b2c6b40e8d38210fd2e0c04750732afb5179996a663185f2525442c9c957a"
6355
  },
6356
  "hf_model": {
6357
  "path": "hf_model:index.html",
6358
  "exists": true,
6359
+ "bytes": 348384,
6360
+ "sha256": "6f6b2c6b40e8d38210fd2e0c04750732afb5179996a663185f2525442c9c957a"
6361
  },
6362
  "hf_model_docs": {
6363
  "path": "hf_model:docs/index.html",
6364
  "exists": true,
6365
+ "bytes": 348384,
6366
+ "sha256": "6f6b2c6b40e8d38210fd2e0c04750732afb5179996a663185f2525442c9c957a"
6367
  }
6368
  },
6369
  "failures": []
 
31027
  "local": {
31028
  "path": "repo:FIGURE_INDEX.md",
31029
  "exists": true,
31030
+ "bytes": 7068,
31031
+ "sha256": "cc64c0bb070e7eb0035ba590a6d83ed07fcc68fb56081668caebf40b49b9900f"
31032
  },
31033
  "mirrors": {
31034
  "hf_space": {
31035
  "path": "hf_space:FIGURE_INDEX.md",
31036
  "exists": true,
31037
+ "bytes": 7068,
31038
+ "sha256": "cc64c0bb070e7eb0035ba590a6d83ed07fcc68fb56081668caebf40b49b9900f"
31039
  },
31040
  "hf_artifacts": {
31041
  "path": "hf_artifacts:FIGURE_INDEX.md",
31042
  "exists": true,
31043
+ "bytes": 7068,
31044
+ "sha256": "cc64c0bb070e7eb0035ba590a6d83ed07fcc68fb56081668caebf40b49b9900f"
31045
  },
31046
  "hf_model": {
31047
  "path": "hf_model:FIGURE_INDEX.md",
31048
  "exists": true,
31049
+ "bytes": 7068,
31050
+ "sha256": "cc64c0bb070e7eb0035ba590a6d83ed07fcc68fb56081668caebf40b49b9900f"
31051
  }
31052
  },
31053
  "failures": []
 
31554
  "local": {
31555
  "path": "repo:TASK_METHOD_20_RESULT_MATRIX.md",
31556
  "exists": true,
31557
+ "bytes": 14862,
31558
+ "sha256": "a6f769d40ed22a2d63dad88029be393d55ed9af3082ab61fc1ad8b314aa871a6"
31559
  },
31560
  "mirrors": {
31561
  "hf_space": {
31562
  "path": "hf_space:TASK_METHOD_20_RESULT_MATRIX.md",
31563
  "exists": true,
31564
+ "bytes": 14862,
31565
+ "sha256": "a6f769d40ed22a2d63dad88029be393d55ed9af3082ab61fc1ad8b314aa871a6"
31566
  },
31567
  "hf_artifacts": {
31568
  "path": "hf_artifacts:TASK_METHOD_20_RESULT_MATRIX.md",
31569
  "exists": true,
31570
+ "bytes": 14862,
31571
+ "sha256": "a6f769d40ed22a2d63dad88029be393d55ed9af3082ab61fc1ad8b314aa871a6"
31572
  },
31573
  "hf_model": {
31574
  "path": "hf_model:TASK_METHOD_20_RESULT_MATRIX.md",
31575
  "exists": true,
31576
+ "bytes": 14862,
31577
+ "sha256": "a6f769d40ed22a2d63dad88029be393d55ed9af3082ab61fc1ad8b314aa871a6"
31578
  }
31579
  },
31580
  "failures": []
metrics/public_surface_qa.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Public Project Surface",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T20:01:39+00:00",
5
  "scope": "Repo README, GitHub Pages HTML, Hugging Face Space card, artifact dataset card, and model card.",
6
  "checks": [
7
  {
@@ -18,7 +18,7 @@
18
  "website_integrity": {
19
  "exists": true,
20
  "status": "pass",
21
- "generated_at_utc": "2026-06-21T20:01:18+00:00"
22
  },
23
  "rendered_site_check": {
24
  "exists": true,
@@ -28,27 +28,27 @@
28
  "task_surface_integrity": {
29
  "exists": true,
30
  "status": "pass",
31
- "generated_at_utc": "2026-06-21T15:21:55+00:00"
32
  },
33
  "source_alignment": {
34
  "exists": true,
35
  "status": "pass",
36
- "generated_at_utc": "2026-06-21T15:21:55+00:00"
37
  },
38
  "scale_up_status": {
39
  "exists": true,
40
  "status": "pass",
41
- "generated_at_utc": "2026-06-21T15:23:13+00:00"
42
  },
43
  "publication_package": {
44
  "exists": true,
45
  "status": "pass",
46
- "generated_at_utc": "2026-06-21T19:57:53+00:00"
47
  },
48
  "mirror_parity": {
49
  "exists": true,
50
  "status": "pass",
51
- "generated_at_utc": "2026-06-21T20:00:18+00:00"
52
  }
53
  },
54
  "failures": {}
@@ -97,8 +97,8 @@
97
  "marker_counts": {
98
  "Ropedia Xperience-10M Task Suite": 22,
99
  "Xperience-10M": 173,
100
- "20-task": 110,
101
- "Qwen3-Omni": 246,
102
  "128-episode pilot": 1
103
  }
104
  },
 
1
  {
2
  "title": "Ropedia Xperience-10M Public Project Surface",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T20:35:18+00:00",
5
  "scope": "Repo README, GitHub Pages HTML, Hugging Face Space card, artifact dataset card, and model card.",
6
  "checks": [
7
  {
 
18
  "website_integrity": {
19
  "exists": true,
20
  "status": "pass",
21
+ "generated_at_utc": "2026-06-21T20:35:18+00:00"
22
  },
23
  "rendered_site_check": {
24
  "exists": true,
 
28
  "task_surface_integrity": {
29
  "exists": true,
30
  "status": "pass",
31
+ "generated_at_utc": "2026-06-21T20:30:48+00:00"
32
  },
33
  "source_alignment": {
34
  "exists": true,
35
  "status": "pass",
36
+ "generated_at_utc": "2026-06-21T20:30:47+00:00"
37
  },
38
  "scale_up_status": {
39
  "exists": true,
40
  "status": "pass",
41
+ "generated_at_utc": "2026-06-21T20:30:47+00:00"
42
  },
43
  "publication_package": {
44
  "exists": true,
45
  "status": "pass",
46
+ "generated_at_utc": "2026-06-21T20:32:45+00:00"
47
  },
48
  "mirror_parity": {
49
  "exists": true,
50
  "status": "pass",
51
+ "generated_at_utc": "2026-06-21T20:33:57+00:00"
52
  }
53
  },
54
  "failures": {}
 
97
  "marker_counts": {
98
  "Ropedia Xperience-10M Task Suite": 22,
99
  "Xperience-10M": 173,
100
+ "20-task": 107,
101
+ "Qwen3-Omni": 239,
102
  "128-episode pilot": 1
103
  }
104
  },
metrics/publication_audit.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T20:02:09+00:00",
4
  "checks": [
5
  {
6
  "name": "required_publication_assets_present",
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T20:35:41+00:00",
4
  "checks": [
5
  {
6
  "name": "required_publication_assets_present",
metrics/quality_gates.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Release Checks",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T15:21:42+00:00",
5
  "rule": "A release is current when the automated reports pass and the live GitHub/Hugging Face mirrors are verified after publishing.",
6
  "automated_gates": [
7
  {
 
1
  {
2
  "title": "Ropedia Xperience-10M Release Checks",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T20:35:18+00:00",
5
  "rule": "A release is current when the automated reports pass and the live GitHub/Hugging Face mirrors are verified after publishing.",
6
  "automated_gates": [
7
  {
metrics/scope_claims_audit.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T15:23:13+00:00",
4
  "summary": {
5
  "qwen3_omni_verified_diagnostic_pilot": true,
6
  "dataset_manifest_num_episodes": 119,
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T20:35:22+00:00",
4
  "summary": {
5
  "qwen3_omni_verified_diagnostic_pilot": true,
6
  "dataset_manifest_num_episodes": 119,
metrics/single_episode_task_model_radar.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Single-Episode 20-Task Radar",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T15:20:34+00:00",
5
  "description": "Minimal and Neural MLP baselines on the one public sample episode, both scored on all 20 task contracts.",
6
  "task_count": 20,
7
  "method_count": 2,
@@ -11,10 +11,28 @@
11
  "higher_is_better": "bounded metrics are plotted directly on 0-1 axes after clipping to [0, 1]",
12
  "lower_is_better": "lower-error metrics are converted to best_observed_value / raw_value within the same task",
13
  "raw_values": "raw metric values, metric keys, and sources are retained in this JSON; the SVG is an overview, not a replacement for the metric table",
 
14
  "result_record_policy": "every method has 20 task records; the current public release has 180/180 scored rows with proxy flags and reasons retained where compact substitute targets are used",
15
- "foundation_model_overlay": "Qwen3-Omni and Cosmos3 points are plotted only on task-aligned axes. Scoreless records mean the public result does not evaluate that task contract.",
16
- "metadata_128_overlay": "128-episode aligned baselines have 20 records. Numeric scores come from JSONL metadata/text tasks plus staged sensor-block targets when the processed target exists; raw interaction text and paired camera-view embeddings remain explicit gaps.",
17
- "raw_128_overlay": "128-episode raw-feature baselines use staged sensor NPZ features. Eighteen axes use direct task targets; interaction text and camera-view sync are completed with documented compact proxies because raw interaction strings and paired video-view embeddings are absent from the 128 export."
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
18
  },
19
  "source_unified_radar": "docs/data/unified_task_model_radar.json",
20
  "source_result_matrix": "docs/data/task_method_20_result_matrix.json",
@@ -28,7 +46,7 @@
28
  "scope": "1 public sample episode",
29
  "stroke_dasharray": null,
30
  "method_detail": "Single-episode simple heads over the public sample split.",
31
- "plotted_as": "filled polygon",
32
  "result_record_count": 20,
33
  "scored_task_count": 20,
34
  "covered_task_count": 20,
@@ -51,7 +69,7 @@
51
  "scope": "1 public sample episode",
52
  "stroke_dasharray": null,
53
  "method_detail": "Single-episode compact PyTorch MLP heads on the same 20 task contracts.",
54
- "plotted_as": "filled polygon",
55
  "result_record_count": 20,
56
  "scored_task_count": 20,
57
  "covered_task_count": 20,
 
1
  {
2
  "title": "Single-Episode 20-Task Radar",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T20:35:16+00:00",
5
  "description": "Minimal and Neural MLP baselines on the one public sample episode, both scored on all 20 task contracts.",
6
  "task_count": 20,
7
  "method_count": 2,
 
11
  "higher_is_better": "bounded metrics are plotted directly on 0-1 axes after clipping to [0, 1]",
12
  "lower_is_better": "lower-error metrics are converted to best_observed_value / raw_value within the same task",
13
  "raw_values": "raw metric values, metric keys, and sources are retained in this JSON; the SVG is an overview, not a replacement for the metric table",
14
+ "radar_visual_radius": "SVG radar panels use sqrt(normalized_score) for radius so polygon area remains closer to the score and low-valued but real differences stay visible; the JSON and matrix retain exact linear normalized_score values",
15
  "result_record_policy": "every method has 20 task records; the current public release has 180/180 scored rows with proxy flags and reasons retained where compact substitute targets are used",
16
+ "foundation_model_overlay": "Qwen3-Omni and Cosmos3 are grouped in the foundation-model radar panel. All current public model rows have 20 scored task records, with source paths retained for every metric.",
17
+ "metadata_128_overlay": "128-episode aligned baselines are grouped in the metadata/text radar panel. Numeric scores come from JSONL metadata/text tasks plus staged sensor-block targets when the processed target exists.",
18
+ "raw_128_overlay": "128-episode raw-feature baselines are grouped in the raw-feature radar panel. Eighteen axes use direct task targets; interaction text and camera-view sync are completed with documented compact proxies because raw interaction strings and paired video-view embeddings are absent from the 128 export."
19
+ },
20
+ "chart_design": {
21
+ "mode": "grouped_small_multiples",
22
+ "method_count": 2,
23
+ "reason": "This split view has 2 methods and 40 method-task records; grouped radar panels keep related methods readable while retaining the unified source matrix.",
24
+ "groups": [
25
+ {
26
+ "id": "single_episode",
27
+ "title": "Single-episode sample",
28
+ "series_ids": [
29
+ "minimal",
30
+ "neural_mlp"
31
+ ]
32
+ }
33
+ ],
34
+ "visual_radius_transform": "sqrt(normalized_score)",
35
+ "exact_value_source": "docs/data/task_method_20_result_matrix.json"
36
  },
37
  "source_unified_radar": "docs/data/unified_task_model_radar.json",
38
  "source_result_matrix": "docs/data/task_method_20_result_matrix.json",
 
46
  "scope": "1 public sample episode",
47
  "stroke_dasharray": null,
48
  "method_detail": "Single-episode simple heads over the public sample split.",
49
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
50
  "result_record_count": 20,
51
  "scored_task_count": 20,
52
  "covered_task_count": 20,
 
69
  "scope": "1 public sample episode",
70
  "stroke_dasharray": null,
71
  "method_detail": "Single-episode compact PyTorch MLP heads on the same 20 task contracts.",
72
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
73
  "result_record_count": 20,
74
  "scored_task_count": 20,
75
  "covered_task_count": 20,
metrics/source_alignment_audit.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Source Alignment Note",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T15:21:55+00:00",
5
  "alignment_json": "docs/data/xperience10m_dataset_card_alignment.json",
6
  "alignment_summary": {
7
  "full_dataset_repo": "ropedia-ai/xperience-10m",
 
1
  {
2
  "title": "Ropedia Xperience-10M Source Alignment Note",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T20:35:22+00:00",
5
  "alignment_json": "docs/data/xperience10m_dataset_card_alignment.json",
6
  "alignment_summary": {
7
  "full_dataset_repo": "ropedia-ai/xperience-10m",
metrics/task_method_20_result_matrix.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Task Method 20-Result Matrix",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T15:20:34+00:00",
5
  "task_count": 20,
6
  "method_count": 9,
7
  "method_task_record_count": 180,
@@ -16,7 +16,7 @@
16
  "scope": "1 public sample episode",
17
  "stroke_dasharray": null,
18
  "method_detail": "Single-episode simple heads over the public sample split.",
19
- "plotted_as": "filled polygon",
20
  "result_record_count": 20,
21
  "scored_task_count": 20,
22
  "covered_task_count": 20,
@@ -39,7 +39,7 @@
39
  "scope": "1 public sample episode",
40
  "stroke_dasharray": null,
41
  "method_detail": "Single-episode compact PyTorch MLP heads on the same 20 task contracts.",
42
- "plotted_as": "filled polygon",
43
  "result_record_count": 20,
44
  "scored_task_count": 20,
45
  "covered_task_count": 20,
@@ -62,7 +62,7 @@
62
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
63
  "stroke_dasharray": "9 6",
64
  "method_detail": "128-episode aligned simple baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
65
- "plotted_as": "colored point overlay",
66
  "result_record_count": 20,
67
  "scored_task_count": 20,
68
  "covered_task_count": 20,
@@ -86,7 +86,7 @@
86
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
87
  "stroke_dasharray": "3 6",
88
  "method_detail": "128-episode aligned MLP baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
89
- "plotted_as": "colored point overlay",
90
  "result_record_count": 20,
91
  "scored_task_count": 20,
92
  "covered_task_count": 20,
@@ -110,7 +110,7 @@
110
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
111
  "stroke_dasharray": "8 4",
112
  "method_detail": "128-episode 4430-dim sensor NPZ simple heads; tasks 15/19 use compact proxies.",
113
- "plotted_as": "colored point overlay",
114
  "result_record_count": 20,
115
  "scored_task_count": 20,
116
  "covered_task_count": 20,
@@ -134,7 +134,7 @@
134
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
135
  "stroke_dasharray": "2 5",
136
  "method_detail": "128-episode 4430-dim sensor NPZ MLP heads; tasks 15/19 use compact proxies.",
137
- "plotted_as": "colored point overlay",
138
  "result_record_count": 20,
139
  "scored_task_count": 20,
140
  "covered_task_count": 20,
@@ -158,7 +158,7 @@
158
  "scope": "128 selected episodes, held-out test",
159
  "stroke_dasharray": "7 7",
160
  "method_detail": "Verified held-out Qwen3-Omni v6 LoRA metrics, plus task 16 and any completed private-GPU future/retrieval/sensor-target probes scored from task-specific JSON.",
161
- "plotted_as": "colored point overlay",
162
  "result_record_count": 20,
163
  "scored_task_count": 20,
164
  "covered_task_count": 20,
@@ -181,7 +181,7 @@
181
  "scope": "128 selected episodes, held-out test",
182
  "stroke_dasharray": "4 7",
183
  "method_detail": "Verified Cosmos3-Super base-weight Reasoner JSON-task evaluation, plus task 5/8/9/10/11/12/13/14/16/17/18/19/20 probes where public metrics exist.",
184
- "plotted_as": "colored point overlay",
185
  "result_record_count": 20,
186
  "scored_task_count": 20,
187
  "covered_task_count": 20,
@@ -204,7 +204,7 @@
204
  "scope": "128 selected episodes, held-out test",
205
  "stroke_dasharray": "2 7",
206
  "method_detail": "Verified Cosmos3-Nano future-window compatibility metrics, plus model-output probes for tasks 2/5/7/8/10/11/12/13/14/15/16/17/18/19 and a derived task-20 boundary timing probe scored from held-out future-window artifacts.",
207
- "plotted_as": "colored point overlay",
208
  "result_record_count": 20,
209
  "scored_task_count": 20,
210
  "covered_task_count": 20,
 
1
  {
2
  "title": "Task Method 20-Result Matrix",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T20:35:16+00:00",
5
  "task_count": 20,
6
  "method_count": 9,
7
  "method_task_record_count": 180,
 
16
  "scope": "1 public sample episode",
17
  "stroke_dasharray": null,
18
  "method_detail": "Single-episode simple heads over the public sample split.",
19
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
20
  "result_record_count": 20,
21
  "scored_task_count": 20,
22
  "covered_task_count": 20,
 
39
  "scope": "1 public sample episode",
40
  "stroke_dasharray": null,
41
  "method_detail": "Single-episode compact PyTorch MLP heads on the same 20 task contracts.",
42
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
43
  "result_record_count": 20,
44
  "scored_task_count": 20,
45
  "covered_task_count": 20,
 
62
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
63
  "stroke_dasharray": "9 6",
64
  "method_detail": "128-episode aligned simple baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
65
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
66
  "result_record_count": 20,
67
  "scored_task_count": 20,
68
  "covered_task_count": 20,
 
86
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
87
  "stroke_dasharray": "3 6",
88
  "method_detail": "128-episode aligned MLP baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
89
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
90
  "result_record_count": 20,
91
  "scored_task_count": 20,
92
  "covered_task_count": 20,
 
110
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
111
  "stroke_dasharray": "8 4",
112
  "method_detail": "128-episode 4430-dim sensor NPZ simple heads; tasks 15/19 use compact proxies.",
113
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
114
  "result_record_count": 20,
115
  "scored_task_count": 20,
116
  "covered_task_count": 20,
 
134
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
135
  "stroke_dasharray": "2 5",
136
  "method_detail": "128-episode 4430-dim sensor NPZ MLP heads; tasks 15/19 use compact proxies.",
137
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
138
  "result_record_count": 20,
139
  "scored_task_count": 20,
140
  "covered_task_count": 20,
 
158
  "scope": "128 selected episodes, held-out test",
159
  "stroke_dasharray": "7 7",
160
  "method_detail": "Verified held-out Qwen3-Omni v6 LoRA metrics, plus task 16 and any completed private-GPU future/retrieval/sensor-target probes scored from task-specific JSON.",
161
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
162
  "result_record_count": 20,
163
  "scored_task_count": 20,
164
  "covered_task_count": 20,
 
181
  "scope": "128 selected episodes, held-out test",
182
  "stroke_dasharray": "4 7",
183
  "method_detail": "Verified Cosmos3-Super base-weight Reasoner JSON-task evaluation, plus task 5/8/9/10/11/12/13/14/16/17/18/19/20 probes where public metrics exist.",
184
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
185
  "result_record_count": 20,
186
  "scored_task_count": 20,
187
  "covered_task_count": 20,
 
204
  "scope": "128 selected episodes, held-out test",
205
  "stroke_dasharray": "2 7",
206
  "method_detail": "Verified Cosmos3-Nano future-window compatibility metrics, plus model-output probes for tasks 2/5/7/8/10/11/12/13/14/15/16/17/18/19 and a derived task-20 boundary timing probe scored from held-out future-window artifacts.",
207
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
208
  "result_record_count": 20,
209
  "scored_task_count": 20,
210
  "covered_task_count": 20,
metrics/task_surface_integrity.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T15:21:55+00:00",
4
  "summary": {
5
  "original_walkthrough_task_count": 12,
6
  "expected_original_walkthrough_task_count": 12,
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T20:35:22+00:00",
4
  "summary": {
5
  "original_walkthrough_task_count": 12,
6
  "expected_original_walkthrough_task_count": 12,
metrics/unified_task_model_radar.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Unified 20-Task Model Radar",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T15:20:34+00:00",
5
  "task_count": 20,
6
  "method_count": 9,
7
  "method_task_record_count": 180,
@@ -10,10 +10,53 @@
10
  "higher_is_better": "bounded metrics are plotted directly on 0-1 axes after clipping to [0, 1]",
11
  "lower_is_better": "lower-error metrics are converted to best_observed_value / raw_value within the same task",
12
  "raw_values": "raw metric values, metric keys, and sources are retained in this JSON; the SVG is an overview, not a replacement for the metric table",
 
13
  "result_record_policy": "every method has 20 task records; the current public release has 180/180 scored rows with proxy flags and reasons retained where compact substitute targets are used",
14
- "foundation_model_overlay": "Qwen3-Omni and Cosmos3 points are plotted only on task-aligned axes. Scoreless records mean the public result does not evaluate that task contract.",
15
- "metadata_128_overlay": "128-episode aligned baselines have 20 records. Numeric scores come from JSONL metadata/text tasks plus staged sensor-block targets when the processed target exists; raw interaction text and paired camera-view embeddings remain explicit gaps.",
16
- "raw_128_overlay": "128-episode raw-feature baselines use staged sensor NPZ features. Eighteen axes use direct task targets; interaction text and camera-view sync are completed with documented compact proxies because raw interaction strings and paired video-view embeddings are absent from the 128 export."
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
17
  },
18
  "series": [
19
  {
@@ -25,7 +68,7 @@
25
  "scope": "1 public sample episode",
26
  "stroke_dasharray": null,
27
  "method_detail": "Single-episode simple heads over the public sample split.",
28
- "plotted_as": "filled polygon",
29
  "result_record_count": 20,
30
  "scored_task_count": 20,
31
  "covered_task_count": 20,
@@ -48,7 +91,7 @@
48
  "scope": "1 public sample episode",
49
  "stroke_dasharray": null,
50
  "method_detail": "Single-episode compact PyTorch MLP heads on the same 20 task contracts.",
51
- "plotted_as": "filled polygon",
52
  "result_record_count": 20,
53
  "scored_task_count": 20,
54
  "covered_task_count": 20,
@@ -71,7 +114,7 @@
71
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
72
  "stroke_dasharray": "9 6",
73
  "method_detail": "128-episode aligned simple baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
74
- "plotted_as": "colored point overlay",
75
  "result_record_count": 20,
76
  "scored_task_count": 20,
77
  "covered_task_count": 20,
@@ -95,7 +138,7 @@
95
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
96
  "stroke_dasharray": "3 6",
97
  "method_detail": "128-episode aligned MLP baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
98
- "plotted_as": "colored point overlay",
99
  "result_record_count": 20,
100
  "scored_task_count": 20,
101
  "covered_task_count": 20,
@@ -119,7 +162,7 @@
119
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
120
  "stroke_dasharray": "8 4",
121
  "method_detail": "128-episode 4430-dim sensor NPZ simple heads; tasks 15/19 use compact proxies.",
122
- "plotted_as": "colored point overlay",
123
  "result_record_count": 20,
124
  "scored_task_count": 20,
125
  "covered_task_count": 20,
@@ -143,7 +186,7 @@
143
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
144
  "stroke_dasharray": "2 5",
145
  "method_detail": "128-episode 4430-dim sensor NPZ MLP heads; tasks 15/19 use compact proxies.",
146
- "plotted_as": "colored point overlay",
147
  "result_record_count": 20,
148
  "scored_task_count": 20,
149
  "covered_task_count": 20,
@@ -167,7 +210,7 @@
167
  "scope": "128 selected episodes, held-out test",
168
  "stroke_dasharray": "7 7",
169
  "method_detail": "Verified held-out Qwen3-Omni v6 LoRA metrics, plus task 16 and any completed private-GPU future/retrieval/sensor-target probes scored from task-specific JSON.",
170
- "plotted_as": "colored point overlay",
171
  "result_record_count": 20,
172
  "scored_task_count": 20,
173
  "covered_task_count": 20,
@@ -190,7 +233,7 @@
190
  "scope": "128 selected episodes, held-out test",
191
  "stroke_dasharray": "4 7",
192
  "method_detail": "Verified Cosmos3-Super base-weight Reasoner JSON-task evaluation, plus task 5/8/9/10/11/12/13/14/16/17/18/19/20 probes where public metrics exist.",
193
- "plotted_as": "colored point overlay",
194
  "result_record_count": 20,
195
  "scored_task_count": 20,
196
  "covered_task_count": 20,
@@ -213,7 +256,7 @@
213
  "scope": "128 selected episodes, held-out test",
214
  "stroke_dasharray": "2 7",
215
  "method_detail": "Verified Cosmos3-Nano future-window compatibility metrics, plus model-output probes for tasks 2/5/7/8/10/11/12/13/14/15/16/17/18/19 and a derived task-20 boundary timing probe scored from held-out future-window artifacts.",
216
- "plotted_as": "colored point overlay",
217
  "result_record_count": 20,
218
  "scored_task_count": 20,
219
  "covered_task_count": 20,
 
1
  {
2
  "title": "Unified 20-Task Model Radar",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T20:35:16+00:00",
5
  "task_count": 20,
6
  "method_count": 9,
7
  "method_task_record_count": 180,
 
10
  "higher_is_better": "bounded metrics are plotted directly on 0-1 axes after clipping to [0, 1]",
11
  "lower_is_better": "lower-error metrics are converted to best_observed_value / raw_value within the same task",
12
  "raw_values": "raw metric values, metric keys, and sources are retained in this JSON; the SVG is an overview, not a replacement for the metric table",
13
+ "radar_visual_radius": "SVG radar panels use sqrt(normalized_score) for radius so polygon area remains closer to the score and low-valued but real differences stay visible; the JSON and matrix retain exact linear normalized_score values",
14
  "result_record_policy": "every method has 20 task records; the current public release has 180/180 scored rows with proxy flags and reasons retained where compact substitute targets are used",
15
+ "foundation_model_overlay": "Qwen3-Omni and Cosmos3 are grouped in the foundation-model radar panel. All current public model rows have 20 scored task records, with source paths retained for every metric.",
16
+ "metadata_128_overlay": "128-episode aligned baselines are grouped in the metadata/text radar panel. Numeric scores come from JSONL metadata/text tasks plus staged sensor-block targets when the processed target exists.",
17
+ "raw_128_overlay": "128-episode raw-feature baselines are grouped in the raw-feature radar panel. Eighteen axes use direct task targets; interaction text and camera-view sync are completed with documented compact proxies because raw interaction strings and paired video-view embeddings are absent from the 128 export."
18
+ },
19
+ "chart_design": {
20
+ "mode": "grouped_small_multiples",
21
+ "method_count": 9,
22
+ "reason": "The public release has nine methods and 180 scored records; small-multiple radar panels avoid a nine-polygon overlay while keeping every method visible.",
23
+ "groups": [
24
+ {
25
+ "id": "single_episode",
26
+ "title": "Single-episode sample",
27
+ "series_ids": [
28
+ "minimal",
29
+ "neural_mlp"
30
+ ]
31
+ },
32
+ {
33
+ "id": "metadata_128",
34
+ "title": "128-episode metadata/text",
35
+ "series_ids": [
36
+ "metadata128_simple",
37
+ "metadata128_neural_mlp"
38
+ ]
39
+ },
40
+ {
41
+ "id": "raw_128",
42
+ "title": "128-episode raw features",
43
+ "series_ids": [
44
+ "raw128_simple",
45
+ "raw128_neural_mlp"
46
+ ]
47
+ },
48
+ {
49
+ "id": "foundation_models",
50
+ "title": "Foundation-model probes",
51
+ "series_ids": [
52
+ "qwen3_omni_v6_lora",
53
+ "cosmos3_super_reasoner",
54
+ "cosmos3_nano_future_window"
55
+ ]
56
+ }
57
+ ],
58
+ "visual_radius_transform": "sqrt(normalized_score)",
59
+ "exact_value_source": "docs/data/task_method_20_result_matrix.json"
60
  },
61
  "series": [
62
  {
 
68
  "scope": "1 public sample episode",
69
  "stroke_dasharray": null,
70
  "method_detail": "Single-episode simple heads over the public sample split.",
71
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
72
  "result_record_count": 20,
73
  "scored_task_count": 20,
74
  "covered_task_count": 20,
 
91
  "scope": "1 public sample episode",
92
  "stroke_dasharray": null,
93
  "method_detail": "Single-episode compact PyTorch MLP heads on the same 20 task contracts.",
94
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
95
  "result_record_count": 20,
96
  "scored_task_count": 20,
97
  "covered_task_count": 20,
 
114
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
115
  "stroke_dasharray": "9 6",
116
  "method_detail": "128-episode aligned simple baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
117
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
118
  "result_record_count": 20,
119
  "scored_task_count": 20,
120
  "covered_task_count": 20,
 
138
  "scope": "128 selected episodes, JSONL metadata/text plus staged sensor-block targets where available",
139
  "stroke_dasharray": "3 6",
140
  "method_detail": "128-episode aligned MLP baselines: JSONL metadata/text tasks plus staged sensor-block tasks where the processed target exists.",
141
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
142
  "result_record_count": 20,
143
  "scored_task_count": 20,
144
  "covered_task_count": 20,
 
162
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
163
  "stroke_dasharray": "8 4",
164
  "method_detail": "128-episode 4430-dim sensor NPZ simple heads; tasks 15/19 use compact proxies.",
165
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
166
  "result_record_count": 20,
167
  "scored_task_count": 20,
168
  "covered_task_count": 20,
 
186
  "scope": "128 selected episodes, staged 4430-dim sensor NPZ features; 2 compact proxy axes",
187
  "stroke_dasharray": "2 5",
188
  "method_detail": "128-episode 4430-dim sensor NPZ MLP heads; tasks 15/19 use compact proxies.",
189
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
190
  "result_record_count": 20,
191
  "scored_task_count": 20,
192
  "covered_task_count": 20,
 
210
  "scope": "128 selected episodes, held-out test",
211
  "stroke_dasharray": "7 7",
212
  "method_detail": "Verified held-out Qwen3-Omni v6 LoRA metrics, plus task 16 and any completed private-GPU future/retrieval/sensor-target probes scored from task-specific JSON.",
213
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
214
  "result_record_count": 20,
215
  "scored_task_count": 20,
216
  "covered_task_count": 20,
 
233
  "scope": "128 selected episodes, held-out test",
234
  "stroke_dasharray": "4 7",
235
  "method_detail": "Verified Cosmos3-Super base-weight Reasoner JSON-task evaluation, plus task 5/8/9/10/11/12/13/14/16/17/18/19/20 probes where public metrics exist.",
236
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
237
  "result_record_count": 20,
238
  "scored_task_count": 20,
239
  "covered_task_count": 20,
 
256
  "scope": "128 selected episodes, held-out test",
257
  "stroke_dasharray": "2 7",
258
  "method_detail": "Verified Cosmos3-Nano future-window compatibility metrics, plus model-output probes for tasks 2/5/7/8/10/11/12/13/14/15/16/17/18/19 and a derived task-20 boundary timing probe scored from held-out future-window artifacts.",
259
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
260
  "result_record_count": 20,
261
  "scored_task_count": 20,
262
  "covered_task_count": 20,
metrics/website_integrity.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T20:01:18+00:00",
4
  "docs_root": "docs",
5
  "site_base": "/ropedia-xperience-10m-task-suite/",
6
  "summary": {
@@ -80,8 +80,8 @@
80
  "name": "project_overview_precedes_progress_ledger",
81
  "status": "pass",
82
  "reason": "The project overview should appear before the deeper progress ledger.",
83
- "overview_index": 136413,
84
- "evidence_index": 187740
85
  },
86
  {
87
  "name": "project_status_links_json",
@@ -159,9 +159,9 @@
159
  "name": "evaluation_protocol_between_overview_and_progress",
160
  "status": "pass",
161
  "reason": "The evaluation protocol should appear before the deeper evidence ledger.",
162
- "overview_index": 136413,
163
- "protocol_index": 183928,
164
- "evidence_index": 187740
165
  },
166
  {
167
  "name": "evaluation_protocol_links_json",
@@ -187,7 +187,7 @@
187
  "status": "pass",
188
  "reason": "The Suite anchor should show the task-suite map before the radar/results surface.",
189
  "first_marker_index": 468,
190
- "second_marker_index": 1838
191
  },
192
  {
193
  "name": "raw_sample_stream_ledger_contains_seven_modalities",
@@ -315,7 +315,7 @@
315
  },
316
  {
317
  "path": "data/artifact_index.json",
318
- "bytes": 124341,
319
  "top_level_type": "dict"
320
  },
321
  {
@@ -330,7 +330,7 @@
330
  },
331
  {
332
  "path": "data/episode128_task_model_radar.json",
333
- "bytes": 185212,
334
  "top_level_type": "dict"
335
  },
336
  {
@@ -345,7 +345,7 @@
345
  },
346
  {
347
  "path": "data/figure_index.json",
348
- "bytes": 19485,
349
  "top_level_type": "dict"
350
  },
351
  {
@@ -495,7 +495,7 @@
495
  },
496
  {
497
  "path": "data/single_episode_task_model_radar.json",
498
- "bytes": 51327,
499
  "top_level_type": "dict"
500
  },
501
  {
@@ -515,7 +515,7 @@
515
  },
516
  {
517
  "path": "data/task_method_20_result_matrix.json",
518
- "bytes": 128509,
519
  "top_level_type": "dict"
520
  },
521
  {
@@ -565,7 +565,7 @@
565
  },
566
  {
567
  "path": "data/unified_task_model_radar.json",
568
- "bytes": 229035,
569
  "top_level_type": "dict"
570
  },
571
  {
@@ -610,7 +610,7 @@
610
  {
611
  "path": "assets/charts/episode128_task_model_radar.svg",
612
  "exists": true,
613
- "bytes": 51915,
614
  "format": "SVG",
615
  "has_viewbox": true
616
  },
@@ -666,7 +666,7 @@
666
  {
667
  "path": "assets/charts/single_episode_task_model_radar.svg",
668
  "exists": true,
669
- "bytes": 35232,
670
  "format": "SVG",
671
  "has_viewbox": true
672
  },
@@ -680,7 +680,7 @@
680
  {
681
  "path": "assets/charts/unified_task_model_radar.svg",
682
  "exists": true,
683
- "bytes": 57938,
684
  "format": "SVG",
685
  "has_viewbox": true
686
  },
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T20:35:18+00:00",
4
  "docs_root": "docs",
5
  "site_base": "/ropedia-xperience-10m-task-suite/",
6
  "summary": {
 
80
  "name": "project_overview_precedes_progress_ledger",
81
  "status": "pass",
82
  "reason": "The project overview should appear before the deeper progress ledger.",
83
+ "overview_index": 136346,
84
+ "evidence_index": 187681
85
  },
86
  {
87
  "name": "project_status_links_json",
 
159
  "name": "evaluation_protocol_between_overview_and_progress",
160
  "status": "pass",
161
  "reason": "The evaluation protocol should appear before the deeper evidence ledger.",
162
+ "overview_index": 136346,
163
+ "protocol_index": 183869,
164
+ "evidence_index": 187681
165
  },
166
  {
167
  "name": "evaluation_protocol_links_json",
 
187
  "status": "pass",
188
  "reason": "The Suite anchor should show the task-suite map before the radar/results surface.",
189
  "first_marker_index": 468,
190
+ "second_marker_index": 2000
191
  },
192
  {
193
  "name": "raw_sample_stream_ledger_contains_seven_modalities",
 
315
  },
316
  {
317
  "path": "data/artifact_index.json",
318
+ "bytes": 124471,
319
  "top_level_type": "dict"
320
  },
321
  {
 
330
  },
331
  {
332
  "path": "data/episode128_task_model_radar.json",
333
+ "bytes": 186828,
334
  "top_level_type": "dict"
335
  },
336
  {
 
345
  },
346
  {
347
  "path": "data/figure_index.json",
348
+ "bytes": 19526,
349
  "top_level_type": "dict"
350
  },
351
  {
 
495
  },
496
  {
497
  "path": "data/single_episode_task_model_radar.json",
498
+ "bytes": 52256,
499
  "top_level_type": "dict"
500
  },
501
  {
 
515
  },
516
  {
517
  "path": "data/task_method_20_result_matrix.json",
518
+ "bytes": 128991,
519
  "top_level_type": "dict"
520
  },
521
  {
 
565
  },
566
  {
567
  "path": "data/unified_task_model_radar.json",
568
+ "bytes": 230938,
569
  "top_level_type": "dict"
570
  },
571
  {
 
610
  {
611
  "path": "assets/charts/episode128_task_model_radar.svg",
612
  "exists": true,
613
+ "bytes": 79370,
614
  "format": "SVG",
615
  "has_viewbox": true
616
  },
 
666
  {
667
  "path": "assets/charts/single_episode_task_model_radar.svg",
668
  "exists": true,
669
+ "bytes": 36930,
670
  "format": "SVG",
671
  "has_viewbox": true
672
  },
 
680
  {
681
  "path": "assets/charts/unified_task_model_radar.svg",
682
  "exists": true,
683
+ "bytes": 98527,
684
  "format": "SVG",
685
  "has_viewbox": true
686
  },
scripts/build_artifact_index.py CHANGED
@@ -539,7 +539,7 @@ ARTIFACTS = [
539
  "path": "docs/data/unified_task_model_radar.json",
540
  "kind": "website_data",
541
  "surface": "website_hf",
542
- "shows": "Stores normalized 20-axis radar values, raw task metrics, Qwen3-Omni/Cosmos3 overlay mappings, method-card caveats, proxy flags, and source artifacts.",
543
  },
544
  {
545
  "id": "single_episode_task_model_radar_json",
@@ -619,7 +619,7 @@ ARTIFACTS = [
619
  "path": "docs/assets/charts/unified_task_model_radar.svg",
620
  "kind": "generated_figure",
621
  "surface": "website_hf",
622
- "shows": "Compares minimal and neural MLP baselines across all 20 tasks, with Qwen3-Omni and Cosmos3 task-aligned overlays.",
623
  },
624
  {
625
  "id": "single_episode_task_model_radar_chart",
@@ -627,7 +627,7 @@ ARTIFACTS = [
627
  "path": "docs/assets/charts/single_episode_task_model_radar.svg",
628
  "kind": "generated_figure",
629
  "surface": "website_hf",
630
- "shows": "Separates the one-episode Minimal and Neural MLP 20/20 scored baselines into a clean two-polygon radar.",
631
  },
632
  {
633
  "id": "episode128_task_model_radar_chart",
@@ -635,7 +635,7 @@ ARTIFACTS = [
635
  "path": "docs/assets/charts/episode128_task_model_radar.svg",
636
  "kind": "generated_figure",
637
  "surface": "website_hf",
638
- "shows": "Separates the selected 128-episode methods: raw-feature simple/NN as complete 20/20 scored polygons plus metadata, Qwen3-Omni, Cosmos3-Super, and Cosmos3-Nano task-aligned overlays.",
639
  },
640
  {
641
  "id": "unified_task_model_radar_builder",
@@ -643,7 +643,7 @@ ARTIFACTS = [
643
  "path": "scripts/build_unified_task_model_radar.py",
644
  "kind": "visualization_builder",
645
  "surface": "repo_hf",
646
- "shows": "Regenerates the direction-aware radar chart and machine-readable metric overlay JSON.",
647
  },
648
  {
649
  "id": "task_method_20_gap_audit_builder",
 
539
  "path": "docs/data/unified_task_model_radar.json",
540
  "kind": "website_data",
541
  "surface": "website_hf",
542
+ "shows": "Stores normalized 20-axis radar values, raw task metrics, grouped chart-design metadata, Qwen3-Omni/Cosmos3 source mappings, method-card caveats, proxy flags, and source artifacts.",
543
  },
544
  {
545
  "id": "single_episode_task_model_radar_json",
 
619
  "path": "docs/assets/charts/unified_task_model_radar.svg",
620
  "kind": "generated_figure",
621
  "surface": "website_hf",
622
+ "shows": "Groups all nine methods into small-multiple 20-task radar panels so single-episode, 128-episode metadata/text, 128-episode raw-feature, and foundation-model rows remain readable.",
623
  },
624
  {
625
  "id": "single_episode_task_model_radar_chart",
 
627
  "path": "docs/assets/charts/single_episode_task_model_radar.svg",
628
  "kind": "generated_figure",
629
  "surface": "website_hf",
630
+ "shows": "Shows the one-episode Minimal and Neural MLP 20/20 scored baselines in one enlarged radar panel with local legend and task key.",
631
  },
632
  {
633
  "id": "episode128_task_model_radar_chart",
 
635
  "path": "docs/assets/charts/episode128_task_model_radar.svg",
636
  "kind": "generated_figure",
637
  "surface": "website_hf",
638
+ "shows": "Separates selected 128-episode methods into metadata/text, raw-feature, and foundation-model radar panels with all 140 result rows scored and proxy notes retained.",
639
  },
640
  {
641
  "id": "unified_task_model_radar_builder",
 
643
  "path": "scripts/build_unified_task_model_radar.py",
644
  "kind": "visualization_builder",
645
  "surface": "repo_hf",
646
+ "shows": "Regenerates grouped 20-task radar charts plus machine-readable metric, source, chart-design, and proxy metadata.",
647
  },
648
  {
649
  "id": "task_method_20_gap_audit_builder",
scripts/build_figure_index.py CHANGED
@@ -206,7 +206,7 @@ FIGURES = [
206
  "id": "unified_task_model_radar",
207
  "title": "Unified 20-task model radar",
208
  "path": "docs/assets/charts/unified_task_model_radar.svg",
209
- "role": "Twenty-axis direction-aware comparison of minimal and neural MLP baselines, with 128-episode metadata, Qwen3, and Cosmos task-aligned overlay points and branch notes.",
210
  "source_script": "scripts/build_unified_task_model_radar.py",
211
  "surface": "website unified task section, README, HF mirrors",
212
  },
@@ -222,7 +222,7 @@ FIGURES = [
222
  "id": "episode128_task_model_radar",
223
  "title": "128-episode 20-task model radar",
224
  "path": "docs/assets/charts/episode128_task_model_radar.svg",
225
- "role": "Twenty-axis split radar for selected 128-episode methods: raw-feature simple/NN as complete scored polygons plus metadata, Qwen3-Omni, Cosmos3-Super, and Cosmos3-Nano task-aligned overlays.",
226
  "source_script": "scripts/build_unified_task_model_radar.py",
227
  "surface": "website unified task section, README, HF mirrors",
228
  },
 
206
  "id": "unified_task_model_radar",
207
  "title": "Unified 20-task model radar",
208
  "path": "docs/assets/charts/unified_task_model_radar.svg",
209
+ "role": "Grouped small-multiple 20-task radar board for all nine methods, separating single-episode, 128-episode metadata/text, 128-episode raw-feature, and foundation-model rows while preserving task keys and proxy notes.",
210
  "source_script": "scripts/build_unified_task_model_radar.py",
211
  "surface": "website unified task section, README, HF mirrors",
212
  },
 
222
  "id": "episode128_task_model_radar",
223
  "title": "128-episode 20-task model radar",
224
  "path": "docs/assets/charts/episode128_task_model_radar.svg",
225
+ "role": "Grouped 20-task radar for selected 128-episode methods: metadata/text baselines, raw-feature simple/NN, Qwen3-Omni, Cosmos3-Super, and Cosmos3-Nano with local legends and proxy notes.",
226
  "source_script": "scripts/build_unified_task_model_radar.py",
227
  "surface": "website unified task section, README, HF mirrors",
228
  },
scripts/build_unified_task_model_radar.py CHANGED
@@ -503,6 +503,33 @@ EPISODE128_SERIES = (
503
  "cosmos3_nano_future_window",
504
  )
505
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
506
  STATUS_LABELS = {
507
  "scored": "scored",
508
  "proxy_scored": "proxy scored",
@@ -760,7 +787,7 @@ def render_matrix_markdown(payload: dict[str, Any]) -> str:
760
  "",
761
  "## Compact Score Matrix",
762
  "",
763
- "Cells show `raw metric value`, then `direct/proxy; normalized radar value; metric key`. The raw metric is the value to cite; the normalized value is the 0-1 plotting value used by the radar.",
764
  "",
765
  "| # | Task | " + " | ".join(spec["short_label"] for spec in SERIES.values()) + " |",
766
  "| ---: | --- | " + " | ".join("---" for _ in SERIES) + " |",
@@ -814,6 +841,21 @@ def filtered_radar_payload(
814
  for row in payload["task_method_result_matrix"]
815
  if row.get("series_id") in selected
816
  ]
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
817
  return {
818
  "title": title,
819
  "status": payload["status"],
@@ -824,6 +866,7 @@ def filtered_radar_payload(
824
  "method_task_record_count": sum(record.get("result_record_count", 0) for record in series),
825
  "scored_method_task_count": sum(record.get("scored_task_count", 0) for record in series),
826
  "normalization_policy": payload["normalization_policy"],
 
827
  "source_unified_radar": "docs/data/unified_task_model_radar.json",
828
  "source_result_matrix": "docs/data/task_method_20_result_matrix.json",
829
  "series": series,
@@ -1016,7 +1059,7 @@ def build_payload() -> dict[str, Any]:
1016
  "id": series_id,
1017
  **spec,
1018
  "method_detail": METHOD_DETAILS.get(series_id, spec["scope"]),
1019
- "plotted_as": "filled polygon" if spec["kind"].startswith("full_20_task_baseline") else "colored point overlay",
1020
  "result_record_count": len(tasks),
1021
  "scored_task_count": covered,
1022
  "covered_task_count": covered,
@@ -1049,10 +1092,26 @@ def build_payload() -> dict[str, Any]:
1049
  "higher_is_better": "bounded metrics are plotted directly on 0-1 axes after clipping to [0, 1]",
1050
  "lower_is_better": "lower-error metrics are converted to best_observed_value / raw_value within the same task",
1051
  "raw_values": "raw metric values, metric keys, and sources are retained in this JSON; the SVG is an overview, not a replacement for the metric table",
 
1052
  "result_record_policy": "every method has 20 task records; the current public release has 180/180 scored rows with proxy flags and reasons retained where compact substitute targets are used",
1053
- "foundation_model_overlay": "Qwen3-Omni and Cosmos3 points are plotted only on task-aligned axes. Scoreless records mean the public result does not evaluate that task contract.",
1054
- "metadata_128_overlay": "128-episode aligned baselines have 20 records. Numeric scores come from JSONL metadata/text tasks plus staged sensor-block targets when the processed target exists; raw interaction text and paired camera-view embeddings remain explicit gaps.",
1055
- "raw_128_overlay": "128-episode raw-feature baselines use staged sensor NPZ features. Eighteen axes use direct task targets; interaction text and camera-view sync are completed with documented compact proxies because raw interaction strings and paired video-view embeddings are absent from the 128 export.",
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1056
  },
1057
  "series": series_records,
1058
  "tasks": tasks,
@@ -1128,6 +1187,266 @@ def build_payload() -> dict[str, Any]:
1128
  return payload
1129
 
1130
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1131
  def render_svg(
1132
  payload: dict[str, Any],
1133
  *,
@@ -1139,29 +1458,27 @@ def render_svg(
1139
  chip_specs: list[tuple[str, str]] | None = None,
1140
  reading_rules: tuple[str, str, str] | None = None,
1141
  ) -> str:
1142
- width, height = 2400, 1840
1143
- cx, cy, radius = 650, 860, 355
1144
  tasks = payload["tasks"]
1145
- n = len(tasks)
1146
- angles = [-math.pi / 2 + 2 * math.pi * i / n for i in range(n)]
1147
  if series_ids is None:
1148
  series_ids = tuple(record["id"] for record in payload["series"])
1149
- polygon_series_set = set(polygon_series_ids)
1150
- series_records = [record for record in payload["series"] if record["id"] in set(series_ids)]
1151
  parts = [
1152
  f'<svg xmlns="http://www.w3.org/2000/svg" width="{width}" height="{height}" viewBox="0 0 {width} {height}">',
1153
  "<defs>",
1154
- '<filter id="softGlow"><feGaussianBlur stdDeviation="5" result="blur"/><feMerge><feMergeNode in="blur"/><feMergeNode in="SourceGraphic"/></feMerge></filter>',
1155
- '<pattern id="dots" width="22" height="22" patternUnits="userSpaceOnUse"><circle cx="2" cy="2" r="1.15" fill="#ccffa0" opacity="0.16"/></pattern>',
1156
  "</defs>",
1157
  '<rect width="100%" height="100%" fill="#020502"/>',
1158
- '<rect width="100%" height="100%" fill="url(#dots)" opacity="0.45"/>',
1159
- '<rect x="28" y="28" width="2344" height="1784" rx="18" fill="#061006" fill-opacity="0.88" stroke="#ccffa0" stroke-opacity="0.22"/>',
1160
- svg_text(70, 86, title or payload.get("title", "20-Task Model Radar"), size=36, weight=800),
1161
  svg_text(
1162
  70,
1163
  122,
1164
- subtitle or "Task names, methods, coverage, and metric normalization in one comparison view.",
1165
  size=18,
1166
  fill="#dce8d7",
1167
  weight=650,
@@ -1170,7 +1487,7 @@ def render_svg(
1170
  70,
1171
  150,
1172
  context_line
1173
- or "Filled areas show complete scored baselines; colored points show partial branches on task-aligned axes.",
1174
  size=15,
1175
  fill="#a5afa2",
1176
  weight=560,
@@ -1182,111 +1499,77 @@ def render_svg(
1182
  ("20 task axes", "#ccffa0"),
1183
  (f"{payload['method_task_record_count']} method-task records", "#67e8d1"),
1184
  (f"{payload['scored_method_task_count']} scored records", "#22d3ee"),
1185
- ("40/40 raw128 pass", "#f59e0b"),
1186
- ("2 compact proxy records", "#f472b6"),
1187
  ]
1188
  chip_x = 70
1189
  for label, color in chip_specs:
1190
- chip_w = 168 if len(label) < 15 else 250
1191
- parts.append(f'<rect x="{chip_x}" y="174" width="{chip_w}" height="34" rx="17" fill="{color}" fill-opacity="0.10" stroke="{color}" stroke-opacity="0.38"/>')
1192
- parts.append(svg_text(chip_x + 16, 197, label, size=13, fill=color, weight=760))
1193
  chip_x += chip_w + 12
1194
 
1195
- parts.append('<rect x="54" y="235" width="1190" height="1190" rx="14" fill="#020502" fill-opacity="0.42" stroke="#ccffa0" stroke-opacity="0.14"/>')
1196
- parts.append(svg_text(84, 276, "Normalized task scores", size=23, weight=800))
1197
- parts.append(svg_text(84, 302, "Each axis is one task. Longer radius means better after metric-direction normalization.", size=13, fill="#a5afa2", weight=560))
1198
-
1199
- for level in range(1, 6):
1200
- r = radius * level / 5
1201
- ring = [point(cx, cy, r, angle) for angle in angles]
1202
- parts.append(polyline(ring, fill="none", stroke="#ccffa0", opacity=0, stroke_width=1.1))
1203
- parts[-1] = parts[-1].replace('fill="none" fill-opacity="0.000"', 'fill="none"').replace('stroke-opacity="0.92"', 'stroke-opacity="0.15"')
1204
- parts.append(svg_text(cx + 8, cy - r + 4, f"{level / 5:.1f}", size=11, fill="#a5afa2", weight=600, opacity=0.75))
1205
-
1206
- for task, angle in zip(tasks, angles):
1207
- x, y = point(cx, cy, radius, angle)
1208
- parts.append(f'<line x1="{cx:.1f}" y1="{cy:.1f}" x2="{x:.1f}" y2="{y:.1f}" stroke="#ccffa0" stroke-opacity="0.12" stroke-width="1"/>')
1209
- lx, ly = point(cx, cy, radius + 82, angle)
1210
- parts.append(f'<circle cx="{lx:.1f}" cy="{ly:.1f}" r="15.5" fill="#ccffa0" fill-opacity="0.12" stroke="#ccffa0" stroke-opacity="0.34"/>')
1211
- parts.append(svg_text(lx, ly + 4, f"{task['task_number']:02d}", size=11, fill="#ccffa0", anchor="middle", weight=800, opacity=0.98))
1212
-
1213
- for series_id in series_ids:
1214
- if series_id not in polygon_series_set:
1215
- continue
1216
- spec = SERIES[series_id]
1217
- points = []
1218
- for task, angle in zip(tasks, angles):
1219
- score = task["values"].get(series_id, {}).get("normalized_score")
1220
- points.append(point(cx, cy, radius * float(score or 0.0), angle))
1221
- parts.append(polyline(points, fill=spec["color"], stroke=spec["color"], opacity=0.18 if series_id in {"minimal", "raw128_simple"} else 0.16, stroke_width=4.2, dash=spec.get("stroke_dasharray")))
1222
- for x, y in points:
1223
- parts.append(f'<circle cx="{x:.1f}" cy="{y:.1f}" r="4.0" fill="{spec["color"]}" stroke="#020502" stroke-width="1.1"/>')
1224
-
1225
- for series_id in series_ids:
1226
- if series_id in polygon_series_set:
1227
- continue
1228
- spec = SERIES[series_id]
1229
- for task, angle in zip(tasks, angles):
1230
- score = task["values"].get(series_id, {}).get("normalized_score")
1231
- if score is None:
1232
- continue
1233
- x, y = point(cx, cy, radius * float(score), angle)
1234
- radius_px = 6.5 if series_id.startswith(("metadata128", "raw128")) else 8.0
1235
- parts.append(
1236
- f'<circle cx="{x:.1f}" cy="{y:.1f}" r="{radius_px:.1f}" fill="{spec["color"]}" fill-opacity="0.92" '
1237
- f'stroke="#020502" stroke-width="2.0"/>'
1238
  )
1239
-
1240
- legend_x, legend_y = 1315, 178
1241
- parts.append(f'<rect x="{legend_x - 30}" y="{legend_y - 38}" width="1000" height="560" rx="14" fill="#020502" fill-opacity="0.58" stroke="#ccffa0" stroke-opacity="0.20"/>')
1242
- parts.append(svg_text(legend_x, legend_y, "Methods compared", size=25, weight=800))
1243
- parts.append(svg_text(legend_x, legend_y + 30, "Each method has 20 records; scores, proxy flags, and sources stay in the JSON matrix.", size=13, fill="#a5afa2", weight=560))
1244
-
1245
- cursor = legend_y + 74
1246
- for record in series_records:
1247
- color = record["color"]
1248
- parts.append(f'<line x1="{legend_x}" y1="{cursor - 7}" x2="{legend_x + 50}" y2="{cursor - 7}" stroke="{color}" stroke-width="7" stroke-linecap="round"/>')
1249
- if record["id"] not in polygon_series_set:
1250
- parts.append(f'<circle cx="{legend_x + 25}" cy="{cursor - 7}" r="7" fill="{color}" stroke="#020502" stroke-width="2"/>')
1251
- parts.append(svg_text(legend_x + 66, cursor - 12, record["label"], size=15, weight=800))
1252
- parts.append(svg_text(legend_x + 392, cursor - 12, f"20 records / {record['scored_task_count']} scored", size=13, fill=color, weight=800))
1253
- detail_lines = split_text(METHOD_DETAILS.get(record["id"], record["scope"]), 78)[:2]
1254
- parts.extend(svg_text_lines(legend_x + 66, cursor + 8, detail_lines, size=11, fill="#a5afa2", weight=560, line_height=15))
1255
- cursor += 50
1256
-
1257
- key_x, key_y = 1315, 780
1258
- parts.append(f'<rect x="{key_x - 30}" y="{key_y - 44}" width="1000" height="680" rx="14" fill="#020502" fill-opacity="0.58" stroke="#ccffa0" stroke-opacity="0.20"/>')
1259
- parts.append(svg_text(key_x, key_y, "Task axis key", size=25, weight=800))
1260
- parts.append(svg_text(key_x, key_y + 30, "Full task names are listed here so the polygon remains readable at homepage scale.", size=13, fill="#a5afa2", weight=560))
1261
- for idx, task in enumerate(tasks):
1262
- col = 0 if idx < 10 else 1
1263
- row = idx if idx < 10 else idx - 10
1264
- x0 = key_x + col * 500
1265
- y0 = key_y + 74 + row * 54
1266
- proxy = task["task_id"] in PROXY_TASK_IDS
1267
- badge_fill = "#f472b6" if proxy else "#ccffa0"
1268
- parts.append(f'<rect x="{x0}" y="{y0 - 16}" width="36" height="26" rx="6" fill="{badge_fill}" fill-opacity="0.14" stroke="{badge_fill}" stroke-opacity="0.40"/>')
1269
- parts.append(svg_text(x0 + 18, y0 + 2, f"{task['task_number']:02d}", size=11, fill=badge_fill, anchor="middle", weight=800))
1270
- name_lines = split_text(str(task["label"]), 42)[:2]
1271
- parts.extend(svg_text_lines(x0 + 48, y0 - 3, name_lines, size=12, fill="#f4f8ef", weight=760, line_height=14))
1272
- metric_label = f"{task.get('metric_name') or task.get('metric_key')} / {'lower better' if task.get('metric_direction') == 'lower' else 'higher better'}"
1273
- if proxy:
1274
- metric_label += " / raw128 proxy"
1275
- parts.append(svg_text(x0 + 48, y0 + 29, metric_label, size=10, fill="#a5afa2", weight=560))
1276
-
1277
- table_y = 1680
1278
- if reading_rules is None:
1279
- reading_rules = (
1280
- "Every method has 20 task records and the current public matrix scores all 180 rows.",
1281
- "Raw128 completion: 18 direct task targets plus 2 compact proxies. Task 15 predicts the dominant caption/object/interaction hash bin; task 19 retrieves depth/audio sync from camera pose.",
1282
- "Proxy flags, raw metric values, and source artifacts stay attached in docs/data/task_method_20_result_matrix.json.",
1283
  )
1284
- parts.append(f'<rect x="70" y="{table_y - 38}" width="2260" height="120" rx="12" fill="#020502" fill-opacity="0.58" stroke="#ccffa0" stroke-opacity="0.16"/>')
1285
- parts.append(svg_text(100, table_y - 10, "Reading rules", size=16, fill="#ccffa0", weight=800))
1286
- parts.append(svg_text(220, table_y - 10, reading_rules[0], size=14, fill="#dce8d7", weight=650))
1287
- parts.append(svg_text(220, table_y + 18, reading_rules[1], size=13, fill="#a5afa2", weight=560))
1288
- parts.append(svg_text(220, table_y + 44, reading_rules[2], size=13, fill="#a5afa2", weight=560))
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1289
 
 
 
1290
  parts.append("</svg>")
1291
  return "\n".join(parts) + "\n"
1292
 
@@ -1357,8 +1640,8 @@ def main() -> int:
1357
  series_ids=EPISODE128_SERIES,
1358
  polygon_series_ids=("raw128_simple", "raw128_neural_mlp"),
1359
  title="128-Episode 20-Task Radar",
1360
- subtitle="Selected 96/16/16 episode split; raw-feature heads score all 20 axes.",
1361
- context_line="Raw128 baselines are filled polygons; metadata, Qwen3, and Cosmos branches now all carry scored task rows.",
1362
  chip_specs=[
1363
  ("20 task axes", "#ccffa0"),
1364
  ("140 method-task records", "#67e8d1"),
@@ -1368,7 +1651,7 @@ def main() -> int:
1368
  ],
1369
  reading_rules=(
1370
  "Every 128-episode method has 20 result records and all 140 rows are scored in this split radar.",
1371
- "Raw128 Simple and Raw128 NN are complete 20/20 scored multi-episode baselines; tasks 15/19 are documented compact proxies.",
1372
  "Qwen3-Omni and Cosmos3 rows use verified held-out outputs or derived probe artifacts; source paths stay in the matrix JSON.",
1373
  ),
1374
  ),
 
503
  "cosmos3_nano_future_window",
504
  )
505
 
506
+ RADAR_GROUP_SPECS = (
507
+ {
508
+ "id": "single_episode",
509
+ "title": "Single-episode sample",
510
+ "subtitle": "Public-sample simple and neural task heads.",
511
+ "series_ids": ("minimal", "neural_mlp"),
512
+ },
513
+ {
514
+ "id": "metadata_128",
515
+ "title": "128-episode metadata/text",
516
+ "subtitle": "Aligned JSONL metadata/text plus staged target blocks.",
517
+ "series_ids": ("metadata128_simple", "metadata128_neural_mlp"),
518
+ },
519
+ {
520
+ "id": "raw_128",
521
+ "title": "128-episode raw features",
522
+ "subtitle": "4430-dim sensor-block heads; proxy axes are flagged.",
523
+ "series_ids": ("raw128_simple", "raw128_neural_mlp"),
524
+ },
525
+ {
526
+ "id": "foundation_models",
527
+ "title": "Foundation-model probes",
528
+ "subtitle": "Verified Qwen3 and Cosmos task-specific outputs.",
529
+ "series_ids": ("qwen3_omni_v6_lora", "cosmos3_super_reasoner", "cosmos3_nano_future_window"),
530
+ },
531
+ )
532
+
533
  STATUS_LABELS = {
534
  "scored": "scored",
535
  "proxy_scored": "proxy scored",
 
787
  "",
788
  "## Compact Score Matrix",
789
  "",
790
+ "Cells show `raw metric value`, then `direct/proxy; normalized radar value; metric key`. The raw metric is the value to cite; the normalized value is the exact linear 0-1 score retained in JSON. The SVG radar uses sqrt(normalized score) only for visual radius, so low but real differences remain visible without changing the table values.",
791
  "",
792
  "| # | Task | " + " | ".join(spec["short_label"] for spec in SERIES.values()) + " |",
793
  "| ---: | --- | " + " | ".join("---" for _ in SERIES) + " |",
 
841
  for row in payload["task_method_result_matrix"]
842
  if row.get("series_id") in selected
843
  ]
844
+ selected_groups = radar_groups_for_series(series_ids)
845
+ chart_design = json.loads(json.dumps(payload.get("chart_design", {})))
846
+ chart_design["method_count"] = len(series)
847
+ chart_design["reason"] = (
848
+ f"This split view has {len(series)} methods and {sum(record.get('result_record_count', 0) for record in series)} "
849
+ "method-task records; grouped radar panels keep related methods readable while retaining the unified source matrix."
850
+ )
851
+ chart_design["groups"] = [
852
+ {
853
+ "id": group["id"],
854
+ "title": group["title"],
855
+ "series_ids": list(group["series_ids"]),
856
+ }
857
+ for group in selected_groups
858
+ ]
859
  return {
860
  "title": title,
861
  "status": payload["status"],
 
866
  "method_task_record_count": sum(record.get("result_record_count", 0) for record in series),
867
  "scored_method_task_count": sum(record.get("scored_task_count", 0) for record in series),
868
  "normalization_policy": payload["normalization_policy"],
869
+ "chart_design": chart_design,
870
  "source_unified_radar": "docs/data/unified_task_model_radar.json",
871
  "source_result_matrix": "docs/data/task_method_20_result_matrix.json",
872
  "series": series,
 
1059
  "id": series_id,
1060
  **spec,
1061
  "method_detail": METHOD_DETAILS.get(series_id, spec["scope"]),
1062
+ "plotted_as": "grouped small-multiple radar panel with direct legend and coverage badges",
1063
  "result_record_count": len(tasks),
1064
  "scored_task_count": covered,
1065
  "covered_task_count": covered,
 
1092
  "higher_is_better": "bounded metrics are plotted directly on 0-1 axes after clipping to [0, 1]",
1093
  "lower_is_better": "lower-error metrics are converted to best_observed_value / raw_value within the same task",
1094
  "raw_values": "raw metric values, metric keys, and sources are retained in this JSON; the SVG is an overview, not a replacement for the metric table",
1095
+ "radar_visual_radius": "SVG radar panels use sqrt(normalized_score) for radius so polygon area remains closer to the score and low-valued but real differences stay visible; the JSON and matrix retain exact linear normalized_score values",
1096
  "result_record_policy": "every method has 20 task records; the current public release has 180/180 scored rows with proxy flags and reasons retained where compact substitute targets are used",
1097
+ "foundation_model_overlay": "Qwen3-Omni and Cosmos3 are grouped in the foundation-model radar panel. All current public model rows have 20 scored task records, with source paths retained for every metric.",
1098
+ "metadata_128_overlay": "128-episode aligned baselines are grouped in the metadata/text radar panel. Numeric scores come from JSONL metadata/text tasks plus staged sensor-block targets when the processed target exists.",
1099
+ "raw_128_overlay": "128-episode raw-feature baselines are grouped in the raw-feature radar panel. Eighteen axes use direct task targets; interaction text and camera-view sync are completed with documented compact proxies because raw interaction strings and paired video-view embeddings are absent from the 128 export.",
1100
+ },
1101
+ "chart_design": {
1102
+ "mode": "grouped_small_multiples",
1103
+ "method_count": len(SERIES),
1104
+ "reason": "The public release has nine methods and 180 scored records; small-multiple radar panels avoid a nine-polygon overlay while keeping every method visible.",
1105
+ "groups": [
1106
+ {
1107
+ "id": group["id"],
1108
+ "title": group["title"],
1109
+ "series_ids": list(group["series_ids"]),
1110
+ }
1111
+ for group in RADAR_GROUP_SPECS
1112
+ ],
1113
+ "visual_radius_transform": "sqrt(normalized_score)",
1114
+ "exact_value_source": "docs/data/task_method_20_result_matrix.json",
1115
  },
1116
  "series": series_records,
1117
  "tasks": tasks,
 
1187
  return payload
1188
 
1189
 
1190
+ def svg_shape(
1191
+ tag: str,
1192
+ points: list[tuple[float, float]],
1193
+ *,
1194
+ fill: str,
1195
+ fill_opacity: float,
1196
+ stroke: str,
1197
+ stroke_opacity: float = 0.92,
1198
+ stroke_width: float = 2.0,
1199
+ dash: str | None = None,
1200
+ ) -> str:
1201
+ coords = " ".join(f"{x:.1f},{y:.1f}" for x, y in points)
1202
+ dash_attr = f' stroke-dasharray="{dash}"' if dash else ""
1203
+ return (
1204
+ f'<{tag} points="{coords}" fill="{fill}" fill-opacity="{fill_opacity:.3f}" '
1205
+ f'stroke="{stroke}" stroke-opacity="{stroke_opacity:.3f}" stroke-width="{stroke_width:.1f}" '
1206
+ f'stroke-linejoin="round" stroke-linecap="round"{dash_attr}/>'
1207
+ )
1208
+
1209
+
1210
+ def radar_radius(score: float | None, radius: float) -> float | None:
1211
+ if score is None:
1212
+ return None
1213
+ return radius * math.sqrt(clamp01(float(score)))
1214
+
1215
+
1216
+ def radar_groups_for_series(series_ids: tuple[str, ...]) -> list[dict[str, Any]]:
1217
+ selected = set(series_ids)
1218
+ groups: list[dict[str, Any]] = []
1219
+ assigned: set[str] = set()
1220
+ for group in RADAR_GROUP_SPECS:
1221
+ present = tuple(series_id for series_id in group["series_ids"] if series_id in selected)
1222
+ if not present:
1223
+ continue
1224
+ groups.append({**group, "series_ids": present})
1225
+ assigned.update(present)
1226
+ remaining = tuple(series_id for series_id in series_ids if series_id not in assigned)
1227
+ if remaining:
1228
+ groups.append(
1229
+ {
1230
+ "id": "other_methods",
1231
+ "title": "Other methods",
1232
+ "subtitle": "Additional method rows retained from the matrix.",
1233
+ "series_ids": remaining,
1234
+ }
1235
+ )
1236
+ return groups
1237
+
1238
+
1239
+ def draw_radar_grid(
1240
+ parts: list[str],
1241
+ *,
1242
+ cx: float,
1243
+ cy: float,
1244
+ radius: float,
1245
+ tasks: list[dict[str, Any]],
1246
+ angles: list[float],
1247
+ label_size: int,
1248
+ ) -> None:
1249
+ for value in (0.05, 0.25, 0.50, 0.75, 1.0):
1250
+ ring_radius = radius * math.sqrt(value)
1251
+ ring = [point(cx, cy, ring_radius, angle) for angle in angles]
1252
+ parts.append(
1253
+ svg_shape(
1254
+ "polygon",
1255
+ ring,
1256
+ fill="none",
1257
+ fill_opacity=0,
1258
+ stroke="#ccffa0",
1259
+ stroke_opacity=0.16,
1260
+ stroke_width=1.0,
1261
+ )
1262
+ )
1263
+ parts.append(svg_text(cx + 8, cy - ring_radius + 4, f"{value:.2g}", size=max(9, label_size - 2), fill="#a5afa2", weight=620, opacity=0.72))
1264
+ for task, angle in zip(tasks, angles):
1265
+ x, y = point(cx, cy, radius, angle)
1266
+ parts.append(f'<line x1="{cx:.1f}" y1="{cy:.1f}" x2="{x:.1f}" y2="{y:.1f}" stroke="#ccffa0" stroke-opacity="0.11" stroke-width="1"/>')
1267
+ lx, ly = point(cx, cy, radius + 28, angle)
1268
+ proxy = task["task_id"] in PROXY_TASK_IDS
1269
+ color = "#f472b6" if proxy else "#ccffa0"
1270
+ parts.append(f'<circle cx="{lx:.1f}" cy="{ly:.1f}" r="{label_size + 2:.1f}" fill="{color}" fill-opacity="0.12" stroke="{color}" stroke-opacity="0.34"/>')
1271
+ parts.append(svg_text(lx, ly + label_size * 0.33, f"{task['task_number']:02d}", size=label_size, fill=color, anchor="middle", weight=850, opacity=0.98))
1272
+
1273
+
1274
+ def draw_radar_series(
1275
+ parts: list[str],
1276
+ *,
1277
+ cx: float,
1278
+ cy: float,
1279
+ radius: float,
1280
+ tasks: list[dict[str, Any]],
1281
+ angles: list[float],
1282
+ series_id: str,
1283
+ stroke_width: float,
1284
+ fill_opacity: float,
1285
+ ) -> None:
1286
+ spec = SERIES[series_id]
1287
+ valid_points: list[tuple[float, float]] = []
1288
+ scored_count = 0
1289
+ for task, angle in zip(tasks, angles):
1290
+ value = task["values"].get(series_id, {})
1291
+ score = value.get("normalized_score")
1292
+ plotted_radius = radar_radius(score, radius)
1293
+ if plotted_radius is None:
1294
+ continue
1295
+ scored_count += 1
1296
+ valid_points.append(point(cx, cy, plotted_radius, angle))
1297
+ if len(valid_points) >= 3 and scored_count == len(tasks):
1298
+ parts.append(
1299
+ svg_shape(
1300
+ "polygon",
1301
+ valid_points,
1302
+ fill=spec["color"],
1303
+ fill_opacity=fill_opacity,
1304
+ stroke=spec["color"],
1305
+ stroke_width=stroke_width,
1306
+ dash=spec.get("stroke_dasharray"),
1307
+ )
1308
+ )
1309
+ elif len(valid_points) >= 2:
1310
+ parts.append(
1311
+ svg_shape(
1312
+ "polyline",
1313
+ valid_points,
1314
+ fill="none",
1315
+ fill_opacity=0,
1316
+ stroke=spec["color"],
1317
+ stroke_width=stroke_width,
1318
+ dash=spec.get("stroke_dasharray"),
1319
+ )
1320
+ )
1321
+ for task, angle in zip(tasks, angles):
1322
+ value = task["values"].get(series_id, {})
1323
+ plotted_radius = radar_radius(value.get("normalized_score"), radius)
1324
+ if plotted_radius is None:
1325
+ continue
1326
+ px, py = point(cx, cy, plotted_radius, angle)
1327
+ proxy = value.get("status") == "proxy_scored"
1328
+ parts.append(
1329
+ f'<circle cx="{px:.1f}" cy="{py:.1f}" r="{5.5 if proxy else 4.4:.1f}" '
1330
+ f'fill="{spec["color"]}" fill-opacity="0.95" stroke="{"#f4f8ef" if proxy else "#020502"}" '
1331
+ f'stroke-width="{2.1 if proxy else 1.3:.1f}"/>'
1332
+ )
1333
+
1334
+
1335
+ def draw_radar_panel(
1336
+ parts: list[str],
1337
+ *,
1338
+ x: float,
1339
+ y: float,
1340
+ width: float,
1341
+ height: float,
1342
+ group: dict[str, Any],
1343
+ payload: dict[str, Any],
1344
+ series_record_by_id: dict[str, dict[str, Any]],
1345
+ large: bool = False,
1346
+ ) -> None:
1347
+ tasks = payload["tasks"]
1348
+ angles = [-math.pi / 2 + 2 * math.pi * i / len(tasks) for i in range(len(tasks))]
1349
+ panel_bg = "#071007"
1350
+ parts.append(f'<rect x="{x:.1f}" y="{y:.1f}" width="{width:.1f}" height="{height:.1f}" rx="18" fill="{panel_bg}" fill-opacity="0.90" stroke="#ccffa0" stroke-opacity="0.22"/>')
1351
+ parts.append(svg_text(x + 28, y + 44, str(group["title"]), size=26 if large else 20, weight=850))
1352
+ parts.append(svg_text(x + 28, y + 74, str(group["subtitle"]), size=14 if large else 12, fill="#a5afa2", weight=600))
1353
+
1354
+ if large:
1355
+ cx = x + width * 0.39
1356
+ cy = y + height * 0.56
1357
+ radius = min(width * 0.20, height * 0.34)
1358
+ legend_x = x + width * 0.68
1359
+ legend_y = y + 160
1360
+ label_size = 12
1361
+ else:
1362
+ cx = x + width * 0.38
1363
+ cy = y + height * 0.57
1364
+ radius = min(width * 0.18, height * 0.30)
1365
+ legend_x = x + width * 0.67
1366
+ legend_y = y + 122
1367
+ label_size = 8
1368
+
1369
+ draw_radar_grid(parts, cx=cx, cy=cy, radius=radius, tasks=tasks, angles=angles, label_size=label_size)
1370
+
1371
+ series_ids = tuple(group["series_ids"])
1372
+ fill_opacity = 0.065 if len(series_ids) <= 2 else 0.040
1373
+ for idx, series_id in enumerate(series_ids):
1374
+ draw_radar_series(
1375
+ parts,
1376
+ cx=cx,
1377
+ cy=cy,
1378
+ radius=radius,
1379
+ tasks=tasks,
1380
+ angles=angles,
1381
+ series_id=series_id,
1382
+ stroke_width=4.3 if large else 3.2,
1383
+ fill_opacity=max(0.026, fill_opacity - idx * 0.010),
1384
+ )
1385
+
1386
+ parts.append(svg_text(legend_x, legend_y - 34, "Methods", size=17 if large else 14, fill="#ccffa0", weight=850))
1387
+ for idx, series_id in enumerate(series_ids):
1388
+ record = series_record_by_id[series_id]
1389
+ color = record["color"]
1390
+ row_y = legend_y + idx * (92 if large else 74)
1391
+ parts.append(f'<line x1="{legend_x:.1f}" y1="{row_y:.1f}" x2="{legend_x + 58:.1f}" y2="{row_y:.1f}" stroke="{color}" stroke-width="{6 if large else 5}" stroke-linecap="round" stroke-dasharray="{record.get("stroke_dasharray") or ""}"/>')
1392
+ parts.append(f'<circle cx="{legend_x + 29:.1f}" cy="{row_y:.1f}" r="{6 if large else 5}" fill="{color}" stroke="#020502" stroke-width="1.5"/>')
1393
+ parts.append(svg_text(legend_x + 74, row_y + 5, record["label"], size=15 if large else 12, weight=850))
1394
+ coverage = f"{record['scored_task_count']}/20 scored"
1395
+ proxy = record.get("proxy_scored_task_count", 0)
1396
+ if proxy:
1397
+ coverage += f" · {proxy} proxy"
1398
+ parts.append(svg_text(legend_x + 74, row_y + (28 if large else 22), coverage, size=12 if large else 10, fill=color, weight=800))
1399
+ detail = split_text(METHOD_DETAILS.get(series_id, record["scope"]), 50 if large else 44)[:2]
1400
+ parts.extend(svg_text_lines(legend_x + 74, row_y + (49 if large else 40), detail, size=10 if large else 8, fill="#a5afa2", weight=560, line_height=13 if large else 10))
1401
+
1402
+ parts.append(svg_text(x + 28, y + height - 30, "Radius = sqrt(normalized score); exact raw and normalized values are in the matrix.", size=11 if large else 9, fill="#a5afa2", weight=600, opacity=0.88))
1403
+
1404
+
1405
+ def draw_task_key(parts: list[str], *, x: float, y: float, width: float, tasks: list[dict[str, Any]], compact: bool = False) -> None:
1406
+ height = 292 if not compact else 250
1407
+ parts.append(f'<rect x="{x:.1f}" y="{y:.1f}" width="{width:.1f}" height="{height:.1f}" rx="16" fill="#020502" fill-opacity="0.62" stroke="#ccffa0" stroke-opacity="0.18"/>')
1408
+ parts.append(svg_text(x + 28, y + 42, "20-task axis key", size=20, weight=850))
1409
+ parts.append(svg_text(x + 250, y + 42, "Task numbers stay on the radar; full names and proxy axes stay here.", size=13, fill="#a5afa2", weight=600))
1410
+ col_count = 4
1411
+ col_w = (width - 56) / col_count
1412
+ row_h = 42 if not compact else 36
1413
+ for idx, task in enumerate(tasks):
1414
+ col = idx // 5
1415
+ row = idx % 5
1416
+ x0 = x + 28 + col * col_w
1417
+ y0 = y + 84 + row * row_h
1418
+ proxy = task["task_id"] in PROXY_TASK_IDS
1419
+ color = "#f472b6" if proxy else "#ccffa0"
1420
+ parts.append(f'<rect x="{x0:.1f}" y="{y0 - 17:.1f}" width="35" height="25" rx="6" fill="{color}" fill-opacity="0.13" stroke="{color}" stroke-opacity="0.40"/>')
1421
+ parts.append(svg_text(x0 + 17.5, y0 + 1, f"{task['task_number']:02d}", size=10, fill=color, anchor="middle", weight=850))
1422
+ task_name = str(task["label"])
1423
+ if len(task_name) > 34:
1424
+ task_name = task_name[:31].rstrip() + "..."
1425
+ parts.append(svg_text(x0 + 46, y0 - 2, task_name, size=11 if not compact else 10, fill="#f4f8ef", weight=800))
1426
+ metric = str(task.get("metric_name") or task.get("metric_key") or "")
1427
+ direction = "lower" if task.get("metric_direction") == "lower" else "higher"
1428
+ metric_text = f"{metric}; {direction} better"
1429
+ if proxy:
1430
+ metric_text += "; proxy axis"
1431
+ if len(metric_text) > 43:
1432
+ metric_text = metric_text[:40].rstrip() + "..."
1433
+ parts.append(svg_text(x0 + 46, y0 + 16, metric_text, size=9, fill="#a5afa2", weight=560))
1434
+
1435
+
1436
+ def draw_reading_rules(parts: list[str], *, y: float, reading_rules: tuple[str, str, str] | None) -> None:
1437
+ if reading_rules is None:
1438
+ reading_rules = (
1439
+ "Use the panels for shape and coverage; use docs/data/task_method_20_result_matrix.json for exact ranks, raw values, direct/proxy flags, and sources.",
1440
+ "The old nine-method overlay was replaced by grouped small multiples so each radar compares only related methods.",
1441
+ "SVG radius uses sqrt(normalized_score) for readable area; JSON normalized_score remains linear and unchanged.",
1442
+ )
1443
+ parts.append(f'<rect x="70" y="{y:.1f}" width="2260" height="118" rx="14" fill="#020502" fill-opacity="0.62" stroke="#ccffa0" stroke-opacity="0.16"/>')
1444
+ parts.append(svg_text(100, y + 33, "Reading rules", size=16, fill="#ccffa0", weight=850))
1445
+ parts.append(svg_text(230, y + 33, reading_rules[0], size=13, fill="#dce8d7", weight=650))
1446
+ parts.append(svg_text(230, y + 61, reading_rules[1], size=12, fill="#a5afa2", weight=560))
1447
+ parts.append(svg_text(230, y + 87, reading_rules[2], size=12, fill="#a5afa2", weight=560))
1448
+
1449
+
1450
  def render_svg(
1451
  payload: dict[str, Any],
1452
  *,
 
1458
  chip_specs: list[tuple[str, str]] | None = None,
1459
  reading_rules: tuple[str, str, str] | None = None,
1460
  ) -> str:
1461
+ del polygon_series_ids
1462
+ width, height = 2400, 1900
1463
  tasks = payload["tasks"]
 
 
1464
  if series_ids is None:
1465
  series_ids = tuple(record["id"] for record in payload["series"])
1466
+ groups = radar_groups_for_series(series_ids)
1467
+ series_record_by_id = {record["id"]: record for record in payload["series"]}
1468
  parts = [
1469
  f'<svg xmlns="http://www.w3.org/2000/svg" width="{width}" height="{height}" viewBox="0 0 {width} {height}">',
1470
  "<defs>",
1471
+ '<filter id="softGlow"><feGaussianBlur stdDeviation="4" result="blur"/><feMerge><feMergeNode in="blur"/><feMergeNode in="SourceGraphic"/></feMerge></filter>',
1472
+ '<pattern id="dots" width="22" height="22" patternUnits="userSpaceOnUse"><circle cx="2" cy="2" r="1.15" fill="#ccffa0" opacity="0.12"/></pattern>',
1473
  "</defs>",
1474
  '<rect width="100%" height="100%" fill="#020502"/>',
1475
+ '<rect width="100%" height="100%" fill="url(#dots)" opacity="0.40"/>',
1476
+ '<rect x="28" y="28" width="2344" height="1844" rx="22" fill="#061006" fill-opacity="0.90" stroke="#ccffa0" stroke-opacity="0.22"/>',
1477
+ svg_text(70, 86, title or payload.get("title", "20-Task Model Radar"), size=36, weight=850),
1478
  svg_text(
1479
  70,
1480
  122,
1481
+ subtitle or "Grouped small-multiple radars for the nine-method, 180-result comparison.",
1482
  size=18,
1483
  fill="#dce8d7",
1484
  weight=650,
 
1487
  70,
1488
  150,
1489
  context_line
1490
+ or "Related methods are compared in separate panels to avoid the unreadable nine-polygon overlay.",
1491
  size=15,
1492
  fill="#a5afa2",
1493
  weight=560,
 
1499
  ("20 task axes", "#ccffa0"),
1500
  (f"{payload['method_task_record_count']} method-task records", "#67e8d1"),
1501
  (f"{payload['scored_method_task_count']} scored records", "#22d3ee"),
1502
+ ("grouped small multiples", "#f59e0b"),
1503
+ ("sqrt visual radius", "#f472b6"),
1504
  ]
1505
  chip_x = 70
1506
  for label, color in chip_specs:
1507
+ chip_w = max(128, min(280, 18 + len(label) * 8.3))
1508
+ parts.append(f'<rect x="{chip_x:.1f}" y="174" width="{chip_w:.1f}" height="34" rx="17" fill="{color}" fill-opacity="0.10" stroke="{color}" stroke-opacity="0.38"/>')
1509
+ parts.append(svg_text(chip_x + 16, 197, label, size=13, fill=color, weight=780))
1510
  chip_x += chip_w + 12
1511
 
1512
+ if len(groups) == 1:
1513
+ draw_radar_panel(
1514
+ parts,
1515
+ x=70,
1516
+ y=242,
1517
+ width=2260,
1518
+ height=1040,
1519
+ group=groups[0],
1520
+ payload=payload,
1521
+ series_record_by_id=series_record_by_id,
1522
+ large=True,
1523
+ )
1524
+ key_y = 1322
1525
+ elif len(groups) == 3:
1526
+ panel_w, panel_h = 1100, 545
1527
+ start_x, start_y = 70, 248
1528
+ gap_x, gap_y = 30, 34
1529
+ for idx, group in enumerate(groups[:2]):
1530
+ draw_radar_panel(
1531
+ parts,
1532
+ x=start_x + idx * (panel_w + gap_x),
1533
+ y=start_y,
1534
+ width=panel_w,
1535
+ height=panel_h,
1536
+ group=group,
1537
+ payload=payload,
1538
+ series_record_by_id=series_record_by_id,
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1539
  )
1540
+ draw_radar_panel(
1541
+ parts,
1542
+ x=start_x,
1543
+ y=start_y + panel_h + gap_y,
1544
+ width=panel_w * 2 + gap_x,
1545
+ height=panel_h,
1546
+ group=groups[2],
1547
+ payload=payload,
1548
+ series_record_by_id=series_record_by_id,
1549
+ large=True,
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1550
  )
1551
+ key_y = 1438
1552
+ else:
1553
+ panel_w, panel_h = 1100, 545
1554
+ start_x, start_y = 70, 248
1555
+ gap_x, gap_y = 30, 34
1556
+ for idx, group in enumerate(groups):
1557
+ col = idx % 2
1558
+ row = idx // 2
1559
+ draw_radar_panel(
1560
+ parts,
1561
+ x=start_x + col * (panel_w + gap_x),
1562
+ y=start_y + row * (panel_h + gap_y),
1563
+ width=panel_w,
1564
+ height=panel_h,
1565
+ group=group,
1566
+ payload=payload,
1567
+ series_record_by_id=series_record_by_id,
1568
+ )
1569
+ key_y = 1438
1570
 
1571
+ draw_task_key(parts, x=70, y=key_y, width=2260, tasks=tasks, compact=len(groups) == 1)
1572
+ draw_reading_rules(parts, y=1750 if len(groups) > 1 else 1632, reading_rules=reading_rules)
1573
  parts.append("</svg>")
1574
  return "\n".join(parts) + "\n"
1575
 
 
1640
  series_ids=EPISODE128_SERIES,
1641
  polygon_series_ids=("raw128_simple", "raw128_neural_mlp"),
1642
  title="128-Episode 20-Task Radar",
1643
+ subtitle="Selected 96/16/16 episode split; all seven 128-episode rows score all 20 axes.",
1644
+ context_line="Metadata, raw-feature, and foundation-model methods are separated into grouped radar panels instead of one crowded overlay.",
1645
  chip_specs=[
1646
  ("20 task axes", "#ccffa0"),
1647
  ("140 method-task records", "#67e8d1"),
 
1651
  ],
1652
  reading_rules=(
1653
  "Every 128-episode method has 20 result records and all 140 rows are scored in this split radar.",
1654
+ "Raw128 Simple and Raw128 NN are complete 20/20 scored multi-episode baselines; tasks 15/19 are documented compact proxies and are marked in the task key.",
1655
  "Qwen3-Omni and Cosmos3 rows use verified held-out outputs or derived probe artifacts; source paths stay in the matrix JSON.",
1656
  ),
1657
  ),