Publish Ropedia Xperience-10M task baseline cards

Browse files

Files changed (12) hide show

ARTIFACT_GUIDE.md +1 -1
EVIDENCE_CONTRACT.md +3 -2
README.md +7 -6
metrics/artifact_index.json +8 -8
metrics/evidence_contract.json +1 -1
metrics/mirror_parity.json +81 -52
metrics/publication_audit.json +6 -1
metrics/scope_claims_audit.json +1 -1
metrics/website_integrity.json +2 -2
scripts/build_artifact_index.py +1 -1
scripts/validate_mirror_parity.py +22 -0
scripts/validate_publication_package.py +16 -0

ARTIFACT_GUIDE.md CHANGED Viewed

@@ -24,7 +24,7 @@ The project intentionally separates four layers:
 | [`EVIDENCE_CONTRACT.md`](EVIDENCE_CONTRACT.md) | Defines which claims are verified and which are explicitly not claimed. |
 | [`REPRODUCIBILITY.md`](REPRODUCIBILITY.md) | Defines public reproduction commands, expected outputs, and unreproducible boundaries. |
 | [`metrics/artifact_index.json`](metrics/artifact_index.json) | Lists reviewer-critical files with existence, size, and stable hashes. |
-| [`metrics/mirror_parity.json`](metrics/mirror_parity.json) | Confirms prepared HF Space, artifact, and model mirrors match the repo for critical files. |
 | [`metrics/publication_audit.json`](metrics/publication_audit.json) | Confirms public bundles exclude raw data, Python caches, heavy archives, and token strings. |
 | [`metrics/scope_claims_audit.json`](metrics/scope_claims_audit.json) | Confirms historical `32ep` smoke-run identifiers are not presented as real 32-episode results. |
 | [`metrics/website_integrity.json`](metrics/website_integrity.json) | Confirms local site links, anchors, JSON bundles, and referenced images resolve. |

 | [`EVIDENCE_CONTRACT.md`](EVIDENCE_CONTRACT.md) | Defines which claims are verified and which are explicitly not claimed. |
 | [`REPRODUCIBILITY.md`](REPRODUCIBILITY.md) | Defines public reproduction commands, expected outputs, and unreproducible boundaries. |
 | [`metrics/artifact_index.json`](metrics/artifact_index.json) | Lists reviewer-critical files with existence, size, and stable hashes. |
+| [`metrics/mirror_parity.json`](metrics/mirror_parity.json) | Confirms prepared HF Space, artifact, and model mirrors match the repo for critical data, figures, website HTML, and validator scripts. |
 | [`metrics/publication_audit.json`](metrics/publication_audit.json) | Confirms public bundles exclude raw data, Python caches, heavy archives, and token strings. |
 | [`metrics/scope_claims_audit.json`](metrics/scope_claims_audit.json) | Confirms historical `32ep` smoke-run identifiers are not presented as real 32-episode results. |
 | [`metrics/website_integrity.json`](metrics/website_integrity.json) | Confirms local site links, anchors, JSON bundles, and referenced images resolve. |

EVIDENCE_CONTRACT.md CHANGED Viewed

@@ -15,7 +15,7 @@ local artifact that a reader can inspect before trusting the dashboard.
 | Qwen3-Omni infrastructure has passed technical smoke checks. | Companion GitHub repo: `results/omni_finetune/RUN_REPORT.md`, `results/omni_finetune/dataset_manifest.json`, `results/omni_finetune/metrics_eval.json` | Smoke-only evidence | One episode, 128 train windows; not a 32-episode pilot |
 | The real 32-episode LoRA pilot is blocked on gated data access, not on repo presentation. | Companion GitHub repo: `results/omni_finetune/DATA_BLOCKER_REPORT.md`, `results/omni_finetune/A100_HF_RELAY_STATUS.md`, `results/omni_finetune/source_discovery.json` | Blocker documented | No 32-episode metric should be claimed until the gate passes |
 | Historical `32ep` path strings are not treated as 32-episode results. | `scripts/validate_scope_claims.py`, `metrics/scope_claims_audit.json` | Verified pass | Classifies old run/path identifiers and fails if public presentation claims real 32-episode metrics |
-| Prepared GitHub/Hugging Face mirrors carry matching critical files. | `scripts/validate_mirror_parity.py`, `metrics/mirror_parity.json` | Verified pass | Compares prepared Space, artifact dataset, and model bundles before upload; live URLs are checked after publishing |
 | The public GitHub and Hugging Face bundles are publication-clean. | `scripts/validate_publication_package.py`, `metrics/publication_audit.json` | Verified pass | Checks public files and HF bundles, not arbitrary ignored local scratch outputs |
 | The public website has checked local references. | `scripts/validate_website_integrity.py`, `metrics/website_integrity.json` | Verified pass | Checks local links, anchors, JSON data, and referenced images; external URLs are not fetched |
 | The core proof artifacts are indexed and grouped for fast review. | `ARTIFACT_GUIDE.md`, `scripts/build_artifact_index.py`, `metrics/artifact_index.json` | Verified guide and index | Selective source-of-truth catalog, not a complete inventory of every output file |
@@ -43,7 +43,8 @@ local artifact that a reader can inspect before trusting the dashboard.
 8. Inspect `metrics/scope_claims_audit.json` before interpreting historical
    `32ep` strings in Qwen3-Omni smoke artifacts.
 9. Inspect `metrics/mirror_parity.json` before assuming the GitHub and
-   Hugging Face mirrors contain the same critical files.
 10. Inspect the companion GitHub repo's
    `results/omni_finetune/DATA_BLOCKER_REPORT.md` before interpreting any
    Qwen3-Omni artifact.

 | Qwen3-Omni infrastructure has passed technical smoke checks. | Companion GitHub repo: `results/omni_finetune/RUN_REPORT.md`, `results/omni_finetune/dataset_manifest.json`, `results/omni_finetune/metrics_eval.json` | Smoke-only evidence | One episode, 128 train windows; not a 32-episode pilot |
 | The real 32-episode LoRA pilot is blocked on gated data access, not on repo presentation. | Companion GitHub repo: `results/omni_finetune/DATA_BLOCKER_REPORT.md`, `results/omni_finetune/A100_HF_RELAY_STATUS.md`, `results/omni_finetune/source_discovery.json` | Blocker documented | No 32-episode metric should be claimed until the gate passes |
 | Historical `32ep` path strings are not treated as 32-episode results. | `scripts/validate_scope_claims.py`, `metrics/scope_claims_audit.json` | Verified pass | Classifies old run/path identifiers and fails if public presentation claims real 32-episode metrics |
+| Prepared GitHub/Hugging Face mirrors carry matching critical files. | `scripts/validate_mirror_parity.py`, `metrics/mirror_parity.json` | Verified pass | Compares prepared data files, visual assets, website HTML, and validator scripts before upload; live URLs are checked after publishing |
 | The public GitHub and Hugging Face bundles are publication-clean. | `scripts/validate_publication_package.py`, `metrics/publication_audit.json` | Verified pass | Checks public files and HF bundles, not arbitrary ignored local scratch outputs |
 | The public website has checked local references. | `scripts/validate_website_integrity.py`, `metrics/website_integrity.json` | Verified pass | Checks local links, anchors, JSON data, and referenced images; external URLs are not fetched |
 | The core proof artifacts are indexed and grouped for fast review. | `ARTIFACT_GUIDE.md`, `scripts/build_artifact_index.py`, `metrics/artifact_index.json` | Verified guide and index | Selective source-of-truth catalog, not a complete inventory of every output file |
 8. Inspect `metrics/scope_claims_audit.json` before interpreting historical
    `32ep` strings in Qwen3-Omni smoke artifacts.
 9. Inspect `metrics/mirror_parity.json` before assuming the GitHub and
+   Hugging Face mirrors contain the same critical data, visual, HTML, and
+   validator files.
 10. Inspect the companion GitHub repo's
    `results/omni_finetune/DATA_BLOCKER_REPORT.md` before interpreting any
    Qwen3-Omni artifact.

README.md CHANGED Viewed

@@ -62,14 +62,15 @@ and metrics for the 12-task Xperience-10M episode suite, plus four lightweight
 direction-extension probes. It is meant to be read like a model audit, not
 advertised as a robot foundation model.
-![12-task suite with sample modalities](assets/task_suite_infographic.png?v=xperience10m-modalities-v9-large-atlas)
 The source Xperience-10M sample spans video, audio, depth, pose, motion
 capture, inertial sensing, and language annotation. The committed minimal and
 neural task heads use the current 8,378-d feature manifest; audio is documented
 in the figures but is not yet extracted into a model input feature block.
-The companion dashboard and this model card mirror the responsive modality atlas
-metadata in `metrics/modality_atlas.json`, with standalone derived thumbnails in
 `assets/modalities/`.
 The committed heads are intentionally small:
@@ -110,7 +111,7 @@ Source-of-truth artifact index mirror: `metrics/artifact_index.json`.
 | Feature contract | `artifacts/**/feature_manifest.json` | audio documented but not featurized |
 | Qwen3-Omni | companion blocker and relay reports | smoke-only until 32 valid episodes are available |
 | Scope claims guard | `metrics/scope_claims_audit.json` and `scripts/validate_scope_claims.py` | historical `32ep` path strings are provenance, not 32-episode results |
-| Mirror parity | `metrics/mirror_parity.json` and `scripts/validate_mirror_parity.py` | prepared repo/HF mirrors carry matching critical files |
 | Publication hygiene | `metrics/publication_audit.json` and validator script mirror | public bundles contain no raw data, generated caches, heavy archives, or token strings |
 | Website integrity | `metrics/website_integrity.json` and validator script mirror | local links, anchors, JSON bundles, and referenced images only |
 | Artifact index | `metrics/artifact_index.json` and `scripts/build_artifact_index.py` | compact catalog of the reviewer-critical proof artifacts |
@@ -142,10 +143,10 @@ transfers them to H20 for manifest building, training, and evaluation.
 | `artifacts/episode_task_suite/research_direction_extensions/` | adds one coded extension probe per research direction |
 | `artifacts/episode_task_suite/task_walkthroughs/` | explains every task with case study, input, process modules, output, and limitation |
 | `assets/task_architectures.png` | shows the shared pipeline and all 12 heads |
-| `assets/task_suite_infographic.png` | presents the 12 heads with public-sample modality thumbnails and verified metrics |
 | `assets/modalities/`, `metrics/modality_atlas.json` | responsive modality-card thumbnails and metadata for sample inspection |
 | `metrics/artifact_index.json` | indexes proof artifacts with existence, size, and stable-file hashes |
-| `metrics/mirror_parity.json` | verifies prepared repo/HF mirrors have matching critical files before upload |
 | `metrics/scope_claims_audit.json` | verifies historical `32ep` smoke-run identifiers are not presented as real 32-episode results |
 | `metrics/publication_audit.json` | records the latest public-bundle hygiene check |
 | `metrics/website_integrity.json` | records the latest local website link, anchor, JSON, and image integrity check |

 direction-extension probes. It is meant to be read like a model audit, not
 advertised as a robot foundation model.
+![12-task suite with sample modalities](assets/task_suite_infographic.png?v=xperience10m-taskfirst-v10)
 The source Xperience-10M sample spans video, audio, depth, pose, motion
 capture, inertial sensing, and language annotation. The committed minimal and
 neural task heads use the current 8,378-d feature manifest; audio is documented
 in the figures but is not yet extracted into a model input feature block.
+The companion dashboard and this model card start with the task-first 12-head
+map, then mirror the responsive modality atlas metadata in
+`metrics/modality_atlas.json`, with standalone derived thumbnails in
 `assets/modalities/`.
 The committed heads are intentionally small:
 | Feature contract | `artifacts/**/feature_manifest.json` | audio documented but not featurized |
 | Qwen3-Omni | companion blocker and relay reports | smoke-only until 32 valid episodes are available |
 | Scope claims guard | `metrics/scope_claims_audit.json` and `scripts/validate_scope_claims.py` | historical `32ep` path strings are provenance, not 32-episode results |
+| Mirror parity | `metrics/mirror_parity.json` and `scripts/validate_mirror_parity.py` | prepared repo/HF mirrors carry matching critical data, figures, website HTML, and validator files |
 | Publication hygiene | `metrics/publication_audit.json` and validator script mirror | public bundles contain no raw data, generated caches, heavy archives, or token strings |
 | Website integrity | `metrics/website_integrity.json` and validator script mirror | local links, anchors, JSON bundles, and referenced images only |
 | Artifact index | `metrics/artifact_index.json` and `scripts/build_artifact_index.py` | compact catalog of the reviewer-critical proof artifacts |
 | `artifacts/episode_task_suite/research_direction_extensions/` | adds one coded extension probe per research direction |
 | `artifacts/episode_task_suite/task_walkthroughs/` | explains every task with case study, input, process modules, output, and limitation |
 | `assets/task_architectures.png` | shows the shared pipeline and all 12 heads |
+| `assets/task_suite_infographic.png` | presents the shared processing contract, 12 heads, verified metrics, and public-sample modality thumbnails |
 | `assets/modalities/`, `metrics/modality_atlas.json` | responsive modality-card thumbnails and metadata for sample inspection |
 | `metrics/artifact_index.json` | indexes proof artifacts with existence, size, and stable-file hashes |
+| `metrics/mirror_parity.json` | verifies prepared repo/HF mirrors have matching critical data, figures, website HTML, and validator files before upload |
 | `metrics/scope_claims_audit.json` | verifies historical `32ep` smoke-run identifiers are not presented as real 32-episode results |
 | `metrics/publication_audit.json` | records the latest public-bundle hygiene check |
 | `metrics/website_integrity.json` | records the latest local website link, anchor, JSON, and image integrity check |

metrics/artifact_index.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "title": "Ropedia Xperience-10M Task Suite Artifact Index",
-  "generated_at_utc": "2026-06-01T04:49:01+00:00",
   "status": "pass",
   "artifact_count": 29,
   "missing": [],
@@ -35,8 +35,8 @@
       "surface": "repo",
       "proves": "Defines what is verified, what is smoke-only, and what must not be inferred.",
       "exists": true,
-      "bytes": 6440,
-      "sha256": "a89e2316e19ebacbb1150879c070279f8f6f659030a945fc398eb08280c60cc0"
     },
     {
       "id": "reviewer_packet",
@@ -57,8 +57,8 @@
       "surface": "repo_hf",
       "proves": "Gives the human-readable map from proof boundary to data, tasks, platform mirrors, and scale-up status.",
       "exists": true,
-      "bytes": 6438,
-      "sha256": "01d2e37bf25a5884e116ba7de80cc460d69523c563e540a650646a58e365713f"
     },
     {
       "id": "reproducibility_contract",
@@ -90,8 +90,8 @@
       "surface": "repo_hf",
       "proves": "Generates the selective proof-artifact catalog from local files.",
       "exists": true,
-      "bytes": 11565,
-      "sha256": "d57875b1e42a58c02aa2f7da481f7b2190b82414113827883dd8b332c33552f3"
     },
     {
       "id": "publication_audit",
@@ -124,7 +124,7 @@
       "kind": "mirror_parity",
       "surface": "website_hf",
       "volatile": true,
-      "proves": "Confirms prepared GitHub/HF Space/artifact/model mirrors share the same critical data, figure, and validator files.",
       "exists": true,
       "bytes": 41465,
       "hash_policy": "existence_and_size_only"

 {
   "title": "Ropedia Xperience-10M Task Suite Artifact Index",
+  "generated_at_utc": "2026-06-01T05:07:04+00:00",
   "status": "pass",
   "artifact_count": 29,
   "missing": [],
       "surface": "repo",
       "proves": "Defines what is verified, what is smoke-only, and what must not be inferred.",
       "exists": true,
+      "bytes": 6497,
+      "sha256": "417835c2f838f1d4c4bca9f07c708ce04611e7212017e58421956818a4ca4b45"
     },
     {
       "id": "reviewer_packet",
       "surface": "repo_hf",
       "proves": "Gives the human-readable map from proof boundary to data, tasks, platform mirrors, and scale-up status.",
       "exists": true,
+      "bytes": 6483,
+      "sha256": "cc211ec1175eed43bc5d8c9ce1ec412982524af1998316b392e77e5d7ddc99ee"
     },
     {
       "id": "reproducibility_contract",
       "surface": "repo_hf",
       "proves": "Generates the selective proof-artifact catalog from local files.",
       "exists": true,
+      "bytes": 11579,
+      "sha256": "874a3813fb3a19d79be9ea4c0177f5922adf9e667760f927dd49163784eb6b48"
     },
     {
       "id": "publication_audit",
       "kind": "mirror_parity",
       "surface": "website_hf",
       "volatile": true,
+      "proves": "Confirms prepared GitHub/HF Space/artifact/model mirrors share the same critical data, figure, website HTML, and validator files.",
       "exists": true,
       "bytes": 41465,
       "hash_policy": "existence_and_size_only"

metrics/evidence_contract.json CHANGED Viewed

@@ -110,7 +110,7 @@
     },
     {
       "id": "mirror_parity",
-      "claim": "Prepared GitHub and Hugging Face mirrors carry matching critical files.",
       "status": "verified",
       "evidence": [
         "scripts/validate_mirror_parity.py",

     },
     {
       "id": "mirror_parity",
+      "claim": "Prepared GitHub and Hugging Face mirrors carry matching critical data, visual, HTML, and validator files.",
       "status": "verified",
       "evidence": [
         "scripts/validate_mirror_parity.py",

metrics/mirror_parity.json CHANGED Viewed

@@ -1,9 +1,9 @@
 {
   "status": "pass",
-  "generated_at_utc": "2026-06-01T04:49:44+00:00",
   "hf_root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish",
   "summary": {
-    "group_count": 28,
     "failure_count": 0,
     "failures_by_surface": {}
   },
@@ -19,6 +19,10 @@
     {
       "name": "repo_hf_validator_script_parity",
       "status": "pass"
     }
   ],
   "groups": [
@@ -28,27 +32,27 @@
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/artifact_index.json",
         "exists": true,
-        "bytes": 12902,
-        "sha256": "0a6fb26c150942a0807fc38a092bc85f8dd63cc96943d6c2fb8a1df2d727b7ed"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/artifact_index.json",
           "exists": true,
-          "bytes": 12902,
-          "sha256": "0a6fb26c150942a0807fc38a092bc85f8dd63cc96943d6c2fb8a1df2d727b7ed"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/artifact_index.json",
           "exists": true,
-          "bytes": 12902,
-          "sha256": "0a6fb26c150942a0807fc38a092bc85f8dd63cc96943d6c2fb8a1df2d727b7ed"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/artifact_index.json",
           "exists": true,
-          "bytes": 12902,
-          "sha256": "0a6fb26c150942a0807fc38a092bc85f8dd63cc96943d6c2fb8a1df2d727b7ed"
         }
       },
       "failures": []
@@ -59,27 +63,27 @@
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/evidence_contract.json",
         "exists": true,
-        "bytes": 7148,
-        "sha256": "7be0e996d5acec81b26eba19919ff92f951241c22189086c484b055c7f988bed"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/evidence_contract.json",
           "exists": true,
-          "bytes": 7148,
-          "sha256": "7be0e996d5acec81b26eba19919ff92f951241c22189086c484b055c7f988bed"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/evidence_contract.json",
           "exists": true,
-          "bytes": 7148,
-          "sha256": "7be0e996d5acec81b26eba19919ff92f951241c22189086c484b055c7f988bed"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/evidence_contract.json",
           "exists": true,
-          "bytes": 7148,
-          "sha256": "7be0e996d5acec81b26eba19919ff92f951241c22189086c484b055c7f988bed"
         }
       },
       "failures": []
@@ -152,27 +156,27 @@
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/publication_audit.json",
         "exists": true,
-        "bytes": 4105,
-        "sha256": "ce4addc653c34287da1f529f526362fb791ad2a07d0e6610f617c4c8e1cf9597"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/publication_audit.json",
           "exists": true,
-          "bytes": 4105,
-          "sha256": "ce4addc653c34287da1f529f526362fb791ad2a07d0e6610f617c4c8e1cf9597"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/publication_audit.json",
           "exists": true,
-          "bytes": 4105,
-          "sha256": "ce4addc653c34287da1f529f526362fb791ad2a07d0e6610f617c4c8e1cf9597"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/publication_audit.json",
           "exists": true,
-          "bytes": 4105,
-          "sha256": "ce4addc653c34287da1f529f526362fb791ad2a07d0e6610f617c4c8e1cf9597"
         }
       },
       "failures": []
@@ -308,26 +312,26 @@
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/scope_claims_audit.json",
         "exists": true,
         "bytes": 19964,
-        "sha256": "a1a90b04b8bd11e751a34a9ca27676dbe543b6a3bf2454807abf861a91ce33b4"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/scope_claims_audit.json",
           "exists": true,
           "bytes": 19964,
-          "sha256": "a1a90b04b8bd11e751a34a9ca27676dbe543b6a3bf2454807abf861a91ce33b4"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/scope_claims_audit.json",
           "exists": true,
           "bytes": 19964,
-          "sha256": "a1a90b04b8bd11e751a34a9ca27676dbe543b6a3bf2454807abf861a91ce33b4"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/scope_claims_audit.json",
           "exists": true,
           "bytes": 19964,
-          "sha256": "a1a90b04b8bd11e751a34a9ca27676dbe543b6a3bf2454807abf861a91ce33b4"
         }
       },
       "failures": []
@@ -401,26 +405,26 @@
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/website_integrity.json",
         "exists": true,
         "bytes": 5936,
-        "sha256": "0ba08b7d03c5513520d2900d57cd383f24e228a5d9d55b6a89e8d3419594c55f"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/website_integrity.json",
           "exists": true,
           "bytes": 5936,
-          "sha256": "0ba08b7d03c5513520d2900d57cd383f24e228a5d9d55b6a89e8d3419594c55f"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/website_integrity.json",
           "exists": true,
           "bytes": 5936,
-          "sha256": "0ba08b7d03c5513520d2900d57cd383f24e228a5d9d55b6a89e8d3419594c55f"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/website_integrity.json",
           "exists": true,
           "bytes": 5936,
-          "sha256": "0ba08b7d03c5513520d2900d57cd383f24e228a5d9d55b6a89e8d3419594c55f"
         }
       },
       "failures": []
@@ -801,21 +805,21 @@
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/scripts/build_artifact_index.py",
         "exists": true,
-        "bytes": 11565,
-        "sha256": "d57875b1e42a58c02aa2f7da481f7b2190b82414113827883dd8b332c33552f3"
       },
       "mirrors": {
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/scripts/build_artifact_index.py",
           "exists": true,
-          "bytes": 11565,
-          "sha256": "d57875b1e42a58c02aa2f7da481f7b2190b82414113827883dd8b332c33552f3"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/scripts/build_artifact_index.py",
           "exists": true,
-          "bytes": 11565,
-          "sha256": "d57875b1e42a58c02aa2f7da481f7b2190b82414113827883dd8b332c33552f3"
         }
       },
       "failures": []
@@ -826,21 +830,21 @@
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/scripts/validate_mirror_parity.py",
         "exists": true,
-        "bytes": 6971,
-        "sha256": "d0e0a1514a6c8548120f8bcb68827a648252a198197acab9186b72725fe9d39b"
       },
       "mirrors": {
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/scripts/validate_mirror_parity.py",
           "exists": true,
-          "bytes": 6971,
-          "sha256": "d0e0a1514a6c8548120f8bcb68827a648252a198197acab9186b72725fe9d39b"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/scripts/validate_mirror_parity.py",
           "exists": true,
-          "bytes": 6971,
-          "sha256": "d0e0a1514a6c8548120f8bcb68827a648252a198197acab9186b72725fe9d39b"
         }
       },
       "failures": []
@@ -851,21 +855,21 @@
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/scripts/validate_publication_package.py",
         "exists": true,
-        "bytes": 8960,
-        "sha256": "129e1276a60abe1330de5190622097a0e19198d133d434425317123f0a390c82"
       },
       "mirrors": {
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/scripts/validate_publication_package.py",
           "exists": true,
-          "bytes": 8960,
-          "sha256": "129e1276a60abe1330de5190622097a0e19198d133d434425317123f0a390c82"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/scripts/validate_publication_package.py",
           "exists": true,
-          "bytes": 8960,
-          "sha256": "129e1276a60abe1330de5190622097a0e19198d133d434425317123f0a390c82"
         }
       },
       "failures": []
@@ -919,6 +923,31 @@
         }
       },
       "failures": []
     }
   ],
   "failures": []

 {
   "status": "pass",
+  "generated_at_utc": "2026-06-01T05:08:43+00:00",
   "hf_root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish",
   "summary": {
+    "group_count": 29,
     "failure_count": 0,
     "failures_by_surface": {}
   },
     {
       "name": "repo_hf_validator_script_parity",
       "status": "pass"
+    },
+    {
+      "name": "repo_hf_website_html_parity",
+      "status": "pass"
     }
   ],
   "groups": [
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/artifact_index.json",
         "exists": true,
+        "bytes": 12916,
+        "sha256": "977e2d8d0ec9e42bee1fb7b43b9460b42a6d8a6d6e9a452389901b8d56d69372"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/artifact_index.json",
           "exists": true,
+          "bytes": 12916,
+          "sha256": "977e2d8d0ec9e42bee1fb7b43b9460b42a6d8a6d6e9a452389901b8d56d69372"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/artifact_index.json",
           "exists": true,
+          "bytes": 12916,
+          "sha256": "977e2d8d0ec9e42bee1fb7b43b9460b42a6d8a6d6e9a452389901b8d56d69372"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/artifact_index.json",
           "exists": true,
+          "bytes": 12916,
+          "sha256": "977e2d8d0ec9e42bee1fb7b43b9460b42a6d8a6d6e9a452389901b8d56d69372"
         }
       },
       "failures": []
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/evidence_contract.json",
         "exists": true,
+        "bytes": 7182,
+        "sha256": "42a75b0f87eec02dd5b5fedffe6eb3d0cdc8d9f12156887680686f1900ac2bfa"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/evidence_contract.json",
           "exists": true,
+          "bytes": 7182,
+          "sha256": "42a75b0f87eec02dd5b5fedffe6eb3d0cdc8d9f12156887680686f1900ac2bfa"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/evidence_contract.json",
           "exists": true,
+          "bytes": 7182,
+          "sha256": "42a75b0f87eec02dd5b5fedffe6eb3d0cdc8d9f12156887680686f1900ac2bfa"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/evidence_contract.json",
           "exists": true,
+          "bytes": 7182,
+          "sha256": "42a75b0f87eec02dd5b5fedffe6eb3d0cdc8d9f12156887680686f1900ac2bfa"
         }
       },
       "failures": []
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/publication_audit.json",
         "exists": true,
+        "bytes": 4214,
+        "sha256": "3d1a2d861c96d445541519494abfcfca1da13cb593094a8c660ad40a036ab218"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/publication_audit.json",
           "exists": true,
+          "bytes": 4214,
+          "sha256": "3d1a2d861c96d445541519494abfcfca1da13cb593094a8c660ad40a036ab218"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/publication_audit.json",
           "exists": true,
+          "bytes": 4214,
+          "sha256": "3d1a2d861c96d445541519494abfcfca1da13cb593094a8c660ad40a036ab218"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/publication_audit.json",
           "exists": true,
+          "bytes": 4214,
+          "sha256": "3d1a2d861c96d445541519494abfcfca1da13cb593094a8c660ad40a036ab218"
         }
       },
       "failures": []
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/scope_claims_audit.json",
         "exists": true,
         "bytes": 19964,
+        "sha256": "5520aa2b2c41ed9394283e8bf08be0ec1926b2851a952ba8a8a56a1f85a058eb"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/scope_claims_audit.json",
           "exists": true,
           "bytes": 19964,
+          "sha256": "5520aa2b2c41ed9394283e8bf08be0ec1926b2851a952ba8a8a56a1f85a058eb"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/scope_claims_audit.json",
           "exists": true,
           "bytes": 19964,
+          "sha256": "5520aa2b2c41ed9394283e8bf08be0ec1926b2851a952ba8a8a56a1f85a058eb"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/scope_claims_audit.json",
           "exists": true,
           "bytes": 19964,
+          "sha256": "5520aa2b2c41ed9394283e8bf08be0ec1926b2851a952ba8a8a56a1f85a058eb"
         }
       },
       "failures": []
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/website_integrity.json",
         "exists": true,
         "bytes": 5936,
+        "sha256": "b9c324a59e447a11bc6aeb5130736788981b9a1b529a80c988378e3f05f924b1"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/website_integrity.json",
           "exists": true,
           "bytes": 5936,
+          "sha256": "b9c324a59e447a11bc6aeb5130736788981b9a1b529a80c988378e3f05f924b1"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/website_integrity.json",
           "exists": true,
           "bytes": 5936,
+          "sha256": "b9c324a59e447a11bc6aeb5130736788981b9a1b529a80c988378e3f05f924b1"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/website_integrity.json",
           "exists": true,
           "bytes": 5936,
+          "sha256": "b9c324a59e447a11bc6aeb5130736788981b9a1b529a80c988378e3f05f924b1"
         }
       },
       "failures": []
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/scripts/build_artifact_index.py",
         "exists": true,
+        "bytes": 11579,
+        "sha256": "874a3813fb3a19d79be9ea4c0177f5922adf9e667760f927dd49163784eb6b48"
       },
       "mirrors": {
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/scripts/build_artifact_index.py",
           "exists": true,
+          "bytes": 11579,
+          "sha256": "874a3813fb3a19d79be9ea4c0177f5922adf9e667760f927dd49163784eb6b48"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/scripts/build_artifact_index.py",
           "exists": true,
+          "bytes": 11579,
+          "sha256": "874a3813fb3a19d79be9ea4c0177f5922adf9e667760f927dd49163784eb6b48"
         }
       },
       "failures": []
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/scripts/validate_mirror_parity.py",
         "exists": true,
+        "bytes": 7617,
+        "sha256": "0a74954e50fbf7bff661c9499244fc9be704764b701431fc2035ab4cc29d43d0"
       },
       "mirrors": {
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/scripts/validate_mirror_parity.py",
           "exists": true,
+          "bytes": 7617,
+          "sha256": "0a74954e50fbf7bff661c9499244fc9be704764b701431fc2035ab4cc29d43d0"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/scripts/validate_mirror_parity.py",
           "exists": true,
+          "bytes": 7617,
+          "sha256": "0a74954e50fbf7bff661c9499244fc9be704764b701431fc2035ab4cc29d43d0"
         }
       },
       "failures": []
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/scripts/validate_publication_package.py",
         "exists": true,
+        "bytes": 9772,
+        "sha256": "1a915bdd68a6c63941339282a8f747e4cafa08c24e5cdb3dbe105bf6ac3ea144"
       },
       "mirrors": {
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/scripts/validate_publication_package.py",
           "exists": true,
+          "bytes": 9772,
+          "sha256": "1a915bdd68a6c63941339282a8f747e4cafa08c24e5cdb3dbe105bf6ac3ea144"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/scripts/validate_publication_package.py",
           "exists": true,
+          "bytes": 9772,
+          "sha256": "1a915bdd68a6c63941339282a8f747e4cafa08c24e5cdb3dbe105bf6ac3ea144"
         }
       },
       "failures": []
         }
       },
       "failures": []
+    },
+    {
+      "name": "website/index.html",
+      "status": "pass",
+      "local": {
+        "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/index.html",
+        "exists": true,
+        "bytes": 89653,
+        "sha256": "f4d2b412d24bb29e977e8b82bb531fdb352cc7a1b81a2141ac63a0328bab654b"
+      },
+      "mirrors": {
+        "hf_space": {
+          "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/index.html",
+          "exists": true,
+          "bytes": 89653,
+          "sha256": "f4d2b412d24bb29e977e8b82bb531fdb352cc7a1b81a2141ac63a0328bab654b"
+        },
+        "hf_artifacts_docs": {
+          "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/index.html",
+          "exists": true,
+          "bytes": 89653,
+          "sha256": "f4d2b412d24bb29e977e8b82bb531fdb352cc7a1b81a2141ac63a0328bab654b"
+        }
+      },
+      "failures": []
     }
   ],
   "failures": []

metrics/publication_audit.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "status": "pass",
-  "generated_at_utc": "2026-06-01T04:49:16+00:00",
   "checks": [
     {
       "name": "required_publication_assets_present",
@@ -26,6 +26,11 @@
       "name": "no_hf_tokens_in_public_text",
       "status": "pass",
       "count": 0
     }
   ],
   "required_assets": {

 {
   "status": "pass",
+  "generated_at_utc": "2026-06-01T05:07:53+00:00",
   "checks": [
     {
       "name": "required_publication_assets_present",
       "name": "no_hf_tokens_in_public_text",
       "status": "pass",
       "count": 0
+    },
+    {
+      "name": "no_stale_task_suite_presentation_copy",
+      "status": "pass",
+      "count": 0
     }
   ],
   "required_assets": {

metrics/scope_claims_audit.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "status": "pass",
-  "generated_at_utc": "2026-06-01T04:43:33+00:00",
   "summary": {
     "qwen3_omni_32_episode_claim": false,
     "dataset_manifest_num_episodes": 1,

 {
   "status": "pass",
+  "generated_at_utc": "2026-06-01T05:06:13+00:00",
   "summary": {
     "qwen3_omni_32_episode_claim": false,
     "dataset_manifest_num_episodes": 1,

metrics/website_integrity.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "status": "pass",
-  "generated_at_utc": "2026-06-01T04:48:47+00:00",
   "docs_root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs",
   "site_base": "/ropedia-xperience-10m-task-suite/",
   "summary": {
@@ -56,7 +56,7 @@
     },
     {
       "path": "data/evidence_contract.json",
-      "bytes": 7148,
       "top_level_type": "dict"
     },
     {

 {
   "status": "pass",
+  "generated_at_utc": "2026-06-01T05:06:42+00:00",
   "docs_root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs",
   "site_base": "/ropedia-xperience-10m-task-suite/",
   "summary": {
     },
     {
       "path": "data/evidence_contract.json",
+      "bytes": 7182,
       "top_level_type": "dict"
     },
     {

scripts/build_artifact_index.py CHANGED Viewed

@@ -90,7 +90,7 @@ ARTIFACTS = [
         "kind": "mirror_parity",
         "surface": "website_hf",
         "volatile": True,
-        "proves": "Confirms prepared GitHub/HF Space/artifact/model mirrors share the same critical data, figure, and validator files.",
     },
     {
         "id": "website_integrity",

         "kind": "mirror_parity",
         "surface": "website_hf",
         "volatile": True,
+        "proves": "Confirms prepared GitHub/HF Space/artifact/model mirrors share the same critical data, figure, website HTML, and validator files.",
     },
     {
         "id": "website_integrity",

scripts/validate_mirror_parity.py CHANGED Viewed

@@ -56,6 +56,10 @@ SCRIPT_FILES = [
     "validate_website_integrity.py",
 ]
 def sha256(path: Path) -> str:
     digest = hashlib.sha256()
@@ -150,6 +154,18 @@ def build_report(hf_root: Path) -> dict:
             )
         )
     failures = [
         {"group": group["name"], **failure}
         for group in groups
@@ -187,6 +203,12 @@ def build_report(hf_root: Path) -> dict:
                 if not any(failure["group"].startswith("scripts/") for failure in failures)
                 else "fail",
             },
         ],
         "groups": groups,
         "failures": failures,

     "validate_website_integrity.py",
 ]
+WEBSITE_FILES = [
+    "index.html",
+]
 def sha256(path: Path) -> str:
     digest = hashlib.sha256()
             )
         )
+    for filename in WEBSITE_FILES:
+        groups.append(
+            parity_group(
+                f"website/{filename}",
+                ROOT / "docs" / filename,
+                {
+                    "hf_space": hf_root / "space" / filename,
+                    "hf_artifacts_docs": hf_root / "artifacts/docs" / filename,
+                },
+            )
+        )
     failures = [
         {"group": group["name"], **failure}
         for group in groups
                 if not any(failure["group"].startswith("scripts/") for failure in failures)
                 else "fail",
             },
+            {
+                "name": "repo_hf_website_html_parity",
+                "status": "pass"
+                if not any(failure["group"].startswith("website/") for failure in failures)
+                else "fail",
+            },
         ],
         "groups": groups,
         "failures": failures,

scripts/validate_publication_package.py CHANGED Viewed

@@ -42,6 +42,10 @@ TEXT_SUFFIXES = {
     ".yml",
 }
 TOKEN_PATTERN = re.compile(r"hf_[A-Za-z0-9]{20,}")
 def rel(path: Path, base: Path) -> str:
@@ -114,6 +118,13 @@ def scan(root: Path, *, paths: list[Path] | None = None) -> dict:
                 continue
             if TOKEN_PATTERN.search(text):
                 violations.append({"kind": "possible_hf_token", "path": path_rel})
     return {
         "root": str(root),
@@ -222,6 +233,11 @@ def build_report(hf_root: Path) -> dict:
             "status": "pass" if not any(v["kind"] == "possible_hf_token" for v in violations) else "fail",
             "count": sum(1 for v in violations if v["kind"] == "possible_hf_token"),
         },
     ]
     status = "pass" if all(check["status"] == "pass" for check in checks) else "fail"
     return {

     ".yml",
 }
 TOKEN_PATTERN = re.compile(r"hf_[A-Za-z0-9]{20,}")
+STALE_PRESENTATION_STRINGS = {
+    "xperience10m-" + "modalities-v9-large-atlas": "old task-suite infographic cache key",
+    "Start with the large native " + "modality atlas": "old suite-section hierarchy copy",
+}
 def rel(path: Path, base: Path) -> str:
                 continue
             if TOKEN_PATTERN.search(text):
                 violations.append({"kind": "possible_hf_token", "path": path_rel})
+            for needle, reason in STALE_PRESENTATION_STRINGS.items():
+                if needle in text:
+                    violations.append({
+                        "kind": "stale_presentation_copy",
+                        "path": path_rel,
+                        "detail": reason,
+                    })
     return {
         "root": str(root),
             "status": "pass" if not any(v["kind"] == "possible_hf_token" for v in violations) else "fail",
             "count": sum(1 for v in violations if v["kind"] == "possible_hf_token"),
         },
+        {
+            "name": "no_stale_task_suite_presentation_copy",
+            "status": "pass" if not any(v["kind"] == "stale_presentation_copy" for v in violations) else "fail",
+            "count": sum(1 for v in violations if v["kind"] == "stale_presentation_copy"),
+        },
     ]
     status = "pass" if all(check["status"] == "pass" for check in checks) else "fail"
     return {