Publish Ropedia Xperience-10M task baseline cards

Browse files

Files changed (11) hide show

ARTIFACT_GUIDE.md +1 -1
EVIDENCE_CONTRACT.md +1 -1
README.md +3 -3
assets/task_suite_infographic.png +2 -2
metrics/artifact_index.json +9 -9
metrics/evidence_contract.json +1 -1
metrics/mirror_parity.json +51 -51
metrics/publication_audit.json +49 -2
metrics/scope_claims_audit.json +1 -1
metrics/website_integrity.json +8 -8
scripts/validate_publication_package.py +75 -0

ARTIFACT_GUIDE.md CHANGED Viewed

@@ -25,7 +25,7 @@ The project intentionally separates four layers:
 | [`REPRODUCIBILITY.md`](REPRODUCIBILITY.md) | Defines public reproduction commands, expected outputs, and unreproducible boundaries. |
 | [`metrics/artifact_index.json`](metrics/artifact_index.json) | Lists reviewer-critical files with existence, size, and stable hashes. |
 | [`metrics/mirror_parity.json`](metrics/mirror_parity.json) | Confirms prepared HF Space, artifact, and model mirrors match the repo for critical data, figures, website HTML, and validator scripts. |
-| [`metrics/publication_audit.json`](metrics/publication_audit.json) | Confirms public bundles exclude raw data, Python caches, heavy archives, and token strings. |
 | [`metrics/scope_claims_audit.json`](metrics/scope_claims_audit.json) | Confirms historical `32ep` smoke-run identifiers are not presented as real 32-episode results. |
 | [`metrics/website_integrity.json`](metrics/website_integrity.json) | Confirms local site links, anchors, JSON bundles, and referenced images resolve. |
 | [`metrics/reviewer_packet.json`](metrics/reviewer_packet.json) | Gives the shortest machine-readable reviewer route. |

 | [`REPRODUCIBILITY.md`](REPRODUCIBILITY.md) | Defines public reproduction commands, expected outputs, and unreproducible boundaries. |
 | [`metrics/artifact_index.json`](metrics/artifact_index.json) | Lists reviewer-critical files with existence, size, and stable hashes. |
 | [`metrics/mirror_parity.json`](metrics/mirror_parity.json) | Confirms prepared HF Space, artifact, and model mirrors match the repo for critical data, figures, website HTML, and validator scripts. |
+| [`metrics/publication_audit.json`](metrics/publication_audit.json) | Confirms public bundles exclude raw data, Python caches, heavy archives, token strings, and stale public-card figure references. |
 | [`metrics/scope_claims_audit.json`](metrics/scope_claims_audit.json) | Confirms historical `32ep` smoke-run identifiers are not presented as real 32-episode results. |
 | [`metrics/website_integrity.json`](metrics/website_integrity.json) | Confirms local site links, anchors, JSON bundles, and referenced images resolve. |
 | [`metrics/reviewer_packet.json`](metrics/reviewer_packet.json) | Gives the shortest machine-readable reviewer route. |

EVIDENCE_CONTRACT.md CHANGED Viewed

@@ -16,7 +16,7 @@ local artifact that a reader can inspect before trusting the dashboard.
 | The real 32-episode LoRA pilot is blocked on gated data access, not on repo presentation. | Companion GitHub repo: `results/omni_finetune/DATA_BLOCKER_REPORT.md`, `results/omni_finetune/A100_HF_RELAY_STATUS.md`, `results/omni_finetune/source_discovery.json` | Blocker documented | No 32-episode metric should be claimed until the gate passes |
 | Historical `32ep` path strings are not treated as 32-episode results. | `scripts/validate_scope_claims.py`, `metrics/scope_claims_audit.json` | Verified pass | Classifies old run/path identifiers and fails if public presentation claims real 32-episode metrics |
 | Prepared GitHub/Hugging Face mirrors carry matching critical files. | `scripts/validate_mirror_parity.py`, `metrics/mirror_parity.json` | Verified pass | Compares prepared data files, visual assets, website HTML, and validator scripts before upload; live URLs are checked after publishing |
-| The public GitHub and Hugging Face bundles are publication-clean. | `scripts/validate_publication_package.py`, `metrics/publication_audit.json` | Verified pass | Checks public files and HF bundles, not arbitrary ignored local scratch outputs |
 | The public website has checked local references. | `scripts/validate_website_integrity.py`, `metrics/website_integrity.json` | Verified pass | Checks local links, anchors, JSON data, and referenced images; external URLs are not fetched |
 | The core proof artifacts are indexed and grouped for fast review. | `ARTIFACT_GUIDE.md`, `scripts/build_artifact_index.py`, `metrics/artifact_index.json` | Verified guide and index | Selective source-of-truth catalog, not a complete inventory of every output file |
 | The public reproduction path is documented. | `REPRODUCIBILITY.md`, `metrics/reproducibility_matrix.json`, `notes/reproducibility_audit.md` | Verified documentation and prior exact-match audit | Publicly reproduces the single-episode pipeline, not the gated 32-episode Qwen3-Omni pilot |

 | The real 32-episode LoRA pilot is blocked on gated data access, not on repo presentation. | Companion GitHub repo: `results/omni_finetune/DATA_BLOCKER_REPORT.md`, `results/omni_finetune/A100_HF_RELAY_STATUS.md`, `results/omni_finetune/source_discovery.json` | Blocker documented | No 32-episode metric should be claimed until the gate passes |
 | Historical `32ep` path strings are not treated as 32-episode results. | `scripts/validate_scope_claims.py`, `metrics/scope_claims_audit.json` | Verified pass | Classifies old run/path identifiers and fails if public presentation claims real 32-episode metrics |
 | Prepared GitHub/Hugging Face mirrors carry matching critical files. | `scripts/validate_mirror_parity.py`, `metrics/mirror_parity.json` | Verified pass | Compares prepared data files, visual assets, website HTML, and validator scripts before upload; live URLs are checked after publishing |
+| The public GitHub and Hugging Face bundles are publication-clean. | `scripts/validate_publication_package.py`, `metrics/publication_audit.json` | Verified pass | Checks public files, HF bundles, and public-card freshness; ignored local scratch outputs are excluded |
 | The public website has checked local references. | `scripts/validate_website_integrity.py`, `metrics/website_integrity.json` | Verified pass | Checks local links, anchors, JSON data, and referenced images; external URLs are not fetched |
 | The core proof artifacts are indexed and grouped for fast review. | `ARTIFACT_GUIDE.md`, `scripts/build_artifact_index.py`, `metrics/artifact_index.json` | Verified guide and index | Selective source-of-truth catalog, not a complete inventory of every output file |
 | The public reproduction path is documented. | `REPRODUCIBILITY.md`, `metrics/reproducibility_matrix.json`, `notes/reproducibility_audit.md` | Verified documentation and prior exact-match audit | Publicly reproduces the single-episode pipeline, not the gated 32-episode Qwen3-Omni pilot |

README.md CHANGED Viewed

@@ -62,7 +62,7 @@ and metrics for the 12-task Xperience-10M episode suite, plus four lightweight
 direction-extension probes. It is meant to be read like a model audit, not
 advertised as a robot foundation model.
-![12-task suite with sample modalities](assets/task_suite_infographic.png?v=xperience10m-taskfirst-v10)
 The source Xperience-10M sample spans video, audio, depth, pose, motion
 capture, inertial sensing, and language annotation. The committed minimal and
@@ -112,7 +112,7 @@ Source-of-truth artifact index mirror: `metrics/artifact_index.json`.
 | Qwen3-Omni | companion blocker and relay reports | smoke-only until 32 valid episodes are available |
 | Scope claims guard | `metrics/scope_claims_audit.json` and `scripts/validate_scope_claims.py` | historical `32ep` path strings are provenance, not 32-episode results |
 | Mirror parity | `metrics/mirror_parity.json` and `scripts/validate_mirror_parity.py` | prepared repo/HF mirrors carry matching critical data, figures, website HTML, and validator files |
-| Publication hygiene | `metrics/publication_audit.json` and validator script mirror | public bundles contain no raw data, generated caches, heavy archives, or token strings |
 | Website integrity | `metrics/website_integrity.json` and validator script mirror | local links, anchors, JSON bundles, and referenced images only |
 | Artifact index | `metrics/artifact_index.json` and `scripts/build_artifact_index.py` | compact catalog of the reviewer-critical proof artifacts |
 | Artifact guide | `ARTIFACT_GUIDE.md` | human-readable map of proof boundary, task evidence, mirrors, and scale-up status |
@@ -148,7 +148,7 @@ transfers them to H20 for manifest building, training, and evaluation.
 | `metrics/artifact_index.json` | indexes proof artifacts with existence, size, and stable-file hashes |
 | `metrics/mirror_parity.json` | verifies prepared repo/HF mirrors have matching critical data, figures, website HTML, and validator files before upload |
 | `metrics/scope_claims_audit.json` | verifies historical `32ep` smoke-run identifiers are not presented as real 32-episode results |
-| `metrics/publication_audit.json` | records the latest public-bundle hygiene check |
 | `metrics/website_integrity.json` | records the latest local website link, anchor, JSON, and image integrity check |
 | `metrics/project_manifest.json` | mirrors the public URL and citation metadata bundle |

 direction-extension probes. It is meant to be read like a model audit, not
 advertised as a robot foundation model.
+![12-task suite with sample modalities](assets/task_suite_infographic.png?v=xperience10m-taskfirst-v11-modality-spread)
 The source Xperience-10M sample spans video, audio, depth, pose, motion
 capture, inertial sensing, and language annotation. The committed minimal and
 | Qwen3-Omni | companion blocker and relay reports | smoke-only until 32 valid episodes are available |
 | Scope claims guard | `metrics/scope_claims_audit.json` and `scripts/validate_scope_claims.py` | historical `32ep` path strings are provenance, not 32-episode results |
 | Mirror parity | `metrics/mirror_parity.json` and `scripts/validate_mirror_parity.py` | prepared repo/HF mirrors carry matching critical data, figures, website HTML, and validator files |
+| Publication hygiene | `metrics/publication_audit.json` and validator script mirror | public bundles contain no raw data, generated caches, heavy archives, token strings, or stale public-card figure references |
 | Website integrity | `metrics/website_integrity.json` and validator script mirror | local links, anchors, JSON bundles, and referenced images only |
 | Artifact index | `metrics/artifact_index.json` and `scripts/build_artifact_index.py` | compact catalog of the reviewer-critical proof artifacts |
 | Artifact guide | `ARTIFACT_GUIDE.md` | human-readable map of proof boundary, task evidence, mirrors, and scale-up status |
 | `metrics/artifact_index.json` | indexes proof artifacts with existence, size, and stable-file hashes |
 | `metrics/mirror_parity.json` | verifies prepared repo/HF mirrors have matching critical data, figures, website HTML, and validator files before upload |
 | `metrics/scope_claims_audit.json` | verifies historical `32ep` smoke-run identifiers are not presented as real 32-episode results |
+| `metrics/publication_audit.json` | records the latest public-bundle hygiene and public-card freshness check |
 | `metrics/website_integrity.json` | records the latest local website link, anchor, JSON, and image integrity check |
 | `metrics/project_manifest.json` | mirrors the public URL and citation metadata bundle |

assets/task_suite_infographic.png CHANGED Viewed

Git LFS Details

SHA256: 7ffca73ef3cb775dfd8b0563e943502421eac55989b3c6dd05101e7f070a3bdf
Pointer size: 132 Bytes
Size of remote file: 2.33 MB

Git LFS Details

SHA256: 0e1d1b1eb165f6c52a49f94e4e93b7b4e9421d9efa015d7a094d70c48800e9b5
Pointer size: 132 Bytes
Size of remote file: 2.32 MB

metrics/artifact_index.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "title": "Ropedia Xperience-10M Task Suite Artifact Index",
-  "generated_at_utc": "2026-06-01T05:07:04+00:00",
   "status": "pass",
   "artifact_count": 29,
   "missing": [],
@@ -35,8 +35,8 @@
       "surface": "repo",
       "proves": "Defines what is verified, what is smoke-only, and what must not be inferred.",
       "exists": true,
-      "bytes": 6497,
-      "sha256": "417835c2f838f1d4c4bca9f07c708ce04611e7212017e58421956818a4ca4b45"
     },
     {
       "id": "reviewer_packet",
@@ -57,8 +57,8 @@
       "surface": "repo_hf",
       "proves": "Gives the human-readable map from proof boundary to data, tasks, platform mirrors, and scale-up status.",
       "exists": true,
-      "bytes": 6483,
-      "sha256": "cc211ec1175eed43bc5d8c9ce1ec412982524af1998316b392e77e5d7ddc99ee"
     },
     {
       "id": "reproducibility_contract",
@@ -102,7 +102,7 @@
       "volatile": true,
       "proves": "Confirms public bundles pass raw-data, cache, archive, and token-string checks.",
       "exists": true,
-      "bytes": 4105,
       "hash_policy": "existence_and_size_only"
     },
     {
@@ -126,7 +126,7 @@
       "volatile": true,
       "proves": "Confirms prepared GitHub/HF Space/artifact/model mirrors share the same critical data, figure, website HTML, and validator files.",
       "exists": true,
-      "bytes": 41465,
       "hash_policy": "existence_and_size_only"
     },
     {
@@ -259,8 +259,8 @@
       "surface": "website_hf",
       "proves": "Presents the task suite and sample modality thumbnails with metrics generated from committed files.",
       "exists": true,
-      "bytes": 2331622,
-      "sha256": "7ffca73ef3cb775dfd8b0563e943502421eac55989b3c6dd05101e7f070a3bdf"
     },
     {
       "id": "modality_atlas",

 {
   "title": "Ropedia Xperience-10M Task Suite Artifact Index",
+  "generated_at_utc": "2026-06-01T05:58:15+00:00",
   "status": "pass",
   "artifact_count": 29,
   "missing": [],
       "surface": "repo",
       "proves": "Defines what is verified, what is smoke-only, and what must not be inferred.",
       "exists": true,
+      "bytes": 6520,
+      "sha256": "d6b8d74a53b49778d38bff6f6857f79d481d451f938c6a4177a50374f541d219"
     },
     {
       "id": "reviewer_packet",
       "surface": "repo_hf",
       "proves": "Gives the human-readable map from proof boundary to data, tasks, platform mirrors, and scale-up status.",
       "exists": true,
+      "bytes": 6520,
+      "sha256": "0a8740e19d56c9c7e1c3964d3abf838a8e33af140128a4fb95a69bdca0b45173"
     },
     {
       "id": "reproducibility_contract",
       "volatile": true,
       "proves": "Confirms public bundles pass raw-data, cache, archive, and token-string checks.",
       "exists": true,
+      "bytes": 5292,
       "hash_policy": "existence_and_size_only"
     },
     {
       "volatile": true,
       "proves": "Confirms prepared GitHub/HF Space/artifact/model mirrors share the same critical data, figure, website HTML, and validator files.",
       "exists": true,
+      "bytes": 42567,
       "hash_policy": "existence_and_size_only"
     },
     {
       "surface": "website_hf",
       "proves": "Presents the task suite and sample modality thumbnails with metrics generated from committed files.",
       "exists": true,
+      "bytes": 2322389,
+      "sha256": "0e1d1b1eb165f6c52a49f94e4e93b7b4e9421d9efa015d7a094d70c48800e9b5"
     },
     {
       "id": "modality_atlas",

metrics/evidence_contract.json CHANGED Viewed

@@ -126,7 +126,7 @@
         "scripts/validate_publication_package.py",
         "docs/data/publication_audit.json"
       ],
-      "boundary": "checks public files and HF bundles, not arbitrary ignored local scratch outputs"
     },
     {
       "id": "website_integrity",

         "scripts/validate_publication_package.py",
         "docs/data/publication_audit.json"
       ],
+      "boundary": "checks public files, HF bundles, and public-card freshness; ignored local scratch outputs are excluded"
     },
     {
       "id": "website_integrity",

metrics/mirror_parity.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "status": "pass",
-  "generated_at_utc": "2026-06-01T05:08:43+00:00",
   "hf_root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish",
   "summary": {
     "group_count": 29,
@@ -33,26 +33,26 @@
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/artifact_index.json",
         "exists": true,
         "bytes": 12916,
-        "sha256": "977e2d8d0ec9e42bee1fb7b43b9460b42a6d8a6d6e9a452389901b8d56d69372"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/artifact_index.json",
           "exists": true,
           "bytes": 12916,
-          "sha256": "977e2d8d0ec9e42bee1fb7b43b9460b42a6d8a6d6e9a452389901b8d56d69372"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/artifact_index.json",
           "exists": true,
           "bytes": 12916,
-          "sha256": "977e2d8d0ec9e42bee1fb7b43b9460b42a6d8a6d6e9a452389901b8d56d69372"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/artifact_index.json",
           "exists": true,
           "bytes": 12916,
-          "sha256": "977e2d8d0ec9e42bee1fb7b43b9460b42a6d8a6d6e9a452389901b8d56d69372"
         }
       },
       "failures": []
@@ -63,27 +63,27 @@
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/evidence_contract.json",
         "exists": true,
-        "bytes": 7182,
-        "sha256": "42a75b0f87eec02dd5b5fedffe6eb3d0cdc8d9f12156887680686f1900ac2bfa"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/evidence_contract.json",
           "exists": true,
-          "bytes": 7182,
-          "sha256": "42a75b0f87eec02dd5b5fedffe6eb3d0cdc8d9f12156887680686f1900ac2bfa"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/evidence_contract.json",
           "exists": true,
-          "bytes": 7182,
-          "sha256": "42a75b0f87eec02dd5b5fedffe6eb3d0cdc8d9f12156887680686f1900ac2bfa"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/evidence_contract.json",
           "exists": true,
-          "bytes": 7182,
-          "sha256": "42a75b0f87eec02dd5b5fedffe6eb3d0cdc8d9f12156887680686f1900ac2bfa"
         }
       },
       "failures": []
@@ -156,27 +156,27 @@
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/publication_audit.json",
         "exists": true,
-        "bytes": 4214,
-        "sha256": "3d1a2d861c96d445541519494abfcfca1da13cb593094a8c660ad40a036ab218"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/publication_audit.json",
           "exists": true,
-          "bytes": 4214,
-          "sha256": "3d1a2d861c96d445541519494abfcfca1da13cb593094a8c660ad40a036ab218"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/publication_audit.json",
           "exists": true,
-          "bytes": 4214,
-          "sha256": "3d1a2d861c96d445541519494abfcfca1da13cb593094a8c660ad40a036ab218"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/publication_audit.json",
           "exists": true,
-          "bytes": 4214,
-          "sha256": "3d1a2d861c96d445541519494abfcfca1da13cb593094a8c660ad40a036ab218"
         }
       },
       "failures": []
@@ -312,26 +312,26 @@
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/scope_claims_audit.json",
         "exists": true,
         "bytes": 19964,
-        "sha256": "5520aa2b2c41ed9394283e8bf08be0ec1926b2851a952ba8a8a56a1f85a058eb"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/scope_claims_audit.json",
           "exists": true,
           "bytes": 19964,
-          "sha256": "5520aa2b2c41ed9394283e8bf08be0ec1926b2851a952ba8a8a56a1f85a058eb"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/scope_claims_audit.json",
           "exists": true,
           "bytes": 19964,
-          "sha256": "5520aa2b2c41ed9394283e8bf08be0ec1926b2851a952ba8a8a56a1f85a058eb"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/scope_claims_audit.json",
           "exists": true,
           "bytes": 19964,
-          "sha256": "5520aa2b2c41ed9394283e8bf08be0ec1926b2851a952ba8a8a56a1f85a058eb"
         }
       },
       "failures": []
@@ -405,26 +405,26 @@
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/website_integrity.json",
         "exists": true,
         "bytes": 5936,
-        "sha256": "b9c324a59e447a11bc6aeb5130736788981b9a1b529a80c988378e3f05f924b1"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/website_integrity.json",
           "exists": true,
           "bytes": 5936,
-          "sha256": "b9c324a59e447a11bc6aeb5130736788981b9a1b529a80c988378e3f05f924b1"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/website_integrity.json",
           "exists": true,
           "bytes": 5936,
-          "sha256": "b9c324a59e447a11bc6aeb5130736788981b9a1b529a80c988378e3f05f924b1"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/website_integrity.json",
           "exists": true,
           "bytes": 5936,
-          "sha256": "b9c324a59e447a11bc6aeb5130736788981b9a1b529a80c988378e3f05f924b1"
         }
       },
       "failures": []
@@ -435,33 +435,33 @@
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/assets/task_suite_infographic.png",
         "exists": true,
-        "bytes": 2331622,
-        "sha256": "7ffca73ef3cb775dfd8b0563e943502421eac55989b3c6dd05101e7f070a3bdf"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/assets/task_suite_infographic.png",
           "exists": true,
-          "bytes": 2331622,
-          "sha256": "7ffca73ef3cb775dfd8b0563e943502421eac55989b3c6dd05101e7f070a3bdf"
         },
         "hf_artifacts_docs": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/assets/task_suite_infographic.png",
           "exists": true,
-          "bytes": 2331622,
-          "sha256": "7ffca73ef3cb775dfd8b0563e943502421eac55989b3c6dd05101e7f070a3bdf"
         },
         "hf_artifacts_card": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/assets/task_suite_infographic.png",
           "exists": true,
-          "bytes": 2331622,
-          "sha256": "7ffca73ef3cb775dfd8b0563e943502421eac55989b3c6dd05101e7f070a3bdf"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/assets/task_suite_infographic.png",
           "exists": true,
-          "bytes": 2331622,
-          "sha256": "7ffca73ef3cb775dfd8b0563e943502421eac55989b3c6dd05101e7f070a3bdf"
         }
       },
       "failures": []
@@ -855,21 +855,21 @@
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/scripts/validate_publication_package.py",
         "exists": true,
-        "bytes": 9772,
-        "sha256": "1a915bdd68a6c63941339282a8f747e4cafa08c24e5cdb3dbe105bf6ac3ea144"
       },
       "mirrors": {
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/scripts/validate_publication_package.py",
           "exists": true,
-          "bytes": 9772,
-          "sha256": "1a915bdd68a6c63941339282a8f747e4cafa08c24e5cdb3dbe105bf6ac3ea144"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/scripts/validate_publication_package.py",
           "exists": true,
-          "bytes": 9772,
-          "sha256": "1a915bdd68a6c63941339282a8f747e4cafa08c24e5cdb3dbe105bf6ac3ea144"
         }
       },
       "failures": []
@@ -930,21 +930,21 @@
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/index.html",
         "exists": true,
-        "bytes": 89653,
-        "sha256": "f4d2b412d24bb29e977e8b82bb531fdb352cc7a1b81a2141ac63a0328bab654b"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/index.html",
           "exists": true,
-          "bytes": 89653,
-          "sha256": "f4d2b412d24bb29e977e8b82bb531fdb352cc7a1b81a2141ac63a0328bab654b"
         },
         "hf_artifacts_docs": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/index.html",
           "exists": true,
-          "bytes": 89653,
-          "sha256": "f4d2b412d24bb29e977e8b82bb531fdb352cc7a1b81a2141ac63a0328bab654b"
         }
       },
       "failures": []

 {
   "status": "pass",
+  "generated_at_utc": "2026-06-01T05:59:09+00:00",
   "hf_root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish",
   "summary": {
     "group_count": 29,
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/artifact_index.json",
         "exists": true,
         "bytes": 12916,
+        "sha256": "0d12a2e36db90b4c6ae4205aab074e6c6b00083fa6c4cc18ea51342f5f1a05df"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/artifact_index.json",
           "exists": true,
           "bytes": 12916,
+          "sha256": "0d12a2e36db90b4c6ae4205aab074e6c6b00083fa6c4cc18ea51342f5f1a05df"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/artifact_index.json",
           "exists": true,
           "bytes": 12916,
+          "sha256": "0d12a2e36db90b4c6ae4205aab074e6c6b00083fa6c4cc18ea51342f5f1a05df"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/artifact_index.json",
           "exists": true,
           "bytes": 12916,
+          "sha256": "0d12a2e36db90b4c6ae4205aab074e6c6b00083fa6c4cc18ea51342f5f1a05df"
         }
       },
       "failures": []
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/evidence_contract.json",
         "exists": true,
+        "bytes": 7205,
+        "sha256": "78f2a0db49dc00f52f79dc4d3448d01a47bd28019458bd61000304e119a018e8"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/evidence_contract.json",
           "exists": true,
+          "bytes": 7205,
+          "sha256": "78f2a0db49dc00f52f79dc4d3448d01a47bd28019458bd61000304e119a018e8"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/evidence_contract.json",
           "exists": true,
+          "bytes": 7205,
+          "sha256": "78f2a0db49dc00f52f79dc4d3448d01a47bd28019458bd61000304e119a018e8"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/evidence_contract.json",
           "exists": true,
+          "bytes": 7205,
+          "sha256": "78f2a0db49dc00f52f79dc4d3448d01a47bd28019458bd61000304e119a018e8"
         }
       },
       "failures": []
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/publication_audit.json",
         "exists": true,
+        "bytes": 5292,
+        "sha256": "c273917a673c41fed0b1498ffef2d1e64ebd9691e7b870cbcd15b194785397ed"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/publication_audit.json",
           "exists": true,
+          "bytes": 5292,
+          "sha256": "c273917a673c41fed0b1498ffef2d1e64ebd9691e7b870cbcd15b194785397ed"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/publication_audit.json",
           "exists": true,
+          "bytes": 5292,
+          "sha256": "c273917a673c41fed0b1498ffef2d1e64ebd9691e7b870cbcd15b194785397ed"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/publication_audit.json",
           "exists": true,
+          "bytes": 5292,
+          "sha256": "c273917a673c41fed0b1498ffef2d1e64ebd9691e7b870cbcd15b194785397ed"
         }
       },
       "failures": []
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/scope_claims_audit.json",
         "exists": true,
         "bytes": 19964,
+        "sha256": "a74d89124ddbecd1af7a6054fcaa0d905c639dec56a9fd455140817220fd91a1"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/scope_claims_audit.json",
           "exists": true,
           "bytes": 19964,
+          "sha256": "a74d89124ddbecd1af7a6054fcaa0d905c639dec56a9fd455140817220fd91a1"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/scope_claims_audit.json",
           "exists": true,
           "bytes": 19964,
+          "sha256": "a74d89124ddbecd1af7a6054fcaa0d905c639dec56a9fd455140817220fd91a1"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/scope_claims_audit.json",
           "exists": true,
           "bytes": 19964,
+          "sha256": "a74d89124ddbecd1af7a6054fcaa0d905c639dec56a9fd455140817220fd91a1"
         }
       },
       "failures": []
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/website_integrity.json",
         "exists": true,
         "bytes": 5936,
+        "sha256": "0b9a1d8d3bbf953f86aed99fe22882dbee4ed2813f2cae91cbf15a2002752cc2"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/website_integrity.json",
           "exists": true,
           "bytes": 5936,
+          "sha256": "0b9a1d8d3bbf953f86aed99fe22882dbee4ed2813f2cae91cbf15a2002752cc2"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/website_integrity.json",
           "exists": true,
           "bytes": 5936,
+          "sha256": "0b9a1d8d3bbf953f86aed99fe22882dbee4ed2813f2cae91cbf15a2002752cc2"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/website_integrity.json",
           "exists": true,
           "bytes": 5936,
+          "sha256": "0b9a1d8d3bbf953f86aed99fe22882dbee4ed2813f2cae91cbf15a2002752cc2"
         }
       },
       "failures": []
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/assets/task_suite_infographic.png",
         "exists": true,
+        "bytes": 2322389,
+        "sha256": "0e1d1b1eb165f6c52a49f94e4e93b7b4e9421d9efa015d7a094d70c48800e9b5"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/assets/task_suite_infographic.png",
           "exists": true,
+          "bytes": 2322389,
+          "sha256": "0e1d1b1eb165f6c52a49f94e4e93b7b4e9421d9efa015d7a094d70c48800e9b5"
         },
         "hf_artifacts_docs": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/assets/task_suite_infographic.png",
           "exists": true,
+          "bytes": 2322389,
+          "sha256": "0e1d1b1eb165f6c52a49f94e4e93b7b4e9421d9efa015d7a094d70c48800e9b5"
         },
         "hf_artifacts_card": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/assets/task_suite_infographic.png",
           "exists": true,
+          "bytes": 2322389,
+          "sha256": "0e1d1b1eb165f6c52a49f94e4e93b7b4e9421d9efa015d7a094d70c48800e9b5"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/assets/task_suite_infographic.png",
           "exists": true,
+          "bytes": 2322389,
+          "sha256": "0e1d1b1eb165f6c52a49f94e4e93b7b4e9421d9efa015d7a094d70c48800e9b5"
         }
       },
       "failures": []
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/scripts/validate_publication_package.py",
         "exists": true,
+        "bytes": 12444,
+        "sha256": "f8fc86b66a1fde0755004897dd307eb5c80f84bdaf917158b43c423ff6e7e9e7"
       },
       "mirrors": {
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/scripts/validate_publication_package.py",
           "exists": true,
+          "bytes": 12444,
+          "sha256": "f8fc86b66a1fde0755004897dd307eb5c80f84bdaf917158b43c423ff6e7e9e7"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/scripts/validate_publication_package.py",
           "exists": true,
+          "bytes": 12444,
+          "sha256": "f8fc86b66a1fde0755004897dd307eb5c80f84bdaf917158b43c423ff6e7e9e7"
         }
       },
       "failures": []
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/index.html",
         "exists": true,
+        "bytes": 89772,
+        "sha256": "3544638ab8dc809e126f347d942b4f7303674edd79858cd039a5b18b95500fcb"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/index.html",
           "exists": true,
+          "bytes": 89772,
+          "sha256": "3544638ab8dc809e126f347d942b4f7303674edd79858cd039a5b18b95500fcb"
         },
         "hf_artifacts_docs": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/index.html",
           "exists": true,
+          "bytes": 89772,
+          "sha256": "3544638ab8dc809e126f347d942b4f7303674edd79858cd039a5b18b95500fcb"
         }
       },
       "failures": []

metrics/publication_audit.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "status": "pass",
-  "generated_at_utc": "2026-06-01T05:07:53+00:00",
   "checks": [
     {
       "name": "required_publication_assets_present",
@@ -31,6 +31,11 @@
       "name": "no_stale_task_suite_presentation_copy",
       "status": "pass",
       "count": 0
     }
   ],
   "required_assets": {
@@ -80,6 +85,48 @@
     "scripts/validate_website_integrity.py": true,
     "scripts/omni/train_qwen3_omni_lora.py": true
   },
   "scans": {
     "github_repo": {
       "root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy",
@@ -99,7 +146,7 @@
       "text_file_count": 38,
       "largest_file": {
         "path": "assets/task_suite_infographic.png",
-        "bytes": 2331622
       },
       "violations": []
     },

 {
   "status": "pass",
+  "generated_at_utc": "2026-06-01T05:58:39+00:00",
   "checks": [
     {
       "name": "required_publication_assets_present",
       "name": "no_stale_task_suite_presentation_copy",
       "status": "pass",
       "count": 0
+    },
+    {
+      "name": "public_cards_reference_taskfirst_figure",
+      "status": "pass",
+      "failures": []
     }
   ],
   "required_assets": {
     "scripts/validate_website_integrity.py": true,
     "scripts/omni/train_qwen3_omni_lora.py": true
   },
+  "public_card_freshness": [
+    {
+      "surface": "github_repo",
+      "path": "README.md",
+      "exists": true,
+      "required_marker_count": 3,
+      "missing_markers": [],
+      "status": "pass"
+    },
+    {
+      "surface": "hf_space_bundle",
+      "path": "README.md",
+      "exists": true,
+      "required_marker_count": 4,
+      "missing_markers": [],
+      "status": "pass"
+    },
+    {
+      "surface": "hf_artifact_bundle",
+      "path": "README.md",
+      "exists": true,
+      "required_marker_count": 3,
+      "missing_markers": [],
+      "status": "pass"
+    },
+    {
+      "surface": "hf_artifact_bundle",
+      "path": "PROJECT_README.md",
+      "exists": true,
+      "required_marker_count": 3,
+      "missing_markers": [],
+      "status": "pass"
+    },
+    {
+      "surface": "hf_model_bundle",
+      "path": "README.md",
+      "exists": true,
+      "required_marker_count": 4,
+      "missing_markers": [],
+      "status": "pass"
+    }
+  ],
   "scans": {
     "github_repo": {
       "root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy",
       "text_file_count": 38,
       "largest_file": {
         "path": "assets/task_suite_infographic.png",
+        "bytes": 2322389
       },
       "violations": []
     },

metrics/scope_claims_audit.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "status": "pass",
-  "generated_at_utc": "2026-06-01T05:06:13+00:00",
   "summary": {
     "qwen3_omni_32_episode_claim": false,
     "dataset_manifest_num_episodes": 1,

 {
   "status": "pass",
+  "generated_at_utc": "2026-06-01T05:46:47+00:00",
   "summary": {
     "qwen3_omni_32_episode_claim": false,
     "dataset_manifest_num_episodes": 1,

metrics/website_integrity.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "status": "pass",
-  "generated_at_utc": "2026-06-01T05:06:42+00:00",
   "docs_root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs",
   "site_base": "/ropedia-xperience-10m-task-suite/",
   "summary": {
@@ -25,7 +25,7 @@
       "status": "pass",
       "reason": "The Suite anchor should show the full 12-task map before the modality atlas.",
       "first_marker_index": 380,
-      "second_marker_index": 669
     },
     {
       "name": "suite_modality_atlas_contains_seven_cards",
@@ -51,17 +51,17 @@
   "json_files": [
     {
       "path": "data/artifact_index.json",
-      "bytes": 12902,
       "top_level_type": "dict"
     },
     {
       "path": "data/evidence_contract.json",
-      "bytes": 7182,
       "top_level_type": "dict"
     },
     {
       "path": "data/mirror_parity.json",
-      "bytes": 41465,
       "top_level_type": "dict"
     },
     {
@@ -76,7 +76,7 @@
     },
     {
       "path": "data/publication_audit.json",
-      "bytes": 4105,
       "top_level_type": "dict"
     },
     {
@@ -252,9 +252,9 @@
     {
       "path": "assets/task_suite_infographic.png",
       "exists": true,
-      "bytes": 2331622,
       "width": 1800,
-      "height": 5700,
       "format": "PNG"
     }
   ]

 {
   "status": "pass",
+  "generated_at_utc": "2026-06-01T05:57:48+00:00",
   "docs_root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs",
   "site_base": "/ropedia-xperience-10m-task-suite/",
   "summary": {
       "status": "pass",
       "reason": "The Suite anchor should show the full 12-task map before the modality atlas.",
       "first_marker_index": 380,
+      "second_marker_index": 696
     },
     {
       "name": "suite_modality_atlas_contains_seven_cards",
   "json_files": [
     {
       "path": "data/artifact_index.json",
+      "bytes": 12916,
       "top_level_type": "dict"
     },
     {
       "path": "data/evidence_contract.json",
+      "bytes": 7205,
       "top_level_type": "dict"
     },
     {
       "path": "data/mirror_parity.json",
+      "bytes": 42567,
       "top_level_type": "dict"
     },
     {
     },
     {
       "path": "data/publication_audit.json",
+      "bytes": 5292,
       "top_level_type": "dict"
     },
     {
     {
       "path": "assets/task_suite_infographic.png",
       "exists": true,
+      "bytes": 2322389,
       "width": 1800,
+      "height": 5850,
       "format": "PNG"
     }
   ]

scripts/validate_publication_package.py CHANGED Viewed

@@ -44,8 +44,58 @@ TEXT_SUFFIXES = {
 TOKEN_PATTERN = re.compile(r"hf_[A-Za-z0-9]{20,}")
 STALE_PRESENTATION_STRINGS = {
     "xperience10m-" + "modalities-v9-large-atlas": "old task-suite infographic cache key",
     "Start with the large native " + "modality atlas": "old suite-section hierarchy copy",
 }
 def rel(path: Path, base: Path) -> str:
@@ -187,6 +237,24 @@ def required_assets(root: Path) -> dict[str, bool]:
     return {item: (root / item).exists() for item in required}
 def build_report(hf_root: Path) -> dict:
     roots = {
         "github_repo": ROOT,
@@ -199,6 +267,7 @@ def build_report(hf_root: Path) -> dict:
         public_paths = git_public_paths(path) if name == "github_repo" else None
         scans[name] = scan(path, paths=public_paths)
     assets = required_assets(ROOT)
     missing_assets = [path for path, present in assets.items() if not present]
     violations = [
         {"root": name, **violation}
@@ -238,6 +307,11 @@ def build_report(hf_root: Path) -> dict:
             "status": "pass" if not any(v["kind"] == "stale_presentation_copy" for v in violations) else "fail",
             "count": sum(1 for v in violations if v["kind"] == "stale_presentation_copy"),
         },
     ]
     status = "pass" if all(check["status"] == "pass" for check in checks) else "fail"
     return {
@@ -245,6 +319,7 @@ def build_report(hf_root: Path) -> dict:
         "generated_at_utc": datetime.now(timezone.utc).isoformat(timespec="seconds"),
         "checks": checks,
         "required_assets": assets,
         "scans": scans,
         "violations": violations,
     }

 TOKEN_PATTERN = re.compile(r"hf_[A-Za-z0-9]{20,}")
 STALE_PRESENTATION_STRINGS = {
     "xperience10m-" + "modalities-v9-large-atlas": "old task-suite infographic cache key",
+    "xperience10m-" + "taskfirst-v10": "older task-suite infographic cache key",
     "Start with the large native " + "modality atlas": "old suite-section hierarchy copy",
 }
+CARD_FRESHNESS_EXPECTATIONS = [
+    {
+        "surface": "github_repo",
+        "relative_path": "README.md",
+        "required": [
+            "xperience10m-taskfirst-v11-modality-spread",
+            "all 12 task families before the",
+            "Public-sample modality thumbnails remain enlarged below",
+        ],
+    },
+    {
+        "surface": "hf_space_bundle",
+        "relative_path": "README.md",
+        "required": [
+            "xperience10m-taskfirst-v11-modality-spread",
+            "task-first 12-task infographic",
+            "native responsive modality atlas",
+            "website HTML",
+        ],
+    },
+    {
+        "surface": "hf_artifact_bundle",
+        "relative_path": "README.md",
+        "required": [
+            "xperience10m-taskfirst-v11-modality-spread",
+            "task-first 12-task map",
+            "including critical website HTML",
+        ],
+    },
+    {
+        "surface": "hf_artifact_bundle",
+        "relative_path": "PROJECT_README.md",
+        "required": [
+            "xperience10m-taskfirst-v11-modality-spread",
+            "all 12 task families before the",
+            "Public-sample modality thumbnails remain enlarged below",
+        ],
+    },
+    {
+        "surface": "hf_model_bundle",
+        "relative_path": "README.md",
+        "required": [
+            "xperience10m-taskfirst-v11-modality-spread",
+            "task-first 12-head",
+            "responsive modality atlas",
+            "website HTML",
+        ],
+    },
+]
 def rel(path: Path, base: Path) -> str:
     return {item: (root / item).exists() for item in required}
+def public_card_freshness(roots: dict[str, Path]) -> list[dict]:
+    records = []
+    for item in CARD_FRESHNESS_EXPECTATIONS:
+        surface = item["surface"]
+        path = roots[surface] / item["relative_path"]
+        text = path.read_text(encoding="utf-8", errors="ignore") if path.exists() else ""
+        missing = [marker for marker in item["required"] if marker not in text]
+        records.append({
+            "surface": surface,
+            "path": item["relative_path"],
+            "exists": path.exists(),
+            "required_marker_count": len(item["required"]),
+            "missing_markers": missing,
+            "status": "pass" if path.exists() and not missing else "fail",
+        })
+    return records
 def build_report(hf_root: Path) -> dict:
     roots = {
         "github_repo": ROOT,
         public_paths = git_public_paths(path) if name == "github_repo" else None
         scans[name] = scan(path, paths=public_paths)
     assets = required_assets(ROOT)
+    card_freshness = public_card_freshness(roots)
     missing_assets = [path for path, present in assets.items() if not present]
     violations = [
         {"root": name, **violation}
             "status": "pass" if not any(v["kind"] == "stale_presentation_copy" for v in violations) else "fail",
             "count": sum(1 for v in violations if v["kind"] == "stale_presentation_copy"),
         },
+        {
+            "name": "public_cards_reference_taskfirst_figure",
+            "status": "pass" if all(item["status"] == "pass" for item in card_freshness) else "fail",
+            "failures": [item for item in card_freshness if item["status"] != "pass"],
+        },
     ]
     status = "pass" if all(check["status"] == "pass" for check in checks) else "fail"
     return {
         "generated_at_utc": datetime.now(timezone.utc).isoformat(timespec="seconds"),
         "checks": checks,
         "required_assets": assets,
+        "public_card_freshness": card_freshness,
         "scans": scans,
         "violations": violations,
     }