cy0307 commited on
Commit
ae4f6df
·
verified ·
1 Parent(s): 03b872c

Publish Ropedia Xperience-10M task baseline cards

Browse files
ARTIFACT_GUIDE.md CHANGED
@@ -25,7 +25,7 @@ The project intentionally separates four layers:
25
  | [`REPRODUCIBILITY.md`](REPRODUCIBILITY.md) | Defines public reproduction commands, expected outputs, and unreproducible boundaries. |
26
  | [`metrics/artifact_index.json`](metrics/artifact_index.json) | Lists reviewer-critical files with existence, size, and stable hashes. |
27
  | [`metrics/mirror_parity.json`](metrics/mirror_parity.json) | Confirms prepared HF Space, artifact, and model mirrors match the repo for critical data, figures, website HTML, and validator scripts. |
28
- | [`metrics/publication_audit.json`](metrics/publication_audit.json) | Confirms public bundles exclude raw data, Python caches, heavy archives, and token strings. |
29
  | [`metrics/scope_claims_audit.json`](metrics/scope_claims_audit.json) | Confirms historical `32ep` smoke-run identifiers are not presented as real 32-episode results. |
30
  | [`metrics/website_integrity.json`](metrics/website_integrity.json) | Confirms local site links, anchors, JSON bundles, and referenced images resolve. |
31
  | [`metrics/reviewer_packet.json`](metrics/reviewer_packet.json) | Gives the shortest machine-readable reviewer route. |
 
25
  | [`REPRODUCIBILITY.md`](REPRODUCIBILITY.md) | Defines public reproduction commands, expected outputs, and unreproducible boundaries. |
26
  | [`metrics/artifact_index.json`](metrics/artifact_index.json) | Lists reviewer-critical files with existence, size, and stable hashes. |
27
  | [`metrics/mirror_parity.json`](metrics/mirror_parity.json) | Confirms prepared HF Space, artifact, and model mirrors match the repo for critical data, figures, website HTML, and validator scripts. |
28
+ | [`metrics/publication_audit.json`](metrics/publication_audit.json) | Confirms public bundles exclude raw data, Python caches, heavy archives, token strings, and stale public-card figure references. |
29
  | [`metrics/scope_claims_audit.json`](metrics/scope_claims_audit.json) | Confirms historical `32ep` smoke-run identifiers are not presented as real 32-episode results. |
30
  | [`metrics/website_integrity.json`](metrics/website_integrity.json) | Confirms local site links, anchors, JSON bundles, and referenced images resolve. |
31
  | [`metrics/reviewer_packet.json`](metrics/reviewer_packet.json) | Gives the shortest machine-readable reviewer route. |
EVIDENCE_CONTRACT.md CHANGED
@@ -16,7 +16,7 @@ local artifact that a reader can inspect before trusting the dashboard.
16
  | The real 32-episode LoRA pilot is blocked on gated data access, not on repo presentation. | Companion GitHub repo: `results/omni_finetune/DATA_BLOCKER_REPORT.md`, `results/omni_finetune/A100_HF_RELAY_STATUS.md`, `results/omni_finetune/source_discovery.json` | Blocker documented | No 32-episode metric should be claimed until the gate passes |
17
  | Historical `32ep` path strings are not treated as 32-episode results. | `scripts/validate_scope_claims.py`, `metrics/scope_claims_audit.json` | Verified pass | Classifies old run/path identifiers and fails if public presentation claims real 32-episode metrics |
18
  | Prepared GitHub/Hugging Face mirrors carry matching critical files. | `scripts/validate_mirror_parity.py`, `metrics/mirror_parity.json` | Verified pass | Compares prepared data files, visual assets, website HTML, and validator scripts before upload; live URLs are checked after publishing |
19
- | The public GitHub and Hugging Face bundles are publication-clean. | `scripts/validate_publication_package.py`, `metrics/publication_audit.json` | Verified pass | Checks public files and HF bundles, not arbitrary ignored local scratch outputs |
20
  | The public website has checked local references. | `scripts/validate_website_integrity.py`, `metrics/website_integrity.json` | Verified pass | Checks local links, anchors, JSON data, and referenced images; external URLs are not fetched |
21
  | The core proof artifacts are indexed and grouped for fast review. | `ARTIFACT_GUIDE.md`, `scripts/build_artifact_index.py`, `metrics/artifact_index.json` | Verified guide and index | Selective source-of-truth catalog, not a complete inventory of every output file |
22
  | The public reproduction path is documented. | `REPRODUCIBILITY.md`, `metrics/reproducibility_matrix.json`, `notes/reproducibility_audit.md` | Verified documentation and prior exact-match audit | Publicly reproduces the single-episode pipeline, not the gated 32-episode Qwen3-Omni pilot |
 
16
  | The real 32-episode LoRA pilot is blocked on gated data access, not on repo presentation. | Companion GitHub repo: `results/omni_finetune/DATA_BLOCKER_REPORT.md`, `results/omni_finetune/A100_HF_RELAY_STATUS.md`, `results/omni_finetune/source_discovery.json` | Blocker documented | No 32-episode metric should be claimed until the gate passes |
17
  | Historical `32ep` path strings are not treated as 32-episode results. | `scripts/validate_scope_claims.py`, `metrics/scope_claims_audit.json` | Verified pass | Classifies old run/path identifiers and fails if public presentation claims real 32-episode metrics |
18
  | Prepared GitHub/Hugging Face mirrors carry matching critical files. | `scripts/validate_mirror_parity.py`, `metrics/mirror_parity.json` | Verified pass | Compares prepared data files, visual assets, website HTML, and validator scripts before upload; live URLs are checked after publishing |
19
+ | The public GitHub and Hugging Face bundles are publication-clean. | `scripts/validate_publication_package.py`, `metrics/publication_audit.json` | Verified pass | Checks public files, HF bundles, and public-card freshness; ignored local scratch outputs are excluded |
20
  | The public website has checked local references. | `scripts/validate_website_integrity.py`, `metrics/website_integrity.json` | Verified pass | Checks local links, anchors, JSON data, and referenced images; external URLs are not fetched |
21
  | The core proof artifacts are indexed and grouped for fast review. | `ARTIFACT_GUIDE.md`, `scripts/build_artifact_index.py`, `metrics/artifact_index.json` | Verified guide and index | Selective source-of-truth catalog, not a complete inventory of every output file |
22
  | The public reproduction path is documented. | `REPRODUCIBILITY.md`, `metrics/reproducibility_matrix.json`, `notes/reproducibility_audit.md` | Verified documentation and prior exact-match audit | Publicly reproduces the single-episode pipeline, not the gated 32-episode Qwen3-Omni pilot |
README.md CHANGED
@@ -62,7 +62,7 @@ and metrics for the 12-task Xperience-10M episode suite, plus four lightweight
62
  direction-extension probes. It is meant to be read like a model audit, not
63
  advertised as a robot foundation model.
64
 
65
- ![12-task suite with sample modalities](assets/task_suite_infographic.png?v=xperience10m-taskfirst-v10)
66
 
67
  The source Xperience-10M sample spans video, audio, depth, pose, motion
68
  capture, inertial sensing, and language annotation. The committed minimal and
@@ -112,7 +112,7 @@ Source-of-truth artifact index mirror: `metrics/artifact_index.json`.
112
  | Qwen3-Omni | companion blocker and relay reports | smoke-only until 32 valid episodes are available |
113
  | Scope claims guard | `metrics/scope_claims_audit.json` and `scripts/validate_scope_claims.py` | historical `32ep` path strings are provenance, not 32-episode results |
114
  | Mirror parity | `metrics/mirror_parity.json` and `scripts/validate_mirror_parity.py` | prepared repo/HF mirrors carry matching critical data, figures, website HTML, and validator files |
115
- | Publication hygiene | `metrics/publication_audit.json` and validator script mirror | public bundles contain no raw data, generated caches, heavy archives, or token strings |
116
  | Website integrity | `metrics/website_integrity.json` and validator script mirror | local links, anchors, JSON bundles, and referenced images only |
117
  | Artifact index | `metrics/artifact_index.json` and `scripts/build_artifact_index.py` | compact catalog of the reviewer-critical proof artifacts |
118
  | Artifact guide | `ARTIFACT_GUIDE.md` | human-readable map of proof boundary, task evidence, mirrors, and scale-up status |
@@ -148,7 +148,7 @@ transfers them to H20 for manifest building, training, and evaluation.
148
  | `metrics/artifact_index.json` | indexes proof artifacts with existence, size, and stable-file hashes |
149
  | `metrics/mirror_parity.json` | verifies prepared repo/HF mirrors have matching critical data, figures, website HTML, and validator files before upload |
150
  | `metrics/scope_claims_audit.json` | verifies historical `32ep` smoke-run identifiers are not presented as real 32-episode results |
151
- | `metrics/publication_audit.json` | records the latest public-bundle hygiene check |
152
  | `metrics/website_integrity.json` | records the latest local website link, anchor, JSON, and image integrity check |
153
  | `metrics/project_manifest.json` | mirrors the public URL and citation metadata bundle |
154
 
 
62
  direction-extension probes. It is meant to be read like a model audit, not
63
  advertised as a robot foundation model.
64
 
65
+ ![12-task suite with sample modalities](assets/task_suite_infographic.png?v=xperience10m-taskfirst-v11-modality-spread)
66
 
67
  The source Xperience-10M sample spans video, audio, depth, pose, motion
68
  capture, inertial sensing, and language annotation. The committed minimal and
 
112
  | Qwen3-Omni | companion blocker and relay reports | smoke-only until 32 valid episodes are available |
113
  | Scope claims guard | `metrics/scope_claims_audit.json` and `scripts/validate_scope_claims.py` | historical `32ep` path strings are provenance, not 32-episode results |
114
  | Mirror parity | `metrics/mirror_parity.json` and `scripts/validate_mirror_parity.py` | prepared repo/HF mirrors carry matching critical data, figures, website HTML, and validator files |
115
+ | Publication hygiene | `metrics/publication_audit.json` and validator script mirror | public bundles contain no raw data, generated caches, heavy archives, token strings, or stale public-card figure references |
116
  | Website integrity | `metrics/website_integrity.json` and validator script mirror | local links, anchors, JSON bundles, and referenced images only |
117
  | Artifact index | `metrics/artifact_index.json` and `scripts/build_artifact_index.py` | compact catalog of the reviewer-critical proof artifacts |
118
  | Artifact guide | `ARTIFACT_GUIDE.md` | human-readable map of proof boundary, task evidence, mirrors, and scale-up status |
 
148
  | `metrics/artifact_index.json` | indexes proof artifacts with existence, size, and stable-file hashes |
149
  | `metrics/mirror_parity.json` | verifies prepared repo/HF mirrors have matching critical data, figures, website HTML, and validator files before upload |
150
  | `metrics/scope_claims_audit.json` | verifies historical `32ep` smoke-run identifiers are not presented as real 32-episode results |
151
+ | `metrics/publication_audit.json` | records the latest public-bundle hygiene and public-card freshness check |
152
  | `metrics/website_integrity.json` | records the latest local website link, anchor, JSON, and image integrity check |
153
  | `metrics/project_manifest.json` | mirrors the public URL and citation metadata bundle |
154
 
assets/task_suite_infographic.png CHANGED

Git LFS Details

  • SHA256: 7ffca73ef3cb775dfd8b0563e943502421eac55989b3c6dd05101e7f070a3bdf
  • Pointer size: 132 Bytes
  • Size of remote file: 2.33 MB

Git LFS Details

  • SHA256: 0e1d1b1eb165f6c52a49f94e4e93b7b4e9421d9efa015d7a094d70c48800e9b5
  • Pointer size: 132 Bytes
  • Size of remote file: 2.32 MB
metrics/artifact_index.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "title": "Ropedia Xperience-10M Task Suite Artifact Index",
3
- "generated_at_utc": "2026-06-01T05:07:04+00:00",
4
  "status": "pass",
5
  "artifact_count": 29,
6
  "missing": [],
@@ -35,8 +35,8 @@
35
  "surface": "repo",
36
  "proves": "Defines what is verified, what is smoke-only, and what must not be inferred.",
37
  "exists": true,
38
- "bytes": 6497,
39
- "sha256": "417835c2f838f1d4c4bca9f07c708ce04611e7212017e58421956818a4ca4b45"
40
  },
41
  {
42
  "id": "reviewer_packet",
@@ -57,8 +57,8 @@
57
  "surface": "repo_hf",
58
  "proves": "Gives the human-readable map from proof boundary to data, tasks, platform mirrors, and scale-up status.",
59
  "exists": true,
60
- "bytes": 6483,
61
- "sha256": "cc211ec1175eed43bc5d8c9ce1ec412982524af1998316b392e77e5d7ddc99ee"
62
  },
63
  {
64
  "id": "reproducibility_contract",
@@ -102,7 +102,7 @@
102
  "volatile": true,
103
  "proves": "Confirms public bundles pass raw-data, cache, archive, and token-string checks.",
104
  "exists": true,
105
- "bytes": 4105,
106
  "hash_policy": "existence_and_size_only"
107
  },
108
  {
@@ -126,7 +126,7 @@
126
  "volatile": true,
127
  "proves": "Confirms prepared GitHub/HF Space/artifact/model mirrors share the same critical data, figure, website HTML, and validator files.",
128
  "exists": true,
129
- "bytes": 41465,
130
  "hash_policy": "existence_and_size_only"
131
  },
132
  {
@@ -259,8 +259,8 @@
259
  "surface": "website_hf",
260
  "proves": "Presents the task suite and sample modality thumbnails with metrics generated from committed files.",
261
  "exists": true,
262
- "bytes": 2331622,
263
- "sha256": "7ffca73ef3cb775dfd8b0563e943502421eac55989b3c6dd05101e7f070a3bdf"
264
  },
265
  {
266
  "id": "modality_atlas",
 
1
  {
2
  "title": "Ropedia Xperience-10M Task Suite Artifact Index",
3
+ "generated_at_utc": "2026-06-01T05:58:15+00:00",
4
  "status": "pass",
5
  "artifact_count": 29,
6
  "missing": [],
 
35
  "surface": "repo",
36
  "proves": "Defines what is verified, what is smoke-only, and what must not be inferred.",
37
  "exists": true,
38
+ "bytes": 6520,
39
+ "sha256": "d6b8d74a53b49778d38bff6f6857f79d481d451f938c6a4177a50374f541d219"
40
  },
41
  {
42
  "id": "reviewer_packet",
 
57
  "surface": "repo_hf",
58
  "proves": "Gives the human-readable map from proof boundary to data, tasks, platform mirrors, and scale-up status.",
59
  "exists": true,
60
+ "bytes": 6520,
61
+ "sha256": "0a8740e19d56c9c7e1c3964d3abf838a8e33af140128a4fb95a69bdca0b45173"
62
  },
63
  {
64
  "id": "reproducibility_contract",
 
102
  "volatile": true,
103
  "proves": "Confirms public bundles pass raw-data, cache, archive, and token-string checks.",
104
  "exists": true,
105
+ "bytes": 5292,
106
  "hash_policy": "existence_and_size_only"
107
  },
108
  {
 
126
  "volatile": true,
127
  "proves": "Confirms prepared GitHub/HF Space/artifact/model mirrors share the same critical data, figure, website HTML, and validator files.",
128
  "exists": true,
129
+ "bytes": 42567,
130
  "hash_policy": "existence_and_size_only"
131
  },
132
  {
 
259
  "surface": "website_hf",
260
  "proves": "Presents the task suite and sample modality thumbnails with metrics generated from committed files.",
261
  "exists": true,
262
+ "bytes": 2322389,
263
+ "sha256": "0e1d1b1eb165f6c52a49f94e4e93b7b4e9421d9efa015d7a094d70c48800e9b5"
264
  },
265
  {
266
  "id": "modality_atlas",
metrics/evidence_contract.json CHANGED
@@ -126,7 +126,7 @@
126
  "scripts/validate_publication_package.py",
127
  "docs/data/publication_audit.json"
128
  ],
129
- "boundary": "checks public files and HF bundles, not arbitrary ignored local scratch outputs"
130
  },
131
  {
132
  "id": "website_integrity",
 
126
  "scripts/validate_publication_package.py",
127
  "docs/data/publication_audit.json"
128
  ],
129
+ "boundary": "checks public files, HF bundles, and public-card freshness; ignored local scratch outputs are excluded"
130
  },
131
  {
132
  "id": "website_integrity",
metrics/mirror_parity.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-01T05:08:43+00:00",
4
  "hf_root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish",
5
  "summary": {
6
  "group_count": 29,
@@ -33,26 +33,26 @@
33
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/artifact_index.json",
34
  "exists": true,
35
  "bytes": 12916,
36
- "sha256": "977e2d8d0ec9e42bee1fb7b43b9460b42a6d8a6d6e9a452389901b8d56d69372"
37
  },
38
  "mirrors": {
39
  "hf_space": {
40
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/artifact_index.json",
41
  "exists": true,
42
  "bytes": 12916,
43
- "sha256": "977e2d8d0ec9e42bee1fb7b43b9460b42a6d8a6d6e9a452389901b8d56d69372"
44
  },
45
  "hf_artifacts": {
46
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/artifact_index.json",
47
  "exists": true,
48
  "bytes": 12916,
49
- "sha256": "977e2d8d0ec9e42bee1fb7b43b9460b42a6d8a6d6e9a452389901b8d56d69372"
50
  },
51
  "hf_model": {
52
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/artifact_index.json",
53
  "exists": true,
54
  "bytes": 12916,
55
- "sha256": "977e2d8d0ec9e42bee1fb7b43b9460b42a6d8a6d6e9a452389901b8d56d69372"
56
  }
57
  },
58
  "failures": []
@@ -63,27 +63,27 @@
63
  "local": {
64
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/evidence_contract.json",
65
  "exists": true,
66
- "bytes": 7182,
67
- "sha256": "42a75b0f87eec02dd5b5fedffe6eb3d0cdc8d9f12156887680686f1900ac2bfa"
68
  },
69
  "mirrors": {
70
  "hf_space": {
71
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/evidence_contract.json",
72
  "exists": true,
73
- "bytes": 7182,
74
- "sha256": "42a75b0f87eec02dd5b5fedffe6eb3d0cdc8d9f12156887680686f1900ac2bfa"
75
  },
76
  "hf_artifacts": {
77
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/evidence_contract.json",
78
  "exists": true,
79
- "bytes": 7182,
80
- "sha256": "42a75b0f87eec02dd5b5fedffe6eb3d0cdc8d9f12156887680686f1900ac2bfa"
81
  },
82
  "hf_model": {
83
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/evidence_contract.json",
84
  "exists": true,
85
- "bytes": 7182,
86
- "sha256": "42a75b0f87eec02dd5b5fedffe6eb3d0cdc8d9f12156887680686f1900ac2bfa"
87
  }
88
  },
89
  "failures": []
@@ -156,27 +156,27 @@
156
  "local": {
157
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/publication_audit.json",
158
  "exists": true,
159
- "bytes": 4214,
160
- "sha256": "3d1a2d861c96d445541519494abfcfca1da13cb593094a8c660ad40a036ab218"
161
  },
162
  "mirrors": {
163
  "hf_space": {
164
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/publication_audit.json",
165
  "exists": true,
166
- "bytes": 4214,
167
- "sha256": "3d1a2d861c96d445541519494abfcfca1da13cb593094a8c660ad40a036ab218"
168
  },
169
  "hf_artifacts": {
170
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/publication_audit.json",
171
  "exists": true,
172
- "bytes": 4214,
173
- "sha256": "3d1a2d861c96d445541519494abfcfca1da13cb593094a8c660ad40a036ab218"
174
  },
175
  "hf_model": {
176
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/publication_audit.json",
177
  "exists": true,
178
- "bytes": 4214,
179
- "sha256": "3d1a2d861c96d445541519494abfcfca1da13cb593094a8c660ad40a036ab218"
180
  }
181
  },
182
  "failures": []
@@ -312,26 +312,26 @@
312
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/scope_claims_audit.json",
313
  "exists": true,
314
  "bytes": 19964,
315
- "sha256": "5520aa2b2c41ed9394283e8bf08be0ec1926b2851a952ba8a8a56a1f85a058eb"
316
  },
317
  "mirrors": {
318
  "hf_space": {
319
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/scope_claims_audit.json",
320
  "exists": true,
321
  "bytes": 19964,
322
- "sha256": "5520aa2b2c41ed9394283e8bf08be0ec1926b2851a952ba8a8a56a1f85a058eb"
323
  },
324
  "hf_artifacts": {
325
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/scope_claims_audit.json",
326
  "exists": true,
327
  "bytes": 19964,
328
- "sha256": "5520aa2b2c41ed9394283e8bf08be0ec1926b2851a952ba8a8a56a1f85a058eb"
329
  },
330
  "hf_model": {
331
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/scope_claims_audit.json",
332
  "exists": true,
333
  "bytes": 19964,
334
- "sha256": "5520aa2b2c41ed9394283e8bf08be0ec1926b2851a952ba8a8a56a1f85a058eb"
335
  }
336
  },
337
  "failures": []
@@ -405,26 +405,26 @@
405
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/website_integrity.json",
406
  "exists": true,
407
  "bytes": 5936,
408
- "sha256": "b9c324a59e447a11bc6aeb5130736788981b9a1b529a80c988378e3f05f924b1"
409
  },
410
  "mirrors": {
411
  "hf_space": {
412
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/website_integrity.json",
413
  "exists": true,
414
  "bytes": 5936,
415
- "sha256": "b9c324a59e447a11bc6aeb5130736788981b9a1b529a80c988378e3f05f924b1"
416
  },
417
  "hf_artifacts": {
418
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/website_integrity.json",
419
  "exists": true,
420
  "bytes": 5936,
421
- "sha256": "b9c324a59e447a11bc6aeb5130736788981b9a1b529a80c988378e3f05f924b1"
422
  },
423
  "hf_model": {
424
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/website_integrity.json",
425
  "exists": true,
426
  "bytes": 5936,
427
- "sha256": "b9c324a59e447a11bc6aeb5130736788981b9a1b529a80c988378e3f05f924b1"
428
  }
429
  },
430
  "failures": []
@@ -435,33 +435,33 @@
435
  "local": {
436
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/assets/task_suite_infographic.png",
437
  "exists": true,
438
- "bytes": 2331622,
439
- "sha256": "7ffca73ef3cb775dfd8b0563e943502421eac55989b3c6dd05101e7f070a3bdf"
440
  },
441
  "mirrors": {
442
  "hf_space": {
443
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/assets/task_suite_infographic.png",
444
  "exists": true,
445
- "bytes": 2331622,
446
- "sha256": "7ffca73ef3cb775dfd8b0563e943502421eac55989b3c6dd05101e7f070a3bdf"
447
  },
448
  "hf_artifacts_docs": {
449
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/assets/task_suite_infographic.png",
450
  "exists": true,
451
- "bytes": 2331622,
452
- "sha256": "7ffca73ef3cb775dfd8b0563e943502421eac55989b3c6dd05101e7f070a3bdf"
453
  },
454
  "hf_artifacts_card": {
455
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/assets/task_suite_infographic.png",
456
  "exists": true,
457
- "bytes": 2331622,
458
- "sha256": "7ffca73ef3cb775dfd8b0563e943502421eac55989b3c6dd05101e7f070a3bdf"
459
  },
460
  "hf_model": {
461
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/assets/task_suite_infographic.png",
462
  "exists": true,
463
- "bytes": 2331622,
464
- "sha256": "7ffca73ef3cb775dfd8b0563e943502421eac55989b3c6dd05101e7f070a3bdf"
465
  }
466
  },
467
  "failures": []
@@ -855,21 +855,21 @@
855
  "local": {
856
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/scripts/validate_publication_package.py",
857
  "exists": true,
858
- "bytes": 9772,
859
- "sha256": "1a915bdd68a6c63941339282a8f747e4cafa08c24e5cdb3dbe105bf6ac3ea144"
860
  },
861
  "mirrors": {
862
  "hf_artifacts": {
863
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/scripts/validate_publication_package.py",
864
  "exists": true,
865
- "bytes": 9772,
866
- "sha256": "1a915bdd68a6c63941339282a8f747e4cafa08c24e5cdb3dbe105bf6ac3ea144"
867
  },
868
  "hf_model": {
869
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/scripts/validate_publication_package.py",
870
  "exists": true,
871
- "bytes": 9772,
872
- "sha256": "1a915bdd68a6c63941339282a8f747e4cafa08c24e5cdb3dbe105bf6ac3ea144"
873
  }
874
  },
875
  "failures": []
@@ -930,21 +930,21 @@
930
  "local": {
931
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/index.html",
932
  "exists": true,
933
- "bytes": 89653,
934
- "sha256": "f4d2b412d24bb29e977e8b82bb531fdb352cc7a1b81a2141ac63a0328bab654b"
935
  },
936
  "mirrors": {
937
  "hf_space": {
938
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/index.html",
939
  "exists": true,
940
- "bytes": 89653,
941
- "sha256": "f4d2b412d24bb29e977e8b82bb531fdb352cc7a1b81a2141ac63a0328bab654b"
942
  },
943
  "hf_artifacts_docs": {
944
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/index.html",
945
  "exists": true,
946
- "bytes": 89653,
947
- "sha256": "f4d2b412d24bb29e977e8b82bb531fdb352cc7a1b81a2141ac63a0328bab654b"
948
  }
949
  },
950
  "failures": []
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-01T05:59:09+00:00",
4
  "hf_root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish",
5
  "summary": {
6
  "group_count": 29,
 
33
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/artifact_index.json",
34
  "exists": true,
35
  "bytes": 12916,
36
+ "sha256": "0d12a2e36db90b4c6ae4205aab074e6c6b00083fa6c4cc18ea51342f5f1a05df"
37
  },
38
  "mirrors": {
39
  "hf_space": {
40
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/artifact_index.json",
41
  "exists": true,
42
  "bytes": 12916,
43
+ "sha256": "0d12a2e36db90b4c6ae4205aab074e6c6b00083fa6c4cc18ea51342f5f1a05df"
44
  },
45
  "hf_artifacts": {
46
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/artifact_index.json",
47
  "exists": true,
48
  "bytes": 12916,
49
+ "sha256": "0d12a2e36db90b4c6ae4205aab074e6c6b00083fa6c4cc18ea51342f5f1a05df"
50
  },
51
  "hf_model": {
52
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/artifact_index.json",
53
  "exists": true,
54
  "bytes": 12916,
55
+ "sha256": "0d12a2e36db90b4c6ae4205aab074e6c6b00083fa6c4cc18ea51342f5f1a05df"
56
  }
57
  },
58
  "failures": []
 
63
  "local": {
64
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/evidence_contract.json",
65
  "exists": true,
66
+ "bytes": 7205,
67
+ "sha256": "78f2a0db49dc00f52f79dc4d3448d01a47bd28019458bd61000304e119a018e8"
68
  },
69
  "mirrors": {
70
  "hf_space": {
71
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/evidence_contract.json",
72
  "exists": true,
73
+ "bytes": 7205,
74
+ "sha256": "78f2a0db49dc00f52f79dc4d3448d01a47bd28019458bd61000304e119a018e8"
75
  },
76
  "hf_artifacts": {
77
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/evidence_contract.json",
78
  "exists": true,
79
+ "bytes": 7205,
80
+ "sha256": "78f2a0db49dc00f52f79dc4d3448d01a47bd28019458bd61000304e119a018e8"
81
  },
82
  "hf_model": {
83
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/evidence_contract.json",
84
  "exists": true,
85
+ "bytes": 7205,
86
+ "sha256": "78f2a0db49dc00f52f79dc4d3448d01a47bd28019458bd61000304e119a018e8"
87
  }
88
  },
89
  "failures": []
 
156
  "local": {
157
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/publication_audit.json",
158
  "exists": true,
159
+ "bytes": 5292,
160
+ "sha256": "c273917a673c41fed0b1498ffef2d1e64ebd9691e7b870cbcd15b194785397ed"
161
  },
162
  "mirrors": {
163
  "hf_space": {
164
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/publication_audit.json",
165
  "exists": true,
166
+ "bytes": 5292,
167
+ "sha256": "c273917a673c41fed0b1498ffef2d1e64ebd9691e7b870cbcd15b194785397ed"
168
  },
169
  "hf_artifacts": {
170
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/publication_audit.json",
171
  "exists": true,
172
+ "bytes": 5292,
173
+ "sha256": "c273917a673c41fed0b1498ffef2d1e64ebd9691e7b870cbcd15b194785397ed"
174
  },
175
  "hf_model": {
176
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/publication_audit.json",
177
  "exists": true,
178
+ "bytes": 5292,
179
+ "sha256": "c273917a673c41fed0b1498ffef2d1e64ebd9691e7b870cbcd15b194785397ed"
180
  }
181
  },
182
  "failures": []
 
312
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/scope_claims_audit.json",
313
  "exists": true,
314
  "bytes": 19964,
315
+ "sha256": "a74d89124ddbecd1af7a6054fcaa0d905c639dec56a9fd455140817220fd91a1"
316
  },
317
  "mirrors": {
318
  "hf_space": {
319
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/scope_claims_audit.json",
320
  "exists": true,
321
  "bytes": 19964,
322
+ "sha256": "a74d89124ddbecd1af7a6054fcaa0d905c639dec56a9fd455140817220fd91a1"
323
  },
324
  "hf_artifacts": {
325
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/scope_claims_audit.json",
326
  "exists": true,
327
  "bytes": 19964,
328
+ "sha256": "a74d89124ddbecd1af7a6054fcaa0d905c639dec56a9fd455140817220fd91a1"
329
  },
330
  "hf_model": {
331
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/scope_claims_audit.json",
332
  "exists": true,
333
  "bytes": 19964,
334
+ "sha256": "a74d89124ddbecd1af7a6054fcaa0d905c639dec56a9fd455140817220fd91a1"
335
  }
336
  },
337
  "failures": []
 
405
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/website_integrity.json",
406
  "exists": true,
407
  "bytes": 5936,
408
+ "sha256": "0b9a1d8d3bbf953f86aed99fe22882dbee4ed2813f2cae91cbf15a2002752cc2"
409
  },
410
  "mirrors": {
411
  "hf_space": {
412
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/website_integrity.json",
413
  "exists": true,
414
  "bytes": 5936,
415
+ "sha256": "0b9a1d8d3bbf953f86aed99fe22882dbee4ed2813f2cae91cbf15a2002752cc2"
416
  },
417
  "hf_artifacts": {
418
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/website_integrity.json",
419
  "exists": true,
420
  "bytes": 5936,
421
+ "sha256": "0b9a1d8d3bbf953f86aed99fe22882dbee4ed2813f2cae91cbf15a2002752cc2"
422
  },
423
  "hf_model": {
424
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/website_integrity.json",
425
  "exists": true,
426
  "bytes": 5936,
427
+ "sha256": "0b9a1d8d3bbf953f86aed99fe22882dbee4ed2813f2cae91cbf15a2002752cc2"
428
  }
429
  },
430
  "failures": []
 
435
  "local": {
436
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/assets/task_suite_infographic.png",
437
  "exists": true,
438
+ "bytes": 2322389,
439
+ "sha256": "0e1d1b1eb165f6c52a49f94e4e93b7b4e9421d9efa015d7a094d70c48800e9b5"
440
  },
441
  "mirrors": {
442
  "hf_space": {
443
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/assets/task_suite_infographic.png",
444
  "exists": true,
445
+ "bytes": 2322389,
446
+ "sha256": "0e1d1b1eb165f6c52a49f94e4e93b7b4e9421d9efa015d7a094d70c48800e9b5"
447
  },
448
  "hf_artifacts_docs": {
449
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/assets/task_suite_infographic.png",
450
  "exists": true,
451
+ "bytes": 2322389,
452
+ "sha256": "0e1d1b1eb165f6c52a49f94e4e93b7b4e9421d9efa015d7a094d70c48800e9b5"
453
  },
454
  "hf_artifacts_card": {
455
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/assets/task_suite_infographic.png",
456
  "exists": true,
457
+ "bytes": 2322389,
458
+ "sha256": "0e1d1b1eb165f6c52a49f94e4e93b7b4e9421d9efa015d7a094d70c48800e9b5"
459
  },
460
  "hf_model": {
461
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/assets/task_suite_infographic.png",
462
  "exists": true,
463
+ "bytes": 2322389,
464
+ "sha256": "0e1d1b1eb165f6c52a49f94e4e93b7b4e9421d9efa015d7a094d70c48800e9b5"
465
  }
466
  },
467
  "failures": []
 
855
  "local": {
856
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/scripts/validate_publication_package.py",
857
  "exists": true,
858
+ "bytes": 12444,
859
+ "sha256": "f8fc86b66a1fde0755004897dd307eb5c80f84bdaf917158b43c423ff6e7e9e7"
860
  },
861
  "mirrors": {
862
  "hf_artifacts": {
863
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/scripts/validate_publication_package.py",
864
  "exists": true,
865
+ "bytes": 12444,
866
+ "sha256": "f8fc86b66a1fde0755004897dd307eb5c80f84bdaf917158b43c423ff6e7e9e7"
867
  },
868
  "hf_model": {
869
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/scripts/validate_publication_package.py",
870
  "exists": true,
871
+ "bytes": 12444,
872
+ "sha256": "f8fc86b66a1fde0755004897dd307eb5c80f84bdaf917158b43c423ff6e7e9e7"
873
  }
874
  },
875
  "failures": []
 
930
  "local": {
931
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/index.html",
932
  "exists": true,
933
+ "bytes": 89772,
934
+ "sha256": "3544638ab8dc809e126f347d942b4f7303674edd79858cd039a5b18b95500fcb"
935
  },
936
  "mirrors": {
937
  "hf_space": {
938
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/index.html",
939
  "exists": true,
940
+ "bytes": 89772,
941
+ "sha256": "3544638ab8dc809e126f347d942b4f7303674edd79858cd039a5b18b95500fcb"
942
  },
943
  "hf_artifacts_docs": {
944
  "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/index.html",
945
  "exists": true,
946
+ "bytes": 89772,
947
+ "sha256": "3544638ab8dc809e126f347d942b4f7303674edd79858cd039a5b18b95500fcb"
948
  }
949
  },
950
  "failures": []
metrics/publication_audit.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-01T05:07:53+00:00",
4
  "checks": [
5
  {
6
  "name": "required_publication_assets_present",
@@ -31,6 +31,11 @@
31
  "name": "no_stale_task_suite_presentation_copy",
32
  "status": "pass",
33
  "count": 0
 
 
 
 
 
34
  }
35
  ],
36
  "required_assets": {
@@ -80,6 +85,48 @@
80
  "scripts/validate_website_integrity.py": true,
81
  "scripts/omni/train_qwen3_omni_lora.py": true
82
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
83
  "scans": {
84
  "github_repo": {
85
  "root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy",
@@ -99,7 +146,7 @@
99
  "text_file_count": 38,
100
  "largest_file": {
101
  "path": "assets/task_suite_infographic.png",
102
- "bytes": 2331622
103
  },
104
  "violations": []
105
  },
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-01T05:58:39+00:00",
4
  "checks": [
5
  {
6
  "name": "required_publication_assets_present",
 
31
  "name": "no_stale_task_suite_presentation_copy",
32
  "status": "pass",
33
  "count": 0
34
+ },
35
+ {
36
+ "name": "public_cards_reference_taskfirst_figure",
37
+ "status": "pass",
38
+ "failures": []
39
  }
40
  ],
41
  "required_assets": {
 
85
  "scripts/validate_website_integrity.py": true,
86
  "scripts/omni/train_qwen3_omni_lora.py": true
87
  },
88
+ "public_card_freshness": [
89
+ {
90
+ "surface": "github_repo",
91
+ "path": "README.md",
92
+ "exists": true,
93
+ "required_marker_count": 3,
94
+ "missing_markers": [],
95
+ "status": "pass"
96
+ },
97
+ {
98
+ "surface": "hf_space_bundle",
99
+ "path": "README.md",
100
+ "exists": true,
101
+ "required_marker_count": 4,
102
+ "missing_markers": [],
103
+ "status": "pass"
104
+ },
105
+ {
106
+ "surface": "hf_artifact_bundle",
107
+ "path": "README.md",
108
+ "exists": true,
109
+ "required_marker_count": 3,
110
+ "missing_markers": [],
111
+ "status": "pass"
112
+ },
113
+ {
114
+ "surface": "hf_artifact_bundle",
115
+ "path": "PROJECT_README.md",
116
+ "exists": true,
117
+ "required_marker_count": 3,
118
+ "missing_markers": [],
119
+ "status": "pass"
120
+ },
121
+ {
122
+ "surface": "hf_model_bundle",
123
+ "path": "README.md",
124
+ "exists": true,
125
+ "required_marker_count": 4,
126
+ "missing_markers": [],
127
+ "status": "pass"
128
+ }
129
+ ],
130
  "scans": {
131
  "github_repo": {
132
  "root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy",
 
146
  "text_file_count": 38,
147
  "largest_file": {
148
  "path": "assets/task_suite_infographic.png",
149
+ "bytes": 2322389
150
  },
151
  "violations": []
152
  },
metrics/scope_claims_audit.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-01T05:06:13+00:00",
4
  "summary": {
5
  "qwen3_omni_32_episode_claim": false,
6
  "dataset_manifest_num_episodes": 1,
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-01T05:46:47+00:00",
4
  "summary": {
5
  "qwen3_omni_32_episode_claim": false,
6
  "dataset_manifest_num_episodes": 1,
metrics/website_integrity.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-01T05:06:42+00:00",
4
  "docs_root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs",
5
  "site_base": "/ropedia-xperience-10m-task-suite/",
6
  "summary": {
@@ -25,7 +25,7 @@
25
  "status": "pass",
26
  "reason": "The Suite anchor should show the full 12-task map before the modality atlas.",
27
  "first_marker_index": 380,
28
- "second_marker_index": 669
29
  },
30
  {
31
  "name": "suite_modality_atlas_contains_seven_cards",
@@ -51,17 +51,17 @@
51
  "json_files": [
52
  {
53
  "path": "data/artifact_index.json",
54
- "bytes": 12902,
55
  "top_level_type": "dict"
56
  },
57
  {
58
  "path": "data/evidence_contract.json",
59
- "bytes": 7182,
60
  "top_level_type": "dict"
61
  },
62
  {
63
  "path": "data/mirror_parity.json",
64
- "bytes": 41465,
65
  "top_level_type": "dict"
66
  },
67
  {
@@ -76,7 +76,7 @@
76
  },
77
  {
78
  "path": "data/publication_audit.json",
79
- "bytes": 4105,
80
  "top_level_type": "dict"
81
  },
82
  {
@@ -252,9 +252,9 @@
252
  {
253
  "path": "assets/task_suite_infographic.png",
254
  "exists": true,
255
- "bytes": 2331622,
256
  "width": 1800,
257
- "height": 5700,
258
  "format": "PNG"
259
  }
260
  ]
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-01T05:57:48+00:00",
4
  "docs_root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs",
5
  "site_base": "/ropedia-xperience-10m-task-suite/",
6
  "summary": {
 
25
  "status": "pass",
26
  "reason": "The Suite anchor should show the full 12-task map before the modality atlas.",
27
  "first_marker_index": 380,
28
+ "second_marker_index": 696
29
  },
30
  {
31
  "name": "suite_modality_atlas_contains_seven_cards",
 
51
  "json_files": [
52
  {
53
  "path": "data/artifact_index.json",
54
+ "bytes": 12916,
55
  "top_level_type": "dict"
56
  },
57
  {
58
  "path": "data/evidence_contract.json",
59
+ "bytes": 7205,
60
  "top_level_type": "dict"
61
  },
62
  {
63
  "path": "data/mirror_parity.json",
64
+ "bytes": 42567,
65
  "top_level_type": "dict"
66
  },
67
  {
 
76
  },
77
  {
78
  "path": "data/publication_audit.json",
79
+ "bytes": 5292,
80
  "top_level_type": "dict"
81
  },
82
  {
 
252
  {
253
  "path": "assets/task_suite_infographic.png",
254
  "exists": true,
255
+ "bytes": 2322389,
256
  "width": 1800,
257
+ "height": 5850,
258
  "format": "PNG"
259
  }
260
  ]
scripts/validate_publication_package.py CHANGED
@@ -44,8 +44,58 @@ TEXT_SUFFIXES = {
44
  TOKEN_PATTERN = re.compile(r"hf_[A-Za-z0-9]{20,}")
45
  STALE_PRESENTATION_STRINGS = {
46
  "xperience10m-" + "modalities-v9-large-atlas": "old task-suite infographic cache key",
 
47
  "Start with the large native " + "modality atlas": "old suite-section hierarchy copy",
48
  }
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
49
 
50
 
51
  def rel(path: Path, base: Path) -> str:
@@ -187,6 +237,24 @@ def required_assets(root: Path) -> dict[str, bool]:
187
  return {item: (root / item).exists() for item in required}
188
 
189
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
190
  def build_report(hf_root: Path) -> dict:
191
  roots = {
192
  "github_repo": ROOT,
@@ -199,6 +267,7 @@ def build_report(hf_root: Path) -> dict:
199
  public_paths = git_public_paths(path) if name == "github_repo" else None
200
  scans[name] = scan(path, paths=public_paths)
201
  assets = required_assets(ROOT)
 
202
  missing_assets = [path for path, present in assets.items() if not present]
203
  violations = [
204
  {"root": name, **violation}
@@ -238,6 +307,11 @@ def build_report(hf_root: Path) -> dict:
238
  "status": "pass" if not any(v["kind"] == "stale_presentation_copy" for v in violations) else "fail",
239
  "count": sum(1 for v in violations if v["kind"] == "stale_presentation_copy"),
240
  },
 
 
 
 
 
241
  ]
242
  status = "pass" if all(check["status"] == "pass" for check in checks) else "fail"
243
  return {
@@ -245,6 +319,7 @@ def build_report(hf_root: Path) -> dict:
245
  "generated_at_utc": datetime.now(timezone.utc).isoformat(timespec="seconds"),
246
  "checks": checks,
247
  "required_assets": assets,
 
248
  "scans": scans,
249
  "violations": violations,
250
  }
 
44
  TOKEN_PATTERN = re.compile(r"hf_[A-Za-z0-9]{20,}")
45
  STALE_PRESENTATION_STRINGS = {
46
  "xperience10m-" + "modalities-v9-large-atlas": "old task-suite infographic cache key",
47
+ "xperience10m-" + "taskfirst-v10": "older task-suite infographic cache key",
48
  "Start with the large native " + "modality atlas": "old suite-section hierarchy copy",
49
  }
50
+ CARD_FRESHNESS_EXPECTATIONS = [
51
+ {
52
+ "surface": "github_repo",
53
+ "relative_path": "README.md",
54
+ "required": [
55
+ "xperience10m-taskfirst-v11-modality-spread",
56
+ "all 12 task families before the",
57
+ "Public-sample modality thumbnails remain enlarged below",
58
+ ],
59
+ },
60
+ {
61
+ "surface": "hf_space_bundle",
62
+ "relative_path": "README.md",
63
+ "required": [
64
+ "xperience10m-taskfirst-v11-modality-spread",
65
+ "task-first 12-task infographic",
66
+ "native responsive modality atlas",
67
+ "website HTML",
68
+ ],
69
+ },
70
+ {
71
+ "surface": "hf_artifact_bundle",
72
+ "relative_path": "README.md",
73
+ "required": [
74
+ "xperience10m-taskfirst-v11-modality-spread",
75
+ "task-first 12-task map",
76
+ "including critical website HTML",
77
+ ],
78
+ },
79
+ {
80
+ "surface": "hf_artifact_bundle",
81
+ "relative_path": "PROJECT_README.md",
82
+ "required": [
83
+ "xperience10m-taskfirst-v11-modality-spread",
84
+ "all 12 task families before the",
85
+ "Public-sample modality thumbnails remain enlarged below",
86
+ ],
87
+ },
88
+ {
89
+ "surface": "hf_model_bundle",
90
+ "relative_path": "README.md",
91
+ "required": [
92
+ "xperience10m-taskfirst-v11-modality-spread",
93
+ "task-first 12-head",
94
+ "responsive modality atlas",
95
+ "website HTML",
96
+ ],
97
+ },
98
+ ]
99
 
100
 
101
  def rel(path: Path, base: Path) -> str:
 
237
  return {item: (root / item).exists() for item in required}
238
 
239
 
240
+ def public_card_freshness(roots: dict[str, Path]) -> list[dict]:
241
+ records = []
242
+ for item in CARD_FRESHNESS_EXPECTATIONS:
243
+ surface = item["surface"]
244
+ path = roots[surface] / item["relative_path"]
245
+ text = path.read_text(encoding="utf-8", errors="ignore") if path.exists() else ""
246
+ missing = [marker for marker in item["required"] if marker not in text]
247
+ records.append({
248
+ "surface": surface,
249
+ "path": item["relative_path"],
250
+ "exists": path.exists(),
251
+ "required_marker_count": len(item["required"]),
252
+ "missing_markers": missing,
253
+ "status": "pass" if path.exists() and not missing else "fail",
254
+ })
255
+ return records
256
+
257
+
258
  def build_report(hf_root: Path) -> dict:
259
  roots = {
260
  "github_repo": ROOT,
 
267
  public_paths = git_public_paths(path) if name == "github_repo" else None
268
  scans[name] = scan(path, paths=public_paths)
269
  assets = required_assets(ROOT)
270
+ card_freshness = public_card_freshness(roots)
271
  missing_assets = [path for path, present in assets.items() if not present]
272
  violations = [
273
  {"root": name, **violation}
 
307
  "status": "pass" if not any(v["kind"] == "stale_presentation_copy" for v in violations) else "fail",
308
  "count": sum(1 for v in violations if v["kind"] == "stale_presentation_copy"),
309
  },
310
+ {
311
+ "name": "public_cards_reference_taskfirst_figure",
312
+ "status": "pass" if all(item["status"] == "pass" for item in card_freshness) else "fail",
313
+ "failures": [item for item in card_freshness if item["status"] != "pass"],
314
+ },
315
  ]
316
  status = "pass" if all(check["status"] == "pass" for check in checks) else "fail"
317
  return {
 
319
  "generated_at_utc": datetime.now(timezone.utc).isoformat(timespec="seconds"),
320
  "checks": checks,
321
  "required_assets": assets,
322
+ "public_card_freshness": card_freshness,
323
  "scans": scans,
324
  "violations": violations,
325
  }