cy0307 commited on
Commit
f45f1a0
·
verified ·
1 Parent(s): 07da339

Add files using upload-large-folder tool

Browse files
QWEN3_OMNI_RUN_LINEAGE.md ADDED
@@ -0,0 +1,36 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Qwen3-Omni v1-v6 Run Lineage
2
+
3
+ Generated: `2026-06-21T09:58:19+00:00`.
4
+
5
+ Scope: Verified public-safe Qwen3-Omni LoRA/eval packages over the selected Xperience-10M 128-episode surface.
6
+
7
+ Interpretation rule: Do not confuse the Qwen run versions with the project-level public result layers. The 20-task matrix uses Qwen3-Omni v6 LoRA; v5 remains the pinned prior release; v1-v4 are lineage and ablation evidence.
8
+
9
+ ## Compact Lineage
10
+
11
+ | Version | What changed | Eval samples | JSON validity | Action macro-F1 | Subtask acc. | Contact acc. | Public role |
12
+ | --- | --- | --- | --- | --- | --- | --- | --- |
13
+ | v1 | Selected-128 validation-aware LoRA baseline | 448 | 0.8750 | 0.0027 | 0.0067 | 0.6451 | superseded lineage evidence, not the current 20-task Qwen row |
14
+ | v2 | Structured-JSON reuse full-8-GPU LoRA | 448 | 0.9978 | 0.0024 | 0.0022 | 0.7188 | superseded lineage evidence, not the current 20-task Qwen row |
15
+ | v3 | Strict-label prompt evaluation | 448 | 1.0000 | 0.0022 | 0.0022 | 0.7210 | superseded prompt/eval lineage evidence |
16
+ | v4 | Four-epoch structured-JSON LoRA | 448 | 1.0000 | 0.0019 | 0.0000 | 0.7299 | superseded lineage evidence, not the current 20-task Qwen row |
17
+ | v5 | Multiscale cap96 LoRA | 4032 | 1.0000 | 0.0023 | 0.0112 | 0.7865 | pinned prior release row and comparison baseline |
18
+ | v6 | Rank64 lr5e-5 multiscale LoRA | 4032 | 0.9990 | 0.0029 | 0.0037 | 0.8177 | current public 20-task Qwen3-Omni v6 LoRA row |
19
+
20
+ ## Run IDs And Packages
21
+
22
+ | Version | Train run | Eval run | Role | Package |
23
+ | --- | --- | --- | --- | --- |
24
+ | v1 | xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605_lora | xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605_eval | First verified 96/16/16 selected-episode Qwen3-Omni LoRA package; establishes dataset, training, eval, and packaging plumbing. | results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605_eval |
25
+ | v2 | xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora | xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora_eval_test_full | Reuses the selected-128 split with a stricter structured JSON answer contract and full 8-GPU LoRA training. | results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora_eval_test_full |
26
+ | v3 | xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora | xperience10m_qwen3_omni_128ep_structured_json_v3_strict_label_prompt_reuse_lora_eval_test_full | Strict-label prompt/eval pass over the v2 adapter; improves JSON validity without introducing a new adapter training run. | results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_structured_json_v3_strict_label_prompt_reuse_lora_eval_test_full |
27
+ | v4 | xperience10m_qwen3_omni_128ep_structured_json_v4_4epoch_full8gpu_lora | xperience10m_qwen3_omni_128ep_structured_json_v4_4epoch_full8gpu_lora_eval_test_full | Four-epoch full-8-GPU LoRA run on the same selected split; useful for overfit/metric tradeoff analysis. | results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_structured_json_v4_4epoch_full8gpu_lora_eval_test_full |
28
+ | v5 | xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora | xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora_eval_test_full | Dense/multiscale selected-128 run with 4,032 held-out predictions; kept as the pinned prior release because several metrics remain stronger than v6. | results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora_eval_test_full |
29
+ | v6 | xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora | xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full | Current verified Qwen3-Omni row: rank64/lr5e-5 multiscale LoRA plus task-specific probe artifacts used for the 20/20 Qwen matrix coverage. | results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full |
30
+
31
+ ## Related Engineering Artifacts
32
+
33
+ | Artifact | Path | Role |
34
+ | --- | --- | --- |
35
+ | Full-parameter gates | results/omni_finetune/QWEN3_FULL_PARAMETER_GATES_20260609.md | Feasibility and short-train gates; not a public 20-task matrix method row. |
36
+ | Alternate fullsplit v6 package | results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_fullsplit_fast8gpu_lora_fsdp_full_train_noval_tail_logits_fullstatesave_v6_eval_test_full | Verified alternate no-validation/fullsplit artifact retained for audit, not the current matrix row. |
data/mirror_parity.json CHANGED
@@ -1,9 +1,9 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T10:09:43+00:00",
4
  "hf_root": "hf_publish",
5
  "summary": {
6
- "group_count": 1253,
7
  "failure_count": 0,
8
  "failures_by_surface": {}
9
  },
@@ -1063,6 +1063,55 @@
1063
  },
1064
  "failures": []
1065
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1066
  {
1067
  "name": "data/qwen3_v5_v6_comparison.json",
1068
  "status": "pass",
@@ -1119,44 +1168,44 @@
1119
  "path": "repo:docs/data/quality_gates.json",
1120
  "exists": true,
1121
  "bytes": 8640,
1122
- "sha256": "6a2890d168d4f09510a4c59cdf9319e6e03d1e5ef2a50ac57bff644565673bf9"
1123
  },
1124
  "mirrors": {
1125
  "hf_space": {
1126
  "path": "hf_space:data/quality_gates.json",
1127
  "exists": true,
1128
  "bytes": 8640,
1129
- "sha256": "6a2890d168d4f09510a4c59cdf9319e6e03d1e5ef2a50ac57bff644565673bf9"
1130
  },
1131
  "hf_artifacts_data": {
1132
  "path": "hf_artifacts:data/quality_gates.json",
1133
  "exists": true,
1134
  "bytes": 8640,
1135
- "sha256": "6a2890d168d4f09510a4c59cdf9319e6e03d1e5ef2a50ac57bff644565673bf9"
1136
  },
1137
  "hf_artifacts": {
1138
  "path": "hf_artifacts:docs/data/quality_gates.json",
1139
  "exists": true,
1140
  "bytes": 8640,
1141
- "sha256": "6a2890d168d4f09510a4c59cdf9319e6e03d1e5ef2a50ac57bff644565673bf9"
1142
  },
1143
  "hf_model_data": {
1144
  "path": "hf_model:data/quality_gates.json",
1145
  "exists": true,
1146
  "bytes": 8640,
1147
- "sha256": "6a2890d168d4f09510a4c59cdf9319e6e03d1e5ef2a50ac57bff644565673bf9"
1148
  },
1149
  "hf_model_docs_data": {
1150
  "path": "hf_model:docs/data/quality_gates.json",
1151
  "exists": true,
1152
  "bytes": 8640,
1153
- "sha256": "6a2890d168d4f09510a4c59cdf9319e6e03d1e5ef2a50ac57bff644565673bf9"
1154
  },
1155
  "hf_model": {
1156
  "path": "hf_model:metrics/quality_gates.json",
1157
  "exists": true,
1158
  "bytes": 8640,
1159
- "sha256": "6a2890d168d4f09510a4c59cdf9319e6e03d1e5ef2a50ac57bff644565673bf9"
1160
  }
1161
  },
1162
  "failures": []
@@ -5348,6 +5397,31 @@
5348
  },
5349
  "failures": []
5350
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5351
  {
5352
  "name": "scripts/build_public_surface_qa.py",
5353
  "status": "pass",
@@ -5829,21 +5903,21 @@
5829
  "local": {
5830
  "path": "repo:scripts/validate_mirror_parity.py",
5831
  "exists": true,
5832
- "bytes": 36363,
5833
- "sha256": "24cfad669a2f77794e30acf75c0df5560991f2aab110d751f73e4f0739b2b06f"
5834
  },
5835
  "mirrors": {
5836
  "hf_artifacts": {
5837
  "path": "hf_artifacts:scripts/validate_mirror_parity.py",
5838
  "exists": true,
5839
- "bytes": 36363,
5840
- "sha256": "24cfad669a2f77794e30acf75c0df5560991f2aab110d751f73e4f0739b2b06f"
5841
  },
5842
  "hf_model": {
5843
  "path": "hf_model:scripts/validate_mirror_parity.py",
5844
  "exists": true,
5845
- "bytes": 36363,
5846
- "sha256": "24cfad669a2f77794e30acf75c0df5560991f2aab110d751f73e4f0739b2b06f"
5847
  }
5848
  },
5849
  "failures": []
@@ -6004,21 +6078,21 @@
6004
  "local": {
6005
  "path": "repo:scripts/sync_hf_publish_mirrors.py",
6006
  "exists": true,
6007
- "bytes": 28655,
6008
- "sha256": "75ec8d1c324317c8f8e5c4a483e7aef43e95c30ee3b581c0108d1de059cd68f0"
6009
  },
6010
  "mirrors": {
6011
  "hf_artifacts": {
6012
  "path": "hf_artifacts:scripts/sync_hf_publish_mirrors.py",
6013
  "exists": true,
6014
- "bytes": 28655,
6015
- "sha256": "75ec8d1c324317c8f8e5c4a483e7aef43e95c30ee3b581c0108d1de059cd68f0"
6016
  },
6017
  "hf_model": {
6018
  "path": "hf_model:scripts/sync_hf_publish_mirrors.py",
6019
  "exists": true,
6020
- "bytes": 28655,
6021
- "sha256": "75ec8d1c324317c8f8e5c4a483e7aef43e95c30ee3b581c0108d1de059cd68f0"
6022
  }
6023
  },
6024
  "failures": []
@@ -30843,26 +30917,26 @@
30843
  "path": "repo:QUALITY_GATES.md",
30844
  "exists": true,
30845
  "bytes": 5184,
30846
- "sha256": "01bc15b23784df0668624b819ead739058628378f38435f41655a1491b1e8465"
30847
  },
30848
  "mirrors": {
30849
  "hf_space": {
30850
  "path": "hf_space:QUALITY_GATES.md",
30851
  "exists": true,
30852
  "bytes": 5184,
30853
- "sha256": "01bc15b23784df0668624b819ead739058628378f38435f41655a1491b1e8465"
30854
  },
30855
  "hf_artifacts": {
30856
  "path": "hf_artifacts:QUALITY_GATES.md",
30857
  "exists": true,
30858
  "bytes": 5184,
30859
- "sha256": "01bc15b23784df0668624b819ead739058628378f38435f41655a1491b1e8465"
30860
  },
30861
  "hf_model": {
30862
  "path": "hf_model:QUALITY_GATES.md",
30863
  "exists": true,
30864
  "bytes": 5184,
30865
- "sha256": "01bc15b23784df0668624b819ead739058628378f38435f41655a1491b1e8465"
30866
  }
30867
  },
30868
  "failures": []
@@ -31084,6 +31158,37 @@
31084
  },
31085
  "failures": []
31086
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
31087
  {
31088
  "name": "docs/PROJECT_BRIEF.md",
31089
  "status": "pass",
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T10:25:58+00:00",
4
  "hf_root": "hf_publish",
5
  "summary": {
6
+ "group_count": 1256,
7
  "failure_count": 0,
8
  "failures_by_surface": {}
9
  },
 
1063
  },
1064
  "failures": []
1065
  },
1066
+ {
1067
+ "name": "data/qwen3_omni_run_lineage.json",
1068
+ "status": "pass",
1069
+ "local": {
1070
+ "path": "repo:docs/data/qwen3_omni_run_lineage.json",
1071
+ "exists": true,
1072
+ "bytes": 9211,
1073
+ "sha256": "98e322fa0bec38aab93b924f4f1fcd13eee848bf9b6791fea7adcfb0db3c2953"
1074
+ },
1075
+ "mirrors": {
1076
+ "hf_space": {
1077
+ "path": "hf_space:data/qwen3_omni_run_lineage.json",
1078
+ "exists": true,
1079
+ "bytes": 9211,
1080
+ "sha256": "98e322fa0bec38aab93b924f4f1fcd13eee848bf9b6791fea7adcfb0db3c2953"
1081
+ },
1082
+ "hf_artifacts_data": {
1083
+ "path": "hf_artifacts:data/qwen3_omni_run_lineage.json",
1084
+ "exists": true,
1085
+ "bytes": 9211,
1086
+ "sha256": "98e322fa0bec38aab93b924f4f1fcd13eee848bf9b6791fea7adcfb0db3c2953"
1087
+ },
1088
+ "hf_artifacts": {
1089
+ "path": "hf_artifacts:docs/data/qwen3_omni_run_lineage.json",
1090
+ "exists": true,
1091
+ "bytes": 9211,
1092
+ "sha256": "98e322fa0bec38aab93b924f4f1fcd13eee848bf9b6791fea7adcfb0db3c2953"
1093
+ },
1094
+ "hf_model_data": {
1095
+ "path": "hf_model:data/qwen3_omni_run_lineage.json",
1096
+ "exists": true,
1097
+ "bytes": 9211,
1098
+ "sha256": "98e322fa0bec38aab93b924f4f1fcd13eee848bf9b6791fea7adcfb0db3c2953"
1099
+ },
1100
+ "hf_model_docs_data": {
1101
+ "path": "hf_model:docs/data/qwen3_omni_run_lineage.json",
1102
+ "exists": true,
1103
+ "bytes": 9211,
1104
+ "sha256": "98e322fa0bec38aab93b924f4f1fcd13eee848bf9b6791fea7adcfb0db3c2953"
1105
+ },
1106
+ "hf_model": {
1107
+ "path": "hf_model:metrics/qwen3_omni_run_lineage.json",
1108
+ "exists": true,
1109
+ "bytes": 9211,
1110
+ "sha256": "98e322fa0bec38aab93b924f4f1fcd13eee848bf9b6791fea7adcfb0db3c2953"
1111
+ }
1112
+ },
1113
+ "failures": []
1114
+ },
1115
  {
1116
  "name": "data/qwen3_v5_v6_comparison.json",
1117
  "status": "pass",
 
1168
  "path": "repo:docs/data/quality_gates.json",
1169
  "exists": true,
1170
  "bytes": 8640,
1171
+ "sha256": "3cb0aca2dca01448cb9bc5cbb519a91bc6397c08a1eaaa84c031e773221e5a0a"
1172
  },
1173
  "mirrors": {
1174
  "hf_space": {
1175
  "path": "hf_space:data/quality_gates.json",
1176
  "exists": true,
1177
  "bytes": 8640,
1178
+ "sha256": "3cb0aca2dca01448cb9bc5cbb519a91bc6397c08a1eaaa84c031e773221e5a0a"
1179
  },
1180
  "hf_artifacts_data": {
1181
  "path": "hf_artifacts:data/quality_gates.json",
1182
  "exists": true,
1183
  "bytes": 8640,
1184
+ "sha256": "3cb0aca2dca01448cb9bc5cbb519a91bc6397c08a1eaaa84c031e773221e5a0a"
1185
  },
1186
  "hf_artifacts": {
1187
  "path": "hf_artifacts:docs/data/quality_gates.json",
1188
  "exists": true,
1189
  "bytes": 8640,
1190
+ "sha256": "3cb0aca2dca01448cb9bc5cbb519a91bc6397c08a1eaaa84c031e773221e5a0a"
1191
  },
1192
  "hf_model_data": {
1193
  "path": "hf_model:data/quality_gates.json",
1194
  "exists": true,
1195
  "bytes": 8640,
1196
+ "sha256": "3cb0aca2dca01448cb9bc5cbb519a91bc6397c08a1eaaa84c031e773221e5a0a"
1197
  },
1198
  "hf_model_docs_data": {
1199
  "path": "hf_model:docs/data/quality_gates.json",
1200
  "exists": true,
1201
  "bytes": 8640,
1202
+ "sha256": "3cb0aca2dca01448cb9bc5cbb519a91bc6397c08a1eaaa84c031e773221e5a0a"
1203
  },
1204
  "hf_model": {
1205
  "path": "hf_model:metrics/quality_gates.json",
1206
  "exists": true,
1207
  "bytes": 8640,
1208
+ "sha256": "3cb0aca2dca01448cb9bc5cbb519a91bc6397c08a1eaaa84c031e773221e5a0a"
1209
  }
1210
  },
1211
  "failures": []
 
5397
  },
5398
  "failures": []
5399
  },
5400
+ {
5401
+ "name": "scripts/build_qwen3_omni_run_lineage.py",
5402
+ "status": "pass",
5403
+ "local": {
5404
+ "path": "repo:scripts/build_qwen3_omni_run_lineage.py",
5405
+ "exists": true,
5406
+ "bytes": 7934,
5407
+ "sha256": "76b68613e3470d3a00642a516e9fb1272de94503d22877c845af105e8d651a6e"
5408
+ },
5409
+ "mirrors": {
5410
+ "hf_artifacts": {
5411
+ "path": "hf_artifacts:scripts/build_qwen3_omni_run_lineage.py",
5412
+ "exists": true,
5413
+ "bytes": 7934,
5414
+ "sha256": "76b68613e3470d3a00642a516e9fb1272de94503d22877c845af105e8d651a6e"
5415
+ },
5416
+ "hf_model": {
5417
+ "path": "hf_model:scripts/build_qwen3_omni_run_lineage.py",
5418
+ "exists": true,
5419
+ "bytes": 7934,
5420
+ "sha256": "76b68613e3470d3a00642a516e9fb1272de94503d22877c845af105e8d651a6e"
5421
+ }
5422
+ },
5423
+ "failures": []
5424
+ },
5425
  {
5426
  "name": "scripts/build_public_surface_qa.py",
5427
  "status": "pass",
 
5903
  "local": {
5904
  "path": "repo:scripts/validate_mirror_parity.py",
5905
  "exists": true,
5906
+ "bytes": 36470,
5907
+ "sha256": "0617ffc2fd6ed22b4202c4ec532e50796923e74b4ccd7e1fb3e1ff301995bb2c"
5908
  },
5909
  "mirrors": {
5910
  "hf_artifacts": {
5911
  "path": "hf_artifacts:scripts/validate_mirror_parity.py",
5912
  "exists": true,
5913
+ "bytes": 36470,
5914
+ "sha256": "0617ffc2fd6ed22b4202c4ec532e50796923e74b4ccd7e1fb3e1ff301995bb2c"
5915
  },
5916
  "hf_model": {
5917
  "path": "hf_model:scripts/validate_mirror_parity.py",
5918
  "exists": true,
5919
+ "bytes": 36470,
5920
+ "sha256": "0617ffc2fd6ed22b4202c4ec532e50796923e74b4ccd7e1fb3e1ff301995bb2c"
5921
  }
5922
  },
5923
  "failures": []
 
6078
  "local": {
6079
  "path": "repo:scripts/sync_hf_publish_mirrors.py",
6080
  "exists": true,
6081
+ "bytes": 29242,
6082
+ "sha256": "60566f5c6922d02c249a33d6c57b9474453369de038a30c9285febc7bd0347fa"
6083
  },
6084
  "mirrors": {
6085
  "hf_artifacts": {
6086
  "path": "hf_artifacts:scripts/sync_hf_publish_mirrors.py",
6087
  "exists": true,
6088
+ "bytes": 29242,
6089
+ "sha256": "60566f5c6922d02c249a33d6c57b9474453369de038a30c9285febc7bd0347fa"
6090
  },
6091
  "hf_model": {
6092
  "path": "hf_model:scripts/sync_hf_publish_mirrors.py",
6093
  "exists": true,
6094
+ "bytes": 29242,
6095
+ "sha256": "60566f5c6922d02c249a33d6c57b9474453369de038a30c9285febc7bd0347fa"
6096
  }
6097
  },
6098
  "failures": []
 
30917
  "path": "repo:QUALITY_GATES.md",
30918
  "exists": true,
30919
  "bytes": 5184,
30920
+ "sha256": "4931d4457c4c5b0978fdf31861b6e3e2da6e24368398cf1756120a32cbff98f0"
30921
  },
30922
  "mirrors": {
30923
  "hf_space": {
30924
  "path": "hf_space:QUALITY_GATES.md",
30925
  "exists": true,
30926
  "bytes": 5184,
30927
+ "sha256": "4931d4457c4c5b0978fdf31861b6e3e2da6e24368398cf1756120a32cbff98f0"
30928
  },
30929
  "hf_artifacts": {
30930
  "path": "hf_artifacts:QUALITY_GATES.md",
30931
  "exists": true,
30932
  "bytes": 5184,
30933
+ "sha256": "4931d4457c4c5b0978fdf31861b6e3e2da6e24368398cf1756120a32cbff98f0"
30934
  },
30935
  "hf_model": {
30936
  "path": "hf_model:QUALITY_GATES.md",
30937
  "exists": true,
30938
  "bytes": 5184,
30939
+ "sha256": "4931d4457c4c5b0978fdf31861b6e3e2da6e24368398cf1756120a32cbff98f0"
30940
  }
30941
  },
30942
  "failures": []
 
31158
  },
31159
  "failures": []
31160
  },
31161
+ {
31162
+ "name": "docs/QWEN3_OMNI_RUN_LINEAGE.md",
31163
+ "status": "pass",
31164
+ "local": {
31165
+ "path": "repo:QWEN3_OMNI_RUN_LINEAGE.md",
31166
+ "exists": true,
31167
+ "bytes": 4595,
31168
+ "sha256": "dfb1b8e931cc2dd7a15e8e3489047b5ee198495932961680b86875c169f92a68"
31169
+ },
31170
+ "mirrors": {
31171
+ "hf_space": {
31172
+ "path": "hf_space:QWEN3_OMNI_RUN_LINEAGE.md",
31173
+ "exists": true,
31174
+ "bytes": 4595,
31175
+ "sha256": "dfb1b8e931cc2dd7a15e8e3489047b5ee198495932961680b86875c169f92a68"
31176
+ },
31177
+ "hf_artifacts": {
31178
+ "path": "hf_artifacts:QWEN3_OMNI_RUN_LINEAGE.md",
31179
+ "exists": true,
31180
+ "bytes": 4595,
31181
+ "sha256": "dfb1b8e931cc2dd7a15e8e3489047b5ee198495932961680b86875c169f92a68"
31182
+ },
31183
+ "hf_model": {
31184
+ "path": "hf_model:QWEN3_OMNI_RUN_LINEAGE.md",
31185
+ "exists": true,
31186
+ "bytes": 4595,
31187
+ "sha256": "dfb1b8e931cc2dd7a15e8e3489047b5ee198495932961680b86875c169f92a68"
31188
+ }
31189
+ },
31190
+ "failures": []
31191
+ },
31192
  {
31193
  "name": "docs/PROJECT_BRIEF.md",
31194
  "status": "pass",
data/quality_gates.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Release Checks",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T10:10:44+00:00",
5
  "rule": "A release is current when the automated reports pass and the live GitHub/Hugging Face mirrors are verified after publishing.",
6
  "automated_gates": [
7
  {
 
1
  {
2
  "title": "Ropedia Xperience-10M Release Checks",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T10:25:40+00:00",
5
  "rule": "A release is current when the automated reports pass and the live GitHub/Hugging Face mirrors are verified after publishing.",
6
  "automated_gates": [
7
  {
data/qwen3_omni_run_lineage.json ADDED
@@ -0,0 +1,161 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "current_public_matrix_row": "qwen3_omni_v6_lora",
3
+ "generated_at_utc": "2026-06-21T09:58:19+00:00",
4
+ "interpretation_rule": "Do not confuse the Qwen run versions with the project-level public result layers. The 20-task matrix uses Qwen3-Omni v6 LoRA; v5 remains the pinned prior release; v1-v4 are lineage and ablation evidence.",
5
+ "pinned_prior_release": "v5",
6
+ "related_engineering_artifacts": [
7
+ {
8
+ "name": "Full-parameter gates",
9
+ "path": "results/omni_finetune/QWEN3_FULL_PARAMETER_GATES_20260609.md",
10
+ "role": "Feasibility and short-train gates; not a public 20-task matrix method row."
11
+ },
12
+ {
13
+ "name": "Alternate fullsplit v6 package",
14
+ "path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_fullsplit_fast8gpu_lora_fsdp_full_train_noval_tail_logits_fullstatesave_v6_eval_test_full",
15
+ "role": "Verified alternate no-validation/fullsplit artifact retained for audit, not the current matrix row."
16
+ }
17
+ ],
18
+ "runs": [
19
+ {
20
+ "dataset_contract": "xperience10m_episode_json_qa_v1",
21
+ "dataset_run_id": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605",
22
+ "eval_run_id": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605_eval",
23
+ "eval_samples": 448,
24
+ "metrics": {
25
+ "action_macro_f1": 0.0026621494447581404,
26
+ "contact_accuracy": 0.6450892857142857,
27
+ "json_validity_rate": 0.875,
28
+ "next_action_accuracy": 0.024553571428571428,
29
+ "object_micro_f1": 0.22299431459254582,
30
+ "subtask_accuracy": 0.006696428571428571,
31
+ "transition_accuracy": 0.8504464285714286
32
+ },
33
+ "package": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605_eval",
34
+ "package_path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605_eval",
35
+ "public_matrix_role": "superseded lineage evidence, not the current 20-task Qwen row",
36
+ "role": "First verified 96/16/16 selected-episode Qwen3-Omni LoRA package; establishes dataset, training, eval, and packaging plumbing.",
37
+ "status": "verified",
38
+ "title": "Selected-128 validation-aware LoRA baseline",
39
+ "train_run_id": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605_lora",
40
+ "version": "v1"
41
+ },
42
+ {
43
+ "dataset_contract": "xperience10m_episode_json_qa_v1",
44
+ "dataset_run_id": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605",
45
+ "eval_run_id": "xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora_eval_test_full",
46
+ "eval_samples": 448,
47
+ "metrics": {
48
+ "action_macro_f1": 0.0024331644885523347,
49
+ "contact_accuracy": 0.71875,
50
+ "json_validity_rate": 0.9977678571428571,
51
+ "next_action_accuracy": 0.029017857142857144,
52
+ "object_micro_f1": 0.30160427807486634,
53
+ "subtask_accuracy": 0.002232142857142857,
54
+ "transition_accuracy": 0.9709821428571429
55
+ },
56
+ "package": "xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora_eval_test_full",
57
+ "package_path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora_eval_test_full",
58
+ "public_matrix_role": "superseded lineage evidence, not the current 20-task Qwen row",
59
+ "role": "Reuses the selected-128 split with a stricter structured JSON answer contract and full 8-GPU LoRA training.",
60
+ "status": "verified",
61
+ "title": "Structured-JSON reuse full-8-GPU LoRA",
62
+ "train_run_id": "xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora",
63
+ "version": "v2"
64
+ },
65
+ {
66
+ "dataset_contract": "xperience10m_episode_json_qa_v1",
67
+ "dataset_run_id": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605",
68
+ "eval_run_id": "xperience10m_qwen3_omni_128ep_structured_json_v3_strict_label_prompt_reuse_lora_eval_test_full",
69
+ "eval_samples": 448,
70
+ "metrics": {
71
+ "action_macro_f1": 0.0021983997167007384,
72
+ "contact_accuracy": 0.7209821428571429,
73
+ "json_validity_rate": 1.0,
74
+ "next_action_accuracy": 0.03125,
75
+ "object_micro_f1": 0.30688228657389993,
76
+ "subtask_accuracy": 0.002232142857142857,
77
+ "transition_accuracy": 0.9732142857142857
78
+ },
79
+ "package": "xperience10m_qwen3_omni_128ep_structured_json_v3_strict_label_prompt_reuse_lora_eval_test_full",
80
+ "package_path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_structured_json_v3_strict_label_prompt_reuse_lora_eval_test_full",
81
+ "public_matrix_role": "superseded prompt/eval lineage evidence",
82
+ "role": "Strict-label prompt/eval pass over the v2 adapter; improves JSON validity without introducing a new adapter training run.",
83
+ "status": "verified",
84
+ "title": "Strict-label prompt evaluation",
85
+ "train_run_id": "xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora",
86
+ "version": "v3"
87
+ },
88
+ {
89
+ "dataset_contract": "xperience10m_episode_json_qa_v1",
90
+ "dataset_run_id": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605",
91
+ "eval_run_id": "xperience10m_qwen3_omni_128ep_structured_json_v4_4epoch_full8gpu_lora_eval_test_full",
92
+ "eval_samples": 448,
93
+ "metrics": {
94
+ "action_macro_f1": 0.0018678269676001454,
95
+ "contact_accuracy": 0.7299107142857143,
96
+ "json_validity_rate": 1.0,
97
+ "next_action_accuracy": 0.033482142857142856,
98
+ "object_micro_f1": 0.31099781500364165,
99
+ "subtask_accuracy": 0.0,
100
+ "transition_accuracy": 0.9732142857142857
101
+ },
102
+ "package": "xperience10m_qwen3_omni_128ep_structured_json_v4_4epoch_full8gpu_lora_eval_test_full",
103
+ "package_path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_structured_json_v4_4epoch_full8gpu_lora_eval_test_full",
104
+ "public_matrix_role": "superseded lineage evidence, not the current 20-task Qwen row",
105
+ "role": "Four-epoch full-8-GPU LoRA run on the same selected split; useful for overfit/metric tradeoff analysis.",
106
+ "status": "verified",
107
+ "title": "Four-epoch structured-JSON LoRA",
108
+ "train_run_id": "xperience10m_qwen3_omni_128ep_structured_json_v4_4epoch_full8gpu_lora",
109
+ "version": "v4"
110
+ },
111
+ {
112
+ "dataset_contract": "xperience10m_episode_json_qa_v1",
113
+ "dataset_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora",
114
+ "eval_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora_eval_test_full",
115
+ "eval_samples": 4032,
116
+ "metrics": {
117
+ "action_macro_f1": 0.002289711036077459,
118
+ "contact_accuracy": 0.7864583333333334,
119
+ "json_validity_rate": 1.0,
120
+ "next_action_accuracy": 0.053618594823032224,
121
+ "object_micro_f1": 0.31614599936244814,
122
+ "subtask_accuracy": 0.011194029850746268,
123
+ "transition_accuracy": 0.9908234126984127
124
+ },
125
+ "package": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora_eval_test_full",
126
+ "package_path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora_eval_test_full",
127
+ "public_matrix_role": "pinned prior release row and comparison baseline",
128
+ "role": "Dense/multiscale selected-128 run with 4,032 held-out predictions; kept as the pinned prior release because several metrics remain stronger than v6.",
129
+ "status": "verified",
130
+ "title": "Multiscale cap96 LoRA",
131
+ "train_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora",
132
+ "version": "v5"
133
+ },
134
+ {
135
+ "dataset_contract": "xperience10m_episode_json_qa_v1",
136
+ "dataset_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora",
137
+ "eval_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full",
138
+ "eval_samples": 4032,
139
+ "metrics": {
140
+ "action_macro_f1": 0.0028830723979596335,
141
+ "contact_accuracy": 0.8177083333333334,
142
+ "json_validity_rate": 0.9990079365079365,
143
+ "next_action_accuracy": 0.04305335446381405,
144
+ "object_micro_f1": 0.3064982378331287,
145
+ "subtask_accuracy": 0.0037313432835820895,
146
+ "transition_accuracy": 0.9898313492063492
147
+ },
148
+ "package": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full",
149
+ "package_path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full",
150
+ "public_matrix_role": "current public 20-task Qwen3-Omni v6 LoRA row",
151
+ "role": "Current verified Qwen3-Omni row: rank64/lr5e-5 multiscale LoRA plus task-specific probe artifacts used for the 20/20 Qwen matrix coverage.",
152
+ "status": "verified",
153
+ "title": "Rank64 lr5e-5 multiscale LoRA",
154
+ "train_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora",
155
+ "version": "v6"
156
+ }
157
+ ],
158
+ "scope": "Verified public-safe Qwen3-Omni LoRA/eval packages over the selected Xperience-10M 128-episode surface.",
159
+ "status": "pass",
160
+ "title": "Qwen3-Omni v1-v6 Run Lineage"
161
+ }
docs/data/mirror_parity.json CHANGED
@@ -1,9 +1,9 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T10:09:43+00:00",
4
  "hf_root": "hf_publish",
5
  "summary": {
6
- "group_count": 1253,
7
  "failure_count": 0,
8
  "failures_by_surface": {}
9
  },
@@ -1063,6 +1063,55 @@
1063
  },
1064
  "failures": []
1065
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1066
  {
1067
  "name": "data/qwen3_v5_v6_comparison.json",
1068
  "status": "pass",
@@ -1119,44 +1168,44 @@
1119
  "path": "repo:docs/data/quality_gates.json",
1120
  "exists": true,
1121
  "bytes": 8640,
1122
- "sha256": "6a2890d168d4f09510a4c59cdf9319e6e03d1e5ef2a50ac57bff644565673bf9"
1123
  },
1124
  "mirrors": {
1125
  "hf_space": {
1126
  "path": "hf_space:data/quality_gates.json",
1127
  "exists": true,
1128
  "bytes": 8640,
1129
- "sha256": "6a2890d168d4f09510a4c59cdf9319e6e03d1e5ef2a50ac57bff644565673bf9"
1130
  },
1131
  "hf_artifacts_data": {
1132
  "path": "hf_artifacts:data/quality_gates.json",
1133
  "exists": true,
1134
  "bytes": 8640,
1135
- "sha256": "6a2890d168d4f09510a4c59cdf9319e6e03d1e5ef2a50ac57bff644565673bf9"
1136
  },
1137
  "hf_artifacts": {
1138
  "path": "hf_artifacts:docs/data/quality_gates.json",
1139
  "exists": true,
1140
  "bytes": 8640,
1141
- "sha256": "6a2890d168d4f09510a4c59cdf9319e6e03d1e5ef2a50ac57bff644565673bf9"
1142
  },
1143
  "hf_model_data": {
1144
  "path": "hf_model:data/quality_gates.json",
1145
  "exists": true,
1146
  "bytes": 8640,
1147
- "sha256": "6a2890d168d4f09510a4c59cdf9319e6e03d1e5ef2a50ac57bff644565673bf9"
1148
  },
1149
  "hf_model_docs_data": {
1150
  "path": "hf_model:docs/data/quality_gates.json",
1151
  "exists": true,
1152
  "bytes": 8640,
1153
- "sha256": "6a2890d168d4f09510a4c59cdf9319e6e03d1e5ef2a50ac57bff644565673bf9"
1154
  },
1155
  "hf_model": {
1156
  "path": "hf_model:metrics/quality_gates.json",
1157
  "exists": true,
1158
  "bytes": 8640,
1159
- "sha256": "6a2890d168d4f09510a4c59cdf9319e6e03d1e5ef2a50ac57bff644565673bf9"
1160
  }
1161
  },
1162
  "failures": []
@@ -5348,6 +5397,31 @@
5348
  },
5349
  "failures": []
5350
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5351
  {
5352
  "name": "scripts/build_public_surface_qa.py",
5353
  "status": "pass",
@@ -5829,21 +5903,21 @@
5829
  "local": {
5830
  "path": "repo:scripts/validate_mirror_parity.py",
5831
  "exists": true,
5832
- "bytes": 36363,
5833
- "sha256": "24cfad669a2f77794e30acf75c0df5560991f2aab110d751f73e4f0739b2b06f"
5834
  },
5835
  "mirrors": {
5836
  "hf_artifacts": {
5837
  "path": "hf_artifacts:scripts/validate_mirror_parity.py",
5838
  "exists": true,
5839
- "bytes": 36363,
5840
- "sha256": "24cfad669a2f77794e30acf75c0df5560991f2aab110d751f73e4f0739b2b06f"
5841
  },
5842
  "hf_model": {
5843
  "path": "hf_model:scripts/validate_mirror_parity.py",
5844
  "exists": true,
5845
- "bytes": 36363,
5846
- "sha256": "24cfad669a2f77794e30acf75c0df5560991f2aab110d751f73e4f0739b2b06f"
5847
  }
5848
  },
5849
  "failures": []
@@ -6004,21 +6078,21 @@
6004
  "local": {
6005
  "path": "repo:scripts/sync_hf_publish_mirrors.py",
6006
  "exists": true,
6007
- "bytes": 28655,
6008
- "sha256": "75ec8d1c324317c8f8e5c4a483e7aef43e95c30ee3b581c0108d1de059cd68f0"
6009
  },
6010
  "mirrors": {
6011
  "hf_artifacts": {
6012
  "path": "hf_artifacts:scripts/sync_hf_publish_mirrors.py",
6013
  "exists": true,
6014
- "bytes": 28655,
6015
- "sha256": "75ec8d1c324317c8f8e5c4a483e7aef43e95c30ee3b581c0108d1de059cd68f0"
6016
  },
6017
  "hf_model": {
6018
  "path": "hf_model:scripts/sync_hf_publish_mirrors.py",
6019
  "exists": true,
6020
- "bytes": 28655,
6021
- "sha256": "75ec8d1c324317c8f8e5c4a483e7aef43e95c30ee3b581c0108d1de059cd68f0"
6022
  }
6023
  },
6024
  "failures": []
@@ -30843,26 +30917,26 @@
30843
  "path": "repo:QUALITY_GATES.md",
30844
  "exists": true,
30845
  "bytes": 5184,
30846
- "sha256": "01bc15b23784df0668624b819ead739058628378f38435f41655a1491b1e8465"
30847
  },
30848
  "mirrors": {
30849
  "hf_space": {
30850
  "path": "hf_space:QUALITY_GATES.md",
30851
  "exists": true,
30852
  "bytes": 5184,
30853
- "sha256": "01bc15b23784df0668624b819ead739058628378f38435f41655a1491b1e8465"
30854
  },
30855
  "hf_artifacts": {
30856
  "path": "hf_artifacts:QUALITY_GATES.md",
30857
  "exists": true,
30858
  "bytes": 5184,
30859
- "sha256": "01bc15b23784df0668624b819ead739058628378f38435f41655a1491b1e8465"
30860
  },
30861
  "hf_model": {
30862
  "path": "hf_model:QUALITY_GATES.md",
30863
  "exists": true,
30864
  "bytes": 5184,
30865
- "sha256": "01bc15b23784df0668624b819ead739058628378f38435f41655a1491b1e8465"
30866
  }
30867
  },
30868
  "failures": []
@@ -31084,6 +31158,37 @@
31084
  },
31085
  "failures": []
31086
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
31087
  {
31088
  "name": "docs/PROJECT_BRIEF.md",
31089
  "status": "pass",
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T10:25:58+00:00",
4
  "hf_root": "hf_publish",
5
  "summary": {
6
+ "group_count": 1256,
7
  "failure_count": 0,
8
  "failures_by_surface": {}
9
  },
 
1063
  },
1064
  "failures": []
1065
  },
1066
+ {
1067
+ "name": "data/qwen3_omni_run_lineage.json",
1068
+ "status": "pass",
1069
+ "local": {
1070
+ "path": "repo:docs/data/qwen3_omni_run_lineage.json",
1071
+ "exists": true,
1072
+ "bytes": 9211,
1073
+ "sha256": "98e322fa0bec38aab93b924f4f1fcd13eee848bf9b6791fea7adcfb0db3c2953"
1074
+ },
1075
+ "mirrors": {
1076
+ "hf_space": {
1077
+ "path": "hf_space:data/qwen3_omni_run_lineage.json",
1078
+ "exists": true,
1079
+ "bytes": 9211,
1080
+ "sha256": "98e322fa0bec38aab93b924f4f1fcd13eee848bf9b6791fea7adcfb0db3c2953"
1081
+ },
1082
+ "hf_artifacts_data": {
1083
+ "path": "hf_artifacts:data/qwen3_omni_run_lineage.json",
1084
+ "exists": true,
1085
+ "bytes": 9211,
1086
+ "sha256": "98e322fa0bec38aab93b924f4f1fcd13eee848bf9b6791fea7adcfb0db3c2953"
1087
+ },
1088
+ "hf_artifacts": {
1089
+ "path": "hf_artifacts:docs/data/qwen3_omni_run_lineage.json",
1090
+ "exists": true,
1091
+ "bytes": 9211,
1092
+ "sha256": "98e322fa0bec38aab93b924f4f1fcd13eee848bf9b6791fea7adcfb0db3c2953"
1093
+ },
1094
+ "hf_model_data": {
1095
+ "path": "hf_model:data/qwen3_omni_run_lineage.json",
1096
+ "exists": true,
1097
+ "bytes": 9211,
1098
+ "sha256": "98e322fa0bec38aab93b924f4f1fcd13eee848bf9b6791fea7adcfb0db3c2953"
1099
+ },
1100
+ "hf_model_docs_data": {
1101
+ "path": "hf_model:docs/data/qwen3_omni_run_lineage.json",
1102
+ "exists": true,
1103
+ "bytes": 9211,
1104
+ "sha256": "98e322fa0bec38aab93b924f4f1fcd13eee848bf9b6791fea7adcfb0db3c2953"
1105
+ },
1106
+ "hf_model": {
1107
+ "path": "hf_model:metrics/qwen3_omni_run_lineage.json",
1108
+ "exists": true,
1109
+ "bytes": 9211,
1110
+ "sha256": "98e322fa0bec38aab93b924f4f1fcd13eee848bf9b6791fea7adcfb0db3c2953"
1111
+ }
1112
+ },
1113
+ "failures": []
1114
+ },
1115
  {
1116
  "name": "data/qwen3_v5_v6_comparison.json",
1117
  "status": "pass",
 
1168
  "path": "repo:docs/data/quality_gates.json",
1169
  "exists": true,
1170
  "bytes": 8640,
1171
+ "sha256": "3cb0aca2dca01448cb9bc5cbb519a91bc6397c08a1eaaa84c031e773221e5a0a"
1172
  },
1173
  "mirrors": {
1174
  "hf_space": {
1175
  "path": "hf_space:data/quality_gates.json",
1176
  "exists": true,
1177
  "bytes": 8640,
1178
+ "sha256": "3cb0aca2dca01448cb9bc5cbb519a91bc6397c08a1eaaa84c031e773221e5a0a"
1179
  },
1180
  "hf_artifacts_data": {
1181
  "path": "hf_artifacts:data/quality_gates.json",
1182
  "exists": true,
1183
  "bytes": 8640,
1184
+ "sha256": "3cb0aca2dca01448cb9bc5cbb519a91bc6397c08a1eaaa84c031e773221e5a0a"
1185
  },
1186
  "hf_artifacts": {
1187
  "path": "hf_artifacts:docs/data/quality_gates.json",
1188
  "exists": true,
1189
  "bytes": 8640,
1190
+ "sha256": "3cb0aca2dca01448cb9bc5cbb519a91bc6397c08a1eaaa84c031e773221e5a0a"
1191
  },
1192
  "hf_model_data": {
1193
  "path": "hf_model:data/quality_gates.json",
1194
  "exists": true,
1195
  "bytes": 8640,
1196
+ "sha256": "3cb0aca2dca01448cb9bc5cbb519a91bc6397c08a1eaaa84c031e773221e5a0a"
1197
  },
1198
  "hf_model_docs_data": {
1199
  "path": "hf_model:docs/data/quality_gates.json",
1200
  "exists": true,
1201
  "bytes": 8640,
1202
+ "sha256": "3cb0aca2dca01448cb9bc5cbb519a91bc6397c08a1eaaa84c031e773221e5a0a"
1203
  },
1204
  "hf_model": {
1205
  "path": "hf_model:metrics/quality_gates.json",
1206
  "exists": true,
1207
  "bytes": 8640,
1208
+ "sha256": "3cb0aca2dca01448cb9bc5cbb519a91bc6397c08a1eaaa84c031e773221e5a0a"
1209
  }
1210
  },
1211
  "failures": []
 
5397
  },
5398
  "failures": []
5399
  },
5400
+ {
5401
+ "name": "scripts/build_qwen3_omni_run_lineage.py",
5402
+ "status": "pass",
5403
+ "local": {
5404
+ "path": "repo:scripts/build_qwen3_omni_run_lineage.py",
5405
+ "exists": true,
5406
+ "bytes": 7934,
5407
+ "sha256": "76b68613e3470d3a00642a516e9fb1272de94503d22877c845af105e8d651a6e"
5408
+ },
5409
+ "mirrors": {
5410
+ "hf_artifacts": {
5411
+ "path": "hf_artifacts:scripts/build_qwen3_omni_run_lineage.py",
5412
+ "exists": true,
5413
+ "bytes": 7934,
5414
+ "sha256": "76b68613e3470d3a00642a516e9fb1272de94503d22877c845af105e8d651a6e"
5415
+ },
5416
+ "hf_model": {
5417
+ "path": "hf_model:scripts/build_qwen3_omni_run_lineage.py",
5418
+ "exists": true,
5419
+ "bytes": 7934,
5420
+ "sha256": "76b68613e3470d3a00642a516e9fb1272de94503d22877c845af105e8d651a6e"
5421
+ }
5422
+ },
5423
+ "failures": []
5424
+ },
5425
  {
5426
  "name": "scripts/build_public_surface_qa.py",
5427
  "status": "pass",
 
5903
  "local": {
5904
  "path": "repo:scripts/validate_mirror_parity.py",
5905
  "exists": true,
5906
+ "bytes": 36470,
5907
+ "sha256": "0617ffc2fd6ed22b4202c4ec532e50796923e74b4ccd7e1fb3e1ff301995bb2c"
5908
  },
5909
  "mirrors": {
5910
  "hf_artifacts": {
5911
  "path": "hf_artifacts:scripts/validate_mirror_parity.py",
5912
  "exists": true,
5913
+ "bytes": 36470,
5914
+ "sha256": "0617ffc2fd6ed22b4202c4ec532e50796923e74b4ccd7e1fb3e1ff301995bb2c"
5915
  },
5916
  "hf_model": {
5917
  "path": "hf_model:scripts/validate_mirror_parity.py",
5918
  "exists": true,
5919
+ "bytes": 36470,
5920
+ "sha256": "0617ffc2fd6ed22b4202c4ec532e50796923e74b4ccd7e1fb3e1ff301995bb2c"
5921
  }
5922
  },
5923
  "failures": []
 
6078
  "local": {
6079
  "path": "repo:scripts/sync_hf_publish_mirrors.py",
6080
  "exists": true,
6081
+ "bytes": 29242,
6082
+ "sha256": "60566f5c6922d02c249a33d6c57b9474453369de038a30c9285febc7bd0347fa"
6083
  },
6084
  "mirrors": {
6085
  "hf_artifacts": {
6086
  "path": "hf_artifacts:scripts/sync_hf_publish_mirrors.py",
6087
  "exists": true,
6088
+ "bytes": 29242,
6089
+ "sha256": "60566f5c6922d02c249a33d6c57b9474453369de038a30c9285febc7bd0347fa"
6090
  },
6091
  "hf_model": {
6092
  "path": "hf_model:scripts/sync_hf_publish_mirrors.py",
6093
  "exists": true,
6094
+ "bytes": 29242,
6095
+ "sha256": "60566f5c6922d02c249a33d6c57b9474453369de038a30c9285febc7bd0347fa"
6096
  }
6097
  },
6098
  "failures": []
 
30917
  "path": "repo:QUALITY_GATES.md",
30918
  "exists": true,
30919
  "bytes": 5184,
30920
+ "sha256": "4931d4457c4c5b0978fdf31861b6e3e2da6e24368398cf1756120a32cbff98f0"
30921
  },
30922
  "mirrors": {
30923
  "hf_space": {
30924
  "path": "hf_space:QUALITY_GATES.md",
30925
  "exists": true,
30926
  "bytes": 5184,
30927
+ "sha256": "4931d4457c4c5b0978fdf31861b6e3e2da6e24368398cf1756120a32cbff98f0"
30928
  },
30929
  "hf_artifacts": {
30930
  "path": "hf_artifacts:QUALITY_GATES.md",
30931
  "exists": true,
30932
  "bytes": 5184,
30933
+ "sha256": "4931d4457c4c5b0978fdf31861b6e3e2da6e24368398cf1756120a32cbff98f0"
30934
  },
30935
  "hf_model": {
30936
  "path": "hf_model:QUALITY_GATES.md",
30937
  "exists": true,
30938
  "bytes": 5184,
30939
+ "sha256": "4931d4457c4c5b0978fdf31861b6e3e2da6e24368398cf1756120a32cbff98f0"
30940
  }
30941
  },
30942
  "failures": []
 
31158
  },
31159
  "failures": []
31160
  },
31161
+ {
31162
+ "name": "docs/QWEN3_OMNI_RUN_LINEAGE.md",
31163
+ "status": "pass",
31164
+ "local": {
31165
+ "path": "repo:QWEN3_OMNI_RUN_LINEAGE.md",
31166
+ "exists": true,
31167
+ "bytes": 4595,
31168
+ "sha256": "dfb1b8e931cc2dd7a15e8e3489047b5ee198495932961680b86875c169f92a68"
31169
+ },
31170
+ "mirrors": {
31171
+ "hf_space": {
31172
+ "path": "hf_space:QWEN3_OMNI_RUN_LINEAGE.md",
31173
+ "exists": true,
31174
+ "bytes": 4595,
31175
+ "sha256": "dfb1b8e931cc2dd7a15e8e3489047b5ee198495932961680b86875c169f92a68"
31176
+ },
31177
+ "hf_artifacts": {
31178
+ "path": "hf_artifacts:QWEN3_OMNI_RUN_LINEAGE.md",
31179
+ "exists": true,
31180
+ "bytes": 4595,
31181
+ "sha256": "dfb1b8e931cc2dd7a15e8e3489047b5ee198495932961680b86875c169f92a68"
31182
+ },
31183
+ "hf_model": {
31184
+ "path": "hf_model:QWEN3_OMNI_RUN_LINEAGE.md",
31185
+ "exists": true,
31186
+ "bytes": 4595,
31187
+ "sha256": "dfb1b8e931cc2dd7a15e8e3489047b5ee198495932961680b86875c169f92a68"
31188
+ }
31189
+ },
31190
+ "failures": []
31191
+ },
31192
  {
31193
  "name": "docs/PROJECT_BRIEF.md",
31194
  "status": "pass",
docs/data/quality_gates.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Release Checks",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T10:10:44+00:00",
5
  "rule": "A release is current when the automated reports pass and the live GitHub/Hugging Face mirrors are verified after publishing.",
6
  "automated_gates": [
7
  {
 
1
  {
2
  "title": "Ropedia Xperience-10M Release Checks",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T10:25:40+00:00",
5
  "rule": "A release is current when the automated reports pass and the live GitHub/Hugging Face mirrors are verified after publishing.",
6
  "automated_gates": [
7
  {
docs/data/qwen3_omni_run_lineage.json ADDED
@@ -0,0 +1,161 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "current_public_matrix_row": "qwen3_omni_v6_lora",
3
+ "generated_at_utc": "2026-06-21T09:58:19+00:00",
4
+ "interpretation_rule": "Do not confuse the Qwen run versions with the project-level public result layers. The 20-task matrix uses Qwen3-Omni v6 LoRA; v5 remains the pinned prior release; v1-v4 are lineage and ablation evidence.",
5
+ "pinned_prior_release": "v5",
6
+ "related_engineering_artifacts": [
7
+ {
8
+ "name": "Full-parameter gates",
9
+ "path": "results/omni_finetune/QWEN3_FULL_PARAMETER_GATES_20260609.md",
10
+ "role": "Feasibility and short-train gates; not a public 20-task matrix method row."
11
+ },
12
+ {
13
+ "name": "Alternate fullsplit v6 package",
14
+ "path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_fullsplit_fast8gpu_lora_fsdp_full_train_noval_tail_logits_fullstatesave_v6_eval_test_full",
15
+ "role": "Verified alternate no-validation/fullsplit artifact retained for audit, not the current matrix row."
16
+ }
17
+ ],
18
+ "runs": [
19
+ {
20
+ "dataset_contract": "xperience10m_episode_json_qa_v1",
21
+ "dataset_run_id": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605",
22
+ "eval_run_id": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605_eval",
23
+ "eval_samples": 448,
24
+ "metrics": {
25
+ "action_macro_f1": 0.0026621494447581404,
26
+ "contact_accuracy": 0.6450892857142857,
27
+ "json_validity_rate": 0.875,
28
+ "next_action_accuracy": 0.024553571428571428,
29
+ "object_micro_f1": 0.22299431459254582,
30
+ "subtask_accuracy": 0.006696428571428571,
31
+ "transition_accuracy": 0.8504464285714286
32
+ },
33
+ "package": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605_eval",
34
+ "package_path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605_eval",
35
+ "public_matrix_role": "superseded lineage evidence, not the current 20-task Qwen row",
36
+ "role": "First verified 96/16/16 selected-episode Qwen3-Omni LoRA package; establishes dataset, training, eval, and packaging plumbing.",
37
+ "status": "verified",
38
+ "title": "Selected-128 validation-aware LoRA baseline",
39
+ "train_run_id": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605_lora",
40
+ "version": "v1"
41
+ },
42
+ {
43
+ "dataset_contract": "xperience10m_episode_json_qa_v1",
44
+ "dataset_run_id": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605",
45
+ "eval_run_id": "xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora_eval_test_full",
46
+ "eval_samples": 448,
47
+ "metrics": {
48
+ "action_macro_f1": 0.0024331644885523347,
49
+ "contact_accuracy": 0.71875,
50
+ "json_validity_rate": 0.9977678571428571,
51
+ "next_action_accuracy": 0.029017857142857144,
52
+ "object_micro_f1": 0.30160427807486634,
53
+ "subtask_accuracy": 0.002232142857142857,
54
+ "transition_accuracy": 0.9709821428571429
55
+ },
56
+ "package": "xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora_eval_test_full",
57
+ "package_path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora_eval_test_full",
58
+ "public_matrix_role": "superseded lineage evidence, not the current 20-task Qwen row",
59
+ "role": "Reuses the selected-128 split with a stricter structured JSON answer contract and full 8-GPU LoRA training.",
60
+ "status": "verified",
61
+ "title": "Structured-JSON reuse full-8-GPU LoRA",
62
+ "train_run_id": "xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora",
63
+ "version": "v2"
64
+ },
65
+ {
66
+ "dataset_contract": "xperience10m_episode_json_qa_v1",
67
+ "dataset_run_id": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605",
68
+ "eval_run_id": "xperience10m_qwen3_omni_128ep_structured_json_v3_strict_label_prompt_reuse_lora_eval_test_full",
69
+ "eval_samples": 448,
70
+ "metrics": {
71
+ "action_macro_f1": 0.0021983997167007384,
72
+ "contact_accuracy": 0.7209821428571429,
73
+ "json_validity_rate": 1.0,
74
+ "next_action_accuracy": 0.03125,
75
+ "object_micro_f1": 0.30688228657389993,
76
+ "subtask_accuracy": 0.002232142857142857,
77
+ "transition_accuracy": 0.9732142857142857
78
+ },
79
+ "package": "xperience10m_qwen3_omni_128ep_structured_json_v3_strict_label_prompt_reuse_lora_eval_test_full",
80
+ "package_path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_structured_json_v3_strict_label_prompt_reuse_lora_eval_test_full",
81
+ "public_matrix_role": "superseded prompt/eval lineage evidence",
82
+ "role": "Strict-label prompt/eval pass over the v2 adapter; improves JSON validity without introducing a new adapter training run.",
83
+ "status": "verified",
84
+ "title": "Strict-label prompt evaluation",
85
+ "train_run_id": "xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora",
86
+ "version": "v3"
87
+ },
88
+ {
89
+ "dataset_contract": "xperience10m_episode_json_qa_v1",
90
+ "dataset_run_id": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605",
91
+ "eval_run_id": "xperience10m_qwen3_omni_128ep_structured_json_v4_4epoch_full8gpu_lora_eval_test_full",
92
+ "eval_samples": 448,
93
+ "metrics": {
94
+ "action_macro_f1": 0.0018678269676001454,
95
+ "contact_accuracy": 0.7299107142857143,
96
+ "json_validity_rate": 1.0,
97
+ "next_action_accuracy": 0.033482142857142856,
98
+ "object_micro_f1": 0.31099781500364165,
99
+ "subtask_accuracy": 0.0,
100
+ "transition_accuracy": 0.9732142857142857
101
+ },
102
+ "package": "xperience10m_qwen3_omni_128ep_structured_json_v4_4epoch_full8gpu_lora_eval_test_full",
103
+ "package_path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_structured_json_v4_4epoch_full8gpu_lora_eval_test_full",
104
+ "public_matrix_role": "superseded lineage evidence, not the current 20-task Qwen row",
105
+ "role": "Four-epoch full-8-GPU LoRA run on the same selected split; useful for overfit/metric tradeoff analysis.",
106
+ "status": "verified",
107
+ "title": "Four-epoch structured-JSON LoRA",
108
+ "train_run_id": "xperience10m_qwen3_omni_128ep_structured_json_v4_4epoch_full8gpu_lora",
109
+ "version": "v4"
110
+ },
111
+ {
112
+ "dataset_contract": "xperience10m_episode_json_qa_v1",
113
+ "dataset_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora",
114
+ "eval_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora_eval_test_full",
115
+ "eval_samples": 4032,
116
+ "metrics": {
117
+ "action_macro_f1": 0.002289711036077459,
118
+ "contact_accuracy": 0.7864583333333334,
119
+ "json_validity_rate": 1.0,
120
+ "next_action_accuracy": 0.053618594823032224,
121
+ "object_micro_f1": 0.31614599936244814,
122
+ "subtask_accuracy": 0.011194029850746268,
123
+ "transition_accuracy": 0.9908234126984127
124
+ },
125
+ "package": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora_eval_test_full",
126
+ "package_path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora_eval_test_full",
127
+ "public_matrix_role": "pinned prior release row and comparison baseline",
128
+ "role": "Dense/multiscale selected-128 run with 4,032 held-out predictions; kept as the pinned prior release because several metrics remain stronger than v6.",
129
+ "status": "verified",
130
+ "title": "Multiscale cap96 LoRA",
131
+ "train_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora",
132
+ "version": "v5"
133
+ },
134
+ {
135
+ "dataset_contract": "xperience10m_episode_json_qa_v1",
136
+ "dataset_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora",
137
+ "eval_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full",
138
+ "eval_samples": 4032,
139
+ "metrics": {
140
+ "action_macro_f1": 0.0028830723979596335,
141
+ "contact_accuracy": 0.8177083333333334,
142
+ "json_validity_rate": 0.9990079365079365,
143
+ "next_action_accuracy": 0.04305335446381405,
144
+ "object_micro_f1": 0.3064982378331287,
145
+ "subtask_accuracy": 0.0037313432835820895,
146
+ "transition_accuracy": 0.9898313492063492
147
+ },
148
+ "package": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full",
149
+ "package_path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full",
150
+ "public_matrix_role": "current public 20-task Qwen3-Omni v6 LoRA row",
151
+ "role": "Current verified Qwen3-Omni row: rank64/lr5e-5 multiscale LoRA plus task-specific probe artifacts used for the 20/20 Qwen matrix coverage.",
152
+ "status": "verified",
153
+ "title": "Rank64 lr5e-5 multiscale LoRA",
154
+ "train_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora",
155
+ "version": "v6"
156
+ }
157
+ ],
158
+ "scope": "Verified public-safe Qwen3-Omni LoRA/eval packages over the selected Xperience-10M 128-episode surface.",
159
+ "status": "pass",
160
+ "title": "Qwen3-Omni v1-v6 Run Lineage"
161
+ }
metrics/mirror_parity.json CHANGED
@@ -1,9 +1,9 @@
1
  {
2
  "status": "pass",
3
- "generated_at_utc": "2026-06-21T10:09:43+00:00",
4
  "hf_root": "hf_publish",
5
  "summary": {
6
- "group_count": 1253,
7
  "failure_count": 0,
8
  "failures_by_surface": {}
9
  },
@@ -1063,6 +1063,55 @@
1063
  },
1064
  "failures": []
1065
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1066
  {
1067
  "name": "data/qwen3_v5_v6_comparison.json",
1068
  "status": "pass",
@@ -1119,44 +1168,44 @@
1119
  "path": "repo:docs/data/quality_gates.json",
1120
  "exists": true,
1121
  "bytes": 8640,
1122
- "sha256": "6a2890d168d4f09510a4c59cdf9319e6e03d1e5ef2a50ac57bff644565673bf9"
1123
  },
1124
  "mirrors": {
1125
  "hf_space": {
1126
  "path": "hf_space:data/quality_gates.json",
1127
  "exists": true,
1128
  "bytes": 8640,
1129
- "sha256": "6a2890d168d4f09510a4c59cdf9319e6e03d1e5ef2a50ac57bff644565673bf9"
1130
  },
1131
  "hf_artifacts_data": {
1132
  "path": "hf_artifacts:data/quality_gates.json",
1133
  "exists": true,
1134
  "bytes": 8640,
1135
- "sha256": "6a2890d168d4f09510a4c59cdf9319e6e03d1e5ef2a50ac57bff644565673bf9"
1136
  },
1137
  "hf_artifacts": {
1138
  "path": "hf_artifacts:docs/data/quality_gates.json",
1139
  "exists": true,
1140
  "bytes": 8640,
1141
- "sha256": "6a2890d168d4f09510a4c59cdf9319e6e03d1e5ef2a50ac57bff644565673bf9"
1142
  },
1143
  "hf_model_data": {
1144
  "path": "hf_model:data/quality_gates.json",
1145
  "exists": true,
1146
  "bytes": 8640,
1147
- "sha256": "6a2890d168d4f09510a4c59cdf9319e6e03d1e5ef2a50ac57bff644565673bf9"
1148
  },
1149
  "hf_model_docs_data": {
1150
  "path": "hf_model:docs/data/quality_gates.json",
1151
  "exists": true,
1152
  "bytes": 8640,
1153
- "sha256": "6a2890d168d4f09510a4c59cdf9319e6e03d1e5ef2a50ac57bff644565673bf9"
1154
  },
1155
  "hf_model": {
1156
  "path": "hf_model:metrics/quality_gates.json",
1157
  "exists": true,
1158
  "bytes": 8640,
1159
- "sha256": "6a2890d168d4f09510a4c59cdf9319e6e03d1e5ef2a50ac57bff644565673bf9"
1160
  }
1161
  },
1162
  "failures": []
@@ -5348,6 +5397,31 @@
5348
  },
5349
  "failures": []
5350
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5351
  {
5352
  "name": "scripts/build_public_surface_qa.py",
5353
  "status": "pass",
@@ -5829,21 +5903,21 @@
5829
  "local": {
5830
  "path": "repo:scripts/validate_mirror_parity.py",
5831
  "exists": true,
5832
- "bytes": 36363,
5833
- "sha256": "24cfad669a2f77794e30acf75c0df5560991f2aab110d751f73e4f0739b2b06f"
5834
  },
5835
  "mirrors": {
5836
  "hf_artifacts": {
5837
  "path": "hf_artifacts:scripts/validate_mirror_parity.py",
5838
  "exists": true,
5839
- "bytes": 36363,
5840
- "sha256": "24cfad669a2f77794e30acf75c0df5560991f2aab110d751f73e4f0739b2b06f"
5841
  },
5842
  "hf_model": {
5843
  "path": "hf_model:scripts/validate_mirror_parity.py",
5844
  "exists": true,
5845
- "bytes": 36363,
5846
- "sha256": "24cfad669a2f77794e30acf75c0df5560991f2aab110d751f73e4f0739b2b06f"
5847
  }
5848
  },
5849
  "failures": []
@@ -6004,21 +6078,21 @@
6004
  "local": {
6005
  "path": "repo:scripts/sync_hf_publish_mirrors.py",
6006
  "exists": true,
6007
- "bytes": 28655,
6008
- "sha256": "75ec8d1c324317c8f8e5c4a483e7aef43e95c30ee3b581c0108d1de059cd68f0"
6009
  },
6010
  "mirrors": {
6011
  "hf_artifacts": {
6012
  "path": "hf_artifacts:scripts/sync_hf_publish_mirrors.py",
6013
  "exists": true,
6014
- "bytes": 28655,
6015
- "sha256": "75ec8d1c324317c8f8e5c4a483e7aef43e95c30ee3b581c0108d1de059cd68f0"
6016
  },
6017
  "hf_model": {
6018
  "path": "hf_model:scripts/sync_hf_publish_mirrors.py",
6019
  "exists": true,
6020
- "bytes": 28655,
6021
- "sha256": "75ec8d1c324317c8f8e5c4a483e7aef43e95c30ee3b581c0108d1de059cd68f0"
6022
  }
6023
  },
6024
  "failures": []
@@ -30843,26 +30917,26 @@
30843
  "path": "repo:QUALITY_GATES.md",
30844
  "exists": true,
30845
  "bytes": 5184,
30846
- "sha256": "01bc15b23784df0668624b819ead739058628378f38435f41655a1491b1e8465"
30847
  },
30848
  "mirrors": {
30849
  "hf_space": {
30850
  "path": "hf_space:QUALITY_GATES.md",
30851
  "exists": true,
30852
  "bytes": 5184,
30853
- "sha256": "01bc15b23784df0668624b819ead739058628378f38435f41655a1491b1e8465"
30854
  },
30855
  "hf_artifacts": {
30856
  "path": "hf_artifacts:QUALITY_GATES.md",
30857
  "exists": true,
30858
  "bytes": 5184,
30859
- "sha256": "01bc15b23784df0668624b819ead739058628378f38435f41655a1491b1e8465"
30860
  },
30861
  "hf_model": {
30862
  "path": "hf_model:QUALITY_GATES.md",
30863
  "exists": true,
30864
  "bytes": 5184,
30865
- "sha256": "01bc15b23784df0668624b819ead739058628378f38435f41655a1491b1e8465"
30866
  }
30867
  },
30868
  "failures": []
@@ -31084,6 +31158,37 @@
31084
  },
31085
  "failures": []
31086
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
31087
  {
31088
  "name": "docs/PROJECT_BRIEF.md",
31089
  "status": "pass",
 
1
  {
2
  "status": "pass",
3
+ "generated_at_utc": "2026-06-21T10:25:58+00:00",
4
  "hf_root": "hf_publish",
5
  "summary": {
6
+ "group_count": 1256,
7
  "failure_count": 0,
8
  "failures_by_surface": {}
9
  },
 
1063
  },
1064
  "failures": []
1065
  },
1066
+ {
1067
+ "name": "data/qwen3_omni_run_lineage.json",
1068
+ "status": "pass",
1069
+ "local": {
1070
+ "path": "repo:docs/data/qwen3_omni_run_lineage.json",
1071
+ "exists": true,
1072
+ "bytes": 9211,
1073
+ "sha256": "98e322fa0bec38aab93b924f4f1fcd13eee848bf9b6791fea7adcfb0db3c2953"
1074
+ },
1075
+ "mirrors": {
1076
+ "hf_space": {
1077
+ "path": "hf_space:data/qwen3_omni_run_lineage.json",
1078
+ "exists": true,
1079
+ "bytes": 9211,
1080
+ "sha256": "98e322fa0bec38aab93b924f4f1fcd13eee848bf9b6791fea7adcfb0db3c2953"
1081
+ },
1082
+ "hf_artifacts_data": {
1083
+ "path": "hf_artifacts:data/qwen3_omni_run_lineage.json",
1084
+ "exists": true,
1085
+ "bytes": 9211,
1086
+ "sha256": "98e322fa0bec38aab93b924f4f1fcd13eee848bf9b6791fea7adcfb0db3c2953"
1087
+ },
1088
+ "hf_artifacts": {
1089
+ "path": "hf_artifacts:docs/data/qwen3_omni_run_lineage.json",
1090
+ "exists": true,
1091
+ "bytes": 9211,
1092
+ "sha256": "98e322fa0bec38aab93b924f4f1fcd13eee848bf9b6791fea7adcfb0db3c2953"
1093
+ },
1094
+ "hf_model_data": {
1095
+ "path": "hf_model:data/qwen3_omni_run_lineage.json",
1096
+ "exists": true,
1097
+ "bytes": 9211,
1098
+ "sha256": "98e322fa0bec38aab93b924f4f1fcd13eee848bf9b6791fea7adcfb0db3c2953"
1099
+ },
1100
+ "hf_model_docs_data": {
1101
+ "path": "hf_model:docs/data/qwen3_omni_run_lineage.json",
1102
+ "exists": true,
1103
+ "bytes": 9211,
1104
+ "sha256": "98e322fa0bec38aab93b924f4f1fcd13eee848bf9b6791fea7adcfb0db3c2953"
1105
+ },
1106
+ "hf_model": {
1107
+ "path": "hf_model:metrics/qwen3_omni_run_lineage.json",
1108
+ "exists": true,
1109
+ "bytes": 9211,
1110
+ "sha256": "98e322fa0bec38aab93b924f4f1fcd13eee848bf9b6791fea7adcfb0db3c2953"
1111
+ }
1112
+ },
1113
+ "failures": []
1114
+ },
1115
  {
1116
  "name": "data/qwen3_v5_v6_comparison.json",
1117
  "status": "pass",
 
1168
  "path": "repo:docs/data/quality_gates.json",
1169
  "exists": true,
1170
  "bytes": 8640,
1171
+ "sha256": "3cb0aca2dca01448cb9bc5cbb519a91bc6397c08a1eaaa84c031e773221e5a0a"
1172
  },
1173
  "mirrors": {
1174
  "hf_space": {
1175
  "path": "hf_space:data/quality_gates.json",
1176
  "exists": true,
1177
  "bytes": 8640,
1178
+ "sha256": "3cb0aca2dca01448cb9bc5cbb519a91bc6397c08a1eaaa84c031e773221e5a0a"
1179
  },
1180
  "hf_artifacts_data": {
1181
  "path": "hf_artifacts:data/quality_gates.json",
1182
  "exists": true,
1183
  "bytes": 8640,
1184
+ "sha256": "3cb0aca2dca01448cb9bc5cbb519a91bc6397c08a1eaaa84c031e773221e5a0a"
1185
  },
1186
  "hf_artifacts": {
1187
  "path": "hf_artifacts:docs/data/quality_gates.json",
1188
  "exists": true,
1189
  "bytes": 8640,
1190
+ "sha256": "3cb0aca2dca01448cb9bc5cbb519a91bc6397c08a1eaaa84c031e773221e5a0a"
1191
  },
1192
  "hf_model_data": {
1193
  "path": "hf_model:data/quality_gates.json",
1194
  "exists": true,
1195
  "bytes": 8640,
1196
+ "sha256": "3cb0aca2dca01448cb9bc5cbb519a91bc6397c08a1eaaa84c031e773221e5a0a"
1197
  },
1198
  "hf_model_docs_data": {
1199
  "path": "hf_model:docs/data/quality_gates.json",
1200
  "exists": true,
1201
  "bytes": 8640,
1202
+ "sha256": "3cb0aca2dca01448cb9bc5cbb519a91bc6397c08a1eaaa84c031e773221e5a0a"
1203
  },
1204
  "hf_model": {
1205
  "path": "hf_model:metrics/quality_gates.json",
1206
  "exists": true,
1207
  "bytes": 8640,
1208
+ "sha256": "3cb0aca2dca01448cb9bc5cbb519a91bc6397c08a1eaaa84c031e773221e5a0a"
1209
  }
1210
  },
1211
  "failures": []
 
5397
  },
5398
  "failures": []
5399
  },
5400
+ {
5401
+ "name": "scripts/build_qwen3_omni_run_lineage.py",
5402
+ "status": "pass",
5403
+ "local": {
5404
+ "path": "repo:scripts/build_qwen3_omni_run_lineage.py",
5405
+ "exists": true,
5406
+ "bytes": 7934,
5407
+ "sha256": "76b68613e3470d3a00642a516e9fb1272de94503d22877c845af105e8d651a6e"
5408
+ },
5409
+ "mirrors": {
5410
+ "hf_artifacts": {
5411
+ "path": "hf_artifacts:scripts/build_qwen3_omni_run_lineage.py",
5412
+ "exists": true,
5413
+ "bytes": 7934,
5414
+ "sha256": "76b68613e3470d3a00642a516e9fb1272de94503d22877c845af105e8d651a6e"
5415
+ },
5416
+ "hf_model": {
5417
+ "path": "hf_model:scripts/build_qwen3_omni_run_lineage.py",
5418
+ "exists": true,
5419
+ "bytes": 7934,
5420
+ "sha256": "76b68613e3470d3a00642a516e9fb1272de94503d22877c845af105e8d651a6e"
5421
+ }
5422
+ },
5423
+ "failures": []
5424
+ },
5425
  {
5426
  "name": "scripts/build_public_surface_qa.py",
5427
  "status": "pass",
 
5903
  "local": {
5904
  "path": "repo:scripts/validate_mirror_parity.py",
5905
  "exists": true,
5906
+ "bytes": 36470,
5907
+ "sha256": "0617ffc2fd6ed22b4202c4ec532e50796923e74b4ccd7e1fb3e1ff301995bb2c"
5908
  },
5909
  "mirrors": {
5910
  "hf_artifacts": {
5911
  "path": "hf_artifacts:scripts/validate_mirror_parity.py",
5912
  "exists": true,
5913
+ "bytes": 36470,
5914
+ "sha256": "0617ffc2fd6ed22b4202c4ec532e50796923e74b4ccd7e1fb3e1ff301995bb2c"
5915
  },
5916
  "hf_model": {
5917
  "path": "hf_model:scripts/validate_mirror_parity.py",
5918
  "exists": true,
5919
+ "bytes": 36470,
5920
+ "sha256": "0617ffc2fd6ed22b4202c4ec532e50796923e74b4ccd7e1fb3e1ff301995bb2c"
5921
  }
5922
  },
5923
  "failures": []
 
6078
  "local": {
6079
  "path": "repo:scripts/sync_hf_publish_mirrors.py",
6080
  "exists": true,
6081
+ "bytes": 29242,
6082
+ "sha256": "60566f5c6922d02c249a33d6c57b9474453369de038a30c9285febc7bd0347fa"
6083
  },
6084
  "mirrors": {
6085
  "hf_artifacts": {
6086
  "path": "hf_artifacts:scripts/sync_hf_publish_mirrors.py",
6087
  "exists": true,
6088
+ "bytes": 29242,
6089
+ "sha256": "60566f5c6922d02c249a33d6c57b9474453369de038a30c9285febc7bd0347fa"
6090
  },
6091
  "hf_model": {
6092
  "path": "hf_model:scripts/sync_hf_publish_mirrors.py",
6093
  "exists": true,
6094
+ "bytes": 29242,
6095
+ "sha256": "60566f5c6922d02c249a33d6c57b9474453369de038a30c9285febc7bd0347fa"
6096
  }
6097
  },
6098
  "failures": []
 
30917
  "path": "repo:QUALITY_GATES.md",
30918
  "exists": true,
30919
  "bytes": 5184,
30920
+ "sha256": "4931d4457c4c5b0978fdf31861b6e3e2da6e24368398cf1756120a32cbff98f0"
30921
  },
30922
  "mirrors": {
30923
  "hf_space": {
30924
  "path": "hf_space:QUALITY_GATES.md",
30925
  "exists": true,
30926
  "bytes": 5184,
30927
+ "sha256": "4931d4457c4c5b0978fdf31861b6e3e2da6e24368398cf1756120a32cbff98f0"
30928
  },
30929
  "hf_artifacts": {
30930
  "path": "hf_artifacts:QUALITY_GATES.md",
30931
  "exists": true,
30932
  "bytes": 5184,
30933
+ "sha256": "4931d4457c4c5b0978fdf31861b6e3e2da6e24368398cf1756120a32cbff98f0"
30934
  },
30935
  "hf_model": {
30936
  "path": "hf_model:QUALITY_GATES.md",
30937
  "exists": true,
30938
  "bytes": 5184,
30939
+ "sha256": "4931d4457c4c5b0978fdf31861b6e3e2da6e24368398cf1756120a32cbff98f0"
30940
  }
30941
  },
30942
  "failures": []
 
31158
  },
31159
  "failures": []
31160
  },
31161
+ {
31162
+ "name": "docs/QWEN3_OMNI_RUN_LINEAGE.md",
31163
+ "status": "pass",
31164
+ "local": {
31165
+ "path": "repo:QWEN3_OMNI_RUN_LINEAGE.md",
31166
+ "exists": true,
31167
+ "bytes": 4595,
31168
+ "sha256": "dfb1b8e931cc2dd7a15e8e3489047b5ee198495932961680b86875c169f92a68"
31169
+ },
31170
+ "mirrors": {
31171
+ "hf_space": {
31172
+ "path": "hf_space:QWEN3_OMNI_RUN_LINEAGE.md",
31173
+ "exists": true,
31174
+ "bytes": 4595,
31175
+ "sha256": "dfb1b8e931cc2dd7a15e8e3489047b5ee198495932961680b86875c169f92a68"
31176
+ },
31177
+ "hf_artifacts": {
31178
+ "path": "hf_artifacts:QWEN3_OMNI_RUN_LINEAGE.md",
31179
+ "exists": true,
31180
+ "bytes": 4595,
31181
+ "sha256": "dfb1b8e931cc2dd7a15e8e3489047b5ee198495932961680b86875c169f92a68"
31182
+ },
31183
+ "hf_model": {
31184
+ "path": "hf_model:QWEN3_OMNI_RUN_LINEAGE.md",
31185
+ "exists": true,
31186
+ "bytes": 4595,
31187
+ "sha256": "dfb1b8e931cc2dd7a15e8e3489047b5ee198495932961680b86875c169f92a68"
31188
+ }
31189
+ },
31190
+ "failures": []
31191
+ },
31192
  {
31193
  "name": "docs/PROJECT_BRIEF.md",
31194
  "status": "pass",
metrics/quality_gates.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "title": "Ropedia Xperience-10M Release Checks",
3
  "status": "pass",
4
- "generated_at_utc": "2026-06-21T10:10:44+00:00",
5
  "rule": "A release is current when the automated reports pass and the live GitHub/Hugging Face mirrors are verified after publishing.",
6
  "automated_gates": [
7
  {
 
1
  {
2
  "title": "Ropedia Xperience-10M Release Checks",
3
  "status": "pass",
4
+ "generated_at_utc": "2026-06-21T10:25:40+00:00",
5
  "rule": "A release is current when the automated reports pass and the live GitHub/Hugging Face mirrors are verified after publishing.",
6
  "automated_gates": [
7
  {
metrics/qwen3_omni_run_lineage.json ADDED
@@ -0,0 +1,161 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "current_public_matrix_row": "qwen3_omni_v6_lora",
3
+ "generated_at_utc": "2026-06-21T09:58:19+00:00",
4
+ "interpretation_rule": "Do not confuse the Qwen run versions with the project-level public result layers. The 20-task matrix uses Qwen3-Omni v6 LoRA; v5 remains the pinned prior release; v1-v4 are lineage and ablation evidence.",
5
+ "pinned_prior_release": "v5",
6
+ "related_engineering_artifacts": [
7
+ {
8
+ "name": "Full-parameter gates",
9
+ "path": "results/omni_finetune/QWEN3_FULL_PARAMETER_GATES_20260609.md",
10
+ "role": "Feasibility and short-train gates; not a public 20-task matrix method row."
11
+ },
12
+ {
13
+ "name": "Alternate fullsplit v6 package",
14
+ "path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_fullsplit_fast8gpu_lora_fsdp_full_train_noval_tail_logits_fullstatesave_v6_eval_test_full",
15
+ "role": "Verified alternate no-validation/fullsplit artifact retained for audit, not the current matrix row."
16
+ }
17
+ ],
18
+ "runs": [
19
+ {
20
+ "dataset_contract": "xperience10m_episode_json_qa_v1",
21
+ "dataset_run_id": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605",
22
+ "eval_run_id": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605_eval",
23
+ "eval_samples": 448,
24
+ "metrics": {
25
+ "action_macro_f1": 0.0026621494447581404,
26
+ "contact_accuracy": 0.6450892857142857,
27
+ "json_validity_rate": 0.875,
28
+ "next_action_accuracy": 0.024553571428571428,
29
+ "object_micro_f1": 0.22299431459254582,
30
+ "subtask_accuracy": 0.006696428571428571,
31
+ "transition_accuracy": 0.8504464285714286
32
+ },
33
+ "package": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605_eval",
34
+ "package_path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605_eval",
35
+ "public_matrix_role": "superseded lineage evidence, not the current 20-task Qwen row",
36
+ "role": "First verified 96/16/16 selected-episode Qwen3-Omni LoRA package; establishes dataset, training, eval, and packaging plumbing.",
37
+ "status": "verified",
38
+ "title": "Selected-128 validation-aware LoRA baseline",
39
+ "train_run_id": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605_lora",
40
+ "version": "v1"
41
+ },
42
+ {
43
+ "dataset_contract": "xperience10m_episode_json_qa_v1",
44
+ "dataset_run_id": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605",
45
+ "eval_run_id": "xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora_eval_test_full",
46
+ "eval_samples": 448,
47
+ "metrics": {
48
+ "action_macro_f1": 0.0024331644885523347,
49
+ "contact_accuracy": 0.71875,
50
+ "json_validity_rate": 0.9977678571428571,
51
+ "next_action_accuracy": 0.029017857142857144,
52
+ "object_micro_f1": 0.30160427807486634,
53
+ "subtask_accuracy": 0.002232142857142857,
54
+ "transition_accuracy": 0.9709821428571429
55
+ },
56
+ "package": "xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora_eval_test_full",
57
+ "package_path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora_eval_test_full",
58
+ "public_matrix_role": "superseded lineage evidence, not the current 20-task Qwen row",
59
+ "role": "Reuses the selected-128 split with a stricter structured JSON answer contract and full 8-GPU LoRA training.",
60
+ "status": "verified",
61
+ "title": "Structured-JSON reuse full-8-GPU LoRA",
62
+ "train_run_id": "xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora",
63
+ "version": "v2"
64
+ },
65
+ {
66
+ "dataset_contract": "xperience10m_episode_json_qa_v1",
67
+ "dataset_run_id": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605",
68
+ "eval_run_id": "xperience10m_qwen3_omni_128ep_structured_json_v3_strict_label_prompt_reuse_lora_eval_test_full",
69
+ "eval_samples": 448,
70
+ "metrics": {
71
+ "action_macro_f1": 0.0021983997167007384,
72
+ "contact_accuracy": 0.7209821428571429,
73
+ "json_validity_rate": 1.0,
74
+ "next_action_accuracy": 0.03125,
75
+ "object_micro_f1": 0.30688228657389993,
76
+ "subtask_accuracy": 0.002232142857142857,
77
+ "transition_accuracy": 0.9732142857142857
78
+ },
79
+ "package": "xperience10m_qwen3_omni_128ep_structured_json_v3_strict_label_prompt_reuse_lora_eval_test_full",
80
+ "package_path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_structured_json_v3_strict_label_prompt_reuse_lora_eval_test_full",
81
+ "public_matrix_role": "superseded prompt/eval lineage evidence",
82
+ "role": "Strict-label prompt/eval pass over the v2 adapter; improves JSON validity without introducing a new adapter training run.",
83
+ "status": "verified",
84
+ "title": "Strict-label prompt evaluation",
85
+ "train_run_id": "xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora",
86
+ "version": "v3"
87
+ },
88
+ {
89
+ "dataset_contract": "xperience10m_episode_json_qa_v1",
90
+ "dataset_run_id": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605",
91
+ "eval_run_id": "xperience10m_qwen3_omni_128ep_structured_json_v4_4epoch_full8gpu_lora_eval_test_full",
92
+ "eval_samples": 448,
93
+ "metrics": {
94
+ "action_macro_f1": 0.0018678269676001454,
95
+ "contact_accuracy": 0.7299107142857143,
96
+ "json_validity_rate": 1.0,
97
+ "next_action_accuracy": 0.033482142857142856,
98
+ "object_micro_f1": 0.31099781500364165,
99
+ "subtask_accuracy": 0.0,
100
+ "transition_accuracy": 0.9732142857142857
101
+ },
102
+ "package": "xperience10m_qwen3_omni_128ep_structured_json_v4_4epoch_full8gpu_lora_eval_test_full",
103
+ "package_path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_structured_json_v4_4epoch_full8gpu_lora_eval_test_full",
104
+ "public_matrix_role": "superseded lineage evidence, not the current 20-task Qwen row",
105
+ "role": "Four-epoch full-8-GPU LoRA run on the same selected split; useful for overfit/metric tradeoff analysis.",
106
+ "status": "verified",
107
+ "title": "Four-epoch structured-JSON LoRA",
108
+ "train_run_id": "xperience10m_qwen3_omni_128ep_structured_json_v4_4epoch_full8gpu_lora",
109
+ "version": "v4"
110
+ },
111
+ {
112
+ "dataset_contract": "xperience10m_episode_json_qa_v1",
113
+ "dataset_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora",
114
+ "eval_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora_eval_test_full",
115
+ "eval_samples": 4032,
116
+ "metrics": {
117
+ "action_macro_f1": 0.002289711036077459,
118
+ "contact_accuracy": 0.7864583333333334,
119
+ "json_validity_rate": 1.0,
120
+ "next_action_accuracy": 0.053618594823032224,
121
+ "object_micro_f1": 0.31614599936244814,
122
+ "subtask_accuracy": 0.011194029850746268,
123
+ "transition_accuracy": 0.9908234126984127
124
+ },
125
+ "package": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora_eval_test_full",
126
+ "package_path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora_eval_test_full",
127
+ "public_matrix_role": "pinned prior release row and comparison baseline",
128
+ "role": "Dense/multiscale selected-128 run with 4,032 held-out predictions; kept as the pinned prior release because several metrics remain stronger than v6.",
129
+ "status": "verified",
130
+ "title": "Multiscale cap96 LoRA",
131
+ "train_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora",
132
+ "version": "v5"
133
+ },
134
+ {
135
+ "dataset_contract": "xperience10m_episode_json_qa_v1",
136
+ "dataset_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora",
137
+ "eval_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full",
138
+ "eval_samples": 4032,
139
+ "metrics": {
140
+ "action_macro_f1": 0.0028830723979596335,
141
+ "contact_accuracy": 0.8177083333333334,
142
+ "json_validity_rate": 0.9990079365079365,
143
+ "next_action_accuracy": 0.04305335446381405,
144
+ "object_micro_f1": 0.3064982378331287,
145
+ "subtask_accuracy": 0.0037313432835820895,
146
+ "transition_accuracy": 0.9898313492063492
147
+ },
148
+ "package": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full",
149
+ "package_path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full",
150
+ "public_matrix_role": "current public 20-task Qwen3-Omni v6 LoRA row",
151
+ "role": "Current verified Qwen3-Omni row: rank64/lr5e-5 multiscale LoRA plus task-specific probe artifacts used for the 20/20 Qwen matrix coverage.",
152
+ "status": "verified",
153
+ "title": "Rank64 lr5e-5 multiscale LoRA",
154
+ "train_run_id": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora",
155
+ "version": "v6"
156
+ }
157
+ ],
158
+ "scope": "Verified public-safe Qwen3-Omni LoRA/eval packages over the selected Xperience-10M 128-episode surface.",
159
+ "status": "pass",
160
+ "title": "Qwen3-Omni v1-v6 Run Lineage"
161
+ }
scripts/build_qwen3_omni_run_lineage.py ADDED
@@ -0,0 +1,204 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ #!/usr/bin/env python3
2
+ """Build the public Qwen3-Omni v1-v6 run-lineage summary."""
3
+
4
+ from __future__ import annotations
5
+
6
+ import json
7
+ from datetime import datetime, timezone
8
+ from pathlib import Path
9
+
10
+
11
+ ROOT = Path(__file__).resolve().parents[1]
12
+ VERIFIED = ROOT / "results/omni_finetune/verified_public"
13
+ OUTPUT_JSON = ROOT / "docs/data/qwen3_omni_run_lineage.json"
14
+ OUTPUT_MD = ROOT / "QWEN3_OMNI_RUN_LINEAGE.md"
15
+
16
+ RUNS = [
17
+ {
18
+ "version": "v1",
19
+ "title": "Selected-128 validation-aware LoRA baseline",
20
+ "package": "xperience10m_qwen3_omni_128ep_96train_16val_16test_valmon_20260605_eval",
21
+ "role": "First verified 96/16/16 selected-episode Qwen3-Omni LoRA package; establishes dataset, training, eval, and packaging plumbing.",
22
+ "public_matrix_role": "superseded lineage evidence, not the current 20-task Qwen row",
23
+ },
24
+ {
25
+ "version": "v2",
26
+ "title": "Structured-JSON reuse full-8-GPU LoRA",
27
+ "package": "xperience10m_qwen3_omni_128ep_structured_json_v2_reuse_full8gpu_lora_eval_test_full",
28
+ "role": "Reuses the selected-128 split with a stricter structured JSON answer contract and full 8-GPU LoRA training.",
29
+ "public_matrix_role": "superseded lineage evidence, not the current 20-task Qwen row",
30
+ },
31
+ {
32
+ "version": "v3",
33
+ "title": "Strict-label prompt evaluation",
34
+ "package": "xperience10m_qwen3_omni_128ep_structured_json_v3_strict_label_prompt_reuse_lora_eval_test_full",
35
+ "role": "Strict-label prompt/eval pass over the v2 adapter; improves JSON validity without introducing a new adapter training run.",
36
+ "public_matrix_role": "superseded prompt/eval lineage evidence",
37
+ },
38
+ {
39
+ "version": "v4",
40
+ "title": "Four-epoch structured-JSON LoRA",
41
+ "package": "xperience10m_qwen3_omni_128ep_structured_json_v4_4epoch_full8gpu_lora_eval_test_full",
42
+ "role": "Four-epoch full-8-GPU LoRA run on the same selected split; useful for overfit/metric tradeoff analysis.",
43
+ "public_matrix_role": "superseded lineage evidence, not the current 20-task Qwen row",
44
+ },
45
+ {
46
+ "version": "v5",
47
+ "title": "Multiscale cap96 LoRA",
48
+ "package": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v5_full8gpu_lora_eval_test_full",
49
+ "role": "Dense/multiscale selected-128 run with 4,032 held-out predictions; kept as the pinned prior release because several metrics remain stronger than v6.",
50
+ "public_matrix_role": "pinned prior release row and comparison baseline",
51
+ },
52
+ {
53
+ "version": "v6",
54
+ "title": "Rank64 lr5e-5 multiscale LoRA",
55
+ "package": "xperience10m_qwen3_omni_128ep_multiscale_cap96_v6_rank64_lr5e5_full8gpu_lora_eval_test_full",
56
+ "role": "Current verified Qwen3-Omni row: rank64/lr5e-5 multiscale LoRA plus task-specific probe artifacts used for the 20/20 Qwen matrix coverage.",
57
+ "public_matrix_role": "current public 20-task Qwen3-Omni v6 LoRA row",
58
+ },
59
+ ]
60
+
61
+ METRIC_KEYS = [
62
+ "json_validity_rate",
63
+ "action_macro_f1",
64
+ "subtask_accuracy",
65
+ "transition_accuracy",
66
+ "next_action_accuracy",
67
+ "contact_accuracy",
68
+ "object_micro_f1",
69
+ ]
70
+
71
+
72
+ def read_json(path: Path) -> dict:
73
+ return json.loads(path.read_text(encoding="utf-8"))
74
+
75
+
76
+ def metric_value(metrics: dict, key: str):
77
+ return metrics.get(key)
78
+
79
+
80
+ def fmt(value) -> str:
81
+ if value is None:
82
+ return ""
83
+ if isinstance(value, float):
84
+ return f"{value:.4f}"
85
+ return str(value)
86
+
87
+
88
+ def markdown_table(headers: list[str], rows: list[list[str]]) -> str:
89
+ out = [
90
+ "| " + " | ".join(headers) + " |",
91
+ "| " + " | ".join("---" for _ in headers) + " |",
92
+ ]
93
+ for row in rows:
94
+ out.append("| " + " | ".join(str(cell).replace("|", "\\|").replace("\n", " ") for cell in row) + " |")
95
+ return "\n".join(out)
96
+
97
+
98
+ def build_payload() -> dict:
99
+ rows = []
100
+ for spec in RUNS:
101
+ package_dir = VERIFIED / spec["package"]
102
+ summary = read_json(package_dir / "verified_result_summary.json")
103
+ metrics = read_json(package_dir / "eval/metrics.json")
104
+ row = {
105
+ **spec,
106
+ "status": summary.get("status", "verified"),
107
+ "package_path": str(package_dir.relative_to(ROOT)),
108
+ "dataset_run_id": summary.get("dataset_run_id"),
109
+ "train_run_id": summary.get("train_run_id"),
110
+ "eval_run_id": summary.get("eval_run_id"),
111
+ "dataset_contract": summary.get("dataset_contract"),
112
+ "eval_samples": metrics.get("num_samples") or metrics.get("eval_samples"),
113
+ "metrics": {key: metric_value(metrics, key) for key in METRIC_KEYS},
114
+ }
115
+ rows.append(row)
116
+ return {
117
+ "title": "Qwen3-Omni v1-v6 Run Lineage",
118
+ "status": "pass",
119
+ "generated_at_utc": datetime.now(timezone.utc).isoformat(timespec="seconds"),
120
+ "scope": "Verified public-safe Qwen3-Omni LoRA/eval packages over the selected Xperience-10M 128-episode surface.",
121
+ "interpretation_rule": (
122
+ "Do not confuse the Qwen run versions with the project-level public result layers. "
123
+ "The 20-task matrix uses Qwen3-Omni v6 LoRA; v5 remains the pinned prior release; "
124
+ "v1-v4 are lineage and ablation evidence."
125
+ ),
126
+ "current_public_matrix_row": "qwen3_omni_v6_lora",
127
+ "pinned_prior_release": "v5",
128
+ "runs": rows,
129
+ "related_engineering_artifacts": [
130
+ {
131
+ "name": "Full-parameter gates",
132
+ "path": "results/omni_finetune/QWEN3_FULL_PARAMETER_GATES_20260609.md",
133
+ "role": "Feasibility and short-train gates; not a public 20-task matrix method row.",
134
+ },
135
+ {
136
+ "name": "Alternate fullsplit v6 package",
137
+ "path": "results/omni_finetune/verified_public/xperience10m_qwen3_omni_128ep_fullsplit_fast8gpu_lora_fsdp_full_train_noval_tail_logits_fullstatesave_v6_eval_test_full",
138
+ "role": "Verified alternate no-validation/fullsplit artifact retained for audit, not the current matrix row.",
139
+ },
140
+ ],
141
+ }
142
+
143
+
144
+ def write_outputs(payload: dict) -> None:
145
+ OUTPUT_JSON.parent.mkdir(parents=True, exist_ok=True)
146
+ OUTPUT_JSON.write_text(json.dumps(payload, indent=2, sort_keys=True) + "\n", encoding="utf-8")
147
+
148
+ rows = []
149
+ for run in payload["runs"]:
150
+ m = run["metrics"]
151
+ rows.append(
152
+ [
153
+ run["version"],
154
+ run["title"],
155
+ run["eval_samples"],
156
+ fmt(m["json_validity_rate"]),
157
+ fmt(m["action_macro_f1"]),
158
+ fmt(m["subtask_accuracy"]),
159
+ fmt(m["contact_accuracy"]),
160
+ run["public_matrix_role"],
161
+ ]
162
+ )
163
+ detail_rows = [
164
+ [
165
+ run["version"],
166
+ run["train_run_id"],
167
+ run["eval_run_id"],
168
+ run["role"],
169
+ run["package_path"],
170
+ ]
171
+ for run in payload["runs"]
172
+ ]
173
+ text = f"""# Qwen3-Omni v1-v6 Run Lineage
174
+
175
+ Generated: `{payload['generated_at_utc']}`.
176
+
177
+ Scope: {payload['scope']}
178
+
179
+ Interpretation rule: {payload['interpretation_rule']}
180
+
181
+ ## Compact Lineage
182
+
183
+ {markdown_table(['Version', 'What changed', 'Eval samples', 'JSON validity', 'Action macro-F1', 'Subtask acc.', 'Contact acc.', 'Public role'], rows)}
184
+
185
+ ## Run IDs And Packages
186
+
187
+ {markdown_table(['Version', 'Train run', 'Eval run', 'Role', 'Package'], detail_rows)}
188
+
189
+ ## Related Engineering Artifacts
190
+
191
+ {markdown_table(['Artifact', 'Path', 'Role'], [[row['name'], row['path'], row['role']] for row in payload['related_engineering_artifacts']])}
192
+ """
193
+ OUTPUT_MD.write_text(text, encoding="utf-8")
194
+
195
+
196
+ def main() -> int:
197
+ write_outputs(build_payload())
198
+ print(f"Wrote {OUTPUT_JSON.relative_to(ROOT)}")
199
+ print(f"Wrote {OUTPUT_MD.relative_to(ROOT)}")
200
+ return 0
201
+
202
+
203
+ if __name__ == "__main__":
204
+ raise SystemExit(main())
scripts/sync_hf_publish_mirrors.py CHANGED
@@ -53,7 +53,8 @@ stride, feature manifest, chronological split, and minimal/neural head pattern.
53
  The historical `tier2_task_suite` path is retained only for stable artifact
54
  links to tasks 13-20. Results are organized as two evidence lines:
55
  one public sample episode for task construction and reproducibility, and
56
- 128 selected episodes for same-split baselines plus Qwen3/Cosmos comparison.
 
57
  The line map is published as `docs/assets/charts/two_evidence_line_map.svg`
58
  with machine-readable summaries in `docs/data/two_evidence_lines.json` and
59
  `docs/data/two_evidence_line_result_summary.json`.
@@ -120,6 +121,11 @@ QWEN_COMPARISON_ROW = (
120
  "| Compare Qwen3 v5/v6 diagnostic branches | "
121
  "`docs/data/qwen3_v5_v6_comparison.json` |"
122
  )
 
 
 
 
 
123
  QWEN_ARTIFACT_OLD_BULLET = """- A current verified Qwen3-Omni strict-label v3 held-out package for the
124
  selected 96/16/16 episode split, with 100.00% JSON validity and weak
125
  action/subtask quality documented as the next error-analysis target."""
@@ -460,12 +466,19 @@ def ensure_current_qwen_card_links(hf_root: Path, *, dry_run: bool) -> list[str]
460
  if QWEN_COMPARISON_ROW not in text:
461
  anchor = "| Compare current versions and model groups | `docs/data/omni_model_comparison.json` |"
462
  text = text.replace(anchor, anchor + "\n" + QWEN_COMPARISON_ROW, 1)
 
 
463
  if QWEN_ARTIFACT_OLD_BULLET in text:
464
  text = text.replace(QWEN_ARTIFACT_OLD_BULLET, QWEN_ARTIFACT_CURRENT_BULLET, 1)
465
  if "99.90% JSON validity" not in text:
466
  text = text.rstrip() + "\n\n" + QWEN_ARTIFACT_CURRENT_BULLET + "\n"
467
  if QWEN_COMPARISON_MARKER not in text:
468
  text = text.rstrip() + f"\n\nQwen v5/v6 comparison: `{QWEN_COMPARISON_MARKER}`.\n"
 
 
 
 
 
469
  if text != original:
470
  updated.append("artifacts/README.md")
471
  if not dry_run:
 
53
  The historical `tier2_task_suite` path is retained only for stable artifact
54
  links to tasks 13-20. Results are organized as two evidence lines:
55
  one public sample episode for task construction and reproducibility, and
56
+ 128 selected episodes for same-split baselines plus Qwen3-Omni and Cosmos3
57
+ comparison.
58
  The line map is published as `docs/assets/charts/two_evidence_line_map.svg`
59
  with machine-readable summaries in `docs/data/two_evidence_lines.json` and
60
  `docs/data/two_evidence_line_result_summary.json`.
 
121
  "| Compare Qwen3 v5/v6 diagnostic branches | "
122
  "`docs/data/qwen3_v5_v6_comparison.json` |"
123
  )
124
+ QWEN_LINEAGE_MARKER = "docs/data/qwen3_omni_run_lineage.json"
125
+ QWEN_LINEAGE_ROW = (
126
+ "| Explain Qwen3-Omni v1-v6 run lineage | "
127
+ "`QWEN3_OMNI_RUN_LINEAGE.md`, `docs/data/qwen3_omni_run_lineage.json` |"
128
+ )
129
  QWEN_ARTIFACT_OLD_BULLET = """- A current verified Qwen3-Omni strict-label v3 held-out package for the
130
  selected 96/16/16 episode split, with 100.00% JSON validity and weak
131
  action/subtask quality documented as the next error-analysis target."""
 
466
  if QWEN_COMPARISON_ROW not in text:
467
  anchor = "| Compare current versions and model groups | `docs/data/omni_model_comparison.json` |"
468
  text = text.replace(anchor, anchor + "\n" + QWEN_COMPARISON_ROW, 1)
469
+ if QWEN_LINEAGE_ROW not in text:
470
+ text = text.replace(QWEN_COMPARISON_ROW, QWEN_COMPARISON_ROW + "\n" + QWEN_LINEAGE_ROW, 1)
471
  if QWEN_ARTIFACT_OLD_BULLET in text:
472
  text = text.replace(QWEN_ARTIFACT_OLD_BULLET, QWEN_ARTIFACT_CURRENT_BULLET, 1)
473
  if "99.90% JSON validity" not in text:
474
  text = text.rstrip() + "\n\n" + QWEN_ARTIFACT_CURRENT_BULLET + "\n"
475
  if QWEN_COMPARISON_MARKER not in text:
476
  text = text.rstrip() + f"\n\nQwen v5/v6 comparison: `{QWEN_COMPARISON_MARKER}`.\n"
477
+ if QWEN_LINEAGE_MARKER not in text:
478
+ text = text.rstrip() + (
479
+ f"\n\nQwen3-Omni v1-v6 lineage: `QWEN3_OMNI_RUN_LINEAGE.md` "
480
+ f"and `{QWEN_LINEAGE_MARKER}`.\n"
481
+ )
482
  if text != original:
483
  updated.append("artifacts/README.md")
484
  if not dry_run:
scripts/validate_mirror_parity.py CHANGED
@@ -62,6 +62,7 @@ DATA_FILES = [
62
  "publication_audit.json",
63
  "public_surface_qa.json",
64
  "qwen3_full_parameter_gates.json",
 
65
  "qwen3_v5_v6_comparison.json",
66
  "quality_gates.json",
67
  "raw_sample_files.json",
@@ -189,6 +190,7 @@ SCRIPT_FILES = [
189
  "build_evaluation_protocol.py",
190
  "build_figure_index.py",
191
  "build_quality_gates.py",
 
192
  "build_public_surface_qa.py",
193
  "build_rendered_site_check.py",
194
  "build_interactive_research_roadmap.py",
@@ -324,6 +326,7 @@ DOC_FILES = [
324
  "THREE_FOUNDATION_PIPELINES.md",
325
  "TWO_EVIDENCE_LINES.md",
326
  "TWO_EVIDENCE_LINE_RESULT_SUMMARY.md",
 
327
  "PROJECT_BRIEF.md",
328
  "PUBLIC_READER_MAP.md",
329
  "RENDERED_SITE_CHECK.md",
 
62
  "publication_audit.json",
63
  "public_surface_qa.json",
64
  "qwen3_full_parameter_gates.json",
65
+ "qwen3_omni_run_lineage.json",
66
  "qwen3_v5_v6_comparison.json",
67
  "quality_gates.json",
68
  "raw_sample_files.json",
 
190
  "build_evaluation_protocol.py",
191
  "build_figure_index.py",
192
  "build_quality_gates.py",
193
+ "build_qwen3_omni_run_lineage.py",
194
  "build_public_surface_qa.py",
195
  "build_rendered_site_check.py",
196
  "build_interactive_research_roadmap.py",
 
326
  "THREE_FOUNDATION_PIPELINES.md",
327
  "TWO_EVIDENCE_LINES.md",
328
  "TWO_EVIDENCE_LINE_RESULT_SUMMARY.md",
329
+ "QWEN3_OMNI_RUN_LINEAGE.md",
330
  "PROJECT_BRIEF.md",
331
  "PUBLIC_READER_MAP.md",
332
  "RENDERED_SITE_CHECK.md",