ropedia-xperience-10m-task-baselines / results /omni_finetune /MULTI_EPISODE_ACCESS_STATUS.md
cy0307's picture
Publish Ropedia Xperience-10M task baseline cards
b7a466b verified
|
Raw
History Blame
1.48 kB

Multi-Episode Access Status

Current status: access to the gated full ropedia-ai/xperience-10m dataset is still pending approval from the dataset authors.

This file records the public data-access status and pilot requirements. It does not include local-machine aliases, private paths, SSH hosts, or token locations.

Selection Plan

Item Value
Dataset ropedia-ai/xperience-10m
Target 32 complete leaf episodes
Strategy stratified round-robin across top-level session UUIDs
Candidate scan first 64 top-level session UUIDs
Valid candidates 680
Selected sessions 32
Minimum episode size 0.25 GB
Estimated bytes 72,031,620,552
Excluded file visualization.rrd

Current Stage

The current Qwen3-Omni artifacts come from the locally available sample data. The 32-episode held-out model-quality run starts after the selected episodes are available locally.

A real 32-episode pilot can be claimed only after:

  • at least 32 valid episodes are available locally,
  • the manifest builder confirms complete held-out episode splits,
  • training finishes with recorded metadata and progress logs,
  • evaluation runs on held-out test episodes,
  • predictions, metrics, confusion matrices, and a run report are committed.

The reader-facing data access summary is:

results/omni_finetune/DATA_ACCESS_STATUS.md

The machine-generated discovery report remains:

results/omni_finetune/DATA_BLOCKER_REPORT.md