Robotics
PyTorch
Cosmos
xperience10m_task_baseline_suite
embodied-ai
multimodal
xperience-10m
baseline
evaluation
qwen3-omni
Instructions to use cy0307/ropedia-xperience-10m-task-baselines with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Cosmos
How to use cy0307/ropedia-xperience-10m-task-baselines with Cosmos:
# No code snippets available yet for this library. # To use this model, check the repository files and the library's documentation. # Want to help? PRs adding snippets are welcome at: # https://github.com/huggingface/huggingface.js
- Notebooks
- Google Colab
- Kaggle
| # Xperience-10M HF Metadata Audit | |
| Metadata-only analysis of the gated Hugging Face dataset. No MP4, HDF5, RRD, or model files were downloaded. | |
| ## Access and Source | |
| - Repo: `ropedia-ai/xperience-10m` | |
| - Repo SHA: `ce943cf271a758b60240084892d05cf6dc12dd90` | |
| - Last modified: `2026-04-21T05:03:45+00:00` | |
| - Gated mode: `manual` | |
| - Pretty name: `Xperience-10M` | |
| - License field: `other` | |
| - HF size category: `1M<n<10M` | |
| - Tags: `egocentric, first-person, multimodal, 3d, 4d, embodied-ai, robotics, human-motion, mocap, imu, audio, depth, captions, video` | |
| ## Current Hub File Metadata | |
| | Measure | Value | | |
| | --- | --- | | |
| | Files listed by API | 85,257 | | |
| | Total bytes from file metadata | 25.52 TiB (28,057,584,187,079 bytes) | | |
| | Bytes excluding visualization.rrd | 24.63 TiB (27,083,292,060,675 bytes) | | |
| | visualization.rrd bytes | 907.38 GiB (974,292,126,404 bytes) | | |
| | Top-level session folders | 804 | | |
| | Episode-like folders | 12,103 | | |
| ## File Composition | |
| | File type | Count | | |
| | --- | --- | | |
| | .hdf5 | 12,103 | | |
| | .md | 1 | | |
| | .mp4 | 72,612 | | |
| | .rrd | 541 | | |
| ## Episode Completeness | |
| | Measure | Value | | |
| | --- | --- | | |
| | annotation.hdf5 files | 12,103 | | |
| | MP4 files | 72,612 | | |
| | visualization.rrd files | 541 | | |
| | Complete episodes: annotation + all six MP4 views | 12,102 (99.9917%) | | |
| | Degraded-valid episodes: annotation + fisheye_cam0 | 12,102 (99.9917%) | | |
| | Sessions with complete episodes | 802 | | |
| | Video-count histogram per episode | {"0": 1, "6": 12102} | | |
| ## Episode Size Distribution | |
| | Statistic | Training bytes per complete episode, excluding visualization.rrd | | |
| | --- | --- | | |
| | Min | 7.78 MiB | | |
| | P25 | 2.13 GiB | | |
| | Median | 2.20 GiB | | |
| | P75 | 2.25 GiB | | |
| | Mean | 2.08 GiB | | |
| | Max | 2.53 GiB | | |
| ## Annotation File Size Distribution | |
| | Statistic | annotation.hdf5 size | | |
| | --- | --- | | |
| | Min | 6.38 MiB | | |
| | P25 | 1.74 GiB | | |
| | Median | 1.83 GiB | | |
| | P75 | 1.85 GiB | | |
| | Mean | 1.70 GiB | | |
| | Max | 1.86 GiB | | |
| ## Pilot Scale Estimates | |
| | Pilot | Episodes | Max windows at 256/episode | Storage estimate | | |
| | --- | --- | --- | --- | | |
| | 32-episode smallest one-per-session | 32 | 8192 | 35.35 GiB | | |
| | 32-episode median-sized estimate | 32 | 8192 | 70.51 GiB | | |
| | 32-episode mean-sized estimate | 32 | 8192 | 66.69 GiB | | |
| | 100-episode pilot | 100 | 25600 | roughly 220.34 GiB at median episode size | | |
| | 500-episode pilot | 500 | 128000 | roughly 1.08 TiB at median episode size | | |
| | All complete visible HF episodes | 12102 | 3098112 | 24.63 TiB | | |
| ## Incomplete Episode Records | |
| [ | |
| { | |
| "episode_path": "dc3f4139-f499-4de7-b057-e25b7dfb2d83/ep1", | |
| "episode_id": "ep1", | |
| "top_level_session": "dc3f4139-f499-4de7-b057-e25b7dfb2d83", | |
| "file_count": 1, | |
| "total_bytes": 1418232696, | |
| "training_bytes_excluding_visualization_rrd": 1418232696, | |
| "has_annotation": true, | |
| "has_fisheye_cam0": false, | |
| "video_count": 0, | |
| "has_all_six_videos": false, | |
| "is_degraded_valid": false, | |
| "is_complete": false, | |
| "has_visualization_rrd": false, | |
| "missing_required_files": [ | |
| "fisheye_cam0.mp4", | |
| "fisheye_cam1.mp4", | |
| "fisheye_cam2.mp4", | |
| "fisheye_cam3.mp4", | |
| "stereo_left.mp4", | |
| "stereo_right.mp4" | |
| ] | |
| } | |
| ] | |
| ## Download and Compute Recommendation | |
| - This metadata listing check can run on any machine with Hugging Face access. | |
| - If the training host cannot reach Hugging Face, download on an HF-reachable machine, then transfer staged episode folders to the training host. | |
| - For training downloads, include `annotation.hdf5` plus the six MP4 streams; exclude `visualization.rrd` unless Rerun visualization is specifically needed. | |
| - For the first real training pilot, prefer 32 complete episodes from different top-level sessions and avoid selecting only the tiny outlier episodes. | |
| - The training host is used after staged data exists: manifest validation, preprocessing, LoRA training, and held-out evaluation. | |