Robotics
PyTorch
Cosmos
xperience10m_task_baseline_suite
embodied-ai
multimodal
xperience-10m
baseline
evaluation
qwen3-omni
Instructions to use cy0307/ropedia-xperience-10m-task-baselines with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Cosmos
How to use cy0307/ropedia-xperience-10m-task-baselines with Cosmos:
# No code snippets available yet for this library. # To use this model, check the repository files and the library's documentation. # Want to help? PRs adding snippets are welcome at: # https://github.com/huggingface/huggingface.js
- Notebooks
- Google Colab
- Kaggle
| <svg xmlns="http://www.w3.org/2000/svg" width="1400" height="760" viewBox="0 0 1400 760"> | |
| <defs><pattern id="dotgrid" width="18" height="18" patternUnits="userSpaceOnUse"><circle cx="2" cy="2" r="1.2" fill="#ccffa0" opacity="0.20"/></pattern><marker id="arrow" viewBox="0 0 10 10" refX="8" refY="5" markerWidth="7" markerHeight="7" orient="auto-start-reverse"><path d="M 0 0 L 10 5 L 0 10 z" fill="#ccffa0" fill-opacity="0.72"/></marker></defs> | |
| <rect width="100%" height="100%" fill="#020502"/> | |
| <rect x="0" y="0" width="1400" height="760" fill="#020502"/> | |
| <rect x="0" y="0" width="1400" height="760" fill="url(#dotgrid)" opacity="0.55"/> | |
| <circle cx="1120" cy="132" r="170" fill="#ccffa0" opacity="0.10"/> | |
| <text x="60" y="58" font-family="Inter Tight, Arial, sans-serif" font-size="32" font-weight="800" fill="#f4f8ef">Verified Ropedia Xperience-10M Pipeline</text> | |
| <text x="60" y="88" font-family="Space Grotesk, Arial, sans-serif" font-size="16" fill="#a5afa2">Generated from committed scripts, the unified 20-task index, and traceable metrics.</text> | |
| <line x1="310" y1="176" x2="365" y2="176" stroke="#ccffa0" stroke-opacity="0.54" stroke-width="3" marker-end="url(#arrow)"/> | |
| <line x1="615" y1="176" x2="670" y2="176" stroke="#ccffa0" stroke-opacity="0.54" stroke-width="3" marker-end="url(#arrow)"/> | |
| <line x1="920" y1="176" x2="975" y2="176" stroke="#ccffa0" stroke-opacity="0.54" stroke-width="3" marker-end="url(#arrow)"/> | |
| <line x1="215" y1="242" x2="240" y2="380" stroke="#ccffa0" stroke-opacity="0.54" stroke-width="3" marker-end="url(#arrow)"/> | |
| <line x1="1095" y1="242" x2="700" y2="380" stroke="#ccffa0" stroke-opacity="0.54" stroke-width="3" marker-end="url(#arrow)"/> | |
| <line x1="420" y1="464" x2="520" y2="464" stroke="#ccffa0" stroke-opacity="0.54" stroke-width="3" marker-end="url(#arrow)"/> | |
| <line x1="880" y1="464" x2="980" y2="464" stroke="#ccffa0" stroke-opacity="0.54" stroke-width="3" marker-end="url(#arrow)"/> | |
| <rect x="60" y="110" width="250" height="132" rx="8" fill="#061006" stroke="#ccffa0" stroke-opacity="0.26" stroke-width="2"/> | |
| <rect x="60" y="110" width="8" height="132" rx="4" fill="#9bdfff"/> | |
| <text x="84" y="144" font-family="Inter Tight, Arial, sans-serif" font-size="18" font-weight="800" fill="#f4f8ef">1. Raw public sample</text> | |
| <text x="84" y="176" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">annotation.hdf5</text> | |
| <text x="84" y="198" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">6 MP4 videos with audio</text> | |
| <text x="84" y="220" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">5,821 aligned frames</text> | |
| <rect x="365" y="110" width="250" height="132" rx="8" fill="#061006" stroke="#ccffa0" stroke-opacity="0.26" stroke-width="2"/> | |
| <rect x="365" y="110" width="8" height="132" rx="4" fill="#7ae5c3"/> | |
| <text x="389" y="144" font-family="Inter Tight, Arial, sans-serif" font-size="18" font-weight="800" fill="#f4f8ef">2. HOMIE loader</text> | |
| <text x="389" y="176" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">video, depth, pose</text> | |
| <text x="389" y="198" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">mocap, IMU, language</text> | |
| <text x="389" y="220" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">audio features</text> | |
| <rect x="670" y="110" width="250" height="132" rx="8" fill="#061006" stroke="#ccffa0" stroke-opacity="0.26" stroke-width="2"/> | |
| <rect x="670" y="110" width="8" height="132" rx="4" fill="#ccffa0"/> | |
| <text x="694" y="144" font-family="Inter Tight, Arial, sans-serif" font-size="18" font-weight="800" fill="#f4f8ef">3. Window builder</text> | |
| <text x="694" y="176" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">20-frame windows</text> | |
| <text x="694" y="198" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">5-frame stride</text> | |
| <text x="694" y="220" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">1,161 windows</text> | |
| <rect x="975" y="110" width="300" height="132" rx="8" fill="#061006" stroke="#ccffa0" stroke-opacity="0.26" stroke-width="2"/> | |
| <rect x="975" y="110" width="8" height="132" rx="4" fill="#d8f4a5"/> | |
| <text x="999" y="144" font-family="Inter Tight, Arial, sans-serif" font-size="18" font-weight="800" fill="#f4f8ef">4. Feature vector</text> | |
| <text x="999" y="176" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">8,546 dimensions</text> | |
| <text x="999" y="198" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">18 named blocks</text> | |
| <text x="999" y="220" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">audio represented</text> | |
| <text x="999" y="242" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">stored manifest</text> | |
| <rect x="60" y="380" width="360" height="168" rx="8" fill="#061006" stroke="#ccffa0" stroke-opacity="0.26" stroke-width="2"/> | |
| <rect x="60" y="380" width="8" height="168" rx="4" fill="#9bdfff"/> | |
| <text x="84" y="414" font-family="Inter Tight, Arial, sans-serif" font-size="18" font-weight="800" fill="#f4f8ef">5. Baseline models</text> | |
| <text x="84" y="446" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">motion-only action/subtask</text> | |
| <text x="84" y="468" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">current all-feature action/subtask</text> | |
| <text x="84" y="490" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">numpy softmax classifier</text> | |
| <text x="84" y="512" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">metrics and predictions</text> | |
| <rect x="520" y="380" width="360" height="168" rx="8" fill="#061006" stroke="#ccffa0" stroke-opacity="0.26" stroke-width="2"/> | |
| <rect x="520" y="380" width="8" height="168" rx="4" fill="#7ae5c3"/> | |
| <text x="544" y="414" font-family="Inter Tight, Arial, sans-serif" font-size="18" font-weight="800" fill="#f4f8ef">6. Ropedia Xperience-10M suite</text> | |
| <text x="544" y="446" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">20 unified task contracts</text> | |
| <text x="544" y="468" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">chronological split</text> | |
| <text x="544" y="490" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">retrieval, forecast, alignment</text> | |
| <text x="544" y="512" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">per-task artifacts</text> | |
| <rect x="980" y="380" width="300" height="168" rx="8" fill="#061006" stroke="#ccffa0" stroke-opacity="0.26" stroke-width="2"/> | |
| <rect x="980" y="380" width="8" height="168" rx="4" fill="#ccffa0"/> | |
| <text x="1004" y="414" font-family="Inter Tight, Arial, sans-serif" font-size="18" font-weight="800" fill="#f4f8ef">7. Published artifacts</text> | |
| <text x="1004" y="446" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">results/**/*.json/csv/npz</text> | |
| <text x="1004" y="468" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">docs/data/summary_metrics.json</text> | |
| <text x="1004" y="490" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">GitHub Pages dashboard</text> | |
| <text x="1004" y="512" font-family="Space Grotesk, Arial, sans-serif" font-size="14" fill="#dce8d7">reproducibility check</text> | |
| <rect x="60" y="620" width="1220" height="96" rx="8" fill="#071207" stroke="#ccffa0" stroke-opacity="0.24"/> | |
| <text x="84" y="650" font-family="Space Grotesk, Arial, sans-serif" font-size="15" fill="#dce8d7">Reproduction check: rerunning scripts in a temporary local workspace reproduced committed metrics exactly.</text> | |
| <text x="84" y="674" font-family="Space Grotesk, Arial, sans-serif" font-size="15" fill="#dce8d7">Modality check: sample covers video, audio, depth, pose/SLAM, mocap, IMU, and language annotation.</text> | |
| <text x="84" y="698" font-family="Space Grotesk, Arial, sans-serif" font-size="15" fill="#dce8d7">Feature check: current manifest has synchronized video, audio, depth, pose, mocap, IMU, and language groups.</text> | |
| <text x="84" y="722" font-family="Space Grotesk, Arial, sans-serif" font-size="15" fill="#dce8d7">Scope check: this validates one public sample episode, not cross-episode generalization.</text> | |
| </svg> |