Figure Index
This file is generated by scripts/build_figure_index.py. It catalogs
the public visual assets used by the repo, website, and Hugging Face mirrors.
Current status: pass
Public figures, diagrams, charts, and derived modality thumbnails. Raw Xperience-10M videos, annotations, RRD files, and Qwen weights are excluded.
Figures
| Figure | Path | Size | Source script | Role |
|---|---|---|---|---|
| Project logo mark | docs/assets/brand/xperience10m-logo-mark-512.png |
512 x 512 | scripts/build_brand_assets.py |
Primary X-shaped multimodal camera mark used for the website header, README, HF cards, and brand identity. |
| Project logo social card | docs/assets/brand/xperience10m-logo-social-card.png |
1200 x 630 | scripts/build_brand_assets.py |
Large preview image for README, Hugging Face cards, and Open Graph/Twitter social sharing. |
| Project favicon | docs/assets/brand/xperience10m-logo-favicon-64.png |
64 x 64 | scripts/build_brand_assets.py |
Small dark-tile logo for browser tabs and compact navigation. |
| Original task-suite infographic | docs/assets/task_suite_infographic.png |
1800 x 7600 | scripts/render_task_suite_infographic.py |
Primary visual map of the original task families, verified metrics, and sample modalities; the unified public suite is now documented as 20 tasks. |
| Episode-to-task pipeline diagram | docs/assets/pipeline_diagram.png |
1800 x 1120 | scripts/generate_visualizations.py |
End-to-end data processing and evaluation pipeline overview. |
| Qwen3-Omni LoRA training pipeline | docs/assets/qwen3_omni_lora_pipeline.png |
1536 x 1024 | docs/assets/qwen3_omni_lora_pipeline.prompt.md |
Detailed raw-data-to-adapter flow for staged Xperience-10M Qwen3-Omni LoRA training. |
| Spatial intelligence slide diagram | docs/assets/foundation-pipelines/spatial-intelligence-pipeline.png |
2560 x 1920 | scripts/render_foundation_pipeline_diagrams.py |
High-resolution slide diagram for the spatial intelligence pipeline track. |
| Human-video world model slide diagram | docs/assets/foundation-pipelines/human-video-world-model-pipeline.png |
2560 x 1920 | scripts/render_foundation_pipeline_diagrams.py |
High-resolution slide diagram for the human-video world-model pipeline track. |
| Vision-language-action slide diagram | docs/assets/foundation-pipelines/vision-language-action-pipeline.png |
2560 x 1920 | scripts/render_foundation_pipeline_diagrams.py |
High-resolution slide diagram for the VLA/action-policy pipeline track. |
| Minimal and neural task architecture map | docs/assets/task_architectures.png |
1800 x 2450 | scripts/render_overview_figures.py |
Minimal and neural heads for the original task contracts and shared feature contracts. |
| Video modality thumbnail | docs/assets/modalities/video.jpg |
880 x 520 | scripts/export_modality_atlas_assets.py |
Derived thumbnail for synchronized camera streams. |
| Audio modality thumbnail | docs/assets/modalities/audio.png |
880 x 520 | scripts/export_modality_atlas_assets.py |
Derived waveform thumbnail for the MP4 AAC stream. |
| Depth modality thumbnail | docs/assets/modalities/depth.jpg |
880 x 520 | scripts/export_modality_atlas_assets.py |
Derived depth and confidence thumbnail. |
| Pose / SLAM modality thumbnail | docs/assets/modalities/pose_slam.png |
880 x 520 | scripts/export_modality_atlas_assets.py |
Derived camera trajectory and sparse map thumbnail. |
| Motion capture modality thumbnail | docs/assets/modalities/motion_capture.png |
880 x 520 | scripts/export_modality_atlas_assets.py |
Derived body and hand motion-capture thumbnail. |
| Inertial modality thumbnail | docs/assets/modalities/inertial.png |
880 x 520 | scripts/export_modality_atlas_assets.py |
Derived accelerometer and gyroscope trace thumbnail. |
| Language modality thumbnail | docs/assets/modalities/language.png |
880 x 520 | scripts/export_modality_atlas_assets.py |
Derived object-tag and caption thumbnail. |
| Model macro-F1 comparison chart | docs/assets/charts/model_macro_f1.svg |
1100 x 284 | scripts/generate_visualizations.py |
Minimal-vs-neural classification score comparison. |
| Neural MLP task score chart | docs/assets/charts/episode_task_scores_neural_mlp.svg |
1100 x 556 | scripts/generate_visualizations.py |
Neural MLP metric snapshot across the task suite. |
| Minimal-vs-neural task score chart | docs/assets/charts/episode_task_scores_minimal_vs_neural.svg |
1100 x 964 | scripts/generate_visualizations.py |
Side-by-side baseline comparison over the same window contracts. |
| Research direction coverage chart | docs/assets/charts/research_direction_coverage.svg |
1180 x 700 | scripts/generate_visualizations.py |
Four-track coverage map for Ropedia research directions. |
| Research direction extension chart | docs/assets/charts/research_direction_extension_tasks.svg |
1420 x 920 | scripts/generate_visualizations.py |
Four coded extension probes, one per Ropedia research direction. |
| Tasks 13-20 baseline chart | docs/assets/charts/tier2_task_suite.svg |
1440 x 832 | scripts/tier2_task_suite.py |
Eight additional sample-supported tasks in the unified 20-task suite with aligned minimal and neural baseline metrics. |
| Unified 20-task model radar | docs/assets/charts/unified_task_model_radar.svg |
2400 x 1840 | scripts/build_unified_task_model_radar.py |
Twenty-axis direction-aware comparison of minimal and neural MLP baselines, with 128-episode metadata, Qwen3, and Cosmos task-aligned overlay points and branch notes. |
| Single-episode 20-task model radar | docs/assets/charts/single_episode_task_model_radar.svg |
2400 x 1840 | scripts/build_unified_task_model_radar.py |
Twenty-axis split radar for the one public-sample episode, comparing Minimal and Neural MLP as two complete 20/20 scored polygons. |
| 128-episode 20-task model radar | docs/assets/charts/episode128_task_model_radar.svg |
2400 x 1840 | scripts/build_unified_task_model_radar.py |
Twenty-axis split radar for selected 128-episode methods: raw-feature simple/NN as complete scored polygons plus metadata, Qwen3-Omni, Cosmos3-Super, and Cosmos3-Nano task-aligned overlays. |
| Feature block chart | docs/assets/charts/feature_blocks.svg |
1100 x 760 | scripts/generate_visualizations.py |
Feature allocation by modality block. |
| Minimal task score chart | docs/assets/charts/episode_task_scores.svg |
1100 x 556 | scripts/generate_visualizations.py |
Minimal baseline metric snapshot across the task suite. |
| Cross-modal retrieval chart | docs/assets/charts/cross_modal_retrieval.svg |
1100 x 284 | scripts/generate_visualizations.py |
Retrieval behavior chart for the cross-modal task. |
Use and Scope
- These figures are derived presentation artifacts or small thumbnails.
- The index records file hashes and dimensions for reproducibility checks.
- Raw Xperience-10M MP4/HDF5/RRD files and full model weights are not redistributed.