cy0307's picture
Publish Ropedia Xperience-10M task baseline cards
fc9e8cf verified
|
Raw
History Blame
4.95 kB

Figure Index

This file is generated by scripts/build_figure_index.py. It catalogs the public visual assets used by the repo, website, and Hugging Face mirrors.

Current status: pass

Public figures, diagrams, charts, and derived modality thumbnails. Raw Xperience-10M videos, annotations, RRD files, and Qwen weights are excluded.

Figures

Figure Path Size Source script Role
Project logo mark docs/assets/brand/xperience10m-logo-mark-512.png 512 x 512 scripts/build_brand_assets.py Primary X-shaped multimodal camera mark used for the website header, README, HF cards, and brand identity.
Project logo social card docs/assets/brand/xperience10m-logo-social-card.png 1200 x 630 scripts/build_brand_assets.py Large preview image for README, Hugging Face cards, and Open Graph/Twitter social sharing.
Project favicon docs/assets/brand/xperience10m-logo-favicon-64.png 64 x 64 scripts/build_brand_assets.py Small dark-tile logo for browser tabs and compact navigation.
12-task suite infographic docs/assets/task_suite_infographic.png 1800 x 6600 scripts/render_task_suite_infographic.py Primary visual map of the task suite, verified metrics, and sample modalities.
Episode-to-task pipeline diagram docs/assets/pipeline_diagram.png 1800 x 1120 scripts/generate_visualizations.py End-to-end data processing and evaluation pipeline overview.
Qwen3-Omni LoRA training pipeline docs/assets/qwen3_omni_lora_pipeline.png 1536 x 1024 docs/assets/qwen3_omni_lora_pipeline.prompt.md Detailed raw-data-to-adapter flow for selected Xperience-10M Qwen3-Omni LoRA training.
Minimal and neural task architecture map docs/assets/task_architectures.png 1800 x 2450 scripts/render_overview_figures.py All 12 task heads and shared feature contracts.
Video modality thumbnail docs/assets/modalities/video.jpg 880 x 520 scripts/export_modality_atlas_assets.py Derived thumbnail for synchronized camera streams.
Audio modality thumbnail docs/assets/modalities/audio.png 880 x 520 scripts/export_modality_atlas_assets.py Derived waveform thumbnail for the MP4 AAC stream.
Depth modality thumbnail docs/assets/modalities/depth.jpg 880 x 520 scripts/export_modality_atlas_assets.py Derived depth and confidence thumbnail.
Pose / SLAM modality thumbnail docs/assets/modalities/pose_slam.png 880 x 520 scripts/export_modality_atlas_assets.py Derived camera trajectory and sparse map thumbnail.
Motion capture modality thumbnail docs/assets/modalities/motion_capture.png 880 x 520 scripts/export_modality_atlas_assets.py Derived body and hand motion-capture thumbnail.
Inertial modality thumbnail docs/assets/modalities/inertial.png 880 x 520 scripts/export_modality_atlas_assets.py Derived accelerometer and gyroscope trace thumbnail.
Language modality thumbnail docs/assets/modalities/language.png 880 x 520 scripts/export_modality_atlas_assets.py Derived object-tag and caption thumbnail.
Model macro-F1 comparison chart docs/assets/charts/model_macro_f1.svg 1100 x 284 scripts/generate_visualizations.py Minimal-vs-neural classification score comparison.
Neural MLP task score chart docs/assets/charts/episode_task_scores_neural_mlp.svg 1100 x 556 scripts/generate_visualizations.py Neural MLP metric snapshot across the task suite.
Minimal-vs-neural task score chart docs/assets/charts/episode_task_scores_minimal_vs_neural.svg 1100 x 964 scripts/generate_visualizations.py Side-by-side baseline comparison over the same window contracts.
Research direction coverage chart docs/assets/charts/research_direction_coverage.svg 1180 x 700 scripts/generate_visualizations.py Four-track coverage map for Ropedia research directions.
Research direction extension chart docs/assets/charts/research_direction_extension_tasks.svg 1420 x 920 scripts/generate_visualizations.py Four coded extension probes, one per Ropedia research direction.
Feature block chart docs/assets/charts/feature_blocks.svg 1100 x 760 scripts/generate_visualizations.py Feature allocation by modality block.
Minimal task score chart docs/assets/charts/episode_task_scores.svg 1100 x 556 scripts/generate_visualizations.py Minimal baseline metric snapshot across the task suite.
Cross-modal retrieval chart docs/assets/charts/cross_modal_retrieval.svg 1100 x 284 scripts/generate_visualizations.py Retrieval behavior chart for the cross-modal task.

Use and Scope

  • These figures are derived presentation artifacts or small thumbnails.
  • The index records file hashes and dimensions for reproducibility checks.
  • Raw Xperience-10M MP4/HDF5/RRD files and full model weights are not redistributed.