ropedia-xperience-10m-task-baselines / TASK_METHOD_20_GAP_AUDIT.md
cy0307's picture
Add files using upload-large-folder tool
17c38d5 verified
|
Raw
History Blame
7.3 kB

Task Method 20-Result Gap Audit

Generated: 2026-06-20T04:24:42+00:00

This audit is the explicit gap ledger for the 9-method x 20-task result matrix. It keeps missing cells visible while preserving the rule that a numeric score requires a real task target and source artifact.

Score Summary

Method Coverage

Method ID Scored Scoreless Proxy Status counts
Minimal minimal 20/20 0 0 scored: 20
Neural MLP neural_mlp 20/20 0 0 scored: 20
128ep Aligned Simple metadata128_simple 19/20 1 0 scored: 19, unsupported_without_required_target: 1
128ep Aligned NN metadata128_neural_mlp 19/20 1 0 not_supported_by_metadata_only_package: 1, scored: 19
128ep Raw Simple raw128_simple 20/20 0 2 proxy_scored: 2, scored: 18
128ep Raw NN raw128_neural_mlp 20/20 0 2 proxy_scored: 2, scored: 18
Qwen3-Omni v6 LoRA qwen3_omni_v6_lora 20/20 0 0 scored: 20
Cosmos3-Super Reasoner cosmos3_super_reasoner 10/20 10 0 not_evaluated_in_verified_package: 10, scored: 10
Cosmos3-Nano Future Window cosmos3_nano_future_window 11/20 9 0 not_evaluated_in_verified_package: 9, scored: 11

Gap Classes

Status Count Next step
not_evaluated_in_verified_package 19 Generate verified model outputs for this task contract and score them against the held-out labels.
not_supported_by_metadata_only_package 1 Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
unsupported_without_required_target 1 Export the missing target field for this 128-episode method, then rerun the same train/validation/test split.

Scoreless Records

Task Task label Method Status Required evidence
02 Procedure Step Recognition Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
05 Hand Trajectory Forecasting Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
05 Hand Trajectory Forecasting Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
07 Object Relevance Prediction Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
08 Language Grounding Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
09 Cross-Modal Retrieval Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
10 Cross-Modal Reconstruction Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
11 Temporal Order Verification Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
11 Temporal Order Verification Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
12 Multimodal Synchronization Detection Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
12 Multimodal Synchronization Detection Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
14 Long-Horizon Next-Subtask Forecasting Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
15 Interaction Text Prediction Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
15 Interaction Text Prediction Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
17 Future Object-Set Forecasting Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
18 IMU-to-Hand Pose Reconstruction Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
18 IMU-to-Hand Pose Reconstruction Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
19 Camera-View Synchronization Retrieval 128ep Aligned Simple unsupported Export the missing target field for this 128-episode method, then rerun the same train/validation/test split.
19 Camera-View Synchronization Retrieval 128ep Aligned NN not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
19 Camera-View Synchronization Retrieval Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
19 Camera-View Synchronization Retrieval Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.

Proxy Records

Task Task label Method Metric Proxy note
15 Interaction Text Prediction 128ep Raw Simple macro_f1 documented compact proxy completion for this raw128 task axis
15 Interaction Text Prediction 128ep Raw NN macro_f1 documented compact proxy completion for this raw128 task axis
19 Camera-View Synchronization Retrieval 128ep Raw Simple mrr documented compact proxy completion for this raw128 task axis
19 Camera-View Synchronization Retrieval 128ep Raw NN mrr documented compact proxy completion for this raw128 task axis

Immediate Actions