ropedia-xperience-10m-task-baselines / TASK_METHOD_20_GAP_AUDIT.md
cy0307's picture
Add files using upload-large-folder tool
322d85b verified
|
Raw
History Blame
15 kB

Task Method 20-Result Gap Audit

Generated: 2026-06-18T06:34:16+00:00

This audit is the explicit gap ledger for the 9-method x 20-task result matrix. It keeps missing cells visible while preserving the rule that a numeric score requires a real task target and source artifact.

Score Summary

Method Coverage

Method ID Scored Scoreless Proxy Status counts
Minimal minimal 20/20 0 0 scored: 20
Neural MLP neural_mlp 20/20 0 0 scored: 20
128ep Metadata Simple metadata128_simple 8/20 12 0 not_supported_by_metadata_only_package: 8, scored: 8, unsupported_without_required_target: 4
128ep Metadata NN metadata128_neural_mlp 6/20 14 0 not_supported_by_metadata_only_package: 14, scored: 6
128ep Raw Simple raw128_simple 20/20 0 2 proxy_scored: 2, scored: 18
128ep Raw NN raw128_neural_mlp 20/20 0 2 proxy_scored: 2, scored: 18
Qwen3-Omni v6 LoRA qwen3_omni_v6_lora 14/20 6 0 not_evaluated_in_verified_package: 6, scored: 14
Cosmos3-Super Reasoner cosmos3_super_reasoner 7/20 13 0 not_evaluated_in_verified_package: 13, scored: 7
Cosmos3-Nano Future Window cosmos3_nano_future_window 5/20 15 0 not_evaluated_in_verified_package: 15, scored: 5

Gap Classes

Status Count Next step
not_evaluated_in_verified_package 34 Generate verified model outputs for this task contract and score them against the held-out labels.
not_supported_by_metadata_only_package 22 Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
unsupported_without_required_target 4 Export the missing target field for this 128-episode method, then rerun the same train/validation/test split.

Scoreless Records

Task Task label Method Status Required evidence
02 Procedure Step Recognition Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
05 Hand Trajectory Forecasting 128ep Metadata Simple unsupported Export the missing target field for this 128-episode method, then rerun the same train/validation/test split.
05 Hand Trajectory Forecasting 128ep Metadata NN not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
05 Hand Trajectory Forecasting Qwen3-Omni v6 LoRA not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
05 Hand Trajectory Forecasting Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
05 Hand Trajectory Forecasting Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
07 Object Relevance Prediction Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
08 Language Grounding 128ep Metadata NN not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
08 Language Grounding Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
08 Language Grounding Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
09 Cross-Modal Retrieval 128ep Metadata Simple unsupported Export the missing target field for this 128-episode method, then rerun the same train/validation/test split.
09 Cross-Modal Retrieval 128ep Metadata NN not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
09 Cross-Modal Retrieval Qwen3-Omni v6 LoRA not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
09 Cross-Modal Retrieval Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
10 Cross-Modal Reconstruction 128ep Metadata Simple unsupported Export the missing target field for this 128-episode method, then rerun the same train/validation/test split.
10 Cross-Modal Reconstruction 128ep Metadata NN not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
10 Cross-Modal Reconstruction Qwen3-Omni v6 LoRA not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
10 Cross-Modal Reconstruction Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
10 Cross-Modal Reconstruction Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
11 Temporal Order Verification 128ep Metadata NN not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
11 Temporal Order Verification Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
11 Temporal Order Verification Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
12 Multimodal Synchronization Detection 128ep Metadata Simple unsupported Export the missing target field for this 128-episode method, then rerun the same train/validation/test split.
12 Multimodal Synchronization Detection 128ep Metadata NN not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
12 Multimodal Synchronization Detection Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
12 Multimodal Synchronization Detection Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
13 Long-Horizon Next-Action Forecasting 128ep Metadata Simple not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
13 Long-Horizon Next-Action Forecasting 128ep Metadata NN not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
13 Long-Horizon Next-Action Forecasting Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
13 Long-Horizon Next-Action Forecasting Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
14 Long-Horizon Next-Subtask Forecasting 128ep Metadata Simple not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
14 Long-Horizon Next-Subtask Forecasting 128ep Metadata NN not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
14 Long-Horizon Next-Subtask Forecasting Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
14 Long-Horizon Next-Subtask Forecasting Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
15 Interaction Text Prediction 128ep Metadata Simple not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
15 Interaction Text Prediction 128ep Metadata NN not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
15 Interaction Text Prediction Qwen3-Omni v6 LoRA not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
15 Interaction Text Prediction Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
15 Interaction Text Prediction Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
16 Action-Object Relation Prediction 128ep Metadata Simple not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
16 Action-Object Relation Prediction 128ep Metadata NN not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
16 Action-Object Relation Prediction Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
17 Future Object-Set Forecasting 128ep Metadata Simple not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
17 Future Object-Set Forecasting 128ep Metadata NN not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
17 Future Object-Set Forecasting Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
17 Future Object-Set Forecasting Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
18 IMU-to-Hand Pose Reconstruction 128ep Metadata Simple not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
18 IMU-to-Hand Pose Reconstruction 128ep Metadata NN not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
18 IMU-to-Hand Pose Reconstruction Qwen3-Omni v6 LoRA not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
18 IMU-to-Hand Pose Reconstruction Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
18 IMU-to-Hand Pose Reconstruction Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
19 Camera-View Synchronization Retrieval 128ep Metadata Simple not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
19 Camera-View Synchronization Retrieval 128ep Metadata NN not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
19 Camera-View Synchronization Retrieval Qwen3-Omni v6 LoRA not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
19 Camera-View Synchronization Retrieval Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
19 Camera-View Synchronization Retrieval Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
20 Time-to-Next-Transition Regression 128ep Metadata Simple not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
20 Time-to-Next-Transition Regression 128ep Metadata NN not supported Run the task with raw sensor-feature blocks or add a task-specific metadata target builder before assigning a numeric score.
20 Time-to-Next-Transition Regression Cosmos3-Super Reasoner not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.
20 Time-to-Next-Transition Regression Cosmos3-Nano Future Window not evaluated Generate verified model outputs for this task contract and score them against the held-out labels.

Proxy Records

Task Task label Method Metric Proxy note
15 Interaction Text Prediction 128ep Raw Simple macro_f1 documented compact proxy completion for this raw128 task axis
15 Interaction Text Prediction 128ep Raw NN macro_f1 documented compact proxy completion for this raw128 task axis
19 Camera-View Synchronization Retrieval 128ep Raw Simple mrr documented compact proxy completion for this raw128 task axis
19 Camera-View Synchronization Retrieval 128ep Raw NN mrr documented compact proxy completion for this raw128 task axis

Immediate Actions