File size: 7,167 Bytes
4c17660
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
2026-02-11 07:40:09,452 INFO    MainThread:1284257 [wandb_setup.py:_flush():77] Current SDK version is 0.17.9
2026-02-11 07:40:09,452 INFO    MainThread:1284257 [wandb_setup.py:_flush():77] Configure stats pid to 1284257
2026-02-11 07:40:09,452 INFO    MainThread:1284257 [wandb_setup.py:_flush():77] Loading settings from /home/x_fahkh/.config/wandb/settings
2026-02-11 07:40:09,452 INFO    MainThread:1284257 [wandb_setup.py:_flush():77] Loading settings from /proj/cvl/users/x_fahkh2/WorldModels/WorldMem_Repro/wandb/settings
2026-02-11 07:40:09,452 INFO    MainThread:1284257 [wandb_setup.py:_flush():77] Loading settings from environment variables: {}
2026-02-11 07:40:09,452 INFO    MainThread:1284257 [wandb_setup.py:_flush():77] Applying setup settings: {'_disable_service': False}
2026-02-11 07:40:09,452 WARNING MainThread:1284257 [wandb_setup.py:_flush():77] Could not find program at -m main
2026-02-11 07:40:09,452 INFO    MainThread:1284257 [wandb_setup.py:_flush():77] Inferring run settings from compute environment: {'program_relpath': None, 'program': '-m main'}
2026-02-11 07:40:09,452 INFO    MainThread:1284257 [wandb_setup.py:_flush():77] Applying login settings: {}
2026-02-11 07:40:09,452 INFO    MainThread:1284257 [wandb_init.py:_log_setup():524] Logging user logs to /proj/cvl/users/x_fahkh2/WorldModels/WorldMem_Repro/reproduce_official_set/wandb/run-20260211_074009-ku1rnk9p/logs/debug.log
2026-02-11 07:40:09,452 INFO    MainThread:1284257 [wandb_init.py:_log_setup():525] Logging internal logs to /proj/cvl/users/x_fahkh2/WorldModels/WorldMem_Repro/reproduce_official_set/wandb/run-20260211_074009-ku1rnk9p/logs/debug-internal.log
2026-02-11 07:40:09,452 INFO    MainThread:1284257 [wandb_init.py:init():608] calling init triggers
2026-02-11 07:40:09,453 INFO    MainThread:1284257 [wandb_init.py:init():615] wandb.init called with sweep_config: {}
config: {'experiment': {'debug': '${debug}', 'tasks': ['training'], 'num_nodes': 1, 'training': {'precision': '16-mixed', 'compile': False, 'lr': 2e-05, 'batch_size': 4, 'max_epochs': -1, 'max_steps': 120000, 'max_time': None, 'data': {'num_workers': 4, 'shuffle': True}, 'optim': {'accumulate_grad_batches': 1, 'gradient_clip_val': 1.0}, 'checkpointing': {'every_n_train_steps': 2500, 'every_n_epochs': None, 'train_time_interval': None, 'enable_version_counter': False}}, 'validation': {'precision': '16-mixed', 'compile': False, 'batch_size': 4, 'val_every_n_step': 2500, 'val_every_n_epoch': None, 'limit_batch': 1, 'inference_mode': False, 'data': {'num_workers': 4, 'shuffle': False}}, 'test': {'precision': '16-mixed', 'compile': False, 'batch_size': 1, 'limit_batch': 1, 'inference_mode': False, 'data': {'num_workers': 4, 'shuffle': False}}, 'logging': {'metrics': None}, '_name': 'exp_video'}, 'dataset': {'debug': '${debug}', 'metadata': 'data/${dataset.name}/metadata.json', 'data_mean': 0.5, 'data_std': 0.5, 'save_dir': '/proj/cvl/users/x_fahkh2/WorldModels/WorldMem_Repro/worldmem_minecraft_dataset', 'n_frames': 8, 'context_length': 1, 'resolution': 128, 'observation_shape': [3, '${dataset.resolution}', '${dataset.resolution}'], 'external_cond_dim': 0, 'validation_multiplier': 1, 'frame_skip': 1, 'action_cond_dim': 25, '_name': 'video_minecraft', 'n_frames_valid': 700, 'angle_range': 110, 'pos_range': 2, 'memory_condition_length': 8, 'customized_validation': True, 'add_timestamp_embedding': True, 'wo_updown': True}, 'algorithm': {'debug': '${debug}', 'lr': '${experiment.training.lr}', 'x_shape': '${dataset.observation_shape}', 'frame_stack': 1, 'frame_skip': '${dataset.frame_skip}', 'data_mean': '${dataset.data_mean}', 'data_std': '${dataset.data_std}', 'external_cond_dim': 0, 'context_frames': 600, 'weight_decay': 0.002, 'warmup_steps': 1000, 'optimizer_beta': [0.9, 0.99], 'uncertainty_scale': 1, 'guidance_scale': 0.0, 'chunk_size': 1, 'scheduling_matrix': 'autoregressive', 'noise_level': 'random_all', 'causal': True, 'diffusion': {'objective': 'pred_v', 'beta_schedule': 'sigmoid', 'schedule_fn_kwargs': {}, 'clip_noise': 20.0, 'use_snr': False, 'use_cum_snr': False, 'use_fused_snr': True, 'snr_clip': 5.0, 'cum_snr_decay': 0.96, 'timesteps': 1000, 'sampling_timesteps': 20, 'ddim_sampling_eta': 0.0, 'stabilization_level': 15, 'architecture': {'network_size': 64, 'attn_heads': 4, 'attn_dim_head': 64, 'dim_mults': [1, 2, 4, 8], 'resolution': '${dataset.resolution}', 'attn_resolutions': [16, 32, 64, 128], 'use_init_temporal_attn': True, 'use_linear_attn': True, 'time_emb_type': 'rotary'}}, 'n_frames': '${dataset.n_frames}', 'metadata': '${dataset.metadata}', 'action_cond_dim': 25, 'use_plucker': True, '_name': 'df_video_worldmemminecraft', 'n_tokens': 8, 'memory_condition_length': 8, 'relative_embedding': True, 'log_video': True, 'add_timestamp_embedding': True, 'metrics': ['lpips', 'psnr']}, 'debug': False, 'wandb': {'entity': 'turlin', 'project': 'worldmem', 'mode': 'online'}, 'resume': None, 'load': None, 'name': 'train', 'diffusion_model_path': '/proj/cvl/users/x_fahkh2/WorldModels/WorldMem_Repro/checkpoints/diffusion_only.ckpt', 'vae_path': '/proj/cvl/users/x_fahkh2/WorldModels/WorldMem_Repro/checkpoints/vae_only.ckpt', 'customized_load': True, 'seperate_load': True, 'zero_init_gate': True, 'output_dir': '/proj/cvl/users/x_fahkh2/WorldModels/WorldMem_Repro/reproduce_official_set'}
2026-02-11 07:40:09,453 INFO    MainThread:1284257 [wandb_init.py:init():658] starting backend
2026-02-11 07:40:09,453 INFO    MainThread:1284257 [wandb_init.py:init():662] setting up manager
2026-02-11 07:40:09,468 INFO    MainThread:1284257 [backend.py:_multiprocessing_setup():105] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
2026-02-11 07:40:09,475 INFO    MainThread:1284257 [wandb_init.py:init():670] backend started and connected
2026-02-11 07:40:09,505 INFO    MainThread:1284257 [wandb_init.py:init():768] updated telemetry
2026-02-11 07:40:09,565 INFO    MainThread:1284257 [wandb_init.py:init():801] communicating run to backend with 90.0 second timeout
2026-02-11 07:40:10,115 ERROR   MainThread:1284257 [wandb_init.py:init():827] encountered error: It appears that you do not have permission to access the requested resource. Please reach out to the project owner to grant you access. If you have the correct permissions, verify that there are no issues with your networking setup.(Error 403: Forbidden)
2026-02-11 07:40:10,115 ERROR   MainThread:1284257 [wandb_init.py:init():1239] error in wandb.init()
Traceback (most recent call last):
  File "/proj/cvl/users/x_fahkh2/envs/worldmem/lib/python3.10/site-packages/wandb/sdk/wandb_init.py", line 1229, in init
    return wi.init()
  File "/proj/cvl/users/x_fahkh2/envs/worldmem/lib/python3.10/site-packages/wandb/sdk/wandb_init.py", line 833, in init
    raise error
wandb.errors.CommError: It appears that you do not have permission to access the requested resource. Please reach out to the project owner to grant you access. If you have the correct permissions, verify that there are no issues with your networking setup.(Error 403: Forbidden)
2026-02-11 07:40:12,653 WARNING MsgRouterThr:1284257 [router.py:message_loop():77] message_loop has been closed