| `torch_dtype` is deprecated! Use `dtype` instead! |
| Loading /workspace/stage2/output/final... |
|
Loading weights: 0%| | 0/399 [00:00<?, ?it/s]
Loading weights: 0%| | 1/399 [00:00<01:15, 5.29it/s]
Loading weights: 9%|β | 35/399 [00:00<00:02, 149.10it/s]
Loading weights: 15%|ββ | 61/399 [00:00<00:01, 189.24it/s]
Loading weights: 23%|βββ | 93/399 [00:00<00:01, 227.84it/s]
Loading weights: 31%|ββββ | 125/399 [00:00<00:01, 257.41it/s]
Loading weights: 38%|ββββ | 153/399 [00:00<00:00, 260.75it/s]
Loading weights: 46%|βββββ | 182/399 [00:00<00:00, 266.59it/s]
Loading weights: 54%|ββββββ | 215/399 [00:00<00:00, 280.50it/s]
Loading weights: 62%|βββββββ | 248/399 [00:01<00:00, 291.03it/s]
Loading weights: 73%|ββββββββ | 290/399 [00:01<00:00, 322.57it/s]
Loading weights: 81%|ββββββββ | 323/399 [00:01<00:00, 322.12it/s]
Loading weights: 89%|βββββββββ | 357/399 [00:01<00:00, 321.50it/s]
Loading weights: 98%|ββββββββββ| 390/399 [00:01<00:00, 316.15it/s]
Loading weights: 100%|ββββββββββ| 399/399 [00:01<00:00, 272.55it/s] |
| The following generation flags are not valid and may be ignored: ['temperature', 'top_p', 'top_k']. Set `TRANSFORMERS_VERBOSITY=info` for more details. |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Both `max_new_tokens` (=12) and `max_length`(=40960) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation) |
| Model: 8.20B params, vocab=152696 |
| Reco samples in val: 204874 |
| Eval items: 1000 |
| [32/1000] 1s elapsed, ETA 17s |
| [64/1000] 1s elapsed, ETA 13s |
| [96/1000] 1s elapsed, ETA 12s |
| [128/1000] 2s elapsed, ETA 11s |
| [160/1000] 2s elapsed, ETA 10s |
| [192/1000] 2s elapsed, ETA 10s |
| [224/1000] 3s elapsed, ETA 9s |
| [256/1000] 3s elapsed, ETA 9s |
| [288/1000] 3s elapsed, ETA 8s |
| [320/1000] 4s elapsed, ETA 8s |
| [352/1000] 4s elapsed, ETA 7s |
| [384/1000] 4s elapsed, ETA 7s |
| [416/1000] 5s elapsed, ETA 7s |
| [448/1000] 5s elapsed, ETA 6s |
| [480/1000] 5s elapsed, ETA 6s |
| [512/1000] 6s elapsed, ETA 5s |
| [544/1000] 6s elapsed, ETA 5s |
| [576/1000] 6s elapsed, ETA 5s |
| [608/1000] 7s elapsed, ETA 4s |
| [640/1000] 7s elapsed, ETA 4s |
| [672/1000] 7s elapsed, ETA 4s |
| [704/1000] 8s elapsed, ETA 3s |
| [736/1000] 8s elapsed, ETA 3s |
| [768/1000] 8s elapsed, ETA 3s |
| [800/1000] 9s elapsed, ETA 2s |
| [832/1000] 9s elapsed, ETA 2s |
| [864/1000] 9s elapsed, ETA 1s |
| [896/1000] 10s elapsed, ETA 1s |
| [928/1000] 10s elapsed, ETA 1s |
| [960/1000] 10s elapsed, ETA 0s |
| [992/1000] 11s elapsed, ETA 0s |
| [1000/1000] 11s elapsed, ETA 0s |
|
|
| === Results (n=1000, time=11s) === |
| valid_format: 100.0% |
| level_A: 6.9% |
| level_AB: 0.1% |
| level_ABC: 0.0% |
| exact (Hit@1):0.00% |
|
|
| === By type === |
| copurchase_backward n= 357 valid=100.0% A= 3.9% exact= 0.00% |
| copurchase_forward n= 361 valid=100.0% A= 7.5% exact= 0.00% |
| seq_last_2 n= 124 valid=100.0% A= 7.3% exact= 0.00% |
| seq_last_3 n= 104 valid=100.0% A= 11.5% exact= 0.00% |
| seq_last_5 n= 54 valid=100.0% A= 13.0% exact= 0.00% |
|
|