2025-11-17 09:23:23,174 - root - INFO - Config loaded: {'seed': 42, 'data_path': 'data/dolly/train.jsonl', 'max_prompt_length': 256, 'max_length': 512, 'student_type': None, 'teacher_type': 'qwen2', 'student_path': None, 'teacher_path': 'models/qwen1.5-1.8b', 'num_epochs': 5, 'device': 'cuda', 'learning_rate': '1e-6', 'warmup_percentage': 0.05, 'batch_size': 8, 'gradient_accumulation_steps': 1, 'eval_repeat': 1, 'eval_data_path': 'data/dolly/valid.jsonl', 'eval_batch_size': 8, 'user': 'mrtuandao', 'repo': 'weighted-CTKD', 'wandb_project': 'weighted-ctkd'} 2025-11-17 09:23:24,072 - root - INFO - Wandb initialized with run name: train_teacher_train_teacher_qwen1.5-1.8b_20251117_092323 2025-11-17 09:23:24,073 - root - INFO - Using device: cuda 2025-11-17 09:23:26,335 - weighted_ctkd.kd_dataset - INFO - Start loading data from data/dolly/train.jsonl 2025-11-17 09:23:29,831 - weighted_ctkd.kd_dataset - INFO - Start loading data from data/dolly/valid.jsonl 2025-11-17 09:23:30,001 - root - INFO - Epoch 1/5 2025-11-17 09:23:30,415 - absl - INFO - Using default tokenizer. 2025-11-17 09:23:34,137 - root - INFO - Step 1/7150 train rougeL: 0.2695166607914909 2025-11-17 09:23:34,389 - root - INFO - Step 1/7150 loss: 2.684575319290161, total_norm: inf 2025-11-17 09:24:17,564 - absl - INFO - Using default tokenizer. 2025-11-17 09:24:21,921 - root - INFO - Step 101/7150 train rougeL: 0.1814889559974096 2025-11-17 09:24:22,247 - root - INFO - Step 101/7150 loss: 1.390326976776123, total_norm: 9.108480453491211 2025-11-17 09:25:05,518 - absl - INFO - Using default tokenizer. 2025-11-17 09:25:10,037 - root - INFO - Step 201/7150 train rougeL: 0.13310543577201747 2025-11-17 09:25:10,363 - root - INFO - Step 201/7150 loss: 1.7122552394866943, total_norm: 10.320128440856934 2025-11-17 09:25:53,642 - absl - INFO - Using default tokenizer. 2025-11-17 09:25:57,885 - root - INFO - Step 301/7150 train rougeL: 0.14500941944338083 2025-11-17 09:25:58,211 - root - INFO - Step 301/7150 loss: 1.7587337493896484, total_norm: 10.128713607788086 2025-11-17 09:26:41,361 - absl - INFO - Using default tokenizer. 2025-11-17 09:26:45,548 - root - INFO - Step 401/7150 train rougeL: 0.130055569738767 2025-11-17 09:26:45,873 - root - INFO - Step 401/7150 loss: 1.553663730621338, total_norm: 9.712050437927246 2025-11-17 09:27:29,063 - root - INFO - Step 501/7150 finished 2025-11-17 09:27:29,181 - absl - INFO - Using default tokenizer. 2025-11-17 09:27:33,995 - absl - INFO - Using default tokenizer. 2025-11-17 09:27:38,536 - absl - INFO - Using default tokenizer. 2025-11-17 09:27:43,160 - absl - INFO - Using default tokenizer. 2025-11-17 09:27:47,803 - absl - INFO - Using default tokenizer. 2025-11-17 09:27:52,339 - absl - INFO - Using default tokenizer. 2025-11-17 09:27:56,896 - absl - INFO - Using default tokenizer. 2025-11-17 09:28:01,372 - absl - INFO - Using default tokenizer. 2025-11-17 09:28:06,016 - absl - INFO - Using default tokenizer. 2025-11-17 09:28:10,516 - absl - INFO - Using default tokenizer. 2025-11-17 09:28:15,177 - absl - INFO - Using default tokenizer. 2025-11-17 09:28:19,717 - absl - INFO - Using default tokenizer. 2025-11-17 09:28:24,146 - absl - INFO - Using default tokenizer. 2025-11-17 09:28:28,559 - absl - INFO - Using default tokenizer. 2025-11-17 09:28:33,109 - absl - INFO - Using default tokenizer. 2025-11-17 09:28:37,581 - absl - INFO - Using default tokenizer. 2025-11-17 09:28:42,030 - absl - INFO - Using default tokenizer. 2025-11-17 09:28:46,609 - absl - INFO - Using default tokenizer. 2025-11-17 09:28:51,092 - absl - INFO - Using default tokenizer. 2025-11-17 09:28:55,622 - absl - INFO - Using default tokenizer. 2025-11-17 09:29:00,214 - absl - INFO - Using default tokenizer. 2025-11-17 09:29:04,719 - absl - INFO - Using default tokenizer. 2025-11-17 09:29:09,253 - absl - INFO - Using default tokenizer. 2025-11-17 09:29:13,795 - absl - INFO - Using default tokenizer. 2025-11-17 09:29:18,339 - absl - INFO - Using default tokenizer. 2025-11-17 09:29:22,897 - absl - INFO - Using default tokenizer. 2025-11-17 09:29:27,430 - absl - INFO - Using default tokenizer. 2025-11-17 09:29:32,318 - absl - INFO - Using default tokenizer. 2025-11-17 09:29:37,492 - absl - INFO - Using default tokenizer. 2025-11-17 09:29:42,331 - absl - INFO - Using default tokenizer. 2025-11-17 09:29:47,076 - absl - INFO - Using default tokenizer. 2025-11-17 09:29:51,657 - absl - INFO - Using default tokenizer. 2025-11-17 09:29:56,280 - absl - INFO - Using default tokenizer. 2025-11-17 09:30:00,839 - absl - INFO - Using default tokenizer. 2025-11-17 09:30:05,429 - absl - INFO - Using default tokenizer. 2025-11-17 09:30:10,028 - absl - INFO - Using default tokenizer. 2025-11-17 09:30:14,581 - absl - INFO - Using default tokenizer. 2025-11-17 09:30:19,163 - absl - INFO - Using default tokenizer. 2025-11-17 09:30:23,761 - absl - INFO - Using default tokenizer. 2025-11-17 09:30:28,315 - absl - INFO - Using default tokenizer. 2025-11-17 09:30:32,912 - absl - INFO - Using default tokenizer. 2025-11-17 09:30:37,499 - absl - INFO - Using default tokenizer. 2025-11-17 09:30:42,038 - absl - INFO - Using default tokenizer. 2025-11-17 09:30:46,650 - absl - INFO - Using default tokenizer. 2025-11-17 09:30:51,223 - absl - INFO - Using default tokenizer. 2025-11-17 09:30:55,765 - absl - INFO - Using default tokenizer. 2025-11-17 09:31:00,576 - absl - INFO - Using default tokenizer. 2025-11-17 09:31:05,128 - absl - INFO - Using default tokenizer. 2025-11-17 09:31:09,628 - absl - INFO - Using default tokenizer. 2025-11-17 09:31:14,220 - absl - INFO - Using default tokenizer. 2025-11-17 09:31:18,746 - absl - INFO - Using default tokenizer. 2025-11-17 09:31:23,247 - absl - INFO - Using default tokenizer. 2025-11-17 09:31:27,893 - absl - INFO - Using default tokenizer. 2025-11-17 09:31:32,415 - absl - INFO - Using default tokenizer. 2025-11-17 09:31:36,914 - absl - INFO - Using default tokenizer. 2025-11-17 09:31:41,504 - absl - INFO - Using default tokenizer. 2025-11-17 09:31:46,022 - absl - INFO - Using default tokenizer. 2025-11-17 09:31:50,494 - absl - INFO - Using default tokenizer. 2025-11-17 09:31:55,094 - absl - INFO - Using default tokenizer. 2025-11-17 09:31:59,610 - absl - INFO - Using default tokenizer. 2025-11-17 09:32:04,109 - absl - INFO - Using default tokenizer. 2025-11-17 09:32:08,695 - absl - INFO - Using default tokenizer. 2025-11-17 09:32:13,135 - absl - INFO - Using default tokenizer. 2025-11-17 09:32:17,519 - root - INFO - Epoch 1/5 eval loss: 1.6519081989924114, eval rougeL: 0.13946933698656294 2025-11-17 09:32:17,631 - absl - INFO - Using default tokenizer. 2025-11-17 09:32:22,193 - root - INFO - Step 501/7150 train rougeL: 0.18411816370219858 2025-11-17 09:32:22,519 - root - INFO - Step 501/7150 loss: 1.7445898056030273, total_norm: 8.362028121948242 2025-11-17 09:33:05,779 - absl - INFO - Using default tokenizer. 2025-11-17 09:33:10,101 - root - INFO - Step 601/7150 train rougeL: 0.19568898904713544 2025-11-17 09:33:10,427 - root - INFO - Step 601/7150 loss: 1.5907386541366577, total_norm: 8.246379852294922 2025-11-17 09:33:53,681 - absl - INFO - Using default tokenizer. 2025-11-17 09:33:57,926 - root - INFO - Step 701/7150 train rougeL: 0.12426825185379815 2025-11-17 09:33:58,251 - root - INFO - Step 701/7150 loss: 1.5074223279953003, total_norm: 10.695302963256836 2025-11-17 09:34:41,499 - absl - INFO - Using default tokenizer. 2025-11-17 09:34:45,785 - root - INFO - Step 801/7150 train rougeL: 0.14440483420485709 2025-11-17 09:34:46,110 - root - INFO - Step 801/7150 loss: 1.3891932964324951, total_norm: 9.557195663452148 2025-11-17 09:35:29,390 - absl - INFO - Using default tokenizer. 2025-11-17 09:35:33,809 - root - INFO - Step 901/7150 train rougeL: 0.07753096926356544 2025-11-17 09:35:34,134 - root - INFO - Step 901/7150 loss: 1.5375957489013672, total_norm: 11.225637435913086 2025-11-17 09:36:17,285 - root - INFO - Step 1001/7150 finished 2025-11-17 09:36:17,400 - absl - INFO - Using default tokenizer. 2025-11-17 09:36:22,102 - absl - INFO - Using default tokenizer. 2025-11-17 09:36:26,644 - absl - INFO - Using default tokenizer. 2025-11-17 09:36:31,130 - absl - INFO - Using default tokenizer. 2025-11-17 09:36:35,749 - absl - INFO - Using default tokenizer. 2025-11-17 09:36:40,277 - absl - INFO - Using default tokenizer. 2025-11-17 09:36:44,784 - absl - INFO - Using default tokenizer. 2025-11-17 09:36:49,338 - absl - INFO - Using default tokenizer. 2025-11-17 09:36:53,916 - absl - INFO - Using default tokenizer. 2025-11-17 09:36:58,457 - absl - INFO - Using default tokenizer. 2025-11-17 09:37:03,230 - absl - INFO - Using default tokenizer. 2025-11-17 09:37:07,750 - absl - INFO - Using default tokenizer. 2025-11-17 09:37:12,281 - absl - INFO - Using default tokenizer. 2025-11-17 09:37:16,850 - absl - INFO - Using default tokenizer. 2025-11-17 09:37:21,397 - absl - INFO - Using default tokenizer. 2025-11-17 09:37:26,021 - absl - INFO - Using default tokenizer. 2025-11-17 09:37:30,580 - absl - INFO - Using default tokenizer. 2025-11-17 09:37:35,035 - absl - INFO - Using default tokenizer. 2025-11-17 09:37:39,634 - absl - INFO - Using default tokenizer. 2025-11-17 09:37:44,219 - absl - INFO - Using default tokenizer. 2025-11-17 09:37:48,726 - absl - INFO - Using default tokenizer. 2025-11-17 09:37:53,341 - absl - INFO - Using default tokenizer. 2025-11-17 09:37:57,887 - absl - INFO - Using default tokenizer. 2025-11-17 09:38:02,405 - absl - INFO - Using default tokenizer. 2025-11-17 09:38:07,078 - absl - INFO - Using default tokenizer. 2025-11-17 09:38:11,639 - absl - INFO - Using default tokenizer. 2025-11-17 09:38:16,119 - absl - INFO - Using default tokenizer. 2025-11-17 09:38:20,723 - absl - INFO - Using default tokenizer. 2025-11-17 09:38:25,448 - absl - INFO - Using default tokenizer. 2025-11-17 09:38:29,903 - absl - INFO - Using default tokenizer. 2025-11-17 09:38:34,496 - absl - INFO - Using default tokenizer. 2025-11-17 09:38:39,030 - absl - INFO - Using default tokenizer. 2025-11-17 09:38:43,519 - absl - INFO - Using default tokenizer. 2025-11-17 09:38:48,121 - absl - INFO - Using default tokenizer. 2025-11-17 09:38:52,619 - absl - INFO - Using default tokenizer. 2025-11-17 09:38:57,114 - absl - INFO - Using default tokenizer. 2025-11-17 09:39:01,710 - absl - INFO - Using default tokenizer. 2025-11-17 09:39:06,192 - absl - INFO - Using default tokenizer. 2025-11-17 09:39:10,702 - absl - INFO - Using default tokenizer. 2025-11-17 09:39:15,270 - absl - INFO - Using default tokenizer. 2025-11-17 09:39:19,758 - absl - INFO - Using default tokenizer. 2025-11-17 09:39:24,309 - absl - INFO - Using default tokenizer. 2025-11-17 09:39:28,905 - absl - INFO - Using default tokenizer. 2025-11-17 09:39:33,413 - absl - INFO - Using default tokenizer. 2025-11-17 09:39:38,013 - absl - INFO - Using default tokenizer. 2025-11-17 09:39:42,591 - absl - INFO - Using default tokenizer. 2025-11-17 09:39:47,366 - absl - INFO - Using default tokenizer. 2025-11-17 09:39:52,002 - absl - INFO - Using default tokenizer. 2025-11-17 09:39:56,610 - absl - INFO - Using default tokenizer. 2025-11-17 09:40:01,156 - absl - INFO - Using default tokenizer. 2025-11-17 09:40:05,757 - absl - INFO - Using default tokenizer. 2025-11-17 09:40:10,286 - absl - INFO - Using default tokenizer. 2025-11-17 09:40:14,848 - absl - INFO - Using default tokenizer. 2025-11-17 09:40:19,493 - absl - INFO - Using default tokenizer. 2025-11-17 09:40:24,108 - absl - INFO - Using default tokenizer. 2025-11-17 09:40:28,630 - absl - INFO - Using default tokenizer. 2025-11-17 09:40:33,276 - absl - INFO - Using default tokenizer. 2025-11-17 09:40:37,821 - absl - INFO - Using default tokenizer. 2025-11-17 09:40:42,292 - absl - INFO - Using default tokenizer. 2025-11-17 09:40:46,883 - absl - INFO - Using default tokenizer. 2025-11-17 09:40:51,366 - absl - INFO - Using default tokenizer. 2025-11-17 09:40:55,851 - absl - INFO - Using default tokenizer. 2025-11-17 09:41:00,420 - absl - INFO - Using default tokenizer. 2025-11-17 09:41:04,810 - root - INFO - Epoch 1/5 eval loss: 1.6416344207430642, eval rougeL: 0.12899227842891858 2025-11-17 09:41:04,923 - absl - INFO - Using default tokenizer. 2025-11-17 09:41:09,438 - root - INFO - Step 1001/7150 train rougeL: 0.15306381692413795 2025-11-17 09:41:09,764 - root - INFO - Step 1001/7150 loss: 1.3160507678985596, total_norm: 7.292008876800537 2025-11-17 09:41:53,037 - absl - INFO - Using default tokenizer. 2025-11-17 09:41:57,494 - root - INFO - Step 1101/7150 train rougeL: 0.10340424263642492 2025-11-17 09:41:57,819 - root - INFO - Step 1101/7150 loss: 1.904419183731079, total_norm: 8.607535362243652 2025-11-17 09:42:41,091 - absl - INFO - Using default tokenizer. 2025-11-17 09:42:45,583 - root - INFO - Step 1201/7150 train rougeL: 0.1308097746355616 2025-11-17 09:42:45,908 - root - INFO - Step 1201/7150 loss: 1.6532994508743286, total_norm: 8.67361068725586 2025-11-17 09:43:29,207 - absl - INFO - Using default tokenizer. 2025-11-17 09:43:33,676 - root - INFO - Step 1301/7150 train rougeL: 0.2525205312757198 2025-11-17 09:43:34,001 - root - INFO - Step 1301/7150 loss: 1.2103170156478882, total_norm: 9.047572135925293 2025-11-17 09:44:17,261 - absl - INFO - Using default tokenizer. 2025-11-17 09:44:21,633 - root - INFO - Step 1401/7150 train rougeL: 0.15473034239630698 2025-11-17 09:44:21,959 - root - INFO - Step 1401/7150 loss: 1.7244929075241089, total_norm: 8.697896957397461 2025-11-17 09:44:34,453 - root - INFO - Epoch 1/5 finished 2025-11-17 09:44:34,570 - absl - INFO - Using default tokenizer. 2025-11-17 09:44:39,383 - absl - INFO - Using default tokenizer. 2025-11-17 09:44:44,024 - absl - INFO - Using default tokenizer. 2025-11-17 09:44:48,493 - absl - INFO - Using default tokenizer. 2025-11-17 09:44:52,970 - absl - INFO - Using default tokenizer. 2025-11-17 09:44:57,572 - absl - INFO - Using default tokenizer. 2025-11-17 09:45:02,076 - absl - INFO - Using default tokenizer. 2025-11-17 09:45:06,580 - absl - INFO - Using default tokenizer. 2025-11-17 09:45:11,206 - absl - INFO - Using default tokenizer. 2025-11-17 09:45:15,695 - absl - INFO - Using default tokenizer. 2025-11-17 09:45:20,105 - absl - INFO - Using default tokenizer. 2025-11-17 09:45:24,654 - absl - INFO - Using default tokenizer. 2025-11-17 09:45:28,942 - absl - INFO - Using default tokenizer. 2025-11-17 09:45:33,291 - absl - INFO - Using default tokenizer. 2025-11-17 09:45:37,591 - absl - INFO - Using default tokenizer. 2025-11-17 09:45:41,885 - absl - INFO - Using default tokenizer. 2025-11-17 09:45:46,454 - absl - INFO - Using default tokenizer. 2025-11-17 09:45:50,916 - absl - INFO - Using default tokenizer. 2025-11-17 09:45:55,436 - absl - INFO - Using default tokenizer. 2025-11-17 09:46:00,029 - absl - INFO - Using default tokenizer. 2025-11-17 09:46:04,545 - absl - INFO - Using default tokenizer. 2025-11-17 09:46:09,097 - absl - INFO - Using default tokenizer. 2025-11-17 09:46:13,646 - absl - INFO - Using default tokenizer. 2025-11-17 09:46:18,110 - absl - INFO - Using default tokenizer. 2025-11-17 09:46:22,668 - absl - INFO - Using default tokenizer. 2025-11-17 09:46:27,179 - absl - INFO - Using default tokenizer. 2025-11-17 09:46:31,619 - absl - INFO - Using default tokenizer. 2025-11-17 09:46:36,193 - absl - INFO - Using default tokenizer. 2025-11-17 09:46:40,706 - absl - INFO - Using default tokenizer. 2025-11-17 09:46:45,294 - absl - INFO - Using default tokenizer. 2025-11-17 09:46:49,670 - absl - INFO - Using default tokenizer. 2025-11-17 09:46:54,147 - absl - INFO - Using default tokenizer. 2025-11-17 09:46:58,691 - absl - INFO - Using default tokenizer. 2025-11-17 09:47:03,291 - absl - INFO - Using default tokenizer. 2025-11-17 09:47:07,819 - absl - INFO - Using default tokenizer. 2025-11-17 09:47:12,387 - absl - INFO - Using default tokenizer. 2025-11-17 09:47:16,985 - absl - INFO - Using default tokenizer. 2025-11-17 09:47:21,460 - absl - INFO - Using default tokenizer. 2025-11-17 09:47:26,013 - absl - INFO - Using default tokenizer. 2025-11-17 09:47:30,548 - absl - INFO - Using default tokenizer. 2025-11-17 09:47:35,039 - absl - INFO - Using default tokenizer. 2025-11-17 09:47:39,624 - absl - INFO - Using default tokenizer. 2025-11-17 09:47:44,138 - absl - INFO - Using default tokenizer. 2025-11-17 09:47:48,735 - absl - INFO - Using default tokenizer. 2025-11-17 09:47:53,348 - absl - INFO - Using default tokenizer. 2025-11-17 09:47:58,075 - absl - INFO - Using default tokenizer. 2025-11-17 09:48:02,727 - absl - INFO - Using default tokenizer. 2025-11-17 09:48:07,532 - absl - INFO - Using default tokenizer. 2025-11-17 09:48:12,134 - absl - INFO - Using default tokenizer. 2025-11-17 09:48:16,593 - absl - INFO - Using default tokenizer. 2025-11-17 09:48:21,176 - absl - INFO - Using default tokenizer. 2025-11-17 09:48:25,685 - absl - INFO - Using default tokenizer. 2025-11-17 09:48:30,179 - absl - INFO - Using default tokenizer. 2025-11-17 09:48:34,829 - absl - INFO - Using default tokenizer. 2025-11-17 09:48:39,496 - absl - INFO - Using default tokenizer. 2025-11-17 09:48:44,063 - absl - INFO - Using default tokenizer. 2025-11-17 09:48:48,678 - absl - INFO - Using default tokenizer. 2025-11-17 09:48:53,315 - absl - INFO - Using default tokenizer. 2025-11-17 09:48:57,919 - absl - INFO - Using default tokenizer. 2025-11-17 09:49:02,504 - absl - INFO - Using default tokenizer. 2025-11-17 09:49:07,077 - absl - INFO - Using default tokenizer. 2025-11-17 09:49:11,781 - absl - INFO - Using default tokenizer. 2025-11-17 09:49:16,274 - absl - INFO - Using default tokenizer. 2025-11-17 09:49:20,652 - root - INFO - Epoch 1/5 eval loss: 1.63703125242203, eval rougeL: 0.12554697716065366 2025-11-17 09:49:30,534 - root - INFO - Epoch 2/5 2025-11-17 09:50:01,136 - root - INFO - Step 1501/7150 finished 2025-11-17 09:50:01,252 - absl - INFO - Using default tokenizer. 2025-11-17 09:50:06,266 - absl - INFO - Using default tokenizer. 2025-11-17 09:50:10,962 - absl - INFO - Using default tokenizer. 2025-11-17 09:50:15,646 - absl - INFO - Using default tokenizer. 2025-11-17 09:50:20,244 - absl - INFO - Using default tokenizer. 2025-11-17 09:50:25,007 - absl - INFO - Using default tokenizer. 2025-11-17 09:50:29,482 - absl - INFO - Using default tokenizer. 2025-11-17 09:50:33,873 - absl - INFO - Using default tokenizer. 2025-11-17 09:50:38,470 - absl - INFO - Using default tokenizer. 2025-11-17 09:50:43,014 - absl - INFO - Using default tokenizer. 2025-11-17 09:50:47,669 - absl - INFO - Using default tokenizer. 2025-11-17 09:50:52,283 - absl - INFO - Using default tokenizer. 2025-11-17 09:50:56,913 - absl - INFO - Using default tokenizer. 2025-11-17 09:51:01,590 - absl - INFO - Using default tokenizer. 2025-11-17 09:51:06,193 - absl - INFO - Using default tokenizer. 2025-11-17 09:51:10,814 - absl - INFO - Using default tokenizer. 2025-11-17 09:51:15,447 - absl - INFO - Using default tokenizer. 2025-11-17 09:51:20,008 - absl - INFO - Using default tokenizer. 2025-11-17 09:51:24,659 - absl - INFO - Using default tokenizer. 2025-11-17 09:51:29,357 - absl - INFO - Using default tokenizer. 2025-11-17 09:51:33,818 - absl - INFO - Using default tokenizer. 2025-11-17 09:51:38,266 - absl - INFO - Using default tokenizer. 2025-11-17 09:51:42,707 - absl - INFO - Using default tokenizer. 2025-11-17 09:51:47,141 - absl - INFO - Using default tokenizer. 2025-11-17 09:51:51,600 - absl - INFO - Using default tokenizer. 2025-11-17 09:51:55,990 - absl - INFO - Using default tokenizer. 2025-11-17 09:52:00,558 - absl - INFO - Using default tokenizer. 2025-11-17 09:52:05,204 - absl - INFO - Using default tokenizer. 2025-11-17 09:52:09,817 - absl - INFO - Using default tokenizer. 2025-11-17 09:52:14,345 - absl - INFO - Using default tokenizer. 2025-11-17 09:52:19,108 - absl - INFO - Using default tokenizer. 2025-11-17 09:52:23,632 - absl - INFO - Using default tokenizer. 2025-11-17 09:52:28,130 - absl - INFO - Using default tokenizer. 2025-11-17 09:52:32,716 - absl - INFO - Using default tokenizer. 2025-11-17 09:52:37,229 - absl - INFO - Using default tokenizer. 2025-11-17 09:52:41,802 - absl - INFO - Using default tokenizer. 2025-11-17 09:52:46,418 - absl - INFO - Using default tokenizer. 2025-11-17 09:52:51,132 - absl - INFO - Using default tokenizer. 2025-11-17 09:52:55,671 - absl - INFO - Using default tokenizer. 2025-11-17 09:53:00,266 - absl - INFO - Using default tokenizer. 2025-11-17 09:53:04,792 - absl - INFO - Using default tokenizer. 2025-11-17 09:53:09,361 - absl - INFO - Using default tokenizer. 2025-11-17 09:53:13,949 - absl - INFO - Using default tokenizer. 2025-11-17 09:53:18,436 - absl - INFO - Using default tokenizer. 2025-11-17 09:53:22,992 - absl - INFO - Using default tokenizer. 2025-11-17 09:53:27,528 - absl - INFO - Using default tokenizer. 2025-11-17 09:53:32,028 - absl - INFO - Using default tokenizer. 2025-11-17 09:53:36,649 - absl - INFO - Using default tokenizer. 2025-11-17 09:53:41,170 - absl - INFO - Using default tokenizer. 2025-11-17 09:53:45,686 - absl - INFO - Using default tokenizer. 2025-11-17 09:53:50,393 - absl - INFO - Using default tokenizer. 2025-11-17 09:53:55,008 - absl - INFO - Using default tokenizer. 2025-11-17 09:53:59,746 - absl - INFO - Using default tokenizer. 2025-11-17 09:54:04,487 - absl - INFO - Using default tokenizer. 2025-11-17 09:54:09,055 - absl - INFO - Using default tokenizer. 2025-11-17 09:54:13,750 - absl - INFO - Using default tokenizer. 2025-11-17 09:54:18,354 - absl - INFO - Using default tokenizer. 2025-11-17 09:54:22,852 - absl - INFO - Using default tokenizer. 2025-11-17 09:54:27,337 - absl - INFO - Using default tokenizer. 2025-11-17 09:54:31,939 - absl - INFO - Using default tokenizer. 2025-11-17 09:54:36,456 - absl - INFO - Using default tokenizer. 2025-11-17 09:54:40,965 - absl - INFO - Using default tokenizer. 2025-11-17 09:54:45,531 - absl - INFO - Using default tokenizer. 2025-11-17 09:54:49,921 - root - INFO - Epoch 2/5 eval loss: 1.6420218130898854, eval rougeL: 0.12702835309426896 2025-11-17 09:54:50,034 - absl - INFO - Using default tokenizer. 2025-11-17 09:54:54,423 - root - INFO - Step 1501/7150 train rougeL: 0.12314224151257665 2025-11-17 09:54:54,748 - root - INFO - Step 1501/7150 loss: 1.8820037841796875, total_norm: 7.781169891357422 2025-11-17 09:55:38,038 - absl - INFO - Using default tokenizer. 2025-11-17 09:55:42,581 - root - INFO - Step 1601/7150 train rougeL: 0.1932124457807039 2025-11-17 09:55:42,906 - root - INFO - Step 1601/7150 loss: 1.3279756307601929, total_norm: 8.1652250289917 2025-11-17 09:56:26,208 - absl - INFO - Using default tokenizer. 2025-11-17 09:56:30,764 - root - INFO - Step 1701/7150 train rougeL: 0.16474791024744465 2025-11-17 09:56:31,090 - root - INFO - Step 1701/7150 loss: 1.5774627923965454, total_norm: 7.514491081237793 2025-11-17 09:57:14,423 - absl - INFO - Using default tokenizer. 2025-11-17 09:57:18,856 - root - INFO - Step 1801/7150 train rougeL: 0.13908046997072102 2025-11-17 09:57:19,182 - root - INFO - Step 1801/7150 loss: 1.4007459878921509, total_norm: 9.450364112854004 2025-11-17 09:58:02,482 - absl - INFO - Using default tokenizer. 2025-11-17 09:58:06,864 - root - INFO - Step 1901/7150 train rougeL: 0.13224568495410882 2025-11-17 09:58:07,190 - root - INFO - Step 1901/7150 loss: 1.2375394105911255, total_norm: 8.681380271911621 2025-11-17 09:58:50,369 - root - INFO - Step 2001/7150 finished 2025-11-17 09:58:50,485 - absl - INFO - Using default tokenizer. 2025-11-17 09:58:55,203 - absl - INFO - Using default tokenizer. 2025-11-17 09:58:59,758 - absl - INFO - Using default tokenizer. 2025-11-17 09:59:04,311 - absl - INFO - Using default tokenizer. 2025-11-17 09:59:08,844 - absl - INFO - Using default tokenizer. 2025-11-17 09:59:13,519 - absl - INFO - Using default tokenizer. 2025-11-17 09:59:18,056 - absl - INFO - Using default tokenizer. 2025-11-17 09:59:22,563 - absl - INFO - Using default tokenizer. 2025-11-17 09:59:27,268 - absl - INFO - Using default tokenizer. 2025-11-17 09:59:31,792 - absl - INFO - Using default tokenizer. 2025-11-17 09:59:36,281 - absl - INFO - Using default tokenizer. 2025-11-17 09:59:40,907 - absl - INFO - Using default tokenizer. 2025-11-17 09:59:45,409 - absl - INFO - Using default tokenizer. 2025-11-17 09:59:49,942 - absl - INFO - Using default tokenizer. 2025-11-17 09:59:54,523 - absl - INFO - Using default tokenizer. 2025-11-17 09:59:59,030 - absl - INFO - Using default tokenizer. 2025-11-17 10:00:03,672 - absl - INFO - Using default tokenizer. 2025-11-17 10:00:08,246 - absl - INFO - Using default tokenizer. 2025-11-17 10:00:12,746 - absl - INFO - Using default tokenizer. 2025-11-17 10:00:17,527 - absl - INFO - Using default tokenizer. 2025-11-17 10:00:22,044 - absl - INFO - Using default tokenizer. 2025-11-17 10:00:26,497 - absl - INFO - Using default tokenizer. 2025-11-17 10:00:31,066 - absl - INFO - Using default tokenizer. 2025-11-17 10:00:35,691 - absl - INFO - Using default tokenizer. 2025-11-17 10:00:40,207 - absl - INFO - Using default tokenizer. 2025-11-17 10:00:44,855 - absl - INFO - Using default tokenizer. 2025-11-17 10:00:49,408 - absl - INFO - Using default tokenizer. 2025-11-17 10:00:53,947 - absl - INFO - Using default tokenizer. 2025-11-17 10:00:58,669 - absl - INFO - Using default tokenizer. 2025-11-17 10:01:03,406 - absl - INFO - Using default tokenizer. 2025-11-17 10:01:08,015 - absl - INFO - Using default tokenizer. 2025-11-17 10:01:12,770 - absl - INFO - Using default tokenizer. 2025-11-17 10:01:17,362 - absl - INFO - Using default tokenizer. 2025-11-17 10:01:21,978 - absl - INFO - Using default tokenizer. 2025-11-17 10:01:26,671 - absl - INFO - Using default tokenizer. 2025-11-17 10:01:31,254 - absl - INFO - Using default tokenizer. 2025-11-17 10:01:35,832 - absl - INFO - Using default tokenizer. 2025-11-17 10:01:40,682 - absl - INFO - Using default tokenizer. 2025-11-17 10:01:45,245 - absl - INFO - Using default tokenizer. 2025-11-17 10:01:49,756 - absl - INFO - Using default tokenizer. 2025-11-17 10:01:54,259 - absl - INFO - Using default tokenizer. 2025-11-17 10:01:58,741 - absl - INFO - Using default tokenizer. 2025-11-17 10:02:03,232 - absl - INFO - Using default tokenizer. 2025-11-17 10:02:07,866 - absl - INFO - Using default tokenizer. 2025-11-17 10:02:12,387 - absl - INFO - Using default tokenizer. 2025-11-17 10:02:16,854 - absl - INFO - Using default tokenizer. 2025-11-17 10:02:21,415 - absl - INFO - Using default tokenizer. 2025-11-17 10:02:25,879 - absl - INFO - Using default tokenizer. 2025-11-17 10:02:30,340 - absl - INFO - Using default tokenizer. 2025-11-17 10:02:34,890 - absl - INFO - Using default tokenizer. 2025-11-17 10:02:39,381 - absl - INFO - Using default tokenizer. 2025-11-17 10:02:43,906 - absl - INFO - Using default tokenizer. 2025-11-17 10:02:48,498 - absl - INFO - Using default tokenizer. 2025-11-17 10:02:53,020 - absl - INFO - Using default tokenizer. 2025-11-17 10:02:57,630 - absl - INFO - Using default tokenizer. 2025-11-17 10:03:02,393 - absl - INFO - Using default tokenizer. 2025-11-17 10:03:06,892 - absl - INFO - Using default tokenizer. 2025-11-17 10:03:11,463 - absl - INFO - Using default tokenizer. 2025-11-17 10:03:15,983 - absl - INFO - Using default tokenizer. 2025-11-17 10:03:20,434 - absl - INFO - Using default tokenizer. 2025-11-17 10:03:25,001 - absl - INFO - Using default tokenizer. 2025-11-17 10:03:29,491 - absl - INFO - Using default tokenizer. 2025-11-17 10:03:33,930 - absl - INFO - Using default tokenizer. 2025-11-17 10:03:38,432 - root - INFO - Epoch 2/5 eval loss: 1.6414309569767542, eval rougeL: 0.12392493572247207 2025-11-17 10:03:38,545 - absl - INFO - Using default tokenizer. 2025-11-17 10:03:42,916 - root - INFO - Step 2001/7150 train rougeL: 0.12277606377251042 2025-11-17 10:03:43,242 - root - INFO - Step 2001/7150 loss: 1.4930633306503296, total_norm: 10.01546859741211 2025-11-17 10:04:26,543 - absl - INFO - Using default tokenizer. 2025-11-17 10:04:30,883 - root - INFO - Step 2101/7150 train rougeL: 0.16905727988391578 2025-11-17 10:04:31,209 - root - INFO - Step 2101/7150 loss: 1.27573561668396, total_norm: 7.830188274383545 2025-11-17 10:05:14,471 - absl - INFO - Using default tokenizer. 2025-11-17 10:05:18,818 - root - INFO - Step 2201/7150 train rougeL: 0.2625067607078729 2025-11-17 10:05:19,143 - root - INFO - Step 2201/7150 loss: 0.9901683926582336, total_norm: 7.19817590713501 2025-11-17 10:06:02,418 - absl - INFO - Using default tokenizer. 2025-11-17 10:06:06,773 - root - INFO - Step 2301/7150 train rougeL: 0.195401787116215 2025-11-17 10:06:07,098 - root - INFO - Step 2301/7150 loss: 1.3955148458480835, total_norm: 8.128800392150879 2025-11-17 10:06:50,362 - absl - INFO - Using default tokenizer. 2025-11-17 10:06:54,602 - root - INFO - Step 2401/7150 train rougeL: 0.1433318121417271 2025-11-17 10:06:54,927 - root - INFO - Step 2401/7150 loss: 1.7501530647277832, total_norm: 8.100083351135254 2025-11-17 10:07:38,029 - root - INFO - Step 2501/7150 finished 2025-11-17 10:07:38,145 - absl - INFO - Using default tokenizer. 2025-11-17 10:07:42,830 - absl - INFO - Using default tokenizer. 2025-11-17 10:07:47,260 - absl - INFO - Using default tokenizer. 2025-11-17 10:07:51,908 - absl - INFO - Using default tokenizer. 2025-11-17 10:07:56,480 - absl - INFO - Using default tokenizer. 2025-11-17 10:08:01,028 - absl - INFO - Using default tokenizer. 2025-11-17 10:08:05,697 - absl - INFO - Using default tokenizer. 2025-11-17 10:08:10,359 - absl - INFO - Using default tokenizer. 2025-11-17 10:08:14,918 - absl - INFO - Using default tokenizer. 2025-11-17 10:08:19,504 - absl - INFO - Using default tokenizer. 2025-11-17 10:08:23,893 - absl - INFO - Using default tokenizer. 2025-11-17 10:08:28,277 - absl - INFO - Using default tokenizer. 2025-11-17 10:08:32,641 - absl - INFO - Using default tokenizer. 2025-11-17 10:08:37,058 - absl - INFO - Using default tokenizer. 2025-11-17 10:08:41,490 - absl - INFO - Using default tokenizer. 2025-11-17 10:08:45,874 - absl - INFO - Using default tokenizer. 2025-11-17 10:08:50,262 - absl - INFO - Using default tokenizer. 2025-11-17 10:08:54,837 - absl - INFO - Using default tokenizer. 2025-11-17 10:08:59,214 - absl - INFO - Using default tokenizer. 2025-11-17 10:09:03,770 - absl - INFO - Using default tokenizer. 2025-11-17 10:09:08,132 - absl - INFO - Using default tokenizer. 2025-11-17 10:09:12,484 - absl - INFO - Using default tokenizer. 2025-11-17 10:09:16,826 - absl - INFO - Using default tokenizer. 2025-11-17 10:09:21,150 - absl - INFO - Using default tokenizer. 2025-11-17 10:09:25,484 - absl - INFO - Using default tokenizer. 2025-11-17 10:09:29,848 - absl - INFO - Using default tokenizer. 2025-11-17 10:09:34,173 - absl - INFO - Using default tokenizer. 2025-11-17 10:09:38,531 - absl - INFO - Using default tokenizer. 2025-11-17 10:09:42,933 - absl - INFO - Using default tokenizer. 2025-11-17 10:09:47,298 - absl - INFO - Using default tokenizer. 2025-11-17 10:09:51,665 - absl - INFO - Using default tokenizer. 2025-11-17 10:09:56,047 - absl - INFO - Using default tokenizer. 2025-11-17 10:10:00,430 - absl - INFO - Using default tokenizer. 2025-11-17 10:10:04,808 - absl - INFO - Using default tokenizer. 2025-11-17 10:10:09,198 - absl - INFO - Using default tokenizer. 2025-11-17 10:10:13,609 - absl - INFO - Using default tokenizer. 2025-11-17 10:10:17,983 - absl - INFO - Using default tokenizer. 2025-11-17 10:10:22,527 - absl - INFO - Using default tokenizer. 2025-11-17 10:10:26,866 - absl - INFO - Using default tokenizer. 2025-11-17 10:10:31,209 - absl - INFO - Using default tokenizer. 2025-11-17 10:10:35,565 - absl - INFO - Using default tokenizer. 2025-11-17 10:10:39,913 - absl - INFO - Using default tokenizer. 2025-11-17 10:10:44,255 - absl - INFO - Using default tokenizer. 2025-11-17 10:10:48,637 - absl - INFO - Using default tokenizer. 2025-11-17 10:10:52,988 - absl - INFO - Using default tokenizer. 2025-11-17 10:10:57,377 - absl - INFO - Using default tokenizer. 2025-11-17 10:11:01,759 - absl - INFO - Using default tokenizer. 2025-11-17 10:11:06,129 - absl - INFO - Using default tokenizer. 2025-11-17 10:11:10,505 - absl - INFO - Using default tokenizer. 2025-11-17 10:11:14,867 - absl - INFO - Using default tokenizer. 2025-11-17 10:11:19,203 - absl - INFO - Using default tokenizer. 2025-11-17 10:11:23,555 - absl - INFO - Using default tokenizer. 2025-11-17 10:11:27,912 - absl - INFO - Using default tokenizer. 2025-11-17 10:11:32,266 - absl - INFO - Using default tokenizer. 2025-11-17 10:11:36,640 - absl - INFO - Using default tokenizer. 2025-11-17 10:11:41,110 - absl - INFO - Using default tokenizer. 2025-11-17 10:11:45,503 - absl - INFO - Using default tokenizer. 2025-11-17 10:11:49,852 - absl - INFO - Using default tokenizer. 2025-11-17 10:11:54,249 - absl - INFO - Using default tokenizer. 2025-11-17 10:11:58,619 - absl - INFO - Using default tokenizer. 2025-11-17 10:12:02,963 - absl - INFO - Using default tokenizer. 2025-11-17 10:12:07,342 - absl - INFO - Using default tokenizer. 2025-11-17 10:12:11,662 - absl - INFO - Using default tokenizer. 2025-11-17 10:12:15,862 - root - INFO - Epoch 2/5 eval loss: 1.6396445017012338, eval rougeL: 0.12217254180782963 2025-11-17 10:12:15,974 - absl - INFO - Using default tokenizer. 2025-11-17 10:12:20,180 - root - INFO - Step 2501/7150 train rougeL: 0.10662774189082413 2025-11-17 10:12:20,506 - root - INFO - Step 2501/7150 loss: 1.6259877681732178, total_norm: 9.325039863586426 2025-11-17 10:13:03,698 - absl - INFO - Using default tokenizer. 2025-11-17 10:13:07,923 - root - INFO - Step 2601/7150 train rougeL: 0.09531017424187235 2025-11-17 10:13:08,248 - root - INFO - Step 2601/7150 loss: 1.5560225248336792, total_norm: 9.481316566467285 2025-11-17 10:13:51,473 - absl - INFO - Using default tokenizer. 2025-11-17 10:13:55,690 - root - INFO - Step 2701/7150 train rougeL: 0.14330683857162124 2025-11-17 10:13:56,016 - root - INFO - Step 2701/7150 loss: 1.7089084386825562, total_norm: 7.112304210662842 2025-11-17 10:14:39,218 - absl - INFO - Using default tokenizer. 2025-11-17 10:14:43,384 - root - INFO - Step 2801/7150 train rougeL: 0.10039210438135035 2025-11-17 10:14:43,709 - root - INFO - Step 2801/7150 loss: 1.1360270977020264, total_norm: 10.12359619140625 2025-11-17 10:15:09,212 - root - INFO - Epoch 2/5 finished 2025-11-17 10:15:09,328 - absl - INFO - Using default tokenizer. 2025-11-17 10:15:13,646 - absl - INFO - Using default tokenizer. 2025-11-17 10:15:17,883 - absl - INFO - Using default tokenizer. 2025-11-17 10:15:22,333 - absl - INFO - Using default tokenizer. 2025-11-17 10:15:26,566 - absl - INFO - Using default tokenizer. 2025-11-17 10:15:30,838 - absl - INFO - Using default tokenizer. 2025-11-17 10:15:35,101 - absl - INFO - Using default tokenizer. 2025-11-17 10:15:39,353 - absl - INFO - Using default tokenizer. 2025-11-17 10:15:43,670 - absl - INFO - Using default tokenizer. 2025-11-17 10:15:47,952 - absl - INFO - Using default tokenizer. 2025-11-17 10:15:52,264 - absl - INFO - Using default tokenizer. 2025-11-17 10:15:56,562 - absl - INFO - Using default tokenizer. 2025-11-17 10:16:00,832 - absl - INFO - Using default tokenizer. 2025-11-17 10:16:05,131 - absl - INFO - Using default tokenizer. 2025-11-17 10:16:09,400 - absl - INFO - Using default tokenizer. 2025-11-17 10:16:13,726 - absl - INFO - Using default tokenizer. 2025-11-17 10:16:18,041 - absl - INFO - Using default tokenizer. 2025-11-17 10:16:22,385 - absl - INFO - Using default tokenizer. 2025-11-17 10:16:26,724 - absl - INFO - Using default tokenizer. 2025-11-17 10:16:31,023 - absl - INFO - Using default tokenizer. 2025-11-17 10:16:35,377 - absl - INFO - Using default tokenizer. 2025-11-17 10:16:40,003 - absl - INFO - Using default tokenizer. 2025-11-17 10:16:44,324 - absl - INFO - Using default tokenizer. 2025-11-17 10:16:48,672 - absl - INFO - Using default tokenizer. 2025-11-17 10:16:53,069 - absl - INFO - Using default tokenizer. 2025-11-17 10:16:57,388 - absl - INFO - Using default tokenizer. 2025-11-17 10:17:01,817 - absl - INFO - Using default tokenizer. 2025-11-17 10:17:06,257 - absl - INFO - Using default tokenizer. 2025-11-17 10:17:10,682 - absl - INFO - Using default tokenizer. 2025-11-17 10:17:15,030 - absl - INFO - Using default tokenizer. 2025-11-17 10:17:19,414 - absl - INFO - Using default tokenizer. 2025-11-17 10:17:23,818 - absl - INFO - Using default tokenizer. 2025-11-17 10:17:28,191 - absl - INFO - Using default tokenizer. 2025-11-17 10:17:32,579 - absl - INFO - Using default tokenizer. 2025-11-17 10:17:36,973 - absl - INFO - Using default tokenizer. 2025-11-17 10:17:41,381 - absl - INFO - Using default tokenizer. 2025-11-17 10:17:45,771 - absl - INFO - Using default tokenizer. 2025-11-17 10:17:50,157 - absl - INFO - Using default tokenizer. 2025-11-17 10:17:54,540 - absl - INFO - Using default tokenizer. 2025-11-17 10:17:59,110 - absl - INFO - Using default tokenizer. 2025-11-17 10:18:03,489 - absl - INFO - Using default tokenizer. 2025-11-17 10:18:07,822 - absl - INFO - Using default tokenizer. 2025-11-17 10:18:12,176 - absl - INFO - Using default tokenizer. 2025-11-17 10:18:16,541 - absl - INFO - Using default tokenizer. 2025-11-17 10:18:20,893 - absl - INFO - Using default tokenizer. 2025-11-17 10:18:25,258 - absl - INFO - Using default tokenizer. 2025-11-17 10:18:29,669 - absl - INFO - Using default tokenizer. 2025-11-17 10:18:34,028 - absl - INFO - Using default tokenizer. 2025-11-17 10:18:38,369 - absl - INFO - Using default tokenizer. 2025-11-17 10:18:42,708 - absl - INFO - Using default tokenizer. 2025-11-17 10:18:47,033 - absl - INFO - Using default tokenizer. 2025-11-17 10:18:51,404 - absl - INFO - Using default tokenizer. 2025-11-17 10:18:55,811 - absl - INFO - Using default tokenizer. 2025-11-17 10:19:00,176 - absl - INFO - Using default tokenizer. 2025-11-17 10:19:04,613 - absl - INFO - Using default tokenizer. 2025-11-17 10:19:08,981 - absl - INFO - Using default tokenizer. 2025-11-17 10:19:13,350 - absl - INFO - Using default tokenizer. 2025-11-17 10:19:17,891 - absl - INFO - Using default tokenizer. 2025-11-17 10:19:22,198 - absl - INFO - Using default tokenizer. 2025-11-17 10:19:26,559 - absl - INFO - Using default tokenizer. 2025-11-17 10:19:30,911 - absl - INFO - Using default tokenizer. 2025-11-17 10:19:35,295 - absl - INFO - Using default tokenizer. 2025-11-17 10:19:39,641 - absl - INFO - Using default tokenizer. 2025-11-17 10:19:43,877 - root - INFO - Epoch 2/5 eval loss: 1.6362242774357871, eval rougeL: 0.12231369640628474 2025-11-17 10:19:53,652 - root - INFO - Epoch 3/5 2025-11-17 10:20:11,263 - absl - INFO - Using default tokenizer. 2025-11-17 10:20:15,577 - root - INFO - Step 2901/7150 train rougeL: 0.15897589078440516 2025-11-17 10:20:15,902 - root - INFO - Step 2901/7150 loss: 1.6558233499526978, total_norm: 8.313885688781738 2025-11-17 10:20:59,019 - root - INFO - Step 3001/7150 finished 2025-11-17 10:20:59,134 - absl - INFO - Using default tokenizer. 2025-11-17 10:21:03,487 - absl - INFO - Using default tokenizer. 2025-11-17 10:21:07,808 - absl - INFO - Using default tokenizer. 2025-11-17 10:21:12,146 - absl - INFO - Using default tokenizer. 2025-11-17 10:21:16,529 - absl - INFO - Using default tokenizer. 2025-11-17 10:21:20,980 - absl - INFO - Using default tokenizer. 2025-11-17 10:21:25,436 - absl - INFO - Using default tokenizer. 2025-11-17 10:21:29,788 - absl - INFO - Using default tokenizer. 2025-11-17 10:21:34,205 - absl - INFO - Using default tokenizer. 2025-11-17 10:21:38,547 - absl - INFO - Using default tokenizer. 2025-11-17 10:21:43,055 - absl - INFO - Using default tokenizer. 2025-11-17 10:21:47,414 - absl - INFO - Using default tokenizer. 2025-11-17 10:21:51,719 - absl - INFO - Using default tokenizer. 2025-11-17 10:21:56,075 - absl - INFO - Using default tokenizer. 2025-11-17 10:22:00,649 - absl - INFO - Using default tokenizer. 2025-11-17 10:22:05,193 - absl - INFO - Using default tokenizer. 2025-11-17 10:22:09,697 - absl - INFO - Using default tokenizer. 2025-11-17 10:22:14,169 - absl - INFO - Using default tokenizer. 2025-11-17 10:22:18,642 - absl - INFO - Using default tokenizer. 2025-11-17 10:22:23,132 - absl - INFO - Using default tokenizer. 2025-11-17 10:22:27,465 - absl - INFO - Using default tokenizer. 2025-11-17 10:22:31,752 - absl - INFO - Using default tokenizer. 2025-11-17 10:22:36,008 - absl - INFO - Using default tokenizer. 2025-11-17 10:22:40,298 - absl - INFO - Using default tokenizer. 2025-11-17 10:22:44,600 - absl - INFO - Using default tokenizer. 2025-11-17 10:22:48,913 - absl - INFO - Using default tokenizer. 2025-11-17 10:22:53,220 - absl - INFO - Using default tokenizer. 2025-11-17 10:22:57,725 - absl - INFO - Using default tokenizer. 2025-11-17 10:23:02,458 - absl - INFO - Using default tokenizer. 2025-11-17 10:23:07,033 - absl - INFO - Using default tokenizer. 2025-11-17 10:23:11,613 - absl - INFO - Using default tokenizer. 2025-11-17 10:23:16,052 - absl - INFO - Using default tokenizer. 2025-11-17 10:23:20,561 - absl - INFO - Using default tokenizer. 2025-11-17 10:23:24,939 - absl - INFO - Using default tokenizer. 2025-11-17 10:23:29,300 - absl - INFO - Using default tokenizer. 2025-11-17 10:23:33,707 - absl - INFO - Using default tokenizer. 2025-11-17 10:23:38,092 - absl - INFO - Using default tokenizer. 2025-11-17 10:23:42,455 - absl - INFO - Using default tokenizer. 2025-11-17 10:23:46,808 - absl - INFO - Using default tokenizer. 2025-11-17 10:23:51,178 - absl - INFO - Using default tokenizer. 2025-11-17 10:23:55,543 - absl - INFO - Using default tokenizer. 2025-11-17 10:23:59,872 - absl - INFO - Using default tokenizer. 2025-11-17 10:24:04,209 - absl - INFO - Using default tokenizer. 2025-11-17 10:24:08,560 - absl - INFO - Using default tokenizer. 2025-11-17 10:24:12,869 - absl - INFO - Using default tokenizer. 2025-11-17 10:24:17,199 - absl - INFO - Using default tokenizer. 2025-11-17 10:24:21,660 - absl - INFO - Using default tokenizer. 2025-11-17 10:24:25,974 - absl - INFO - Using default tokenizer. 2025-11-17 10:24:30,318 - absl - INFO - Using default tokenizer. 2025-11-17 10:24:34,658 - absl - INFO - Using default tokenizer. 2025-11-17 10:24:38,966 - absl - INFO - Using default tokenizer. 2025-11-17 10:24:43,299 - absl - INFO - Using default tokenizer. 2025-11-17 10:24:47,603 - absl - INFO - Using default tokenizer. 2025-11-17 10:24:51,917 - absl - INFO - Using default tokenizer. 2025-11-17 10:24:56,249 - absl - INFO - Using default tokenizer. 2025-11-17 10:25:00,548 - absl - INFO - Using default tokenizer. 2025-11-17 10:25:04,882 - absl - INFO - Using default tokenizer. 2025-11-17 10:25:09,171 - absl - INFO - Using default tokenizer. 2025-11-17 10:25:13,479 - absl - INFO - Using default tokenizer. 2025-11-17 10:25:17,783 - absl - INFO - Using default tokenizer. 2025-11-17 10:25:22,096 - absl - INFO - Using default tokenizer. 2025-11-17 10:25:26,424 - absl - INFO - Using default tokenizer. 2025-11-17 10:25:30,671 - absl - INFO - Using default tokenizer. 2025-11-17 10:25:34,846 - root - INFO - Epoch 3/5 eval loss: 1.65362552801768, eval rougeL: 0.12596918722834954 2025-11-17 10:25:34,958 - absl - INFO - Using default tokenizer. 2025-11-17 10:25:39,289 - root - INFO - Step 3001/7150 train rougeL: 0.19326760517745264 2025-11-17 10:25:39,614 - root - INFO - Step 3001/7150 loss: 1.5034375190734863, total_norm: 7.960014343261719 2025-11-17 10:26:22,750 - absl - INFO - Using default tokenizer. 2025-11-17 10:26:26,919 - root - INFO - Step 3101/7150 train rougeL: 0.12763325732899017 2025-11-17 10:26:27,244 - root - INFO - Step 3101/7150 loss: 1.4708831310272217, total_norm: 9.13034439086914 2025-11-17 10:27:10,360 - absl - INFO - Using default tokenizer. 2025-11-17 10:27:14,561 - root - INFO - Step 3201/7150 train rougeL: 0.12888426048023352 2025-11-17 10:27:14,886 - root - INFO - Step 3201/7150 loss: 1.2688593864440918, total_norm: 8.532910346984863 2025-11-17 10:27:58,042 - absl - INFO - Using default tokenizer. 2025-11-17 10:28:02,354 - root - INFO - Step 3301/7150 train rougeL: 0.10986793311717052 2025-11-17 10:28:02,680 - root - INFO - Step 3301/7150 loss: 1.3775675296783447, total_norm: 7.811343193054199 2025-11-17 10:28:45,830 - absl - INFO - Using default tokenizer. 2025-11-17 10:28:50,154 - root - INFO - Step 3401/7150 train rougeL: 0.10495329130225253 2025-11-17 10:28:50,479 - root - INFO - Step 3401/7150 loss: 1.20743727684021, total_norm: 10.27733325958252 2025-11-17 10:29:33,475 - root - INFO - Step 3501/7150 finished 2025-11-17 10:29:33,590 - absl - INFO - Using default tokenizer. 2025-11-17 10:29:37,948 - absl - INFO - Using default tokenizer. 2025-11-17 10:29:42,295 - absl - INFO - Using default tokenizer. 2025-11-17 10:29:46,619 - absl - INFO - Using default tokenizer. 2025-11-17 10:29:50,974 - absl - INFO - Using default tokenizer. 2025-11-17 10:29:55,315 - absl - INFO - Using default tokenizer. 2025-11-17 10:29:59,636 - absl - INFO - Using default tokenizer. 2025-11-17 10:30:04,001 - absl - INFO - Using default tokenizer. 2025-11-17 10:30:08,398 - absl - INFO - Using default tokenizer. 2025-11-17 10:30:12,747 - absl - INFO - Using default tokenizer. 2025-11-17 10:30:17,310 - absl - INFO - Using default tokenizer. 2025-11-17 10:30:21,684 - absl - INFO - Using default tokenizer. 2025-11-17 10:30:26,016 - absl - INFO - Using default tokenizer. 2025-11-17 10:30:30,369 - absl - INFO - Using default tokenizer. 2025-11-17 10:30:34,684 - absl - INFO - Using default tokenizer. 2025-11-17 10:30:39,038 - absl - INFO - Using default tokenizer. 2025-11-17 10:30:43,367 - absl - INFO - Using default tokenizer. 2025-11-17 10:30:47,710 - absl - INFO - Using default tokenizer. 2025-11-17 10:30:52,075 - absl - INFO - Using default tokenizer. 2025-11-17 10:30:56,417 - absl - INFO - Using default tokenizer. 2025-11-17 10:31:00,770 - absl - INFO - Using default tokenizer. 2025-11-17 10:31:05,110 - absl - INFO - Using default tokenizer. 2025-11-17 10:31:09,436 - absl - INFO - Using default tokenizer. 2025-11-17 10:31:13,808 - absl - INFO - Using default tokenizer. 2025-11-17 10:31:18,145 - absl - INFO - Using default tokenizer. 2025-11-17 10:31:22,491 - absl - INFO - Using default tokenizer. 2025-11-17 10:31:26,818 - absl - INFO - Using default tokenizer. 2025-11-17 10:31:31,140 - absl - INFO - Using default tokenizer. 2025-11-17 10:31:35,674 - absl - INFO - Using default tokenizer. 2025-11-17 10:31:40,023 - absl - INFO - Using default tokenizer. 2025-11-17 10:31:44,347 - absl - INFO - Using default tokenizer. 2025-11-17 10:31:48,684 - absl - INFO - Using default tokenizer. 2025-11-17 10:31:52,996 - absl - INFO - Using default tokenizer. 2025-11-17 10:31:57,341 - absl - INFO - Using default tokenizer. 2025-11-17 10:32:01,666 - absl - INFO - Using default tokenizer. 2025-11-17 10:32:06,009 - absl - INFO - Using default tokenizer. 2025-11-17 10:32:10,351 - absl - INFO - Using default tokenizer. 2025-11-17 10:32:14,674 - absl - INFO - Using default tokenizer. 2025-11-17 10:32:18,991 - absl - INFO - Using default tokenizer. 2025-11-17 10:32:23,312 - absl - INFO - Using default tokenizer. 2025-11-17 10:32:27,753 - absl - INFO - Using default tokenizer. 2025-11-17 10:32:32,172 - absl - INFO - Using default tokenizer. 2025-11-17 10:32:36,603 - absl - INFO - Using default tokenizer. 2025-11-17 10:32:41,063 - absl - INFO - Using default tokenizer. 2025-11-17 10:32:45,487 - absl - INFO - Using default tokenizer. 2025-11-17 10:32:49,941 - absl - INFO - Using default tokenizer. 2025-11-17 10:32:54,557 - absl - INFO - Using default tokenizer. 2025-11-17 10:32:58,984 - absl - INFO - Using default tokenizer. 2025-11-17 10:33:03,411 - absl - INFO - Using default tokenizer. 2025-11-17 10:33:07,869 - absl - INFO - Using default tokenizer. 2025-11-17 10:33:12,374 - absl - INFO - Using default tokenizer. 2025-11-17 10:33:16,943 - absl - INFO - Using default tokenizer. 2025-11-17 10:33:21,609 - absl - INFO - Using default tokenizer. 2025-11-17 10:33:26,177 - absl - INFO - Using default tokenizer. 2025-11-17 10:33:30,762 - absl - INFO - Using default tokenizer. 2025-11-17 10:33:35,333 - absl - INFO - Using default tokenizer. 2025-11-17 10:33:39,857 - absl - INFO - Using default tokenizer. 2025-11-17 10:33:44,299 - absl - INFO - Using default tokenizer. 2025-11-17 10:33:48,792 - absl - INFO - Using default tokenizer. 2025-11-17 10:33:53,233 - absl - INFO - Using default tokenizer. 2025-11-17 10:33:57,800 - absl - INFO - Using default tokenizer. 2025-11-17 10:34:02,286 - absl - INFO - Using default tokenizer. 2025-11-17 10:34:06,683 - absl - INFO - Using default tokenizer. 2025-11-17 10:34:11,108 - root - INFO - Epoch 3/5 eval loss: 1.6567365385237194, eval rougeL: 0.12643298401221784 2025-11-17 10:34:11,220 - absl - INFO - Using default tokenizer. 2025-11-17 10:34:15,655 - root - INFO - Step 3501/7150 train rougeL: 0.12792752496494356 2025-11-17 10:34:15,980 - root - INFO - Step 3501/7150 loss: 1.5544320344924927, total_norm: 10.188572883605957 2025-11-17 10:34:59,125 - absl - INFO - Using default tokenizer. 2025-11-17 10:35:03,403 - root - INFO - Step 3601/7150 train rougeL: 0.12198955462667235 2025-11-17 10:35:03,729 - root - INFO - Step 3601/7150 loss: 1.3073434829711914, total_norm: 9.680294036865234 2025-11-17 10:35:46,950 - absl - INFO - Using default tokenizer. 2025-11-17 10:35:51,155 - root - INFO - Step 3701/7150 train rougeL: 0.09020803253463353 2025-11-17 10:35:51,480 - root - INFO - Step 3701/7150 loss: 1.194284439086914, total_norm: 9.768637657165527 2025-11-17 10:36:34,618 - absl - INFO - Using default tokenizer. 2025-11-17 10:36:38,847 - root - INFO - Step 3801/7150 train rougeL: 0.12248498258051387 2025-11-17 10:36:39,173 - root - INFO - Step 3801/7150 loss: 1.2301397323608398, total_norm: 9.589608192443848 2025-11-17 10:37:22,384 - absl - INFO - Using default tokenizer. 2025-11-17 10:37:26,927 - root - INFO - Step 3901/7150 train rougeL: 0.1316850744566755 2025-11-17 10:37:27,253 - root - INFO - Step 3901/7150 loss: 1.474929928779602, total_norm: 9.06521987915039 2025-11-17 10:38:10,380 - root - INFO - Step 4001/7150 finished 2025-11-17 10:38:10,495 - absl - INFO - Using default tokenizer. 2025-11-17 10:38:15,142 - absl - INFO - Using default tokenizer. 2025-11-17 10:38:19,594 - absl - INFO - Using default tokenizer. 2025-11-17 10:38:24,080 - absl - INFO - Using default tokenizer. 2025-11-17 10:38:28,764 - absl - INFO - Using default tokenizer. 2025-11-17 10:38:33,357 - absl - INFO - Using default tokenizer. 2025-11-17 10:38:38,008 - absl - INFO - Using default tokenizer. 2025-11-17 10:38:42,695 - absl - INFO - Using default tokenizer. 2025-11-17 10:38:47,362 - absl - INFO - Using default tokenizer. 2025-11-17 10:38:52,015 - absl - INFO - Using default tokenizer. 2025-11-17 10:38:56,875 - absl - INFO - Using default tokenizer. 2025-11-17 10:39:01,446 - absl - INFO - Using default tokenizer. 2025-11-17 10:39:06,040 - absl - INFO - Using default tokenizer. 2025-11-17 10:39:10,799 - absl - INFO - Using default tokenizer. 2025-11-17 10:39:15,390 - absl - INFO - Using default tokenizer. 2025-11-17 10:39:19,974 - absl - INFO - Using default tokenizer. 2025-11-17 10:39:24,722 - absl - INFO - Using default tokenizer. 2025-11-17 10:39:29,324 - absl - INFO - Using default tokenizer. 2025-11-17 10:39:33,917 - absl - INFO - Using default tokenizer. 2025-11-17 10:39:38,607 - absl - INFO - Using default tokenizer. 2025-11-17 10:39:43,199 - absl - INFO - Using default tokenizer. 2025-11-17 10:39:47,753 - absl - INFO - Using default tokenizer. 2025-11-17 10:39:52,408 - absl - INFO - Using default tokenizer. 2025-11-17 10:39:57,017 - absl - INFO - Using default tokenizer. 2025-11-17 10:40:01,609 - absl - INFO - Using default tokenizer. 2025-11-17 10:40:06,294 - absl - INFO - Using default tokenizer. 2025-11-17 10:40:10,870 - absl - INFO - Using default tokenizer. 2025-11-17 10:40:15,470 - absl - INFO - Using default tokenizer. 2025-11-17 10:40:20,288 - absl - INFO - Using default tokenizer. 2025-11-17 10:40:24,792 - absl - INFO - Using default tokenizer. 2025-11-17 10:40:29,307 - absl - INFO - Using default tokenizer. 2025-11-17 10:40:33,903 - absl - INFO - Using default tokenizer. 2025-11-17 10:40:38,410 - absl - INFO - Using default tokenizer. 2025-11-17 10:40:42,954 - absl - INFO - Using default tokenizer. 2025-11-17 10:40:47,528 - absl - INFO - Using default tokenizer. 2025-11-17 10:40:52,036 - absl - INFO - Using default tokenizer. 2025-11-17 10:40:56,619 - absl - INFO - Using default tokenizer. 2025-11-17 10:41:01,129 - absl - INFO - Using default tokenizer. 2025-11-17 10:41:05,609 - absl - INFO - Using default tokenizer. 2025-11-17 10:41:10,221 - absl - INFO - Using default tokenizer. 2025-11-17 10:41:14,756 - absl - INFO - Using default tokenizer. 2025-11-17 10:41:19,276 - absl - INFO - Using default tokenizer. 2025-11-17 10:41:23,897 - absl - INFO - Using default tokenizer. 2025-11-17 10:41:28,457 - absl - INFO - Using default tokenizer. 2025-11-17 10:41:32,977 - absl - INFO - Using default tokenizer. 2025-11-17 10:41:37,625 - absl - INFO - Using default tokenizer. 2025-11-17 10:41:42,381 - absl - INFO - Using default tokenizer. 2025-11-17 10:41:46,921 - absl - INFO - Using default tokenizer. 2025-11-17 10:41:51,546 - absl - INFO - Using default tokenizer. 2025-11-17 10:41:56,097 - absl - INFO - Using default tokenizer. 2025-11-17 10:42:00,692 - absl - INFO - Using default tokenizer. 2025-11-17 10:42:05,334 - absl - INFO - Using default tokenizer. 2025-11-17 10:42:09,892 - absl - INFO - Using default tokenizer. 2025-11-17 10:42:14,481 - absl - INFO - Using default tokenizer. 2025-11-17 10:42:19,110 - absl - INFO - Using default tokenizer. 2025-11-17 10:42:23,673 - absl - INFO - Using default tokenizer. 2025-11-17 10:42:28,355 - absl - INFO - Using default tokenizer. 2025-11-17 10:42:32,914 - absl - INFO - Using default tokenizer. 2025-11-17 10:42:37,395 - absl - INFO - Using default tokenizer. 2025-11-17 10:42:41,995 - absl - INFO - Using default tokenizer. 2025-11-17 10:42:46,533 - absl - INFO - Using default tokenizer. 2025-11-17 10:42:51,042 - absl - INFO - Using default tokenizer. 2025-11-17 10:42:55,633 - absl - INFO - Using default tokenizer. 2025-11-17 10:43:00,017 - root - INFO - Epoch 3/5 eval loss: 1.6557310176274134, eval rougeL: 0.12576593720645066 2025-11-17 10:43:00,130 - absl - INFO - Using default tokenizer. 2025-11-17 10:43:04,644 - root - INFO - Step 4001/7150 train rougeL: 0.09341172184943963 2025-11-17 10:43:04,969 - root - INFO - Step 4001/7150 loss: 1.670107126235962, total_norm: 12.115103721618652 2025-11-17 10:43:48,264 - absl - INFO - Using default tokenizer. 2025-11-17 10:43:52,703 - root - INFO - Step 4101/7150 train rougeL: 0.1297813312222549 2025-11-17 10:43:53,027 - root - INFO - Step 4101/7150 loss: 1.4866772890090942, total_norm: 9.36847972869873 2025-11-17 10:44:36,328 - absl - INFO - Using default tokenizer. 2025-11-17 10:44:40,952 - root - INFO - Step 4201/7150 train rougeL: 0.14583939450690894 2025-11-17 10:44:41,277 - root - INFO - Step 4201/7150 loss: 1.3981727361679077, total_norm: 10.50740909576416 2025-11-17 10:45:19,931 - root - INFO - Epoch 3/5 finished 2025-11-17 10:45:20,048 - absl - INFO - Using default tokenizer. 2025-11-17 10:45:24,689 - absl - INFO - Using default tokenizer. 2025-11-17 10:45:29,346 - absl - INFO - Using default tokenizer. 2025-11-17 10:45:34,069 - absl - INFO - Using default tokenizer. 2025-11-17 10:45:38,746 - absl - INFO - Using default tokenizer. 2025-11-17 10:45:43,425 - absl - INFO - Using default tokenizer. 2025-11-17 10:45:48,201 - absl - INFO - Using default tokenizer. 2025-11-17 10:45:52,815 - absl - INFO - Using default tokenizer. 2025-11-17 10:45:57,489 - absl - INFO - Using default tokenizer. 2025-11-17 10:46:02,205 - absl - INFO - Using default tokenizer. 2025-11-17 10:46:06,916 - absl - INFO - Using default tokenizer. 2025-11-17 10:46:11,817 - absl - INFO - Using default tokenizer. 2025-11-17 10:46:16,646 - absl - INFO - Using default tokenizer. 2025-11-17 10:46:21,346 - absl - INFO - Using default tokenizer. 2025-11-17 10:46:26,197 - absl - INFO - Using default tokenizer. 2025-11-17 10:46:30,797 - absl - INFO - Using default tokenizer. 2025-11-17 10:46:35,449 - absl - INFO - Using default tokenizer. 2025-11-17 10:46:40,039 - absl - INFO - Using default tokenizer. 2025-11-17 10:46:44,532 - absl - INFO - Using default tokenizer. 2025-11-17 10:46:49,169 - absl - INFO - Using default tokenizer. 2025-11-17 10:46:53,660 - absl - INFO - Using default tokenizer. 2025-11-17 10:46:58,151 - absl - INFO - Using default tokenizer. 2025-11-17 10:47:02,776 - absl - INFO - Using default tokenizer. 2025-11-17 10:47:07,241 - absl - INFO - Using default tokenizer. 2025-11-17 10:47:11,790 - absl - INFO - Using default tokenizer. 2025-11-17 10:47:16,466 - absl - INFO - Using default tokenizer. 2025-11-17 10:47:20,943 - absl - INFO - Using default tokenizer. 2025-11-17 10:47:25,443 - absl - INFO - Using default tokenizer. 2025-11-17 10:47:30,119 - absl - INFO - Using default tokenizer. 2025-11-17 10:47:34,601 - absl - INFO - Using default tokenizer. 2025-11-17 10:47:39,115 - absl - INFO - Using default tokenizer. 2025-11-17 10:47:43,746 - absl - INFO - Using default tokenizer. 2025-11-17 10:47:48,448 - absl - INFO - Using default tokenizer. 2025-11-17 10:47:52,916 - absl - INFO - Using default tokenizer. 2025-11-17 10:47:57,500 - absl - INFO - Using default tokenizer. 2025-11-17 10:48:02,005 - absl - INFO - Using default tokenizer. 2025-11-17 10:48:06,538 - absl - INFO - Using default tokenizer. 2025-11-17 10:48:11,142 - absl - INFO - Using default tokenizer. 2025-11-17 10:48:15,607 - absl - INFO - Using default tokenizer. 2025-11-17 10:48:20,115 - absl - INFO - Using default tokenizer. 2025-11-17 10:48:24,735 - absl - INFO - Using default tokenizer. 2025-11-17 10:48:29,197 - absl - INFO - Using default tokenizer. 2025-11-17 10:48:33,755 - absl - INFO - Using default tokenizer. 2025-11-17 10:48:38,340 - absl - INFO - Using default tokenizer. 2025-11-17 10:48:42,834 - absl - INFO - Using default tokenizer. 2025-11-17 10:48:47,434 - absl - INFO - Using default tokenizer. 2025-11-17 10:48:52,172 - absl - INFO - Using default tokenizer. 2025-11-17 10:48:57,018 - absl - INFO - Using default tokenizer. 2025-11-17 10:49:01,849 - absl - INFO - Using default tokenizer. 2025-11-17 10:49:06,451 - absl - INFO - Using default tokenizer. 2025-11-17 10:49:11,156 - absl - INFO - Using default tokenizer. 2025-11-17 10:49:15,664 - absl - INFO - Using default tokenizer. 2025-11-17 10:49:20,200 - absl - INFO - Using default tokenizer. 2025-11-17 10:49:24,646 - absl - INFO - Using default tokenizer. 2025-11-17 10:49:29,590 - absl - INFO - Using default tokenizer. 2025-11-17 10:49:34,534 - absl - INFO - Using default tokenizer. 2025-11-17 10:49:39,386 - absl - INFO - Using default tokenizer. 2025-11-17 10:49:44,221 - absl - INFO - Using default tokenizer. 2025-11-17 10:49:49,233 - absl - INFO - Using default tokenizer. 2025-11-17 10:49:54,073 - absl - INFO - Using default tokenizer. 2025-11-17 10:49:58,902 - absl - INFO - Using default tokenizer. 2025-11-17 10:50:03,712 - absl - INFO - Using default tokenizer. 2025-11-17 10:50:08,211 - absl - INFO - Using default tokenizer. 2025-11-17 10:50:12,553 - root - INFO - Epoch 3/5 eval loss: 1.653796877179827, eval rougeL: 0.1272040227159578 2025-11-17 10:50:22,353 - root - INFO - Epoch 4/5 2025-11-17 10:50:26,852 - absl - INFO - Using default tokenizer. 2025-11-17 10:50:31,364 - root - INFO - Step 4301/7150 train rougeL: 0.12061293185318069 2025-11-17 10:50:31,689 - root - INFO - Step 4301/7150 loss: 1.484779953956604, total_norm: 8.997773170471191 2025-11-17 10:51:15,053 - absl - INFO - Using default tokenizer. 2025-11-17 10:51:19,455 - root - INFO - Step 4401/7150 train rougeL: 0.1343199929023872 2025-11-17 10:51:19,782 - root - INFO - Step 4401/7150 loss: 1.2528334856033325, total_norm: 8.899775505065918