Instructions to use Arthur-Tsai/ht-stmini-cls-v6_ftis_noPretrain-msm-pos with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Arthur-Tsai/ht-stmini-cls-v6_ftis_noPretrain-msm-pos with Transformers:
# Load model directly from transformers import HiTrans model = HiTrans.from_pretrained("Arthur-Tsai/ht-stmini-cls-v6_ftis_noPretrain-msm-pos", dtype="auto") - Notebooks
- Google Colab
- Kaggle
End of training
Browse files- README.md +214 -158
- model.safetensors +1 -1
- runs/0-by=2006-psr=1.0/events.out.tfevents.1747568036.yara2.19173.1 +3 -0
README.md
CHANGED
|
@@ -16,9 +16,9 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 16 |
|
| 17 |
This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
|
| 18 |
It achieves the following results on the evaluation set:
|
| 19 |
-
- Loss:
|
| 20 |
-
- Accuracy: 0.
|
| 21 |
-
- Macro F1: 0.
|
| 22 |
|
| 23 |
## Model description
|
| 24 |
|
|
@@ -44,166 +44,222 @@ The following hyperparameters were used during training:
|
|
| 44 |
- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
|
| 45 |
- lr_scheduler_type: linear
|
| 46 |
- lr_scheduler_warmup_steps: 6733
|
| 47 |
-
- training_steps:
|
| 48 |
|
| 49 |
### Training results
|
| 50 |
|
| 51 |
| Training Loss | Epoch | Step | Validation Loss | Accuracy | Macro F1 |
|
| 52 |
|:-------------:|:--------:|:-----:|:---------------:|:--------:|:--------:|
|
| 53 |
-
|
|
| 54 |
-
|
|
| 55 |
-
| 7.
|
| 56 |
-
|
|
| 57 |
-
| 5.
|
| 58 |
-
| 4.
|
| 59 |
-
|
|
| 60 |
-
| 3.
|
| 61 |
-
| 3.
|
| 62 |
-
| 2.
|
| 63 |
-
| 2.
|
| 64 |
-
| 2.
|
| 65 |
-
| 2.
|
| 66 |
-
| 2.
|
| 67 |
-
| 2.
|
| 68 |
-
| 2.
|
| 69 |
-
|
|
| 70 |
-
|
|
| 71 |
-
|
|
| 72 |
-
|
|
| 73 |
-
| 1.
|
| 74 |
-
| 1.
|
| 75 |
-
| 1.
|
| 76 |
-
| 1.
|
| 77 |
-
| 1.
|
| 78 |
-
| 1.
|
| 79 |
-
| 1.
|
| 80 |
-
| 1.
|
| 81 |
-
| 1.
|
| 82 |
-
| 1.
|
| 83 |
-
| 1.
|
| 84 |
-
| 1.
|
| 85 |
-
| 1.
|
| 86 |
-
|
|
| 87 |
-
|
|
| 88 |
-
|
|
| 89 |
-
| 0.
|
| 90 |
-
| 0.
|
| 91 |
-
| 0.
|
| 92 |
-
| 0.
|
| 93 |
-
| 0.
|
| 94 |
-
| 0.
|
| 95 |
-
| 0.
|
| 96 |
-
| 0.
|
| 97 |
-
| 0.
|
| 98 |
-
| 0.
|
| 99 |
-
| 0.
|
| 100 |
-
| 0.
|
| 101 |
-
| 0.
|
| 102 |
-
| 0.
|
| 103 |
-
| 0.
|
| 104 |
-
| 0.
|
| 105 |
-
| 0.
|
| 106 |
-
| 0.
|
| 107 |
-
| 0.
|
| 108 |
-
| 0.
|
| 109 |
-
| 0.
|
| 110 |
-
| 0.
|
| 111 |
-
| 0.
|
| 112 |
-
| 0.
|
| 113 |
-
| 0.
|
| 114 |
-
| 0.
|
| 115 |
-
| 0.
|
| 116 |
-
| 0.
|
| 117 |
-
| 0.
|
| 118 |
-
| 0.
|
| 119 |
-
| 0.
|
| 120 |
-
| 0.
|
| 121 |
-
| 0.
|
| 122 |
-
| 0.
|
| 123 |
-
| 0.
|
| 124 |
-
| 0.
|
| 125 |
-
| 0.
|
| 126 |
-
| 0.
|
| 127 |
-
| 0.
|
| 128 |
-
| 0.
|
| 129 |
-
| 0.
|
| 130 |
-
| 0.
|
| 131 |
-
| 0.
|
| 132 |
-
| 0.
|
| 133 |
-
| 0.
|
| 134 |
-
| 0.
|
| 135 |
-
| 0.
|
| 136 |
-
| 0.
|
| 137 |
-
| 0.
|
| 138 |
-
| 0.
|
| 139 |
-
| 0.
|
| 140 |
-
| 0.
|
| 141 |
-
| 0.
|
| 142 |
-
| 0.
|
| 143 |
-
| 0.
|
| 144 |
-
| 0.
|
| 145 |
-
| 0.
|
| 146 |
-
| 0.
|
| 147 |
-
| 0.
|
| 148 |
-
| 0.
|
| 149 |
-
| 0.
|
| 150 |
-
| 0.
|
| 151 |
-
| 0.
|
| 152 |
-
| 0.
|
| 153 |
-
| 0.
|
| 154 |
-
| 0.
|
| 155 |
-
| 0.
|
| 156 |
-
| 0.
|
| 157 |
-
| 0.
|
| 158 |
-
| 0.
|
| 159 |
-
| 0.
|
| 160 |
-
| 0.
|
| 161 |
-
| 0.
|
| 162 |
-
| 0.
|
| 163 |
-
| 0.
|
| 164 |
-
| 0.
|
| 165 |
-
| 0.
|
| 166 |
-
| 0.
|
| 167 |
-
| 0.
|
| 168 |
-
| 0.
|
| 169 |
-
| 0.
|
| 170 |
-
| 0.
|
| 171 |
-
| 0.
|
| 172 |
-
| 0.
|
| 173 |
-
| 0.
|
| 174 |
-
| 0.
|
| 175 |
-
| 0.
|
| 176 |
-
| 0.
|
| 177 |
-
| 0.
|
| 178 |
-
| 0.
|
| 179 |
-
| 0.
|
| 180 |
-
| 0.
|
| 181 |
-
| 0.
|
| 182 |
-
| 0.
|
| 183 |
-
| 0.
|
| 184 |
-
| 0.
|
| 185 |
-
| 0.
|
| 186 |
-
| 0.
|
| 187 |
-
| 0.
|
| 188 |
-
| 0.
|
| 189 |
-
| 0.
|
| 190 |
-
| 0.
|
| 191 |
-
| 0.
|
| 192 |
-
| 0.
|
| 193 |
-
| 0.
|
| 194 |
-
| 0.
|
| 195 |
-
| 0.
|
| 196 |
-
| 0.
|
| 197 |
-
| 0.
|
| 198 |
-
| 0.
|
| 199 |
-
| 0.
|
| 200 |
-
| 0.
|
| 201 |
-
| 0.
|
| 202 |
-
| 0.
|
| 203 |
-
| 0.
|
| 204 |
-
| 0.
|
| 205 |
-
| 0.
|
| 206 |
-
| 0.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 207 |
|
| 208 |
|
| 209 |
### Framework versions
|
|
|
|
| 16 |
|
| 17 |
This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
|
| 18 |
It achieves the following results on the evaluation set:
|
| 19 |
+
- Loss: 0.9005
|
| 20 |
+
- Accuracy: 0.9455
|
| 21 |
+
- Macro F1: 0.8570
|
| 22 |
|
| 23 |
## Model description
|
| 24 |
|
|
|
|
| 44 |
- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
|
| 45 |
- lr_scheduler_type: linear
|
| 46 |
- lr_scheduler_warmup_steps: 6733
|
| 47 |
+
- training_steps: 134675
|
| 48 |
|
| 49 |
### Training results
|
| 50 |
|
| 51 |
| Training Loss | Epoch | Step | Validation Loss | Accuracy | Macro F1 |
|
| 52 |
|:-------------:|:--------:|:-----:|:---------------:|:--------:|:--------:|
|
| 53 |
+
| 39.0542 | 0.0015 | 202 | 52.4550 | 0.0872 | 0.0428 |
|
| 54 |
+
| 9.995 | 1.0015 | 404 | 111.6770 | 0.4138 | 0.1069 |
|
| 55 |
+
| 7.019 | 2.0015 | 606 | 151.1546 | 0.5324 | 0.1287 |
|
| 56 |
+
| 5.9896 | 3.0015 | 808 | 161.7299 | 0.5608 | 0.1355 |
|
| 57 |
+
| 5.2331 | 4.0015 | 1010 | 127.6348 | 0.5911 | 0.1425 |
|
| 58 |
+
| 4.4417 | 5.0015 | 1212 | 85.7775 | 0.6060 | 0.1540 |
|
| 59 |
+
| 3.7238 | 6.0015 | 1414 | 56.9365 | 0.5991 | 0.1518 |
|
| 60 |
+
| 3.3202 | 7.0015 | 1616 | 38.2729 | 0.6192 | 0.1640 |
|
| 61 |
+
| 3.0289 | 8.0015 | 1818 | 29.3182 | 0.6116 | 0.1670 |
|
| 62 |
+
| 2.851 | 9.0015 | 2020 | 22.7821 | 0.6146 | 0.1740 |
|
| 63 |
+
| 2.6193 | 10.0015 | 2222 | 20.0463 | 0.6157 | 0.1804 |
|
| 64 |
+
| 2.5279 | 11.0015 | 2424 | 12.6429 | 0.6331 | 0.1994 |
|
| 65 |
+
| 2.5246 | 12.0015 | 2626 | 11.5103 | 0.6461 | 0.2189 |
|
| 66 |
+
| 2.387 | 13.0015 | 2828 | 9.9416 | 0.6469 | 0.2234 |
|
| 67 |
+
| 2.2723 | 14.0015 | 3030 | 9.5941 | 0.6758 | 0.2647 |
|
| 68 |
+
| 2.1285 | 15.0015 | 3232 | 9.1184 | 0.7111 | 0.2847 |
|
| 69 |
+
| 1.9754 | 16.0015 | 3434 | 7.8173 | 0.7227 | 0.3092 |
|
| 70 |
+
| 1.9499 | 17.0015 | 3636 | 6.2093 | 0.7161 | 0.3239 |
|
| 71 |
+
| 1.9703 | 18.0015 | 3838 | 6.8571 | 0.7220 | 0.3407 |
|
| 72 |
+
| 1.8392 | 19.0015 | 4040 | 5.9043 | 0.7307 | 0.3679 |
|
| 73 |
+
| 1.6563 | 20.0015 | 4242 | 6.6418 | 0.7494 | 0.3731 |
|
| 74 |
+
| 1.7206 | 21.0015 | 4444 | 5.9527 | 0.7415 | 0.3983 |
|
| 75 |
+
| 1.4991 | 22.0015 | 4646 | 7.9843 | 0.7538 | 0.4166 |
|
| 76 |
+
| 1.4357 | 23.0015 | 4848 | 6.3545 | 0.7619 | 0.4368 |
|
| 77 |
+
| 1.3359 | 24.0015 | 5050 | 6.9233 | 0.7764 | 0.4465 |
|
| 78 |
+
| 1.4795 | 25.0015 | 5252 | 6.6354 | 0.7810 | 0.4767 |
|
| 79 |
+
| 1.3976 | 26.0015 | 5454 | 7.4885 | 0.7918 | 0.4841 |
|
| 80 |
+
| 1.2662 | 27.0015 | 5656 | 7.2623 | 0.7961 | 0.4800 |
|
| 81 |
+
| 1.1697 | 28.0015 | 5858 | 6.7678 | 0.7819 | 0.4808 |
|
| 82 |
+
| 1.1994 | 29.0015 | 6060 | 8.9266 | 0.8038 | 0.5040 |
|
| 83 |
+
| 1.1654 | 30.0015 | 6262 | 8.9018 | 0.7965 | 0.4944 |
|
| 84 |
+
| 1.0817 | 31.0015 | 6464 | 8.8383 | 0.8060 | 0.5435 |
|
| 85 |
+
| 1.0778 | 32.0015 | 6666 | 8.7811 | 0.8029 | 0.5208 |
|
| 86 |
+
| 0.985 | 33.0015 | 6868 | 9.7044 | 0.8169 | 0.5330 |
|
| 87 |
+
| 0.9646 | 34.0015 | 7070 | 10.2150 | 0.8206 | 0.5506 |
|
| 88 |
+
| 0.9595 | 35.0015 | 7272 | 11.9668 | 0.8233 | 0.5671 |
|
| 89 |
+
| 0.8569 | 36.0015 | 7474 | 11.1166 | 0.8329 | 0.5819 |
|
| 90 |
+
| 0.8767 | 37.0015 | 7676 | 13.2788 | 0.8401 | 0.6010 |
|
| 91 |
+
| 0.8094 | 38.0015 | 7878 | 12.9696 | 0.8384 | 0.6067 |
|
| 92 |
+
| 0.7561 | 39.0015 | 8080 | 12.2524 | 0.8382 | 0.5987 |
|
| 93 |
+
| 0.6879 | 40.0015 | 8282 | 14.3620 | 0.8478 | 0.6204 |
|
| 94 |
+
| 0.7386 | 41.0015 | 8484 | 14.7484 | 0.8626 | 0.6337 |
|
| 95 |
+
| 0.6411 | 42.0015 | 8686 | 13.5941 | 0.8660 | 0.6437 |
|
| 96 |
+
| 0.6389 | 43.0015 | 8888 | 15.4027 | 0.8645 | 0.6406 |
|
| 97 |
+
| 0.6128 | 44.0015 | 9090 | 19.0751 | 0.8758 | 0.6587 |
|
| 98 |
+
| 0.496 | 45.0015 | 9292 | 18.6661 | 0.8661 | 0.6372 |
|
| 99 |
+
| 0.5358 | 46.0015 | 9494 | 14.1890 | 0.8757 | 0.6533 |
|
| 100 |
+
| 0.4955 | 47.0015 | 9696 | 17.5556 | 0.8721 | 0.6659 |
|
| 101 |
+
| 0.4513 | 48.0015 | 9898 | 16.8865 | 0.8847 | 0.6857 |
|
| 102 |
+
| 0.4363 | 49.0015 | 10100 | 15.3593 | 0.8801 | 0.6744 |
|
| 103 |
+
| 0.3792 | 50.0015 | 10302 | 14.4326 | 0.8903 | 0.6988 |
|
| 104 |
+
| 0.3841 | 51.0015 | 10504 | 15.5706 | 0.8961 | 0.7055 |
|
| 105 |
+
| 0.3951 | 52.0015 | 10706 | 16.9263 | 0.8958 | 0.7031 |
|
| 106 |
+
| 0.3896 | 53.0015 | 10908 | 16.1398 | 0.8921 | 0.7052 |
|
| 107 |
+
| 0.3263 | 54.0015 | 11110 | 14.3136 | 0.8982 | 0.7045 |
|
| 108 |
+
| 0.3028 | 55.0015 | 11312 | 13.7927 | 0.9024 | 0.7175 |
|
| 109 |
+
| 0.2973 | 56.0015 | 11514 | 15.8300 | 0.9022 | 0.7204 |
|
| 110 |
+
| 0.2723 | 57.0015 | 11716 | 12.2600 | 0.9057 | 0.7185 |
|
| 111 |
+
| 0.274 | 58.0015 | 11918 | 11.5149 | 0.8993 | 0.7156 |
|
| 112 |
+
| 0.2485 | 59.0015 | 12120 | 11.6746 | 0.9085 | 0.7371 |
|
| 113 |
+
| 0.2617 | 60.0015 | 12322 | 10.2702 | 0.9102 | 0.7333 |
|
| 114 |
+
| 0.2402 | 61.0015 | 12524 | 9.3862 | 0.9120 | 0.7369 |
|
| 115 |
+
| 0.2289 | 62.0015 | 12726 | 9.2598 | 0.9106 | 0.7368 |
|
| 116 |
+
| 0.2068 | 63.0015 | 12928 | 7.8129 | 0.9153 | 0.7521 |
|
| 117 |
+
| 0.2011 | 64.0015 | 13130 | 6.5932 | 0.9102 | 0.7382 |
|
| 118 |
+
| 0.1943 | 65.0015 | 13332 | 7.2363 | 0.9133 | 0.7565 |
|
| 119 |
+
| 0.1746 | 66.0015 | 13534 | 6.6193 | 0.9149 | 0.7529 |
|
| 120 |
+
| 0.1848 | 67.0015 | 13736 | 6.6123 | 0.9175 | 0.7517 |
|
| 121 |
+
| 0.1704 | 68.0015 | 13938 | 6.5004 | 0.9159 | 0.7531 |
|
| 122 |
+
| 0.1565 | 69.0015 | 14140 | 5.0556 | 0.9150 | 0.7578 |
|
| 123 |
+
| 0.1533 | 70.0015 | 14342 | 5.4416 | 0.9179 | 0.7662 |
|
| 124 |
+
| 0.1379 | 71.0015 | 14544 | 4.6997 | 0.9216 | 0.7713 |
|
| 125 |
+
| 0.1354 | 72.0015 | 14746 | 4.3266 | 0.9193 | 0.7666 |
|
| 126 |
+
| 0.1441 | 73.0015 | 14948 | 3.9965 | 0.9252 | 0.7782 |
|
| 127 |
+
| 0.1302 | 74.0015 | 15150 | 3.1910 | 0.9221 | 0.7709 |
|
| 128 |
+
| 0.132 | 75.0015 | 15352 | 3.2843 | 0.9252 | 0.7797 |
|
| 129 |
+
| 0.1269 | 76.0015 | 15554 | 3.1127 | 0.9253 | 0.7860 |
|
| 130 |
+
| 0.117 | 77.0015 | 15756 | 2.9740 | 0.9306 | 0.7843 |
|
| 131 |
+
| 0.1135 | 78.0015 | 15958 | 3.0400 | 0.9273 | 0.7851 |
|
| 132 |
+
| 0.1095 | 79.0015 | 16160 | 2.5968 | 0.9261 | 0.7831 |
|
| 133 |
+
| 0.1012 | 80.0015 | 16362 | 2.8566 | 0.9262 | 0.7869 |
|
| 134 |
+
| 0.0958 | 81.0015 | 16564 | 2.4564 | 0.9285 | 0.7847 |
|
| 135 |
+
| 0.0963 | 82.0015 | 16766 | 2.4953 | 0.9333 | 0.8010 |
|
| 136 |
+
| 0.1026 | 83.0015 | 16968 | 2.0616 | 0.9292 | 0.7923 |
|
| 137 |
+
| 0.0853 | 84.0015 | 17170 | 1.8190 | 0.9269 | 0.7881 |
|
| 138 |
+
| 0.0884 | 85.0015 | 17372 | 1.7115 | 0.9261 | 0.7899 |
|
| 139 |
+
| 0.0911 | 86.0015 | 17574 | 1.8535 | 0.9275 | 0.7951 |
|
| 140 |
+
| 0.0853 | 87.0015 | 17776 | 1.5836 | 0.9269 | 0.7920 |
|
| 141 |
+
| 0.0915 | 88.0015 | 17978 | 1.5385 | 0.9321 | 0.8054 |
|
| 142 |
+
| 0.0762 | 89.0015 | 18180 | 1.5013 | 0.9351 | 0.8112 |
|
| 143 |
+
| 0.0876 | 90.0015 | 18382 | 1.6770 | 0.9298 | 0.7870 |
|
| 144 |
+
| 0.0725 | 91.0015 | 18584 | 1.5174 | 0.9366 | 0.8054 |
|
| 145 |
+
| 0.079 | 92.0015 | 18786 | 1.5734 | 0.9339 | 0.8067 |
|
| 146 |
+
| 0.0658 | 93.0015 | 18988 | 1.4919 | 0.9314 | 0.8057 |
|
| 147 |
+
| 0.0661 | 94.0015 | 19190 | 1.3583 | 0.9343 | 0.7936 |
|
| 148 |
+
| 0.0717 | 95.0015 | 19392 | 1.3954 | 0.9374 | 0.8100 |
|
| 149 |
+
| 0.0617 | 96.0015 | 19594 | 1.4305 | 0.9348 | 0.8151 |
|
| 150 |
+
| 0.0665 | 97.0015 | 19796 | 1.3416 | 0.9334 | 0.7926 |
|
| 151 |
+
| 0.0767 | 98.0015 | 19998 | 1.2812 | 0.9351 | 0.8144 |
|
| 152 |
+
| 0.0664 | 99.0015 | 20200 | 1.4248 | 0.9345 | 0.8081 |
|
| 153 |
+
| 0.055 | 100.0015 | 20402 | 1.3525 | 0.9331 | 0.8138 |
|
| 154 |
+
| 0.0586 | 101.0015 | 20604 | 1.2343 | 0.9327 | 0.8121 |
|
| 155 |
+
| 0.0647 | 102.0015 | 20806 | 1.1799 | 0.9326 | 0.8143 |
|
| 156 |
+
| 0.0575 | 103.0015 | 21008 | 1.2465 | 0.9338 | 0.8180 |
|
| 157 |
+
| 0.0564 | 104.0015 | 21210 | 1.2148 | 0.9355 | 0.8141 |
|
| 158 |
+
| 0.0571 | 105.0015 | 21412 | 1.3326 | 0.9334 | 0.8156 |
|
| 159 |
+
| 0.0522 | 106.0015 | 21614 | 1.2063 | 0.9362 | 0.8230 |
|
| 160 |
+
| 0.051 | 107.0015 | 21816 | 1.1642 | 0.9383 | 0.8248 |
|
| 161 |
+
| 0.07 | 108.0015 | 22018 | 1.1145 | 0.9385 | 0.8255 |
|
| 162 |
+
| 0.0538 | 109.0015 | 22220 | 1.1939 | 0.9358 | 0.8187 |
|
| 163 |
+
| 0.05 | 110.0015 | 22422 | 1.1498 | 0.9390 | 0.8298 |
|
| 164 |
+
| 0.0521 | 111.0015 | 22624 | 1.0604 | 0.9338 | 0.8170 |
|
| 165 |
+
| 0.0542 | 112.0015 | 22826 | 1.0632 | 0.9378 | 0.8220 |
|
| 166 |
+
| 0.0522 | 113.0015 | 23028 | 1.1671 | 0.9374 | 0.8276 |
|
| 167 |
+
| 0.0491 | 114.0015 | 23230 | 1.1449 | 0.9408 | 0.8300 |
|
| 168 |
+
| 0.0478 | 115.0015 | 23432 | 1.0877 | 0.9405 | 0.8300 |
|
| 169 |
+
| 0.0496 | 116.0015 | 23634 | 1.1114 | 0.9407 | 0.8278 |
|
| 170 |
+
| 0.047 | 117.0015 | 23836 | 1.0889 | 0.9401 | 0.8250 |
|
| 171 |
+
| 0.0464 | 118.0015 | 24038 | 1.0318 | 0.9421 | 0.8297 |
|
| 172 |
+
| 0.0425 | 119.0015 | 24240 | 0.9645 | 0.9362 | 0.8262 |
|
| 173 |
+
| 0.0445 | 120.0015 | 24442 | 1.0574 | 0.9369 | 0.8271 |
|
| 174 |
+
| 0.0423 | 121.0015 | 24644 | 0.9761 | 0.9417 | 0.8372 |
|
| 175 |
+
| 0.0444 | 122.0015 | 24846 | 1.0535 | 0.9376 | 0.8304 |
|
| 176 |
+
| 0.0463 | 123.0015 | 25048 | 0.9546 | 0.9398 | 0.8298 |
|
| 177 |
+
| 0.042 | 124.0015 | 25250 | 0.9689 | 0.9378 | 0.8336 |
|
| 178 |
+
| 0.0423 | 125.0015 | 25452 | 0.9090 | 0.9419 | 0.8335 |
|
| 179 |
+
| 0.0431 | 126.0015 | 25654 | 1.0322 | 0.9394 | 0.8357 |
|
| 180 |
+
| 0.0371 | 127.0015 | 25856 | 1.0071 | 0.9425 | 0.8339 |
|
| 181 |
+
| 0.0441 | 128.0015 | 26058 | 0.9907 | 0.9415 | 0.8351 |
|
| 182 |
+
| 0.0408 | 129.0015 | 26260 | 1.0066 | 0.9423 | 0.8170 |
|
| 183 |
+
| 0.0413 | 130.0015 | 26462 | 1.0439 | 0.9390 | 0.8256 |
|
| 184 |
+
| 0.0339 | 131.0015 | 26664 | 0.9631 | 0.9428 | 0.8358 |
|
| 185 |
+
| 0.035 | 132.0015 | 26866 | 1.0452 | 0.9365 | 0.8312 |
|
| 186 |
+
| 0.0349 | 133.0015 | 27068 | 1.0405 | 0.9362 | 0.8286 |
|
| 187 |
+
| 0.0395 | 134.0015 | 27270 | 1.0398 | 0.9387 | 0.8303 |
|
| 188 |
+
| 0.0352 | 135.0015 | 27472 | 0.9459 | 0.9420 | 0.8340 |
|
| 189 |
+
| 0.0376 | 136.0015 | 27674 | 0.9511 | 0.9412 | 0.8374 |
|
| 190 |
+
| 0.0362 | 137.0015 | 27876 | 1.0098 | 0.9388 | 0.8416 |
|
| 191 |
+
| 0.0335 | 138.0015 | 28078 | 1.0599 | 0.9422 | 0.8380 |
|
| 192 |
+
| 0.0309 | 139.0015 | 28280 | 1.1331 | 0.9377 | 0.8310 |
|
| 193 |
+
| 0.0354 | 140.0015 | 28482 | 1.1502 | 0.9369 | 0.8304 |
|
| 194 |
+
| 0.0357 | 141.0015 | 28684 | 0.9515 | 0.9386 | 0.8294 |
|
| 195 |
+
| 0.0313 | 142.0015 | 28886 | 0.9854 | 0.9403 | 0.8400 |
|
| 196 |
+
| 0.0329 | 143.0015 | 29088 | 1.0700 | 0.9362 | 0.8351 |
|
| 197 |
+
| 0.0333 | 144.0015 | 29290 | 0.9787 | 0.9429 | 0.8437 |
|
| 198 |
+
| 0.0329 | 145.0015 | 29492 | 1.0145 | 0.9323 | 0.8320 |
|
| 199 |
+
| 0.0318 | 146.0015 | 29694 | 0.9619 | 0.9402 | 0.8358 |
|
| 200 |
+
| 0.0322 | 147.0015 | 29896 | 1.1008 | 0.9348 | 0.8200 |
|
| 201 |
+
| 0.0352 | 148.0015 | 30098 | 0.9330 | 0.9411 | 0.8401 |
|
| 202 |
+
| 0.0309 | 149.0015 | 30300 | 0.9829 | 0.9421 | 0.8226 |
|
| 203 |
+
| 0.0317 | 150.0015 | 30502 | 0.9698 | 0.9424 | 0.8221 |
|
| 204 |
+
| 0.028 | 151.0015 | 30704 | 0.9358 | 0.9452 | 0.8475 |
|
| 205 |
+
| 0.0328 | 152.0015 | 30906 | 1.0448 | 0.9394 | 0.8177 |
|
| 206 |
+
| 0.0318 | 153.0015 | 31108 | 1.0614 | 0.9379 | 0.8396 |
|
| 207 |
+
| 0.0297 | 154.0015 | 31310 | 0.9583 | 0.9421 | 0.8429 |
|
| 208 |
+
| 0.0284 | 155.0015 | 31512 | 0.9899 | 0.9409 | 0.8418 |
|
| 209 |
+
| 0.0287 | 156.0015 | 31714 | 0.9172 | 0.9422 | 0.8430 |
|
| 210 |
+
| 0.0292 | 157.0015 | 31916 | 0.9322 | 0.9426 | 0.8473 |
|
| 211 |
+
| 0.0263 | 158.0015 | 32118 | 1.0263 | 0.9406 | 0.8202 |
|
| 212 |
+
| 0.0297 | 159.0015 | 32320 | 0.9233 | 0.9445 | 0.8252 |
|
| 213 |
+
| 0.0296 | 160.0015 | 32522 | 0.9406 | 0.9424 | 0.8459 |
|
| 214 |
+
| 0.0285 | 161.0015 | 32724 | 0.8934 | 0.9420 | 0.8246 |
|
| 215 |
+
| 0.0277 | 162.0015 | 32926 | 0.8199 | 0.9428 | 0.8454 |
|
| 216 |
+
| 0.0278 | 163.0015 | 33128 | 0.9287 | 0.9452 | 0.8332 |
|
| 217 |
+
| 0.0276 | 164.0015 | 33330 | 1.0221 | 0.9402 | 0.8431 |
|
| 218 |
+
| 0.0266 | 165.0015 | 33532 | 1.0262 | 0.9404 | 0.8476 |
|
| 219 |
+
| 0.0256 | 166.0015 | 33734 | 0.9846 | 0.9396 | 0.8426 |
|
| 220 |
+
| 0.0255 | 167.0015 | 33936 | 0.9476 | 0.9434 | 0.8509 |
|
| 221 |
+
| 0.0264 | 168.0015 | 34138 | 0.8982 | 0.9436 | 0.8305 |
|
| 222 |
+
| 0.0249 | 169.0015 | 34340 | 0.9160 | 0.9420 | 0.8455 |
|
| 223 |
+
| 0.0252 | 170.0015 | 34542 | 0.9905 | 0.9422 | 0.8283 |
|
| 224 |
+
| 0.0285 | 171.0015 | 34744 | 0.9653 | 0.9415 | 0.8406 |
|
| 225 |
+
| 0.0286 | 172.0015 | 34946 | 0.9691 | 0.9415 | 0.8445 |
|
| 226 |
+
| 0.0333 | 173.0015 | 35148 | 0.9424 | 0.9405 | 0.8421 |
|
| 227 |
+
| 0.0252 | 174.0015 | 35350 | 0.9120 | 0.9434 | 0.8485 |
|
| 228 |
+
| 0.0245 | 175.0015 | 35552 | 1.0208 | 0.9429 | 0.8278 |
|
| 229 |
+
| 0.0214 | 176.0015 | 35754 | 0.9905 | 0.9425 | 0.8302 |
|
| 230 |
+
| 0.0248 | 177.0015 | 35956 | 0.9479 | 0.9439 | 0.8288 |
|
| 231 |
+
| 0.0242 | 178.0015 | 36158 | 0.9858 | 0.9443 | 0.8525 |
|
| 232 |
+
| 0.0236 | 179.0015 | 36360 | 1.0952 | 0.9416 | 0.8255 |
|
| 233 |
+
| 0.0206 | 180.0015 | 36562 | 1.1354 | 0.9418 | 0.8276 |
|
| 234 |
+
| 0.0223 | 181.0015 | 36764 | 0.9461 | 0.9427 | 0.8295 |
|
| 235 |
+
| 0.0226 | 182.0015 | 36966 | 0.9072 | 0.9445 | 0.8334 |
|
| 236 |
+
| 0.0249 | 183.0015 | 37168 | 1.1476 | 0.9386 | 0.8449 |
|
| 237 |
+
| 0.0263 | 184.0015 | 37370 | 0.8881 | 0.9421 | 0.8321 |
|
| 238 |
+
| 0.0248 | 185.0015 | 37572 | 0.9298 | 0.9422 | 0.8537 |
|
| 239 |
+
| 0.0216 | 186.0015 | 37774 | 0.9200 | 0.9425 | 0.8352 |
|
| 240 |
+
| 0.0217 | 187.0015 | 37976 | 0.9245 | 0.9455 | 0.8526 |
|
| 241 |
+
| 0.0214 | 188.0015 | 38178 | 1.0350 | 0.9405 | 0.8252 |
|
| 242 |
+
| 0.022 | 189.0015 | 38380 | 0.8831 | 0.9455 | 0.8570 |
|
| 243 |
+
| 0.0206 | 190.0015 | 38582 | 0.8855 | 0.9448 | 0.8361 |
|
| 244 |
+
| 0.021 | 191.0015 | 38784 | 0.9974 | 0.9444 | 0.8513 |
|
| 245 |
+
| 0.0226 | 192.0015 | 38986 | 0.9566 | 0.9420 | 0.8506 |
|
| 246 |
+
| 0.0199 | 193.0015 | 39188 | 0.8891 | 0.9454 | 0.8328 |
|
| 247 |
+
| 0.0237 | 194.0015 | 39390 | 0.9330 | 0.9431 | 0.8525 |
|
| 248 |
+
| 0.0206 | 195.0015 | 39592 | 0.8964 | 0.9441 | 0.8327 |
|
| 249 |
+
| 0.0209 | 196.0015 | 39794 | 0.9579 | 0.9450 | 0.8326 |
|
| 250 |
+
| 0.0199 | 197.0015 | 39996 | 0.9376 | 0.9447 | 0.8342 |
|
| 251 |
+
| 0.0218 | 198.0015 | 40198 | 0.8677 | 0.9454 | 0.8358 |
|
| 252 |
+
| 0.0217 | 199.0015 | 40400 | 1.0234 | 0.9375 | 0.8222 |
|
| 253 |
+
| 0.0229 | 200.0015 | 40602 | 0.9920 | 0.9379 | 0.8282 |
|
| 254 |
+
| 0.0195 | 201.0015 | 40804 | 1.0083 | 0.9457 | 0.8355 |
|
| 255 |
+
| 0.0215 | 202.0015 | 41006 | 0.9446 | 0.9464 | 0.8368 |
|
| 256 |
+
| 0.02 | 203.0015 | 41208 | 0.9566 | 0.9449 | 0.8359 |
|
| 257 |
+
| 0.0194 | 204.0015 | 41410 | 0.8834 | 0.9438 | 0.8543 |
|
| 258 |
+
| 0.0185 | 205.0015 | 41612 | 0.9536 | 0.9454 | 0.8362 |
|
| 259 |
+
| 0.0216 | 206.0015 | 41814 | 0.8942 | 0.9458 | 0.8558 |
|
| 260 |
+
| 0.0178 | 207.0015 | 42016 | 1.0467 | 0.9412 | 0.8317 |
|
| 261 |
+
| 0.0176 | 208.0015 | 42218 | 0.9006 | 0.9468 | 0.8365 |
|
| 262 |
+
| 0.0223 | 209.0015 | 42420 | 0.8957 | 0.9429 | 0.8341 |
|
| 263 |
|
| 264 |
|
| 265 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 126037348
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2372455df658bb554e69016991e1b39d15825ad45c49876f22f9f5dec59f5279
|
| 3 |
size 126037348
|
runs/0-by=2006-psr=1.0/events.out.tfevents.1747568036.yara2.19173.1
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e9133ce471884200332b97e4dfcab3a3b0331a3ccc893ac3b580d87140fe41cd
|
| 3 |
+
size 852
|