Instructions to use Arthur-Tsai/ht-stmini-cls-v6_ftis_noPretrain-cssl-npsNonennsNone-masked with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Arthur-Tsai/ht-stmini-cls-v6_ftis_noPretrain-cssl-npsNonennsNone-masked with Transformers:
# Load model directly from transformers import HiTrans model = HiTrans.from_pretrained("Arthur-Tsai/ht-stmini-cls-v6_ftis_noPretrain-cssl-npsNonennsNone-masked", dtype="auto") - Notebooks
- Google Colab
- Kaggle
End of training
Browse files- README.md +253 -220
- model.safetensors +1 -1
- runs/0-by=2006-psr=0.25/events.out.tfevents.1747032164.ana2.143225.1 +3 -0
README.md
CHANGED
|
@@ -16,9 +16,9 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 16 |
|
| 17 |
This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
|
| 18 |
It achieves the following results on the evaluation set:
|
| 19 |
-
- Loss:
|
| 20 |
-
- Accuracy: 0.
|
| 21 |
-
- Macro F1: 0.
|
| 22 |
|
| 23 |
## Model description
|
| 24 |
|
|
@@ -50,223 +50,256 @@ The following hyperparameters were used during training:
|
|
| 50 |
|
| 51 |
| Training Loss | Epoch | Step | Validation Loss | Accuracy | Macro F1 |
|
| 52 |
|:-------------:|:--------:|:-----:|:---------------:|:--------:|:--------:|
|
| 53 |
-
|
|
| 54 |
-
|
|
| 55 |
-
| 95.
|
| 56 |
-
|
|
| 57 |
-
| 69.
|
| 58 |
-
|
|
| 59 |
-
| 60.
|
| 60 |
-
|
|
| 61 |
-
| 54.
|
| 62 |
-
| 51.
|
| 63 |
-
| 48.
|
| 64 |
-
| 46.
|
| 65 |
-
| 44.
|
| 66 |
-
| 42.
|
| 67 |
-
| 40.
|
| 68 |
-
| 38.
|
| 69 |
-
| 36.
|
| 70 |
-
| 34.
|
| 71 |
-
| 33.
|
| 72 |
-
|
|
| 73 |
-
| 30.
|
| 74 |
-
| 28.
|
| 75 |
-
|
|
| 76 |
-
|
|
| 77 |
-
|
|
| 78 |
-
|
|
| 79 |
-
|
|
| 80 |
-
| 19.
|
| 81 |
-
| 18.
|
| 82 |
-
| 17.
|
| 83 |
-
| 16.
|
| 84 |
-
| 15.
|
| 85 |
-
|
|
| 86 |
-
| 13.
|
| 87 |
-
| 12.
|
| 88 |
-
| 11.
|
| 89 |
-
| 11.
|
| 90 |
-
| 10.
|
| 91 |
-
| 9.
|
| 92 |
-
| 9.
|
| 93 |
-
| 8.
|
| 94 |
-
| 8.
|
| 95 |
-
| 7.
|
| 96 |
-
| 7.
|
| 97 |
-
| 6.
|
| 98 |
-
| 6.
|
| 99 |
-
| 5.
|
| 100 |
-
| 5.
|
| 101 |
-
| 4.
|
| 102 |
-
| 4.
|
| 103 |
-
| 4.
|
| 104 |
-
| 4.
|
| 105 |
-
| 3.
|
| 106 |
-
| 3.
|
| 107 |
-
| 3.
|
| 108 |
-
| 3.
|
| 109 |
-
| 2.
|
| 110 |
-
| 2.
|
| 111 |
-
| 2.
|
| 112 |
-
| 2.
|
| 113 |
-
| 2.
|
| 114 |
-
| 2.
|
| 115 |
-
| 1.
|
| 116 |
-
| 1.
|
| 117 |
-
| 1.
|
| 118 |
-
| 1.
|
| 119 |
-
| 1.
|
| 120 |
-
| 1.
|
| 121 |
-
| 0.
|
| 122 |
-
| 0.
|
| 123 |
-
| 0.
|
| 124 |
-
| 0.
|
| 125 |
-
| 0.
|
| 126 |
-
| 0.
|
| 127 |
-
| 0.
|
| 128 |
-
| 0.
|
| 129 |
-
| 0.
|
| 130 |
-
| 0.
|
| 131 |
-
| 0.
|
| 132 |
-
| 0.
|
| 133 |
-
| 0.
|
| 134 |
-
| 0.
|
| 135 |
-
| 0.
|
| 136 |
-
| 0.
|
| 137 |
-
| 0.
|
| 138 |
-
| 0.
|
| 139 |
-
| 0.
|
| 140 |
-
| 0.
|
| 141 |
-
| 0.
|
| 142 |
-
| 0.
|
| 143 |
-
| 0.
|
| 144 |
-
| 0.
|
| 145 |
-
| 0.
|
| 146 |
-
| 0.
|
| 147 |
-
| 0.
|
| 148 |
-
| 0.
|
| 149 |
-
| 0.
|
| 150 |
-
| 0.
|
| 151 |
-
| 0.
|
| 152 |
-
| 0.
|
| 153 |
-
| 0.
|
| 154 |
-
| 0.
|
| 155 |
-
| 0.
|
| 156 |
-
| 0.
|
| 157 |
-
| 0.
|
| 158 |
-
| 0.
|
| 159 |
-
| 0.
|
| 160 |
-
| 0.
|
| 161 |
-
| 0.
|
| 162 |
-
| 0.
|
| 163 |
-
| 0.
|
| 164 |
-
| 0.
|
| 165 |
-
| 0.
|
| 166 |
-
| 0.
|
| 167 |
-
| 0.
|
| 168 |
-
| 0.
|
| 169 |
-
| 0.
|
| 170 |
-
| 0.
|
| 171 |
-
| 0.
|
| 172 |
-
| 0.
|
| 173 |
-
| 0.
|
| 174 |
-
| 0.
|
| 175 |
-
| 0.
|
| 176 |
-
| 0.
|
| 177 |
-
| 0.
|
| 178 |
-
| 0.
|
| 179 |
-
| 0.
|
| 180 |
-
| 0.
|
| 181 |
-
| 0.
|
| 182 |
-
| 0.
|
| 183 |
-
| 0.
|
| 184 |
-
| 0.
|
| 185 |
-
| 0.
|
| 186 |
-
| 0.
|
| 187 |
-
| 0.
|
| 188 |
-
| 0.
|
| 189 |
-
| 0.
|
| 190 |
-
| 0.
|
| 191 |
-
| 0.
|
| 192 |
-
| 0.
|
| 193 |
-
| 0.
|
| 194 |
-
| 0.
|
| 195 |
-
| 0.
|
| 196 |
-
| 0.
|
| 197 |
-
| 0.
|
| 198 |
-
| 0.
|
| 199 |
-
| 0.
|
| 200 |
-
| 0.
|
| 201 |
-
| 0.
|
| 202 |
-
| 0.
|
| 203 |
-
| 0.
|
| 204 |
-
| 0.
|
| 205 |
-
| 0.
|
| 206 |
-
| 0.
|
| 207 |
-
| 0.
|
| 208 |
-
| 0.
|
| 209 |
-
| 0.
|
| 210 |
-
| 0.
|
| 211 |
-
| 0.
|
| 212 |
-
| 0.
|
| 213 |
-
| 0.
|
| 214 |
-
| 0.
|
| 215 |
-
| 0.
|
| 216 |
-
| 0.
|
| 217 |
-
| 0.
|
| 218 |
-
| 0.
|
| 219 |
-
| 0.
|
| 220 |
-
| 0.
|
| 221 |
-
| 0.
|
| 222 |
-
| 0.
|
| 223 |
-
| 0.
|
| 224 |
-
| 0.
|
| 225 |
-
| 0.
|
| 226 |
-
| 0.
|
| 227 |
-
| 0.
|
| 228 |
-
| 0.
|
| 229 |
-
| 0.
|
| 230 |
-
| 0.
|
| 231 |
-
| 0.
|
| 232 |
-
| 0.
|
| 233 |
-
| 0.
|
| 234 |
-
| 0.
|
| 235 |
-
| 0.
|
| 236 |
-
| 0.
|
| 237 |
-
| 0.
|
| 238 |
-
| 0.
|
| 239 |
-
| 0.
|
| 240 |
-
| 0.
|
| 241 |
-
| 0.
|
| 242 |
-
| 0.
|
| 243 |
-
| 0.
|
| 244 |
-
| 0.
|
| 245 |
-
| 0.
|
| 246 |
-
| 0.
|
| 247 |
-
| 0.
|
| 248 |
-
| 0.
|
| 249 |
-
| 0.
|
| 250 |
-
| 0.
|
| 251 |
-
| 0.
|
| 252 |
-
| 0.
|
| 253 |
-
| 0.
|
| 254 |
-
| 0.
|
| 255 |
-
| 0.
|
| 256 |
-
| 0.
|
| 257 |
-
| 0.
|
| 258 |
-
| 0.
|
| 259 |
-
| 0.
|
| 260 |
-
| 0.
|
| 261 |
-
| 0.
|
| 262 |
-
| 0.
|
| 263 |
-
| 0.
|
| 264 |
-
| 0.
|
| 265 |
-
| 0.
|
| 266 |
-
| 0.
|
| 267 |
-
| 0.
|
| 268 |
-
| 0.
|
| 269 |
-
| 0.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 270 |
|
| 271 |
|
| 272 |
### Framework versions
|
|
|
|
| 16 |
|
| 17 |
This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
|
| 18 |
It achieves the following results on the evaluation set:
|
| 19 |
+
- Loss: 23.8621
|
| 20 |
+
- Accuracy: 0.9006
|
| 21 |
+
- Macro F1: 0.7591
|
| 22 |
|
| 23 |
## Model description
|
| 24 |
|
|
|
|
| 50 |
|
| 51 |
| Training Loss | Epoch | Step | Validation Loss | Accuracy | Macro F1 |
|
| 52 |
|:-------------:|:--------:|:-----:|:---------------:|:--------:|:--------:|
|
| 53 |
+
| 1071.1105 | 0.0013 | 174 | 916.9260 | 0.0519 | 0.0275 |
|
| 54 |
+
| 307.3144 | 1.0013 | 348 | 220.1818 | 0.0904 | 0.0378 |
|
| 55 |
+
| 95.291 | 2.0013 | 522 | 327.4888 | 0.2279 | 0.0661 |
|
| 56 |
+
| 77.9946 | 3.0013 | 696 | 354.5122 | 0.3472 | 0.0917 |
|
| 57 |
+
| 69.1096 | 4.0013 | 870 | 385.2865 | 0.4179 | 0.1059 |
|
| 58 |
+
| 64.8772 | 5.0013 | 1044 | 409.2585 | 0.4601 | 0.1145 |
|
| 59 |
+
| 60.4624 | 6.0012 | 1218 | 361.9901 | 0.4791 | 0.1188 |
|
| 60 |
+
| 57.818 | 7.0012 | 1392 | 397.4457 | 0.4991 | 0.1244 |
|
| 61 |
+
| 54.002 | 8.0012 | 1566 | 398.1290 | 0.5117 | 0.1266 |
|
| 62 |
+
| 51.7324 | 9.0012 | 1740 | 390.6792 | 0.5269 | 0.1295 |
|
| 63 |
+
| 48.178 | 10.0012 | 1914 | 358.7085 | 0.5314 | 0.1311 |
|
| 64 |
+
| 46.1396 | 11.0012 | 2088 | 281.8654 | 0.5355 | 0.1321 |
|
| 65 |
+
| 44.6578 | 12.0012 | 2262 | 248.2498 | 0.5428 | 0.1342 |
|
| 66 |
+
| 42.1772 | 13.0012 | 2436 | 220.6902 | 0.5463 | 0.1357 |
|
| 67 |
+
| 40.186 | 14.0012 | 2610 | 180.6336 | 0.5597 | 0.1390 |
|
| 68 |
+
| 38.2952 | 15.0012 | 2784 | 167.0331 | 0.5601 | 0.1397 |
|
| 69 |
+
| 36.5276 | 16.0012 | 2958 | 157.5534 | 0.5710 | 0.1424 |
|
| 70 |
+
| 34.946 | 17.0012 | 3132 | 148.7067 | 0.5715 | 0.1445 |
|
| 71 |
+
| 33.2803 | 18.0012 | 3306 | 128.2341 | 0.5720 | 0.1459 |
|
| 72 |
+
| 32.1226 | 19.0012 | 3480 | 122.0039 | 0.5979 | 0.1519 |
|
| 73 |
+
| 30.4977 | 20.0011 | 3654 | 106.4274 | 0.5800 | 0.1536 |
|
| 74 |
+
| 28.9133 | 21.0011 | 3828 | 108.8758 | 0.6021 | 0.1601 |
|
| 75 |
+
| 27.0074 | 22.0011 | 4002 | 100.3312 | 0.5978 | 0.1629 |
|
| 76 |
+
| 26.0635 | 23.0011 | 4176 | 93.0917 | 0.6021 | 0.1671 |
|
| 77 |
+
| 24.0991 | 24.0011 | 4350 | 88.3154 | 0.6072 | 0.1748 |
|
| 78 |
+
| 22.1448 | 25.0011 | 4524 | 82.2642 | 0.6317 | 0.1771 |
|
| 79 |
+
| 21.3017 | 26.0011 | 4698 | 79.5632 | 0.6511 | 0.1937 |
|
| 80 |
+
| 19.762 | 27.0011 | 4872 | 72.8238 | 0.6553 | 0.1947 |
|
| 81 |
+
| 18.3492 | 28.0011 | 5046 | 71.8495 | 0.6644 | 0.2039 |
|
| 82 |
+
| 17.3736 | 29.0011 | 5220 | 71.1042 | 0.6819 | 0.2250 |
|
| 83 |
+
| 16.3902 | 30.0011 | 5394 | 70.4107 | 0.6914 | 0.2287 |
|
| 84 |
+
| 15.1843 | 31.0011 | 5568 | 65.3933 | 0.6926 | 0.2372 |
|
| 85 |
+
| 15.1916 | 32.0011 | 5742 | 65.4141 | 0.6879 | 0.2476 |
|
| 86 |
+
| 13.469 | 33.0010 | 5916 | 60.5838 | 0.7097 | 0.2581 |
|
| 87 |
+
| 12.9061 | 34.0010 | 6090 | 59.3015 | 0.6963 | 0.2640 |
|
| 88 |
+
| 11.7938 | 35.0010 | 6264 | 67.4723 | 0.7167 | 0.2896 |
|
| 89 |
+
| 11.1503 | 36.0010 | 6438 | 55.8701 | 0.7292 | 0.3013 |
|
| 90 |
+
| 10.354 | 37.0010 | 6612 | 55.7133 | 0.7247 | 0.3055 |
|
| 91 |
+
| 9.9429 | 38.0010 | 6786 | 64.5717 | 0.7347 | 0.3190 |
|
| 92 |
+
| 9.3196 | 39.0010 | 6960 | 61.9292 | 0.7433 | 0.3425 |
|
| 93 |
+
| 8.513 | 40.0010 | 7134 | 55.8721 | 0.7472 | 0.3493 |
|
| 94 |
+
| 8.2008 | 41.0010 | 7308 | 61.9761 | 0.7472 | 0.3483 |
|
| 95 |
+
| 7.7605 | 42.0010 | 7482 | 63.0649 | 0.7515 | 0.3643 |
|
| 96 |
+
| 7.1182 | 43.0010 | 7656 | 69.1525 | 0.7575 | 0.3849 |
|
| 97 |
+
| 6.6393 | 44.0010 | 7830 | 64.7897 | 0.7589 | 0.3897 |
|
| 98 |
+
| 6.0149 | 45.0010 | 8004 | 63.6413 | 0.7707 | 0.4001 |
|
| 99 |
+
| 5.6938 | 46.0010 | 8178 | 60.9982 | 0.7680 | 0.3939 |
|
| 100 |
+
| 5.5535 | 47.0009 | 8352 | 75.8102 | 0.7556 | 0.4027 |
|
| 101 |
+
| 4.8991 | 48.0009 | 8526 | 71.2782 | 0.7717 | 0.4160 |
|
| 102 |
+
| 4.54 | 49.0009 | 8700 | 72.8997 | 0.7779 | 0.4300 |
|
| 103 |
+
| 4.4054 | 50.0009 | 8874 | 71.3511 | 0.7873 | 0.4430 |
|
| 104 |
+
| 4.6015 | 51.0009 | 9048 | 72.7990 | 0.7808 | 0.4392 |
|
| 105 |
+
| 3.8771 | 52.0009 | 9222 | 70.6393 | 0.7844 | 0.4518 |
|
| 106 |
+
| 3.4468 | 53.0009 | 9396 | 72.0320 | 0.7799 | 0.4548 |
|
| 107 |
+
| 3.3139 | 54.0009 | 9570 | 71.4426 | 0.7928 | 0.4631 |
|
| 108 |
+
| 3.2179 | 55.0009 | 9744 | 71.2521 | 0.7860 | 0.4661 |
|
| 109 |
+
| 2.7241 | 56.0009 | 9918 | 60.8963 | 0.7946 | 0.4655 |
|
| 110 |
+
| 2.6323 | 57.0009 | 10092 | 68.7927 | 0.7984 | 0.4751 |
|
| 111 |
+
| 2.5876 | 58.0009 | 10266 | 67.8136 | 0.8020 | 0.4906 |
|
| 112 |
+
| 2.3709 | 59.0009 | 10440 | 69.9898 | 0.8020 | 0.4896 |
|
| 113 |
+
| 2.212 | 60.0008 | 10614 | 71.2956 | 0.7976 | 0.4960 |
|
| 114 |
+
| 2.064 | 61.0008 | 10788 | 79.7838 | 0.8062 | 0.5010 |
|
| 115 |
+
| 1.9108 | 62.0008 | 10962 | 79.9406 | 0.8108 | 0.5067 |
|
| 116 |
+
| 1.7361 | 63.0008 | 11136 | 71.6764 | 0.8008 | 0.5043 |
|
| 117 |
+
| 1.5907 | 64.0008 | 11310 | 77.2957 | 0.8116 | 0.5192 |
|
| 118 |
+
| 1.518 | 65.0008 | 11484 | 78.2582 | 0.8135 | 0.5088 |
|
| 119 |
+
| 1.1955 | 66.0008 | 11658 | 82.4455 | 0.8175 | 0.5191 |
|
| 120 |
+
| 1.1365 | 67.0008 | 11832 | 63.2824 | 0.8205 | 0.5155 |
|
| 121 |
+
| 0.9021 | 68.0008 | 12006 | 62.9742 | 0.8240 | 0.5295 |
|
| 122 |
+
| 0.8241 | 69.0008 | 12180 | 67.2848 | 0.8266 | 0.5331 |
|
| 123 |
+
| 0.7961 | 70.0008 | 12354 | 69.8350 | 0.8265 | 0.5385 |
|
| 124 |
+
| 0.7521 | 71.0008 | 12528 | 75.7815 | 0.8280 | 0.5426 |
|
| 125 |
+
| 0.7632 | 72.0008 | 12702 | 76.0700 | 0.8334 | 0.5407 |
|
| 126 |
+
| 0.6186 | 73.0007 | 12876 | 81.6507 | 0.8303 | 0.5533 |
|
| 127 |
+
| 0.5838 | 74.0007 | 13050 | 72.6086 | 0.8375 | 0.5655 |
|
| 128 |
+
| 0.519 | 75.0007 | 13224 | 70.2512 | 0.8400 | 0.5642 |
|
| 129 |
+
| 0.5484 | 76.0007 | 13398 | 70.1500 | 0.8398 | 0.5681 |
|
| 130 |
+
| 0.5408 | 77.0007 | 13572 | 71.5243 | 0.8507 | 0.5828 |
|
| 131 |
+
| 0.4949 | 78.0007 | 13746 | 74.5059 | 0.8468 | 0.5787 |
|
| 132 |
+
| 0.4515 | 79.0007 | 13920 | 60.0536 | 0.8467 | 0.5863 |
|
| 133 |
+
| 0.471 | 80.0007 | 14094 | 66.6615 | 0.8494 | 0.5851 |
|
| 134 |
+
| 0.4411 | 81.0007 | 14268 | 60.4127 | 0.8553 | 0.5960 |
|
| 135 |
+
| 0.4081 | 82.0007 | 14442 | 60.6191 | 0.8557 | 0.5959 |
|
| 136 |
+
| 0.3918 | 83.0007 | 14616 | 61.3452 | 0.8583 | 0.6014 |
|
| 137 |
+
| 0.3885 | 84.0007 | 14790 | 54.9566 | 0.8574 | 0.6082 |
|
| 138 |
+
| 0.3856 | 85.0007 | 14964 | 54.7612 | 0.8581 | 0.6127 |
|
| 139 |
+
| 0.3707 | 86.0007 | 15138 | 54.9271 | 0.8607 | 0.6057 |
|
| 140 |
+
| 0.3777 | 87.0006 | 15312 | 49.6427 | 0.8574 | 0.6100 |
|
| 141 |
+
| 0.3386 | 88.0006 | 15486 | 48.5498 | 0.8639 | 0.6099 |
|
| 142 |
+
| 0.3399 | 89.0006 | 15660 | 49.6015 | 0.8643 | 0.6198 |
|
| 143 |
+
| 0.3298 | 90.0006 | 15834 | 49.1596 | 0.8643 | 0.6253 |
|
| 144 |
+
| 0.3424 | 91.0006 | 16008 | 46.6932 | 0.8693 | 0.6374 |
|
| 145 |
+
| 0.3408 | 92.0006 | 16182 | 51.5574 | 0.8676 | 0.6356 |
|
| 146 |
+
| 0.3119 | 93.0006 | 16356 | 46.2073 | 0.8665 | 0.6329 |
|
| 147 |
+
| 0.3232 | 94.0006 | 16530 | 45.4627 | 0.8612 | 0.6294 |
|
| 148 |
+
| 0.2868 | 95.0006 | 16704 | 44.9902 | 0.8674 | 0.6387 |
|
| 149 |
+
| 0.2772 | 96.0006 | 16878 | 44.2864 | 0.8705 | 0.6507 |
|
| 150 |
+
| 0.297 | 97.0006 | 17052 | 42.0163 | 0.8709 | 0.6524 |
|
| 151 |
+
| 0.2587 | 98.0006 | 17226 | 45.7220 | 0.8736 | 0.6513 |
|
| 152 |
+
| 0.2668 | 99.0006 | 17400 | 41.3406 | 0.8726 | 0.6471 |
|
| 153 |
+
| 0.2739 | 100.0005 | 17574 | 42.2701 | 0.8707 | 0.6567 |
|
| 154 |
+
| 0.2786 | 101.0005 | 17748 | 38.4799 | 0.8744 | 0.6544 |
|
| 155 |
+
| 0.2498 | 102.0005 | 17922 | 40.8993 | 0.8786 | 0.6710 |
|
| 156 |
+
| 0.237 | 103.0005 | 18096 | 40.9156 | 0.8738 | 0.6560 |
|
| 157 |
+
| 0.2361 | 104.0005 | 18270 | 38.4156 | 0.8712 | 0.6630 |
|
| 158 |
+
| 0.2284 | 105.0005 | 18444 | 34.6554 | 0.8739 | 0.6637 |
|
| 159 |
+
| 0.2272 | 106.0005 | 18618 | 38.0209 | 0.8757 | 0.6656 |
|
| 160 |
+
| 0.2229 | 107.0005 | 18792 | 41.2726 | 0.8787 | 0.6688 |
|
| 161 |
+
| 0.2137 | 108.0005 | 18966 | 37.8564 | 0.8781 | 0.6719 |
|
| 162 |
+
| 0.2334 | 109.0005 | 19140 | 34.4903 | 0.8782 | 0.6738 |
|
| 163 |
+
| 0.2048 | 110.0005 | 19314 | 37.8222 | 0.8753 | 0.6691 |
|
| 164 |
+
| 0.2104 | 111.0005 | 19488 | 33.4882 | 0.8788 | 0.6684 |
|
| 165 |
+
| 0.2081 | 112.0005 | 19662 | 35.9320 | 0.8844 | 0.6790 |
|
| 166 |
+
| 0.2018 | 113.0005 | 19836 | 32.8766 | 0.8780 | 0.6791 |
|
| 167 |
+
| 0.1848 | 114.0004 | 20010 | 34.7820 | 0.8807 | 0.6813 |
|
| 168 |
+
| 0.2028 | 115.0004 | 20184 | 29.2460 | 0.8841 | 0.6869 |
|
| 169 |
+
| 0.2023 | 116.0004 | 20358 | 32.3090 | 0.8868 | 0.6957 |
|
| 170 |
+
| 0.1979 | 117.0004 | 20532 | 30.9706 | 0.8844 | 0.6886 |
|
| 171 |
+
| 0.1948 | 118.0004 | 20706 | 36.0117 | 0.8864 | 0.6917 |
|
| 172 |
+
| 0.1832 | 119.0004 | 20880 | 33.0474 | 0.8887 | 0.6901 |
|
| 173 |
+
| 0.1789 | 120.0004 | 21054 | 31.7762 | 0.8874 | 0.6991 |
|
| 174 |
+
| 0.1682 | 121.0004 | 21228 | 31.7379 | 0.8888 | 0.7016 |
|
| 175 |
+
| 0.1748 | 122.0004 | 21402 | 32.4533 | 0.8859 | 0.7047 |
|
| 176 |
+
| 0.1714 | 123.0004 | 21576 | 30.4150 | 0.8880 | 0.6984 |
|
| 177 |
+
| 0.1685 | 124.0004 | 21750 | 29.6585 | 0.8902 | 0.7034 |
|
| 178 |
+
| 0.1627 | 125.0004 | 21924 | 29.7405 | 0.8889 | 0.7043 |
|
| 179 |
+
| 0.1677 | 126.0004 | 22098 | 27.6232 | 0.8854 | 0.6982 |
|
| 180 |
+
| 0.163 | 127.0003 | 22272 | 31.4614 | 0.8869 | 0.6977 |
|
| 181 |
+
| 0.1574 | 128.0003 | 22446 | 29.6253 | 0.8894 | 0.7107 |
|
| 182 |
+
| 0.1514 | 129.0003 | 22620 | 31.2731 | 0.8906 | 0.7095 |
|
| 183 |
+
| 0.1516 | 130.0003 | 22794 | 30.3349 | 0.8877 | 0.7030 |
|
| 184 |
+
| 0.15 | 131.0003 | 22968 | 27.9678 | 0.8890 | 0.7032 |
|
| 185 |
+
| 0.1443 | 132.0003 | 23142 | 27.5203 | 0.8901 | 0.6993 |
|
| 186 |
+
| 0.1616 | 133.0003 | 23316 | 27.7037 | 0.8892 | 0.7107 |
|
| 187 |
+
| 0.1532 | 134.0003 | 23490 | 28.0554 | 0.8860 | 0.7014 |
|
| 188 |
+
| 0.1421 | 135.0003 | 23664 | 24.9238 | 0.8916 | 0.7101 |
|
| 189 |
+
| 0.1444 | 136.0003 | 23838 | 28.1723 | 0.8927 | 0.7156 |
|
| 190 |
+
| 0.141 | 137.0003 | 24012 | 27.4672 | 0.8927 | 0.7170 |
|
| 191 |
+
| 0.1439 | 138.0003 | 24186 | 26.4160 | 0.8924 | 0.7169 |
|
| 192 |
+
| 0.1392 | 139.0003 | 24360 | 25.9160 | 0.8929 | 0.7216 |
|
| 193 |
+
| 0.1439 | 140.0003 | 24534 | 26.6943 | 0.8916 | 0.7145 |
|
| 194 |
+
| 0.1321 | 141.0002 | 24708 | 25.2970 | 0.8877 | 0.7147 |
|
| 195 |
+
| 0.1266 | 142.0002 | 24882 | 25.3811 | 0.8881 | 0.7163 |
|
| 196 |
+
| 0.1274 | 143.0002 | 25056 | 23.8011 | 0.8873 | 0.7110 |
|
| 197 |
+
| 0.1297 | 144.0002 | 25230 | 23.4062 | 0.8943 | 0.7237 |
|
| 198 |
+
| 0.1278 | 145.0002 | 25404 | 20.8234 | 0.8930 | 0.7238 |
|
| 199 |
+
| 0.1426 | 146.0002 | 25578 | 26.0120 | 0.8952 | 0.7181 |
|
| 200 |
+
| 0.1235 | 147.0002 | 25752 | 25.0103 | 0.8939 | 0.7218 |
|
| 201 |
+
| 0.1339 | 148.0002 | 25926 | 22.6269 | 0.8939 | 0.7228 |
|
| 202 |
+
| 0.1198 | 149.0002 | 26100 | 24.3290 | 0.8939 | 0.7258 |
|
| 203 |
+
| 0.1149 | 150.0002 | 26274 | 25.0183 | 0.8951 | 0.7276 |
|
| 204 |
+
| 0.1117 | 151.0002 | 26448 | 23.8628 | 0.8897 | 0.7152 |
|
| 205 |
+
| 0.1119 | 152.0002 | 26622 | 24.9298 | 0.8954 | 0.7344 |
|
| 206 |
+
| 0.1173 | 153.0002 | 26796 | 26.3068 | 0.8864 | 0.7184 |
|
| 207 |
+
| 0.1142 | 154.0001 | 26970 | 24.2188 | 0.8934 | 0.7261 |
|
| 208 |
+
| 0.1269 | 155.0001 | 27144 | 27.1026 | 0.8919 | 0.7194 |
|
| 209 |
+
| 0.1135 | 156.0001 | 27318 | 24.8055 | 0.8972 | 0.7329 |
|
| 210 |
+
| 0.1105 | 157.0001 | 27492 | 23.5371 | 0.8939 | 0.7286 |
|
| 211 |
+
| 0.1065 | 158.0001 | 27666 | 23.3602 | 0.8936 | 0.7291 |
|
| 212 |
+
| 0.1049 | 159.0001 | 27840 | 26.7597 | 0.8936 | 0.7287 |
|
| 213 |
+
| 0.1102 | 160.0001 | 28014 | 28.6464 | 0.8910 | 0.7223 |
|
| 214 |
+
| 0.1085 | 161.0001 | 28188 | 28.2966 | 0.8941 | 0.7266 |
|
| 215 |
+
| 0.1046 | 162.0001 | 28362 | 26.8082 | 0.8948 | 0.7277 |
|
| 216 |
+
| 0.1065 | 163.0001 | 28536 | 25.1373 | 0.8987 | 0.7333 |
|
| 217 |
+
| 0.0964 | 164.0001 | 28710 | 25.3613 | 0.8933 | 0.7273 |
|
| 218 |
+
| 0.0985 | 165.0001 | 28884 | 24.9014 | 0.8989 | 0.7333 |
|
| 219 |
+
| 0.0978 | 166.0001 | 29058 | 24.3461 | 0.8944 | 0.7303 |
|
| 220 |
+
| 0.0957 | 167.0001 | 29232 | 25.6565 | 0.8944 | 0.7276 |
|
| 221 |
+
| 0.0909 | 168.0000 | 29406 | 23.1801 | 0.8970 | 0.7297 |
|
| 222 |
+
| 0.1013 | 169.0000 | 29580 | 24.0874 | 0.8954 | 0.7309 |
|
| 223 |
+
| 0.1032 | 170.0000 | 29754 | 25.6413 | 0.8930 | 0.7317 |
|
| 224 |
+
| 0.0992 | 171.0000 | 29928 | 23.7169 | 0.9003 | 0.7318 |
|
| 225 |
+
| 0.0938 | 172.0000 | 30102 | 24.3248 | 0.8994 | 0.7356 |
|
| 226 |
+
| 0.0864 | 173.0000 | 30276 | 27.2556 | 0.8945 | 0.7341 |
|
| 227 |
+
| 0.0934 | 173.0013 | 30450 | 24.3835 | 0.8988 | 0.7402 |
|
| 228 |
+
| 0.1006 | 174.0013 | 30624 | 26.0400 | 0.8968 | 0.7338 |
|
| 229 |
+
| 0.0905 | 175.0013 | 30798 | 23.9647 | 0.8998 | 0.7379 |
|
| 230 |
+
| 0.0941 | 176.0013 | 30972 | 23.8668 | 0.8993 | 0.7394 |
|
| 231 |
+
| 0.0906 | 177.0013 | 31146 | 26.0899 | 0.8982 | 0.7410 |
|
| 232 |
+
| 0.0885 | 178.0013 | 31320 | 25.4130 | 0.8984 | 0.7428 |
|
| 233 |
+
| 0.0939 | 179.0013 | 31494 | 23.5447 | 0.8966 | 0.7394 |
|
| 234 |
+
| 0.089 | 180.0012 | 31668 | 22.5587 | 0.8946 | 0.7416 |
|
| 235 |
+
| 0.0863 | 181.0012 | 31842 | 26.1199 | 0.8943 | 0.7331 |
|
| 236 |
+
| 0.0837 | 182.0012 | 32016 | 25.6055 | 0.8927 | 0.7346 |
|
| 237 |
+
| 0.0822 | 183.0012 | 32190 | 26.6287 | 0.8974 | 0.7403 |
|
| 238 |
+
| 0.0816 | 184.0012 | 32364 | 24.9176 | 0.8951 | 0.7331 |
|
| 239 |
+
| 0.0862 | 185.0012 | 32538 | 27.8039 | 0.8959 | 0.7405 |
|
| 240 |
+
| 0.0877 | 186.0012 | 32712 | 29.2693 | 0.8945 | 0.7322 |
|
| 241 |
+
| 0.0817 | 187.0012 | 32886 | 24.2025 | 0.9012 | 0.7442 |
|
| 242 |
+
| 0.0871 | 188.0012 | 33060 | 27.0045 | 0.9015 | 0.7462 |
|
| 243 |
+
| 0.0752 | 189.0012 | 33234 | 25.4044 | 0.8951 | 0.7413 |
|
| 244 |
+
| 0.0835 | 190.0012 | 33408 | 24.6893 | 0.8967 | 0.7451 |
|
| 245 |
+
| 0.0749 | 191.0012 | 33582 | 23.4499 | 0.8967 | 0.7409 |
|
| 246 |
+
| 0.0807 | 192.0012 | 33756 | 23.6384 | 0.8948 | 0.7365 |
|
| 247 |
+
| 0.0824 | 193.0012 | 33930 | 25.4436 | 0.8968 | 0.7448 |
|
| 248 |
+
| 0.0794 | 194.0011 | 34104 | 26.6859 | 0.9010 | 0.7490 |
|
| 249 |
+
| 0.0786 | 195.0011 | 34278 | 31.1406 | 0.9000 | 0.7455 |
|
| 250 |
+
| 0.0767 | 196.0011 | 34452 | 28.7380 | 0.8993 | 0.7470 |
|
| 251 |
+
| 0.0775 | 197.0011 | 34626 | 26.2337 | 0.8965 | 0.7430 |
|
| 252 |
+
| 0.0754 | 198.0011 | 34800 | 26.0209 | 0.8983 | 0.7434 |
|
| 253 |
+
| 0.0816 | 199.0011 | 34974 | 25.3235 | 0.8970 | 0.7444 |
|
| 254 |
+
| 0.0871 | 200.0011 | 35148 | 25.7532 | 0.9007 | 0.7411 |
|
| 255 |
+
| 0.075 | 201.0011 | 35322 | 26.9895 | 0.8998 | 0.7477 |
|
| 256 |
+
| 0.0782 | 202.0011 | 35496 | 27.1862 | 0.9019 | 0.7487 |
|
| 257 |
+
| 0.0698 | 203.0011 | 35670 | 29.9924 | 0.9021 | 0.7463 |
|
| 258 |
+
| 0.0805 | 204.0011 | 35844 | 24.1934 | 0.9012 | 0.7499 |
|
| 259 |
+
| 0.0734 | 205.0011 | 36018 | 26.2977 | 0.8990 | 0.7451 |
|
| 260 |
+
| 0.0808 | 206.0011 | 36192 | 25.4156 | 0.8944 | 0.7364 |
|
| 261 |
+
| 0.0738 | 207.0010 | 36366 | 28.7278 | 0.8984 | 0.7452 |
|
| 262 |
+
| 0.0748 | 208.0010 | 36540 | 26.4802 | 0.9062 | 0.7512 |
|
| 263 |
+
| 0.0665 | 209.0010 | 36714 | 25.9425 | 0.9001 | 0.7497 |
|
| 264 |
+
| 0.0794 | 210.0010 | 36888 | 27.9106 | 0.8947 | 0.7424 |
|
| 265 |
+
| 0.0738 | 211.0010 | 37062 | 30.8550 | 0.9026 | 0.7551 |
|
| 266 |
+
| 0.0775 | 212.0010 | 37236 | 25.0435 | 0.8985 | 0.7517 |
|
| 267 |
+
| 0.081 | 213.0010 | 37410 | 26.0296 | 0.9005 | 0.7556 |
|
| 268 |
+
| 0.0711 | 214.0010 | 37584 | 29.7379 | 0.8937 | 0.7400 |
|
| 269 |
+
| 0.0698 | 215.0010 | 37758 | 27.7228 | 0.9012 | 0.7544 |
|
| 270 |
+
| 0.0692 | 216.0010 | 37932 | 29.9567 | 0.9023 | 0.7552 |
|
| 271 |
+
| 0.0738 | 217.0010 | 38106 | 28.4722 | 0.9019 | 0.7551 |
|
| 272 |
+
| 0.0667 | 218.0010 | 38280 | 27.1035 | 0.9021 | 0.7548 |
|
| 273 |
+
| 0.0633 | 219.0010 | 38454 | 29.1530 | 0.9024 | 0.7550 |
|
| 274 |
+
| 0.0663 | 220.0010 | 38628 | 28.9497 | 0.9042 | 0.7611 |
|
| 275 |
+
| 0.0675 | 221.0009 | 38802 | 29.4822 | 0.9002 | 0.7555 |
|
| 276 |
+
| 0.0644 | 222.0009 | 38976 | 26.4538 | 0.9020 | 0.7548 |
|
| 277 |
+
| 0.0676 | 223.0009 | 39150 | 28.7964 | 0.9015 | 0.7606 |
|
| 278 |
+
| 0.0802 | 224.0009 | 39324 | 28.6932 | 0.9027 | 0.7572 |
|
| 279 |
+
| 0.0679 | 225.0009 | 39498 | 29.7335 | 0.9053 | 0.7570 |
|
| 280 |
+
| 0.0694 | 226.0009 | 39672 | 27.3412 | 0.8988 | 0.7535 |
|
| 281 |
+
| 0.0721 | 227.0009 | 39846 | 27.4954 | 0.9014 | 0.7621 |
|
| 282 |
+
| 0.0616 | 228.0009 | 40020 | 24.3438 | 0.9030 | 0.7642 |
|
| 283 |
+
| 0.0593 | 229.0009 | 40194 | 28.3107 | 0.8994 | 0.7526 |
|
| 284 |
+
| 0.0602 | 230.0009 | 40368 | 30.5454 | 0.9029 | 0.7575 |
|
| 285 |
+
| 0.0602 | 231.0009 | 40542 | 24.6443 | 0.9011 | 0.7599 |
|
| 286 |
+
| 0.056 | 232.0009 | 40716 | 27.7230 | 0.9002 | 0.7536 |
|
| 287 |
+
| 0.0644 | 233.0009 | 40890 | 24.9996 | 0.9034 | 0.7586 |
|
| 288 |
+
| 0.0657 | 234.0008 | 41064 | 27.3175 | 0.8993 | 0.7576 |
|
| 289 |
+
| 0.0621 | 235.0008 | 41238 | 30.7901 | 0.8997 | 0.7575 |
|
| 290 |
+
| 0.0605 | 236.0008 | 41412 | 28.9089 | 0.9016 | 0.7571 |
|
| 291 |
+
| 0.0567 | 237.0008 | 41586 | 27.2947 | 0.8991 | 0.7496 |
|
| 292 |
+
| 0.06 | 238.0008 | 41760 | 27.8907 | 0.9048 | 0.7620 |
|
| 293 |
+
| 0.0578 | 239.0008 | 41934 | 28.3218 | 0.9028 | 0.7527 |
|
| 294 |
+
| 0.0539 | 240.0008 | 42108 | 29.9793 | 0.8952 | 0.7507 |
|
| 295 |
+
| 0.0594 | 241.0008 | 42282 | 28.6463 | 0.9012 | 0.7547 |
|
| 296 |
+
| 0.063 | 242.0008 | 42456 | 26.1269 | 0.9002 | 0.7587 |
|
| 297 |
+
| 0.0598 | 243.0008 | 42630 | 34.1019 | 0.8996 | 0.7576 |
|
| 298 |
+
| 0.0606 | 244.0008 | 42804 | 32.7949 | 0.9011 | 0.7568 |
|
| 299 |
+
| 0.0582 | 245.0008 | 42978 | 31.6745 | 0.8992 | 0.7532 |
|
| 300 |
+
| 0.0519 | 246.0008 | 43152 | 31.9092 | 0.8997 | 0.7553 |
|
| 301 |
+
| 0.0584 | 247.0007 | 43326 | 34.1754 | 0.9030 | 0.7563 |
|
| 302 |
+
| 0.0676 | 248.0007 | 43500 | 27.5905 | 0.9002 | 0.7576 |
|
| 303 |
|
| 304 |
|
| 305 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 124847652
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a64ce717848c94805186511112e75486c78a520990bd72701548a7c9a271ff5b
|
| 3 |
size 124847652
|
runs/0-by=2006-psr=0.25/events.out.tfevents.1747032164.ana2.143225.1
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:118a688f266a49328a727225c81c4c5d0f8b91d4e2d21d6880cbecfcf3ea4839
|
| 3 |
+
size 470
|