--- library_name: transformers tags: - generated_from_trainer metrics: - accuracy model-index: - name: ht-stmini-cls-v6_ftis_noPretrain-msm-pos results: [] --- # ht-stmini-cls-v6_ftis_noPretrain-msm-pos This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset. It achieves the following results on the evaluation set: - Loss: 0.9005 - Accuracy: 0.9455 - Macro F1: 0.8570 ## Model description More information needed ## Intended uses & limitations More information needed ## Training and evaluation data More information needed ## Training procedure ### Training hyperparameters The following hyperparameters were used during training: - learning_rate: 0.0001 - train_batch_size: 8 - eval_batch_size: 4 - seed: 42 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments - lr_scheduler_type: linear - lr_scheduler_warmup_steps: 6733 - training_steps: 134675 ### Training results | Training Loss | Epoch | Step | Validation Loss | Accuracy | Macro F1 | |:-------------:|:--------:|:-----:|:---------------:|:--------:|:--------:| | 39.0542 | 0.0015 | 202 | 52.4550 | 0.0872 | 0.0428 | | 9.995 | 1.0015 | 404 | 111.6770 | 0.4138 | 0.1069 | | 7.019 | 2.0015 | 606 | 151.1546 | 0.5324 | 0.1287 | | 5.9896 | 3.0015 | 808 | 161.7299 | 0.5608 | 0.1355 | | 5.2331 | 4.0015 | 1010 | 127.6348 | 0.5911 | 0.1425 | | 4.4417 | 5.0015 | 1212 | 85.7775 | 0.6060 | 0.1540 | | 3.7238 | 6.0015 | 1414 | 56.9365 | 0.5991 | 0.1518 | | 3.3202 | 7.0015 | 1616 | 38.2729 | 0.6192 | 0.1640 | | 3.0289 | 8.0015 | 1818 | 29.3182 | 0.6116 | 0.1670 | | 2.851 | 9.0015 | 2020 | 22.7821 | 0.6146 | 0.1740 | | 2.6193 | 10.0015 | 2222 | 20.0463 | 0.6157 | 0.1804 | | 2.5279 | 11.0015 | 2424 | 12.6429 | 0.6331 | 0.1994 | | 2.5246 | 12.0015 | 2626 | 11.5103 | 0.6461 | 0.2189 | | 2.387 | 13.0015 | 2828 | 9.9416 | 0.6469 | 0.2234 | | 2.2723 | 14.0015 | 3030 | 9.5941 | 0.6758 | 0.2647 | | 2.1285 | 15.0015 | 3232 | 9.1184 | 0.7111 | 0.2847 | | 1.9754 | 16.0015 | 3434 | 7.8173 | 0.7227 | 0.3092 | | 1.9499 | 17.0015 | 3636 | 6.2093 | 0.7161 | 0.3239 | | 1.9703 | 18.0015 | 3838 | 6.8571 | 0.7220 | 0.3407 | | 1.8392 | 19.0015 | 4040 | 5.9043 | 0.7307 | 0.3679 | | 1.6563 | 20.0015 | 4242 | 6.6418 | 0.7494 | 0.3731 | | 1.7206 | 21.0015 | 4444 | 5.9527 | 0.7415 | 0.3983 | | 1.4991 | 22.0015 | 4646 | 7.9843 | 0.7538 | 0.4166 | | 1.4357 | 23.0015 | 4848 | 6.3545 | 0.7619 | 0.4368 | | 1.3359 | 24.0015 | 5050 | 6.9233 | 0.7764 | 0.4465 | | 1.4795 | 25.0015 | 5252 | 6.6354 | 0.7810 | 0.4767 | | 1.3976 | 26.0015 | 5454 | 7.4885 | 0.7918 | 0.4841 | | 1.2662 | 27.0015 | 5656 | 7.2623 | 0.7961 | 0.4800 | | 1.1697 | 28.0015 | 5858 | 6.7678 | 0.7819 | 0.4808 | | 1.1994 | 29.0015 | 6060 | 8.9266 | 0.8038 | 0.5040 | | 1.1654 | 30.0015 | 6262 | 8.9018 | 0.7965 | 0.4944 | | 1.0817 | 31.0015 | 6464 | 8.8383 | 0.8060 | 0.5435 | | 1.0778 | 32.0015 | 6666 | 8.7811 | 0.8029 | 0.5208 | | 0.985 | 33.0015 | 6868 | 9.7044 | 0.8169 | 0.5330 | | 0.9646 | 34.0015 | 7070 | 10.2150 | 0.8206 | 0.5506 | | 0.9595 | 35.0015 | 7272 | 11.9668 | 0.8233 | 0.5671 | | 0.8569 | 36.0015 | 7474 | 11.1166 | 0.8329 | 0.5819 | | 0.8767 | 37.0015 | 7676 | 13.2788 | 0.8401 | 0.6010 | | 0.8094 | 38.0015 | 7878 | 12.9696 | 0.8384 | 0.6067 | | 0.7561 | 39.0015 | 8080 | 12.2524 | 0.8382 | 0.5987 | | 0.6879 | 40.0015 | 8282 | 14.3620 | 0.8478 | 0.6204 | | 0.7386 | 41.0015 | 8484 | 14.7484 | 0.8626 | 0.6337 | | 0.6411 | 42.0015 | 8686 | 13.5941 | 0.8660 | 0.6437 | | 0.6389 | 43.0015 | 8888 | 15.4027 | 0.8645 | 0.6406 | | 0.6128 | 44.0015 | 9090 | 19.0751 | 0.8758 | 0.6587 | | 0.496 | 45.0015 | 9292 | 18.6661 | 0.8661 | 0.6372 | | 0.5358 | 46.0015 | 9494 | 14.1890 | 0.8757 | 0.6533 | | 0.4955 | 47.0015 | 9696 | 17.5556 | 0.8721 | 0.6659 | | 0.4513 | 48.0015 | 9898 | 16.8865 | 0.8847 | 0.6857 | | 0.4363 | 49.0015 | 10100 | 15.3593 | 0.8801 | 0.6744 | | 0.3792 | 50.0015 | 10302 | 14.4326 | 0.8903 | 0.6988 | | 0.3841 | 51.0015 | 10504 | 15.5706 | 0.8961 | 0.7055 | | 0.3951 | 52.0015 | 10706 | 16.9263 | 0.8958 | 0.7031 | | 0.3896 | 53.0015 | 10908 | 16.1398 | 0.8921 | 0.7052 | | 0.3263 | 54.0015 | 11110 | 14.3136 | 0.8982 | 0.7045 | | 0.3028 | 55.0015 | 11312 | 13.7927 | 0.9024 | 0.7175 | | 0.2973 | 56.0015 | 11514 | 15.8300 | 0.9022 | 0.7204 | | 0.2723 | 57.0015 | 11716 | 12.2600 | 0.9057 | 0.7185 | | 0.274 | 58.0015 | 11918 | 11.5149 | 0.8993 | 0.7156 | | 0.2485 | 59.0015 | 12120 | 11.6746 | 0.9085 | 0.7371 | | 0.2617 | 60.0015 | 12322 | 10.2702 | 0.9102 | 0.7333 | | 0.2402 | 61.0015 | 12524 | 9.3862 | 0.9120 | 0.7369 | | 0.2289 | 62.0015 | 12726 | 9.2598 | 0.9106 | 0.7368 | | 0.2068 | 63.0015 | 12928 | 7.8129 | 0.9153 | 0.7521 | | 0.2011 | 64.0015 | 13130 | 6.5932 | 0.9102 | 0.7382 | | 0.1943 | 65.0015 | 13332 | 7.2363 | 0.9133 | 0.7565 | | 0.1746 | 66.0015 | 13534 | 6.6193 | 0.9149 | 0.7529 | | 0.1848 | 67.0015 | 13736 | 6.6123 | 0.9175 | 0.7517 | | 0.1704 | 68.0015 | 13938 | 6.5004 | 0.9159 | 0.7531 | | 0.1565 | 69.0015 | 14140 | 5.0556 | 0.9150 | 0.7578 | | 0.1533 | 70.0015 | 14342 | 5.4416 | 0.9179 | 0.7662 | | 0.1379 | 71.0015 | 14544 | 4.6997 | 0.9216 | 0.7713 | | 0.1354 | 72.0015 | 14746 | 4.3266 | 0.9193 | 0.7666 | | 0.1441 | 73.0015 | 14948 | 3.9965 | 0.9252 | 0.7782 | | 0.1302 | 74.0015 | 15150 | 3.1910 | 0.9221 | 0.7709 | | 0.132 | 75.0015 | 15352 | 3.2843 | 0.9252 | 0.7797 | | 0.1269 | 76.0015 | 15554 | 3.1127 | 0.9253 | 0.7860 | | 0.117 | 77.0015 | 15756 | 2.9740 | 0.9306 | 0.7843 | | 0.1135 | 78.0015 | 15958 | 3.0400 | 0.9273 | 0.7851 | | 0.1095 | 79.0015 | 16160 | 2.5968 | 0.9261 | 0.7831 | | 0.1012 | 80.0015 | 16362 | 2.8566 | 0.9262 | 0.7869 | | 0.0958 | 81.0015 | 16564 | 2.4564 | 0.9285 | 0.7847 | | 0.0963 | 82.0015 | 16766 | 2.4953 | 0.9333 | 0.8010 | | 0.1026 | 83.0015 | 16968 | 2.0616 | 0.9292 | 0.7923 | | 0.0853 | 84.0015 | 17170 | 1.8190 | 0.9269 | 0.7881 | | 0.0884 | 85.0015 | 17372 | 1.7115 | 0.9261 | 0.7899 | | 0.0911 | 86.0015 | 17574 | 1.8535 | 0.9275 | 0.7951 | | 0.0853 | 87.0015 | 17776 | 1.5836 | 0.9269 | 0.7920 | | 0.0915 | 88.0015 | 17978 | 1.5385 | 0.9321 | 0.8054 | | 0.0762 | 89.0015 | 18180 | 1.5013 | 0.9351 | 0.8112 | | 0.0876 | 90.0015 | 18382 | 1.6770 | 0.9298 | 0.7870 | | 0.0725 | 91.0015 | 18584 | 1.5174 | 0.9366 | 0.8054 | | 0.079 | 92.0015 | 18786 | 1.5734 | 0.9339 | 0.8067 | | 0.0658 | 93.0015 | 18988 | 1.4919 | 0.9314 | 0.8057 | | 0.0661 | 94.0015 | 19190 | 1.3583 | 0.9343 | 0.7936 | | 0.0717 | 95.0015 | 19392 | 1.3954 | 0.9374 | 0.8100 | | 0.0617 | 96.0015 | 19594 | 1.4305 | 0.9348 | 0.8151 | | 0.0665 | 97.0015 | 19796 | 1.3416 | 0.9334 | 0.7926 | | 0.0767 | 98.0015 | 19998 | 1.2812 | 0.9351 | 0.8144 | | 0.0664 | 99.0015 | 20200 | 1.4248 | 0.9345 | 0.8081 | | 0.055 | 100.0015 | 20402 | 1.3525 | 0.9331 | 0.8138 | | 0.0586 | 101.0015 | 20604 | 1.2343 | 0.9327 | 0.8121 | | 0.0647 | 102.0015 | 20806 | 1.1799 | 0.9326 | 0.8143 | | 0.0575 | 103.0015 | 21008 | 1.2465 | 0.9338 | 0.8180 | | 0.0564 | 104.0015 | 21210 | 1.2148 | 0.9355 | 0.8141 | | 0.0571 | 105.0015 | 21412 | 1.3326 | 0.9334 | 0.8156 | | 0.0522 | 106.0015 | 21614 | 1.2063 | 0.9362 | 0.8230 | | 0.051 | 107.0015 | 21816 | 1.1642 | 0.9383 | 0.8248 | | 0.07 | 108.0015 | 22018 | 1.1145 | 0.9385 | 0.8255 | | 0.0538 | 109.0015 | 22220 | 1.1939 | 0.9358 | 0.8187 | | 0.05 | 110.0015 | 22422 | 1.1498 | 0.9390 | 0.8298 | | 0.0521 | 111.0015 | 22624 | 1.0604 | 0.9338 | 0.8170 | | 0.0542 | 112.0015 | 22826 | 1.0632 | 0.9378 | 0.8220 | | 0.0522 | 113.0015 | 23028 | 1.1671 | 0.9374 | 0.8276 | | 0.0491 | 114.0015 | 23230 | 1.1449 | 0.9408 | 0.8300 | | 0.0478 | 115.0015 | 23432 | 1.0877 | 0.9405 | 0.8300 | | 0.0496 | 116.0015 | 23634 | 1.1114 | 0.9407 | 0.8278 | | 0.047 | 117.0015 | 23836 | 1.0889 | 0.9401 | 0.8250 | | 0.0464 | 118.0015 | 24038 | 1.0318 | 0.9421 | 0.8297 | | 0.0425 | 119.0015 | 24240 | 0.9645 | 0.9362 | 0.8262 | | 0.0445 | 120.0015 | 24442 | 1.0574 | 0.9369 | 0.8271 | | 0.0423 | 121.0015 | 24644 | 0.9761 | 0.9417 | 0.8372 | | 0.0444 | 122.0015 | 24846 | 1.0535 | 0.9376 | 0.8304 | | 0.0463 | 123.0015 | 25048 | 0.9546 | 0.9398 | 0.8298 | | 0.042 | 124.0015 | 25250 | 0.9689 | 0.9378 | 0.8336 | | 0.0423 | 125.0015 | 25452 | 0.9090 | 0.9419 | 0.8335 | | 0.0431 | 126.0015 | 25654 | 1.0322 | 0.9394 | 0.8357 | | 0.0371 | 127.0015 | 25856 | 1.0071 | 0.9425 | 0.8339 | | 0.0441 | 128.0015 | 26058 | 0.9907 | 0.9415 | 0.8351 | | 0.0408 | 129.0015 | 26260 | 1.0066 | 0.9423 | 0.8170 | | 0.0413 | 130.0015 | 26462 | 1.0439 | 0.9390 | 0.8256 | | 0.0339 | 131.0015 | 26664 | 0.9631 | 0.9428 | 0.8358 | | 0.035 | 132.0015 | 26866 | 1.0452 | 0.9365 | 0.8312 | | 0.0349 | 133.0015 | 27068 | 1.0405 | 0.9362 | 0.8286 | | 0.0395 | 134.0015 | 27270 | 1.0398 | 0.9387 | 0.8303 | | 0.0352 | 135.0015 | 27472 | 0.9459 | 0.9420 | 0.8340 | | 0.0376 | 136.0015 | 27674 | 0.9511 | 0.9412 | 0.8374 | | 0.0362 | 137.0015 | 27876 | 1.0098 | 0.9388 | 0.8416 | | 0.0335 | 138.0015 | 28078 | 1.0599 | 0.9422 | 0.8380 | | 0.0309 | 139.0015 | 28280 | 1.1331 | 0.9377 | 0.8310 | | 0.0354 | 140.0015 | 28482 | 1.1502 | 0.9369 | 0.8304 | | 0.0357 | 141.0015 | 28684 | 0.9515 | 0.9386 | 0.8294 | | 0.0313 | 142.0015 | 28886 | 0.9854 | 0.9403 | 0.8400 | | 0.0329 | 143.0015 | 29088 | 1.0700 | 0.9362 | 0.8351 | | 0.0333 | 144.0015 | 29290 | 0.9787 | 0.9429 | 0.8437 | | 0.0329 | 145.0015 | 29492 | 1.0145 | 0.9323 | 0.8320 | | 0.0318 | 146.0015 | 29694 | 0.9619 | 0.9402 | 0.8358 | | 0.0322 | 147.0015 | 29896 | 1.1008 | 0.9348 | 0.8200 | | 0.0352 | 148.0015 | 30098 | 0.9330 | 0.9411 | 0.8401 | | 0.0309 | 149.0015 | 30300 | 0.9829 | 0.9421 | 0.8226 | | 0.0317 | 150.0015 | 30502 | 0.9698 | 0.9424 | 0.8221 | | 0.028 | 151.0015 | 30704 | 0.9358 | 0.9452 | 0.8475 | | 0.0328 | 152.0015 | 30906 | 1.0448 | 0.9394 | 0.8177 | | 0.0318 | 153.0015 | 31108 | 1.0614 | 0.9379 | 0.8396 | | 0.0297 | 154.0015 | 31310 | 0.9583 | 0.9421 | 0.8429 | | 0.0284 | 155.0015 | 31512 | 0.9899 | 0.9409 | 0.8418 | | 0.0287 | 156.0015 | 31714 | 0.9172 | 0.9422 | 0.8430 | | 0.0292 | 157.0015 | 31916 | 0.9322 | 0.9426 | 0.8473 | | 0.0263 | 158.0015 | 32118 | 1.0263 | 0.9406 | 0.8202 | | 0.0297 | 159.0015 | 32320 | 0.9233 | 0.9445 | 0.8252 | | 0.0296 | 160.0015 | 32522 | 0.9406 | 0.9424 | 0.8459 | | 0.0285 | 161.0015 | 32724 | 0.8934 | 0.9420 | 0.8246 | | 0.0277 | 162.0015 | 32926 | 0.8199 | 0.9428 | 0.8454 | | 0.0278 | 163.0015 | 33128 | 0.9287 | 0.9452 | 0.8332 | | 0.0276 | 164.0015 | 33330 | 1.0221 | 0.9402 | 0.8431 | | 0.0266 | 165.0015 | 33532 | 1.0262 | 0.9404 | 0.8476 | | 0.0256 | 166.0015 | 33734 | 0.9846 | 0.9396 | 0.8426 | | 0.0255 | 167.0015 | 33936 | 0.9476 | 0.9434 | 0.8509 | | 0.0264 | 168.0015 | 34138 | 0.8982 | 0.9436 | 0.8305 | | 0.0249 | 169.0015 | 34340 | 0.9160 | 0.9420 | 0.8455 | | 0.0252 | 170.0015 | 34542 | 0.9905 | 0.9422 | 0.8283 | | 0.0285 | 171.0015 | 34744 | 0.9653 | 0.9415 | 0.8406 | | 0.0286 | 172.0015 | 34946 | 0.9691 | 0.9415 | 0.8445 | | 0.0333 | 173.0015 | 35148 | 0.9424 | 0.9405 | 0.8421 | | 0.0252 | 174.0015 | 35350 | 0.9120 | 0.9434 | 0.8485 | | 0.0245 | 175.0015 | 35552 | 1.0208 | 0.9429 | 0.8278 | | 0.0214 | 176.0015 | 35754 | 0.9905 | 0.9425 | 0.8302 | | 0.0248 | 177.0015 | 35956 | 0.9479 | 0.9439 | 0.8288 | | 0.0242 | 178.0015 | 36158 | 0.9858 | 0.9443 | 0.8525 | | 0.0236 | 179.0015 | 36360 | 1.0952 | 0.9416 | 0.8255 | | 0.0206 | 180.0015 | 36562 | 1.1354 | 0.9418 | 0.8276 | | 0.0223 | 181.0015 | 36764 | 0.9461 | 0.9427 | 0.8295 | | 0.0226 | 182.0015 | 36966 | 0.9072 | 0.9445 | 0.8334 | | 0.0249 | 183.0015 | 37168 | 1.1476 | 0.9386 | 0.8449 | | 0.0263 | 184.0015 | 37370 | 0.8881 | 0.9421 | 0.8321 | | 0.0248 | 185.0015 | 37572 | 0.9298 | 0.9422 | 0.8537 | | 0.0216 | 186.0015 | 37774 | 0.9200 | 0.9425 | 0.8352 | | 0.0217 | 187.0015 | 37976 | 0.9245 | 0.9455 | 0.8526 | | 0.0214 | 188.0015 | 38178 | 1.0350 | 0.9405 | 0.8252 | | 0.022 | 189.0015 | 38380 | 0.8831 | 0.9455 | 0.8570 | | 0.0206 | 190.0015 | 38582 | 0.8855 | 0.9448 | 0.8361 | | 0.021 | 191.0015 | 38784 | 0.9974 | 0.9444 | 0.8513 | | 0.0226 | 192.0015 | 38986 | 0.9566 | 0.9420 | 0.8506 | | 0.0199 | 193.0015 | 39188 | 0.8891 | 0.9454 | 0.8328 | | 0.0237 | 194.0015 | 39390 | 0.9330 | 0.9431 | 0.8525 | | 0.0206 | 195.0015 | 39592 | 0.8964 | 0.9441 | 0.8327 | | 0.0209 | 196.0015 | 39794 | 0.9579 | 0.9450 | 0.8326 | | 0.0199 | 197.0015 | 39996 | 0.9376 | 0.9447 | 0.8342 | | 0.0218 | 198.0015 | 40198 | 0.8677 | 0.9454 | 0.8358 | | 0.0217 | 199.0015 | 40400 | 1.0234 | 0.9375 | 0.8222 | | 0.0229 | 200.0015 | 40602 | 0.9920 | 0.9379 | 0.8282 | | 0.0195 | 201.0015 | 40804 | 1.0083 | 0.9457 | 0.8355 | | 0.0215 | 202.0015 | 41006 | 0.9446 | 0.9464 | 0.8368 | | 0.02 | 203.0015 | 41208 | 0.9566 | 0.9449 | 0.8359 | | 0.0194 | 204.0015 | 41410 | 0.8834 | 0.9438 | 0.8543 | | 0.0185 | 205.0015 | 41612 | 0.9536 | 0.9454 | 0.8362 | | 0.0216 | 206.0015 | 41814 | 0.8942 | 0.9458 | 0.8558 | | 0.0178 | 207.0015 | 42016 | 1.0467 | 0.9412 | 0.8317 | | 0.0176 | 208.0015 | 42218 | 0.9006 | 0.9468 | 0.8365 | | 0.0223 | 209.0015 | 42420 | 0.8957 | 0.9429 | 0.8341 | ### Framework versions - Transformers 4.46.0 - Pytorch 2.3.1+cu121 - Datasets 2.20.0 - Tokenizers 0.20.1