--- library_name: transformers tags: - generated_from_trainer metrics: - accuracy model-index: - name: ht-stmini-cls-v6_ftis_noPretrain-cssl-npsNonennsNone-masked results: [] --- # ht-stmini-cls-v6_ftis_noPretrain-cssl-npsNonennsNone-masked This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset. It achieves the following results on the evaluation set: - Loss: 2.0807 - Accuracy: 0.9008 - Macro F1: 0.7546 ## Model description More information needed ## Intended uses & limitations More information needed ## Training and evaluation data More information needed ## Training procedure ### Training hyperparameters The following hyperparameters were used during training: - learning_rate: 0.0001 - train_batch_size: 8 - eval_batch_size: 4 - seed: 42 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments - lr_scheduler_type: linear - lr_scheduler_warmup_steps: 6733 - training_steps: 134674 ### Training results | Training Loss | Epoch | Step | Validation Loss | Accuracy | Macro F1 | |:-------------:|:--------:|:-----:|:---------------:|:--------:|:--------:| | 1068.6496 | 0.0013 | 174 | 927.3249 | 0.0518 | 0.0275 | | 308.1229 | 1.0013 | 348 | 144.5194 | 0.0902 | 0.0379 | | 95.7417 | 2.0013 | 522 | 87.2913 | 0.2271 | 0.0660 | | 78.3458 | 3.0013 | 696 | 75.5185 | 0.3467 | 0.0916 | | 69.4517 | 4.0013 | 870 | 70.5351 | 0.4176 | 0.1059 | | 65.1834 | 5.0013 | 1044 | 64.9793 | 0.4599 | 0.1145 | | 60.7497 | 6.0012 | 1218 | 61.2821 | 0.4788 | 0.1187 | | 58.0137 | 7.0012 | 1392 | 58.6102 | 0.4989 | 0.1243 | | 54.199 | 8.0012 | 1566 | 56.7198 | 0.5115 | 0.1266 | | 51.9066 | 9.0012 | 1740 | 53.0077 | 0.5267 | 0.1294 | | 48.2793 | 10.0012 | 1914 | 50.1285 | 0.5314 | 0.1311 | | 46.2198 | 11.0012 | 2088 | 47.1320 | 0.5354 | 0.1321 | | 44.744 | 12.0012 | 2262 | 44.2763 | 0.5431 | 0.1342 | | 42.2182 | 13.0012 | 2436 | 42.1682 | 0.5466 | 0.1357 | | 40.2043 | 14.0012 | 2610 | 39.8411 | 0.5594 | 0.1389 | | 38.2283 | 15.0012 | 2784 | 37.6270 | 0.5603 | 0.1398 | | 36.4321 | 16.0012 | 2958 | 36.2176 | 0.5715 | 0.1427 | | 34.8123 | 17.0012 | 3132 | 34.1193 | 0.5726 | 0.1448 | | 33.0587 | 18.0012 | 3306 | 32.6392 | 0.5719 | 0.1462 | | 31.8378 | 19.0012 | 3480 | 31.1957 | 0.5994 | 0.1525 | | 30.045 | 20.0011 | 3654 | 29.0035 | 0.5837 | 0.1545 | | 28.3687 | 21.0011 | 3828 | 28.1111 | 0.6058 | 0.1617 | | 26.1973 | 22.0011 | 4002 | 27.2819 | 0.6039 | 0.1641 | | 25.2214 | 23.0011 | 4176 | 24.7536 | 0.6168 | 0.1710 | | 23.5136 | 24.0011 | 4350 | 24.3032 | 0.6130 | 0.1762 | | 21.5435 | 25.0011 | 4524 | 23.4460 | 0.6454 | 0.1803 | | 20.6734 | 26.0011 | 4698 | 20.8139 | 0.6602 | 0.1977 | | 19.3544 | 27.0011 | 4872 | 19.3059 | 0.6618 | 0.1978 | | 18.0144 | 28.0011 | 5046 | 20.0380 | 0.6707 | 0.2068 | | 17.0738 | 29.0011 | 5220 | 17.7529 | 0.6862 | 0.2279 | | 16.2132 | 30.0011 | 5394 | 17.0097 | 0.6964 | 0.2324 | | 15.0171 | 31.0011 | 5568 | 16.1995 | 0.7004 | 0.2420 | | 14.9223 | 32.0011 | 5742 | 15.5821 | 0.7008 | 0.2536 | | 13.4135 | 33.0010 | 5916 | 14.5521 | 0.7166 | 0.2654 | | 12.7349 | 34.0010 | 6090 | 14.7930 | 0.7008 | 0.2697 | | 11.7896 | 35.0010 | 6264 | 13.5747 | 0.7214 | 0.2948 | | 11.0743 | 36.0010 | 6438 | 12.6981 | 0.7297 | 0.3000 | | 10.2061 | 37.0010 | 6612 | 12.1564 | 0.7270 | 0.3073 | | 9.8949 | 38.0010 | 6786 | 11.5629 | 0.7381 | 0.3211 | | 9.3034 | 39.0010 | 6960 | 11.1978 | 0.7486 | 0.3459 | | 8.456 | 40.0010 | 7134 | 10.3292 | 0.7465 | 0.3483 | | 8.1784 | 41.0010 | 7308 | 10.1043 | 0.7501 | 0.3519 | | 7.6915 | 42.0010 | 7482 | 9.6616 | 0.7555 | 0.3691 | | 7.0886 | 43.0010 | 7656 | 8.8204 | 0.7654 | 0.3921 | | 6.647 | 44.0010 | 7830 | 8.5357 | 0.7653 | 0.3937 | | 6.0331 | 45.0010 | 8004 | 8.3202 | 0.7721 | 0.4060 | | 5.7497 | 46.0010 | 8178 | 7.7233 | 0.7730 | 0.3993 | | 5.5441 | 47.0009 | 8352 | 7.5435 | 0.7597 | 0.4057 | | 4.9054 | 48.0009 | 8526 | 7.0332 | 0.7697 | 0.4215 | | 4.5626 | 49.0009 | 8700 | 6.9012 | 0.7802 | 0.4359 | | 4.3469 | 50.0009 | 8874 | 6.3051 | 0.7884 | 0.4428 | | 4.5584 | 51.0009 | 9048 | 6.2547 | 0.7823 | 0.4393 | | 3.8359 | 52.0009 | 9222 | 6.1606 | 0.7888 | 0.4563 | | 3.4501 | 53.0009 | 9396 | 5.8539 | 0.7832 | 0.4587 | | 3.3335 | 54.0009 | 9570 | 5.5077 | 0.7961 | 0.4685 | | 3.1859 | 55.0009 | 9744 | 5.0799 | 0.7882 | 0.4701 | | 2.717 | 56.0009 | 9918 | 4.9820 | 0.7955 | 0.4663 | | 2.6174 | 57.0009 | 10092 | 4.8697 | 0.7989 | 0.4784 | | 2.6327 | 58.0009 | 10266 | 4.3934 | 0.8039 | 0.4901 | | 2.3842 | 59.0009 | 10440 | 4.6552 | 0.8068 | 0.4925 | | 2.2342 | 60.0008 | 10614 | 4.2204 | 0.7980 | 0.4960 | | 2.0333 | 61.0008 | 10788 | 4.0363 | 0.8088 | 0.5055 | | 1.9246 | 62.0008 | 10962 | 3.9155 | 0.8157 | 0.5089 | | 1.7581 | 63.0008 | 11136 | 3.7119 | 0.8076 | 0.5100 | | 1.5897 | 64.0008 | 11310 | 3.4220 | 0.8159 | 0.5227 | | 1.5181 | 65.0008 | 11484 | 3.4158 | 0.8143 | 0.5136 | | 1.2674 | 66.0008 | 11658 | 2.8214 | 0.8198 | 0.5272 | | 1.1923 | 67.0008 | 11832 | 2.4886 | 0.8220 | 0.5251 | | 0.9016 | 68.0008 | 12006 | 1.9265 | 0.8272 | 0.5332 | | 0.8419 | 69.0008 | 12180 | 1.8848 | 0.8276 | 0.5389 | | 0.7813 | 70.0008 | 12354 | 1.8943 | 0.8257 | 0.5352 | | 0.7618 | 71.0008 | 12528 | 1.7000 | 0.8299 | 0.5459 | | 0.7629 | 72.0008 | 12702 | 1.7150 | 0.8316 | 0.5438 | | 0.6173 | 73.0007 | 12876 | 1.6945 | 0.8339 | 0.5615 | | 0.5882 | 74.0007 | 13050 | 1.5856 | 0.8357 | 0.5651 | | 0.5431 | 75.0007 | 13224 | 1.5878 | 0.8416 | 0.5692 | | 0.551 | 76.0007 | 13398 | 1.6983 | 0.8401 | 0.5667 | | 0.5433 | 77.0007 | 13572 | 1.5201 | 0.8493 | 0.5798 | | 0.5021 | 78.0007 | 13746 | 1.5436 | 0.8470 | 0.5810 | | 0.4741 | 79.0007 | 13920 | 1.5503 | 0.8447 | 0.5849 | | 0.4976 | 80.0007 | 14094 | 1.5099 | 0.8526 | 0.5881 | | 0.4391 | 81.0007 | 14268 | 1.4309 | 0.8594 | 0.6006 | | 0.4102 | 82.0007 | 14442 | 1.6794 | 0.8487 | 0.5905 | | 0.3961 | 83.0007 | 14616 | 1.5327 | 0.8592 | 0.6005 | | 0.3966 | 84.0007 | 14790 | 1.5169 | 0.8553 | 0.6043 | | 0.3942 | 85.0007 | 14964 | 1.6696 | 0.8555 | 0.6113 | | 0.3771 | 86.0007 | 15138 | 1.5324 | 0.8612 | 0.6090 | | 0.3931 | 87.0006 | 15312 | 1.6094 | 0.8598 | 0.6123 | | 0.3416 | 88.0006 | 15486 | 1.6472 | 0.8641 | 0.6096 | | 0.342 | 89.0006 | 15660 | 1.5898 | 0.8668 | 0.6241 | | 0.3307 | 90.0006 | 15834 | 1.4751 | 0.8615 | 0.6214 | | 0.3485 | 91.0006 | 16008 | 1.5752 | 0.8637 | 0.6310 | | 0.3415 | 92.0006 | 16182 | 1.6328 | 0.8641 | 0.6265 | | 0.3132 | 93.0006 | 16356 | 1.5418 | 0.8698 | 0.6363 | | 0.3222 | 94.0006 | 16530 | 1.6462 | 0.8632 | 0.6287 | | 0.2826 | 95.0006 | 16704 | 1.6411 | 0.8688 | 0.6311 | | 0.2863 | 96.0006 | 16878 | 1.4910 | 0.8717 | 0.6417 | | 0.3122 | 97.0006 | 17052 | 1.6034 | 0.8701 | 0.6488 | | 0.2576 | 98.0006 | 17226 | 1.6647 | 0.8712 | 0.6476 | | 0.2627 | 99.0006 | 17400 | 1.5346 | 0.8737 | 0.6543 | | 0.2598 | 100.0005 | 17574 | 1.6360 | 0.8725 | 0.6507 | | 0.2797 | 101.0005 | 17748 | 1.6424 | 0.8734 | 0.6535 | | 0.2593 | 102.0005 | 17922 | 1.4222 | 0.8810 | 0.6695 | | 0.241 | 103.0005 | 18096 | 1.8432 | 0.8754 | 0.6545 | | 0.2433 | 104.0005 | 18270 | 1.7883 | 0.8707 | 0.6591 | | 0.2347 | 105.0005 | 18444 | 1.6862 | 0.8750 | 0.6629 | | 0.2247 | 106.0005 | 18618 | 1.6117 | 0.8781 | 0.6703 | | 0.2346 | 107.0005 | 18792 | 1.8101 | 0.8740 | 0.6637 | | 0.2276 | 108.0005 | 18966 | 1.6605 | 0.8772 | 0.6709 | | 0.237 | 109.0005 | 19140 | 1.7177 | 0.8830 | 0.6766 | | 0.2025 | 110.0005 | 19314 | 2.0185 | 0.8721 | 0.6645 | | 0.2233 | 111.0005 | 19488 | 1.7630 | 0.8767 | 0.6621 | | 0.214 | 112.0005 | 19662 | 1.5183 | 0.8849 | 0.6801 | | 0.1981 | 113.0005 | 19836 | 1.5889 | 0.8805 | 0.6797 | | 0.1891 | 114.0004 | 20010 | 1.7077 | 0.8825 | 0.6862 | | 0.2145 | 115.0004 | 20184 | 1.8261 | 0.8802 | 0.6842 | | 0.2086 | 116.0004 | 20358 | 1.7926 | 0.8852 | 0.6952 | | 0.2123 | 117.0004 | 20532 | 1.8480 | 0.8853 | 0.6866 | | 0.2009 | 118.0004 | 20706 | 2.1595 | 0.8831 | 0.6867 | | 0.1918 | 119.0004 | 20880 | 1.6066 | 0.8894 | 0.6880 | | 0.1849 | 120.0004 | 21054 | 1.8627 | 0.8839 | 0.6901 | | 0.1735 | 121.0004 | 21228 | 1.6292 | 0.8891 | 0.7035 | | 0.1785 | 122.0004 | 21402 | 1.9735 | 0.8827 | 0.6978 | | 0.1735 | 123.0004 | 21576 | 1.6743 | 0.8876 | 0.6935 | | 0.164 | 124.0004 | 21750 | 1.8791 | 0.8875 | 0.7010 | | 0.1621 | 125.0004 | 21924 | 1.6715 | 0.8865 | 0.6989 | | 0.1715 | 126.0004 | 22098 | 1.9634 | 0.8864 | 0.6982 | | 0.1631 | 127.0003 | 22272 | 1.8175 | 0.8896 | 0.7001 | | 0.1595 | 128.0003 | 22446 | 1.9219 | 0.8856 | 0.7048 | | 0.1647 | 129.0003 | 22620 | 1.8467 | 0.8896 | 0.7037 | | 0.159 | 130.0003 | 22794 | 2.0490 | 0.8861 | 0.7026 | | 0.1545 | 131.0003 | 22968 | 1.8514 | 0.8909 | 0.7053 | | 0.1507 | 132.0003 | 23142 | 1.8973 | 0.8874 | 0.7081 | | 0.1606 | 133.0003 | 23316 | 1.9146 | 0.8862 | 0.7079 | | 0.1573 | 134.0003 | 23490 | 2.0957 | 0.8839 | 0.6980 | | 0.1379 | 135.0003 | 23664 | 1.7862 | 0.8915 | 0.7133 | | 0.1524 | 136.0003 | 23838 | 1.7193 | 0.8938 | 0.7123 | | 0.1405 | 137.0003 | 24012 | 1.8991 | 0.8922 | 0.7135 | | 0.1532 | 138.0003 | 24186 | 1.9856 | 0.8902 | 0.7153 | | 0.1353 | 139.0003 | 24360 | 1.7119 | 0.8927 | 0.7175 | | 0.1417 | 140.0003 | 24534 | 1.9171 | 0.8965 | 0.7203 | | 0.1388 | 141.0002 | 24708 | 1.9839 | 0.8859 | 0.7093 | | 0.1298 | 142.0002 | 24882 | 2.0332 | 0.8931 | 0.7192 | | 0.1256 | 143.0002 | 25056 | 2.0786 | 0.8905 | 0.7097 | | 0.1335 | 144.0002 | 25230 | 2.0931 | 0.8933 | 0.7230 | | 0.1341 | 145.0002 | 25404 | 1.7686 | 0.8900 | 0.7166 | | 0.1481 | 146.0002 | 25578 | 1.7871 | 0.8901 | 0.7129 | | 0.1337 | 147.0002 | 25752 | 2.1121 | 0.8931 | 0.7231 | | 0.1309 | 148.0002 | 25926 | 2.1013 | 0.8929 | 0.7162 | | 0.1246 | 149.0002 | 26100 | 2.1843 | 0.8881 | 0.7104 | | 0.1199 | 150.0002 | 26274 | 2.0095 | 0.8970 | 0.7230 | | 0.1229 | 151.0002 | 26448 | 2.4829 | 0.8862 | 0.7048 | | 0.1175 | 152.0002 | 26622 | 2.0915 | 0.8955 | 0.7238 | | 0.1245 | 153.0002 | 26796 | 2.3886 | 0.8874 | 0.7171 | | 0.1229 | 154.0001 | 26970 | 1.8728 | 0.8920 | 0.7196 | | 0.1138 | 155.0001 | 27144 | 2.0265 | 0.8924 | 0.7180 | | 0.1138 | 156.0001 | 27318 | 1.9569 | 0.8957 | 0.7262 | | 0.1126 | 157.0001 | 27492 | 2.2207 | 0.8942 | 0.7258 | | 0.1152 | 158.0001 | 27666 | 2.2116 | 0.8906 | 0.7237 | | 0.1087 | 159.0001 | 27840 | 2.0753 | 0.8945 | 0.7244 | | 0.1085 | 160.0001 | 28014 | 1.9079 | 0.8947 | 0.7282 | | 0.1106 | 161.0001 | 28188 | 2.1649 | 0.8947 | 0.7245 | | 0.1073 | 162.0001 | 28362 | 2.0426 | 0.8941 | 0.7213 | | 0.1088 | 163.0001 | 28536 | 2.2528 | 0.8933 | 0.7277 | | 0.1016 | 164.0001 | 28710 | 2.2057 | 0.8920 | 0.7266 | | 0.1041 | 165.0001 | 28884 | 1.9267 | 0.8980 | 0.7331 | | 0.0988 | 166.0001 | 29058 | 2.2524 | 0.8926 | 0.7288 | | 0.0963 | 167.0001 | 29232 | 2.4129 | 0.8938 | 0.7253 | | 0.1031 | 168.0000 | 29406 | 2.3614 | 0.8945 | 0.7313 | | 0.1061 | 169.0000 | 29580 | 2.3434 | 0.8936 | 0.7317 | | 0.1028 | 170.0000 | 29754 | 2.4533 | 0.8972 | 0.7351 | | 0.0983 | 171.0000 | 29928 | 2.1360 | 0.8979 | 0.7325 | | 0.0995 | 172.0000 | 30102 | 2.3943 | 0.8964 | 0.7311 | | 0.091 | 173.0000 | 30276 | 2.5133 | 0.8954 | 0.7282 | | 0.1024 | 173.0013 | 30450 | 2.2063 | 0.8953 | 0.7351 | | 0.1099 | 174.0013 | 30624 | 2.5222 | 0.8937 | 0.7284 | | 0.0909 | 175.0013 | 30798 | 2.4572 | 0.8972 | 0.7320 | | 0.0939 | 176.0013 | 30972 | 2.1493 | 0.9016 | 0.7400 | | 0.0897 | 177.0013 | 31146 | 2.1848 | 0.8955 | 0.7369 | | 0.0927 | 178.0013 | 31320 | 2.3691 | 0.8972 | 0.7341 | | 0.0955 | 179.0013 | 31494 | 2.5360 | 0.8993 | 0.7424 | | 0.1015 | 180.0012 | 31668 | 2.4618 | 0.8935 | 0.7373 | | 0.0881 | 181.0012 | 31842 | 2.6819 | 0.8922 | 0.7300 | | 0.0852 | 182.0012 | 32016 | 2.3577 | 0.8996 | 0.7405 | | 0.0959 | 183.0012 | 32190 | 2.4324 | 0.8972 | 0.7389 | | 0.0862 | 184.0012 | 32364 | 2.5442 | 0.8928 | 0.7294 | | 0.0831 | 185.0012 | 32538 | 2.2686 | 0.8981 | 0.7429 | | 0.0914 | 186.0012 | 32712 | 2.4523 | 0.8972 | 0.7361 | | 0.0888 | 187.0012 | 32886 | 2.2054 | 0.9001 | 0.7492 | | 0.0842 | 188.0012 | 33060 | 2.2701 | 0.9002 | 0.7416 | | 0.0758 | 189.0012 | 33234 | 2.4414 | 0.8979 | 0.7448 | | 0.0882 | 190.0012 | 33408 | 2.7005 | 0.8937 | 0.7344 | | 0.077 | 191.0012 | 33582 | 2.6340 | 0.8964 | 0.7416 | | 0.0895 | 192.0012 | 33756 | 2.7526 | 0.8951 | 0.7412 | | 0.0862 | 193.0012 | 33930 | 2.6311 | 0.8960 | 0.7457 | | 0.0826 | 194.0011 | 34104 | 2.7022 | 0.8995 | 0.7371 | | 0.0789 | 195.0011 | 34278 | 2.2653 | 0.9003 | 0.7534 | | 0.0786 | 196.0011 | 34452 | 2.5231 | 0.9026 | 0.7508 | | 0.0863 | 197.0011 | 34626 | 2.8395 | 0.8969 | 0.7415 | | 0.0801 | 198.0011 | 34800 | 2.6086 | 0.8974 | 0.7461 | | 0.0764 | 199.0011 | 34974 | 2.7182 | 0.8961 | 0.7440 | | 0.092 | 200.0011 | 35148 | 2.3861 | 0.9032 | 0.7394 | | 0.0798 | 201.0011 | 35322 | 2.4694 | 0.9010 | 0.7462 | | 0.0783 | 202.0011 | 35496 | 2.4647 | 0.8987 | 0.7473 | | 0.0696 | 203.0011 | 35670 | 3.2532 | 0.8956 | 0.7387 | | 0.0871 | 204.0011 | 35844 | 2.4562 | 0.8975 | 0.7499 | | 0.0746 | 205.0011 | 36018 | 2.3953 | 0.9010 | 0.7498 | | 0.0774 | 206.0011 | 36192 | 2.6896 | 0.8975 | 0.7383 | | 0.0751 | 207.0010 | 36366 | 2.6423 | 0.9005 | 0.7471 | | 0.0749 | 208.0010 | 36540 | 2.4105 | 0.9058 | 0.7508 | | 0.0716 | 209.0010 | 36714 | 2.5575 | 0.8989 | 0.7472 | | 0.0749 | 210.0010 | 36888 | 2.8735 | 0.8993 | 0.7474 | | 0.0717 | 211.0010 | 37062 | 2.7920 | 0.9003 | 0.7481 | | 0.074 | 212.0010 | 37236 | 2.9536 | 0.8964 | 0.7438 | | 0.0787 | 213.0010 | 37410 | 2.6125 | 0.9005 | 0.7459 | | 0.0781 | 214.0010 | 37584 | 2.9591 | 0.8968 | 0.7409 | | 0.0612 | 215.0010 | 37758 | 2.8858 | 0.9004 | 0.7492 | ### Framework versions - Transformers 4.46.0 - Pytorch 2.3.1+cu121 - Datasets 2.20.0 - Tokenizers 0.20.1