Arthur-Tsai commited on
Commit
74a1a12
·
verified ·
1 Parent(s): 7f3e807

End of training

Browse files
README.md ADDED
@@ -0,0 +1,348 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ tags:
4
+ - generated_from_trainer
5
+ metrics:
6
+ - accuracy
7
+ model-index:
8
+ - name: ht-stmini-cls-v6_ftis_noPretrain-cssl-msm-pos
9
+ results: []
10
+ ---
11
+
12
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
+ should probably proofread and complete it, then remove this comment. -->
14
+
15
+ # ht-stmini-cls-v6_ftis_noPretrain-cssl-msm-pos
16
+
17
+ This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
18
+ It achieves the following results on the evaluation set:
19
+ - Loss: 52.0508
20
+ - Accuracy: 0.9135
21
+ - Macro F1: 0.8017
22
+
23
+ ## Model description
24
+
25
+ More information needed
26
+
27
+ ## Intended uses & limitations
28
+
29
+ More information needed
30
+
31
+ ## Training and evaluation data
32
+
33
+ More information needed
34
+
35
+ ## Training procedure
36
+
37
+ ### Training hyperparameters
38
+
39
+ The following hyperparameters were used during training:
40
+ - learning_rate: 0.0001
41
+ - train_batch_size: 8
42
+ - eval_batch_size: 4
43
+ - seed: 42
44
+ - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
45
+ - lr_scheduler_type: linear
46
+ - lr_scheduler_warmup_steps: 6733
47
+ - training_steps: 134674
48
+
49
+ ### Training results
50
+
51
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy | Macro F1 |
52
+ |:-------------:|:--------:|:-----:|:---------------:|:--------:|:--------:|
53
+ | 2049.1461 | 0.0013 | 174 | 1800.4193 | 0.0689 | 0.0294 |
54
+ | 565.4023 | 1.0013 | 348 | 341.6570 | 0.1097 | 0.0382 |
55
+ | 209.2343 | 2.0013 | 522 | 311.3144 | 0.2400 | 0.0639 |
56
+ | 177.3292 | 3.0013 | 696 | 311.6365 | 0.3554 | 0.0849 |
57
+ | 155.6534 | 4.0013 | 870 | 332.2397 | 0.4248 | 0.1052 |
58
+ | 143.0149 | 5.0013 | 1044 | 344.1388 | 0.4634 | 0.1146 |
59
+ | 128.0991 | 6.0012 | 1218 | 315.7558 | 0.4837 | 0.1205 |
60
+ | 118.0361 | 7.0012 | 1392 | 334.5023 | 0.5028 | 0.1246 |
61
+ | 106.5873 | 8.0012 | 1566 | 326.3535 | 0.5131 | 0.1269 |
62
+ | 101.0085 | 9.0012 | 1740 | 310.9904 | 0.5255 | 0.1299 |
63
+ | 93.4805 | 10.0012 | 1914 | 282.8377 | 0.5332 | 0.1326 |
64
+ | 89.738 | 11.0012 | 2088 | 230.7974 | 0.5392 | 0.1343 |
65
+ | 87.1364 | 12.0012 | 2262 | 211.2657 | 0.5464 | 0.1375 |
66
+ | 82.4582 | 13.0012 | 2436 | 194.0442 | 0.5537 | 0.1404 |
67
+ | 78.5378 | 14.0012 | 2610 | 170.9561 | 0.5649 | 0.1430 |
68
+ | 74.8424 | 15.0012 | 2784 | 159.6624 | 0.5607 | 0.1437 |
69
+ | 71.4314 | 16.0012 | 2958 | 153.6095 | 0.5781 | 0.1482 |
70
+ | 68.4535 | 17.0012 | 3132 | 146.7028 | 0.5778 | 0.1498 |
71
+ | 64.8446 | 18.0012 | 3306 | 131.3660 | 0.5815 | 0.1522 |
72
+ | 62.4702 | 19.0012 | 3480 | 126.0886 | 0.6040 | 0.1563 |
73
+ | 58.8144 | 20.0011 | 3654 | 112.7233 | 0.5828 | 0.1582 |
74
+ | 55.6033 | 21.0011 | 3828 | 111.9374 | 0.6094 | 0.1670 |
75
+ | 51.28 | 22.0011 | 4002 | 106.3416 | 0.6040 | 0.1668 |
76
+ | 49.2113 | 23.0011 | 4176 | 97.6079 | 0.6159 | 0.1755 |
77
+ | 45.3201 | 24.0011 | 4350 | 93.0636 | 0.6165 | 0.1836 |
78
+ | 41.4229 | 25.0011 | 4524 | 88.3754 | 0.6443 | 0.1858 |
79
+ | 39.7556 | 26.0011 | 4698 | 83.4745 | 0.6557 | 0.2018 |
80
+ | 36.5526 | 27.0011 | 4872 | 79.9107 | 0.6630 | 0.2072 |
81
+ | 34.053 | 28.0011 | 5046 | 80.6923 | 0.6622 | 0.2103 |
82
+ | 32.1878 | 29.0011 | 5220 | 78.1180 | 0.6831 | 0.2327 |
83
+ | 30.2641 | 30.0011 | 5394 | 75.4703 | 0.6875 | 0.2420 |
84
+ | 27.5827 | 31.0011 | 5568 | 74.4757 | 0.6933 | 0.2555 |
85
+ | 27.3727 | 32.0011 | 5742 | 74.6789 | 0.6903 | 0.2625 |
86
+ | 24.1575 | 33.0010 | 5916 | 70.0468 | 0.7085 | 0.2770 |
87
+ | 23.1529 | 34.0010 | 6090 | 73.9712 | 0.7002 | 0.2799 |
88
+ | 20.9037 | 35.0010 | 6264 | 76.2687 | 0.7179 | 0.3137 |
89
+ | 19.7302 | 36.0010 | 6438 | 68.7230 | 0.7270 | 0.3149 |
90
+ | 17.8816 | 37.0010 | 6612 | 71.3928 | 0.7315 | 0.3253 |
91
+ | 17.3625 | 38.0010 | 6786 | 77.4764 | 0.7276 | 0.3367 |
92
+ | 15.7429 | 39.0010 | 6960 | 72.2376 | 0.7513 | 0.3669 |
93
+ | 14.4441 | 40.0010 | 7134 | 71.0178 | 0.7470 | 0.3722 |
94
+ | 13.6976 | 41.0010 | 7308 | 74.7617 | 0.7496 | 0.3780 |
95
+ | 12.8759 | 42.0010 | 7482 | 76.6921 | 0.7485 | 0.3849 |
96
+ | 11.4875 | 43.0010 | 7656 | 79.4004 | 0.7594 | 0.4130 |
97
+ | 10.9184 | 44.0010 | 7830 | 80.5496 | 0.7635 | 0.4182 |
98
+ | 9.6914 | 45.0010 | 8004 | 83.1104 | 0.7697 | 0.4192 |
99
+ | 8.8385 | 46.0010 | 8178 | 82.5643 | 0.7690 | 0.4237 |
100
+ | 8.5961 | 47.0009 | 8352 | 87.4413 | 0.7670 | 0.4386 |
101
+ | 7.6754 | 48.0009 | 8526 | 82.4477 | 0.7783 | 0.4398 |
102
+ | 6.9892 | 49.0009 | 8700 | 87.6566 | 0.7813 | 0.4493 |
103
+ | 6.442 | 50.0009 | 8874 | 81.7274 | 0.7856 | 0.4610 |
104
+ | 6.8617 | 51.0009 | 9048 | 83.6309 | 0.7801 | 0.4627 |
105
+ | 5.8208 | 52.0009 | 9222 | 87.4704 | 0.7796 | 0.4553 |
106
+ | 5.1756 | 53.0009 | 9396 | 85.3993 | 0.7876 | 0.4796 |
107
+ | 4.9696 | 54.0009 | 9570 | 88.7034 | 0.7934 | 0.4784 |
108
+ | 4.7399 | 55.0009 | 9744 | 83.7558 | 0.7880 | 0.4851 |
109
+ | 3.9034 | 56.0009 | 9918 | 74.9174 | 0.8016 | 0.4907 |
110
+ | 3.7455 | 57.0009 | 10092 | 82.2448 | 0.7972 | 0.5018 |
111
+ | 3.6231 | 58.0009 | 10266 | 78.0172 | 0.8101 | 0.5119 |
112
+ | 3.2277 | 59.0009 | 10440 | 82.1616 | 0.8017 | 0.5163 |
113
+ | 3.0793 | 60.0008 | 10614 | 83.3934 | 0.8134 | 0.5252 |
114
+ | 2.8513 | 61.0008 | 10788 | 78.8820 | 0.8097 | 0.5257 |
115
+ | 2.5322 | 62.0008 | 10962 | 84.4697 | 0.8157 | 0.5310 |
116
+ | 2.2705 | 63.0008 | 11136 | 76.6802 | 0.8213 | 0.5418 |
117
+ | 1.8824 | 64.0008 | 11310 | 68.8732 | 0.8211 | 0.5384 |
118
+ | 1.585 | 65.0008 | 11484 | 57.1948 | 0.8137 | 0.5150 |
119
+ | 1.1991 | 66.0008 | 11658 | 49.8222 | 0.8240 | 0.5227 |
120
+ | 1.1368 | 67.0008 | 11832 | 38.8429 | 0.8338 | 0.5457 |
121
+ | 0.9509 | 68.0008 | 12006 | 60.7010 | 0.8279 | 0.5354 |
122
+ | 0.8946 | 69.0008 | 12180 | 65.4121 | 0.8373 | 0.5605 |
123
+ | 0.8512 | 70.0008 | 12354 | 62.3615 | 0.8310 | 0.5552 |
124
+ | 0.7906 | 71.0008 | 12528 | 62.2998 | 0.8389 | 0.5698 |
125
+ | 0.7885 | 72.0008 | 12702 | 57.8105 | 0.8408 | 0.5757 |
126
+ | 0.5971 | 73.0007 | 12876 | 58.8721 | 0.8452 | 0.5752 |
127
+ | 0.6079 | 74.0007 | 13050 | 55.2946 | 0.8484 | 0.5856 |
128
+ | 0.5256 | 75.0007 | 13224 | 61.0752 | 0.8518 | 0.5989 |
129
+ | 0.5172 | 76.0007 | 13398 | 62.3961 | 0.8545 | 0.6017 |
130
+ | 0.4981 | 77.0007 | 13572 | 55.6295 | 0.8650 | 0.6072 |
131
+ | 0.4543 | 78.0007 | 13746 | 68.2968 | 0.8605 | 0.6137 |
132
+ | 0.4259 | 79.0007 | 13920 | 56.9472 | 0.8593 | 0.6118 |
133
+ | 0.4571 | 80.0007 | 14094 | 54.4692 | 0.8592 | 0.6172 |
134
+ | 0.4175 | 81.0007 | 14268 | 56.7649 | 0.8676 | 0.6313 |
135
+ | 0.3959 | 82.0007 | 14442 | 50.5353 | 0.8685 | 0.6373 |
136
+ | 0.3605 | 83.0007 | 14616 | 53.6423 | 0.8629 | 0.6354 |
137
+ | 0.3522 | 84.0007 | 14790 | 45.1454 | 0.8725 | 0.6455 |
138
+ | 0.3343 | 85.0007 | 14964 | 47.1450 | 0.8697 | 0.6475 |
139
+ | 0.335 | 86.0007 | 15138 | 45.6049 | 0.8708 | 0.6474 |
140
+ | 0.3395 | 87.0006 | 15312 | 43.6421 | 0.8697 | 0.6446 |
141
+ | 0.2969 | 88.0006 | 15486 | 41.7570 | 0.8753 | 0.6526 |
142
+ | 0.3008 | 89.0006 | 15660 | 41.8292 | 0.8781 | 0.6611 |
143
+ | 0.2823 | 90.0006 | 15834 | 43.5466 | 0.8770 | 0.6594 |
144
+ | 0.3032 | 91.0006 | 16008 | 44.2423 | 0.8771 | 0.6653 |
145
+ | 0.2955 | 92.0006 | 16182 | 46.0140 | 0.8743 | 0.6611 |
146
+ | 0.2768 | 93.0006 | 16356 | 41.9541 | 0.8848 | 0.6742 |
147
+ | 0.2751 | 94.0006 | 16530 | 38.5932 | 0.8850 | 0.6753 |
148
+ | 0.2487 | 95.0006 | 16704 | 34.4618 | 0.8793 | 0.6629 |
149
+ | 0.2333 | 96.0006 | 16878 | 36.7106 | 0.8859 | 0.6768 |
150
+ | 0.2511 | 97.0006 | 17052 | 42.5301 | 0.8807 | 0.6779 |
151
+ | 0.2223 | 98.0006 | 17226 | 36.7938 | 0.8847 | 0.6841 |
152
+ | 0.219 | 99.0006 | 17400 | 43.3022 | 0.8854 | 0.6835 |
153
+ | 0.2216 | 100.0005 | 17574 | 36.1185 | 0.8865 | 0.6887 |
154
+ | 0.2581 | 101.0005 | 17748 | 37.7511 | 0.8890 | 0.6878 |
155
+ | 0.2202 | 102.0005 | 17922 | 36.3711 | 0.8917 | 0.6977 |
156
+ | 0.2065 | 103.0005 | 18096 | 34.7778 | 0.8896 | 0.6862 |
157
+ | 0.1951 | 104.0005 | 18270 | 35.3296 | 0.8933 | 0.6963 |
158
+ | 0.2161 | 105.0005 | 18444 | 35.1388 | 0.8885 | 0.6979 |
159
+ | 0.19 | 106.0005 | 18618 | 37.4090 | 0.8930 | 0.7022 |
160
+ | 0.2014 | 107.0005 | 18792 | 39.9304 | 0.8864 | 0.7021 |
161
+ | 0.1884 | 108.0005 | 18966 | 35.6866 | 0.8928 | 0.7045 |
162
+ | 0.1869 | 109.0005 | 19140 | 31.2140 | 0.8958 | 0.7134 |
163
+ | 0.1679 | 110.0005 | 19314 | 35.2881 | 0.8927 | 0.7023 |
164
+ | 0.1778 | 111.0005 | 19488 | 33.9267 | 0.8958 | 0.7082 |
165
+ | 0.1611 | 112.0005 | 19662 | 35.9019 | 0.8938 | 0.7007 |
166
+ | 0.1501 | 113.0005 | 19836 | 32.8321 | 0.8959 | 0.7068 |
167
+ | 0.1581 | 114.0004 | 20010 | 36.4848 | 0.8866 | 0.6971 |
168
+ | 0.1826 | 115.0004 | 20184 | 32.8341 | 0.8925 | 0.7157 |
169
+ | 0.1618 | 116.0004 | 20358 | 32.4633 | 0.8962 | 0.7197 |
170
+ | 0.1494 | 117.0004 | 20532 | 32.6688 | 0.8980 | 0.7261 |
171
+ | 0.161 | 118.0004 | 20706 | 33.1626 | 0.8955 | 0.7228 |
172
+ | 0.1413 | 119.0004 | 20880 | 31.5254 | 0.9023 | 0.7319 |
173
+ | 0.149 | 120.0004 | 21054 | 29.9773 | 0.8976 | 0.7269 |
174
+ | 0.1273 | 121.0004 | 21228 | 31.7605 | 0.8973 | 0.7279 |
175
+ | 0.1337 | 122.0004 | 21402 | 28.5254 | 0.8967 | 0.7227 |
176
+ | 0.1343 | 123.0004 | 21576 | 25.9094 | 0.9059 | 0.7337 |
177
+ | 0.1265 | 124.0004 | 21750 | 26.7278 | 0.9005 | 0.7359 |
178
+ | 0.127 | 125.0004 | 21924 | 31.3249 | 0.9019 | 0.7416 |
179
+ | 0.1324 | 126.0004 | 22098 | 26.6934 | 0.9023 | 0.7299 |
180
+ | 0.1198 | 127.0003 | 22272 | 25.5277 | 0.9042 | 0.7357 |
181
+ | 0.1158 | 128.0003 | 22446 | 30.6548 | 0.9016 | 0.7369 |
182
+ | 0.1219 | 129.0003 | 22620 | 29.1850 | 0.8997 | 0.7359 |
183
+ | 0.1224 | 130.0003 | 22794 | 29.7951 | 0.9023 | 0.7377 |
184
+ | 0.1091 | 131.0003 | 22968 | 25.7487 | 0.9025 | 0.7397 |
185
+ | 0.1072 | 132.0003 | 23142 | 26.1091 | 0.9032 | 0.7341 |
186
+ | 0.1221 | 133.0003 | 23316 | 30.3684 | 0.9021 | 0.7416 |
187
+ | 0.1111 | 134.0003 | 23490 | 24.3900 | 0.9052 | 0.7435 |
188
+ | 0.104 | 135.0003 | 23664 | 25.5572 | 0.9054 | 0.7292 |
189
+ | 0.1085 | 136.0003 | 23838 | 26.8688 | 0.9061 | 0.7467 |
190
+ | 0.1118 | 137.0003 | 24012 | 27.1255 | 0.9067 | 0.7473 |
191
+ | 0.1092 | 138.0003 | 24186 | 25.9586 | 0.9060 | 0.7465 |
192
+ | 0.1025 | 139.0003 | 24360 | 28.3758 | 0.9035 | 0.7423 |
193
+ | 0.098 | 140.0003 | 24534 | 24.8116 | 0.9050 | 0.7528 |
194
+ | 0.0976 | 141.0002 | 24708 | 25.9563 | 0.9031 | 0.7458 |
195
+ | 0.095 | 142.0002 | 24882 | 26.6460 | 0.9040 | 0.7514 |
196
+ | 0.0946 | 143.0002 | 25056 | 27.5637 | 0.9040 | 0.7468 |
197
+ | 0.1112 | 144.0002 | 25230 | 26.6580 | 0.9059 | 0.7534 |
198
+ | 0.0931 | 145.0002 | 25404 | 25.6440 | 0.9032 | 0.7466 |
199
+ | 0.101 | 146.0002 | 25578 | 25.0267 | 0.9033 | 0.7548 |
200
+ | 0.0837 | 147.0002 | 25752 | 25.2071 | 0.9021 | 0.7547 |
201
+ | 0.092 | 148.0002 | 25926 | 22.9742 | 0.9069 | 0.7561 |
202
+ | 0.0832 | 149.0002 | 26100 | 26.6763 | 0.9047 | 0.7474 |
203
+ | 0.0886 | 150.0002 | 26274 | 28.7782 | 0.9103 | 0.7599 |
204
+ | 0.0791 | 151.0002 | 26448 | 27.2992 | 0.9092 | 0.7587 |
205
+ | 0.08 | 152.0002 | 26622 | 26.8654 | 0.9100 | 0.7660 |
206
+ | 0.0823 | 153.0002 | 26796 | 25.9254 | 0.9101 | 0.7605 |
207
+ | 0.0857 | 154.0001 | 26970 | 26.7305 | 0.9078 | 0.7630 |
208
+ | 0.082 | 155.0001 | 27144 | 28.5294 | 0.9081 | 0.7659 |
209
+ | 0.082 | 156.0001 | 27318 | 24.1481 | 0.9081 | 0.7648 |
210
+ | 0.0792 | 157.0001 | 27492 | 26.6694 | 0.9085 | 0.7623 |
211
+ | 0.075 | 158.0001 | 27666 | 25.4381 | 0.9084 | 0.7731 |
212
+ | 0.0768 | 159.0001 | 27840 | 26.5050 | 0.9106 | 0.7698 |
213
+ | 0.0719 | 160.0001 | 28014 | 32.5789 | 0.9098 | 0.7715 |
214
+ | 0.0755 | 161.0001 | 28188 | 25.7877 | 0.9067 | 0.7634 |
215
+ | 0.077 | 162.0001 | 28362 | 27.9020 | 0.9093 | 0.7669 |
216
+ | 0.0768 | 163.0001 | 28536 | 30.1018 | 0.9095 | 0.7655 |
217
+ | 0.0762 | 164.0001 | 28710 | 29.1469 | 0.9070 | 0.7658 |
218
+ | 0.0721 | 165.0001 | 28884 | 26.5870 | 0.9070 | 0.7687 |
219
+ | 0.0683 | 166.0001 | 29058 | 29.8100 | 0.9014 | 0.7657 |
220
+ | 0.0674 | 167.0001 | 29232 | 29.8396 | 0.9088 | 0.7629 |
221
+ | 0.0684 | 168.0000 | 29406 | 26.3450 | 0.9123 | 0.7683 |
222
+ | 0.0655 | 169.0000 | 29580 | 30.6365 | 0.9089 | 0.7685 |
223
+ | 0.0791 | 170.0000 | 29754 | 27.0922 | 0.9072 | 0.7672 |
224
+ | 0.0635 | 171.0000 | 29928 | 26.5640 | 0.9101 | 0.7725 |
225
+ | 0.0663 | 172.0000 | 30102 | 32.9721 | 0.9074 | 0.7699 |
226
+ | 0.0696 | 173.0000 | 30276 | 31.4842 | 0.9092 | 0.7688 |
227
+ | 0.0672 | 173.0013 | 30450 | 32.9775 | 0.9097 | 0.7732 |
228
+ | 0.0698 | 174.0013 | 30624 | 31.9401 | 0.9116 | 0.7734 |
229
+ | 0.0616 | 175.0013 | 30798 | 31.2141 | 0.9101 | 0.7772 |
230
+ | 0.0609 | 176.0013 | 30972 | 27.9407 | 0.9121 | 0.7742 |
231
+ | 0.0615 | 177.0013 | 31146 | 33.0894 | 0.9113 | 0.7735 |
232
+ | 0.069 | 178.0013 | 31320 | 31.1754 | 0.9103 | 0.7770 |
233
+ | 0.0616 | 179.0013 | 31494 | 29.4448 | 0.9086 | 0.7775 |
234
+ | 0.0688 | 180.0012 | 31668 | 30.4034 | 0.9123 | 0.7794 |
235
+ | 0.0635 | 181.0012 | 31842 | 37.7052 | 0.9088 | 0.7720 |
236
+ | 0.0565 | 182.0012 | 32016 | 30.6736 | 0.9101 | 0.7756 |
237
+ | 0.0647 | 183.0012 | 32190 | 35.0087 | 0.9120 | 0.7790 |
238
+ | 0.0541 | 184.0012 | 32364 | 32.5721 | 0.9102 | 0.7766 |
239
+ | 0.0575 | 185.0012 | 32538 | 39.5180 | 0.9103 | 0.7759 |
240
+ | 0.0642 | 186.0012 | 32712 | 36.0048 | 0.9104 | 0.7817 |
241
+ | 0.0568 | 187.0012 | 32886 | 31.1171 | 0.9116 | 0.7782 |
242
+ | 0.0529 | 188.0012 | 33060 | 36.1413 | 0.9093 | 0.7808 |
243
+ | 0.0509 | 189.0012 | 33234 | 32.2618 | 0.9061 | 0.7843 |
244
+ | 0.0586 | 190.0012 | 33408 | 30.2432 | 0.9088 | 0.7808 |
245
+ | 0.0526 | 191.0012 | 33582 | 38.5739 | 0.9089 | 0.7793 |
246
+ | 0.0624 | 192.0012 | 33756 | 32.6786 | 0.9106 | 0.7765 |
247
+ | 0.0582 | 193.0012 | 33930 | 33.1057 | 0.9106 | 0.7851 |
248
+ | 0.0561 | 194.0011 | 34104 | 31.6344 | 0.9105 | 0.7860 |
249
+ | 0.0497 | 195.0011 | 34278 | 36.1805 | 0.9133 | 0.7834 |
250
+ | 0.0489 | 196.0011 | 34452 | 37.0280 | 0.9101 | 0.7832 |
251
+ | 0.0528 | 197.0011 | 34626 | 34.1204 | 0.9110 | 0.7802 |
252
+ | 0.0517 | 198.0011 | 34800 | 31.8136 | 0.9098 | 0.7837 |
253
+ | 0.0488 | 199.0011 | 34974 | 33.3773 | 0.9107 | 0.7821 |
254
+ | 0.0562 | 200.0011 | 35148 | 31.3113 | 0.9116 | 0.7796 |
255
+ | 0.0479 | 201.0011 | 35322 | 32.9159 | 0.9114 | 0.7861 |
256
+ | 0.0579 | 202.0011 | 35496 | 32.8361 | 0.9084 | 0.7860 |
257
+ | 0.0555 | 203.0011 | 35670 | 44.6282 | 0.9096 | 0.7858 |
258
+ | 0.0595 | 204.0011 | 35844 | 37.1931 | 0.9089 | 0.7724 |
259
+ | 0.0497 | 205.0011 | 36018 | 34.0041 | 0.9103 | 0.7849 |
260
+ | 0.0453 | 206.0011 | 36192 | 33.8356 | 0.9146 | 0.7902 |
261
+ | 0.0527 | 207.0010 | 36366 | 39.8249 | 0.9086 | 0.7816 |
262
+ | 0.0519 | 208.0010 | 36540 | 38.2800 | 0.9123 | 0.7907 |
263
+ | 0.0447 | 209.0010 | 36714 | 37.5709 | 0.9088 | 0.7863 |
264
+ | 0.0481 | 210.0010 | 36888 | 34.6015 | 0.9114 | 0.7877 |
265
+ | 0.0484 | 211.0010 | 37062 | 38.2721 | 0.9095 | 0.7847 |
266
+ | 0.0574 | 212.0010 | 37236 | 33.5939 | 0.9118 | 0.7894 |
267
+ | 0.0481 | 213.0010 | 37410 | 39.5158 | 0.9122 | 0.7887 |
268
+ | 0.0481 | 214.0010 | 37584 | 36.9508 | 0.9115 | 0.7852 |
269
+ | 0.0378 | 215.0010 | 37758 | 35.9266 | 0.9097 | 0.7838 |
270
+ | 0.0538 | 216.0010 | 37932 | 39.2063 | 0.9087 | 0.7793 |
271
+ | 0.0433 | 217.0010 | 38106 | 41.0355 | 0.9109 | 0.7819 |
272
+ | 0.045 | 218.0010 | 38280 | 38.1408 | 0.9117 | 0.7859 |
273
+ | 0.0449 | 219.0010 | 38454 | 44.9038 | 0.9113 | 0.7873 |
274
+ | 0.0399 | 220.0010 | 38628 | 41.5447 | 0.9116 | 0.7850 |
275
+ | 0.0424 | 221.0009 | 38802 | 43.5924 | 0.9120 | 0.7819 |
276
+ | 0.0416 | 222.0009 | 38976 | 45.6428 | 0.9138 | 0.7862 |
277
+ | 0.0484 | 223.0009 | 39150 | 36.4716 | 0.9127 | 0.7918 |
278
+ | 0.0455 | 224.0009 | 39324 | 44.9424 | 0.9151 | 0.7912 |
279
+ | 0.0466 | 225.0009 | 39498 | 43.6592 | 0.9109 | 0.7830 |
280
+ | 0.0449 | 226.0009 | 39672 | 41.9345 | 0.9130 | 0.7897 |
281
+ | 0.044 | 227.0009 | 39846 | 39.3840 | 0.9106 | 0.7848 |
282
+ | 0.0398 | 228.0009 | 40020 | 39.3590 | 0.9104 | 0.7848 |
283
+ | 0.04 | 229.0009 | 40194 | 41.3301 | 0.9094 | 0.7877 |
284
+ | 0.039 | 230.0009 | 40368 | 42.1386 | 0.9151 | 0.7949 |
285
+ | 0.0402 | 231.0009 | 40542 | 39.3484 | 0.9144 | 0.7934 |
286
+ | 0.0381 | 232.0009 | 40716 | 45.7729 | 0.9094 | 0.7849 |
287
+ | 0.0429 | 233.0009 | 40890 | 42.4896 | 0.9134 | 0.7913 |
288
+ | 0.0408 | 234.0008 | 41064 | 45.9568 | 0.9125 | 0.7908 |
289
+ | 0.0364 | 235.0008 | 41238 | 46.4523 | 0.9120 | 0.7936 |
290
+ | 0.0431 | 236.0008 | 41412 | 45.6006 | 0.9148 | 0.7918 |
291
+ | 0.0505 | 237.0008 | 41586 | 41.8685 | 0.9126 | 0.7924 |
292
+ | 0.0365 | 238.0008 | 41760 | 46.5790 | 0.9145 | 0.7879 |
293
+ | 0.0376 | 239.0008 | 41934 | 40.8093 | 0.9073 | 0.7847 |
294
+ | 0.0358 | 240.0008 | 42108 | 43.0555 | 0.9145 | 0.7903 |
295
+ | 0.0377 | 241.0008 | 42282 | 47.9630 | 0.9114 | 0.7910 |
296
+ | 0.0387 | 242.0008 | 42456 | 38.6723 | 0.9084 | 0.7899 |
297
+ | 0.0373 | 243.0008 | 42630 | 44.7078 | 0.9123 | 0.7931 |
298
+ | 0.038 | 244.0008 | 42804 | 44.6724 | 0.9152 | 0.7972 |
299
+ | 0.0377 | 245.0008 | 42978 | 44.1149 | 0.9121 | 0.7965 |
300
+ | 0.036 | 246.0008 | 43152 | 48.4314 | 0.9113 | 0.7924 |
301
+ | 0.0402 | 247.0007 | 43326 | 49.6798 | 0.9123 | 0.7892 |
302
+ | 0.0395 | 248.0007 | 43500 | 46.5585 | 0.9089 | 0.7901 |
303
+ | 0.0414 | 249.0007 | 43674 | 48.7959 | 0.9151 | 0.7958 |
304
+ | 0.0354 | 250.0007 | 43848 | 49.1666 | 0.9150 | 0.7965 |
305
+ | 0.0437 | 251.0007 | 44022 | 47.7063 | 0.9131 | 0.7962 |
306
+ | 0.0367 | 252.0007 | 44196 | 48.3126 | 0.9121 | 0.7978 |
307
+ | 0.0334 | 253.0007 | 44370 | 41.8019 | 0.9126 | 0.7979 |
308
+ | 0.0322 | 254.0007 | 44544 | 43.5699 | 0.9138 | 0.7943 |
309
+ | 0.0343 | 255.0007 | 44718 | 49.6887 | 0.9124 | 0.7930 |
310
+ | 0.0341 | 256.0007 | 44892 | 50.9004 | 0.9110 | 0.7957 |
311
+ | 0.0334 | 257.0007 | 45066 | 46.7507 | 0.9120 | 0.7959 |
312
+ | 0.0364 | 258.0007 | 45240 | 52.3130 | 0.9144 | 0.7963 |
313
+ | 0.0328 | 259.0007 | 45414 | 55.9051 | 0.9115 | 0.7904 |
314
+ | 0.0382 | 260.0007 | 45588 | 51.8432 | 0.9126 | 0.7947 |
315
+ | 0.0343 | 261.0006 | 45762 | 44.2021 | 0.9119 | 0.7910 |
316
+ | 0.0303 | 262.0006 | 45936 | 49.8030 | 0.9094 | 0.7970 |
317
+ | 0.0337 | 263.0006 | 46110 | 48.9785 | 0.9103 | 0.7924 |
318
+ | 0.0368 | 264.0006 | 46284 | 47.8747 | 0.9091 | 0.7921 |
319
+ | 0.0302 | 265.0006 | 46458 | 50.3159 | 0.9095 | 0.7963 |
320
+ | 0.0328 | 266.0006 | 46632 | 51.4730 | 0.9135 | 0.8017 |
321
+ | 0.0311 | 267.0006 | 46806 | 52.1970 | 0.9151 | 0.7956 |
322
+ | 0.0322 | 268.0006 | 46980 | 57.7855 | 0.9110 | 0.7932 |
323
+ | 0.0324 | 269.0006 | 47154 | 54.0878 | 0.9130 | 0.7919 |
324
+ | 0.0317 | 270.0006 | 47328 | 51.4955 | 0.9145 | 0.7951 |
325
+ | 0.0304 | 271.0006 | 47502 | 47.3301 | 0.9123 | 0.7938 |
326
+ | 0.0332 | 272.0006 | 47676 | 51.8781 | 0.9125 | 0.7938 |
327
+ | 0.03 | 273.0006 | 47850 | 63.6576 | 0.9094 | 0.7942 |
328
+ | 0.0262 | 274.0005 | 48024 | 56.7262 | 0.9136 | 0.7974 |
329
+ | 0.0329 | 275.0005 | 48198 | 47.7345 | 0.9092 | 0.7903 |
330
+ | 0.0301 | 276.0005 | 48372 | 54.1646 | 0.9111 | 0.7901 |
331
+ | 0.0334 | 277.0005 | 48546 | 47.4863 | 0.9122 | 0.7950 |
332
+ | 0.0395 | 278.0005 | 48720 | 51.2456 | 0.9123 | 0.7957 |
333
+ | 0.0296 | 279.0005 | 48894 | 51.6280 | 0.9132 | 0.7969 |
334
+ | 0.0352 | 280.0005 | 49068 | 52.0063 | 0.9139 | 0.7946 |
335
+ | 0.027 | 281.0005 | 49242 | 57.0013 | 0.9130 | 0.7893 |
336
+ | 0.0293 | 282.0005 | 49416 | 56.1311 | 0.9121 | 0.7910 |
337
+ | 0.0281 | 283.0005 | 49590 | 59.5521 | 0.9147 | 0.7985 |
338
+ | 0.0325 | 284.0005 | 49764 | 57.0298 | 0.9140 | 0.7984 |
339
+ | 0.0305 | 285.0005 | 49938 | 61.8313 | 0.9127 | 0.7974 |
340
+ | 0.0289 | 286.0005 | 50112 | 51.0739 | 0.9128 | 0.8011 |
341
+
342
+
343
+ ### Framework versions
344
+
345
+ - Transformers 4.46.0
346
+ - Pytorch 2.3.1+cu121
347
+ - Datasets 2.20.0
348
+ - Tokenizers 0.20.1
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e5b5ba5e1b388511c85fc7edbbab97536af19c3f51ccd93b286581922a0d2ff2
3
  size 126037432
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6002a930eb25e2041985174f5dcd1f1e507b216a870a9790cc75479e6c099e75
3
  size 126037432
runs/0-by=2006-psr=0.25/events.out.tfevents.1747134808.yara2.2696447.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cc24a6eb07c72f0643c5837df892cd278644b650f6c56e4a783a5ee72cd568c4
3
+ size 470