Instructions to use Arthur-Tsai/ht-stmini-cls-v6_ftis_noPretrain-cssl-msm-pos with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Arthur-Tsai/ht-stmini-cls-v6_ftis_noPretrain-cssl-msm-pos with Transformers:
# Load model directly from transformers import HiTrans model = HiTrans.from_pretrained("Arthur-Tsai/ht-stmini-cls-v6_ftis_noPretrain-cssl-msm-pos", dtype="auto") - Notebooks
- Google Colab
- Kaggle
End of training
Browse files
README.md
CHANGED
|
@@ -2,6 +2,8 @@
|
|
| 2 |
library_name: transformers
|
| 3 |
tags:
|
| 4 |
- generated_from_trainer
|
|
|
|
|
|
|
| 5 |
model-index:
|
| 6 |
- name: ht-stmini-cls-v6_ftis_noPretrain-cssl-msm-pos
|
| 7 |
results: []
|
|
@@ -14,14 +16,9 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 14 |
|
| 15 |
This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
|
| 16 |
It achieves the following results on the evaluation set:
|
| 17 |
-
-
|
| 18 |
-
-
|
| 19 |
-
-
|
| 20 |
-
- eval_runtime: 14.6137
|
| 21 |
-
- eval_samples_per_second: 45.984
|
| 22 |
-
- eval_steps_per_second: 11.496
|
| 23 |
-
- epoch: 0.0001
|
| 24 |
-
- step: 14
|
| 25 |
|
| 26 |
## Model description
|
| 27 |
|
|
@@ -49,6 +46,220 @@ The following hyperparameters were used during training:
|
|
| 49 |
- lr_scheduler_warmup_steps: 6733
|
| 50 |
- training_steps: 134674
|
| 51 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 52 |
### Framework versions
|
| 53 |
|
| 54 |
- Transformers 4.46.0
|
|
|
|
| 2 |
library_name: transformers
|
| 3 |
tags:
|
| 4 |
- generated_from_trainer
|
| 5 |
+
metrics:
|
| 6 |
+
- accuracy
|
| 7 |
model-index:
|
| 8 |
- name: ht-stmini-cls-v6_ftis_noPretrain-cssl-msm-pos
|
| 9 |
results: []
|
|
|
|
| 16 |
|
| 17 |
This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
|
| 18 |
It achieves the following results on the evaluation set:
|
| 19 |
+
- Loss: 4.6898
|
| 20 |
+
- Accuracy: 0.9046
|
| 21 |
+
- Macro F1: 0.7715
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 22 |
|
| 23 |
## Model description
|
| 24 |
|
|
|
|
| 46 |
- lr_scheduler_warmup_steps: 6733
|
| 47 |
- training_steps: 134674
|
| 48 |
|
| 49 |
+
### Training results
|
| 50 |
+
|
| 51 |
+
| Training Loss | Epoch | Step | Validation Loss | Accuracy | Macro F1 |
|
| 52 |
+
|:-------------:|:--------:|:-----:|:---------------:|:--------:|:--------:|
|
| 53 |
+
| 126.5483 | 0.0013 | 174 | 111.1027 | 0.0648 | 0.0287 |
|
| 54 |
+
| 76.1361 | 1.0013 | 348 | 90.6096 | 0.1034 | 0.0387 |
|
| 55 |
+
| 48.057 | 2.0013 | 522 | 57.0785 | 0.2897 | 0.0665 |
|
| 56 |
+
| 32.3575 | 3.0013 | 696 | 34.9420 | 0.3859 | 0.0746 |
|
| 57 |
+
| 17.1856 | 4.0013 | 870 | 30.2187 | 0.4439 | 0.0994 |
|
| 58 |
+
| 9.1641 | 5.0013 | 1044 | 38.1436 | 0.5078 | 0.1241 |
|
| 59 |
+
| 7.3734 | 6.0012 | 1218 | 36.7879 | 0.5212 | 0.1276 |
|
| 60 |
+
| 6.5681 | 7.0012 | 1392 | 54.2617 | 0.5536 | 0.1346 |
|
| 61 |
+
| 5.7147 | 8.0012 | 1566 | 48.2767 | 0.5728 | 0.1386 |
|
| 62 |
+
| 5.2871 | 9.0012 | 1740 | 37.4026 | 0.5954 | 0.1463 |
|
| 63 |
+
| 4.9769 | 10.0012 | 1914 | 30.2233 | 0.6020 | 0.1477 |
|
| 64 |
+
| 4.6209 | 11.0012 | 2088 | 32.4927 | 0.6052 | 0.1501 |
|
| 65 |
+
| 4.4394 | 12.0012 | 2262 | 25.6421 | 0.6107 | 0.1568 |
|
| 66 |
+
| 4.3723 | 13.0012 | 2436 | 19.3662 | 0.6127 | 0.1597 |
|
| 67 |
+
| 4.0033 | 14.0012 | 2610 | 20.5622 | 0.6211 | 0.1553 |
|
| 68 |
+
| 3.9379 | 15.0012 | 2784 | 19.4049 | 0.6235 | 0.1573 |
|
| 69 |
+
| 3.6634 | 16.0012 | 2958 | 14.8564 | 0.6078 | 0.1596 |
|
| 70 |
+
| 3.5798 | 17.0012 | 3132 | 14.3299 | 0.5876 | 0.1583 |
|
| 71 |
+
| 3.5315 | 18.0012 | 3306 | 16.1368 | 0.6416 | 0.1823 |
|
| 72 |
+
| 3.39 | 19.0012 | 3480 | 14.0253 | 0.6337 | 0.1812 |
|
| 73 |
+
| 3.2767 | 20.0011 | 3654 | 15.0987 | 0.6606 | 0.2020 |
|
| 74 |
+
| 3.1694 | 21.0011 | 3828 | 12.1632 | 0.6091 | 0.2124 |
|
| 75 |
+
| 3.0645 | 22.0011 | 4002 | 12.7286 | 0.6809 | 0.2309 |
|
| 76 |
+
| 2.9893 | 23.0011 | 4176 | 11.4389 | 0.6947 | 0.2409 |
|
| 77 |
+
| 2.7377 | 24.0011 | 4350 | 9.8009 | 0.6568 | 0.2572 |
|
| 78 |
+
| 2.6703 | 25.0011 | 4524 | 13.0192 | 0.6925 | 0.2548 |
|
| 79 |
+
| 2.5916 | 26.0011 | 4698 | 8.7858 | 0.7210 | 0.3056 |
|
| 80 |
+
| 2.4572 | 27.0011 | 4872 | 9.5413 | 0.7235 | 0.2965 |
|
| 81 |
+
| 2.3576 | 28.0011 | 5046 | 10.5421 | 0.7270 | 0.3148 |
|
| 82 |
+
| 2.2748 | 29.0011 | 5220 | 9.0379 | 0.7363 | 0.3782 |
|
| 83 |
+
| 2.1568 | 30.0011 | 5394 | 8.2278 | 0.7335 | 0.3570 |
|
| 84 |
+
| 2.1567 | 31.0011 | 5568 | 8.5514 | 0.7496 | 0.3705 |
|
| 85 |
+
| 1.9851 | 32.0011 | 5742 | 7.6294 | 0.7139 | 0.3522 |
|
| 86 |
+
| 1.9502 | 33.0010 | 5916 | 8.6820 | 0.7517 | 0.4070 |
|
| 87 |
+
| 1.8211 | 34.0010 | 6090 | 8.8881 | 0.7593 | 0.4236 |
|
| 88 |
+
| 1.8441 | 35.0010 | 6264 | 8.7511 | 0.7661 | 0.4279 |
|
| 89 |
+
| 1.7461 | 36.0010 | 6438 | 9.6183 | 0.7638 | 0.4296 |
|
| 90 |
+
| 1.6554 | 37.0010 | 6612 | 9.2041 | 0.7574 | 0.4346 |
|
| 91 |
+
| 1.6534 | 38.0010 | 6786 | 7.9498 | 0.7716 | 0.4507 |
|
| 92 |
+
| 1.5795 | 39.0010 | 6960 | 10.6264 | 0.7690 | 0.4348 |
|
| 93 |
+
| 1.3498 | 40.0010 | 7134 | 8.5518 | 0.7722 | 0.4354 |
|
| 94 |
+
| 1.3898 | 41.0010 | 7308 | 8.2780 | 0.7972 | 0.4779 |
|
| 95 |
+
| 1.3115 | 42.0010 | 7482 | 8.5481 | 0.7993 | 0.4868 |
|
| 96 |
+
| 1.2597 | 43.0010 | 7656 | 9.3766 | 0.7992 | 0.4959 |
|
| 97 |
+
| 1.1297 | 44.0010 | 7830 | 10.5128 | 0.8071 | 0.5101 |
|
| 98 |
+
| 1.1462 | 45.0010 | 8004 | 11.4308 | 0.8068 | 0.5031 |
|
| 99 |
+
| 1.0983 | 46.0010 | 8178 | 13.3206 | 0.8148 | 0.5210 |
|
| 100 |
+
| 1.0061 | 47.0009 | 8352 | 11.6017 | 0.8202 | 0.5307 |
|
| 101 |
+
| 0.9957 | 48.0009 | 8526 | 13.2191 | 0.8111 | 0.5321 |
|
| 102 |
+
| 0.9112 | 49.0009 | 8700 | 11.1954 | 0.8103 | 0.5300 |
|
| 103 |
+
| 0.8629 | 50.0009 | 8874 | 9.9911 | 0.8169 | 0.5272 |
|
| 104 |
+
| 0.9148 | 51.0009 | 9048 | 11.7675 | 0.8181 | 0.5338 |
|
| 105 |
+
| 0.8082 | 52.0009 | 9222 | 13.5731 | 0.8290 | 0.5519 |
|
| 106 |
+
| 0.7414 | 53.0009 | 9396 | 12.3935 | 0.8300 | 0.5587 |
|
| 107 |
+
| 0.7528 | 54.0009 | 9570 | 13.0348 | 0.8411 | 0.5753 |
|
| 108 |
+
| 0.6769 | 55.0009 | 9744 | 14.9218 | 0.8454 | 0.5722 |
|
| 109 |
+
| 0.6312 | 56.0009 | 9918 | 14.8211 | 0.8394 | 0.5745 |
|
| 110 |
+
| 0.6002 | 57.0009 | 10092 | 14.5870 | 0.8486 | 0.5877 |
|
| 111 |
+
| 0.6106 | 58.0009 | 10266 | 13.3761 | 0.8487 | 0.5943 |
|
| 112 |
+
| 0.5756 | 59.0009 | 10440 | 16.0584 | 0.8477 | 0.5932 |
|
| 113 |
+
| 0.5628 | 60.0008 | 10614 | 16.0316 | 0.8581 | 0.6048 |
|
| 114 |
+
| 0.513 | 61.0008 | 10788 | 16.3738 | 0.8495 | 0.5992 |
|
| 115 |
+
| 0.5238 | 62.0008 | 10962 | 18.2736 | 0.8539 | 0.6103 |
|
| 116 |
+
| 0.4706 | 63.0008 | 11136 | 14.2835 | 0.8548 | 0.6128 |
|
| 117 |
+
| 0.4749 | 64.0008 | 11310 | 14.2810 | 0.8585 | 0.6174 |
|
| 118 |
+
| 0.4349 | 65.0008 | 11484 | 15.6633 | 0.8569 | 0.6175 |
|
| 119 |
+
| 0.4185 | 66.0008 | 11658 | 16.3260 | 0.8585 | 0.6174 |
|
| 120 |
+
| 0.4082 | 67.0008 | 11832 | 15.6747 | 0.8662 | 0.6314 |
|
| 121 |
+
| 0.3754 | 68.0008 | 12006 | 15.9374 | 0.8695 | 0.6359 |
|
| 122 |
+
| 0.3712 | 69.0008 | 12180 | 14.9069 | 0.8717 | 0.6462 |
|
| 123 |
+
| 0.3322 | 70.0008 | 12354 | 15.5259 | 0.8709 | 0.6474 |
|
| 124 |
+
| 0.327 | 71.0008 | 12528 | 14.1831 | 0.8697 | 0.6454 |
|
| 125 |
+
| 0.3513 | 72.0008 | 12702 | 13.8472 | 0.8745 | 0.6570 |
|
| 126 |
+
| 0.3022 | 73.0007 | 12876 | 14.9613 | 0.8707 | 0.6552 |
|
| 127 |
+
| 0.319 | 74.0007 | 13050 | 14.3645 | 0.8748 | 0.6704 |
|
| 128 |
+
| 0.2791 | 75.0007 | 13224 | 14.3885 | 0.8758 | 0.6666 |
|
| 129 |
+
| 0.2745 | 76.0007 | 13398 | 13.0991 | 0.8741 | 0.6642 |
|
| 130 |
+
| 0.2707 | 77.0007 | 13572 | 12.0244 | 0.8800 | 0.6743 |
|
| 131 |
+
| 0.2635 | 78.0007 | 13746 | 11.8089 | 0.8724 | 0.6686 |
|
| 132 |
+
| 0.24 | 79.0007 | 13920 | 11.0137 | 0.8818 | 0.6731 |
|
| 133 |
+
| 0.2585 | 80.0007 | 14094 | 13.0278 | 0.8858 | 0.6827 |
|
| 134 |
+
| 0.2356 | 81.0007 | 14268 | 12.4481 | 0.8790 | 0.6801 |
|
| 135 |
+
| 0.2362 | 82.0007 | 14442 | 11.1037 | 0.8849 | 0.6842 |
|
| 136 |
+
| 0.2207 | 83.0007 | 14616 | 11.8355 | 0.8778 | 0.6785 |
|
| 137 |
+
| 0.2198 | 84.0007 | 14790 | 11.4067 | 0.8818 | 0.6836 |
|
| 138 |
+
| 0.2178 | 85.0007 | 14964 | 11.2733 | 0.8825 | 0.6826 |
|
| 139 |
+
| 0.1961 | 86.0007 | 15138 | 9.9639 | 0.8833 | 0.6868 |
|
| 140 |
+
| 0.2047 | 87.0006 | 15312 | 10.8682 | 0.8838 | 0.6958 |
|
| 141 |
+
| 0.1884 | 88.0006 | 15486 | 10.2078 | 0.8880 | 0.6899 |
|
| 142 |
+
| 0.1772 | 89.0006 | 15660 | 10.2804 | 0.8851 | 0.6953 |
|
| 143 |
+
| 0.1878 | 90.0006 | 15834 | 8.5898 | 0.8849 | 0.7019 |
|
| 144 |
+
| 0.1865 | 91.0006 | 16008 | 9.4423 | 0.8833 | 0.6914 |
|
| 145 |
+
| 0.2171 | 92.0006 | 16182 | 8.8286 | 0.8852 | 0.6955 |
|
| 146 |
+
| 0.1646 | 93.0006 | 16356 | 9.2409 | 0.8909 | 0.7067 |
|
| 147 |
+
| 0.1672 | 94.0006 | 16530 | 9.1058 | 0.8918 | 0.7094 |
|
| 148 |
+
| 0.1563 | 95.0006 | 16704 | 9.3175 | 0.8869 | 0.7003 |
|
| 149 |
+
| 0.1559 | 96.0006 | 16878 | 9.5856 | 0.8892 | 0.7094 |
|
| 150 |
+
| 0.1603 | 97.0006 | 17052 | 8.5476 | 0.8845 | 0.6930 |
|
| 151 |
+
| 0.1388 | 98.0006 | 17226 | 8.9783 | 0.8880 | 0.7128 |
|
| 152 |
+
| 0.1381 | 99.0006 | 17400 | 8.0858 | 0.8934 | 0.7093 |
|
| 153 |
+
| 0.1421 | 100.0005 | 17574 | 8.9669 | 0.8928 | 0.7150 |
|
| 154 |
+
| 0.1637 | 101.0005 | 17748 | 8.5299 | 0.8904 | 0.7124 |
|
| 155 |
+
| 0.1261 | 102.0005 | 17922 | 7.3294 | 0.8944 | 0.7175 |
|
| 156 |
+
| 0.1371 | 103.0005 | 18096 | 8.0279 | 0.8936 | 0.7140 |
|
| 157 |
+
| 0.1239 | 104.0005 | 18270 | 7.3118 | 0.8845 | 0.7048 |
|
| 158 |
+
| 0.1366 | 105.0005 | 18444 | 6.6411 | 0.8939 | 0.7157 |
|
| 159 |
+
| 0.1128 | 106.0005 | 18618 | 6.3256 | 0.8961 | 0.7262 |
|
| 160 |
+
| 0.1265 | 107.0005 | 18792 | 7.9040 | 0.8910 | 0.7203 |
|
| 161 |
+
| 0.122 | 108.0005 | 18966 | 7.7399 | 0.8902 | 0.7192 |
|
| 162 |
+
| 0.1276 | 109.0005 | 19140 | 7.0885 | 0.8989 | 0.7314 |
|
| 163 |
+
| 0.1128 | 110.0005 | 19314 | 7.2520 | 0.8953 | 0.7161 |
|
| 164 |
+
| 0.1113 | 111.0005 | 19488 | 7.1691 | 0.8946 | 0.7204 |
|
| 165 |
+
| 0.1027 | 112.0005 | 19662 | 7.1210 | 0.8952 | 0.7238 |
|
| 166 |
+
| 0.1018 | 113.0005 | 19836 | 6.6842 | 0.8973 | 0.7254 |
|
| 167 |
+
| 0.1023 | 114.0004 | 20010 | 6.7983 | 0.8940 | 0.7231 |
|
| 168 |
+
| 0.1155 | 115.0004 | 20184 | 6.5399 | 0.8957 | 0.7273 |
|
| 169 |
+
| 0.0974 | 116.0004 | 20358 | 5.9089 | 0.9025 | 0.7388 |
|
| 170 |
+
| 0.1069 | 117.0004 | 20532 | 6.4540 | 0.8954 | 0.7322 |
|
| 171 |
+
| 0.101 | 118.0004 | 20706 | 6.8240 | 0.8992 | 0.7322 |
|
| 172 |
+
| 0.0908 | 119.0004 | 20880 | 5.6922 | 0.8966 | 0.7346 |
|
| 173 |
+
| 0.0929 | 120.0004 | 21054 | 5.8899 | 0.8997 | 0.7390 |
|
| 174 |
+
| 0.0795 | 121.0004 | 21228 | 6.7321 | 0.8980 | 0.7299 |
|
| 175 |
+
| 0.0905 | 122.0004 | 21402 | 6.0948 | 0.8939 | 0.7299 |
|
| 176 |
+
| 0.0886 | 123.0004 | 21576 | 6.3848 | 0.8991 | 0.7363 |
|
| 177 |
+
| 0.0867 | 124.0004 | 21750 | 6.9547 | 0.8959 | 0.7384 |
|
| 178 |
+
| 0.085 | 125.0004 | 21924 | 6.7207 | 0.8945 | 0.7352 |
|
| 179 |
+
| 0.0823 | 126.0004 | 22098 | 6.6024 | 0.9002 | 0.7442 |
|
| 180 |
+
| 0.0803 | 127.0003 | 22272 | 6.8851 | 0.8947 | 0.7307 |
|
| 181 |
+
| 0.0826 | 128.0003 | 22446 | 6.2815 | 0.9025 | 0.7453 |
|
| 182 |
+
| 0.0749 | 129.0003 | 22620 | 5.9657 | 0.8966 | 0.7327 |
|
| 183 |
+
| 0.0845 | 130.0003 | 22794 | 6.5562 | 0.8990 | 0.7408 |
|
| 184 |
+
| 0.0758 | 131.0003 | 22968 | 6.1722 | 0.8957 | 0.7368 |
|
| 185 |
+
| 0.0759 | 132.0003 | 23142 | 5.6417 | 0.8966 | 0.7355 |
|
| 186 |
+
| 0.0872 | 133.0003 | 23316 | 5.1942 | 0.8963 | 0.7464 |
|
| 187 |
+
| 0.0755 | 134.0003 | 23490 | 4.9223 | 0.8968 | 0.7401 |
|
| 188 |
+
| 0.0739 | 135.0003 | 23664 | 5.9289 | 0.8933 | 0.7409 |
|
| 189 |
+
| 0.072 | 136.0003 | 23838 | 5.5412 | 0.8978 | 0.7470 |
|
| 190 |
+
| 0.0719 | 137.0003 | 24012 | 5.5881 | 0.8983 | 0.7407 |
|
| 191 |
+
| 0.0774 | 138.0003 | 24186 | 6.1457 | 0.8948 | 0.7407 |
|
| 192 |
+
| 0.0667 | 139.0003 | 24360 | 6.6187 | 0.8986 | 0.7486 |
|
| 193 |
+
| 0.0671 | 140.0003 | 24534 | 5.5951 | 0.8983 | 0.7496 |
|
| 194 |
+
| 0.0668 | 141.0002 | 24708 | 6.0237 | 0.9024 | 0.7513 |
|
| 195 |
+
| 0.0578 | 142.0002 | 24882 | 5.8030 | 0.8988 | 0.7512 |
|
| 196 |
+
| 0.067 | 143.0002 | 25056 | 5.2183 | 0.9030 | 0.7521 |
|
| 197 |
+
| 0.0722 | 144.0002 | 25230 | 4.4895 | 0.9052 | 0.7565 |
|
| 198 |
+
| 0.0679 | 145.0002 | 25404 | 5.9771 | 0.8980 | 0.7496 |
|
| 199 |
+
| 0.0716 | 146.0002 | 25578 | 5.8164 | 0.8962 | 0.7492 |
|
| 200 |
+
| 0.0612 | 147.0002 | 25752 | 5.6126 | 0.8973 | 0.7529 |
|
| 201 |
+
| 0.058 | 148.0002 | 25926 | 4.9595 | 0.8992 | 0.7480 |
|
| 202 |
+
| 0.0615 | 149.0002 | 26100 | 4.9541 | 0.9057 | 0.7532 |
|
| 203 |
+
| 0.059 | 150.0002 | 26274 | 4.8776 | 0.8996 | 0.7501 |
|
| 204 |
+
| 0.0576 | 151.0002 | 26448 | 4.0766 | 0.9000 | 0.7519 |
|
| 205 |
+
| 0.0535 | 152.0002 | 26622 | 4.9512 | 0.9017 | 0.7591 |
|
| 206 |
+
| 0.0601 | 153.0002 | 26796 | 4.6140 | 0.9010 | 0.7521 |
|
| 207 |
+
| 0.0591 | 154.0001 | 26970 | 4.3280 | 0.9006 | 0.7520 |
|
| 208 |
+
| 0.0555 | 155.0001 | 27144 | 4.5710 | 0.9017 | 0.7584 |
|
| 209 |
+
| 0.0546 | 156.0001 | 27318 | 4.8572 | 0.8963 | 0.7458 |
|
| 210 |
+
| 0.0538 | 157.0001 | 27492 | 4.5045 | 0.8992 | 0.7588 |
|
| 211 |
+
| 0.0517 | 158.0001 | 27666 | 4.4098 | 0.9042 | 0.7575 |
|
| 212 |
+
| 0.0536 | 159.0001 | 27840 | 5.0623 | 0.9010 | 0.7560 |
|
| 213 |
+
| 0.0524 | 160.0001 | 28014 | 5.2407 | 0.9008 | 0.7568 |
|
| 214 |
+
| 0.0534 | 161.0001 | 28188 | 5.2307 | 0.8995 | 0.7552 |
|
| 215 |
+
| 0.0544 | 162.0001 | 28362 | 4.7759 | 0.9043 | 0.7604 |
|
| 216 |
+
| 0.0534 | 163.0001 | 28536 | 5.1011 | 0.8999 | 0.7496 |
|
| 217 |
+
| 0.0489 | 164.0001 | 28710 | 4.6319 | 0.9009 | 0.7615 |
|
| 218 |
+
| 0.0481 | 165.0001 | 28884 | 4.9649 | 0.8972 | 0.7516 |
|
| 219 |
+
| 0.045 | 166.0001 | 29058 | 4.4254 | 0.9034 | 0.7620 |
|
| 220 |
+
| 0.0468 | 167.0001 | 29232 | 4.9603 | 0.8986 | 0.7561 |
|
| 221 |
+
| 0.0437 | 168.0000 | 29406 | 5.1275 | 0.9044 | 0.7644 |
|
| 222 |
+
| 0.0502 | 169.0000 | 29580 | 4.9316 | 0.9028 | 0.7595 |
|
| 223 |
+
| 0.0465 | 170.0000 | 29754 | 4.6009 | 0.9020 | 0.7615 |
|
| 224 |
+
| 0.0494 | 171.0000 | 29928 | 5.2097 | 0.9001 | 0.7597 |
|
| 225 |
+
| 0.0439 | 172.0000 | 30102 | 4.4983 | 0.9011 | 0.7636 |
|
| 226 |
+
| 0.0444 | 173.0000 | 30276 | 4.5724 | 0.8993 | 0.7678 |
|
| 227 |
+
| 0.0445 | 173.0013 | 30450 | 4.3684 | 0.9006 | 0.7622 |
|
| 228 |
+
| 0.0484 | 174.0013 | 30624 | 4.4104 | 0.8945 | 0.7566 |
|
| 229 |
+
| 0.0451 | 175.0013 | 30798 | 4.3420 | 0.9040 | 0.7657 |
|
| 230 |
+
| 0.0455 | 176.0013 | 30972 | 4.6109 | 0.9048 | 0.7672 |
|
| 231 |
+
| 0.0466 | 177.0013 | 31146 | 4.3886 | 0.9034 | 0.7646 |
|
| 232 |
+
| 0.04 | 178.0013 | 31320 | 4.4757 | 0.8993 | 0.7606 |
|
| 233 |
+
| 0.0396 | 179.0013 | 31494 | 4.3714 | 0.9061 | 0.7709 |
|
| 234 |
+
| 0.0448 | 180.0012 | 31668 | 4.3684 | 0.8984 | 0.7627 |
|
| 235 |
+
| 0.0404 | 181.0012 | 31842 | 4.3439 | 0.9004 | 0.7605 |
|
| 236 |
+
| 0.043 | 182.0012 | 32016 | 4.8352 | 0.8983 | 0.7582 |
|
| 237 |
+
| 0.0458 | 183.0012 | 32190 | 4.1427 | 0.9028 | 0.7660 |
|
| 238 |
+
| 0.0426 | 184.0012 | 32364 | 4.3354 | 0.9037 | 0.7664 |
|
| 239 |
+
| 0.0408 | 185.0012 | 32538 | 4.2703 | 0.8948 | 0.7561 |
|
| 240 |
+
| 0.0447 | 186.0012 | 32712 | 4.7411 | 0.9046 | 0.7715 |
|
| 241 |
+
| 0.0425 | 187.0012 | 32886 | 4.8592 | 0.9060 | 0.7670 |
|
| 242 |
+
| 0.0406 | 188.0012 | 33060 | 5.1796 | 0.9039 | 0.7641 |
|
| 243 |
+
| 0.0354 | 189.0012 | 33234 | 4.9643 | 0.8978 | 0.7564 |
|
| 244 |
+
| 0.0417 | 190.0012 | 33408 | 4.7806 | 0.9006 | 0.7606 |
|
| 245 |
+
| 0.0354 | 191.0012 | 33582 | 4.4736 | 0.9034 | 0.7622 |
|
| 246 |
+
| 0.0354 | 192.0012 | 33756 | 5.2124 | 0.9004 | 0.7592 |
|
| 247 |
+
| 0.0387 | 193.0012 | 33930 | 5.0472 | 0.8992 | 0.7579 |
|
| 248 |
+
| 0.0361 | 194.0011 | 34104 | 4.6059 | 0.9028 | 0.7684 |
|
| 249 |
+
| 0.0376 | 195.0011 | 34278 | 4.3687 | 0.9050 | 0.7658 |
|
| 250 |
+
| 0.0367 | 196.0011 | 34452 | 4.5347 | 0.9025 | 0.7626 |
|
| 251 |
+
| 0.0376 | 197.0011 | 34626 | 4.1590 | 0.9023 | 0.7610 |
|
| 252 |
+
| 0.0374 | 198.0011 | 34800 | 3.8541 | 0.9071 | 0.7699 |
|
| 253 |
+
| 0.0344 | 199.0011 | 34974 | 4.2261 | 0.9058 | 0.7707 |
|
| 254 |
+
| 0.0399 | 200.0011 | 35148 | 3.7801 | 0.9053 | 0.7713 |
|
| 255 |
+
| 0.033 | 201.0011 | 35322 | 4.4107 | 0.8966 | 0.7575 |
|
| 256 |
+
| 0.0377 | 202.0011 | 35496 | 4.2622 | 0.8950 | 0.7569 |
|
| 257 |
+
| 0.0345 | 203.0011 | 35670 | 3.9799 | 0.9060 | 0.7698 |
|
| 258 |
+
| 0.0372 | 204.0011 | 35844 | 4.0399 | 0.8990 | 0.7577 |
|
| 259 |
+
| 0.036 | 205.0011 | 36018 | 4.1664 | 0.8987 | 0.7557 |
|
| 260 |
+
| 0.0341 | 206.0011 | 36192 | 4.2112 | 0.9010 | 0.7639 |
|
| 261 |
+
|
| 262 |
+
|
| 263 |
### Framework versions
|
| 264 |
|
| 265 |
- Transformers 4.46.0
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 126037432
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9b1f9fc7a990d704be70fbf4118962f69f3cff45aebaea49342f5007480230cf
|
| 3 |
size 126037432
|
runs/2-by=2006-psr=0.25/events.out.tfevents.1747294740.dipa.185435.1
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:325058284be51e31240f722543364a38738052349f7bd22f97a700dade4daa47
|
| 3 |
+
size 470
|