LLaMAX commited on
Commit
1537317
·
verified ·
1 Parent(s): cd77648

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +158 -0
README.md ADDED
@@ -0,0 +1,158 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - Multilingual
4
+ license: mit
5
+ language:
6
+ - af
7
+ - am
8
+ - ar
9
+ - hy
10
+ - as
11
+ - ast
12
+ - az
13
+ - be
14
+ - bn
15
+ - bs
16
+ - bg
17
+ - my
18
+ - ca
19
+ - ceb
20
+ - zho
21
+ - hr
22
+ - cs
23
+ - da
24
+ - nl
25
+ - en
26
+ - et
27
+ - tl
28
+ - fi
29
+ - fr
30
+ - ff
31
+ - gl
32
+ - lg
33
+ - ka
34
+ - de
35
+ - el
36
+ - gu
37
+ - ha
38
+ - he
39
+ - hi
40
+ - hu
41
+ - is
42
+ - ig
43
+ - id
44
+ - ga
45
+ - it
46
+ - ja
47
+ - jv
48
+ - kea
49
+ - kam
50
+ - kn
51
+ - kk
52
+ - km
53
+ - ko
54
+ - ky
55
+ - lo
56
+ - lv
57
+ - ln
58
+ - lt
59
+ - luo
60
+ - lb
61
+ - mk
62
+ - ms
63
+ - ml
64
+ - mt
65
+ - mi
66
+ - mr
67
+ - mn
68
+ - ne
69
+ - ns
70
+ - no
71
+ - ny
72
+ - oc
73
+ - or
74
+ - om
75
+ - ps
76
+ - fa
77
+ - pl
78
+ - pt
79
+ - pa
80
+ - ro
81
+ - ru
82
+ - sr
83
+ - sn
84
+ - sd
85
+ - sk
86
+ - sl
87
+ - so
88
+ - ku
89
+ - es
90
+ - sw
91
+ - sv
92
+ - tg
93
+ - ta
94
+ - te
95
+ - th
96
+ - tr
97
+ - uk
98
+ - umb
99
+ - ur
100
+ - uz
101
+ - vi
102
+ - cy
103
+ - wo
104
+ - xh
105
+ - yo
106
+ - zu
107
+ ---
108
+ ### Model Sources
109
+
110
+ - **Paper**: LLaMAX2: Your Translation-Enhanced Model Also Performs Well in Reasoning
111
+ - **Link**: https://arxiv.org/pdf/2510.09189
112
+ - **Repository**: https://github.com/CONE-MT/LLaMAX2.0
113
+
114
+ ### Model Description
115
+
116
+ GlotMAX is a language model with powerful multilingual capabilities and strong reasoning.
117
+
118
+ Qwen3-GlotMAX series models start from Qwen3-8B instruct models with layer-slective tuning using small amount of parallel data alone.
119
+
120
+ Meanwhile, comprehensive testing on 16 reasoning tasks, such as bbeh, Livecodebench, Olymmath and so on, shows that it surpasses existing translation-enhanced models and performs on par with Qwen3 instruct models.
121
+
122
+
123
+
124
+ ### 🔥 Excellent Translation Performance
125
+
126
+ LLaMAX3-8B-Alpaca achieves an average spBLEU score improvement of over **5 points** compared to the LLaMA3-8B-Alpaca model on the Flores-101 dataset.
127
+
128
+
129
+ ### Supported Languages
130
+ Akrikaans (af), Amharic (am), Arabic (ar), Armenian (hy), Assamese (as), Asturian (ast), Azerbaijani (az), Belarusian (be), Bengali (bn), Bosnian (bs), Bulgarian (bg), Burmese (my), Catalan (ca), Cebuano (ceb), Chinese Simpl (zho), Chinese Trad (zho), Croatian (hr), Czech (cs), Danish (da), Dutch (nl), English (en), Estonian (et), Filipino (tl), Finnish (fi), French (fr), Fulah (ff), Galician (gl), Ganda (lg), Georgian (ka), German (de), Greek (el), Gujarati (gu), Hausa (ha), Hebrew (he), Hindi (hi), Hungarian (hu), Icelandic (is), Igbo (ig), Indonesian (id), Irish (ga), Italian (it), Japanese (ja), Javanese (jv), Kabuverdianu (kea), Kamba (kam), Kannada (kn), Kazakh (kk), Khmer (km), Korean (ko), Kyrgyz (ky), Lao (lo), Latvian (lv), Lingala (ln), Lithuanian (lt), Luo (luo), Luxembourgish (lb), Macedonian (mk), Malay (ms), Malayalam (ml), Maltese (mt), Maori (mi), Marathi (mr), Mongolian (mn), Nepali (ne), Northern Sotho (ns), Norwegian (no), Nyanja (ny), Occitan (oc), Oriya (or), Oromo (om), Pashto (ps), Persian (fa), Polish (pl), Portuguese (pt), Punjabi (pa), Romanian (ro), Russian (ru), Serbian (sr), Shona (sn), Sindhi (sd), Slovak (sk), Slovenian (sl), Somali (so), Sorani Kurdish (ku), Spanish (es), Swahili (sw), Swedish (sv), Tajik (tg), Tamil (ta), Telugu (te), Thai (th), Turkish (tr), Ukrainian (uk), Umbundu (umb), Urdu (ur), Uzbek (uz), Vietnamese (vi), Welsh (cy), Wolof (wo), Xhosa (xh), Yoruba (yo), Zulu (zu)
131
+
132
+ ### Model Index
133
+ We implement multiple versions of the LLaMAX model, the model links are as follows:
134
+ | Model | GlotMAX |
135
+ |-------------------------------|----------------------------------------------------------|
136
+ | GlotMAX-17-8B | [Link](https://huggingface.co/LLaMAX/GlotMAX-17-8B) |
137
+ | GlotMAX-17-14B | [Link](https://huggingface.co/LLaMAX/GlotMAX-17-14B) |
138
+ | 👉 **GlotMAX-101-8B** | [Link](https://huggingface.co/LLaMAX/GlotMAX-101-8B) |
139
+ | GlotMAX-101-14B | [Link](https://huggingface.co/LLaMAX/GlotMAX-101-14B) |
140
+
141
+ ### Citation
142
+ If our model helps your work, please cite this paper:
143
+
144
+ ```
145
+ @misc{gaoLLaMAX2YourTranslationEnhanced2025,
146
+ title = {{{LLaMAX2}}: {{Your Translation-Enhanced Model}} Also {{Performs Well}} in {{Reasoning}}},
147
+ shorttitle = {{{LLaMAX2}}},
148
+ author = {Gao, Changjiang and Huang, Zixian and Gong, Jingyang and Huang, Shujian and Li, Lei and Yuan, Fei},
149
+ year = {2025},
150
+ month = oct,
151
+ number = {arXiv:2510.09189},
152
+ eprint = {2510.09189},
153
+ primaryclass = {cs},
154
+ publisher = {arXiv},
155
+ doi = {10.48550/arXiv.2510.09189},
156
+ archiveprefix = {arXiv}
157
+ }
158
+ ```