HY-MT1.5-1.8B_GPTQ_INT4-AX620E / bin /axllm.version.json
yongqiang
Update axllm binary and token config
3f3c4b6
Raw
History Blame Contribute Delete
1.21 kB
{
"binary": "bin/axllm",
"target": "aarch64 binary built from ax-hymt1_5",
"notes": "This is the same packaged axllm binary as the AX650 repository. The binary has been verified on AX650 with HY-MT OpenAI serving. AX620E board validation for axllm serve is still pending.",
"ax_llm_branch": "ax-hymt1_5",
"ax_llm_commit": "760c3a9f3586d233d27811b08f3863dbb7ad4c0a",
"openai_api_cpp_commit": "f56cf8c296d1002f6602226db392325ba42f6775",
"build_command": "cmake --build build --target install -j$(nproc)",
"sha256": "eeb39d339e8044f9036dd773e8e9704b4131c601eb5123ea08cfe71d01617196",
"verified": {
"date": "2026-05-26",
"board": "AX650",
"command": "./bin/axllm serve . --port 18120",
"api_url": "http://10.168.232.217:18120/v1/chat/completions",
"model": "AXERA-TECH/HY-MT1.5-1.8B_GPTQ_INT4",
"smoke_tests": [
"GET /v1/models returned AXERA-TECH/HY-MT1.5-1.8B_GPTQ_INT4 only",
"English to Chinese request returned 这是免费的。",
"Natural-language request 请将下面的文字翻译成日文 returned Japanese text without target_language",
"Request with max_tokens=4096 was clamped to 1024 and returned Japanese translation"
]
}
}