Instructions to use flylcw/Qwen3-4B-Instruct-2507_w8a8_g128_1.2.2_rkllm with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- RKLLM
How to use flylcw/Qwen3-4B-Instruct-2507_w8a8_g128_1.2.2_rkllm with RKLLM:
# No code snippets available yet for this library. # To use this model, check the repository files and the library's documentation. # Want to help? PRs adding snippets are welcome at: # https://github.com/huggingface/huggingface.js
- Notebooks
- Google Colab
- Kaggle
Qwen3-4B-Instruct-2507_w8a8_g128
- My quant for Rock5B (RK3588 board)
- Author: @flylcw
- 建议运行环境:RKLLM-Toolkit在1.2.x应该都可运行,NPU驱动版本需>v0.9.8
Conversion details:
- RKLLM-Toolkit version: v1.2.2
- NPU driver: v0.9.8
- Python: 3.11
- Quantization: w8a8_g128
- Output: single-file .rkllm artifact
- Modifications: quantization (w8a8_g128), export to .rkllm format for RK3588 SBCs
- Downloads last month
- 16
Model tree for flylcw/Qwen3-4B-Instruct-2507_w8a8_g128_1.2.2_rkllm
Base model
Qwen/Qwen3-4B-Instruct-2507