Instructions to use 4ntoine/Qwen2.5-Coder-3B-Instruct-LiteRTLM with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- LiteRT-LM
How to use 4ntoine/Qwen2.5-Coder-3B-Instruct-LiteRTLM with LiteRT-LM:
# LiteRT-LM runs on various platforms (Android, iOS, Windows, Linux, macOS, IoT, Web/WASM) # and supports many APIs (C++, Python, Kotlin, Swift, JavaScript, Flutter). # For platform-specific integration guides, please refer to the official developer website: # https://ai.google.dev/edge/litert-lm # To try LiteRT-LM, the easiest way is to use our CLI tool. # 1. Install the LiteRT-LM CLI tool: pip install litert-lm # 2. Download and run this model locally: # See: https://ai.google.dev/edge/litert-lm/cli litert-lm run \ --from-huggingface-repo=4ntoine/Qwen2.5-Coder-3B-Instruct-LiteRTLM \ model.litertlm \ --prompt="Write me a poem"
- Notebooks
- Google Colab
- Kaggle
The model is converted from the original Qwen/Qwen2.5-Coder-3B-Instruct using:
litert-torch export_hf \
--model=Qwen/Qwen2.5-Coder-3B-Instruct \
--output_dir="./dynamic_wi8_afp32" \
--quantization_recipe="dynamic_wi8_afp32" \
--bundle_litert_lm=true
- Downloads last month
- 653
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support