Instructions to use huggingkot/Llama-3.1-Nemotron-Nano-8B-v1-q4f16_1-MLC with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLC-LLM
How to use huggingkot/Llama-3.1-Nemotron-Nano-8B-v1-q4f16_1-MLC with MLC-LLM:
# No code snippets available yet for this library. # To use this model, check the repository files and the library's documentation. # Want to help? PRs adding snippets are welcome at: # https://github.com/huggingface/huggingface.js
- Notebooks
- Google Colab
- Kaggle
File size: 444 Bytes
8345ef1 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 |
---
library_name: mlc-llm
tags:
- mlc-llm
- web-llm
language:
- en
base_model:
- nvidia/Llama-3.1-Nemotron-Nano-8B-v1
pipeline_tag: text-generation
---
This is a MLC converted weight from [Llama-3.1-Nemotron-Nano-8B-v1](https://huggingface.co/nvidia/Llama-3.1-Nemotron-Nano-8B-v1) model in MLC format `q4f16_1`.
The model can be used for projects [MLC-LLM](https://github.com/mlc-ai/mlc-llm) and [WebLLM](https://github.com/mlc-ai/web-llm).
|