Instructions to use raicrits/OpenLLama13b_Loquace_ITA with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use raicrits/OpenLLama13b_Loquace_ITA with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="raicrits/OpenLLama13b_Loquace_ITA")# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("raicrits/OpenLLama13b_Loquace_ITA", dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use raicrits/OpenLLama13b_Loquace_ITA with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "raicrits/OpenLLama13b_Loquace_ITA" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "raicrits/OpenLLama13b_Loquace_ITA", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/raicrits/OpenLLama13b_Loquace_ITA
- SGLang
How to use raicrits/OpenLLama13b_Loquace_ITA with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "raicrits/OpenLLama13b_Loquace_ITA" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "raicrits/OpenLLama13b_Loquace_ITA", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "raicrits/OpenLLama13b_Loquace_ITA" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "raicrits/OpenLLama13b_Loquace_ITA", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use raicrits/OpenLLama13b_Loquace_ITA with Docker Model Runner:
docker model run hf.co/raicrits/OpenLLama13b_Loquace_ITA
Ubuntu commited on
Commit ·
0402336
1
Parent(s): 39fbe95
First model version
Browse files- config.json +3 -0
- generation_config.json +3 -0
- pytorch_model-00001-of-00003.bin +3 -0
- pytorch_model-00002-of-00003.bin +3 -0
- pytorch_model-00003-of-00003.bin +3 -0
- pytorch_model.bin.index.json +3 -0
- special_tokens_map.json +3 -0
- tokenizer.model +3 -0
- tokenizer_config.json +3 -0
config.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e57a07dc885818f1057ab97c2280e3cb1f994ff6f961f421380e5f63a5513ecb
|
| 3 |
+
size 507
|
generation_config.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e594efa3d5f29e494af0ab38ec7f341f31cdd45a3a570657f260d1f4b5c6ced2
|
| 3 |
+
size 132
|
pytorch_model-00001-of-00003.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:80b484d3fa83addadc4f256a9b1362ec94002b604b4e5b7c9cf938086cbd372e
|
| 3 |
+
size 9948730542
|
pytorch_model-00002-of-00003.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2653d899a87252fda5a2dbe66e5cf068cc30f8eeafd5d9e731a84bea59b4ed29
|
| 3 |
+
size 9904167200
|
pytorch_model-00003-of-00003.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:353e4f23234087894b4f7d1aea29cf6b2bd52e1d3aa61b9a3da9539691b3beba
|
| 3 |
+
size 6178984969
|
pytorch_model.bin.index.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:92c31ad5e853e9a6660e248871f9bd0cd273518de7ec12ca893e1b1078c10898
|
| 3 |
+
size 33444
|
special_tokens_map.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3833453daeb8222e7a04fc85758cfe429dab30f5fc901d18d389105c501784ad
|
| 3 |
+
size 330
|
tokenizer.model
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ab1b681ec7fc02fed5edd3026687d7a692a918c4dd8e150ca2e3994a6229843b
|
| 3 |
+
size 534194
|
tokenizer_config.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5a979515b1e98186bdceb56439d17f24bfa68baa669e50e150cc3933c07da3a2
|
| 3 |
+
size 593
|