How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="TeeZee/Fimbulvetr-10.7B-v1-bpw8.0-h8-exl2")
# Load model directly
from transformers import AutoTokenizer, AutoModelForMultimodalLM

tokenizer = AutoTokenizer.from_pretrained("TeeZee/Fimbulvetr-10.7B-v1-bpw8.0-h8-exl2")
model = AutoModelForMultimodalLM.from_pretrained("TeeZee/Fimbulvetr-10.7B-v1-bpw8.0-h8-exl2")
Quick Links

Fimbulvetr-10.7B-v1

exllamav2 quant for Sao10K/Fimbulvetr-10.7B-v1Sao10K/Fimbulvetr-10.7B-v1

Runs smoothly on single 3090 in webui with context length set to 4096, ExLlamav2_HF loader and cache_8bit=True

All comments are greatly appreciated, download, test and if you appreciate my work, consider buying me my fuel: Buy Me A Coffee

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including TeeZee/Fimbulvetr-10.7B-v1-bpw8.0-h8-exl2