Weather LLM โ€” SFT (run1 stage2, step 2500)

Supervised fine-tuning checkpoint: run1_stage2/checkpoint-2500.

Tokenizer config adjusted for Hub (LlamaTokenizer, model_max_length=2048).

Load

from transformers import AutoModelForCausalLM, AutoTokenizer

repo = "Nagacharan25/weather-llama-initial"
model = AutoModelForCausalLM.from_pretrained(repo, torch_dtype="auto", device_map="auto")
tokenizer = AutoTokenizer.from_pretrained(repo)

Requires transformers (Llama + SentencePiece) and sentencepiece.

Downloads last month
12
Safetensors
Model size
1B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support