Percena/locomo-mc10
Preview • Updated • 970 • 7
How to use thebnbrkr/tinyllama-salience with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("TinyLlama/TinyLlama-1.1B-Chat-v1.0")
model = PeftModel.from_pretrained(base_model, "thebnbrkr/tinyllama-salience")Fine-tuned TinyLlama-1.1B-Chat for conversation memory and salience detection using the LoCoMo dataset.
from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline
model = AutoModelForCausalLM.from_pretrained("thebnbrkr/tinyllama-salience")
tokenizer = AutoTokenizer.from_pretrained("thebnbrkr/tinyllama-salience")
pipe = pipeline("text-generation", model=model, tokenizer=tokenizer)
prompt = """Context:
Alice told Bob she's moving to Seattle next month.
Question: Where is Alice moving?
Answer:"""
result = pipe(prompt, max_new_tokens=50)
print(result[0]["generated_text"])
This model is designed to answer questions based on conversational context, identifying salient information from dialogue history.
Trained on the LoCoMo (Long Context Memory) dataset, specifically the MC10 variant which tests conversation memory through question-answering.
Apache 2.0
Base model
TinyLlama/TinyLlama-1.1B-Chat-v1.0