--- license: llama3.3 base_model_relation: quantized base_model: meta-llama/Llama-3.3-70B-Instruct tags: - llama-3.3 - finetune - roleplay - chat - wings-of-fire - nsfw - not-for-all-audiences ---
Send me your support to help me feed the data beast! also taking comissions for universe specific models
Support on Ko-fiThe quantized model files are available for download. Click the buttons below to view the files.
Download GGUF Files → Download EXL2 Files →For the best roleplaying experience, it is highly recommended to use the provided character card and lore book. These files help guide the model's persona and provide rich, in-universe context.
Download Files →For a seamless setup in SillyTavern, you can download pre-configured sampler presets. These are tuned to provide an optimal balance between creativity and narrative coherence for this model.
Simply download the .json file below and import it into SillyTavern's sampler presets menu.
Temp: 0.8-1.2
Min P: 0.02
Dry: 0.8 , 1.75, 4
Temp: 1
Min P: 0.03
Nsigma: 2
Dry: 0.8 , 1.75, 4
For the best results, use this structured format. This helps the AI clearly distinguish between actions, inner thoughts, and dialogue.
*He walked across the room and stared out the window.**-I wonder what she's thinking.-*Alex (Curious): "What do you see out there?"*-I wonder what she's thinking.-* Standard novel-style formatting is also understood, but this structured format is preferred for clarity.
This is Version 12.1, a significant advancement in the Animus series built on a completely new training philosophy. Instead of merging previous models, V12.1 is a direct fine-tune of Llama 3.3 70B Instruct. This focused approach has resulted in what is being called the most coherent and lore-adherent version to date.
V12.1 is an earlier checkpoint of V12.0, that one was slightly overcooked, while for WOF it was very good, it was much more difficult to steer, it was too condfident on its choices of tokens.
V12.1's strength comes from a novel dataset designed to teach the model the why behind the lore, not just the what. The training data is a mix of:
The result is a model with exceptionally strong prose and a deep grasp of in-universe lore, making for a highly immersive and accurate roleplaying experience.
Note for roleplay, it follows system prompt and first message, meaning if the first assistant message is short, the following messages will be short.
V12.1 marks a shift from model merging to a focused, direct fine-tuning approach. This allows for greater control over the final model's characteristics.
A key feature in previous test versions—the presentation of multiple-choice actions (e.g., A, B, C) to guide the user—has been removed.
While a promising concept, this feature needs further refinement to ensure it enhances, rather than restricts, the roleplaying experience. It may be reintroduced in a more polished form in a future release. For now, the model returns to a more traditional, open-ended prose format.
The V12.1 dataset consists of 6,000 high-quality examples, a combination of two distinct types:
Both datasets underwent a rigorous cleaning process to remove formatting artifacts, such as **scene transitions**, resulting in a cleaner and more natural narrative style.
**scene transitions**. The model should now produce cleaner prose.