--- license: other license_name: circlestone-labs-non-commercial-license license_link: https://huggingface.co/circlestone-labs/Anima/blob/main/LICENSE.md base_model: - circlestone-labs/Anima base_model_relation: quantized --- # GGUF models of ANIMA ## Someone already converted v1.0 -> [Abiray/Anima-base-v1.0-GGUF](https://huggingface.co/Abiray/Anima-base-v1.0-GGUF) # How to use - **Update ComfyUI to v0.14.1 or above.** *(You don't need custom script.)* - Download GGUF you want. (Recommend at least Q5_K_M) - Use [ComfyUI-GGUF](https://github.com/city96/ComfyUI-GGUF) custom node to load the model. ## Generation speed Tested on - RTX5090(400W), ComfyUI with `--fast` option and `Patch Sage Attention KJ` node(AUTO). - 832x1216, cfg 5.0, 50steps | Quant | it/s | Time (s) | Speed vs BF16 (%) | | ------ | ---- | --------- | ---------------- | | BF16 | **4.65** | **11.70** | 0.00% | | Q8_0 | *4.46* | *12.07* | -4.09% | | Q6_K | 3.60 | 14.91 | -22.58% | | Q5_K_S | 3.35 | 15.94 | -28.03% | | Q5_K_M | 3.41 | 15.67 | -26.67% | | Q5_1 | 3.42 | 15.24 | -26.45% | | Q5_0 | 3.40 | 15.73 | -26.88% | | Q4_K_S | 3.55 | 15.12 | -23.66% | | Q4_K_M | 3.59 | 14.98 | -22.80% | | Q4_1 | 4.01 | 13.46 | -13.76% | | Q4_0 | 3.97 | 13.50 | -14.62% | ## Sample ### anima-preview3-base ![26-04-09-Anima_00007_](https://cdn-uploads.huggingface.co/production/uploads/63fbf6951b4b1bd4e706fed1/BSEQSYZniROqpfqiivPmB.webp) ### anima-preview2 ![26-03-12-Anima_00031_](https://cdn-uploads.huggingface.co/production/uploads/63fbf6951b4b1bd4e706fed1/XXtcZh7Au0RO8JC1A0R6A.webp) ### anima-preview ![Anima_GGUF_Comparison](https://cdn-uploads.huggingface.co/production/uploads/63fbf6951b4b1bd4e706fed1/mTWE0WdRsuvoJEwRMeEWr.webp) ## How to reproduce 1. Convert BF16 model to FP32 ```python import torch import safetensors.torch import os import sys def convert_to_fp32(input_path, output_path): state_dict = safetensors.torch.load_file(input_path) new_state_dict = {} for key, tensor in state_dict.items(): print(f"{key} ({tensor.dtype}) -> torch.float32") new_tensor = tensor.to(torch.float32) new_state_dict[key] = new_tensor safetensors.torch.save_file(new_state_dict, output_path) print(f"output_path: {output_path}") if __name__ == "__main__": assert len(sys.argv) == 3, f"usage: {sys.argv[0]} SOURCE TARGET" input_path, output_path = sys.argv[1:3] convert_to_fp32(input_path, output_path) ``` 2. Read [this manual](https://github.com/city96/ComfyUI-GGUF/blob/main/tools/README.md). 3. make F32 GGUF using https://github.com/city96/ComfyUI-GGUF/blob/main/tools/convert.py#L258 4. Run `llama-quantize`.