This is the official release of the model for paper **[SYNTHIA: Novel Concept Design with Affordance Composition](https://arxiv.org/abs/2502.17793)**. . Hugging Face Paper: [https://huggingface.co/papers/2502.17793](https://huggingface.co/papers/2502.17793) Project page: [https://synthia-uiuc.github.io/synthia.github.io/](https://synthia-uiuc.github.io/synthia.github.io/) # 🪄 Affordance-based Novel Concept Generator (Kandinsky-3 Fine-Tuned) This is a fine-tuned version of the **Kandinsky-3** text-to-image pipeline, designed to generate **novel object and furniture concepts** by combining affordance-driven functionalities (e.g., "sofa + bed + cargo + bicycle"). --- ## 🚀 How to Use ```python import os import sys import torch from kandinsky3 import get_T2I_pipeline, get_T2I_Flash_pipeline # Add kandinsky3 to Python path sys.path.append('..') # Set device and dtype maps device_map = torch.device('cuda:0') dtype_map = { 'unet': torch.float32, 'text_encoder': torch.float32, 'movq': torch.float32, } # Load the Flash text-to-image pipeline t2i_pipe = get_T2I_Flash_pipeline( device_map=device_map, dtype_map=dtype_map, cache_dir="./cache/" ) # Load fine-tuned UNet weights t2i_pipe.unet.load_state_dict(torch.load( "unet_model_checkpoint.pt", map_location=device_map )) # Generate image from prompt res = t2i_pipe( text="a new furniture design that has functions from sofa, bed, cargo, bicycle", steps=50 )[0] # Save the result res.save("generated_image.jpg") ```