Waifu Diffusion is a text-to-image latent diffusion model fine-tuned on high-quality anime-style artwork using Stable Diffusion as its base. Tailored for anime fans and artists, it allows users to generate detailed and stylized anime images from written prompts. The model performs especially well with common anime tropes and visual features like eye color, hairstyles, and character poses. It integrates seamlessly with the diffusers library and supports fast inference on GPU using PyTorch. Users can run it locally or via web UIs like Gradio or Google Colab for ease of use. The generated outputs are unrestricted in ownership, though usage must comply with the CreativeML OpenRAIL-M license. The project is maintained by independent contributors and builds upon work from Stability AI and NovelAI.
Features
- Fine-tuned for high-quality anime-style image generation
- Text-to-image synthesis using Stable Diffusion framework
- Compatible with Hugging Face’s diffusers pipeline
- Accepts detailed prompts with anime tags and attributes
- Supports GPU acceleration with PyTorch and autocasting
- OpenRAIL-M license allows commercial and non-commercial use
- Gradio and Colab integration for easy web-based use
- Community-supported with ongoing development on Discord