Search Results for "stable-diffusion-webui" - Page 4

Sort By:

Showing 49677 open source projects for "stable-diffusion-webui"

View related business solutions

Retool your internal operations
Generate secure, production-grade apps that connect to your business data. Not just prototypes, but tools your team can actually deploy.

Build internal software that meets enterprise security standards without waiting on engineering resources. Retool connects to your databases, APIs, and data sources while maintaining the permissions and controls you need. Create custom dashboards, admin tools, and workflows from natural language prompts—all deployed in your cloud with security baked in. Stop duct-taping operations together, start building in Retool.

Build an app in Retool
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)

Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.

Learn More
1

Roadmap To Learn Generative AI In 2025

Basic Machine Learning Natural Language Processing Roadmap

Roadmap To Learn Generative AI In 2025 is a curated learning path focused on contemporary generative AI — covering large language models (LLMs), diffusion-based image generation, prompt engineering, multi-modal AI, fine-tuning techniques, and the practical considerations for deploying generative models. It’s aimed at learners and developers who already have some programming or ML basics and wish to specialize in generative AI, offering a modern, structured plan that reflects the state of the art as of 2025. ...

Downloads: 0 This Week

Last Update: 2025-12-02
See Project
2

DreamO

A Unified Framework for Image Customization

DreamO is a unified, open-source framework from ByteDance for advanced image customization and generation that consolidates multiple “image manipulation” tasks into a single system, rather than requiring separate specialized models. Built on a diffusion-transformer (DiT) backbone, it supports a diverse set of tasks — including identity preservation, virtual “try-on” (e.g. clothing, accessories), style transfer, IP adaptation (objects/characters), and layout/condition-aware customizations — all handled within the same unified architecture. DreamO’s design introduces a feature routing constraint that helps disentangle different control conditions (like identity, style, clothing) when more than one is specified, which significantly reduces conflicts and artifacts when combining controls. ...

Downloads: 0 This Week

Last Update: 2025-12-02
See Project
3

Step1X-Edit

A SOTA open-source image editing model

Step1X-Edit is a state-of-the-art open-source image editing model/framework that uses a multimodal large language model (LLM) together with a diffusion-based image decoder to let users edit images simply via natural-language instructions plus a reference image. You supply an existing image and a textual command — e.g. “add a ruby pendant on the girl’s neck” or “make the background a sunset over mountains” — and the model interprets the instruction, computes a latent embedding combining the image content and user intent, then decodes a new image implementing the edit. ...

Downloads: 0 This Week

Last Update: 2025-12-29
See Project
4

ESP3D

FW for ESP8266/ESP8285/ESP32 used with 3D printer

...I currently use it with my personal flavor of repetier for Due based boards. The web interface files are present in the data directory but UI has its own repository ESP3D-WEBUI. Use GPIO2 to the ground to reset all settings in a hard way, 2-6 sec after boot / not before!! Set GPIO2 to ground before boot change boot mode and go to special boot that do not reach FW. Currently, boot takes 10 sec - giving 8 seconds to connect GPIO2 to GND and do a hard recovery for settings. Complete configuration by web browser (Station or Access point) or by Serial/telnet commands.

Downloads: 14 This Week

Last Update: 2025-12-16
See Project
Grafana: The open and composable observability platform
Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

Grafana is the open source analytics & monitoring solution for every database.

Learn More
5

Step1X-3D

High-Fidelity and Controllable Generation of Textured 3D Assets

...It combines a hybrid architecture: a geometry generation stage using a VAE-DiT model to output a watertight 3D representation (e.g. TSDF surface), and a texture synthesis stage that conditions on geometry and optionally reference input (or prompts) to produce view-consistent textures using a diffusion-based texture module. The result is fully 3D assets — meshes + textures — which can be rendered from any viewpoint, textured consistently, and used in 3D applications. To achieve this, the project includes a massive curated dataset: among more than 5 million candidate 3D assets, it filters and standardizes to produce a high-quality 2 million–asset subset suitable for training.

Downloads: 0 This Week

Last Update: 2025-12-01
See Project
6

Flow Matching

A PyTorch library for implementing flow matching algorithms

flow_matching is a PyTorch library implementing flow matching algorithms in both continuous and discrete settings, enabling generative modeling via matching vector fields rather than diffusion. The underlying idea is to parameterize a flow (a time-dependent vector field) that transports samples from a simple base distribution to a target distribution, and train via matching of flows without requiring score estimation or noisy corruption—this can lead to more efficient or stable generative training. The library supports both continuous-time flows (via differential equations) and discrete-time analogues, giving flexibility in design and tradeoffs. ...

Downloads: 1 This Week

Last Update: 2026-01-05
See Project
7

llama2-webui

Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere

Running Llama 2 with gradio web UI on GPU or CPU from anywhere (Linux/Windows/Mac).

Downloads: 0 This Week

Last Update: 2023-10-04
See Project
8

BepInEx

Unity / XNA game patcher and plugin framework

Unity / XNA game patcher and plugin framework. BepInEx is a plugin / modding framework for Unity Mono, IL2CPP, and .NET framework games (XNA, FNA, MonoGame, etc.) Stable builds are released once a new iteration of BepInEx is considered feature-complete. They have the least bugs, but some newest features might not be available. Bleeding edge builds are available on BepisBuilds. Bleeding edge builds are always the latest builds of the source code. Thus they are the opposite to stable builds: they have the newest features and bugfixes available, but usually tend to be the most buggy. ...

Downloads: 508 This Week

Last Update: 2025-09-25
See Project
9

HunyuanImage-3.0

A Powerful Native Multimodal Model for Image Generation

...It unifies multimodal understanding and generation in a single autoregressive framework, combining text and image modalities seamlessly rather than relying on separate image-only diffusion components. It uses a Mixture-of-Experts (MoE) architecture with many expert subnetworks to scale efficiently, deploying only a subset of experts per token, which allows large parameter counts without linear inference cost explosion. The model is intended to be competitive with closed-source image generation systems, aiming for high fidelity, prompt adherence, fine detail, and even “world knowledge” reasoning (i.e. leveraging context, semantics, or common sense in generation). ...

1 Review

Downloads: 19 This Week

Last Update: 2025-10-31
See Project
Yeastar: Business Phone System and Unified Communications
Go beyond just a PBX with all communications integrated as one.

User-friendly, optimized, and scalable, the Yeastar P-Series Phone System redefines business connectivity by bringing together calling, meetings, omnichannel messaging, and integrations in one simple platform—removing the limitations of distance, platforms, and systems.

Learn More
10

CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo

CogVideo is an open source text-/image-/video-to-video generation project that hosts the CogVideoX family of diffusion-transformer models and end-to-end tooling. The repo includes SAT and Diffusers implementations, turnkey demos, and fine-tuning pipelines (including LoRA) designed to run across a wide range of NVIDIA GPUs, from desktop cards (e.g., RTX 3060) to data-center hardware (A100/H100). Current releases cover CogVideoX-2B, CogVideoX-5B, and the upgraded CogVideoX1.5-5B variants, plus image-to-video (I2V) models, with options for BF16/FP16/FP32—and INT8 quantized inference via TorchAO for memory-constrained setups. ...

Downloads: 25 This Week

Last Update: 2025-10-04
See Project
11

Linfa

A Rust machine learning framework

linfa aims to provide a comprehensive toolkit to build Machine Learning applications with Rust. Kin in spirit to Python's scikit-learn, it focuses on common preprocessing tasks and classical ML algorithms for your everyday ML tasks.

Downloads: 0 This Week

Last Update: 2025-12-23
See Project
12

101-0250-00

ETH course - Solving PDEs in parallel on GPUs

This course aims to cover state-of-the-art methods in modern parallel Graphical Processing Unit (GPU) computing, supercomputing and code development with applications to natural sciences and engineering.

Downloads: 0 This Week

Last Update: 2025-04-15
See Project
13

FLUX.1

Official inference repo for FLUX.1 models

FLUX.1 repository contains inference code and tooling for the FLUX.1 text-to-image diffusion models, enabling developers and researchers to generate and edit images from natural-language prompts using open-weight versions of the model on their own hardware or within custom applications. The project is part of a larger family of FLUX models developed by Black Forest Labs, designed to produce high-quality, detailed visuals from text descriptions with competitive prompt adherence and artistic fidelity. ...

Downloads: 10 This Week

Last Update: 2026-01-19
See Project
14

MethodOfLines.jl

Automatic Finite Difference PDE solving with Julia SciML

MethodOfLines.jl is a Julia package for automated finite difference discretization of symbolically defined PDEs in N dimensions. It uses symbolic expressions for systems of partial differential equations as defined with ModelingToolkit.jl, and Interval from DomainSets.jl to define the space(time) over which the simulation runs. This project is under active development, therefore the interface is subject to change. The docs will be updated to reflect any changes, please check back for current...

Downloads: 0 This Week

Last Update: 2024-10-12
See Project
15

Luna AI

Virtual AI anchor that combines state-of-the-art technology

Luna AI is a virtual AI streamer framework designed to power an interactive VTuber that can go live on major platforms and chat with viewers in real time. It is built around a core assistant persona called “Luna AI,” which can be driven by a wide range of large language models and platforms, including GPT-style APIs, Claude, LangChain-based backends, ChatGLM, Kimi, Ollama, and many others. The project supports multiple rendering backends for the avatar, such as Live2D, Unreal Engine (UE),...

Downloads: 3 This Week

Last Update: 2025-11-28
See Project
16

Matcha-TTS

A fast TTS architecture with conditional flow matching

...It models speech as an ODE-based generative process, and conditional flow matching lets it reach high-quality audio in only a few synthesis steps, which greatly reduces latency compared to score-matching diffusion approaches. The model is fully probabilistic, so it can generate diverse realizations of the same text while still sounding stable and intelligible. The repository provides an end-to-end TTS pipeline: a PyTorch/Lightning training stack, configuration files, pre-trained checkpoints, a command-line interface, and a Gradio app for interactive testing. ...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
17

trellis-stable-projectorz

A One-click installer for Windows: (Python 3.11, Cuda 11.8, Torch 2.1.2) Repository for integration with the StableProjectorz, a free AI-texturing tool. https://stableprojectorz.com Our Discord server: https://discord.gg/aWbnX2qan2 supports float16 and int32 optimizations

1 Review

Downloads: 1 This Week

Last Update: 2025-08-08
See Project
18

PEFT

State-of-the-art Parameter-Efficient Fine-Tuning

Parameter-Efficient Fine-Tuning (PEFT) methods enable efficient adaptation of pre-trained language models (PLMs) to various downstream applications without fine-tuning all the model's parameters. Fine-tuning large-scale PLMs is often prohibitively costly. In this regard, PEFT methods only fine-tune a small number of (extra) model parameters, thereby greatly decreasing the computational and storage costs. Recent State-of-the-Art PEFT techniques achieve performance comparable to that of full...

Downloads: 1 This Week

Last Update: 2026-01-09
See Project
19

Fish Speech

SOTA Open Source TTS

Fish Speech is a state-of-the-art open-source text-to-speech project that has evolved into the OpenAudio series of advanced TTS models. The repository hosts the code and tooling for training, fine-tuning, and serving high-quality TTS, while the current flagship models (OpenAudio-S1 and S1-mini) are distributed via Fish Audio’s playground and Hugging Face. The models are evaluated with Seed TTS metrics and achieve exceptionally low word and character error rates, indicating strong...

Downloads: 3 This Week

Last Update: 2025-11-28
See Project
20

Speech-AI-Forge

Speech-AI-Forge is a project developed around TTS generation model

Speech-AI-Forge is a full-stack project built around modern text-to-speech generation models, providing both an API server and a Gradio-based web UI for interactive use. At its core, it acts as a hub that wires together multiple speech-related capabilities, including TTS, speech-to-text and LLM-based control flows, behind a consistent interface. The system is designed to be deployed in several ways: you can try it online via hosted demos, spin it up in a one-click Colab environment, run it...

Downloads: 3 This Week

Last Update: 2025-11-28
See Project
21

gotify/server

A simple server for sending and receiving messages in real-time

...Docker images are automatically built on every release. Several static code analyzers and many unit/end2end tests are run on every travis-ci build. The heart of this project. gotify/server features a WebUI and functionality for sending messages via a REST-API. For subscribing/receiving messages via a web socket connection and managing users, clients and applications.

Downloads: 0 This Week

Last Update: 2026-01-02
See Project
22

WeUI

Tailor-made UI framework for WeChat web development

WeUI is a UI framework that was officially designed by the WeChat design team and tailor-made for WeChat web development. It contains a set of basic style libraries that are consistent with WeChat’s standard visual experience. WeUI was made to ensure that WeChat users could have the same clean, uniform experience with all WeChat web services, and to make their perception of WeChat applets more unified and cohesive. WeUI covers everything from form factors (buttons, lists, sliders, etc.)...

Downloads: 0 This Week

Last Update: 2025-09-12
See Project
23

NetherSX2 Classic

Continuation of NetherSX2 based on AetherSX2 3668

NetherSX2-classic is a companion and variant of NetherSX2 that targets a specific older base version of the AetherSX2 emulator (based on the 3668 branch), applying similar custom patches to provide a stable and performant PS2 emulation environment on Android devices. The project stitches in anti-tampering modifications, RetroAchievements notification fixes, and controller and GameDB updates while maintaining the legacy behavior of the classic build for compatibility with titles that might perform better on the older codebase. Because this classic branch starts from a slightly different upstream version than NetherSX2-patch, users often choose it for performance reasons on lower-power devices or for games with known regressions in newer builds. ...

Downloads: 738 This Week

Last Update: 2026-01-05
See Project
24

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper

WhisperSpeech is an open-source text-to-speech system created by “inverting” OpenAI’s Whisper, reusing its strengths as a semantic audio model to generate speech instead of only transcribing it. The project aims to be for speech what Stable Diffusion is for images: powerful, hackable, and safe for commercial use, with code under Apache-2.0/MIT and models trained only on properly licensed data. Its architecture follows a token-based, multi-stage pipeline inspired by AudioLM and SPEAR-TTS: Whisper is used to produce semantic tokens, EnCodec compresses the waveform into acoustic tokens, and Vocos reconstructs high-fidelity audio from those tokens. ...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
25

DeepSpeed MII

MII makes low-latency and high-throughput inference possible

...The Deep Learning (DL) open-source community has seen tremendous growth in the last few months. Incredibly powerful text generation models such as the Bloom 176B, or image generation model such as Stable Diffusion are now available to anyone with access to a handful or even a single GPU through platforms such as Hugging Face. While open-sourcing has democratized access to AI capabilities, their application is still restricted by two critical factors: inference latency and cost. DeepSpeed-MII is a new open-source python library from DeepSpeed, aimed towards making low-latency, low-cost inference of powerful models not only feasible but also easily accessible. ...

Downloads: 0 This Week

Last Update: 2025-03-25
See Project