Showing 236 open source projects for "diffusion"

View related business solutions
  • Auth0 for AI Agents now in GA Icon
    Auth0 for AI Agents now in GA

    Ready to implement AI with confidence (without sacrificing security)?

    Connect your AI agents to apps and data more securely, give users control over the actions AI agents can perform and the data they can access, and enable human confirmation for critical agent actions.
    Start building today
  • Cloud tools for web scraping and data extraction Icon
    Cloud tools for web scraping and data extraction

    Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

    Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
    Explore 10,000+ tools
  • 1
    DragonianVoice

    DragonianVoice

    C++ inference library for multiple SVC/TTS

    DragonianVoice is a C++ inference library that unifies multiple speech synthesis, voice conversion, and singing voice synthesis models under a single, high-performance ONNX-based framework. It focuses on being a reusable native library rather than a full UI product, with bindings for C, C++, and C# so it can be embedded into other applications or engines. The project supports a wide range of model families: TTS models such as Tacotron2, VITS, EmotionalVITS, BERTVits2, GPT-SoVITS, SVC systems...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Qwen3-Omni

    Qwen3-Omni

    Qwen3-omni is a natively end-to-end, omni-modal LLM

    ...It achieves state-of-the-art results: across 36 audio and audio-visual benchmarks, it hits open-source SOTA on 32 and overall SOTA on 22, outperforming or matching strong closed-source models such as Gemini-2.5 Pro and GPT-4o. To reduce latency, especially in audio/video streaming, Talker predicts discrete speech codecs via a multi-codebook scheme and replaces heavier diffusion approaches.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    ConsistencyDecoder

    ConsistencyDecoder

    Consistency Distilled Diff VAE

    ConsistencyDecoder is a Python package from OpenAI that introduces an improved decoding method for variational autoencoders (VAEs) used in Stable Diffusion pipelines. Instead of relying solely on the standard GAN or VAE decoder, this approach leverages a Consistency Distilled Diff VAE, designed to produce higher-quality and more stable outputs from encoded latents. The project provides a simple API for encoding with a Stable Diffusion VAE and decoding using the new consistency model, allowing for side-by-side comparisons with traditional decoders. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    Stable Diffusion in Docker

    Stable Diffusion in Docker

    Run the Stable Diffusion releases in a Docker container

    Run the Stable Diffusion releases in a Docker container with txt2img, img2img, depth2img, pix2pix, upscale4x, and inpaint. Run the Stable Diffusion releases on Huggingface in a GPU-accelerated Docker container. By default, the pipeline uses the full model and weights which requires a CUDA capable GPU with 8GB+ of VRAM. It should take a few seconds to create one image.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Total Network Visibility for Network Engineers and IT Managers Icon
    Total Network Visibility for Network Engineers and IT Managers

    Network monitoring and troubleshooting is hard. TotalView makes it easy.

    This means every device on your network, and every interface on every device is automatically analyzed for performance, errors, QoS, and configuration.
    Learn More
  • 5
    Alpaca Electron

    Alpaca Electron

    The simplest way to run Alpaca on your own computer

    Alpaca Electron is built from the ground up to be the easiest way to chat with the alpaca AI models. No command line or compiling is needed. Only windows is currently supported for now. The new llama.Cpp binaries that support GGUF have not yet been built for other platforms.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    DiffRhythm

    DiffRhythm

    Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation

    DiffRhythm is an open-source, diffusion-based model designed to generate full-length songs. Focused on music creation, it combines advanced AI techniques to produce coherent and creative audio compositions. The model utilizes a latent diffusion architecture, making it capable of producing high-quality, long-form music. It can be accessed on Huggingface, where users can interact with a demo or download the model for further use.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 7
    AnimateDiff

    AnimateDiff

    Plug-n-play module turning text-to-image models into animation

    AnimateDiff is an open-source project designed to enhance text-to-image diffusion models by adding animation capabilities. It allows users to turn static images generated by popular text-to-image models into animated sequences without requiring additional model training. This plug-and-play tool is compatible with a wide range of community models and facilitates the generation of animation directly from pre-existing text-to-image models.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 8

    jags-wiener

    Wiener functions in JAGS

    The JAGS Wiener module is an extension for JAGS, which provides wiener process distribution functions, mainly the Wiener first passage time density. It allows to include stochastic nodes with the first hitting time distribution of a diffusion process. Ubuntu users can also checkout our PPA: https://launchpad.net/~cidlab/+archive/jwm
    Downloads: 15 This Week
    Last Update:
    See Project
  • 9
    OpenAssistant

    OpenAssistant

    Chat-based assistant that understands tasks

    OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so. In the same way that Stable Diffusion helped the world make art and images in new ways, we want to improve the world by providing amazing conversational AI. We are in the early stages of development, working from established research in applying RLHF to large language models. Open Assistant is a project organized by LAION and individuals around the world interested in bringing this technology to everyone. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • Desktop and Mobile Device Management Software Icon
    Desktop and Mobile Device Management Software

    It's a modern take on desktop management that can be scaled as per organizational needs.

    Desktop Central is a unified endpoint management (UEM) solution that helps in managing servers, laptops, desktops, smartphones, and tablets from a central location.
    Learn More
  • 10
    Shap-E

    Shap-E

    Generate 3D objects conditioned on text or images

    ...The model is built with a two-stage architecture: first an encoder that maps existing 3D assets into parameterizations of implicit functions, and then a conditional diffusion model trained on those parameterizations to generate new assets. Because it works at the level of implicit functions, Shap-E can render output both as textured meshes and NeRF-style volumetric renderings. The repository contains sample notebooks (e.g. sample_text_to_3d.ipynb, sample_image_to_3d.ipynb) so users can try out text → 3D or image → 3D generation. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    VideoCrafter2

    VideoCrafter2

    Overcoming Data Limitations for High-Quality Video Diffusion Models

    ...It features models for both text-to-video and image-to-video generation. The system is optimized for generating videos from textual descriptions or still images, leveraging advanced diffusion models. VideoCrafter2, an upgraded version, improves on its predecessor by enhancing motion dynamics and concept combinations, especially in low-data scenarios. Users can explore a wide range of creative possibilities, producing cinematic videos that combine artistic styles and real-world scenes.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 12
    CVSim

    CVSim

    Cyclic Voltammetry Simulation Program for Research and Education

    This a cyclic voltammetry (CV) program created for teaching CV undergraduate chemistry students. For research, download the version from https://sourceforge.net/projects/cvsimplus/ . CVSim uses the implicit method to simulate diffusion. Development of new methods for more accurate simulation of the diffusion is under way. For more details and discussion contact me at akeramid@ucy.ac.cy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    PromptSniffer

    PromptSniffer

    View Extract & Remove AI generation metadata with right click

    A powerful tool for reading, extracting, and removing AI generation metadata from image files. Specifically designed to handle metadata from AI image generation tools like ComfyUI, Stable Diffusion, SwarmUI, InvokeAI, and more. Core Functionality Read EXIF/Metadata: Extract and display comprehensive metadata from images AI Metadata Detection: Automatically identify and highlight AI generation metadata Metadata Removal: Strip AI generation metadata while preserving image quality Batch Processing: Handle multiple files with wildcard patterns Cross-Platform: Works on Windows, macOS, and Linux AI Tool Support ComfyUI: Detects and extracts workflow JSON data Stable Diffusion: Identifies prompts, parameters, and generation settings SwarmUI/StableSwarmUI: Handles JSON-formatted metadata Midjourney, DALL-E, NovelAI: Recognizes generation signatures Automatic1111, InvokeAI: Extracts generation parameters
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    CVSimPlus

    CVSimPlus

    Cyclic Voltammetry Simulation Program for Research

    This a cyclic voltammetry (CV) program created for teaching CV undergraduate chemistry students and research. It can be downloaded and from https://sites.google.com/site/keramidasscientificsoftware/cvsim. CVSim uses either implicit or RK4 methods to simulate diffusion. Development of new methods to accurate simulate the diffusion is under way. A lot of time has been invested and will be on making the simulation of CV as more accurate as possible. The results at the moment for CV are satisfactory however can be improved. For more details and discussion contact me at akeramid@ucy.ac.cy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Teem

    Teem

    Tools to process and visualize scientific data and images

    Teem is a set of C libraries for manipulation, measurement, and visualization of structured scientific data. Includes N-dimensional image I/O and processing, volume rendering, diffusion tensor processing, and more. Latest code via git clone http://git.code.sf.net/p/teem/teem.git . Join us at https://discord.gg/xBBqZGXkF7
    Leader badge
    Downloads: 30 This Week
    Last Update:
    See Project
  • 16
    DALL-E 2 - Pytorch

    DALL-E 2 - Pytorch

    Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis

    Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch. The main novelty seems to be an extra layer of indirection with the prior network (whether it is an autoregressive transformer or a diffusion network), which predicts an image embedding based on the text embedding from CLIP. Specifically, this repository will only build out the diffusion prior network, as it is the best performing variant (but which incidentally involves a causal transformer as the denoising network) To train DALLE-2 is a 3 step process, with the training of CLIP being the most important. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    SpatGRIS

    SpatGRIS

    The SpatGRIS is an external multichannel sound spatializer.

    UPDATE (October 2024): New version 3.3.7 • SpatGRIS is now Universal (Intel and M1-M2-M3) and tested under macOS 14 Sonoma and Windows 10 • SpatGRIS uses BlackHole, a very powerful and flexible virtual audio device with 256 channels. New: • A general MUTE button • BlackHole 0.6.0 • Speaker Setup 3D view is now a separate application: SpeakerView • French Manual • SpatGRIS PLAYER: a function that allows using SpatGRIS3 as a standalone software to play any piece encoded by it to any...
    Leader badge
    Downloads: 57 This Week
    Last Update:
    See Project
  • 18
    ControlNet

    ControlNet

    Let us control diffusion models

    ControlNet is a neural network architecture designed to add conditional control to text-to-image diffusion models. Rather than training from scratch, ControlNet “locks” the weights of a pre-trained diffusion model and introduces a parallel trainable branch that learns additional conditions—like edges, depth maps, segmentation, human pose, scribbles, or other guidance signals. This allows the system to control where and how the model should focus during generation, enabling users to steer layout, structure, and content more precisely than prompt text alone. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    KoboldCpp

    KoboldCpp

    Run GGUF models easily with a UI or API. One File. Zero Install.

    KoboldCpp is an easy-to-use AI text-generation software for GGML and GGUF models, inspired by the original KoboldAI. It's a single self-contained distributable that builds off llama.cpp and adds many additional powerful features.
    Downloads: 246 This Week
    Last Update:
    See Project
  • 20
    Float-ditherer

    Float-ditherer

    A dithering software based around floats

    Float-ditherer is a tool that converts images into pure 1-bit images using a user specified palette tailored to convert images,textures and sprites using a float-based sampling method and float matrix. The software is meant to be lightweight and for power-users
    Downloads: 5 This Week
    Last Update:
    See Project
  • 21
    chromabook-coloring-book-prompt-builder

    chromabook-coloring-book-prompt-builder

    Ai Coloring Pages Prompt Builder

    Chromabook Prompt Builder is a lightweight Chrome extension that simplifies the process of generating AI coloring-page prompts. It supports direct export to MidJourney, Stable Diffusion, Ideogram, and ChatGPT. The tool includes presets for kids, kawaii, mandalas, and affirmations, plus a surprise randomizer and history tracking.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    Fosite - advection problem solver

    numerical simulation code for solving transport equations in 1D/2D/3D

    ...The method is 2nd order accurate in space and uses high order Runge-Kutta and multistep schemes for time evolution. In addition to the pure advection code several source terms have been implemented including viscous diffusion and gravitational acceleration. Fosite is written with object-oriented patterns in Fortran 2003 and follows the Structure of Arrays (SoA) layout, operating on generic field datatypes. This allows for high performance on modern architectures (SIMD). It is parallelized and vectorized. The software is thereby optimized for the NEC SX-Aurora TSUBASA Vector Engine .
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    OptionMatrix

    OptionMatrix

    Financial Derivatives Calculator with 171+ Models (Options Calculator)

    ...A generalized date engine can calculate re-occuring distances to any industry used expiration into the future. Spread engine with spread views. Models Supported: Black-Scholes, Merton-73, Black-76, Roll Geske Whaley, Garman KohlHagen, Jump Diffusion, Quanto, Vasicek Bond Option, Turnbull Wakeman Asian, TimeSwitchOption, Look Barrier, Bachelier, PartialTimeBarrier, GapOption, Extreme Spread Option, Simple Chooser, ComplexChooser, PartialFixedLB, Executive, CashOrNothing, Extendible Writer, OptionsOnOptions, BAWAmericanApprox, BSAmericanApprox, AssetOrNothing, Bisection, BAWbisection, BSbisection, Gfrench, Gcarry, Swapoption, Complex Chooser, Super Share, EquityLinkedFXO, Spread Approximation, BinaryBarrier and more
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    StoryTeller

    StoryTeller

    Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.

    A multimodal AI story teller, built with Stable Diffusion, GPT, and neural text-to-speech (TTS). Given a prompt as an opening line of a story, GPT writes the rest of the plot; Stable Diffusion draws an image for each sentence; a TTS model narrates each line, resulting in a fully animated video of a short story, replete with audio and visuals. To develop locally, install dev dependencies and install pre-commit hooks.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    Consistency Models

    Consistency Models

    Official repo for consistency models

    consistency_models is the repository for Consistency Models, a new family of generative models introduced by OpenAI that aim to generate high-quality samples by mapping noise directly into data — circumventing the need for lengthy diffusion chains. It builds on and extends diffusion model frameworks (e.g. based on the guided-diffusion codebase), adding techniques like consistency distillation and consistency training to enable fast, often one-step, sample generation. The repo is implemented in PyTorch and includes support for large-scale experiments on datasets like ImageNet-64 and LSUN variants. ...
    Downloads: 0 This Week
    Last Update:
    See Project