user interface free download

Showing 138 open source projects for "user interface"

View related business solutions

Artificial Intelligence Python Clear Filters & Widen Search

$300 Free Credits to Build on Google Cloud
New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.

Claim $300 Free
$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
1

Open Interface

Control Any Computer Using LLMs

Open Interface is a cross-platform application that allows users to control their computers using large language models (LLMs). By sending user requests to an LLM backend, it determines the necessary steps and executes them by simulating keyboard and mouse inputs. The system can adjust its actions based on real-time feedback, providing a self-driving computer experience.

Downloads: 2 This Week

Last Update: 2025-05-21
See Project
2

Ultimate Vocal Remover (UVR5)

GUI for a Vocal Remover that uses Deep Neural Networks

This application uses state-of-the-art source separation models to remove vocals from audio files. UVR's core developers trained all of the models provided in this package (except for the Demucs v3 and v4 4-stem models).

1 Review

Downloads: 8,658 This Week

Last Update: 2025-01-20
See Project
3

ComfyUI

The most powerful and modular diffusion model GUI, api and backend

The most powerful and modular diffusion model is GUI and backend. This UI will let you design and execute advanced stable diffusion pipelines using a graph/nodes/flowchart-based interface. We are a team dedicated to iterating and improving ComfyUI, supporting the ComfyUI ecosystem with tools like node manager, node registry, cli, automated testing, and public documentation. Open source AI models will win in the long run against closed models and we are only at the beginning. Our core mission...

Downloads: 671 This Week

Last Update: 6 days ago
See Project
4

Fooocus

Focus on prompting and generating

...Built on Gradio and leveraging Stable Diffusion XL, Fooocus eliminates the need for manual parameter tweaking, allowing users to focus solely on crafting prompts. It offers a user-friendly interface with minimal setup, making advanced image synthesis accessible to a broader audience.

Downloads: 378 This Week

Last Update: 2025-06-03
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
5

AUTOMATIC1111 Stable Diffusion web UI

Stable Diffusion web UI

AUTOMATIC1111's stable-diffusion-webui is a powerful, user-friendly web interface built on the Gradio library that allows users to easily interact with Stable Diffusion models for AI-powered image generation. Supporting both text-to-image (txt2img) and image-to-image (img2img) generation, this open-source UI offers a rich feature set including inpainting, outpainting, attention control, and multiple advanced upscaling options.

1 Review

Downloads: 217 This Week

Last Update: 2025-06-02
See Project
6

Gradio

Create UIs for your machine learning model in Python in 3 minutes

Gradio is the fastest way to demo your machine learning model with a friendly web interface so that anyone can use it, anywhere! Gradio can be installed with pip. Creating a Gradio interface only requires adding a couple lines of code to your project. You can choose from a variety of interface types to interface your function. Gradio can be embedded in Python notebooks or presented as a webpage. A Gradio interface can automatically generate a public link you can share with colleagues that...

1 Review

Downloads: 8 This Week

Last Update: 2026-06-17
See Project
7

Hermes Agent

The agent that grows with you

...The agent interfaces with messaging platforms like Telegram, Discord, Slack, and WhatsApp through a single gateway process, and also offers an interactive terminal user interface with history, autocomplete, and streamable tool output. It supports scheduled automation in natural language, allowing users to set up recurring tasks such as daily briefings or system audits that it runs unattended.

Downloads: 28 This Week

Last Update: 5 days ago
See Project
8

MoneyPrinterTurbo

Generate short videos with one click using AI LLM

MoneyPrinterTurbo is an AI-driven tool that enables users to generate high-definition short videos with minimal input. By providing a topic or keyword, the system automatically creates video scripts, sources relevant media assets, adds subtitles, and incorporates background music, resulting in a polished video ready for distribution.

Downloads: 241 This Week

Last Update: 19 hours ago
See Project
9

OmniParser

A simple screen parsing tool towards pure vision based GUI agent

OmniParser is a comprehensive method for parsing user interface screenshots into structured elements, significantly enhancing the ability of multimodal models like GPT-4 to generate actions accurately grounded in corresponding regions of the interface. It reliably identifies interactable icons within user interfaces and understands the semantics of various elements in a screenshot, associating intended actions with the correct screen regions.

Downloads: 5 This Week

Last Update: 2025-09-09
See Project
Secure File Transfer for Windows with Cerberus by Redwood
Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.

Try for Free
10

Video-subtitle-extractor

A GUI tool for extracting hard-coded subtitle (hardsub) from videos

Video hard subtitle extraction, generate srt file. There is no need to apply for a third-party API, and text recognition can be implemented locally. A deep learning-based video subtitle extraction framework, including subtitle region detection and subtitle content extraction. A GUI tool for extracting hard-coded subtitles (hardsub) from videos and generating srt files. Use local OCR recognition, no need to set up and call any API, and do not need to access online OCR services such as Baidu...

1 Review

Downloads: 62 This Week

Last Update: 2026-04-05
See Project
11

SwarmUI

Modular AI image and video generation web UI with extensible tools

SwarmUI is a modular web-based user interface designed for AI-driven image generation, with a strong focus on usability, performance, and extensibility. It serves as a unified environment for working with multiple AI models, including Stable Diffusion and newer image and video generation systems, allowing users to create and manage outputs through a browser interface.

Downloads: 10 This Week

Last Update: 2026-03-18
See Project
12

Search with Lepton

Lightweight demo to build a conversational AI search engine quickly

Search with Lepton is an open source demonstration project that shows how to build a conversational search engine using the Lepton AI framework. It combines traditional web search with large language models to provide natural language answers to user queries. It retrieves information from supported search engines and uses that context to generate responses through a retrieval-augmented generation approach. The implementation is intentionally minimal, containing fewer than 500 lines of code while still providing a complete working example of an AI-powered search system. It includes both a backend service written in Python and a web interface that allows users to interact with the search engine in a conversational format. ...

Downloads: 2 This Week

Last Update: 2026-06-29
See Project
13

InvokeAI

InvokeAI is a leading creative engine for Stable Diffusion models

InvokeAI is an implementation of Stable Diffusion, the open source text-to-image and image-to-image generator. It provides a streamlined process with various new features and options to aid the image generation process. It runs on Windows, Mac and Linux machines, and runs on GPU cards with as little as 4 GB or RAM. InvokeAI is a leading creative engine built to empower professionals and enthusiasts alike. Generate and create stunning visual media using the latest AI-driven technologies....

1 Review

Downloads: 14 This Week

Last Update: 1 day ago
See Project
14

UFO³

Weaving the Digital Agent Galaxy

UFO is an open-source framework developed by Microsoft for building intelligent agents that automate interactions with graphical user interfaces on the Windows operating system. The system allows users to issue natural language instructions that are translated into automated actions across multiple desktop applications. Using a dual-agent architecture, the framework analyzes both visual interface elements and system control structures in order to understand how applications should be manipulated. ...

Downloads: 5 This Week

Last Update: 2026-06-12
See Project
15

GenMedia Creative Studio

AI generative media user experience highlighting use of APIs

GenMedia Creative Studio is a Google Cloud reference application for experimenting with generative media workflows on Vertex AI. It provides a user experience for working with models and APIs such as Gemini, Veo, Imagen, Gemini Image, Gemini TTS, Chirp 3, and Lyria. The project is built to showcase multimodal creation across text, image, video, speech, and music from one deployable interface. It is useful for creators, marketers, developers, and technical teams that want to prototype media-generation experiences using Google Cloud services. ...

Downloads: 4 This Week

Last Update: 2026-06-02
See Project
16

clone-voice

A sound cloning tool with a web interface, using your voice

Clone-voice is a local voice-cloning tool that lets you synthesize speech in any target voice or convert one recording into another voice using the same timbre. It is built around Coqui’s XTTS-v2 model, so it inherits multilingual support and modern neural TTS quality while wrapping it in a user-friendly desktop workflow. The app is designed to be very easy to use: you download a precompiled package, double-click app.exe, and it launches a browser-based web interface where you control cloning and synthesis. It does not require an NVIDIA GPU to run basic tasks, although GPU acceleration can be used when available, making it accessible on modest machines. ...

Downloads: 15 This Week

Last Update: 2025-11-28
See Project
17

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Open-Sora is an open-source initiative aimed at democratizing high-quality video production. It offers a user-friendly platform that simplifies the complexities of video generation, making advanced video techniques accessible to everyone. The project embraces open-source principles, fostering creativity and innovation in content creation. Open-Sora provides tools, models, and resources to create high-quality videos, aiming to lower the entry barrier for video production and support diverse...

1 Review

Downloads: 20 This Week

Last Update: 2025-03-17
See Project
18

AG-UI

The Agent-User Interaction Protocol

AG-UI is an open, lightweight protocol for connecting AI agents to user-facing applications through a standardized event-based interface. It is designed to make agent behavior visible, interactive, and controllable inside real-time front-end experiences. Instead of treating an AI agent as a black-box chat endpoint, AG-UI defines structured events for messages, tool calls, state changes, lifecycle updates, and user interactions.

Downloads: 0 This Week

Last Update: 3 days ago
See Project
19

Clay Foundation Model

The Clay Foundation Model - An open source AI model and interface

The Clay Foundation Model is an open-source AI model and interface designed to provide comprehensive data and insights about Earth. It aims to serve as a foundational tool for environmental monitoring, research, and decision-making by integrating various data sources and offering an accessible platform for analysis.

Downloads: 0 This Week

Last Update: 2025-07-05
See Project
20

AppAgent

Multimodal Agents as Smartphone Users, an LLM-based multimodal agent

...AppAgent combines vision capabilities with language reasoning to understand interface elements and determine which actions are required to accomplish a task. The system also includes mechanisms for exploration and learning, allowing the agent to analyze user interface layouts and build structured knowledge about how different apps function.

Downloads: 0 This Week

Last Update: 2026-03-04
See Project
21

KeepChatGPT

Browser userscript that enhances ChatGPT reliability and usability

...By automating session refresh and maintaining active connections, KeepChatGPT reduces the need for repeated manual steps when recovering from errors or expired sessions. KeepChatGPT also introduces a variety of enhancements that improve the overall interface and user experience, including page cleanup, expanded display layouts, conversation cloning, and detailed chat information.

Downloads: 1 This Week

Last Update: 2026-06-29
See Project
22

Magentic UI

A research prototype of a human-centered web agent

Magentic-UI is a research prototype developed by Microsoft that serves as a human-centered interface powered by a multi-agent system. It enables users to automate complex web tasks, such as browsing, form filling, and data analysis, while maintaining control over the process. The system emphasizes transparency and user involvement, making it suitable for tasks requiring both automation and human oversight.

Downloads: 0 This Week

Last Update: 2026-05-21
See Project
23

Ollama RAG Chatbot

Chat with multiple PDFs locally

Ollama RAG Chatbot is a local-first retrieval chatbot project built to let users chat with the contents of multiple PDF documents through a simple interface. The project is framed as an experiment, but its setup and packaging make it approachable for practical local use as well. It supports running on a local machine or in Kaggle, which lowers the barrier for users who want to test RAG workflows without building everything from scratch. Model support is flexible, with compatibility for both Hugging Face models and Ollama-based models, and the interface is delivered through Gradio for a lightweight user experience. ...

Downloads: 0 This Week

Last Update: 2026-04-20
See Project
24

LibrePhotos

A self-hosted open source photo management service

LibrePhotos is an open-source self-hosted photo management platform designed to organize, browse, and analyze personal media libraries while preserving user privacy. The system allows individuals to store and manage their photos and videos locally rather than relying on commercial cloud services. It provides features similar to services like Google Photos but runs on a private server controlled by the user. The application includes AI-powered tools that automatically analyze images to detect faces, objects, and locations, allowing photos to be grouped and searched more efficiently. ...

Downloads: 1 This Week

Last Update: 2026-06-27
See Project
25

SD.Next

All-in-one WebUI for AI generative image and video creation

SD.Next is an all-in-one web user interface for generative image creation that expands beyond basic Stable Diffusion workflows to cover broader image and video generation, captioning, and processing tasks. It is designed as a power-user environment where model management, generation features, and workflow controls are centralized in a single UI rather than spread across separate scripts and utilities.

Downloads: 9 This Week

Last Update: 2026-06-16
See Project