Showing 135 open source projects for "requirements"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 1
    ACE-Step 1.5

    ACE-Step 1.5

    The most powerful local music generation model

    ...It integrates cutting-edge generative techniques—such as diffusion-based synthesis combined with compressed autoencoders and lightweight transformer elements—to produce high-quality full-length music tracks with rapid inference times, capable of generating a complete song in seconds on modern GPUs while remaining efficient enough to run on consumer-grade hardware with minimal memory requirements. Beyond straightforward text-to-music synthesis, ACE-Step 1.5 enables flexible creative workflows, including tasks like cover generation, editing existing tracks, transforming vocals to background accompaniment, and stylistic personalization using low-rank adaptation from just a few example songs.
    Downloads: 75 This Week
    Last Update:
    See Project
  • 2
    Kiro

    Kiro

    Kiro is an agentic IDE that works alongside you from prototype

    Kiro is an agentic AI-powered integrated development environment and command-line tool designed to help developers move from early-stage prototypes to production-ready software through structured, spec-driven development. Unlike traditional AI coding assistants that rely heavily on ad hoc prompts, Kiro transforms natural language input into formalized requirements, architectural plans, and step-by-step implementation tasks, ensuring clarity and maintainability throughout the development lifecycle. It introduces the concept of “specs,” which act as executable documentation that defines system behavior, constraints, and acceptance criteria, allowing developers to guide AI agents more precisely. ...
    Downloads: 17 This Week
    Last Update:
    See Project
  • 3
    SillyTavern

    SillyTavern

    LLM Frontend for Power Users

    Mobile-friendly, Multi-API (KoboldAI/CPP, Horde, NovelAI, Ooba, OpenAI, OpenRouter, Claude, Scale), VN-like Waifu Mode, Horde SD, System TTS, WorldInfo (lorebooks), customizable UI, auto-translate, and more prompt options than you'd ever want or need. Optional Extras server for more SD/TTS options + ChromaDB/Summarize. SillyTavern is a user interface you can install on your computer (and Android phones) that allows you to interact with text generation AIs and chat/roleplay with characters...
    Downloads: 630 This Week
    Last Update:
    See Project
  • 4
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    ...PyAudio version 0.2.11+ is required, as earlier versions have known memory management bugs when recording from microphones in certain situations. To hack on this library, first make sure you have all the requirements listed in the "Requirements" section.
    Downloads: 16 This Week
    Last Update:
    See Project
  • Host LLMs in Production With On-Demand GPUs Icon
    Host LLMs in Production With On-Demand GPUs

    NVIDIA L4 GPUs. 5-second cold starts. Scale to zero when idle.

    Deploy your model, get an endpoint, pay only for compute time. No GPU provisioning or infrastructure management required.
    Try Free
  • 5
    llama.cpp

    llama.cpp

    Port of Facebook's LLaMA model in C/C++

    The llama.cpp project enables the inference of Meta's LLaMA model (and other models) in pure C/C++ without requiring a Python runtime. It is designed for efficient and fast model execution, offering easy integration for applications needing LLM-based capabilities. The repository focuses on providing a highly optimized and portable implementation for running large language models directly within C/C++ environments.
    Downloads: 177 This Week
    Last Update:
    See Project
  • 6
    AutoBE

    AutoBE

    AI Vibe Coding Agent of TS backend server

    ...AutoBE can be explored through a local playground where users chat with agents and manage sessions. Its main value is giving developers and non-programmers a structured way to generate backend systems from requirements while still producing documentation and tests.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    ChatGLM-6B

    ChatGLM-6B

    ChatGLM-6B: An Open Bilingual Dialogue Language Model

    ...It is optimized for dialogue and question answering with a balance between performance and deployability in consumer hardware settings. Support for quantized inference (INT4, INT8) to reduce GPU memory requirements. Automatic mode switching between precision/memory tradeoffs (full/quantized).
    Downloads: 7 This Week
    Last Update:
    See Project
  • 8
    Chrome DevTools MCP

    Chrome DevTools MCP

    Chrome DevTools for coding agents

    ...It makes a running Chrome instance visible to MCP clients, enabling agents to debug websites end-to-end—launching Chrome, navigating, profiling, and collecting artifacts in a structured way. The repository spells out environment requirements and cautions that exposing a live browser to agents grants powerful access, so sensitive data should be handled carefully. Beyond static inspection, it exposes operational tools like starting a performance trace that an agent can later analyze to propose optimizations. The server is intended to slot into MCP-capable assistants and IDEs, giving them reliable, typed tools and resource endpoints rather than ad-hoc automation. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    WanGP

    WanGP

    AI video generator optimized for low VRAM and older GPUs use

    ...It acts as a unified interface for running multiple video, image, and audio generation models, including Wan-based models as well as other systems like Hunyuan Video, Flux, and Qwen. A key focus of the project is reducing VRAM requirements, enabling some workflows to run on as little as 6 GB while still supporting older Nvidia and certain AMD GPUs. Wan2GP provides a full web-based interface that simplifies interaction with complex generative pipelines, making it easier to configure prompts, models, and rendering settings. It also integrates a wide range of utilities such as prompt enhancement, mask editing, motion design, and extraction tools for pose, depth, and flow data to support advanced video workflows.
    Downloads: 52 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 10
    LatentSync

    LatentSync

    Taming Stable Diffusion for Lip Sync

    ...The system leverages a U-Net diffusion backbone, with cross-attention of audio embeddings (via an audio encoder) and reference video frames to guide generation, and applies a set of loss functions (temporal, perceptual, sync-net based) to enforce lip-sync accuracy, visual fidelity, and temporal consistency. Over versions, LatentSync has improved temporal stability and lowered resource requirements — making inference more practical (e.g. 8 GB VRAM for earlier versions, somewhat higher for latest models).
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    rwkv.cpp

    rwkv.cpp

    INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

    Besides the usual FP32, it supports FP16, quantized INT4, INT5 and INT8 inference. This project is focused on CPU, but cuBLAS is also supported. RWKV is a novel large language model architecture, with the largest model in the family having 14B parameters. In contrast to Transformer with O(n^2) attention, RWKV requires only state from the previous step to calculate logits. This makes RWKV very CPU-friendly on large context lengths.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Noi

    Noi

    Power Your World with AI

    Power your world with AI, to explore, extend, and empower.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    ChatDev

    ChatDev

    Create Customized Software using Natural Language Idea

    ChatDev is an AI-powered development tool designed to simulate the software development lifecycle using multi-agent collaboration. It allows multiple AI agents to take on roles such as product managers, developers, and testers to collaboratively generate, refine, and evaluate software code. This project explores how AI can be leveraged to automate and optimize development workflows.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 14
    Paseo

    Paseo

    Orchestrate coding agents remotely from your phone, desktop and CLI

    Paseo is a modern development platform designed to streamline the creation and management of web applications through a focus on simplicity, performance, and developer experience. It provides a structured environment for building applications with minimal configuration, allowing developers to focus on core functionality rather than setup complexity. The platform likely includes tools for routing, state management, and component organization, enabling rapid development of scalable...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 15
    Spec Kit

    Spec Kit

    Toolkit to help you get started with Spec-Driven Development

    ...It introduces a structured process in which developers define detailed specifications first, then allow AI tools to generate plans, tasks, and implementation code aligned with those requirements. The toolkit provides scaffolding, prompt templates, and automation scripts that help teams maintain a clear source of truth throughout the development lifecycle. By emphasizing intent before code, Spec Kit reduces ambiguity and improves the reliability of AI-generated output. It integrates with popular AI coding tools such as GitHub Copilot and similar assistants, allowing developers to embed spec-driven practices directly into their existing workflows. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 16
    DESIGN.md

    DESIGN.md

    A format specification for describing a visual identity

    ...The file combines machine-readable design tokens with human-readable explanations, enabling agents to generate consistent user interfaces aligned with a brand. By providing persistent design context, it eliminates the need to repeatedly describe styling requirements to AI tools. The format supports interoperability across platforms and tools, making it a potential standard for agent-driven UI generation. It also includes tooling for validation and exporting design tokens. The goal is to enable agents to produce accurate, on-brand designs automatically.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 17
    llmfit

    llmfit

    157 models, 30 providers, one command to find what runs on hardware

    llmfit is a terminal-based utility that helps developers determine which large language models can realistically run on their local hardware by analyzing system resources and model requirements. The tool automatically detects CPU, RAM, GPU, and VRAM specifications, then ranks available models based on performance factors such as speed, quality, and memory fit. It provides both an interactive terminal user interface and a traditional CLI mode, enabling flexible workflows for different user preferences. llmfit also supports advanced configurations including multi-GPU setups, mixture-of-experts architectures, and dynamic quantization recommendations. ...
    Downloads: 17 This Week
    Last Update:
    See Project
  • 18
    Mochi Diffusion

    Mochi Diffusion

    Run Stable Diffusion on Mac natively

    Run Stable Diffusion on Mac natively. This app uses Apple's Core ML Stable Diffusion implementation to achieve maximum performance and speed on Apple Silicon based Macs while reducing memory requirements. Extremely fast and memory efficient (~150MB with Neural Engine) Runs well on all Apple Silicon Macs by fully utilizing Neural Engine. Generate images locally and completely offline. Generate images based on an existing image (commonly known as Image2Image) Generated images are saved with prompt info inside EXIF metadata (view in Finder's Get Info window) Convert generated images to high resolution (using RealESRGAN) Autosave & restore images. ...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 19
    Xianyu Intelligent Monitor Bot

    Xianyu Intelligent Monitor Bot

    AI tool for real-time monitoring and analysis of Goofish listings

    ...A built-in web management interface allows users to create tasks, review results, and manage monitoring rules without relying solely on command line tools. AI models analyze product descriptions, images, and seller information to determine whether a listing meets defined requirements and should be recommended to the user.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Chipper

    Chipper

    AI interface for tinkerers (Ollama, Haystack RAG, Python)

    ...It offers integration with tools like Ollama and Haystack for Retrieval-Augmented Generation (RAG), enabling users to build and test AI applications efficiently. Chipper supports Python and provides a modular architecture, allowing for customization and extension based on specific project requirements.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Meetily

    Meetily

    Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper

    ...It’s built for organizations that want meeting intelligence without sending recordings or transcripts to third-party cloud services, which helps address compliance and data sovereignty requirements. The app supports live transcription with local model options (including Whisper- and Parakeet-based workflows) and presents the transcript as the meeting happens, making it useful both for note-taking and accessibility. After or during the session, it can produce structured, AI-generated summaries, and it’s designed to be flexible about where that summarization comes from, supporting local providers as well as external endpoints when allowed by policy.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 22
    OpenSpec

    OpenSpec

    Spec-driven development (SDD) for AI coding assistants

    OpenSpec is a lightweight specification layer designed to improve reliability when working with AI coding assistants by formalizing requirements before code generation begins. The project addresses the common issue where AI tools produce inconsistent results when specifications exist only in chat history. It introduces a structured workflow that encourages teams to agree on what should be built before implementation starts. OpenSpec integrates into development pipelines and acts as a source of truth for AI-assisted coding sessions. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    GLM-4.1V

    GLM-4.1V

    GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

    ...Though smaller in scale, GLM-4.1V maintains competitive performance, particularly impressive on many benchmarks for models of its size: in fact, on a number of multimodal reasoning and vision-language tasks it outperforms some much larger models from other families. It represents a trade-off: somewhat reduced capacity compared to 4.5V or 4.6V, but with benefits in terms of speed, deployability, and lower hardware requirements — making it especially useful for developers experimenting locally, building lightweight agents, or deploying on limited infrastructure. Given its open-source availability under the same project repository, it provides an accessible entry point for testing multimodal reasoning and building proof-of-concept applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    IndexTTS2

    IndexTTS2

    Industrial-level controllable zero-shot text-to-speech system

    IndexTTS is a modern, zero-shot text-to-speech (TTS) system engineered to deliver high-quality, natural-sounding speech synthesis with few requirements and strong voice-cloning capabilities. It builds on state-of-the-art models such as XTTS and other modern neural TTS backbones, improving them with a conformer-based speech conditional encoder and upgrading the decoder to a high-quality vocoder (BigVGAN2), leading to clearer and more natural audio output. The system supports zero-shot voice cloning — meaning it can mimic a target speaker’s voice from a short reference sample — making it versatile for multi-voice uses. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 25
    HivisionIDPhoto

    HivisionIDPhoto

    HivisionIDPhotos: a lightweight and efficient AI ID photos tools

    HivisionIDPhotos is an open-source AI project designed to automatically generate professional ID photographs from ordinary portrait images. The system uses computer vision and machine learning models to detect faces, segment the subject from the background, and produce standardized identification photos suitable for official documents. It is designed as a lightweight tool that can perform inference offline and run efficiently on CPUs without requiring powerful GPUs. The software analyzes...
    Downloads: 7 This Week
    Last Update:
    See Project
Auth0 Logo