Showing 1182 open source projects for "control"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    ManiSkill

    ManiSkill

    SAPIEN Manipulation Skill Framework

    ManiSkill is a benchmark platform for training and evaluating reinforcement learning agents on dexterous manipulation tasks using physics-based simulations. Developed by Hao Su Lab, it focuses on robotic manipulation with diverse, high-quality 3D tasks designed to challenge perception, control, and planning in robotics. ManiSkill provides both low-level control and visual observation spaces for realistic learning scenarios.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    LTX-2

    LTX-2

    Python inference and LoRA trainer package for the LTX-2 audio–video

    LTX-2 is a powerful, open-source toolkit developed by Lightricks that provides a modular, high-performance base for building real-time graphics and visual effects applications. It is architected to give developers low-level control over rendering pipelines, GPU resource management, shader orchestration, and cross-platform abstractions so they can craft visually compelling experiences without starting from scratch. Beyond basic rendering scaffolding, LTX-2 includes optimized math libraries, resource loaders, utilities for texture and buffer handling, and integration points for native event loops and input systems. ...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 3
    Stable Diffusion web UI for AMDGPUs

    Stable Diffusion web UI for AMDGPUs

    Stable Diffusion WebUI optimized for AMD GPUs with editing tools

    Stable Diffusion WebUI AMDGPU is a browser-based interface for generating images using Stable Diffusion, built with Gradio and adapted for AMD graphics hardware. It provides both text-to-image and image-to-image workflows, allowing users to create, refine, and upscale visuals within a single interface. It includes tools such as inpainting and outpainting for editing specific areas of an image, along with features like prompt matrix generation and attention controls to fine-tune outputs....
    Downloads: 14 This Week
    Last Update:
    See Project
  • 4
    MolmoWeb

    MolmoWeb

    Open multimodal web agent built by Ai2

    MolmoWeb is an open-source multimodal web agent designed to autonomously navigate and interact with web browsers using vision-language models, representing a significant step toward fully agentic AI systems that can operate in real-world digital environments. The system takes natural language instructions and translates them into sequences of browser actions such as clicking, typing, scrolling, and navigating, effectively performing tasks on behalf of the user. Unlike traditional automation...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 5
    rnet

    rnet

    Python HTTP client with TLS and HTTP/2 fingerprint emulation support

    rnet is an ergonomic and modular Python HTTP client designed for developers who need advanced control over network requests and protocol behavior. It provides a flexible API for making HTTP requests while supporting both asynchronous and blocking workflows, allowing it to integrate easily into different Python applications and runtimes. rnet focuses on low-level protocol customization, giving users fine-grained control over TLS and HTTP/2 configuration in order to emulate specific browser behaviors. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Werkzeug

    Werkzeug

    The comprehensive WSGI web application library

    ...Includes a routing system for matching URLs to endpoints and generating URLs for endpoints, with an extensible system for capturing variables from URLs. Includes HTTP utilities to handle entity tags, cache control, dates, user agents, cookies, files, and more.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    mcp-use

    mcp-use

    A solution to build and deploy MCP agents and applications

    ...It simplifies authentication, access control, audit logging, observability, sandboxed runtime environments, and deployment workflows, whether self-hosted or managed, making MCP development production-ready. With integrations for popular frameworks like LangChain (Python) and LangChain.js (TypeScript), mcp-use accelerates the creation of tool-enabled AI agents.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    Open Interface

    Open Interface

    Control Any Computer Using LLMs

    Open Interface is a cross-platform application that allows users to control their computers using large language models (LLMs). By sending user requests to an LLM backend, it determines the necessary steps and executes them by simulating keyboard and mouse inputs. The system can adjust its actions based on real-time feedback, providing a self-driving computer experience.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Hyperledger Cello

    Hyperledger Cello

    Operating System for Enterprise Blockchain

    Hyperledger Cello is a blockchain operation and provisioning system designed to automate the deployment, management, and scaling of Hyperledger Fabric networks. As part of the Hyperledger project under the Linux Foundation, Cello aims to offer Blockchain-as-a-Service (BaaS) by abstracting the complexity of infrastructure setup for consortiums and enterprises. It provides a dashboard, APIs, and orchestration tools to help users create, monitor, and manage blockchain nodes, ledgers, and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 10
    Instaloader

    Instaloader

    Download pictures (or videos) along with their captions

    Instaloader is a mature open-source utility for downloading and archiving Instagram content along with rich metadata. It enables users to retrieve posts, stories, reels, highlights, profile pictures, and associated information such as captions, comments, timestamps, and geotags. The tool supports both public and permitted private content when proper authentication is provided, making it useful for research, digital archiving, and social media analysis. Instaloader can be run as a simple...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 11
    Step-Audio

    Step-Audio

    Open-source framework for intelligent speech interaction

    ...Through its architecture, Step-Audio supports multilingual interaction, dialects, emotional tones (joy, sadness, etc.), and even more creative speech styles (like rap or singing), while allowing dynamic control over speech characteristics. It also provides a “generative data engine,” which can produce synthetic speech data (cloning voices, varying style) to support TTS training.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    pyttsx3

    pyttsx3

    Offline Text To Speech synthesis for python

    pyttsx3 is an offline text-to-speech library for Python that wraps native speech engines instead of calling cloud APIs. It is designed to work entirely without an internet connection, making it suitable for local automation, kiosks, accessibility tools, and embedded applications. On Windows it uses SAPI5, on Linux it typically uses eSpeak or eSpeak-NG, and on macOS it can use NSSpeechSynthesizer or AVSpeechSynthesizer, giving it broad cross-platform compatibility. The library exposes a...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 13
    ChatTTS

    ChatTTS

    A generative speech model for daily dialogue

    ChatTTS is an open-source conversational text-to-speech model optimized for dialogue, developed by 2Noise. Trained on 100,000+ hours of English and Chinese conversation data, it excels at generating expressive prosody—pauses, interjections, laughter—for more natural-sounding speech synthesis in assistant and chatbot applications.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    OpenKore

    OpenKore

    A free/open source client and automation tool for Ragnarok Online

    ...With a strong community and extensive documentation, it is widely used by players and developers who want to explore automation, bot behavior, or server testing. The software emphasizes configurability, giving users control over character strategies and in-game decision-making.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 15
    Translation Agent

    Translation Agent

    Agentic translation using reflection workflow

    ...Instead of asking a language model for a single direct translation, it first generates a translation, then asks the model to critique it, and finally uses that critique to produce a stronger version. This structure makes the system more steerable than a traditional translation pipeline. Users can adjust prompts to control tone, formality, terminology, idiom handling, and regional language choices. The project includes example scripts and a simple Python interface for translating between source and target languages. It is useful for researchers, developers, and localization teams exploring how LLM-based workflows can produce better, more customizable translations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    OpenPlanter

    OpenPlanter

    Language-model investigation agent with a terminal UI

    OpenPlanter is an open-source Python project focused on building an intelligent automated planting or gardening system powered by software control and data processing. The repository is designed to help developers and hobbyists create programmable plant management workflows that can monitor, schedule, and optimize growing conditions. It emphasizes automation and extensibility, allowing integration with sensors, environmental data, and control logic for smart cultivation setups. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    NitroGen

    NitroGen

    A Foundation Model for Generalist Gaming Agents

    ...As an open research model, NitroGen is trained on extensive gameplay data spanning thousands of hours and hundreds of games to instill broad, generalizable gaming competency rather than skill at a single title. This approach enables the model to control agents in different game genres and contexts, performing tasks that range from complex exploration and combat to fine-grained control in platformers, demonstrating adaptability across unseen environments. The project draws on MineDojo’s broader ecosystem for embodied AI, where multi-modal inputs and richly diverse benchmarks help push toward generalist AI capable of interactive decision making.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Flask-Migrate

    Flask-Migrate

    SQLAlchemy database migrations for Flask applications using Alembic

    ...Flask-Migrate also supports branching and merging workflows by generating human-readable migration scripts that developers can review, edit, and commit to version control.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Unrud Video Downloader

    Unrud Video Downloader

    Download videos from websites like YouTube and many others

    ...The software is distributed across multiple platforms, including Linux package managers and containerized environments, ensuring broad accessibility. It includes configuration options and debugging capabilities for advanced users who want more control over the download process.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 20
    Meta-World

    Meta-World

    Collections of robotics environments

    Meta-World is an open-source benchmark suite of robotic manipulation environments focused on multi-task and meta reinforcement learning. It provides a large collection of continuous-control tasks, such as reaching, pushing, opening doors, and manipulating objects with a simulated robot arm. The library defines standardized benchmarks like MT1, MT10, and MT50 for multi-task learning, where a single policy is trained across different numbers of tasks. It also offers meta-learning benchmarks (ML1, ML10, ML45) that evaluate few-shot adaptation to new goals or entirely new tasks. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    GELab-Zero

    GELab-Zero

    GUI Exploration Lab. One of the best GUI agent solutions

    ...The idea is to let developers or users harness an AI agent that can simulate clicking, typing, reading UI elements, and interacting with apps in a human-like way via the GUI, which can enable tasks like automated testing, scriptable workflows, or even autonomous usage of GUI-based applications. Because GELab-Zero is fully open-source and doesn’t require external services, it offers privacy and control: everything runs locally under your control. The project provides a lightweight base model (4B parameters in its public release) that can run on modest hardware (depending on quantization), making it more accessible than many large-scale AI solutions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Operit AI

    Operit AI

    Powerful Android AI agent with tools, automation, and Linux shell

    Operit is a full-featured AI assistant and agent platform designed specifically for Android devices, aiming to go far beyond traditional chat-based interfaces. It integrates deep system-level capabilities with a wide range of tools, allowing the AI to perform real tasks such as file management, automation, and system control directly on the device. A standout aspect of the project is its built-in Ubuntu 24 environment, which enables users to run Linux commands, scripts, and development tools in a mobile context. Operit supports both local and remote AI models, including offline execution through frameworks like llama.cpp and MNN, helping preserve user privacy while maintaining flexibility. ...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 23
    edge-tts

    edge-tts

    Use Microsoft Edge's online text-to-speech service from Python

    ...It also supports generating subtitle files (such as SRT or VTT) alongside the speech, which is handy for video narration, e-learning, or accessibility workflows. From the CLI you can adjust parameters such as speaking rate, volume, and pitch, giving you some control over prosody without diving into SSML. The library is asynchronous under the hood, which makes it efficient for batch jobs or web services that need to synthesize many utterances concurrently.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 24
    Open Interpreter

    Open Interpreter

    A natural language interface for computers

    ...It prompts you to approve code before executing, and supports both online LLM models and local inference servers. It seeks to combine convenience (like ChatGPT’s code interpreter) with control and flexibility by running on your own machine.
    Downloads: 18 This Week
    Last Update:
    See Project
  • 25
    Toloka-Kit

    Toloka-Kit

    Toloka-Kit is a Python library for working with Toloka API

    ...For example, you can pass data between two related projects: one for data labeling, and another for its validation. AutoQuality feature which automatically finds the best fitting quality control rules for your project.
    Downloads: 1 This Week
    Last Update:
    See Project
Auth0 Logo