control free download

Showing 266 open source projects for "control"

View related business solutions

Artificial Intelligence Python Clear Filters & Widen Search

Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
1

Agent Control

Centralized agent control plane for governing runtime agent behavior

Agent Control is a centralized control plane for governing AI agent behavior at runtime across different frameworks and deployment environments. It lets teams define controls once and apply them consistently to agents without rewriting the agent’s core code. The platform evaluates agent inputs and outputs against configurable policies to reduce risks such as prompt injection, unsafe responses, sensitive data exposure, and policy drift.

Downloads: 3 This Week

Last Update: 3 days ago
See Project
2

Data Version Control

Git-based data version control for machine learning workflows

DVC (Data Version Control) is an open source tool designed to bring version control principles to machine learning and data science workflows. It enables developers and data scientists to track datasets, machine learning models, and experiment results in a way that integrates with existing Git repositories. Instead of storing large datasets directly in Git, DVC keeps lightweight metadata in the repository while storing the actual data in external storage systems.

Downloads: 2 This Week

Last Update: 2026-03-31
See Project
3

AUTOMATIC1111 Stable Diffusion web UI

Stable Diffusion web UI

...Supporting both text-to-image (txt2img) and image-to-image (img2img) generation, this open-source UI offers a rich feature set including inpainting, outpainting, attention control, and multiple advanced upscaling options. With a flexible installation process across Windows, Linux, and Apple Silicon, plus support for GPUs and CPUs, it caters to a wide range of users—from hobbyists to professionals. The interface also supports prompt editing, batch processing, custom scripts, and many community extensions, making it a highly customizable and continually evolving platform for creative AI art generation.

1 Review

Downloads: 171 This Week

Last Update: 2025-06-02
See Project
4

OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model

OpenVoice is a versatile instant voice cloning system that can replicate a speaker’s tone color from just a short audio clip and then generate speech in multiple languages. It is designed not only to match the timbre of the reference voice, but also to give granular control over style parameters such as emotion, accent, rhythm, pauses, and intonation. The model supports cross-lingual and even zero-shot cross-lingual voice cloning, so a speaker recorded in one language can be made to speak naturally in others. Architecturally, OpenVoice separates “tone color” cloning from style control, which makes it easier to keep a consistent identity while flexibly changing prosody or language. ...

Downloads: 91 This Week

Last Update: 2025-11-28
See Project
Build Securely on AWS with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
5

KaTrain

Improve your Baduk skills by training with KataGo

...One of its key strengths is its ability to generate detailed post-game analyses, highlighting the moves that resulted in the greatest loss of points and suggesting improvements. KaTrain also includes interactive learning features such as retrying moves, exploring variations, and visualizing territory control probabilities.

Downloads: 50 This Week

Last Update: 2026-06-08
See Project
6

Ideogram 4

Open image model at the forefront of design

Ideogram 4 is an open-weight text-to-image model focused on high-quality visual generation, design control, and accurate text rendering inside images. It is built for users who need more than generic image generation, especially when layout, typography, composition, color, and language understanding matter. The project introduces a structured JSON prompting workflow that gives creators more explicit control over scene details and visual constraints.

Downloads: 6 This Week

Last Update: 2026-06-05
See Project
7

MoneyPrinterTurbo

Generate short videos with one click using AI LLM

MoneyPrinterTurbo is an AI-driven tool that enables users to generate high-definition short videos with minimal input. By providing a topic or keyword, the system automatically creates video scripts, sources relevant media assets, adds subtitles, and incorporates background music, resulting in a polished video ready for distribution.

Downloads: 167 This Week

Last Update: 2026-06-11
See Project
8

Open Notebook

An Open Source implementation of Notebook LM with more flexibility

Open Notebook is an open-source, privacy-focused alternative to Google’s Notebook LM that gives users full control over their research and AI workflows. Designed to be self-hosted, it ensures complete data sovereignty by keeping your content local or within your own infrastructure. The platform supports 16+ AI providers—including OpenAI, Anthropic, Ollama, Google, and LM Studio—allowing flexible model choice and cost optimization. Open Notebook enables users to organize and analyze multi-modal content such as PDFs, videos, audio files, web pages, and Office documents. ...

Downloads: 34 This Week

Last Update: 3 days ago
See Project
9

Wan2.2

Wan2.2: Open and Advanced Large-Scale Video Generative Model

...It introduces a Mixture-of-Experts (MoE) architecture that splits the denoising process across specialized expert models, increasing total model capacity without raising computational costs. Wan2.2 integrates meticulously curated cinematic aesthetic data, enabling precise control over lighting, composition, color tone, and more, for high-quality, customizable video styles. The model is trained on significantly larger datasets than its predecessor, greatly enhancing motion complexity, semantic understanding, and aesthetic diversity. Wan2.2 also open-sources a 5-billion parameter high-compression VAE-based hybrid text-image-to-video (TI2V) model that supports 720P video generation at 24fps on consumer-grade GPUs like the RTX 4090. ...

1 Review

Downloads: 98 This Week

Last Update: 2026-03-17
See Project
Atera - an All-in-one platform for IT management
Ideal for IT departments and MSPs (managed service providers)

Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!

Try Atera now
10

Guidance

A guidance language for controlling large language models

Guidance is an efficient programming paradigm for steering language models. With Guidance, you can control how output is structured and get high-quality output for your use case—while reducing latency and cost vs. conventional prompting or fine-tuning. It allows users to constrain generation (e.g. with regex and CFGs) as well as to interleave control (conditionals, loops, tool use) and generation seamlessly.

Downloads: 1 This Week

Last Update: 2026-03-18
See Project
11

Open-AutoGLM

An open phone agent model & framework

Open-AutoGLM is an open-source framework and model designed to empower autonomous mobile intelligent assistants by enabling AI agents to understand and interact with phone screens in a multimodal manner, blending vision and language capability to control real devices. It aims to create an “AI phone agent” that can perceive on-screen content, reason about user goals, and execute sequences of taps, swipes, and text input via automated device control interfaces like ADB, enabling hands-off completion of multi-step tasks such as navigating apps, filling forms, and more. Unlike traditional automation scripts that depend on brittle heuristics, Open-AutoGLM uses pretrained large language and vision-language models to interpret visual context and natural language instructions, giving the agent robust adaptability across apps and interfaces.

Downloads: 10 This Week

Last Update: 2026-03-06
See Project
12

OpenMontage

World's first open-source, agentic video production system

OpenMontage is an open-source, agent-driven video production system that transforms AI coding assistants into fully automated multimedia creation pipelines. Instead of focusing on a single capability such as text-to-video generation, it treats video production as a structured, multi-stage workflow that mirrors how a real production team operates, including research, scripting, asset generation, editing, and final rendering. The system orchestrates a large collection of tools and models...

Downloads: 190 This Week

Last Update: 2026-05-07
See Project
13

YandexStation

Management of Yandex Station and other smart home devices

...In local control mode, the component can read back what is currently playing, including album art, and supports seeking and track skipping, which is more limited in cloud-only mode.

Downloads: 2 This Week

Last Update: 2026-06-11
See Project
14

OmniVoice

High-Quality Voice Cloning TTS for 600+ Languages

...One of its most notable capabilities is zero-shot voice cloning, allowing users to replicate a speaker’s voice using only a short reference audio clip. In addition, it supports voice design through configurable attributes such as gender, accent, pitch, and speaking style, giving users fine-grained control over generated speech. The system also includes advanced features like non-verbal expression tags and pronunciation overrides, enabling expressive and precise output. With support for both API-based and command-line usage, it is designed for research, production, and experimentation alike.

Downloads: 23 This Week

Last Update: 2026-04-28
See Project
15

LTX-2.3

Official Python inference and LoRA trainer package

...This unified approach allows creators to generate complete multimedia sequences where motion, timing, and sound are aligned automatically. LTX-2 is designed for both research and production workflows and can generate high-resolution video clips with precise control over structure, motion, and camera behavior.

Downloads: 117 This Week

Last Update: 2026-05-28
See Project
16

Chatterbox

SoTA open-source TTS

...Whether you're working on memes, videos, games, or AI agents, Chatterbox brings your content to life. It's also the first open source TTS model to support emotion exaggeration control, a powerful feature that makes your voices stand out. Try it now on our Hugging Face Gradio app. If you like the model but need to scale or tune it for higher accuracy, check out our competitively priced TTS service (link). It delivers reliable performance with ultra-low latency of sub-200ms—ideal for production use in agents, applications, or interactive media.

Downloads: 11 This Week

Last Update: 2025-06-25
See Project
17

Windows-MCP

MCP server enabling AI agents to control and automate Windows OS

Windows-MCP is a lightweight open source project designed to connect AI agents with the Windows operating system through a Model Context Protocol server. It acts as a bridge that allows large language models to directly interact with desktop environments, enabling automated control over applications, files, and system interfaces. Windows-MCP provides capabilities such as file navigation, application management, UI interaction, and QA testing workflows, making it suitable for building autonomous desktop agents. It focuses on native interaction with Windows UI elements rather than relying on traditional computer vision techniques, which simplifies integration and improves efficiency. ...

Downloads: 3 This Week

Last Update: 2026-06-09
See Project
18

ChatTTS webUI & API

A simple native web interface that uses ChatTTS to synthesize text

...It runs a small backend server (Python + Torch + ffmpeg) and exposes a simple webpage where you can type text, adjust parameters, and generate audio. The project supports Chinese, English, and mixed text with digits and control symbols, making it suitable for bilingual content and numerically heavy text like announcements or prompts. From version 0.96 onward, ffmpeg installation is required for deployment, and previous CSV/PT voice tables are no longer valid, so users instead work with updated “voice value” parameters. For convenience, there is a prepackaged Windows build: you download a release archive, extract it, and double-click app.exe to start the web UI, which opens on localhost:9966.

Downloads: 17 This Week

Last Update: 7 days ago
See Project
19

MCP for Unity

AI bridge enabling assistants to control and automate Unity Editor

...It exposes Unity functionality as callable tools so that AI systems can understand and manipulate game development workflows programmatically. This approach allows developers to control Unity using natural language prompts and automated workflows rather than manual editor interaction. Unity MCP supports various AI assistants and development tools that implement MCP clients, enabling flexible integration with existing AI development environments.

Downloads: 3 This Week

Last Update: 6 days ago
See Project
20

firerpa LAMDA

The most powerful Android RPA agent framework

lamda is an Android RPA agent framework that provides visual remote desktop control and automation at scale, geared toward testing, automation validation, and device management. It exposes a clean UI to monitor and interact with connected devices and includes tooling to script actions reliably across apps and OS versions. The project emphasizes low-friction setup and powerful control primitives so teams can move from interactive validation to repeatable automation.

Downloads: 0 This Week

Last Update: 2026-03-22
See Project
21

HunyuanVideo-Avatar

Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model

...Innovations include a character image injection module, an Audio Emotion Module for transferring emotion cues, and a Face-Aware Audio Adapter to isolate audio effects on faces, enabling multiple characters to be animated in a scene. Character image injection module for better consistency between training and inference conditioning. Emotion control by extracting emotion reference images and transferring emotional style into video sequences.

Downloads: 2 This Week

Last Update: 2025-12-16
See Project
22

RL Games

RL implementations

rl_games is a high-performance reinforcement learning framework optimized for GPU-based training, particularly in environments like robotics and continuous control tasks. It supports advanced algorithms and is built with PyTorch.

Downloads: 1 This Week

Last Update: 2026-02-20
See Project
23

PySyft

Data science on data without acquiring a copy

Most software libraries let you compute over the information you own and see inside of machines you control. However, this means that you cannot compute on information without first obtaining (at least partial) ownership of that information. It also means that you cannot compute using machines without first obtaining control over those machines. This is very limiting to human collaboration and systematically drives the centralization of data, because you cannot work with a bunch of data without first putting it all in one (central) place. ...

Downloads: 2 This Week

Last Update: 2025-02-13
See Project
24

Video-subtitle-remover (VSR)

AI tool that removes hardcoded subtitles and text from videos locally

...In addition to video processing, the project supports removing text-like watermarks from images through similar techniques. The processing runs locally without requiring any external API services, enabling offline use and greater control over the data being processed.

Downloads: 67 This Week

Last Update: 2026-04-11
See Project
25

BlenderMCP

Blender Model Context Protocol Integration

BlenderMCP is a bridge that connects Blender, a 3D modeling and rendering software, with AI systems like Claude through the Model Context Protocol, enabling direct AI-driven interaction with 3D environments. It allows users to control Blender using natural language prompts, effectively turning AI into a co-creator for 3D modeling, scene construction, and asset manipulation. The system establishes a two-way communication channel between Blender and the AI, where commands can be sent and results retrieved in real time. It includes features for object manipulation, material editing, and scene inspection, giving the AI deep control over the modeling environment. ...

Downloads: 1 This Week

Last Update: 2026-06-11
See Project