Showing 57 open source projects for "video-making"

View related business solutions
  • Atera all-in-one platform IT management software with AI agents Icon
    Atera all-in-one platform IT management software with AI agents

    Ideal for internal IT departments or managed service providers (MSPs)

    Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
    Learn More
  • The Original Buy Center Software. Icon
    The Original Buy Center Software.

    Never Go To The Auction Again.

    VAN sources private-party vehicles from over 20 platforms and provides all necessary tools to communicate with sellers and manage opportunities. Franchise and Independent dealers can boost their buy center strategies with our advanced tools and an experienced Acquisition Coaching™ team dedicated to your success.
    Learn More
  • 1
    Director

    Director

    AI video agents framework for next-gen video interactions

    Director is a video database management system designed to organize, search, and retrieve large collections of video content efficiently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Android Use

    Android Use

    Automate native Android apps with AI using accessibility APIs

    ...The project works by using Android’s accessibility API to extract structured UI state (as XML) from the device, which is then fed to a large language model (LLM) like OpenAI’s models for decision-making, and actions are executed via the Android Debug Bridge (ADB). This approach bypasses expensive vision-based models and provides faster, cheaper automation with fine-grained interaction capabilities (for example, tapping buttons, typing text, navigating screens).
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    Semantic Router

    Semantic Router

    Superfast AI decision making and processing of multi-modal data

    Semantic Router is a superfast decision-making layer for your LLMs and agents. Rather than waiting for slow, unreliable LLM generations to make tool-use or safety decisions, we use the magic of semantic vector space — routing our requests using semantic meaning. Combining LLMs with deterministic rules means we can be confident that our AI systems behave as intended. Cramming agent tools into the limited context window is expensive, slow, and fundamentally limited.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Letta

    Letta

    Letta (formerly MemGPT) is a framework for creating LLM services

    Letta is an AI-powered task automation framework designed to handle workflow automation, natural language commands, and AI-driven decision-making.
    Downloads: 6 This Week
    Last Update:
    See Project
  • All-in-one security tool helps you prevent ransomware and breaches. Icon
    All-in-one security tool helps you prevent ransomware and breaches.

    SIEM + Detection and Response for IT Teams

    Blumira’s detection and response platform enables faster resolution of threats to help you stop ransomware attacks and prevent data breaches. We surface real threats, providing meaningful findings so you know what to prioritize. With our 3-step rapid response, you can automatically block known threats, use our playbooks for easy remediation, or contact our security team for additional guidance. Our responsive security team helps with onboarding, triage and ongoing consultations to continuously help your organization improve your security coverage.
    Learn More
  • 5
    SwarmZero

    SwarmZero

    SwarmZero's SDK for building AI agents, swarms of agents and much more

    SwarmZero is an open-source platform designed for deploying and managing autonomous robot swarms. It enables collective coordination, decentralized decision-making, and real-time collaboration among large groups of autonomous agents, focusing on multi-robot systems and research in swarm robotics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Activepieces

    Activepieces

    Open Source AI Automation

    ...It’s tailored for technical and non-technical users alike, enabling teams to automate repetitive tasks using a visual editor and a large library of pre-built connectors. Activepieces can be self-hosted or used via a cloud deployment, making it flexible for teams of all sizes. It supports integrations with popular services like Slack, Google Sheets, and Discord, and allows users to create custom pieces to suit unique needs. With real-time logs, version history, and scheduling, Activepieces is positioned as a compelling alternative to Zapier for open-source and privacy-conscious users.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 7
    GPT Researcher

    GPT Researcher

    LLM based autonomous agent that does online comprehensive research

    Say Hello to GPT Researcher, your AI agent for rapid insights and comprehensive research. GPT Researcher is the leading autonomous agent that takes care of everything from accurate source gathering to organization of research results.
    Downloads: 20 This Week
    Last Update:
    See Project
  • 8
    AgentForge

    AgentForge

    Extensible AGI Framework

    AgentForge is a framework for creating and deploying AI agents that can perform autonomous decision-making and task execution. It enables developers to define agent behaviors, train models, and integrate AI-powered automation into various applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    MindsDB

    MindsDB

    Making Enterprise Data Intelligent and Responsive for AI

    ...MindsDB connects to diverse data sources and applications, and unifies petabyte-scale structured and unstructured data. Powered by an industry-first cognitive engine that can operate anywhere (on-prem, VPC, serverless), it empowers both humans and AI with highly informed decision-making capabilities. A federated query engine that tidies up your data-sprawl chaos while meticulously answering every single question you throw at it. MindsDB has an MCP server built in that enables your MCP applications to connect, unify and respond to questions over large-scale federated data—spanning databases, data warehouses, and SaaS applications.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Free and Open Source HR Software Icon
    Free and Open Source HR Software

    OrangeHRM provides a world-class HRIS experience and offers everything you and your team need to be that HR hero you know that you are.

    Give your HR team the tools they need to streamline administrative tasks, support employees, and make informed decisions with the OrangeHRM free and open source HR software.
    Learn More
  • 10
    GELab-Zero

    GELab-Zero

    GUI Exploration Lab. One of the best GUI agent solutions

    ...Because GELab-Zero is fully open-source and doesn’t require external services, it offers privacy and control: everything runs locally under your control. The project provides a lightweight base model (4B parameters in its public release) that can run on modest hardware (depending on quantization), making it more accessible than many large-scale AI solutions.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Cybergod

    Cybergod

    A program that can do anything to earn money without human operators

    AGI Computer Control is an experimental autonomous software system designed to operate independently and generate income without human intervention. It aims to simulate artificial general intelligence (AGI) by leveraging evolutionary algorithms, deep active inference, and other advanced AI techniques. The project explores the boundaries of machine autonomy and self-directed behavior in computational environments.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    TEN

    TEN

    Open-source framework for conversational voice AI agents

    TEN (Transformative Extensions Network) is an open source framework designed to empower developers to build real-time multimodal AI agents capable of voice, video, text, image, and data-stream interaction with ultra-low latency. It includes a full ecosystem, TEN Turn Detection, TEN Agent, and TMAN Designer, allowing developers to rapidly assemble human-like, responsive agents that can see, speak, hear, and interact. With support for languages like Python, C++, and Go, it offers flexible deployment on both edge and cloud environments. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    Moltbot

    Moltbot

    Your own personal AI assistant. Any OS. Any Platform.

    ...It connects seamlessly to the communication channels you already use, including popular messaging apps, collaboration tools, and web chat interfaces. Instead of being a cloud-hosted bot, Moltbot centers around a local Gateway that acts as the control plane, making the assistant feel responsive and under your control. It supports advanced capabilities like voice interaction, live visual canvases, browser automation, and multi-agent workflows. Moltbot is highly configurable, allowing you to choose models, security policies, and integrations that match your needs. Built for single users who want power without bloat, it delivers a polished assistant experience while keeping ownership of data and execution local.
    Downloads: 22 This Week
    Last Update:
    See Project
  • 14
    Grok CLI

    Grok CLI

    An open-source AI agent that brings the power of Grok

    ...It lets you run Grok queries from your shell, scripting environment, or automation workflows without switching to a browser, enabling utility in scripting, quick data exploration, code generation, and assistant-guided tasks directly where you write code. The CLI supports streaming responses, so outputs appear in real time as the Grok model generates them, making interactions feel responsive and fluid in terminal contexts. Grok CLI is designed to integrate with existing terminal habits—aliases, pipes, editors, and tooling—so you can combine AI assistance with native command-line workflows like grep, awk, and git. It also includes authentication support, configuration management, and caching options so frequent queries are efficient.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 15
    Skyvern

    Skyvern

    Automate browser-based workflows with LLMs and Computer Vision

    Skyvern uses a combination of computer vision and AI to understand content on a webpage, making it adaptable to any website. Skyvern takes instructions in natural language, allowing it to execute complex objectives with simple commands. Skyvern is an API-first product. Workflows execute in the cloud, allowing it to run hundreds of workflows at the same time. Skyvern's AI decisions come with built-in explanations, providing clear summaries and justifications for every action.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    DeepChat

    DeepChat

    A smart assistant that connects powerful AI to your personal world

    ...As a cross-platform AI assistant application, DeepChat not only supports basic chat functionality but also offers advanced features such as search enhancement, tool calling, and multimodal interaction, making AI capabilities more accessible and efficient.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    Agent S2

    Agent S2

    Agent S: an open agentic framework that uses computers like a human

    ...Through modular architecture, it efficiently handles complex tasks, such as navigating UIs, performing low-level actions like text selection, and executing high-level strategies like planning. Additionally, the system's proactive hierarchical planning allows for real-time adaptation, making it an ideal solution for businesses seeking to streamline operations and automate digital workflows. Agent S2 is designed with flexibility, enabling seamless scaling for future applications and tasks.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    kagent

    kagent

    Kubernetes native framework for building AI agents

    ...It models core agent concepts declaratively using Kubernetes custom resources, so teams can manage agents similarly to other platform components via YAML, controllers, and standard cluster workflows. In kagent’s design, an “Agent” represents a system prompt plus a set of tools and other agents, along with an LLM configuration, making the agent definition portable and repeatable across environments. It supports multiple model providers through a dedicated configuration resource, allowing teams to switch providers or run mixed environments while keeping the agent spec stable. A major focus is tool integration via MCP: agents can connect to MCP servers for tool access, and kagent includes an MCP server with tools for common Kubernetes and platform engineering systems.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 19
    Gemini CLI

    Gemini CLI

    Open source AI agent CLI tool to bring Gemini into your terminal

    Gemini CLI is an open‑source AI agent that brings the capabilities of Google’s Gemini 2.5 Pro large‑language model directly into your terminal, enabling tasks ranging from coding and debugging to content creation and research via natural‑language prompts, with support for multimodal outputs like image and video generation. Gemini CLI integrates with external tools and MCP servers, enabling media generation and enhanced workflow automation. It also includes a built-in Google Search tool to ground queries with relevant information. Users can authenticate with their Google accounts for free usage limits or configure API keys for higher capacity and access to specific models. ...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 20
    Dendrite

    Dendrite

    Tools to build web AI agents that can authenticate

    Dendrite Python SDK is a toolkit for building web AI agents that can authenticate, interact with, and extract data from any website, facilitating web automation tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Qwen Code

    Qwen Code

    Qwen Code is a coding agent that lives in the digital world

    ...Adapted from the Google Gemini CLI, it features an enhanced parser optimized specifically for Qwen-Coder models, enabling deep code understanding and manipulation. The tool supports querying and editing large codebases beyond traditional context limits, making it ideal for modern, complex projects. Qwen Code automates various development workflows, including handling pull requests and performing complex git rebases. It runs on Node.js (version 20 or higher) and can be installed globally via npm or from source. Users configure Qwen Code by setting API keys and endpoints, supporting both mainland China and international access. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 22
    AskUI Vision Agent

    AskUI Vision Agent

    Enable AI to control your desktop, mobile and HMI devices

    AskUI’s Vision Agent is an automation framework that allows you—and AI agents—to control real desktops, mobile devices, and HMI systems by perceiving the UI and performing actions like clicking, typing, scrolling, and drag-and-drop. It is designed for multi-platform compatibility and supports multiple AI models so you can tailor perception and decision-making to your workload. The repository presents a feature overview, sample media, and frequent release notes, which show ongoing improvements such as CORS checks and other operational tweaks. The broader AskUI documentation covers the Python Vision Agent along with suite services and inference APIs, indicating a productized ecosystem rather than a single library. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    E2B

    E2B

    Secure open source cloud runtime for AI apps & AI agents

    E2B's Code Interpreter SDK allows you to add code-interpreting capabilities to your AI apps. E2B Sandbox is a secure sandboxed cloud environment made for AI agents and AI apps. Sandboxes allow AI agents and apps to have long-running cloud secure environments. In these environments, large language models can use the same tools as humans do.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    Chatbox

    Chatbox

    The Ultimate AI Copilot on Your Desktop

    ...It runs locally on Windows, macOS, and Linux, keeping your conversations and data stored on your own device. Chatbox acts as a unified interface for popular LLMs like ChatGPT, Claude, Gemini, and local models via Ollama, making it easy to switch providers without changing tools. Built with an ergonomic UI, it’s optimized for long sessions, prompt experimentation, and everyday productivity. The app supports rich formatting, streaming responses, and advanced prompting to help you get clearer, more useful outputs. For individuals and teams alike, Chatbox serves as a powerful desktop copilot that blends simplicity with flexibility.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 25
    Phantasm

    Phantasm

    Toolkits to create a human-in-the-loop approval layer

    Phantasm offers toolkits to create a human-in-the-loop approval layer to monitor and guide AI agents' workflows in real-time, ensuring safety and reliability in AI operations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next