Showing 35 open source projects for "video-making"

View related business solutions
  • Atera all-in-one platform IT management software with AI agents Icon
    Atera all-in-one platform IT management software with AI agents

    Ideal for internal IT departments or managed service providers (MSPs)

    Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
    Learn More
  • Retool your internal operations Icon
    Retool your internal operations

    Generate secure, production-grade apps that connect to your business data. Not just prototypes, but tools your team can actually deploy.

    Build internal software that meets enterprise security standards without waiting on engineering resources. Retool connects to your databases, APIs, and data sources while maintaining the permissions and controls you need. Create custom dashboards, admin tools, and workflows from natural language prompts—all deployed in your cloud with security baked in. Stop duct-taping operations together, start building in Retool.
    Build an app in Retool
  • 1
    MiniCPM-o

    MiniCPM-o

    A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming

    MiniCPM-o 2.6 is a cutting-edge multimodal large language model (MLLM) designed for high-performance tasks across vision, speech, and video. Capable of running on end-side devices such as smartphones and tablets, it provides powerful features like real-time speech conversation, video understanding, and multimodal live streaming. With 8 billion parameters, MiniCPM-o 2.6 surpasses its predecessors in versatility and efficiency, making it one of the most robust models available. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Moltbot

    Moltbot

    Your own personal AI assistant. Any OS. Any Platform.

    ...It connects seamlessly to the communication channels you already use, including popular messaging apps, collaboration tools, and web chat interfaces. Instead of being a cloud-hosted bot, Moltbot centers around a local Gateway that acts as the control plane, making the assistant feel responsive and under your control. It supports advanced capabilities like voice interaction, live visual canvases, browser automation, and multi-agent workflows. Moltbot is highly configurable, allowing you to choose models, security policies, and integrations that match your needs. Built for single users who want power without bloat, it delivers a polished assistant experience while keeping ownership of data and execution local.
    Downloads: 392 This Week
    Last Update:
    See Project
  • 3
    Android Use

    Android Use

    Automate native Android apps with AI using accessibility APIs

    ...The project works by using Android’s accessibility API to extract structured UI state (as XML) from the device, which is then fed to a large language model (LLM) like OpenAI’s models for decision-making, and actions are executed via the Android Debug Bridge (ADB). This approach bypasses expensive vision-based models and provides faster, cheaper automation with fine-grained interaction capabilities (for example, tapping buttons, typing text, navigating screens).
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Image Crop Picker

    Image Crop Picker

    iOS/Android image picker with support for camera, video, etc.

    Image Crop Picker is an iOS/Android image picker with support for camera, video, configurable compression, multiple images and cropping. Module is creating tmp images which are going to be cleaned up automatically somewhere in the future. If you want to force cleanup, you can use clean to clean all tmp files, or cleanSingle(path) to clean single tmp file. Some of these types may not be available on all iOS versions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • AestheticsPro Medical Spa Software Icon
    AestheticsPro Medical Spa Software

    Our new software release will dramatically improve your medspa business performance while enhancing the customer experience

    AestheticsPro is the most complete Aesthetics Software on the market today. HIPAA Cloud Compliant with electronic charting, integrated POS, targeted marketing and results driven reporting; AestheticsPro delivers the tools you need to manage your medical spa business. It is our mission To Provide an All-in-One Cutting Edge Software to the Aesthetics Industry.
    Learn More
  • 5
    DeepSeek R1

    DeepSeek R1

    Open-source, high-performance AI model with advanced reasoning

    DeepSeek-R1 is an open-source large language model developed by DeepSeek, designed to excel in complex reasoning tasks across domains such as mathematics, coding, and language. DeepSeek R1 offers unrestricted access for both commercial and academic use. The model employs a Mixture of Experts (MoE) architecture, comprising 671 billion total parameters with 37 billion active parameters per token, and supports a context length of up to 128,000 tokens. DeepSeek-R1's training regimen uniquely...
    Downloads: 103 This Week
    Last Update:
    See Project
  • 6
    OpenCV

    OpenCV

    Open Source Computer Vision Library

    OpenCV (Open Source Computer Vision Library) is a comprehensive open-source library for computer vision, machine learning, and image processing. It enables developers to build real-time vision applications ranging from facial recognition to object tracking. OpenCV supports a wide range of programming languages including C++, Python, and Java, and is optimized for both CPU and GPU operations.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 7
    AskUI Vision Agent

    AskUI Vision Agent

    Enable AI to control your desktop, mobile and HMI devices

    AskUI’s Vision Agent is an automation framework that allows you—and AI agents—to control real desktops, mobile devices, and HMI systems by perceiving the UI and performing actions like clicking, typing, scrolling, and drag-and-drop. It is designed for multi-platform compatibility and supports multiple AI models so you can tailor perception and decision-making to your workload. The repository presents a feature overview, sample media, and frequent release notes, which show ongoing improvements such as CORS checks and other operational tweaks. The broader AskUI documentation covers the Python Vision Agent along with suite services and inference APIs, indicating a productized ecosystem rather than a single library. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 8
    Chatbox

    Chatbox

    The Ultimate AI Copilot on Your Desktop

    ...It runs locally on Windows, macOS, and Linux, keeping your conversations and data stored on your own device. Chatbox acts as a unified interface for popular LLMs like ChatGPT, Claude, Gemini, and local models via Ollama, making it easy to switch providers without changing tools. Built with an ergonomic UI, it’s optimized for long sessions, prompt experimentation, and everyday productivity. The app supports rich formatting, streaming responses, and advanced prompting to help you get clearer, more useful outputs. For individuals and teams alike, Chatbox serves as a powerful desktop copilot that blends simplicity with flexibility.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 9
    RWKV Runner

    RWKV Runner

    A RWKV management and startup tool, full automation, only 8MB

    RWKV (pronounced as RwaKuv) is an RNN with GPT-level LLM performance, which can also be directly trained like a GPT transformer (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, fast training, saves VRAM, "infinite" ctxlen, and free text embedding. Moreover it's 100% attention-free. Default configs has enabled custom CUDA kernel acceleration, which is much faster and consumes much less VRAM. If you encounter possible compatibility...
    Downloads: 2 This Week
    Last Update:
    See Project
  • eProcurement Software Icon
    eProcurement Software

    Enterprises and companies seeking a solution to manage all their procurement operations and processes

    eBuyerAssist by Eyvo is a cloud-based procurement solution designed for businesses of all sizes and industries. Fully modular and scalable, it streamlines the entire procurement lifecycle—from requisition to fulfillment. The platform includes powerful tools for strategic sourcing, supplier management, warehouse operations, and contract oversight. Additional modules cover purchase orders, approval workflows, inventory and asset management, customer orders, budget control, cost accounting, invoice matching, vendor credit checks, and risk analysis. eBuyerAssist centralizes all procurement functions into a single, easy-to-use system—improving visibility, control, and efficiency across your organization. Whether you're aiming to reduce costs, enhance compliance, or align procurement with broader business goals, eBuyerAssist helps you get there faster, smarter, and with measurable results.
    Learn More
  • 10
    Protenix

    Protenix

    A trainable PyTorch reproduction of AlphaFold 3

    ...PDB/CIF) into full 3D atomic-level structure predictions. It supports both “full” models and lightweight variants such as “Protenix-Mini,” offering a trade-off between speed/compute cost and predictive accuracy — making structure prediction accessible even in resource-constrained environments. The project also includes support for constraints (e.g., specifying residue- or atom-level contact constraints, or pocket constraints) to guide predictions toward biologically or experimentally relevant conformations, which enhances its utility for tasks like modeling complexes, ligands, or antibody–antigen interactions.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    AndroidEnv

    AndroidEnv

    RL research on Android devices

    android_env is a reinforcement learning (RL) environment developed by Google DeepMind that enables agents to interact with Android applications directly as a learning environment. It provides a standardized API for training agents to perform tasks on Android apps, supporting tasks ranging from games to productivity apps, making it suitable for research in real-world RL settings.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    MNN

    MNN

    MNN is a blazing fast, lightweight deep learning framework

    ...At present, MNN has been integrated in more than 20 apps of Alibaba Inc, such as Taobao, Tmall, Youku, Dingtalk, Xianyu and etc., covering more than 70 usage scenarios such as live broadcast, short video capture, search recommendation, product searching by image, interactive marketing, equity distribution, security risk control. In addition, MNN is also used on embedded devices, such as IoT. MNN Workbench could be downloaded from MNN's homepage, which provides pretrained models, visualized training tools, and one-click deployment of models to devices. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    ChatGPT Telegram Bot

    ChatGPT Telegram Bot

    A Telegram bot that integrates with OpenAI's official ChatGPT APIs

    A Telegram bot that integrates with OpenAI's official ChatGPT, DALL·E and Whisper APIs to provide answers. Ready to use with minimal configuration required.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Agent Payments Protocol (AP2)

    Agent Payments Protocol (AP2)

    Building a Secure and Interoperable Future for AI-Driven Payments

    AP2 is a project released by Google’s “Agentic Commerce” initiative, focusing on a protocol and reference implementation for agent-driven or AI-mediated payments. In effect, AP2 aims to define a secure, interoperable protocol that allows software agents to act on behalf of users—making payments or shopping decisions autonomously—while preserving necessary security, auditability, and trust. The repository contains sample scenarios (in Python, Android, etc.) that illustrate how agents, servers, and payments flows would work under the protocol. It includes “types” definitions (the core message and object schema) and example agent implementations to demonstrate the mechanics of agent-to-agent and agent-to-server interactions. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    OpenCV

    OpenCV

    Open Source Computer Vision Library

    The Open Source Computer Vision Library has >2500 algorithms, extensive documentation and sample code for real-time computer vision. It works on Windows, Linux, Mac OS X, Android, iOS in your browser through JavaScript. Languages: C++, Python, Julia, Javascript Homepage: https://opencv.org Q&A forum: https://forum.opencv.org/ Documentation: https://docs.opencv.org Source code: https://github.com/opencv Please pay special attention to our tutorials!...
    Leader badge
    Downloads: 3,046 This Week
    Last Update:
    See Project
  • 16
    Qwen Chat

    Qwen Chat

    An AI assistant for everyone, powered by the Qwen series models

    ...Qwen Chat’s web search feature delivers fast, accurate, and real-time answers sourced from authoritative references. It can analyze and summarize large documents, extracting key insights and visualizing data for better decision-making. With multimodal understanding, Qwen Chat processes audio, images, and videos seamlessly within a single conversation. Users can also generate images, videos, and code, including real-time HTML and SVG visualizations. Available across web, mobile, and desktop, Qwen Chat offers a powerful, all-in-one AI experience for diverse user needs.
    Downloads: 101 This Week
    Last Update:
    See Project
  • 17
    Dead Deer 3.14.56.2025

    Dead Deer 3.14.56.2025

    3D modeler, 3D game maker, 3D demo maker

    3D modeler, 3D game maker, 3D demo maker. to model and create games, demos. Scripting language allows you to code interactions in pseudo-C with the animation and synthesize your own rendering with own-made shaders. Import FBX, BLEND, GLTF, OBJ, 3DS, DAE, X, XML, STL, PCB, ASC, PLY, GSPLATS. Cross-platform project WINDOWS 32/64 /MACOSX 10.6/ 10.8+/APPLSilicon /LINUX/iOS/ANDROID/WINDOWS PHONE/GOOGLE VR/OPEN VR/OCULUS VR/WEBASM/UWP8/10/OPENXR, PIs (ARM32/64), RISCV Players and...
    Leader badge
    Downloads: 350 This Week
    Last Update:
    See Project
  • 18
    File-Studio

    File-Studio

    A tool that automates complex file operations.

    File studio is a tool that assists in handling complex file operations such as bulk renaming, organizing folders and more.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 19
    chatbot

    chatbot

    A secure offline chatbot that stores your personal documents

    ChatBot is a lightweight, privacy focused web application that works as a personal offline vault where users can store, search, and manage their documents, notes, images, and PDFs directly inside their browser without relying on any server or cloud service. All data stays on the user’s device using local browser storage, making it highly secure and ideal for handling sensitive information. The app features a simple chatbot style interface that makes searching and accessing stored files fast and easy, while also offering import and export options for backups and transfers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    uweb browser: unlimited power

    minimal suckless android web browser with unlimited power

    - AI bot as search engine; append file content as input for complex query. - Powerful: html5 enhancement; any urls to host a website; javascript and shell scripting for general processing; and more with Termux. - Customizable: user-defined menus, (new) buttons and gestures for user agents, bookmarklets, url services, shell commands, internal functionality links and text processing etc. - Convenient: book/dictionary/txt/command line/app can be search engine. - Tiny: less than 200k -...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 21
    Poker Bot AI

    Poker Bot AI

    Artificial Intelligence Poker Bot for popular apps on Android

    Official site: https://pokerbotai.com/ Poker bot guides: https://pokerbotai.com/knowledge-hub/ Poker Bot AI+ is advanced poker bot software designed for research, strategy training, and decision simulations. The bot continuously analyzes table state in real time, acting as both an AI poker assistant and a full autopilot poker bot depending on your configuration. You can receive live hints from the AI (like RTA‑style advice) or choose automated play under your defined...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Ainee

    Ainee

    Ainee - AI Notetaking and Learning Companion

    ...Capture lecture notes in real-time and effortlessly transform audio, text, files, and YouTube videos into formatted notes, mindmaps, quizzes, flashcards, podcasts, and more. Explore our AI meeting note taker, AI notes, video transcript generator, PDF to AI converter, and AI flashcard maker. Enhance your learning with our AI voice recorder, article summarizer AI, and AI quiz generator. Additionally, share your knowledge base with others to foster the flow of information and help new users benefit from collective insights. Experience smarter learning with Ainee today! ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    pipeless

    pipeless

    A computer vision framework to create and deploy apps in minutes

    ...Pipeless is inspired by modern serverless technologies. It provides the development experience of serverless frameworks applied to computer vision. You provide some functions that are executed for new video frames and Pipeless takes care of everything else. You can easily use industry-standard models, such as YOLO, or load your custom model in one of the supported inference runtimes. Pipeless ships some of the most popular inference runtimes, such as the ONNX Runtime, allowing you to run inference with high performance on CPU or GPU out-of-the-box. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    Liveliness and Face Identification

    Leading free and open-source liveliness check &face recognition system

    ...The application is customizable react based mobile friendly UI and Python based backend. The program is a real-time face detection application. It allows you to detect faces using your webcam and displays the video feed with oval drawn around the detected faces. When you run the program, a GUI window will appear. The window appears to do liveliness check and face detection. The description guides you to adjust the settings and click the "Start" button to begin face detection. If user's pic is in DB, it will show the matching name or else you can upload your pic with name to do detection. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    TTS Server

    TTS Server

    Android system TTS application with Microsoft demo interface

    tts-server-android is an Android system TTS application that acts both as a powerful local text-to-speech engine and as a flexible TTS “server” for other apps via HTTP. It includes a built-in Microsoft TTS demo interface and lets users configure custom HTTP requests, making it possible to route TTS through various cloud providers or local servers. The app can import other local TTS engines, giving Android devices a unified interface to multiple voices and providers, and it features simple narration/dialogue detection based on Chinese quotation marks so it can read text with different styles for narration and dialogue. ...
    Downloads: 31 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next