Open Source Artificial Intelligence Software - Page 12

Sort By:

Artificial Intelligence Software

View 13598 business solutions

Artificial Intelligence Clear Filters

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Fully Managed MySQL, PostgreSQL, and SQL Server
Automatic backups, patching, replication, and failover. Focus on your app, not your database.

Cloud SQL handles your database ops end to end, so you can focus on your app.

Try Free
1

SpeechRecognition

Speech recognition module for Python

Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using pip install SpeechRecognition. The first software requirement is Python 2.6, 2.7, or Python 3.3+. This is required to use the library. PyAudio is required if and only if you want to use microphone input (Microphone). PyAudio version 0.2.11+ is required, as earlier versions have known memory management bugs when recording from microphones in certain situations. To hack on this library, first make sure you have all the requirements listed in the "Requirements" section.

Downloads: 26 This Week

Last Update: 2026-04-05
See Project
2

Strix

Open-source AI hackers to find and fix your app’s vulnerabilities

Strix is an open source agent-driven security platform that uses autonomous AI agents to identify, investigate, and validate vulnerabilities in software applications. The system is designed to mimic the behavior of real attackers by executing dynamic testing and verifying findings through proof-of-concept exploitation. Unlike traditional vulnerability scanners that rely heavily on static analysis, Strix agents actively run code, probe systems, and attempt exploitation to confirm whether vulnerabilities are genuinely exploitable. The platform is intended for developers and security teams that need rapid security assessments without the overhead of manual penetration testing engagements. Strix can orchestrate multiple cooperating agents that divide investigation tasks and collaboratively analyze complex applications or infrastructure.

Downloads: 26 This Week

Last Update: 2026-03-23
See Project
3

Tabby

Self-hosted AI coding assistant

Tabby is an open-source, self-hosted AI coding assistant that provides code completion and suggestions within your preferred development environment. It enhances developer productivity by leveraging machine learning models to predict and generate code snippets.

Downloads: 26 This Week

Last Update: 2026-01-25
See Project
4

CoPaw

Your Personal AI Assistant; easy to install, deploy on local or coud

CoPaw is a personal AI assistant designed to run on your own machine or in the cloud, giving you full control over memory, models, and data. Built by the AgentScope team, it connects to multiple chat platforms—including DingTalk, Feishu, QQ, Discord, iMessage, and more—through a single unified assistant. CoPaw supports both cloud-based LLM providers and fully local models such as llama.cpp, MLX, and Ollama, allowing you to operate without API keys if preferred. It includes a browser-based Console for chatting, configuring models, managing memory, and extending capabilities with custom skills. With built-in cron scheduling, heartbeat check-ins, and extensible skill loading, CoPaw grows with your workflow over time. Easy installation options—including pip, one-line scripts, Docker, and cloud deployment—make it accessible for both developers and non-technical users.

1 Review

Downloads: 25 This Week

Last Update: 2 days ago
See Project
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
5

CodePilot

A native desktop GUI for Claude Code

CodePilot is a native desktop graphical user interface built for Claude Code that lets developers chat with, code with, and manage AI-assisted projects visually rather than through the terminal. Created with Electron and Next.js, CodePilot delivers a polished experience where users can talk to Claude models, view syntax-highlighted responses, attach files, and inspect project context via a live file tree. It supports session management so chats and project work persist between restarts, letting users pick up where they left off without losing history. Unlike traditional CLI-only workflows, CodePilot brings panels, drag-to-resize layouts, and controls for tool permissions that make it feel like a modern desktop code assistant. It also includes project-aware context so Claude understands the specific codebase you’re working on, helping generate smarter suggestions and clearer explanations.

Downloads: 25 This Week

Last Update: 2 days ago
See Project
6

GitHub Copilot CLI

GitHub Copilot CLI brings the power of Copilot coding agent

GitHub Copilot CLI is a command-line interface tool. It brings AI-powered coding assistance directly into your terminal. GitHub Copilot CLI allows you to build, debug, refactor, and understand code via natural language conversations. You can have these conversations within the Active Directory. It integrates tightly with your GitHub context—repositories, issues, pull requests. The Copilot can leverage repository context when making suggestions. The tool is currently in public preview and is evolving. Preview proposed changes before committing or applying them. Extensibility via MCP (Modular Copilot Platform) servers or custom servers.

1 Review

Downloads: 25 This Week

Last Update: 1 day ago
See Project
7

GoogleTest

Google Testing and Mocking Framework

GoogleTest is Google's C++ mocking and test framework. It's used by many internal projects at Google, as well as a number of notable projects such as The Chromium projects, the OpenCV computer vision library, and the LLVM compiler. This GoogleTest project is actually a union of what used to be two separate projects: the old GoogleTest and GoogleMock, an extension of GoogleTest for writing and using C++ mock classes. Since they were so closely related, they were merged to create an even better GoogleTest. GoogleTest features an xUnit test framework, a rich set of assertions, user-defined assertions, death tests, among many others. It's been used on a variety of platforms, including Cygwin, Symbian, MinGW and PlatformIO.

Downloads: 25 This Week

Last Update: 2025-04-30
See Project
8

Happy Coder

Mobile and Web client for Codex and Claude Code, with realtime voice

Happy is an open-source, cross-platform mobile and web client designed to bring powerful AI coding agents such as Claude Code and Codex to your fingertips no matter where you are. At its core, Happy wraps existing AI coding tools with a unified interface, providing real-time voice interactions, encrypted communication, and seamless device switching between desktop and mobile. You can start a coding session locally through the Happy CLI or connect from a phone or browser, allowing developers to inspect, interact with, and guide the AI as it generates, tests, or explains code. The project includes components like a dedicated backend server for encrypted sync, a rich front-end experience across web and native apps, and support for push notifications when your coding agent encounters permission requests or errors. Happy prioritizes security with end-to-end encryption so your code and interactions remain private and auditable.

Downloads: 25 This Week

Last Update: 24 hours ago
See Project
9

KrillinAI

Video translation and dubbing tool powered by LLMs

KrillinAI is an end-to-end content localization, translation, and dubbing tool aimed at helping creators transform videos into multiple languages with minimal manual effort. It integrates several stages of the pipeline: video acquisition (either from local files or remote via download tools), speech recognition (ASR), subtitle segmentation and alignment, machine translation (with context-aware translation to preserve semantics), and voice cloning + text-to-speech (TTS) to produce dubbed audio tracks. KrillinAI supports both landscape and portrait videos, which makes it suitable for a wide range of platforms — from YouTube to TikTok or other vertical-video sites — and ensures correct formatting and layout for the final video. The tool offers “one-click” workflows and desktop versions, lowering the barrier for users who may not be familiar with video editing or audio processing pipelines.

Downloads: 25 This Week

Last Update: 2025-11-28
See Project
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
10

MemU

MemU is an open-source memory framework for AI companions

MemU is an agentic memory layer for LLM applications, specifically designed for AI companions. Transform your memory into an intelligent file system that automatically organizes, connects, and evolves with your memories. Simple, fast, and reliable memory infrastructure for AI applications. Powerful tools and dedicated support to scale your AI applications with confidence. Full proprietary features, commercial usage rights, and white-labeling options for your enterprise needs. SSO/RBAC integration and a dedicated algorithm team for scenario-specific optimization. User behavior analysis, real-time monitoring, and automated agent optimization tools. 24/7 dedicated support team, custom SLAs, and professional implementation services.

Downloads: 25 This Week

Last Update: 2026-03-23
See Project
11

Mochi Diffusion

Run Stable Diffusion on Mac natively

Run Stable Diffusion on Mac natively. This app uses Apple's Core ML Stable Diffusion implementation to achieve maximum performance and speed on Apple Silicon based Macs while reducing memory requirements. Extremely fast and memory efficient (~150MB with Neural Engine) Runs well on all Apple Silicon Macs by fully utilizing Neural Engine. Generate images locally and completely offline. Generate images based on an existing image (commonly known as Image2Image) Generated images are saved with prompt info inside EXIF metadata (view in Finder's Get Info window) Convert generated images to high resolution (using RealESRGAN) Autosave & restore images. Use custom Stable Diffusion Core ML models. No worries about pickled models. macOS native app using SwiftUI.

Downloads: 25 This Week

Last Update: 2026-02-28
See Project
12

Paperclip

Open-source orchestration for zero-human companies

Paperclip is an open-source tool designed to help AI systems and developer tools access academic research papers through a standardized interface. The project implements a server based on the Model Context Protocol (MCP), a framework that allows large language models and AI agents to connect to external data sources and tools in a consistent way. By acting as a middleware layer, Paperclip aggregates multiple academic databases and exposes them through a single interface, allowing AI applications to search and retrieve scholarly papers without needing to integrate with each provider individually. The system supports repositories such as arXiv, OpenAlex, and the Open Science Framework, giving AI agents access to a large body of research literature. Instead of requiring separate APIs and authentication flows for each service, Paperclip provides unified search and retrieval capabilities that simplify integration into AI workflows.

Downloads: 25 This Week

Last Update: 2026-04-04
See Project
13

ShellGPT

A command-line productivity tool powered by AI large language models

A command-line productivity tool powered by AI large language models (LLM). This command-line tool offers a streamlined generation of shell commands, code snippets, and documentation, eliminating the need for external resources (like Google search). Supports Linux, macOS, and Windows and is compatible with all major Shells like PowerShell, CMD, Bash, Zsh, etc. By default, ShellGPT uses OpenAI's API and GPT-4 model. You'll need an API key, you can generate one here. You will be prompted for your key which will then be stored in ~/.config/shell_gpt/.sgptrc. OpenAI API is not free of charge, please refer to the OpenAI pricing for more information.

Downloads: 25 This Week

Last Update: 2026-01-28
See Project
14

VibeVoice

Open-source multi-speaker long-form text-to-speech model

VibeVoice-1.5B is Microsoft’s frontier open-source text-to-speech (TTS) model designed for generating expressive, long-form, multi-speaker conversational audio such as podcasts. Unlike traditional TTS systems, it excels in scalability, speaker consistency, and natural turn-taking for up to 90 minutes of continuous speech with as many as four distinct speakers. A key innovation is its use of continuous acoustic and semantic speech tokenizers operating at an ultra-low frame rate of 7.5 Hz, enabling high audio fidelity with efficient processing of long sequences. The model integrates a Qwen2.5-based large language model with a diffusion head to produce realistic acoustic details and capture conversational context. Training involved curriculum learning with increasing sequence lengths up to 65K tokens, allowing VibeVoice to handle very long dialogues effectively. Safety mechanisms include an audible disclaimer and imperceptible watermarking in all generated audio to mitigate misuse risks.

Downloads: 25 This Week

Last Update: 5 days ago
See Project
15

pyttsx3

Offline Text To Speech synthesis for python

pyttsx3 is an offline text-to-speech library for Python that wraps native speech engines instead of calling cloud APIs. It is designed to work entirely without an internet connection, making it suitable for local automation, kiosks, accessibility tools, and embedded applications. On Windows it uses SAPI5, on Linux it typically uses eSpeak or eSpeak-NG, and on macOS it can use NSSpeechSynthesizer or AVSpeechSynthesizer, giving it broad cross-platform compatibility. The library exposes a simple but flexible API for controlling voice selection, speaking rate, volume, and other synthesis parameters from Python code. It supports both a high-level speak convenience function and a lower-level engine object with event hooks, queuing, and saving output to audio files. The repository includes examples and documentation that show how to adjust properties dynamically, persist synthesized output, and integrate pyttsx3 into GUIs or background services.

Downloads: 25 This Week

Last Update: 2025-11-28
See Project
16

FLUX.1

Official inference repo for FLUX.1 models

FLUX.1 repository contains inference code and tooling for the FLUX.1 text-to-image diffusion models, enabling developers and researchers to generate and edit images from natural-language prompts using open-weight versions of the model on their own hardware or within custom applications. The project is part of a larger family of FLUX models developed by Black Forest Labs, designed to produce high-quality, detailed visuals from text descriptions with competitive prompt adherence and artistic fidelity. This repo focuses on running the open-source model variants efficiently, providing scripts, model loading logic, and examples for local installations, and supports integration with Python toolchains like PyTorch and popular generative pipelines. Users can launch CLI tools to generate images, experiment with different FLUX variants, and extend the base code for research-oriented applications.

Downloads: 24 This Week

Last Update: 2026-01-19
See Project
17

HolmesGPT

CNCF Sandbox Project

HolmesGPT is an open-source AI agent designed to help DevOps and site reliability engineering teams diagnose and resolve production incidents. The system aggregates signals from observability tools such as logs, metrics, alerts, and distributed traces, then analyzes them using large language models to identify potential root causes. Rather than requiring engineers to manually correlate large volumes of monitoring data, HolmesGPT automatically synthesizes evidence and presents explanations in natural language. The project is developed by Robusta and has been accepted as a Cloud Native Computing Foundation Sandbox project, highlighting its relevance to the cloud-native ecosystem. It is designed to operate as an automated troubleshooting assistant that can analyze incidents continuously and support on-call engineers during outages.

Downloads: 24 This Week

Last Update: 5 days ago
See Project
18

LiteRT

LiteRT is the new name for TensorFlow Lite (TFLite)

LiteRT is an experimental, real-time inference runtime built by Google AI Edge to run lightweight ML models on edge devices with ultra-low latency. It focuses on delivering predictable and consistent performance for models used in time-critical applications like robotics, AR/VR, and IoT. LiteRT is designed to be hardware-agnostic, with minimal dependencies and tight control over execution scheduling.

Downloads: 24 This Week

Last Update: 20 hours ago
See Project
19

Loki Mode

Multi-agent autonomous startup system for Claude Code

Loki Mode is a multi-agent autonomous execution system designed to take structured product requirements or specifications and autonomously drive the creation, testing, deployment, and scaling of complex software projects using a large team of specialized AI agents. It orchestrates dozens of agent types across swarms that handle designated roles — such as architecture, coding, QA, deployment, and business workflows — running in parallel to cover both engineering and operational tasks without continuous human intervention. By supporting multiple AI providers (like Claude Code, OpenAI Codex CLI, and Google Gemini CLI), loki-mode dynamically selects and spawns only the needed agents for a given project, optimizing computational resources and task throughput. Its Reason-Act-Reflect-Verify (RARV) cycle with self-verification loops emphasizes quality and resilience, automating end-to-end development lifecycles.

Downloads: 24 This Week

Last Update: 2026-04-06
See Project
20

MoneyPrinterTurbo

Generate short videos with one click using AI LLM

MoneyPrinterTurbo is an AI-driven tool that enables users to generate high-definition short videos with minimal input. By providing a topic or keyword, the system automatically creates video scripts, sources relevant media assets, adds subtitles, and incorporates background music, resulting in a polished video ready for distribution.

Downloads: 24 This Week

Last Update: 2026-04-03
See Project
21

ONNX

Open standard for machine learning interoperability

ONNX is an open format built to represent machine learning models. ONNX defines a common set of operators - the building blocks of machine learning and deep learning models - and a common file format to enable AI developers to use models with a variety of frameworks, tools, runtimes, and compilers. Open Neural Network Exchange (ONNX) is an open ecosystem that empowers AI developers to choose the right tools as their project evolves. ONNX provides an open source format for AI models, both deep learning and traditional ML. It defines an extensible computation graph model, as well as definitions of built-in operators and standard data types. Currently we focus on the capabilities needed for inferencing (scoring). ONNX is widely supported and can be found in many frameworks, tools, and hardware. Enabling interoperability between different frameworks and streamlining the path from research to production helps increase the speed of innovation in the AI community.

Downloads: 24 This Week

Last Update: 2026-03-27
See Project
22

Open Interpreter

A natural language interface for computers

Open Interpreter is an open-source tool that provides a natural-language interface for interacting with your computer. It lets large language models (LLMs) run code locally (Python, JavaScript, shell, etc.), enabling you to ask your computer to do tasks like data analysis, file manipulation, browsing, etc. in human terms (“chat with your computer”), with safeguards. Runs locally or via configured remote LLM servers/inference backends, giving flexibility to use models you trust or have locally. It prompts you to approve code before executing, and supports both online LLM models and local inference servers. It seeks to combine convenience (like ChatGPT’s code interpreter) with control and flexibility by running on your own machine.

Downloads: 24 This Week

Last Update: 2025-09-12
See Project
23

OpenFace

A state-of-the-art facial behavior analysis toolkit

OpenFace is an advanced facial behavior analysis toolkit intended for computer vision and machine learning researchers, those in the affective computing community, and those who are simply interested in creating interactive applications based on facial behavior analysis. The OpenFace toolkit is capable of performing several complex facial analysis tasks, including facial landmark detection, eye-gaze estimation, head pose estimation and facial action unit recognition. OpenFace is able to deliver state-of-the-art results in all of these mentioned tasks. OpenFace is available for Windows, Ubuntu and macOS installations. It is capable of real-time performance and does not need to run on any specialist hardware, a simple webcam will suffice.

Downloads: 24 This Week

Last Update: 2023-11-30
See Project
24

Tesseract.js

A pure Javascript Multilingual OCR

Tesseract.js is a pure Javascript port of the popular Tesseract OCR engine. Tesseract.js' library supports more than 100 languages, automatic text orientation and script detection, a simple interface for reading paragraph, word, and character bounding boxes. Tesseract.js can run either in a browser and on a server with NodeJS. Tesseract.js is a javascript library that gets words in almost any spoken language out of images. The main Tesseract.js functions (ex. recognize, detect) take an image parameter, which should be something that is like an image. What's considered "image-like" differs depending on whether it is being run from the browser or through NodeJS.

Downloads: 24 This Week

Last Update: 2025-12-15
See Project
25

Turi Create

Simplifies the development of custom machine learning models

Turi Create simplifies the development of custom machine learning models. You don't have to be a machine learning expert to add recommendations, object detection, image classification, image similarity or activity classification to your app. If you want your app to recognize specific objects in images, you can build your own model with just a few lines of code. Turi Create supports macOS 10.12+, Linux (with glibc 2.10+), Windows 10 (via WSL). Turi Create requires Python 2.7, 3.5, 3.6, 3.7, 3.8. Also, x86_64 architecture, and at least 4 GB of RAM. We recommend using virtualenv to use, install, or build Turi Create. The package User Guide and API Docs contain more details on how to use Turi Create. If you want to build Turi Create from source, see BUILD.md. Turi Create does not require a GPU, but certain models can be accelerated 9-13x by utilizing a GPU.

Downloads: 24 This Week

Last Update: 2021-06-02
See Project