image free download - SourceForge

MiniMax-MCP

Official MiniMax Model Context Protocol (MCP) server

...It acts as a bridge between tools like Claude Desktop, Cursor, Windsurf, OpenAI Agents, and the MiniMax platform, exposing capabilities such as text-to-speech, voice cloning, image generation, text-to-image, video generation, image-to-video, text-to-video, and music generation. The server is written in Python and distributed under the MIT license, with a pyproject.toml and uv-based workflow that makes installation and execution reproducible. Configuration is handled through JSON files that tell MCP clients how to launch the server (typically via uvx minimax-mcp) and which environment variables to use for the API key, host, and output directory. ...

Downloads: 0 This Week

Last Update: 2026-01-07

See Project

AI Runner

Offline inference engine for art, real-time voice conversations

AI Runner is an offline inference engine designed to run a collection of AI workloads on your own machine, including image generation for art, real-time voice conversations, LLM-powered chatbots and automated workflows. It is implemented as a desktop-oriented Python application and emphasizes privacy and self-hosting, allowing users to work with text-to-speech, speech-to-text, text-to-image and multimodal models without sending data to external services.

Downloads: 2 This Week

Last Update: 2025-12-11

See Project

Readest

Readest is a modern, feature-rich ebook reader

Readest is a project meant to facilitate reading, studying, or consuming content by integrating reading tools with AI-powered assistance. Although the repository is not as widely documented or popular as some, the idea is that Readest supports features to help with reading comprehension — likely combining OCR / text retrieval, translation, note-taking, or summarization for reading materials (eBooks, articles, PDFs). The goal appears to be to let users feed in arbitrary reading material and...

Downloads: 27 This Week

Last Update: 6 days ago

See Project

Luna AI

Virtual AI anchor that combines state-of-the-art technology

Luna AI is a virtual AI streamer framework designed to power an interactive VTuber that can go live on major platforms and chat with viewers in real time. It is built around a core assistant persona called “Luna AI,” which can be driven by a wide range of large language models and platforms, including GPT-style APIs, Claude, LangChain-based backends, ChatGLM, Kimi, Ollama, and many others. The project supports multiple rendering backends for the avatar, such as Live2D, Unreal Engine (UE),...

Downloads: 4 This Week

Last Update: 2025-11-28

See Project

comfyui-mixlab-nodes

Workflow and speech recognition app

comfyui-mixlab-nodes is a large collection of custom nodes for ComfyUI that turns workflows into interactive apps and adds real-time multimedia, LLM, and TTS capabilities. It introduces a “Workflow-to-APP” concept, where a ComfyUI graph can be transformed into a Web App through an AppInfo node, complete with categories, batch prompts, and editable configurations. The project also brings Real-time Design features like screen capture and floating video nodes, enabling creative pipelines that...

Downloads: 2 This Week

Last Update: 2025-11-28

See Project

Lingvo

Framework for building neural networks

...It has been used to implement state of the art architectures such as recurrent neural networks, Transformer models, variational autoencoder hybrids, and multi task systems. Lingvo includes reference models and configurations for domains like machine translation, automatic speech recognition, language modeling, image understanding, and 3D object detection. Centralized hyperparameter configuration files allow researchers to share exact experiment setups so others can retrain and compare results reliably.

Downloads: 0 This Week

Last Update: 2025-11-28

See Project

OpenAI-Compatible Edge-TTS API

Free, high-quality text-to-speech API endpoint to replace OpenAI

...Because it relies on Edge’s TTS, the audio generation itself is free, and the project essentially acts as a smart proxy that handles formatting and streaming. The server supports Server-Sent Events (SSE) for streaming audio, enabling low-latency playback in chat UIs and other interactive tools. A Docker image is provided for one-command deployment, and environment variables can be used to configure default voice, language, response format, authentication, and logging options.

Downloads: 0 This Week

Last Update: 2025-11-28

See Project

EmotiVoice

Multi-Voice and Prompt-Controlled TTS Engine

...The core idea is prompt-based emotional and style control: you can ask the engine to speak “happy,” “sad,” “excited,” or with other high-level style prompts that shape prosody, pitch, speed, and energy. EmotiVoice provides multiple ways to interact with it, including a web interface, a Docker image, an HTTP API (including an OpenAI-compatible TTS API), and Python scripts for batch synthesis. It also supports voice cloning with your own data, backed by recipes for popular datasets like DataBaker and LJSpeech, so you can train or adapt voices to custom personas.

Downloads: 7 This Week

Last Update: 2025-11-30

See Project

Mice MX OS speech to text Voice Control

Mice speech to text with MX Cinnamon OS ISO

Note about this image This image contains a system based on Linux MX, which was created to improve accessibility within the Linux environment. The distribution uses the Cinnamon desktop interface, which is configured to be operated using voice commands and outputs. The user interface and the control of your own devices and home automation systems can be customized and extended.

Downloads: 1 This Week

Last Update: 2025-05-14

See Project

PaddlePaddle models

Pre-trained and Reproduced Deep Learning Models

Pre-trained and Reproduced Deep Learning Models ("Flying Paddle" official model library, including a variety of academic frontier and industrial scene verification of deep learning models) Flying Paddle's industrial-level model library includes a large number of mainstream models that have been polished by industrial practice for a long time and models that have won championships in international competitions; it provides many scenarios for semantic understanding, image classification, target detection, image segmentation, text recognition, speech synthesis, etc. An end-to-end development kit that meets the needs of enterprises for low-cost development and rapid integration. The model library of Flying Paddle is an industrial-level model library tailored around the actual R&D process of domestic enterprises, serving enterprises in many fields such as energy, finance, industry, and agriculture.

Downloads: 0 This Week

Last Update: 2022-08-01

See Project

VoiceShot API - CF SDK

ColdFusion SDK for the VoiceShot API.

...Send notification phone calls and text messages, automate customer service calls/texts, provide order status and integrate with your own custom applications to provide services customized to your needs. Improve your business's efficiency and enhance your image.

Downloads: 0 This Week

Last Update: 2020-03-11

See Project

VoiceShot API - PHP SDK

PHP SDK for processing phone calls and SMS through the VoiceShot API.

...Send notification phone calls and text messages, automate customer service calls/texts, provide order status and integrate with your own custom applications to provide services customized to your needs. Improve your business's efficiency and enhance your image.

Downloads: 0 This Week

Last Update: 2020-03-11

See Project

VoiceShot API - .NET SDK

.NET SDK for processing phone calls and SMS through the VoiceShot API.

...Send notification phone calls and text messages, automate customer service calls/texts, provide order status and integrate with your own custom applications to provide services customized to your needs. Improve your business's efficiency and enhance your image.

Downloads: 0 This Week

Last Update: 2020-03-11

See Project

VoiceShot API - ASP SDK

ASP SDK for processing phone calls and SMS through the VoiceShot API.

...Send notification phone calls and text messages, automate customer service calls/texts, provide order status and integrate with your own custom applications to provide services customized to your needs. Improve your business's efficiency and enhance your image.

Downloads: 0 This Week

Last Update: 2020-03-11

See Project

Search Results for "image"

Showing 14 open source projects for "image"

MiniMax-MCP

AI Runner

Readest

Luna AI

comfyui-mixlab-nodes

Lingvo

OpenAI-Compatible Edge-TTS API

EmotiVoice

Mice MX OS speech to text Voice Control

PaddlePaddle models

VoiceShot API - CF SDK

VoiceShot API - PHP SDK

VoiceShot API - .NET SDK

VoiceShot API - ASP SDK

Search Results for "image"

Showing 14 open source projects for "image"

MiniMax-MCP

AI Runner

Readest

Luna AI

comfyui-mixlab-nodes

Lingvo

OpenAI-Compatible Edge-TTS API

EmotiVoice

Mice MX OS speech to text Voice Control

PaddlePaddle models

VoiceShot API - CF SDK

VoiceShot API - PHP SDK

VoiceShot API - .NET SDK

VoiceShot API - ASP SDK

Related Searches

Related Categories