GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
The AI toolkit for the AI developer
Example client of oagi-python developed with Tauri
Run Stable Diffusion on Mac natively
Enable AI to control your desktop, mobile and HMI devices
Convert AI papers to GUI
Weaving the Digital Agent Galaxy
A single Gradio + React WebUI with extensions for ACE-Step
AnyTool: Universal Tool-Use Layer for AI Agents
AI-powered tool for developers, simplifying coding tasks
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Agent S: an open agentic framework that uses computers like a human
Meta Agents Research Environments is a comprehensive platform
StreamSpeech is a seamless model for offline speech recognition
A simple screen parsing tool towards pure vision based GUI agent
Lightweight PC-Gui framework for AI, typewriter stream Gemini-like
A graphical manager for ollama that can manage your LLMs
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Graphical User Interface Face Anonymization Tool
- RetroScheme is used for molecule sketching and retrosynthesis
Unlimited, private and free Speech-To-Text program