Enable AI to control your desktop, mobile and HMI devices
Generate audiobooks from e-books, voice cloning & 1107+ languages
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
The AI toolkit for the AI developer
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Meta Agents Research Environments is a comprehensive platform
Weaving the Digital Agent Galaxy
Agent S: an open agentic framework that uses computers like a human
AI-powered tool for developers, simplifying coding tasks
A graphical manager for ollama that can manage your LLMs
AnyTool: Universal Tool-Use Layer for AI Agents
StreamSpeech is a seamless model for offline speech recognition
A simple screen parsing tool towards pure vision based GUI agent
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Graphical User Interface Face Anonymization Tool
- RetroScheme is used for molecule sketching and retrosynthesis
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Unlimited, private and free Speech-To-Text program
AI Suite for upscaling, interpolating & restoring images/videos
AI-powered quiz solver for Windows. Free to use, easy to set up.
Meta-Datenbank-Anwendung für die Audio- und TV-Sendungen des CC2.TV
Leading free and open-source liveliness check &face recognition system
Official Code for DragGAN (SIGGRAPH 2023)