Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Python Client for Supabase. Query Postgres from Flask, Django
c/ua is the Docker Container for Computer-Use AI Agents
Control Any Computer Using LLMs
Python scraper based on AI
Comprehensive Gradio WebUI for audio processing
A framework to enable multimodal models to operate a computer
An LLM-powered knowledge curation system that researches topics
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Open source machine learning framework to automate text conversations
Multi-Voice and Prompt-Controlled TTS Engine
A Model Context Protocol (MCP) Gateway & Registry
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Generate Any 3D Scene in Seconds
State-of-the-art diffusion models for image and audio generation
Virtual AI anchor that combines state-of-the-art technology
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Trainable models and NN optimization tools
Photorealistic Synthetic Dataset for Holistic Indoor Scene
A library for deep learning end-to-end dialog systems and chatbots
Agent S: an open agentic framework that uses computers like a human
Foundational model for human-like, expressive TTS
Benchmarking synthetic data generation methods
Best practices on recommendation systems
Simple and powerful voice changer for Linux, written with Python & GTK