Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Python Client for Supabase. Query Postgres from Flask, Django
c/ua is the Docker Container for Computer-Use AI Agents
Python scraper based on AI
Control Any Computer Using LLMs
Comprehensive Gradio WebUI for audio processing
A framework to enable multimodal models to operate a computer
An LLM-powered knowledge curation system that researches topics
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Open source machine learning framework to automate text conversations
A Model Context Protocol (MCP) Gateway & Registry
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
State-of-the-art diffusion models for image and audio generation
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Generate Any 3D Scene in Seconds
Virtual AI anchor that combines state-of-the-art technology
Multi-Voice and Prompt-Controlled TTS Engine
Trainable models and NN optimization tools
Photorealistic Synthetic Dataset for Holistic Indoor Scene
A library for deep learning end-to-end dialog systems and chatbots
Agent S: an open agentic framework that uses computers like a human
Benchmarking synthetic data generation methods
Foundational model for human-like, expressive TTS
Best practices on recommendation systems
Simple and powerful voice changer for Linux, written with Python & GTK