Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Python example app from the OpenAI API quickstart tutorial
Offline speech recognition API for Android, iOS, Raspberry Pi
Comprehensive Gradio WebUI for audio processing
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
A framework to enable multimodal models to operate a computer
A Model Context Protocol (MCP) Gateway & Registry
Python scraper based on AI
Open source machine learning framework to automate text conversations
Python Client for Supabase. Query Postgres from Flask, Django
Photorealistic Synthetic Dataset for Holistic Indoor Scene
Trainable models and NN optimization tools
Agent S: an open agentic framework that uses computers like a human
State-of-the-art diffusion models for image and audio generation
c/ua is the Docker Container for Computer-Use AI Agents
Benchmarking synthetic data generation methods
Control Any Computer Using LLMs
An LLM-powered knowledge curation system that researches topics
GLM-4-Voice | End-to-End Chinese-English Conversational Model
A library for deep learning end-to-end dialog systems and chatbots
Best practices on recommendation systems
Simple and powerful voice changer for Linux, written with Python & GTK
Interpretability and explainability of data and machine learning model
Create software using visual programming
An Open Bilingual Chat LLM | Open Source Bilingual Conversation LLM