Chemcrow
Robust Speech Recognition via Large-Scale Weak Supervision
Offline inference engine for art, real-time voice conversations
Official MiniMax Model Context Protocol (MCP) server
Speakr is a personal, self-hosted web application
Synthesizing and manipulating 2048x1024 images with conditional GANs
Build and run agents you can see, understand and trust
A Systematic Framework for Interactive World Modeling
Repo of Qwen2-Audio chat & pretrained large audio language model
Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant
AI assistant based on large models that can actively think and plan
AI-powered tool for efficient abstract and PDF screening
A fast TTS architecture with conditional flow matching
A TTS model capable of generating ultra-realistic dialogue
A generative speech model for daily dialogue
SDG is a specialized framework
An Open Source text-to-speech system built by inverting Whisper
Inference code for CodeLlama models
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
A specialized Claude Code workspace for creating long-form
Sharp Monocular View Synthesis in Less Than a Second
Context-aware desktop AI assistant that understands screen content
Run a full local LLM stack with one command using Docker
AI-Researcher: Autonomous Scientific Innovation
Transforming Multimodal Content into Captivating Multilingual Audio