A long-running autonomous coding agent powered by the Claude Agent
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Generate high-definition story short videos with one click using AI
Generate audiobooks from e-books
A guidance language for controlling large language models
A theoretical reconstruction of the Claude Mythos architecture
State-of-the-art diffusion models for image and audio generation
StarVector is a foundation model for SVG generation
Flexible Photo Recrafting While Preserving Your Identity
Taming Stable Diffusion for Lip Sync
A high-quality PDF to Markdown tool based on large language model
Multimodal Diffusion with Representation Alignment
High-Quality Voice Cloning TTS for 600+ Languages
Zero-code platform for building AI agents from natural language input
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Unofficial Python API and agentic skill for Google NotebookLM
Minimal Claude Code alternative. Single Python file, zero dependencies
lightweight package to simplify LLM API calls
Use Microsoft Edge's online text-to-speech service from Python
Renderer for the harmony response format to be used with gpt-oss
Toolkit to help you get started with Spec-Driven Development
AI coding workstation: Claude Code + web UI + 5 AI CLIs + headless
A python module to repair invalid JSON from LLMs
Long-form streaming TTS system for multi-speaker dialogue generation
Open source healthcare AI