Chat with it via text and voice
General-purpose image editing model that delivers high-fidelity
An Open Source text-to-speech system built by inverting Whisper
Open-source multi-speaker long-form text-to-speech model
Transforming Multimodal Content into Captivating Multilingual Audio
Open source NLP guide with models, methods, and real use cases
Standalone, small, language-neutral
go1pylib is a Python library designed to control the Go1 robot
Agent harness to make your slop code well-engineered and beautiful
Open Source Speech Language Model
Speakr is a personal, self-hosted web application
Spark-TTS Inference Code
Multimodal embedding and reranking models built on Qwen3-VL
A Unified Framework for Text-to-3D and Image-to-3D Generation
A theme for Sublime Text 3 by Mattia Astorino
lightweight package to simplify LLM API calls
Instant voice cloning by MIT and MyShell. Audio foundation model
A Web UI for easy subtitle using whisper model
A list of free LLM inference resources accessible via API
Audiocraft is a library for audio processing and generation
21 Lessons, Get Started Building with Generative AI
An easy-to-use backup tool for GNU Linux using rsync in the back
The most powerful local music generation model
Qwen-Image is a powerful image generation foundation model
Designed for text embedding and ranking tasks