A lightweight audio-to-MIDI converter with pitch bend detection
Unofficial Python API and agentic skill for Google NotebookLM
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Deepfakes Software For All
Tokenizer-Free TTS for Multilingual Speech Generation
Faster Whisper transcription with CTranslate2
Generate audiobooks from e-books, voice cloning & 1107+ languages
Fast stable diffusion on CPU and AI PC
Aider is AI pair programming in your terminal
Agentic IM Chatbot infrastructure
Open-source AI agent framework
A Domain-Fronting Relay that routes traffic though GAS
Powerful Android AI agent with tools, automation, and Linux shell
Universal LLM Deployment Engine with ML Compilation
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Use Microsoft Edge's online text-to-speech service from Python
Machine learning in Python
The official Meta Llama 3 GitHub site
From Images to High-Fidelity 3D Assets
High-Quality Voice Cloning TTS for 600+ Languages
Python inference and LoRA trainer package for the LTX-2 audio–video
Synchronized Translation for Videos
A community-supported supercharged version of paperless
Python tool for converting files and office documents to Markdown
A set of ready to use Agent Skills for research, science, engineering