State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Capable of understanding text, audio, vision, video
Speakr is a personal, self-hosted web application
Streaming Real-time Audio-Driven Avatar Generation
An open source digital image forensic toolset
SOTA Open Source TTS
Free, high-quality text-to-speech API endpoint to replace OpenAI
Oobabooga - The definitive Web UI for local AI, with powerful features
ChatGPT interface with better UI
Fast multimodal LLM for real-time voice interaction and AI apps
Cross platform GUI tool for downloading videos from Bilibili sites
Automatically translates the text of a video based on a subtitle file
Offline Text To Speech synthesis for python
Sample code and notebooks for Generative AI on Google Cloud
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
AI video generator optimized for low VRAM and older GPUs use
Implementation of AudioLM audio generation model in Pytorch
A PyTorch-based Speech Toolkit
PersonaPlex code
Robust Speech Recognition via Large-Scale Weak Supervision
Unofficial Python API and agentic skill for Google NotebookLM
An open-source music player with simple UI
The most powerful and modular diffusion model GUI, api and backend
EPUB to audiobook converter, optimized for Audiobookshelf
Chinese Financial Trading Framework Based on Multi-Agent LLM