The official implementation of RAPTOR
Web interface for searching and downloading books and audiobooks
Self-hosted platform to unify wearable health data
A modern selfhosted media management system for your media library
Taming Stable Diffusion for Lip Sync
Speech-AI-Forge is a project developed around TTS generation model
Enables the best performance on NVIDIA RTX Graphics Cards
Easy-to-use Speech Toolkit including Self-Supervised Learning model
TikTok releases/likes/compilations/live streams/videos/atlases/music
Unifying 3D Mesh Generation with Language Models
AudioMuse-AI is an Open Source Dockerized environment
Build and run agents you can see, understand and trust
A fast TTS architecture with conditional flow matching
Unified Multimodal Understanding and Generation Models
Gemma open-weight LLM library, from Google DeepMind
Combination of multiple linters to install as a GitHub Action
AI-powered penetration testing assistant using local LLM on linux
Personal Information “Leakage ” Detection Interface
AI agents running research on single-GPU nanochat training
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Python framework for building scalable multi-agent systems
The fastest way to bring multi-agent workflows to production
Extract schema, statistics and entities from datasets
Module for automatic summarization of text documents and HTML pages
Jupyter notebook integration with Spyder