Set of Ansible scripts that simplifies the setup of a personal VPN
Label Studio is a multi-type data labeling and annotation tool
A youtube-dl fork with additional features and fixes
A TTS model capable of generating ultra-realistic dialogue
An Open Source implementation of Notebook LM with more flexibility
Trying to be a robust, user-friendly and hackable music player
Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
Qwen3-ASR is an open-source series of ASR models
VMZ: Model Zoo for Video Modeling
NeoDB is a self-hosted server tracking what you read/watch/listen/play
GPU environment management and cluster orchestration
Unified web UI for training and running open models locally
Run macOS VM in a Docker! Run near native OSX-KVM in Docker
A Web UI for easy subtitle using whisper model
A python tool that uses GPT-4, FFmpeg, and OpenCV
An event-driven framework designed to build multi-agent AI systems
Open Source Speech Language Model
Towards Human-Sounding Speech
Spring AI Alibaba examples for building and testing AI apps
Industrial-level controllable zero-shot text-to-speech system
LLM based data scientist, AI native data application
Python inference and LoRA trainer package for the LTX-2 audio–video
A sound cloning tool with a web interface, using your voice
Framework for building real-time voice and multimodal AI agents
Code and models for ICML 2024 paper, NExT-GPT