Fast stable diffusion on CPU and AI PC
Instant voice cloning by MIT and MyShell. Audio foundation model
Scalable data pre processing and curation toolkit for LLMs
User toolkit for analyzing and interfacing with Large Language Models
Open source terminal session recorder
Interface for OuteTTS models
Automatically translates the text of a video based on a subtitle file
A very simple framework for state-of-the-art NLP
State-of-the-art (SoTA) text-to-video pre-trained model
Code and models for ICML 2024 paper, NExT-GPT
Extract audio and video content and organize it into a Markdown note
The open-source data curation platform for LLMs
Implementation of AudioLM audio generation model in Pytorch
Minimalist Vim Plugin Manager
Adding guardrails to large language models
Framework for building, orchestrating, and deploying AI agents
Documentation for Google's Gen AI site - including Gemini API & Gemma
Low-latency AI inference engine optimized for mobile devices
An Open Source implementation of Notebook LM with more flexibility
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Foundational model for human-like, expressive TTS
The Markdown Editor for Linux
Interactively find and recover deleted or overwritten files
Search all of YouTube from the command line
Real-time voice interactive digital human