Fast and memory-efficient exact attention
text and image to video generation: CogVideoX (2024) and CogVideo
Simplest working implementation of Stylegan2
Traditional Mandarin LLMs for Taiwan
AI video generator optimized for low VRAM and older GPUs use
Run your own AI cluster at home with everyday devices
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
A set of Docker images for training and serving models in TensorFlow
A nearly-live implementation of OpenAI's Whisper
950 line, minimal, extensible LLM inference engine built from scratch
An open sourced end-to-end VLM-based GUI Agent
InvokeAI is a leading creative engine for Stable Diffusion models
Open platform for training, serving, and evaluating language models
Real-Time High-Resolution Background Matting
A2M is a desktop app that converts AUDIO TO MIDI in one click.
AI-powered PC monitoring that explains. Not shows numbers/spikes.
Transformers4Rec is a flexible and efficient library
Unofficial Parallel WaveGAN
Application that simplifies the installation of AI-related projects
A simple command-line utility for querying and monitoring GPU status
Discord bot and Interface for Stable Diffusion
GFPGAN aims at developing Practical Algorithms
BCI: Breast Cancer Immunohistochemical Image Generation
Facebook AI Research Sequence-to-Sequence Toolkit written in Python
[WIP] VoiceSmith makes training text to speech models easy