Unified Model Serving Framework
Hackable and optimized Transformers building blocks
Trainable models and NN optimization tools
An opinionated CLI to transcribe Audio files w/ Whisper on-device
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Data manipulation and transformation for audio signal processing
Simplest working implementation of Stylegan2
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Low-latency REST API for serving text-embeddings
InvokeAI is a leading creative engine for Stable Diffusion models
Pytorch domain library for recommendation systems
2D and 3D Face alignment library build using pytorch
A set of Docker images for training and serving models in TensorFlow
Miso TTS is an 8 billion, highly emotive text-to-speech model
LLM training in simple, raw C/CUDA
GPU accelerated decision optimization
4M: Massively Multimodal Masked Modeling
Synchronized Translation for Videos
AI Suite for upscaling, interpolating & restoring images/videos
A Conversational Speech Generation Model
Serving multiple LoRA finetuned LLM as one
MMEditing is a low-level vision toolbox based on PyTorch
A computer vision framework to create and deploy apps in minutes
FAIR's research platform for object detection research
Run the Stable Diffusion releases in a Docker container