Phi-3.5 for Mac: Locally-run Vision and Language Models
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
An implementation of model parallel GPT-2 and GPT-3-style models
Llama-3.3-70B-Instruct is a multilingual AI optimized for helpful chat
Speaker segmentation model for 10s audio chunks with powerset labels
Efficient text-to-image model with enhanced quality and typography
Multilingual 8B-parameter chat-optimized LLM fine-tuned by Meta
Instruction-tuned 8B LLM by Meta for helpful, safe English dialogue
Advanced MMDiT text-to-image model for high-quality visual generation