Official inference repo for FLUX.2 models
A Family of Open Sourced Music Foundation Models
A Customizable Image-to-Video Model based on HunyuanVideo
Towards Human-Level Text-to-Speech through Style Diffusion
MARS5 speech model (TTS) from CAMB.AI
A high-quality rapid TTS voice cloning model
Interface for OuteTTS models
A lightweight text-to-speech model with zero-shot voice cloning
Implementation of Vision Transformer, a simple way to achieve SOTA
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Framework for building neural networks
The official Meta Llama 3 GitHub site
Utilities intended for use with Llama models
Set of tools to assess and improve LLM security
An implementation of a deep learning recommendation model (DLRM)
Official DeiT repository
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Reference PyTorch implementation and models for DINOv3
Volcano Engine Reinforcement Learning for LLMs
Documentation for Google's Gen AI site - including Gemini API & Gemma
Flexible Photo Recrafting While Preserving Your Identity
Real-time voice interactive digital human
Open source full-stack AI vibe coding platform & web app generator
OpenAI Assistants API quickstart with Next.js
Industrial-level controllable zero-shot text-to-speech system