Flexible Photo Recrafting While Preserving Your Identity
A format specification for describing a visual identity
Build and run AI agents like microservices
Standard and implementation for AI agent authentication
Multimodal-Driven Architecture for Customized Video Generation
A universal git-native AI agent framework
Official inference repo for FLUX.2 models
A Unified Framework for Image Customization
The common language for platforms, agents and businesses.
A Customizable Image-to-Video Model based on HunyuanVideo
A Universal Customization Method for Single and Multi Conditioning
Pushing the Frontier of Long Audio-Visual Generation
super expressive prompting model based on ltx2.3
AI agent microservice
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Interface for OuteTTS models
MARS5 speech model (TTS) from CAMB.AI
Implementation of Make-A-Video, new SOTA text to video generator
CoTracker is a model for tracking any point (pixel) on a video
A framework for autonomous economic agent (AEA) development
Demo for the "Talking Head Anime from a Single Image"
VGGFace2 Dataset for Face Recognition
A python package to analyze and compare voices with deep learning
FaceAccess is an Access Control System based on Facial Recognition