Simplest working implementation of Stylegan2
Spatiotemporal Signal Processing with Neural Machine Learning Models
A lightweight vision library for performing large object detection
This repo contains the code for 1D tokenizer and generator
Flexible Photo Recrafting While Preserving Your Identity
Taming Stable Diffusion for Lip Sync
A SOTA open-source image editing model
Multi-Agent daTa geneRation Infra and eXperimentation framework
Deploy and share agents with open infrastructure
Build cross-modal and multimodal applications on the cloud
UI-TARS-desktop version that can operate on your local personal device
High-Fidelity and Controllable Generation of Textured 3D Assets
GUI Exploration Lab. One of the best GUI agent solutions
Large-language-model & vision-language-model based on Linear Attention
Serving LangChain LLM apps automagically with FastApi
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
FAIR's research platform for object detection research
LLMFlows - Simple, Explicit and Transparent LLM Apps
Optimized Workforce Learning for General Multi-Agent Assistance
RL implementations
Framework for Accelerating LLM Generation with Multiple Decoding Heads
A Customizable Image-to-Video Model based on HunyuanVideo
A computer vision framework to create and deploy apps in minutes
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Scientific Visualisation Made Easy