Recovering the Visual Space from Any Views
Fast-stable-diffusion + DreamBooth
Revolutionizing Database Interactions with Private LLM Technology
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Easy Docker setup for Stable Diffusion with user-friendly UI
Hackable and optimized Transformers building blocks
A Systematic Framework for Interactive World Modeling
The official repo of Qwen chat & pretrained large language model
Official implementation of Watermark Anything with Localized Messages
Miso TTS is an 8 billion, highly emotive text-to-speech model
MOSS‑TTS Family open‑source speech and sound generation model
Foundation Models for Time Series
tiktoken is a fast BPE tokeniser for use with OpenAI's models
High-Resolution Image Synthesis with Latent Diffusion Models
Repo for SeedVR2 & SeedVR
Sharp Monocular Metric Depth in Less Than a Second
code for Mesh R-CNN, ICCV 2019
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
LTX-Video Support for ComfyUI
High-Fidelity and Controllable Generation of Textured 3D Assets
Generate Any 3D Scene in Seconds
Achieving 3+ generation speedup on reasoning tasks
A series of math-specific large language models of our Qwen2 series
FAIR Sequence Modeling Toolkit 2
GPT4V-level open-source multi-modal model based on Llama3-8B