Official Python inference and LoRA trainer package
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Continuous Autonomy for the AI SDK
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Controllable & emotion-expressive zero-shot TTS
super expressive prompting model based on ltx2.3
Python inference and LoRA trainer package for the LTX-2 audio–video
Claude Code action for GitHub PRs
Official implementation of DreamCraft3D
Diffusion Transformer with Fine-Grained Chinese Understanding
MOSS‑TTS Family open‑source speech and sound generation model
Renderer for the harmony response format to be used with gpt-oss
Foundation model for image generation
Hunyuan Translation Model Version 1.5
Bidirectional token-classification model for identifiable info
Achieving 3+ generation speedup on reasoning tasks
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Safety reasoning models built-upon gpt-oss
Powerful open source image generation model
Let us control diffusion models
800,000 step-level correctness labels on LLM solutions to MATH problem
Learning to Act by Watching Unlabeled Online Videos
Code for reproducing key results in the paper
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201