RGBD video generation model conditioned on camera input
Lets make video diffusion practical
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Foundation Models for Time Series
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
LTX-Video Support for ComfyUI
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Industrial-level controllable zero-shot text-to-speech system
An experimental version of DeepSeek model
A Powerful Native Multimodal Model for Image Generation
HY-Motion model for 3D character animation generation
Generate Any 3D Scene in Seconds
Qwen-Image is a powerful image generation foundation model
Qwen3-Coder is the code version of Qwen3
Qwen2.5-VL is the multimodal large language model series
Diffusion Transformer with Fine-Grained Chinese Understanding
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
A Systematic Framework for Interactive World Modeling
Programmatic access to the AlphaGenome model
Renderer for the harmony response format to be used with gpt-oss
Generating Immersive, Explorable, and Interactive 3D Worlds
Collection of Gemma 3 variants that are trained for performance
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity