Text and image to video generation: CogVideoX and CogVideo
Open-source, high-performance AI model with advanced reasoning
A Family of Open Sourced Music Foundation Models
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Reference PyTorch implementation and models for DINOv3
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Open-source multi-speaker long-form text-to-speech model
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Code for running inference with the SAM 3D Body Model 3DB
Advanced language and coding AI model
LTX-Video Support for ComfyUI
Lets make video diffusion practical
A theoretical reconstruction of the Claude Mythos architecture
Qwen3 is the large language model series developed by Qwen team
Foundation model for image generation
Uncommon Objects in 3D dataset
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Qwen3-Coder is the code version of Qwen3
An experimental version of DeepSeek model
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
AlphaFold 3 inference pipeline
Industrial-level controllable zero-shot text-to-speech system
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
High-Resolution Image Synthesis with Latent Diffusion Models