Port of Facebook's LLaMA model in C/C++
Advanced language and coding AI model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Agentic, Reasoning, and Coding (ARC) foundation models
From Images to High-Fidelity 3D Assets
Open-source, high-performance AI model with advanced reasoning
Generating Immersive, Explorable, and Interactive 3D Worlds
LTX-Video Support for ComfyUI
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
An experimental version of DeepSeek model
State-of-the-art TTS model under 25MB
Tool for exploring and debugging transformer model behaviors
Release for Improved Denoising Diffusion Probabilistic Models
Controllable & emotion-expressive zero-shot TTS
Open-source multi-speaker long-form text-to-speech model
Official inference repo for FLUX.1 models
Industrial-level controllable zero-shot text-to-speech system
Tiny vision language model
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Tongyi Deep Research, the Leading Open-source Deep Research Agent
code for Mesh R-CNN, ICCV 2019
HY-Motion model for 3D character animation generation
Multimodal-Driven Architecture for Customized Video Generation
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)