Python bindings for llama.cpp
Official inference repo for FLUX.1 models
Python inference and LoRA trainer package for the LTX-2 audio–video
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Robust Speech Recognition Across Languages, Dialects
Open-source, high-performance AI model with advanced reasoning
Qwen3-ASR is an open-source series of ASR models
Official repository for LTX-Video
Advancing Open-source World Models
Collection of Gemma 3 variants that are trained for performance
Qwen3-TTS is an open-source series of TTS models
Python SDK for Claude Agent
A Systematic Framework for Interactive World Modeling
The official repo of Qwen chat & pretrained large language model
GLM-4-Voice | End-to-End Chinese-English Conversational Model
CogView4, CogView3-Plus and CogView3(ECCV 2024)
AlphaFold 3 inference pipeline
Code for running inference and finetuning with SAM 3 model
Long-form streaming TTS system for multi-speaker dialogue generation
Open Source Speech Language Model
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
State-of-the-art TTS model under 25MB
Large Multimodal Models for Video Understanding and Editing
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
GLM-4.5: Open-source LLM for intelligent agents by Z.ai