Awesome multilingual OCR toolkits based on PaddlePaddle
gpt-oss-120b and gpt-oss-20b are two open-weight language models
An easy 1-click way to create beautiful artwork on your PC using AI
Open-source, high-performance AI model with advanced reasoning
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Advanced language and coding AI model
Multimodal Diffusion with Representation Alignment
Kimi K2 is the large language model series developed by Moonshot AI
Qwen3-Coder is the code version of Qwen3
tiktoken is a fast BPE tokeniser for use with OpenAI's models
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
Image generation model with single-stream diffusion transformer
Text and image to video generation: CogVideoX and CogVideo
An experimental version of DeepSeek model
Reference PyTorch implementation and models for DINOv3
Open-source multi-speaker long-form text-to-speech model
A Family of Open Sourced Music Foundation Models
New set of lightweight state-of-the-art, open foundation models
DeepSeek Coder: Let the Code Write Itself
A Systematic Framework for Interactive World Modeling
Python bindings for llama.cpp
Towards Real-World Vision-Language Understanding