Awesome multilingual OCR toolkits based on PaddlePaddle
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Open-source, high-performance AI model with advanced reasoning
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Wan2.1: Open and Advanced Large-Scale Video Generative Model
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Python bindings for llama.cpp
Advanced language and coding AI model
Qwen3-Coder is the code version of Qwen3
DeepSeek Coder: Let the Code Write Itself
An experimental version of DeepSeek model
Reference PyTorch implementation and models for DINOv3
Open-source multi-speaker long-form text-to-speech model
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
A Family of Open Sourced Music Foundation Models
Text and image to video generation: CogVideoX and CogVideo
Towards Real-World Vision-Language Understanding
AlphaFold 3 inference pipeline
Multimodal Diffusion with Representation Alignment
Models for object and human mesh reconstruction
Fast-stable-diffusion + DreamBooth
Industrial-level controllable zero-shot text-to-speech system
Programmatic access to the AlphaGenome model
Stable Diffusion with Core ML on Apple Silicon
A Systematic Framework for Interactive World Modeling