Fast stable diffusion on CPU and AI PC
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
State-of-the-art TTS model under 25MB
A Customizable Image-to-Video Model based on HunyuanVideo
AlphaFold 3 inference pipeline
Easy Docker setup for Stable Diffusion with user-friendly UI
Text and image to video generation: CogVideoX and CogVideo
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Z80-μLM is a 2-bit quantized language model
Large Multimodal Models for Video Understanding and Editing
FAIR Sequence Modeling Toolkit 2
CodeGeeX2: A More Powerful Multilingual Code Generation Model
LLM-based Reinforcement Learning audio edit model
StudioOllamaUI is a local, portable interface for Ollama
AI Suite for upscaling, interpolating & restoring images/videos
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
Reference implementation of the Transformer architecture optimized
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Dia-1.6B generates lifelike English dialogue and vocal expressions
Tiny pre-trained IBM model for multivariate time series forecasting
Vision-language-action model for robot control via images and text