Fast stable diffusion on CPU and AI PC
State-of-the-art TTS model under 25MB
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
Text and image to video generation: CogVideoX and CogVideo
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Tiny pre-trained IBM model for multivariate time series forecasting
Dia-1.6B generates lifelike English dialogue and vocal expressions