Qwen-Image is a powerful image generation foundation model
Qwen3-omni is a natively end-to-end, omni-modal LLM
Han Language Processing
Fast multimodal LLM for real-time voice interaction and AI apps
A Web UI for easy subtitle using whisper model
A sound cloning tool with a web interface, using your voice
Lightning-fast, on-device TTS, running natively via ONNX
Unified web UI for training and running open models locally
Collection of Gemma 3 variants that are trained for performance
A modular graph-based Retrieval-Augmented Generation (RAG) system
Machine learning, conversational dialog engine for creating chat bots
Framework for building AI-powered interactive digital humans and agent
Algorithms for outlier, adversarial and drift detection
Open source machine learning framework to automate text conversations
Towards Human-Sounding Speech
High-Resolution Image Synthesis with Latent Diffusion Models
The most powerful local music generation model
StreamSpeech is a seamless model for offline speech recognition
Stable Diffusion web UI
Open source personal AI Assistant for Linux, Windows and Mac
Extract schema, statistics and entities from datasets
Open Source Document Management System for Digital Archives
Flowly is 100x faster than OpenClaw
Multilingual sentence & image embeddings with BERT
Open source NLP guide with models, methods, and real use cases