Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Enhances Tesseract OCR output using LLMs (local or API)
Replace OpenAI GPT with another LLM in your app
Toolkit for conversational AI
Repo of Qwen2-Audio chat & pretrained large audio language model
LLM Large Model of Selling Anchor
Large Audio Language Model built for natural interactions
Capable of understanding text, audio, vision, video
Integrating LLMs into structured NLP pipelines
Qwen3-Coder is the code version of Qwen3
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Refer and Ground Anything Anywhere at Any Granularity
Qwen3-omni is a natively end-to-end, omni-modal LLM
Chat & pretrained large vision language model
Label, clean and enrich text datasets with LLMs