Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Enhances Tesseract OCR output using LLMs (local or API)
Toolkit for conversational AI
Replace OpenAI GPT with another LLM in your app
Repo of Qwen2-Audio chat & pretrained large audio language model
LLM Large Model of Selling Anchor
Large Audio Language Model built for natural interactions
Integrating LLMs into structured NLP pipelines
Flock is a workflow-based low-code platform for building chatbots
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Capable of understanding text, audio, vision, video
Qwen3-Coder is the code version of Qwen3
Refer and Ground Anything Anywhere at Any Granularity
Qwen3-omni is a natively end-to-end, omni-modal LLM
Chat & pretrained large vision language model
Label, clean and enrich text datasets with LLMs