Document (PDF, Word, PPTX ...) extraction and parse API
Hypernetworks that adapt LLMs for specific benchmark tasks
Unifying 3D Mesh Generation with Language Models
Qwen3-omni is a natively end-to-end, omni-modal LLM
Enhances Tesseract OCR output using LLMs (local or API)
Qwen-Image is a powerful image generation foundation model
Designed for text embedding and ranking tasks
Knowledge Graph Generation from Any Text
AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim
Using AI models to automatically provide commentary and edit videos
Text-space optimizer that trains reusable natural-language skills
Qwen2.5-VL is the multimodal large language model series
Code and models for ICML 2024 paper, NExT-GPT
GLM-4-Voice | End-to-End Chinese-English Conversational Model
StarVector is a foundation model for SVG generation
A straightforward method for training your LLM
Autoregressive Model Beats Diffusion
Toolkit for conversational AI
Capable of understanding text, audio, vision, video
lightweight package to simplify LLM API calls
Unleashing 10,000+ Word Generation from Long Context LLMs
CogView4, CogView3-Plus and CogView3(ECCV 2024)
A high-quality PDF to Markdown tool based on large language model
LLM
Large-language-model & vision-language-model based on Linear Attention