Document (PDF, Word, PPTX ...) extraction and parse API
Hypernetworks that adapt LLMs for specific benchmark tasks
Qwen3-omni is a natively end-to-end, omni-modal LLM
Qwen-Image is a powerful image generation foundation model
Unifying 3D Mesh Generation with Language Models
A high-quality PDF to Markdown tool based on large language model
AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim
Capable of understanding text, audio, vision, video
Search all of YouTube from the command line
Toolkit for conversational AI
Qwen2.5-VL is the multimodal large language model series
GLM-4-Voice | End-to-End Chinese-English Conversational Model
LLM abstractions that aren't obstructions
Knowledge Graph Generation from Any Text
Multilingual sentence & image embeddings with BERT
Enhances Tesseract OCR output using LLMs (local or API)
Code and models for ICML 2024 paper, NExT-GPT
A Pioneering Open-Source Alternative to GPT-4o
lightweight package to simplify LLM API calls
Large-language-model & vision-language-model based on Linear Attention
A list of free LLM inference resources accessible via API
Designed for text embedding and ranking tasks
Low-latency REST API for serving text-embeddings
A modular graph-based Retrieval-Augmented Generation (RAG) system
Using AI models to automatically provide commentary and edit videos