The ChatGPT Retrieval Plugin lets you easily find personal documents
Contexts Optical Compression
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Qwen3 is the large language model series developed by Qwen team
Qwen2.5-VL is the multimodal large language model series
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Foundation Models for Time Series
This repository contains the official implementation of FastVLM
Large-language-model & vision-language-model based on Linear Attention
Reasoning-powered OCR VLM for converting complex documents to Markdown
Multimodal Transformer for document image understanding and layout
Multimodal 7B model for image, video, and text understanding tasks
Versatile 8B-base multimodal LLM, flexible foundation for custom AI
Lightweight multimodal translation model for 55 languages
Summarization model fine-tuned on CNN/DailyMail articles
Qwen3-Next: 80B instruct LLM with ultra-long context up to 1M tokens
Efficient 13B MoE language model with long context and reasoning modes
Small 3B-base multimodal model ideal for custom AI on edge hardware
Efficient 8B multimodal model tuned for advanced reasoning tasks.
VaultGemma: 1B DP-trained Gemma variant for private NLP tasks
Powerful 14B-base multimodal model — flexible base for fine-tuning