Open-source large language model family from Tencent Hunyuan
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Llama Chinese community, real-time aggregation
A frontier, first-principles handbook
Multimodal embedding and reranking models built on Qwen3-VL
"Big Model" trains a visual multimodal VLM with 26M parameters
Collaborative & Open-Source Quality Assurance for all AI models
MII makes low-latency and high-throughput inference possible
Conditional GAN for generating synthetic tabular data
Open-weight, large-scale hybrid-attention reasoning model
Simplest working implementation of Stylegan2
Neural Network Compression Framework for enhanced OpenVINO
Sunfish: a Python Chess Engine in 111 lines of code
Tooling for the Common Objects In 3D dataset
The Cradle framework is a first attempt at General Computer Control
General-purpose image editing model that delivers high-fidelity
Running large language models on a single GPU
A simple, performant and scalable Jax LLM
Skywork-R1V is an advanced multimodal AI model series
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Robust recipes to align language models with human and AI preferences
Unified framework for building enterprise RAG pipelines
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Language modeling in a sentence representation space