Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Diffusion Transformer with Fine-Grained Chinese Understanding
Qwen2.5-VL is the multimodal large language model series
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
TextWorld is a sandbox learning environment for the training
An industrial grade federated learning framework
All-in-one WebUI for AI generative image and video creation
Lemonade helps users run local LLMs with the highest performance
Googles NotebookLM but local
Repo of Qwen2-Audio chat & pretrained large audio language model
A trainable PyTorch reproduction of AlphaFold 3
Experimental, AI/ML-powered and open sourced Marketing Mix Modeling
Open-source abilities for OpenHome agents
Benchmark LLMs by fighting in Street Fighter 3
Traditional Mandarin LLMs for Taiwan
Plug-and-play library to enable agents to call MCP and UTCP tools
UI-TARS-desktop version that can operate on your local personal device
LLM Large Model of Selling Anchor
New family of code large language models (LLMs)
Controllable & emotion-expressive zero-shot TTS
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Tooling for the Common Objects In 3D dataset
State-of-the-art Image & Video CLIP, Multimodal Large Language Models