Open Source Differentiable Computer Vision Library
InvokeAI is a leading creative engine for Stable Diffusion models
UI-TARS-desktop version that can operate on your local personal device
Integrate cutting-edge LLM technology quickly and easily into your app
Deep Learning Visualization Toolkit
Diffusion Transformer with Fine-Grained Chinese Understanding
Context data platform for building observable, self-learning AI agents
Implementation of the Surya Foundation Model for Heliophysics
Open source file indexing & storage analytics powered by Elasticsearch
Open-Source Dual-Arm Mobile Robot with Motorized Lift
Latent Collaboration in Multi-Agent Systems
Motion-controllable Video Generation via Latent Trajectory Guidance
A tool to use the Ai2 Open Coding Agents Soft-Verified Agents
Multimodal embedding and reranking models built on Qwen3-VL
Contains the code for CM-SS13
Generate high-definition story short videos with one click using AI
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
Mini website for testing both general CS knowledge and enforce coding
A collective list of free APIs
Single-package Kubernetes for developers, IoT and edge
SOTA discrete acoustic codec models with 40/75 tokens per second
Language modeling in a sentence representation space
Proofs, cases, concept supplements, and reference explanations
High-Fidelity and Controllable Generation of Textured 3D Assets
Large Multimodal Models for Video Understanding and Editing