A New Axis of Sparsity for Large Language Models
The knowledge and task management backbone for AI coding assistants
Open-source infrastructure for Computer-Use Agents. Sandboxes
"Big Model" trains a visual multimodal VLM with 26M parameters
Simplifies the local serving of AI models from any source
Collection of Gemma 3 variants that are trained for performance
Language Model Reinforcement Learning Environments frameworks
AI Agent Networks for Open Collaboration
Collection of reference environments, offline reinforcement learning
Simple and easily configurable grid world environments
LLM training in simple, raw C/CUDA
Fast and accurate AI powered file content types detection
Less Code, Lower Barrier, Faster Deployment
PPTAgent: Generating and Evaluating Presentations
Implementation of "MobileCLIP" CVPR 2024
A fast, powerful, and simple hierarchical vision transformer
Code release for Cut and Learn for Unsupervised Object Detection
High-resolution models for human tasks
Ling is a MoE LLM provided and open-sourced by InclusionAI
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible
A Unified Framework for Text-to-3D and Image-to-3D Generation
Multimodal-Driven Architecture for Customized Video Generation
Personalize Any Characters with a Scalable Diffusion Transformer
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Extensible AGI Framework