A single Gradio + React WebUI with extensions for ACE-Step
OCR expert VLM powered by Hunyuan's native multimodal architecture
Collection of reference environments, offline reinforcement learning
Simple and easily configurable grid world environments
LLM training in simple, raw C/CUDA
Fast and accurate AI powered file content types detection
Less Code, Lower Barrier, Faster Deployment
A simple, secure MCP-to-OpenAPI proxy server
Code release for Cut and Learn for Unsupervised Object Detection
Official implementation of Watermark Anything with Localized Messages
Training Large Language Model to Reason in a Continuous Latent Space
High-resolution models for human tasks
Tool for exploring and debugging transformer model behaviors
CLIP, Predict the most relevant text snippet given an image
Ling is a MoE LLM provided and open-sourced by InclusionAI
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible
A Unified Framework for Text-to-3D and Image-to-3D Generation
Personalize Any Characters with a Scalable Diffusion Transformer
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Extensible AGI Framework
Open Source Generative Process Automation
LLM based autonomous agent that does online comprehensive research
Harness LLMs with Multi-Agent Programming
Flower: A Friendly Federated Learning Framework
PyTorch version of Stable Baselines