Open image model at the forefront of design
Enterprise multi-agent orchestration framework for scalable AI apps
A Unified Library for Parameter-Efficient Learning
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Stable Diffusion web UI
ImageBind One Embedding Space to Bind Them All
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Deep learning optimization library: makes distributed training easy
A Universal Customization Method for Single and Multi Conditioning
Open source codebase for Scale Agentex
Less Code, Lower Barrier, Faster Deployment
A refreshing functional take on deep learning
Build multimodal language agents for fast prototype and production
Open source AI model for generating full songs from lyrics prompts
Build and run agents you can see, understand and trust
Democratizing AI scientists with ToolUniverse
Build your own Cowork, AI Scientist and other SoTA Agents
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
A Unified Framework for Image Customization
An AI for Music Generation
Evals is a framework for evaluating LLMs and LLM systems
Official code for Style Aligned Image Generation via Shared Attention
PyTorch Lightning + Hydra. A very user-friendly template
Implementation of Nougat Neural Optical Understanding
Task of transcribing piano recordings into MIDI files