A Production-ready Reinforcement Learning AI Agent Library
A PyTorch library for implementing flow matching algorithms
PyTorch code and models for the DINOv2 self-supervised learning
Official implementation of DreamCraft3D
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Language modeling in a sentence representation space
Designed for text embedding and ranking tasks
Large-language-model & vision-language-model based on Linear Attention
Capable of understanding text, audio, vision, video
A Unified Framework for Text-to-3D and Image-to-3D Generation
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Chat & pretrained large audio language model proposed by Alibaba Cloud
Release for Improved Denoising Diffusion Probabilistic Models
Official DeiT repository
Real-time behaviour synthesis with MuJoCo, using Predictive Control
High-Resolution Image Synthesis with Latent Diffusion Models
Open-source, high-performance Mixture-of-Experts large language model
StudioOllamaUI is a local, portable interface for Ollama
AI Suite for upscaling, interpolating & restoring images/videos
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
A Conversational Speech Generation Model
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Powerful open source image generation model
AI-powered tool to quickly remove watermarks from images flawlessly