Accurate × Fast × Comprehensive
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Provides convenient access to the Anthropic REST API from any Python 3
Phi-3.5 for Mac: Locally-run Vision and Language Models
Open-source framework for intelligent speech interaction
Diversity-driven optimization and large-model reasoning ability
Implementation of the Surya Foundation Model for Heliophysics
Chat & pretrained large audio language model proposed by Alibaba Cloud
Multi-modal large language model designed for audio understanding
Large Multimodal Models for Video Understanding and Editing
The official PyTorch implementation of Google's Gemma models
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
The most powerful local music generation model
Real-time behaviour synthesis with MuJoCo, using Predictive Control
LLM-based Reinforcement Learning audio edit model
Open-weight, large-scale hybrid-attention reasoning model
This repository contains the official implementation of FastVLM
Memory-efficient and performant finetuning of Mistral's models
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Fast stable diffusion on CPU and AI PC
Fast-stable-diffusion + DreamBooth
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
VMZ: Model Zoo for Video Modeling
High-resolution models for human tasks
Video understanding codebase from FAIR for reproducing video models