Generate Any 3D Scene in Seconds
Memory-efficient and performant finetuning of Mistral's models
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Uncommon Objects in 3D dataset
GPT4V-level open-source multi-modal model based on Llama3-8B
The ChatGPT Retrieval Plugin lets you easily find personal documents
Implementation of the Surya Foundation Model for Heliophysics
Chinese and English multimodal conversational language model
Multi-modal large language model designed for audio understanding
Release for Improved Denoising Diffusion Probabilistic Models
AI Suite for upscaling, interpolating & restoring images/videos
High-Resolution Image Synthesis with Latent Diffusion Models
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
StudioOllamaUI is a local, portable interface for Ollama
Open-source, high-performance Mixture-of-Experts large language model
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
AI-powered tool to quickly remove watermarks from images flawlessly
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Open Multilingual Multimodal Chat LMs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Official repo for consistency models
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
Official PyTorch Implementation of "Scalable Diffusion Models"
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)