A Customizable Image-to-Video Model based on HunyuanVideo
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
GPT4V-level open-source multi-modal model based on Llama3-8B
Implementation of the Surya Foundation Model for Heliophysics
Chinese and English multimodal conversational language model
Multi-modal large language model designed for audio understanding
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
High-Resolution Image Synthesis with Latent Diffusion Models
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Open-source, high-performance Mixture-of-Experts large language model
Blazeface is a lightweight model that detects faces in images
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Encoder of greater-than-word length text trained on a variety of data
Runtime extension of Proximus enabling Deployment on AMD Ryzen™ AI
Open Multilingual Multimodal Chat LMs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Official repo for consistency models
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
Official PyTorch Implementation of "Scalable Diffusion Models"
llama.go is like llama.cpp in pure Golang
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Locally run an Instruction-Tuned Chat-Style LLM
A method to increase the speed and lower the memory footprint
LLaMA: Open and Efficient Foundation Language Models
Implementation of model parallel autoregressive transformers on GPUs