State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Language modeling in a sentence representation space
GPT4V-level open-source multi-modal model based on Llama3-8B
The ChatGPT Retrieval Plugin lets you easily find personal documents
A SOTA open-source image editing model
Chinese and English multimodal conversational language model
Multi-modal large language model designed for audio understanding
Open-source framework for intelligent speech interaction
LLM-based Reinforcement Learning audio edit model
High-Resolution Image Synthesis with Latent Diffusion Models
A Conversational Speech Generation Model
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Suite with Real-ESRGAN, BSRGAN , RealESRNet, IRCNN, GFPGAN & RIFE.
Open-source, high-performance Mixture-of-Experts large language model
Open-Source Financial Large Language Models!
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Powerful open source image generation model
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Open Multilingual Multimodal Chat LMs
An Open Bilingual Chat LLM | Open Source Bilingual Conversation LLM
Let us control diffusion models
This repository contains the official implementation of research
Fine-tuning ChatGLM-6B with PEFT
Official repo for consistency models
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)