GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Multi-modal large language model designed for audio understanding
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
LLM-based Reinforcement Learning audio edit model
Chat & pretrained large audio language model proposed by Alibaba Cloud
Open-weight, large-scale hybrid-attention reasoning model
Capable of understanding text, audio, vision, video
FlashMLA: Efficient Multi-head Latent Attention Kernels
Example Discord bot written in Python that uses the completions API
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
High-Resolution Image Synthesis with Latent Diffusion Models
Open-source, high-performance Mixture-of-Experts large language model
Dataset of GPT-2 outputs for research in detection, biases, and more
A Conversational Speech Generation Model
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Official code for Style Aligned Image Generation via Shared Attention
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Powerful open source image generation model
Open Multilingual Multimodal Chat LMs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Let us control diffusion models
Fine-tuning ChatGLM-6B with PEFT
Official repo for consistency models