GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
A trainable PyTorch reproduction of AlphaFold 3
Multi-modal large language model designed for audio understanding
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
LLM-based Reinforcement Learning audio edit model
Chat & pretrained large audio language model proposed by Alibaba Cloud
Open-weight, large-scale hybrid-attention reasoning model
Capable of understanding text, audio, vision, video
Example Discord bot written in Python that uses the completions API
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
High-Resolution Image Synthesis with Latent Diffusion Models
StudioOllamaUI is a local, portable interface for Ollama
Open-source, high-performance Mixture-of-Experts large language model
AI Suite for upscaling, interpolating & restoring images/videos
Dataset of GPT-2 outputs for research in detection, biases, and more
A Conversational Speech Generation Model
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Official code for Style Aligned Image Generation via Shared Attention
Powerful open source image generation model
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Open Multilingual Multimodal Chat LMs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Let us control diffusion models
Fine-tuning ChatGLM-6B with PEFT