OCR expert VLM powered by Hunyuan's native multimodal architecture
Open-weight, large-scale hybrid-attention reasoning model
Large-language-model & vision-language-model based on Linear Attention
Capable of understanding text, audio, vision, video
StudioOllamaUI is a local, portable interface for Ollama
High-Resolution Image Synthesis with Latent Diffusion Models
Example Discord bot written in Python that uses the completions API
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
A Conversational Speech Generation Model
Powerful open source image generation model
AI Suite for upscaling, interpolating & restoring images/videos
Open-source, high-performance Mixture-of-Experts large language model
Dataset of GPT-2 outputs for research in detection, biases, and more
Open Multilingual Multimodal Chat LMs
Official code for Style Aligned Image Generation via Shared Attention
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Let us control diffusion models
This repository contains the official implementation of research
Fine-tuning ChatGLM-6B with PEFT
Official repo for consistency models
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Chinese LLaMA & Alpaca large language model + local CPU/GPU training