High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Open-Source Financial Large Language Models!
AI-suite for image and video upscaling and enhancement. v4.1
Qwen (通义千问) chat/pretrained large language model Alibaba Cloud
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Open-source, high-performance Mixture-of-Experts large language model
Generating Immersive, Explorable, and Interactive 3D Worlds from Words
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
A Conversational Speech Generation Model
Open Multilingual Multimodal Chat LMs
An Open Bilingual Chat LLM | Open Source Bilingual Conversation LLM
Janus-Series: Unified Multimodal Understanding and Generation Models
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Open-source pre-training implementation of Google's LaMDA in PyTorch
An implementation of model parallel GPT-2 and GPT-3-style models
Dia-1.6B generates lifelike English dialogue and vocal expressions
ERNIE 4.5 MoE model in FP8 for efficient high-performance inference
Code generation model trained on 80+ languages with FIM support
State-of-the-art RL-trained coding agent for complex SWE tasks
CTC-based forced aligner for audio-text in 158 languages
Mirror of Ultralytics YOLO-World model weights for object detection
Speaker segmentation model for 10s audio chunks with powerset labels
Open-weight, large-scale hybrid-attention reasoning model