High-Resolution Image Synthesis with Latent Diffusion Models
Qwen3 is the large language model series developed by Qwen team
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Qwen3-Coder is the code version of Qwen3
An experimental version of DeepSeek model
ChatGPT interface with better UI
Qwen-Image is a powerful image generation foundation model
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
A Unified Framework for Text-to-3D and Image-to-3D Generation
GPT4V-level open-source multi-modal model based on Llama3-8B
Hunyuan Translation Model Version 1.5
Stable Diffusion with Core ML on Apple Silicon
Tooling for the Common Objects In 3D dataset
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Chat & pretrained large audio language model proposed by Alibaba Cloud
High-Resolution Image Synthesis with Latent Diffusion Models
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
StudioOllamaUI is a local, portable interface for Ollama
LLaMA: Open and Efficient Foundation Language Models
Code release for ConvNeXt V2 model
Reference implementation of the Transformer architecture optimized
A mix of GAN implementations including progressive growing
Code for reproducing key results in the paper
Dia-1.6B generates lifelike English dialogue and vocal expressions
High-compute ultra-reasoning model surpassing model surpassing GPT-5