Open-source large language model family from Tencent Hunyuan
A series of math-specific large language models of our Qwen2 series
Repo for SeedVR2 & SeedVR
An experimental version of DeepSeek model
High-Resolution Image Synthesis with Latent Diffusion Models
Tool for exploring and debugging transformer model behaviors
Open-source industrial-grade ASR models
A Systematic Framework for Interactive World Modeling
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Ling is a MoE LLM provided and open-sourced by InclusionAI
Multimodal-Driven Architecture for Customized Video Generation
The Clay Foundation Model - An open source AI model and interface
Bidirectional token-classification model for identifiable info
Recovering the Visual Space from Any Views
ChatGPT interface with better UI
Controllable & emotion-expressive zero-shot TTS
Designed for text embedding and ranking tasks
Audio foundation model excelling in audio understanding
DeepSeek Coder: Let the Code Write Itself
Renderer for the harmony response format to be used with gpt-oss
Easy Docker setup for Stable Diffusion with user-friendly UI
ICLR2024 Spotlight: curation/training code, metadata, distribution
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model