Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Diffusion Transformer with Fine-Grained Chinese Understanding
Open-source large language model family from Tencent Hunyuan
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
DeepMind model for tracking arbitrary points across videos & robotics
code for Mesh R-CNN, ICCV 2019
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Language modeling in a sentence representation space
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Dataset of GPT-2 outputs for research in detection, biases, and more
The ChatGPT Retrieval Plugin lets you easily find personal documents
Designed for text embedding and ranking tasks
Implementation of the Surya Foundation Model for Heliophysics
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Diversity-driven optimization and large-model reasoning ability
Chinese and English multimodal conversational language model
GLM-4 series: Open Multilingual Multimodal Chat LMs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
High-Resolution Image Synthesis with Latent Diffusion Models
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Let us control diffusion models
Suite with Real-ESRGAN, BSRGAN , IRCNN, GFPGAN & RIFE. v4.3
A Conversational Speech Generation Model
Open-Source Financial Large Language Models!
Open-source, high-performance Mixture-of-Experts large language model