A Production-ready Reinforcement Learning AI Agent Library
Official DeiT repository
Language modeling in a sentence representation space
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Inference script for Oasis 500M
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Open-weight, large-scale hybrid-attention reasoning model
Suite with Real-ESRGAN, BSRGAN , RealESRNet, IRCNN, GFPGAN & RIFE.
High-Resolution Image Synthesis with Latent Diffusion Models
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Open-source, high-performance Mixture-of-Experts large language model
Powerful open source image generation model
Open Multilingual Multimodal Chat LMs
Fine-tuning ChatGLM-6B with PEFT
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Learning to Act by Watching Unlabeled Online Videos
PyTorch implementation of MAE
GLIDE: a diffusion-based text-conditional image synthesis model
The official pytorch implementation of our paper
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Generate embeddings from large-scale graph-structured data
Tencent’s 36-language state-of-the-art translation model