The ChatGPT Retrieval Plugin lets you easily find personal documents
CLIP, Predict the most relevant text snippet given an image
Ling is a MoE LLM provided and open-sourced by InclusionAI
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Diffusion Transformer with Fine-Grained Chinese Understanding
A Unified Framework for Text-to-3D and Image-to-3D Generation
A Customizable Image-to-Video Model based on HunyuanVideo
Open-source large language model family from Tencent Hunyuan
Multimodal-Driven Architecture for Customized Video Generation
Personalize Any Characters with a Scalable Diffusion Transformer
Implementation of the Surya Foundation Model for Heliophysics
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Let us control diffusion models
This repository contains the official implementation of research
Fine-tuning ChatGLM-6B with PEFT
Official repo for consistency models
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
800,000 step-level correctness labels on LLM solutions to MATH problem
Repo for external large-scale work
Official PyTorch Implementation of "Scalable Diffusion Models"
High-Resolution Image Synthesis with Latent Diffusion Models
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)