The ChatGPT Retrieval Plugin lets you easily find personal documents
Pushing the Limits of Mathematical Reasoning in Open Language Models
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Open-source, high-performance Mixture-of-Experts large language model
Release for Improved Denoising Diffusion Probabilistic Models
Blazeface is a lightweight model that detects faces in images
Open source large language model by Alibaba
Powerful open source image generation model
Open Multilingual Multimodal Chat LMs
DeepSeek LLM: Let there be answers
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Official code for Style Aligned Image Generation via Shared Attention
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Fine-tuning ChatGLM-6B with PEFT
Official PyTorch Implementation of "Scalable Diffusion Models"
llama.go is like llama.cpp in pure Golang
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Code release for ConvNeXt V2 model
A minimal PyTorch re-implementation of the OpenAI GPT
Learning to Act by Watching Unlabeled Online Videos
Code release for "Masked-attention Mask Transformer
GLIDE: a diffusion-based text-conditional image synthesis model
Large-scale autoregressive pixel model for image generation by OpenAI
Environment generation code for the paper "Emergent Tool Use"