Show usage stats for OpenAI Codex and Claude Code
GPT4V-level open-source multi-modal model based on Llama3-8B
LTX-Video Support for ComfyUI
DeepSeek Coder: Let the Code Write Itself
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
GLIDE: a diffusion-based text-conditional image synthesis model
Example Discord bot written in Python that uses the completions API
An implementation of model parallel GPT-2 and GPT-3-style models
A Multi-Modal World Model for Reconstructing, Generating, Simulation
A Unified Framework for Text-to-3D and Image-to-3D Generation
Multimodal Diffusion with Representation Alignment
Release for Improved Denoising Diffusion Probabilistic Models
Code for the paper "Improved Techniques for Training GANs"
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Ultra-Efficient LLMs on End Device
Achieving 3+ generation speedup on reasoning tasks
State of the art LLM and coding model
Chat & pretrained large audio language model proposed by Alibaba Cloud
Qwen3-omni is a natively end-to-end, omni-modal LLM
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion
A method to increase the speed and lower the memory footprint
Tool for exploring and debugging transformer model behaviors
Chinese and English multimodal conversational language model
Open-source, high-performance Mixture-of-Experts large language model
Chinese LLaMA & Alpaca large language model + local CPU/GPU training