tiktoken is a fast BPE tokeniser for use with OpenAI's models
Controllable & emotion-expressive zero-shot TTS
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Language modeling in a sentence representation space
Advancing Formal Mathematical Reasoning via Reinforcement Learning
Clean and efficient FP8 GEMM kernels with fine-grained scaling
FlashMLA: Efficient Multi-head Latent Attention Kernels
The ChatGPT Retrieval Plugin lets you easily find personal documents
PyTorch implementation of JiT
A Unified Framework for Text-to-3D and Image-to-3D Generation
Open-source large language model family from Tencent Hunyuan
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Audio foundation model excelling in audio understanding
DeepSeek LLM: Let there be answers
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
High-Resolution Image Synthesis with Latent Diffusion Models
Open-source, high-performance Mixture-of-Experts large language model
Open source large language model by Alibaba
Dataset of GPT-2 outputs for research in detection, biases, and more
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
A Conversational Speech Generation Model
Qwen2.5-Coder is the code version of Qwen2.5, the large language model