Inference framework for 1-bit LLMs
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Block Diffusion for Ultra-Fast Speculative Decoding
tiktoken is a fast BPE tokeniser for use with OpenAI's models
A Conversational Speech Generation Model
OpenAI’s compact 20B open model for fast, agentic, and local use