GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Port of Facebook's LLaMA model in C/C++
Fast backend for long-term AI user memory via structured profiles
Block Diffusion for Ultra-Fast Speculative Decoding
Image generation model with single-stream diffusion transformer
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
An undetectable, powerful, flexible, high-performance Python library
A lightweight text-to-speech model with zero-shot voice cloning
tiktoken is a fast BPE tokeniser for use with OpenAI's models
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation
100–200× Acceleration for Video Diffusion Models
A New Axis of Sparsity for Large Language Models
Java enterprise application development framework
Learn AI and LLMs from scratch using free resources
Blazeface is a lightweight model that detects faces in images
Detect faces in an image
A Conversational Speech Generation Model
C++-based high-performance parallel environment execution engine
Encoder of greater-than-word length text trained on a variety of data
Editing large language models within 10 seconds
Fast, modular reference implementation of Instance Segmentation
R-FCN: Object Detection via Region-based Fully Convolutional Networks
Bulk delete your ChatGPT conversations easily with this Chrome extensi
Efficient 13B MoE language model with long context and reasoning modes