Fast-stable-diffusion + DreamBooth
Port of Facebook's LLaMA model in C/C++
Open-source image generative foundation model
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Block Diffusion for Ultra-Fast Speculative Decoding
Video understanding codebase from FAIR for reproducing video models
Image generation model with single-stream diffusion transformer
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation
Bidirectional token-classification model for identifiable info
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Accurate × Fast × Comprehensive
26m function call model that runs on incredibly small devices
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Achieving 3+ generation speedup on reasoning tasks
This repository contains the official implementation of FastVLM
ICLR2024 Spotlight: curation/training code, metadata, distribution
Foundational Models for State-of-the-Art Speech and Text Translation
Detect faces in an image
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Blazeface is a lightweight model that detects faces in images
A CNN model that predicts human joints from RGB images of a person
Encoder of greater-than-word length text trained on a variety of data
A Conversational Speech Generation Model
Python example app from the OpenAI API quickstart tutorial
Official repo for consistency models