Code for the paper Language Models are Unsupervised Multitask Learners
Node.js example app from the OpenAI API quickstart tutorial
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
Skills for AI coding agents
Strong, Economical, and Efficient Mixture-of-Experts Language Model
A neural network that transforms a design mock-up into static websites
Research code artifacts for Code World Model (CWM)
Contexts Optical Compression
Reverse-engineered Python API for Google Gemini web app
Dramatron uses large language models to generate coherent scripts
"Big Model" trains a visual multimodal VLM with 26M parameters
TTS with kokoro and onnx runtime
Chinese Llama-3 LLMs) developed from Meta Llama 3
Curated list of datasets and tools for post-training
Official implementation of DreamCraft3D
Towards Human-Level Text-to-Speech through Style Diffusion
Python example app from the OpenAI API quickstart tutorial
The repository provides code for running inference with SAM 2
Audiocraft is a library for audio processing and generation
Reference PyTorch implementation and models for DINOv3
A Powerful Native Multimodal Model for Image Generation
A collection of various deep learning architectures, models, and tips
CLIP, Predict the most relevant text snippet given an image
Official inference repo for FLUX.1 models
Advanced techniques for RAG systems