Annotate and review coding agent plans visually, share with your team
Documentation for Google's Gen AI site - including Gemini API & Gemma
A modular graph-based Retrieval-Augmented Generation (RAG) system
Fast multimodal LLM for real-time voice interaction and AI apps
AI bridge enabling Cursor agents to read and modify Figma designs
Autoregressive Model Beats Diffusion
Foundational Models for State-of-the-Art Speech and Text Translation
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Diffusion Transformer with Fine-Grained Chinese Understanding
Real-time voice interactive digital human
OCR expert VLM powered by Hunyuan's native multimodal architecture
Generate music based on natural language prompts using LLMs
Audiocraft is a library for audio processing and generation
Free, ultrafast Copilot alternative for Vim and Neovim
Slimmed, cleaned and fine-tuned oh-my-opencode fork
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
The easiest way to use Ollama in .NET
Convert any URL to an LLM-friendly input with a simple prefix
HY-Motion model for 3D character animation generation
21 Lessons, Get Started Building with Generative AI
Crafting engine for artists, designers, and filmmakers
LLM abstractions that aren't obstructions
Qwen3.5 is the large language model series developed by Qwen team
Visual Causal Flow
AI tool that turns Hacker News posts into daily podcast updates