Multimodal AI chat app with dynamic conversation routing
Bidirectional token-classification model for identifiable info
Annotations docblock parser
A python tool that uses GPT-4, FFmpeg, and OpenCV
Large-language-model & vision-language-model based on Linear Attention
Quick illustration of how one can easily read books together with LLMs
A Systematic Framework for Interactive World Modeling
Fast multimodal LLM for real-time voice interaction and AI apps
Autoregressive Model Beats Diffusion
Diffusion Transformer with Fine-Grained Chinese Understanding
A speech-text foundation model for real time dialogue
The Markdown Editor for Linux
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
MARS5 speech model (TTS) from CAMB.AI
Search all of YouTube from the command line
Python CLI utility and library for manipulating SQLite databases
Stable Diffusion built-in to Blender
Context-aware desktop AI assistant that understands screen content
Real-time voice interactive digital human
Create native Mac applications from command line scripts
A collection of notebooks/recipes showcasing ways of using Claude
Re-editable LaTeX/ typst graphics for Inkscape
Username OSINT tool for discovering accounts across many websites
An adaptive Web Scraping framework
High-Resolution 3D Assets Generation with Large Scale Diffusion Models