Unified Multimodal Understanding and Generation Models
Community-maintained approach to improving access to GitHub services
Circuit diagrams and firmware source code for Gboard DIY keyboards
Quick illustration of how one can easily read books together with LLMs
A bitmap programming font optimized for coziness
OCR expert VLM powered by Hunyuan's native multimodal architecture
Context database designed specifically for AI Agents
Fast-stable-diffusion + DreamBooth
ComfyUI wrapper nodes for WanVideo and related models
Ultra-Efficient LLMs on End Device
Advanced NLP with spaCy: A free online course
Multi-tool for semantic search
SQL-Driven RAG Engine
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
CineCLI is a cross-platform command-line movie browser
Stable Diffusion web UI
95% token savings. 155x faster queries. 16 languages
Chinese XLNet pre-trained model
Framework for building neural networks
Memory-efficient and performant finetuning of Mistral's models
tensorboard for pytorch (and chainer, mxnet, numpy, etc.)
AI framework for automated short video creation and editing tools
A lightweight framework for building LLM-based agents
Structured data extraction and instruction calling with ML, LLM
Edit videos with Claude Code