Private chat with local GPT with document, images, video, etc.
A reactive notebook for Python
A batteries-included library for building AI-powered software
GLM-4 series: Open Multilingual Multimodal Chat LMs
Qwen-Image is a powerful image generation foundation model
Qwen3-omni is a natively end-to-end, omni-modal LLM
An Async Bot/API wrapper for Twitch made in Python
Learn to build your Second Brain AI assistant with LLMs
A course of learning LLM inference serving on Apple Silicon
Automate native Android apps with AI using accessibility APIs
Unified Multimodal Understanding and Generation Models
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
kaldi-asr/kaldi is the official location of the Kaldi project
An Efficient Agentic Model for Computer Use
Finding the Scaling Law of Agents. A multi-agent framework
Comprehensive paid advertising audit & optimization skill
MARS5 speech model (TTS) from CAMB.AI
An LLM-powered knowledge curation system that researches topics
Claude Code skill for generating production-quality SVG+PNG technical
High-performance inference server for text embeddings models API layer
VMZ: Model Zoo for Video Modeling
Official implementation of Watermark Anything with Localized Messages
Multilingual Automatic Speech Recognition with word-level timestamps
Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge
Bidirectional token-classification model for identifiable info