14-stage Fusion Pipeline for LLM token compression
State-of-the-art 2D and 3D Face Analysis Project
Audiocraft is a library for audio processing and generation
Qwen3-TTS is an open-source series of TTS models
Semantic search and workflows for medical/scientific papers
Official repository for LTX-Video
GUI/CLI tool for downloading Xiaohongshu
Open-source autonomous AI software engineer
Run a full local LLM stack with one command using Docker
Foundational model for human-like, expressive TTS
Use Microsoft Edge's online text-to-speech service from Python
Framework to easily create LLM powered bots over any dataset
Models for the spaCy Natural Language Processing (NLP) library
Kimi Code CLI is your next CLI agent
Phi-3.5 for Mac: Locally-run Vision and Language Models
Go ahead and axolotl questions
Official inference repo for FLUX.1 models
A curated collection of skills for AI coding agents
Follow along with my AI Agents Masterclass videos
Focus on prompting and generating
Qwen2.5-VL is the multimodal large language model series
K8s-mcp-server is a Model Context Protocol (MCP) server
Qwen3-Coder is the code version of Qwen3
One-click deployment (including offline integration package)