A general fine-tuning kit geared toward image/video/audio diffusion
Quick illustration of how one can easily read books together with LLMs
An opinionated CLI to transcribe Audio files w/ Whisper on-device
Automate native Android apps with AI using accessibility APIs
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Minimal Claude Code alternative. Single Python file, zero dependencies
Open-source AI marketing skills for Claude Code
MCP server enabling AI agents to control and automate Windows OS
Fast State-of-the-Art Static Embeddings
Build reliable Gen AI solutions without overhead
An AI agent that automatically builds AI models
Lightweight demo to build a conversational AI search engine quickly
Deploy your agentic worfklows to production
Ready-to-run cloud templates for RAG
An end-to-end Data Scientist
"Big Model" trains a visual multimodal VLM with 26M parameters
A simple, secure MCP-to-OpenAPI proxy server
Bash is all you need, write a claude code with only 16 line code
Project-scoped Lean workflow orchestrator from Math, Inc.
Automatically Visualize any dataset, any size
Code and models for ICML 2024 paper, NExT-GPT
LightLLM is a Python-based LLM (Large Language Model) inference
Generate Any 3D Scene in Seconds
Implementation of Vision Transformer, a simple way to achieve SOTA