Follow along with my AI Agents Masterclass videos
Document Image Parsing via Heterogeneous Anchor Prompting”
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
Open-source Video Translation Skill
Multi-source content processor for NotebookLM
Claude Code, but it runs on your Mac for free
Build reliable Gen AI solutions without overhead
The first AI agent that builds permissionless integrations
AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories
Spark-TTS Inference Code
Overcoming Group Chat Scenarios with LLM-based Technical Assistance
StreamSpeech is a seamless model for offline speech recognition
Easy Docker setup for Stable Diffusion with user-friendly UI
Build Vision Agents quickly with any model or video provider
MCP integration platforms for AI agents to use tools at any scale
Synchronized Translation for Videos
Smart Thermodynamic Modeling with Graph Neural Networks
ChatGPT interface with better UI
Turn your website into a GIF
Automatic question answering for local knowledge bases based on LLM
StudioOllamaUI is a local, portable interface for Ollama
Chatbot with GNNPCSAFT
Did you say you like data?
Code for Language models can explain neurons in language models paper
FaceOnLive Open KYC: Streamlining Identity Verification with AI