GPT4V-level open-source multi-modal model based on Llama3-8B
An open sourced end-to-end VLM-based GUI Agent
Replace OpenAI GPT with another LLM in your app
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Simplifies the local serving of AI models from any source
Rename anything
Python app to work with pictures and associated metadata
Easy-to-use Speech Toolkit including Self-Supervised Learning model
A GraphQL client in Python
Python library to compile, build & package AWS Lambda functions
Chinese and English multimodal conversational language model
A user friendly TUI for SQL databases
A best practices guide for day 2 operations
Stable Diffusion with Core ML on Apple Silicon
Configuration Management for Python
Project structure for doing and sharing data science work
Efficiently diff rows across two different databases
AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework
Context management for Claude Code. Hooks maintain state via ledgers
When LLM Meets Domain Experts
A fast TTS architecture with conditional flow matching
A formatter for Python files
A command-line utility for taking automated screenshots of websites
MetricFlow allows you to define, build, and maintain metrics in code
A minimal yet professional single agent demo project