A python tool that uses GPT-4, FFmpeg, and OpenCV
InvokeAI is a leading creative engine for Stable Diffusion models
A text-to-speech, speech-to-text and speech-to-speech library
Fast and accurate AI powered file content types detection
The best ChatGPT that $100 can buy
Open-source tool designed to enhance the efficiency of workloads
Definitions for AI/ML tasks like dataset creation
Meta Agents Research Environments is a comprehensive platform
Arcade Tool Development Kit (TDK), Worker, Evals, and CLI
AI-Powered tool for automated pull request analysis
Python SDK for the Computer Use model Lux, developed by OpenAGI
Open source codebase for Scale Agentex
GPT4V-level open-source multi-modal model based on Llama3-8B
An open sourced end-to-end VLM-based GUI Agent
Replace OpenAI GPT with another LLM in your app
Simplifies the local serving of AI models from any source
Easy-to-use Speech Toolkit including Self-Supervised Learning model
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Chinese and English multimodal conversational language model
Stable Diffusion with Core ML on Apple Silicon
AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework
Context management for Claude Code. Hooks maintain state via ledgers
When LLM Meets Domain Experts
A fast TTS architecture with conditional flow matching
A minimal yet professional single agent demo project