Reverse-engineered Python API for Google Gemini web app
Core ML tools contain supporting tools for Core ML model conversion
Evaluation and Tracking for LLM Experiments
Convert AI papers to GUI
gpt-oss-120b and gpt-oss-20b are two open-weight language models
gpt-4o for windows, macos and linux
A sound cloning tool with a web interface, using your voice
Expose your FastAPI endpoints as Model Context Protocol (MCP) tools
SoTA open-source TTS
Implementation of "MobileCLIP" CVPR 2024
Framework to easily create LLM powered bots over any dataset
text and image to video generation: CogVideoX (2024) and CogVideo
Open source AI wearable platform for recording and summarizing speech
Context database designed specifically for AI Agents
Label Studio is a multi-type data labeling and annotation tool
Run a full local LLM stack with one command using Docker
A fast TTS architecture with conditional flow matching
Replace OpenAI GPT with another LLM in your app
GUI/CLI tool for downloading Xiaohongshu
An MCP server that autonomously evaluates web applications
Offline inference engine for art, real-time voice conversations
The most powerful Android RPA agent framework
21 Lessons, Get Started Building with Generative AI
⚡ Building applications with LLMs through composability ⚡
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph