Reverse-engineered Python API for Google Gemini web app
Core ML tools contain supporting tools for Core ML model conversion
Evaluation and Tracking for LLM Experiments
Convert AI papers to GUI
gpt-oss-120b and gpt-oss-20b are two open-weight language models
gpt-4o for windows, macos and linux
Expose your FastAPI endpoints as Model Context Protocol (MCP) tools
A sound cloning tool with a web interface, using your voice
SoTA open-source TTS
Implementation of "MobileCLIP" CVPR 2024
Framework to easily create LLM powered bots over any dataset
text and image to video generation: CogVideoX (2024) and CogVideo
Open source AI wearable platform for recording and summarizing speech
GUI/CLI tool for downloading Xiaohongshu
21 Lessons, Get Started Building with Generative AI
Label Studio is a multi-type data labeling and annotation tool
The most powerful Android RPA agent framework
Run a full local LLM stack with one command using Docker
A fast TTS architecture with conditional flow matching
Replace OpenAI GPT with another LLM in your app
Offline inference engine for art, real-time voice conversations
⚡ Building applications with LLMs through composability ⚡
An MCP server that autonomously evaluates web applications
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
Habit Tracker for the AI Coding Workshop