A community-supported supercharged version of paperless
Use Microsoft Edge's online text-to-speech service from Python
Visual intelligence for your home.
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
An experimental version of DeepSeek model
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
SOTA Open Source TTS
Everything you need to build state-of-the-art foundation models
AI-powered tool for generating, optimizing, and translating subtitles
From Images to High-Fidelity 3D Assets
A high-throughput and memory-efficient inference and serving engine
Reverse engineering Gemini's SynthID detection
Open-source AI hackers to find and fix your app’s vulnerabilities
Automate native Android apps with AI using accessibility APIs
Sample code and notebooks for Generative AI on Google Cloud
Generate audiobooks from e-books
Label Studio is a multi-type data labeling and annotation tool
Create UIs for your machine learning model in Python in 3 minutes
A backup-first Codex skill for keeping local Codex state fast
Python library for building agents that leverages Google Antigravity
Agent Skill for generating 2D sprite sheets and map, transparent PNG
lightweight package to simplify LLM API calls
A Python wrapper you can't refuse
Advancing Open-source World Models
A nearly-live implementation of OpenAI's Whisper