End-to-end speech processing toolkit
Readest is a modern, feature-rich ebook reader
Framework for building realtime multimodal voice AI agents apps
Relax! Flux is the ML library that doesn't make you tensor
Models for the spaCy Natural Language Processing (NLP) library
Improve your resumes with Resume Matcher
AI Powered Knowledge Graph Generator
Multimodal model achieving SOTA performance
Self-hosted AI audio transcription
Fast backend for long-term AI user memory via structured profiles
The AI toolkit for the AI developer
A theoretical reconstruction of the Claude Mythos architecture
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Use any LLMs (Large Language Models) for Deep Research
Open Agent Harness with a built-in personal agent, Ohmo
A powerful Zotero AI and MCP plugin with ChatGPT, Gemini 3.1, Claude
OCR expert VLM powered by Hunyuan's native multimodal architecture
AI health assistant for private, local data-driven insights mgmt
AudioMuse-AI is an Open Source Dockerized environment
Open source no-code system for text annotation and building of text
Edit videos with Claude Code
Please do not feed the models
This repository contains code released by Google Research
An end-to-end Data Scientist
Open-source multi-speaker long-form text-to-speech model