Dealing with all unstructured data, such as reverse image search
The data structure for multimodal data
Open Source Document Management System for Digital Archives
Generate audiobooks from e-books, voice cloning & 1107+ languages
The open-source data curation platform for LLMs
Python binding to the Apache Tika™ REST services
Context database designed specifically for AI Agents
Scrape job websites into a single spreadsheet with no duplicates.
An advanced paper search agent powered by large language models
Open-source choice to scale, assess and maintain natural language data
Local RAG engine for private multimodal knowledge search on devices
Open source file indexing & storage analytics powered by Elasticsearch
A youtube-dl fork with additional features and fixes
Powerful open source team chat application
Reading book source
The ultimate RAG for your monorepo
Cut videos with a text editor
Rename anything
Ready-to-run cloud templates for RAG
Open source personal AI Assistant for Linux, Windows and Mac
Implementation of "MobileCLIP" CVPR 2024
The official Python SDK for the ElevenLabs API
Claude Code skill implementing Manus-style persistent planning
Speech recognition module for Python
Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts