Showing 40 open source projects for "metadata tool"

View related business solutions
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Aim

    Aim

    An easy-to-use & supercharged open-source experiment tracker

    Aim logs all your AI metadata (experiments, prompts, etc) enabling a UI to compare & observe them and SDK to query them programmatically. The Aim standard package comes with all integrations. If you'd like to modify the integration and make it custom, create a new integration package and share with others. Aim is an open-source, self-hosted AI Metadata tracking tool designed to handle 100,000s of tracked metadata sequences.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    ebook2audiobook

    ebook2audiobook

    Generate audiobooks from e-books, voice cloning & 1107+ languages

    ebook2audiobook is a tool to convert legally obtained eBooks (non-DRM) into fully narrated audiobooks, complete with chapters and metadata. It automates the pipeline: it reads the eBook file, splits it into appropriate segments (chapters, paragraphs), uses text-to-speech (TTS) models to synthesize audio, optionally applies voice cloning, and outputs a final audiobook — ideal for people who prefer listening over reading, or for accessibility purposes.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 3
    Embedding Atlas

    Embedding Atlas

    Tool that provides interactive visualizations for large embeddings

    Embedding Atlas is an open-source tool by Apple that provides scalable, interactive visualizations for large embedding datasets. It enables users to visualize, cross-filter, and search through embeddings alongside rich metadata, all in real time using modern web-based technologies. In addition to the command line tool, Embedding Atlas is also available as a Jupyter widget.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    MinerU

    MinerU

    A high-quality tool for convert PDF to Markdown and JSON

    MinerU is an open-source, high-quality document extraction toolkit focused on converting PDFs (and other document formats) into structured Markdown and JSON. It leverages OCR and layout analysis to preserve semantic structure and metadata, ideal for research and data science workflows.
    Downloads: 15 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    MCPJungle

    MCPJungle

    Self-hosted MCP Gateway and Registry for AI agents

    ...The platform supports enterprise-grade workflows; centralized tool management, access control, self-hosting so that internal servers and tools remain under your organization’s control, and registry metadata to track what tools exist and who can use them. For organizations building internal AI automation systems, MCPJungle helps enforce governance, tool discovery, and integration scalability.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Metarank

    Metarank

    A low code Machine Learning service that personalizes articles

    ...It’s often considered "too risky" to spend 6+ months on an in-house moonshot project to reinvent the wheel without an experienced team and no existing open-source tools. Metarank makes it easy not only for Amazon to do personalization but for everyone else. Ingest historical item listings, clicks and item metadata so Metarank can find hidden dependencies in the data using our simple JSON format.No Machine Learning experience is required, run our CLI tool with a set of features in a YAML configuration. Run Metarank API service, feed it with real-time events and receive a personalized ranking for your items that will boost conversion, click-through rate or any other business-critical metric you define.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Gollama

    Gollama

    Go manage your Ollama models

    ...Beyond standard model management, Gollama can display metadata such as size, quantization level, model family, and modification date, which helps users compare models quickly. One of its more distinctive capabilities is a VRAM estimation system that can calculate memory requirements, estimate context limits, and help users choose quantization settings that fit available hardware.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    GitHub MCP Server

    GitHub MCP Server

    GitHub's official MCP Server

    The GitHub MCP Server exposes GitHub as a Model Context Protocol server so AI assistants can safely act on repos, issues, pull requests, gists, and more through a consistent tool interface. It’s designed to run locally or remotely and then be attached to MCP-capable clients (for example, Copilot Chat) so an LLM can search code, open files, create branches, draft PRs, label or triage issues, and query metadata without hard-coding GitHub APIs. The server defines tools and resources with fine-grained scopes, leaning on GitHub’s auth to enforce least privilege and auditable access. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    Paperclip

    Paperclip

    Open-source orchestration for zero-human companies

    Paperclip is an open-source tool designed to help AI systems and developer tools access academic research papers through a standardized interface. The project implements a server based on the Model Context Protocol (MCP), a framework that allows large language models and AI agents to connect to external data sources and tools in a consistent way. By acting as a middleware layer, Paperclip aggregates multiple academic databases and exposes them through a single interface, allowing AI...
    Downloads: 18 This Week
    Last Update:
    See Project
  • Add Two Lines of Code. Get Full APM. Icon
    Add Two Lines of Code. Get Full APM.

    AppSignal installs in minutes and auto-configures dashboards, alerts, and error tracking.

    Works out of the box for Rails, Django, Express, Phoenix, and more. Monitoring exceptions and performance in no time.
    Start Free
  • 10
    EPUB to Audiobook Converter

    EPUB to Audiobook Converter

    EPUB to audiobook converter, optimized for Audiobookshelf

    EPUB to Audiobook Converter is a tool designed to convert EPUB ebooks into chaptered audiobooks, optimized specifically for Audiobookshelf servers. It reads each chapter from an EPUB file, generates audio using a chosen text-to-speech backend, and outputs separate MP3 files with chapter titles preserved as metadata to make navigation easier. The project supports multiple TTS providers, including Microsoft Azure TTS, EdgeTTS, OpenAI TTS, local Piper, and Kokoro via an OpenAI-compatible endpoint, allowing users to choose between cloud and self-hosted voices. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 11
    ai-renamer

    ai-renamer

    A Node.js CLI that uses Ollama and LM Studio models

    ai-renamer is a Node.js-based command-line tool that uses large language models to automatically rename files based on their content, enabling more meaningful and organized file management. Instead of relying on manual naming or metadata, the tool analyzes the actual content of files, including images, videos, and documents, to generate descriptive and context-aware filenames.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    tldw Server

    tldw Server

    Your Personal Research Multi-Tool

    tldw-server (mirror) is a mirrored distribution of an open-source backend service designed to store, process, and serve summarized information extracted from long pieces of content. The name “tldw” reflects the phrase “too long; didn’t watch,” which refers to tools that condense lengthy videos, articles, or documents into concise summaries. The server component typically acts as the core infrastructure that manages summaries, metadata, and retrieval operations for client applications or user...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Gate22

    Gate22

    Open-source MCP gateway and control plane for teams

    ...It provides a centralized layer where organizations can configure permission boundaries, role-based access, and operational constraints that govern agent behavior and tool invocation across agentic IDEs or custom agent stacks. By integrating with MCP-aware systems, Gate22 helps maintain security and compliance while enabling teams to scale agent-enabled workflows without losing observability into what actions are taken and why. It can be used to enforce fine-grained policies that restrict dangerous or unauthorized operations, track which agents are calling which tools, and record metadata for auditing and debugging.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Repomix

    Repomix

    Repomix is a powerful tool that packs your entire repository

    Repomix is a developer utility that packages an entire code repository into a single AI-friendly file for analysis by large language models. Its core purpose is to simplify the process of feeding complex codebases into AI tools by consolidating files, metadata, and structure into a unified representation. The tool is particularly valuable for code review, refactoring assistance, and automated documentation workflows where context size matters. Repomix intelligently respects ignore rules and can compress code structure to reduce token usage while preserving meaning. It supports multiple output formats and provides token counting to help developers stay within model limits. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    OpenLIT

    OpenLIT

    OpenLIT is an open-source LLM Observability tool

    OpenLIT is an OpenTelemetry-native tool designed to help developers gain insights into the performance of their LLM applications in production. It automatically collects LLM input and output metadata and monitors GPU performance for self-hosted LLMs. OpenLIT makes integrating observability into GenAI projects effortless with just a single line of code. Whether you're working with popular LLM providers such as OpenAI and HuggingFace, or leveraging vector databases like ChromaDB, OpenLIT ensures your applications are monitored seamlessly, providing critical insights including GPU performance stats for self-hosted LLMs to improve performance and reliability. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Data Version Control

    Data Version Control

    Git-based data version control for machine learning workflows

    DVC (Data Version Control) is an open source tool designed to bring version control principles to machine learning and data science workflows. It enables developers and data scientists to track datasets, machine learning models, and experiment results in a way that integrates with existing Git repositories. Instead of storing large datasets directly in Git, DVC keeps lightweight metadata in the repository while storing the actual data in external storage systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    abogen

    abogen

    Generate audiobooks from EPUBs, PDFs and text with captions

    abogen is a tool designed to generate audiobooks (or speech narrations) from textual sources such as EPUBs, PDFs, or plain text, with synchronized captions. In other words, it automates the pipeline of reading a digital book (or document), converting its text into speech via a TTS engine, and packaging the result into an audiobook format — likely along with timestamped captions or subtitles that align with the spoken audio.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 18
    Skill Scanner

    Skill Scanner

    Security Scanner for Agent Skills

    This repository is a public security-focused scanning tool intended to analyze and assess AI agent skills for potential issues, quality concerns, and vulnerabilities. It acts as a scanner that inspects Agent Skills packages to flag structural problems, inconsistencies, or security flaws before they are deployed or integrated into agent workflows. Because agent skills can contain executable instructions and logic, scanning them for risky patterns is essential to prevent inadvertent...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    shuyuan

    shuyuan

    Reading book source

    shuyuan is a project oriented around reading and knowledge consumption, especially targeting large-scale text content such as books, articles, or educational material. The name suggests “academy” or “study hall,” and the tool aims to help users ingest, organize, and manage reading content — possibly offering features like text parsing, annotation, metadata generation, translation, or storage for later reference. The repository is set up to support document ingestion, indexing, and maybe some AI-aided summarization or lookup functions, which helps users convert large text corpora into a structured, searchable knowledge base. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    agentation

    agentation

    The visual feedback tool for agents

    Agentation is a visual annotation and feedback tool designed to make interacting with AI coding agents more intuitive and precise by letting developers visually click on frontend elements in a browser and annotate them with context before sending structured feedback to an agent. Instead of describing UI elements in text — like “the blue button in the sidebar” — users click directly on elements to automatically capture selectors, positions, and contextual metadata that can be consumed by AI agents to locate exact code references. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    ContextGem

    ContextGem

    ContextGem: Effortless LLM extraction from documents

    ContextGem is an open-source framework designed to simplify the extraction of structured data and insights from documents using large language models (LLMs). It provides a flexible, intuitive API that minimizes boilerplate code, enabling developers to build complex extraction workflows efficiently. ContextGem supports various document formats and integrates with multiple LLM providers, making it a versatile tool for tasks like contract analysis, anomaly detection, and information retrieval.​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    XcodeBuildMCP

    XcodeBuildMCP

    A Model Context Protocol (MCP) server

    XcodeBuildMCP is a Model Context Protocol server that exposes Xcode operations as typed tools and resources so AI assistants can build, test, and debug iOS apps programmatically. It’s organized with a modern plugin architecture and workflow-scoped tool directories, covering common developer actions across projects, schemes, targets, simulators, real devices, and Swift packages. The server aims to be “agent-ready,” surfacing capabilities (build, clean, test, archive, install, run, log...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Ollama Grid Search

    Ollama Grid Search

    A multi-platform desktop application to evaluate and compare LLM

    Ollama Grid Search is a desktop application designed to automate the evaluation and comparison of large language models, prompts, and inference parameters in a structured and repeatable way. Instead of manually testing combinations, the tool performs grid search experiments by iterating across different models, prompt variations, and parameter configurations, allowing users to quickly identify optimal setups for specific tasks. It provides a visual interface where experiment results can be...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    CL4R1T4S

    CL4R1T4S

    Archive of leaked AI system prompts and internal instruction sets

    CL4R1T4S is a public repository that collects and archives extracted system prompts, internal guidelines, and behavioral instructions used by various artificial intelligence models and agents. Its stated goal is to promote transparency by documenting the hidden prompt scaffolding that shapes how AI systems behave and respond to users. CL4R1T4S organizes these materials by company or product, with directories containing prompt files and related instructions for many well-known AI systems....
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    GitDiagram

    GitDiagram

    AI tool that converts GitHub repositories into interactive diagrams

    GitDiagram is an open source web application designed to help developers quickly understand the structure and architecture of GitHub repositories by automatically generating interactive diagrams. It analyzes repository metadata such as the file tree and project documentation to build a visual representation of how different components of a project relate to one another. It uses an AI-powered pipeline to interpret repository structure and transform that information into system design diagrams...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB