Showing 127 open source projects for "metadata management"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    JabRef Bibliography Management

    JabRef Bibliography Management

    Graphical Java application for managing BibTeX and biblatex

    JabRef is an open-source, cross-platform citation and reference management tool. Stay on top of your literature: JabRef helps you to collect and organize sources, find the paper you need and discover the latest research. JabRef is available free of charge and is actively developed. It supports you in every step of your research work.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    DataHub

    DataHub

    The Metadata Platform for your Data and AI Stack

    DataHub is an open source metadata platform that helps organizations discover, understand, and trust their data assets at scale. It models data as a richly connected graph spanning datasets, dashboards, pipelines, ML features, and services, so users can explore relationships like lineage and ownership across tools and domains. The platform focuses on continuous metadata ingestion from many sources, treating metadata as a stream that stays fresh as systems change. A modern web UI and search...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Apache Polaris

    Apache Polaris

    Apache Polaris, the interoperable, open source catalog

    Apache Polaris is an open-source metadata catalog and data management service designed to manage Apache Iceberg tables in modern data lakehouse environments. It provides a centralized catalog that allows multiple compute engines and analytics systems to interact with the same datasets through a standardized interface. By implementing the Iceberg REST catalog API, Polaris enables distributed data platforms to access shared table metadata without tightly coupling storage systems and query engines. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Copybara

    Copybara

    Copybara: A tool for transforming and moving code between repositories

    ...Copybara is particularly useful in workflows where projects maintain both confidential (internal) and public (open source) repositories, enabling controlled synchronization and contribution management between them. The tool supports advanced transformations—such as file relocation, content replacement, and metadata adjustments—defined declaratively in configuration files. It operates in a stateless manner, storing synchronization state within commit metadata to ensure reproducibility and collaboration among multiple users. Copybara currently supports Git repositories (with experimental Mercurial support) and can be integrated with CI/CD systems or run manually.
    Downloads: 96 This Week
    Last Update:
    See Project
  • Add Two Lines of Code. Get Full APM. Icon
    Add Two Lines of Code. Get Full APM.

    AppSignal installs in minutes and auto-configures dashboards, alerts, and error tracking.

    Works out of the box for Rails, Django, Express, Phoenix, and more. Monitoring exceptions and performance in no time.
    Start Free
  • 5
    Booklore

    Booklore

    Digital library with smart shelves

    Booklore is a comprehensive self-hosted digital library platform that helps readers organize, manage, track, and even read their personal collections of books and comics from a centralized web interface. It provides powerful metadata management that automatically fetches rich details like titles, authors, covers, and publication info so your library looks organized and beautiful without manual data entry. Users can create smart shelves with custom filters, build dynamic collections that update automatically, and search through thousands of entries instantly, turning chaotic folders of files into a curated reading experience. ...
    Downloads: 26 This Week
    Last Update:
    See Project
  • 6
    Datacap

    Datacap

    DataCap is integrated software for data transformation

    Datacap is an open-source data catalog and governance tool that helps organizations manage and document their data assets. It provides metadata management, lineage tracking, and collaboration features to ensure data transparency and quality. Datacap is designed for teams that need a lightweight, self-hosted solution to organize and govern their data ecosystems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    BigQuery Utils

    BigQuery Utils

    Useful scripts, udfs, views, and other utilities for migration

    BigQuery Utils is a large utility repository focused on helping users operate, optimize, and migrate workloads in BigQuery through reusable assets rather than a single application. It brings together scripts, user-defined functions, views, stored procedures, dashboards, notebooks, and supporting tools that address common data warehouse and analytics tasks. The repository is especially useful for organizations that need practical building blocks for migration from other database systems,...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    Nacos

    Nacos

    Dynamic Naming and Configuration Service

    Nacos is an easy-to-use, one-stop solution for dynamic service discovery, configuration and service management that allows you to easily build cloud native applications and microservices platforms. It supports almost all types of services, such as Kubernetes service, Spring Cloud RESTFul service, or Dubbo/gRPC service. Nacos is lightweight, easy to deploy and production-ready, having originated from time-tested internal products from Alibaba Group. It’s highly adaptive to cloud...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    Grimmory

    Grimmory

    Grimmory is the successor of booklore

    ...It includes a built-in browser-based reader that allows users to read, annotate, and track progress directly within the application. Grimmory also emphasizes automation through features like metadata enrichment and automatic file ingestion, reducing manual effort when managing large collections. Multi-user support enables separate libraries, preferences, and reading progress for different users within the same instance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    Semantic Type Detection

    Semantic Type Detection

    Metadata/data identification Java library

    Metadata/data identification Java library. Identifies Base Type (e.g. Boolean, Double, Long, String, LocalDate, LocalTime, ...) and Semantic Type information (e.g. Gender, Age, Color, Country, ...). Extensive country/language support. Extensible via user-defined plugins. Comprehensive Profiling support. Large set of built-in Semantic Types (extensible via JSON defined plugins). Extensive Profiling metrics (e.g. Min, Max, Distinct, signatures, …) Sufficiently fast to be used inline. See Speed...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Genie

    Genie

    Distributed Big Data Orchestration Service

    Genie is a completely open source distributed job orchestration engine developed by Netflix. Genie provides REST-ful APIs to run a variety of big data jobs like Hadoop, Pig, Hive, Spark, Presto, Sqoop and more. It also provides APIs for managing the metadata of many distributed processing clusters and the commands and applications which run on them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Signal Server

    Signal Server

    Server supporting the Signal Private Messenger applications on Android

    Signal Server is the official backend server component that supports the Signal Private Messenger ecosystem on Android, iOS, and desktop platforms, serving as the trusted message relay, contact discovery, and push notification hub while the clients handle end-to-end encryption. Built in Java and licensed under the AGPL-3.0 license, this server software handles protocols for delivering encrypted messages, synchronizing user metadata like public keys and push tokens, and providing REST and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Apache Hudi

    Apache Hudi

    Upserts, Deletes And Incremental Processing on Big Data

    Apache Hudi (pronounced Hoodie) stands for Hadoop Upserts Deletes and Incrementals. Hudi manages the storage of large analytical datasets on DFS (Cloud stores, HDFS or any Hadoop FileSystem compatible storage). Apache Hudi is a transactional data lake platform that brings database and data warehouse capabilities to the data lake. Hudi reimagines slow old-school batch data processing with a powerful new incremental processing framework for low latency minute-level analytics. Hudi provides...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    LogicalDOC Document Management - DMS

    LogicalDOC Document Management - DMS

    smart and open source document management system

    LogicalDOC is both document management and collaboration system. The software is loaded with many functions and allows organizing, index, retrieving, controlling and distributing important business documents securely and safely for any organization and individual. Gone are the days when companies used paper-based processes such as printing, mailing and manual filing of paper documents; our document management system replaces all of this with electronic procedures that allow your...
    Leader badge
    Downloads: 195 This Week
    Last Update:
    See Project
  • 15
    gravitino

    gravitino

    Unified metadata lake for data & AI assets.

    Apache Gravitino is a high-performance, geo-distributed, and federated metadata lake. It manages metadata directly in different sources, types, and regions, providing users with unified metadata access for data and AI assets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    OpenKM Document Management - DMS

    OpenKM Document Management - DMS

    Document Management System and Content Management System

    OpenKM Community Edition is a free Document Management System (DMS) that helps businesses control the production, storage, management and distribution of electronic documents, boosting effectiveness and productivity. It integrates document management, collaboration and advanced search into one easy-to-use solution, including administration tools for user roles, access control, security levels, activity logs and automation setup. With OpenKM Community Edition you can: Collect information...
    Leader badge
    Downloads: 457 This Week
    Last Update:
    See Project
  • 17
    CERCA

    CERCA

    CERCA – Citation Extraction & Reference Checking Assistant

    CERCA is an open-source research tool that supports the verification of bibliographic references in scientific manuscripts. It extracts references from PDF files and checks their existence and consistency against authoritative metadata sources, producing explainable diagnostics, audit logs, and reproducible reports. It is intended for: - Researchers performing final manuscript checks - Reviewers assessing reference consistency - Editors supporting editorial quality control -...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    ResCarta

    ResCarta

    Archive your personal history

    ResCarta Toolkit offers an open source solution to creating, storing, viewing, and searching digital collections. Applications in the toolkit let users create and edit metadata, convert data to open standard ResCarta format, index and host collections.
    Leader badge
    Downloads: 44 This Week
    Last Update:
    See Project
  • 19
    Picsimilar

    Picsimilar

    Search and compare similar and identical photos.

    Use reverse image search to find similar and duplicate images in your local photo collection; Use comparison features to select the best images in a set of similar ones.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    MGB OpenSource Guestbook
    MGB is a free OpenSource Guestbook completely written in PHP, using JavaScript and a MySQL Database. Easy to use, flexible and customizable with templates to make it fit 100% to your homepage.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    Web based cataloging and dedupe application. Highly optimized for processing journal articles. Reads MarcXML and dedupes records using the field 773 combined with a fuzzy search on the title. Written for bibnet.org
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    kmttg

    kmttg

    TiVo To Go and advanced Remote Control

    NOTE: Check Wiki for abandoned project notes. kmttg is a Java based program I wrote to facilitate TivoToGo (TTG) transfers that can download, create pyTivo metadata, decrypt, run comskip & comcut (commercial detection and removal), create closed captions files and re-encode multiple shows you select from your Tivos all automatically. The program also has the capability to transfer and process shows automatically from your Tivos based on titles and keywords you setup. kmttg also has an advanced Remote Control which among many other things allows for Season Pass management, backup/restore, copying, etc. ...
    Downloads: 33 This Week
    Last Update:
    See Project
  • 23
    ModelDB

    ModelDB

    Open Source ML Model Versioning, Metadata, and Experiment Management

    An open-source system for Machine Learning model versioning, metadata, and experiment management. ModelDB is an open-source system to version machine learning models including their ingredients code, data, config, and environment and to track ML metadata across the model lifecycle.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    MyCoRe

    MyCoRe

    your repository framework

    MyCoRe is an Open Source project for the development of Repositories, Digital Library and archive solutions. The technical base of the system is formed of Java class libraries, XML technology and different database backends. Since 2015 we use https://mycore.atlassian.net/ for bug tracking. Please use our ticket system there.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Cerberus Content Management System

    Cerberus Content Management System

    Cerberus Content Management System

    Cerberus Content Management System is a Monolithic and Modular Content Management System that is written in 100% Pure PHP code with 100% Pure HTML output, and it supports multiple Database Management Systems. Cerberus Content Management System source code is completely handwritten by the author(s). The CerberusCMS project is focused on data security and ease of use, therefore we have decided to make very little use of JavaScript in the PurePHP Releases. The still-secure, and...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB