Showing 380 open source projects for "metadata"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    DataHub

    DataHub

    The Metadata Platform for your Data and AI Stack

    DataHub is an open source metadata platform that helps organizations discover, understand, and trust their data assets at scale. It models data as a richly connected graph spanning datasets, dashboards, pipelines, ML features, and services, so users can explore relationships like lineage and ownership across tools and domains. The platform focuses on continuous metadata ingestion from many sources, treating metadata as a stream that stays fresh as systems change. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Copybara

    Copybara

    Copybara: A tool for transforming and moving code between repositories

    ...Copybara is particularly useful in workflows where projects maintain both confidential (internal) and public (open source) repositories, enabling controlled synchronization and contribution management between them. The tool supports advanced transformations—such as file relocation, content replacement, and metadata adjustments—defined declaratively in configuration files. It operates in a stateless manner, storing synchronization state within commit metadata to ensure reproducibility and collaboration among multiple users. Copybara currently supports Git repositories (with experimental Mercurial support) and can be integrated with CI/CD systems or run manually.
    Downloads: 59 This Week
    Last Update:
    See Project
  • 3
    Apache Polaris

    Apache Polaris

    Apache Polaris, the interoperable, open source catalog

    Apache Polaris is an open-source metadata catalog and data management service designed to manage Apache Iceberg tables in modern data lakehouse environments. It provides a centralized catalog that allows multiple compute engines and analytics systems to interact with the same datasets through a standardized interface. By implementing the Iceberg REST catalog API, Polaris enables distributed data platforms to access shared table metadata without tightly coupling storage systems and query engines. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Booklore

    Booklore

    Digital library with smart shelves

    Booklore is a comprehensive self-hosted digital library platform that helps readers organize, manage, track, and even read their personal collections of books and comics from a centralized web interface. It provides powerful metadata management that automatically fetches rich details like titles, authors, covers, and publication info so your library looks organized and beautiful without manual data entry. Users can create smart shelves with custom filters, build dynamic collections that update automatically, and search through thousands of entries instantly, turning chaotic folders of files into a curated reading experience. ...
    Downloads: 28 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    Spring Initializr

    Spring Initializr

    A quickstart generator for Spring projects

    A quickstart generator for Spring projects. The various options for the projects are expressed in a metadata model that allows you to configure the list of dependencies, supported JVM and platform versions, etc. Spring Initializr also exposes web endpoints to generate an actual project and also serve its metadata in a well-known format to allow third-party clients to provide the necessary assistance. A set of optional conventions for Spring Boot projects is provided and are used in our production instance at https://start.spring.io. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 6
    Datacap

    Datacap

    DataCap is integrated software for data transformation

    Datacap is an open-source data catalog and governance tool that helps organizations manage and document their data assets. It provides metadata management, lineage tracking, and collaboration features to ensure data transparency and quality. Datacap is designed for teams that need a lightweight, self-hosted solution to organize and govern their data ecosystems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    OpenDataLoader PDF

    OpenDataLoader PDF

    PDF Parser for AI-ready data. Automate PDF accessibility

    ...It focuses on enabling downstream use cases like retrieval-augmented generation (RAG), knowledge extraction, and document intelligence pipelines by maintaining accurate reading order and spatial metadata through bounding boxes. The tool combines deterministic parsing methods with an optional hybrid AI-powered mode that improves extraction quality for difficult layouts such as multi-column documents, scanned files, and scientific papers. It includes built-in OCR capabilities supporting dozens of languages, making it suitable for digitizing low-quality or image-based PDFs. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 8
    BigQuery Utils

    BigQuery Utils

    Useful scripts, udfs, views, and other utilities for migration

    ...It also supports day-to-day operational work by offering optimization scripts, billing queries, performance testing examples, and dashboards built on INFORMATION_SCHEMA metadata so teams can better understand slot usage, reservations, job execution, and errors.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    Google Cloud Dataflow Template Pipelines

    Google Cloud Dataflow Template Pipelines

    Cloud Dataflow Google-provided templates for solving data tasks

    ...The repository is centered on templated pipelines powered by Google Cloud Dataflow and Apache Beam, making it easier to run common integration and movement jobs such as data import, export, backup, restore, and bulk API operations. Its structure shows support for multiple generations of templates, including v1 and v2 implementations, as well as related metadata, YAML assets, plugins, and Python components that support broader template execution and maintenance. This design makes the project more than a sample set, because it acts as the implementation base for official Google-provided templates used in real cloud data workflows.
    Downloads: 7 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Spring Data REST

    Spring Data REST

    Simplifies building hypermedia-driven REST web services

    ...Exposes dedicated search resources for query methods defined in your repositories. Allows to hook into the handling of REST requests by handling Spring ApplicationEvents. Exposes metadata about the model discovered as ALPS and JSON Schema. Allows to define client specific representations through projections. Ships a customized variant of the HAL Explorer to leverage the exposed metadata.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Picture Metadata-WP

    Picture Metadata-WP

    Picture Metadata Workplace

    This is a simple GUI tool to handle JPG metadata like EXIF and IPTC/XMP keywords and dates according to my needs. It uses ExifTool by Phil Harvey to read and write data and can get keywords and star ratings from Adobes Photoshop Elements Organizer (trademark of adobe). It can do manual geo tagging and can read location informations from Open Street Map. Geo tagging from GPX file is also possible.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    GeoNetwork opensource - Metadata Catalog
    A web based Metadata Catalog Server for data description and discovery. Supports both generic and geospatial data discovery. It implements international standards (e.g. ISO19115/19139/19119, ISO19115-3, DCAT-AP, CSW 2.0, OGC API Records). It originates from the United Nations and is used by many governments as geoportal software. Active development and discussion takes place on GitHub and OSGeo Discourse.
    Leader badge
    Downloads: 191 This Week
    Last Update:
    See Project
  • 13
    Zipkin

    Zipkin

    Distributed tracing system to gather timing data

    Zipkin is a distributed tracing system. It helps gather timing data needed to troubleshoot latency problems in service architectures. Features include both the collection and lookup of this data. If you have a trace ID in a log file, you can jump directly to it. Otherwise, you can query based on attributes such as service, operation name, tags and duration. Some interesting data will be summarized for you, such as the percentage of time spent in a service, and whether or not operations...
    Downloads: 27 This Week
    Last Update:
    See Project
  • 14
    Signal Server

    Signal Server

    Server supporting the Signal Private Messenger applications on Android

    ...Its codebase is public to ensure transparency and auditability for a security-focused community willing to inspect how message metadata and delivery mechanisms are implemented.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Semantic Type Detection

    Semantic Type Detection

    Metadata/data identification Java library

    Metadata/data identification Java library. Identifies Base Type (e.g. Boolean, Double, Long, String, LocalDate, LocalTime, ...) and Semantic Type information (e.g. Gender, Age, Color, Country, ...). Extensive country/language support. Extensible via user-defined plugins. Comprehensive Profiling support. Large set of built-in Semantic Types (extensible via JSON defined plugins).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Metadata Extractor

    Metadata Extractor

    Extracts Exif, IPTC, XMP, ICC and other metadata from image and video

    metadata-extractor is a Java library for reading metadata from media files. The library understands several formats of metadata, many of which may be present in a single image.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 17
    MonsterMusic

    MonsterMusic

    A music player on android platform, developed by Andoroid composer

    MonsterMusic is a command-line utility to manage and download music from various online platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Nacos

    Nacos

    Dynamic Naming and Configuration Service

    Nacos is an easy-to-use, one-stop solution for dynamic service discovery, configuration and service management that allows you to easily build cloud native applications and microservices platforms. It supports almost all types of services, such as Kubernetes service, Spring Cloud RESTFul service, or Dubbo/gRPC service. Nacos is lightweight, easy to deploy and production-ready, having originated from time-tested internal products from Alibaba Group. It’s highly adaptive to cloud...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    SchemaCrawler

    SchemaCrawler

    Free database schema discovery and comprehension tool

    ...SchemaCrawler supports almost any database that has a JDBC driver, but for convenience is bundled with drivers for some commonly used RDBMS systems. SchemaCrawler works with any operating system that supports Java SE 8 or better. SchemaCrawler is also a Java API that makes working with database metadata as easy as working with plain old Java objects.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 20
    Apache Iceberg

    Apache Iceberg

    Apache Iceberg

    ...Iceberg brings the reliability and simplicity of SQL tables to big data while making it possible for engines like Spark, Trino, Flink, Presto, Hive, and Impala to safely work with the same tables, at the same time. The core Java library that tracks table snapshots and metadata is complete, but still evolving. Current work is focused on adding row-level deletes and upserts, and integration work with new engines like Flink and Hive. The Iceberg format specification is being actively updated and is open for comment. Until the specification is complete and released, it carries no compatibility guarantees. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    Genie

    Genie

    Distributed Big Data Orchestration Service

    Genie is a completely open source distributed job orchestration engine developed by Netflix. Genie provides REST-ful APIs to run a variety of big data jobs like Hadoop, Pig, Hive, Spark, Presto, Sqoop and more. It also provides APIs for managing the metadata of many distributed processing clusters and the commands and applications which run on them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    JabRef Bibliography Management

    JabRef Bibliography Management

    Graphical Java application for managing BibTeX and biblatex

    JabRef is an open-source, cross-platform citation and reference management tool. Stay on top of your literature: JabRef helps you to collect and organize sources, find the paper you need and discover the latest research. JabRef is available free of charge and is actively developed. It supports you in every step of your research work.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    GROBID

    GROBID

    A machine learning software for extracting information

    ...References extraction and parsing from articles in PDF format, around .87 F1-score against on an independent PubMed Central set of 1943 PDF containing 90,125 references, and around .89 on a similar bioRxiv set of 2000 PDF (using the Deep Learning citation model). All the usual publication metadata are covered (including DOI, PMID, etc.).
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    BinExport

    BinExport

    Export disassemblies into Protocol Buffers

    ...This exported data can then be used for binary comparison, diffing, and advanced analysis tasks through BinDiff or other compatible tools. BinExport captures detailed information such as instructions, functions, control flow graphs, and metadata, providing a machine-readable representation of a program’s disassembled structure. It supports multiple export formats, including binary, text, and statistics outputs, and can be used interactively or via scripting (IDC, IDAPython, or Ghidra’s headless mode). The project includes complete build instructions for Linux, macOS, and Windows, ensuring reproducibility across platforms.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    The Ballerina programming language

    The Ballerina programming language

    The Ballerina Programming Language

    Ballerina is an open-source programming language for the cloud that makes it easier to use, combine, and create network services. Network primitives in the language make it simpler to write services and run them in the cloud. Structural types with support for openness are used both for static typing within a program and for describing service interfaces. Type-safe, declarative processing of JSON, XML, and tabular data with language-integrated queries. Explicit error handling, static types,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB