Search Results for "metadata management" - Page 2

Showing 84 open source projects for "metadata management"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Polymarket Data

    Polymarket Data

    Polymarket Data Retriever that fetches, processes, and structures data

    Polymarket Data is a comprehensive data engineering pipeline designed to collect, process, and structure trading activity from the Polymarket prediction market ecosystem into analyzable datasets. The system operates as a multi-stage pipeline that integrates data from both off-chain APIs and on-chain event sources, enabling users to reconstruct full trading activity including markets, order events, and executed trades. It begins by fetching market metadata such as questions, outcomes, and...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2
    Fondant

    Fondant

    Production-ready data processing made easy and shareable

    Fondant is a modular, pipeline-based framework designed to simplify the preparation of large-scale datasets for training machine learning models, especially foundation models. It offers an end-to-end system for ingesting raw data, applying transformations, filtering, and formatting outputs—all while remaining scalable and traceable. Fondant is designed with reproducibility in mind and supports containerized steps using Docker, making it easy to share and reuse data processing components....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    LLM TLDR

    LLM TLDR

    95% token savings. 155x faster queries. 16 languages

    LLM TLDR is a tool that leverages large language models (LLMs) to generate concise, coherent summaries (TL;DRs) of long documents, articles, or text files, helping users quickly understand large amounts of content without reading every word. It integrates with LLM APIs to handle input texts of varying lengths and complexity, applying techniques like chunking, context management, and multi-pass summarization to preserve accuracy even when the source is very large. The system supports both...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    MLRun

    MLRun

    Machine Learning automation and tracking

    MLRun is an open MLOps framework for quickly building and managing continuous ML and generative AI applications across their lifecycle. MLRun integrates into your development and CI/CD environment and automates the delivery of production data, ML pipelines, and online applications, significantly reducing engineering efforts, time to production, and computation resources. MLRun breaks the silos between data, ML, software, and DevOps/MLOps teams, enabling collaboration and fast continuous...
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    Local File Organizer

    Local File Organizer

    An AI-powered file management tool that ensures privacy

    Local-File-Organizer is an AI-powered file management system designed to automatically analyze, categorize, and reorganize files stored on a user’s local machine. The project focuses on privacy-first file organization by performing all processing locally rather than sending data to external cloud services. It uses language and vision models to understand the contents of documents, images, and other file types so that files can be grouped intelligently according to their meaning or context. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Quod Libet

    Quod Libet

    Music player and music library manager for Linux, Windows, and macOS

    Quod Libet is a cross-platform audio/music management program. It provides many ways to view your local library, and supports streaming audio and feeds (podcasts, etc). It has extremely flexible metadata editing and searching capabilities. With over 90 plugins included, you can extend and integrate with almost anything, or write your own! Ex Falso is a bare-bones tag editor with the same editing interface as Quod Libet.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    Paperless-AI

    Paperless-AI

    AI-powered document analysis and tagging for Paperless-ngx

    Paperless-AI is an AI-powered extension designed to enhance document management within Paperless-ngx by automating analysis, classification, and organization tasks. It continuously monitors incoming documents and processes them using various AI backends, enabling automatic assignment of titles, tags, document types, and correspondents. It integrates with multiple OpenAI-compatible services as well as local models, giving users flexibility in how document intelligence is handled. A key...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    SuggestArr

    SuggestArr

    Request recommended movies, TV shows and anime to Jellyseer/Overseer

    SuggestArr is an open-source automation platform designed to recommend and automatically request movies, TV shows, and anime based on a user’s viewing history in self-hosted media servers. The project integrates with popular media management systems such as Jellyfin, Plex, and Emby, allowing it to analyze recently watched content and identify similar titles using metadata from the TMDb database. Once potential recommendations are identified, SuggestArr can automatically send download or request instructions to services like Jellyseer or Overseerr, which then coordinate with media download tools and libraries. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    gravitino

    gravitino

    Unified metadata lake for data & AI assets.

    Apache Gravitino is a high-performance, geo-distributed, and federated metadata lake. It manages metadata directly in different sources, types, and regions, providing users with unified metadata access for data and AI assets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 10
    PSCoverDL

    PSCoverDL

    Small tool to download PS1/PS2 covers for DuckStation and PCSX2

    PSCoverDL is a companion tool designed to automate the downloading and management of PlayStation game cover art for emulators, providing a user-friendly interface for populating game libraries with visuals. It allows users to select their emulator, specify directories, and automatically retrieve matching cover images based on game metadata. The tool supports multiple platforms and includes features such as multithreading for faster downloads and configuration files for customization. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 11
    Exile is a Python based image collection manager application. Easily add metadata to photos, inluding Caption, People, Event, Location and Tags. No external database: stores metadata in Exif/IPTC/Xmp tags. Three level categorization for easy photo sorting/management Clone GPS data between files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Metador

    Metador

    Metador is a modern GTK4/Libadwaita application that securely cleans,

    Metador is a modern GTK4/Libadwaita application that securely cleans, edits, and manages sensitive metadata from your files.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    AIConfig

    AIConfig

    AIConfig is a config-based framework to build generative AI apps

    AIConfig is an open-source framework designed to simplify the development and management of generative AI applications by separating AI logic from application code. The framework allows prompts, model configurations, and parameters to be stored as structured configuration files that can be version controlled and managed independently from the rest of the software system. This approach improves collaboration between developers, prompt engineers, and machine learning practitioners by turning...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Metagify

    Metagify

    Audio metadata editor with MusicBrainz integration.

    Metagify is an open-source desktop application designed to provide a streamlined solution for editing audio file metadata. Built with Python and PyQt5, it offers a powerful and intuitive interface for single-file and batch-editing of tags, as well as seamless integration with the MusicBrainz database.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Barman for PostgreSQL

    Barman for PostgreSQL

    Backup and Recovery Manager for PostgreSQL

    Barman (backup and recovery manager) is an administration tool for disaster recovery of PostgreSQL servers written in Python. It allows to perform remote backups of multiple servers in business critical environments and help DBAs during the recovery phase. Barman's most wanted features include backup catalogs, retention policies, remote recovery, archiving and compression of WAL files and backups. Barman is written and maintained by PostgreSQL professionals 2ndQuadrant.
    Leader badge
    Downloads: 15 This Week
    Last Update:
    See Project
  • 16
    dbt-re-data

    dbt-re-data

    re_data - fix data issues before your users & CEO would discover them

    re_data is an open-source data reliability framework for the modern data stack. Currently, re_data focuses on observing the dbt project (together with underlaying data warehouse - Postgres, BigQuery, Snowflake, Redshift). Data transformations in re_data are implemented and exposed as models & macros in this dbt package. Gather all relevant outputs about your data in one place using our cloud. Invite your team and debug it easily from there. Go back in time, and see your past metadata. Set up...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Nostalgic Photo DataBase (platform)

    Nostalgic Photo DataBase (platform)

    Active repository of jpeg-photos with tags suitable for personal needs

    NPDB offers a comprehensive platform for creating and maintaining a database of both old, digitized images and new snapshots captured by smartphones. This versatile system allows users to organize and search through their collection using customizable tags, catering to images of any vintage. One of NPDB's key features is its flexible tagging system, which allows users to categorize their images using an arbitrary set of tags tailored to their preferences. This intuitive approach streamlines...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    odd-collector

    odd-collector

    Open-source metadata collector based on ODD Specification

    ODD Collector is a lightweight service that gathers metadata from all your data sources. Push-client is a provider which sends information directly to the central repository of the Platform. ODDRN (Open Data Discovery Resource Name) is a unique resource name that identifies entities such as data sources, data entities, dataset fields etc. It is used to build lineage and update metadata.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    S3cmd

    S3cmd

    Command line tool for managing Amazon S3 and CloudFront services

    ...Lots of features and options have been added to S3cmd, since its very first release in 2008.... we recently counted more than 60 command-line options, including multipart uploads, encryption, incremental backup, s3 sync, ACL and Metadata management, S3 bucket size, bucket policies, and more!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    odd-collector-gcp

    odd-collector-gcp

    Open-source GCP metadata collector based on ODD Specification

    ODD Collector GCP is a lightweight service which gathers metadata from all your Google Cloud Platform data sources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    s3cmd

    s3cmd

    Command line tool for managing Amazon S3 and CloudFront services

    ...Lots of features and options have been added to s3cmd since its very first release in 2008.... we recently counted more than 60 command line options, including multipart uploads, encryption, incremental backup, s3 sync, ACL and Metadata management, S3 bucket size, bucket policies, and more!
    Leader badge
    Downloads: 988 This Week
    Last Update:
    See Project
  • 22
    RasaGPT

    RasaGPT

    Headless Rasa chatbot platform with LLM integration and APIs

    ...RasaGPT includes a FastAPI backend for creating custom bot endpoints, along with document ingestion and a training pipeline. It simplifies integration challenges between Rasa and LLM libraries, including metadata handling and library conflicts. RasaGPT supports multi-tenant deployments, session management, and custom schemas using pgvector. It also enables Telegram bot integration and remote access via ngrok. Docker support allows easier setup and deployment, particularly on macOS environments. While designed as a working prototype, it provides a practical foundation for developers building LLM-powered chatbot applications with extensible architecture and preconfigured components.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Termux APT Repo

    Termux APT Repo

    Script to create Termux apt repositories

    termux-apt-repo is a script designed to create APT repositories for Termux, allowing users to publish and distribute their own packages. It supports cross-compiled packages created using the Termux build setup or on-device packages created with termux-create-package. This tool facilitates the sharing and installation of custom packages within the Termux environment.​
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    QuickPlot

    QuickPlot

    Simple user interface for gnuplot aimed for reflectometry data

    Graphical user interface for gnuplot to create publication quality figure very quickly. It supports templates for fast formatting of graphics, different plot styles, insets, axis and label options. One important feature is storing metadata in png and pdf files that can be used to reload any graph saved with QuickPlot.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Zero Install
    Zero Install is a decentralised cross-distribution software installation system. Create one package that works everywhere! With dependency handling and automatic updates, full support for shared libraries, and integration with native package managers
    Leader badge
    Downloads: 3,270 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB