Showing 139 open source projects for "metadata"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    html-metadata

    html-metadata

    MetaData html scraper and parser for Node.js (supports Promises

    ...Planned is support for RDFa, AGLS, and other yet unheard-of metadata types. Contributions and requests for other metadata types are welcome! You can also pass an options object as the first argument containing extra parameters. Some websites require the user-agent or cookies to be set in order to get the response.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    MDCx

    MDCx

    Movie metadata scraper and organizer for media libraries and NFO

    MDCx is an open source media metadata scraping and organization tool designed to automate the process of collecting detailed information for movie files. It retrieves metadata from multiple online sources and applies it to local media collections, helping users maintain structured and well-organized libraries. MDCx can download information such as titles, cast data, artwork, and other metadata, then generate standardized NFO files compatible with media management systems. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    JavDB

    JavDB

    javdb official app

    ...It includes distribution options through various channels such as direct downloads, third-party installers, and release builds, making it accessible across different user environments. The application is relatively lightweight and centered around providing quick access to indexed content, metadata, and media listings available on the platform. Its development is maintained through a GitHub repository with ongoing releases and updates, indicating active iteration and refinement.
    Downloads: 46 This Week
    Last Update:
    See Project
  • 4
    diskover-community

    diskover-community

    Open source file indexing & storage analytics powered by Elasticsearch

    Diskover Community Edition is an open source file system indexing and storage analytics platform designed to help organizations understand and manage large volumes of file data. It crawls file systems and indexes metadata using Elasticsearch, enabling fast search, analysis, and organization of files stored across different storage systems. It allows administrators and users to explore file structures, monitor storage usage, and gain insights into how data is distributed across infrastructure. By indexing file metadata from sources such as local file systems, network shares like NFS and SMB, and cloud storage, the tool provides a centralized way to analyze heterogeneous storage environments. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Bili23 Downloader

    Bili23 Downloader

    Cross platform GUI tool for downloading videos from Bilibili sites

    ...It also allows users to choose video resolution, audio quality, and encoding format based on the available sources. Additional features include downloading subtitles, comments, metadata, and artwork associated with videos.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 6
    Web Scrobbler

    Web Scrobbler

    Scrobble music all around the web!

    ...Just open any of the websites supported (e.g. Spotify) and start listening to music. Scrobble your music to Last.fm, Libre.fm, and ListenBrainz scrobbling services. The metadata filter cleans track info up and removes redundant track suffixes. Notifications of now playing tracks with extended track info (a website name, a track play count). We currently support scrobbling to 280 different websites with more being added with each release.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    Weibo Crawler

    Weibo Crawler

    Python crawler for collecting and downloading Sina Weibo user data

    ...It automates the process of gathering posts, user profile details, and engagement metrics from one or more target accounts. weibo-crawler can extract comprehensive information about users, including profile attributes such as nickname, follower count, following count, and account metadata. It also captures detailed data about each post, including the content, publishing time, topics, mentions, likes, reposts, and comments. In addition to textual data, the project can download original media from posts, such as images, videos, and Live Photo content. Collected data can be exported to structured formats such as CSV or JSON or stored in databases for further analysis and research. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8

    seaart-save-metadata

    Chrome extention to save metadata of works on seart.ai in XMP in files

    (modified 2025-12-20) v1.1.0 Now it also works on `postDetail` pages. Chrome extention to save metadata of works on seart.ai in XMP and embed them into files. Works on /explore/detail pages and /postDetail/ pages. Does NOT work to videos. Filenames are automatically sanitized. It saves: prompt, model, checkpoint, lora, creator, source, etc. of each work. Press a button to start fetching metadata and download a work on a target page
    Downloads: 7 This Week
    Last Update:
    See Project
  • 9
    AWS CDI SDK

    AWS CDI SDK

    AWS Cloud Digital Interface (CDI) SDK

    ...The SDK takes care of the complex task of ensuring that large amounts of data are seamlessly transferred across the AWS network using multi-path technology to ensure lossless quality without the need for extensive network buffering and delay. The SDK consists of two main API layers, depending on the desired level of functionality, and a common core layer. The CDI Audio, Video and Metadata (CDI-AVM) layer takes care of transferring multi-stream, modern video signals that contain video, audio and metadata. The CDI Raw Payload (CDI-RAW) layer takes care of transferring raw generic payloads. The CDI Core (CDI-CORE) layer takes care of common core functionality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    GeoNetwork opensource - Metadata Catalog
    A web based Metadata Catalog Server for data description and discovery. Supports both generic and geospatial data discovery. It implements international standards (e.g. ISO19115/19139/19119, ISO19115-3, DCAT-AP, CSW 2.0, OGC API Records). It originates from the United Nations and is used by many governments as geoportal software. Active development and discussion takes place on GitHub and OSGeo Discourse.
    Leader badge
    Downloads: 191 This Week
    Last Update:
    See Project
  • 11
    AWS Node Termination Handler

    AWS Node Termination Handler

    Gracefully handle EC2 instance shutdown within Kubernetes

    This project ensures that the Kubernetes control plane responds appropriately to events that can cause your EC2 instance to become unavailable, such as EC2 maintenance events, EC2 Spot interruptions, ASG Scale-In, ASG AZ Rebalance, and EC2 Instance Termination via the API or Console. If not handled, your application code may not stop gracefully, take longer to recover full availability, or accidentally schedule work to nodes that are going down. Secure and resizable compute capacity for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Umbraco CMS

    Umbraco CMS

    Umbraco is a free and open source .NET content management system

    An open-source content management system built on ASP.NET, empowering developers to create and manage websites with flexibility.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    newspaper4k

    newspaper4k

    Python library for scraping and analyzing online news articles easily

    ...It is a continuation and active fork of the original newspaper3k library, which had stopped receiving updates, with the goal of keeping the ecosystem maintained while adding improvements and bug fixes. It provides developers with tools to automatically download web pages, extract the main article content, and collect associated metadata such as titles, authors, images, and publication dates. Newspaper4k also includes natural language processing capabilities that can generate summaries and identify keywords from extracted article text. Newspaper4k supports both single-article extraction and full news site processing, allowing users to build sources representing entire publications and iterate through their articles. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Social-Analyzer

    Social-Analyzer

    API, CLI, and Web App for analyzing and finding a person's profile

    Social Analyzer is an open source OSINT tool that helps investigators discover and analyze a person’s presence across a very large number of social media platforms. It provides a unified API, CLI, and web interface capable of scanning hundreds or thousands of sites for username matches and related metadata. The project includes modular detection and analysis components that users can enable depending on their investigative needs. It is commonly used in cybersecurity, digital forensics, and reconnaissance workflows where identity correlation is required. Social Analyzer emphasizes flexibility, allowing integration into automated pipelines or manual investigations. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 15
    Nacos

    Nacos

    Dynamic Naming and Configuration Service

    Nacos is an easy-to-use, one-stop solution for dynamic service discovery, configuration and service management that allows you to easily build cloud native applications and microservices platforms. It supports almost all types of services, such as Kubernetes service, Spring Cloud RESTFul service, or Dubbo/gRPC service. Nacos is lightweight, easy to deploy and production-ready, having originated from time-tested internal products from Alibaba Group. It’s highly adaptive to cloud...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    news-please

    news-please

    Python tool for crawling and extracting structured data from news site

    news-please is an open source news crawler and information extraction tool designed to collect and structure articles from online news websites. It provides an integrated pipeline that crawls news sites, retrieves article pages, and extracts structured information such as headlines, authors, publication dates, and article text. news-please can recursively follow internal links and read RSS feeds to gather both recent and archived articles from a news outlet when given only the root URL of a...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    owllook

    owllook

    Vertical novel search engine with unified reading and tracking tools

    ...It focuses on providing a simple and comfortable reading experience with features such as searching for books, following updates, bookmarking chapters, and maintaining a personal bookshelf. It aggregates results from multiple search engines and applies parsing rules to extract novel metadata, chapters, and content in a consistent format. Owllook also includes functionality for tracking reading history, displaying rankings based on search activity, and recommending books using a similarity-based approach. Owllook is built using asynchronous technologies to support efficient data retrieval and responsive interactions while reading or searching.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    CloudEvents

    CloudEvents

    CloudEvents Specification

    Events are everywhere. However, event producers tend to describe events differently. The lack of a common way of describing events means developers must constantly re-learn how to consume events. This also limits the potential for libraries, tooling and infrastructure to aide the delivery of event data across environments, like SDKs, event routers or tracing systems. The portability and productivity we can achieve from event data is hindered overall. CloudEvents is a specification for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Jekyll Admin

    Jekyll Admin

    Jekyll plugin that provides users with a traditional CMS interface

    ...A Ruby-based HTTP API that handles Jekyll and filesystem operations, and a Javascript-based front end, built on that API. The plugin allows you to customize the name of a collection that is displayed in the sidebar by defining it in the collection's metadata in the config file. For example, if your source's posts are actually news-items on the deployed site, then it can be distracting to see the label Posts in the admin's sidebar.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Athens

    Athens

    A Go module datastore and proxy

    ...Athens is a project building on top of vgo (or go1.11+) trying to bring dependencies closer to you so you can count on repeatable builds even at a time when VCS is down. The big goal of Athens is to provide a new place where dependencies — not code — live. Dependencies are immutable blobs of code and associated metadata that come from Github. They live in storage that Athens controls. You probably already know what “immutable” means, but let me just point it out again because it’s really important for this whole system. When folks change their packages, iterate, experiment, or whatever else, code on Athens won’t change.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Trafilatura

    Trafilatura

    Python & command-line tool to gather text on the Web

    Trafilatura is a Python package and command-line tool designed to gather text on the Web. It includes discovery, extraction and text-processing components. Its main applications are web crawling, downloads, scraping, and extraction of main texts, metadata and comments. It aims at staying handy and modular: no database is required, the output can be converted to various commonly used formats. Going from raw HTML to essential parts can alleviate many problems related to text quality, first by avoiding the noise caused by recurring elements (headers, footers, links/blogroll etc.) and second by including information such as author and date in order to make sense of the data. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    videodl

    videodl

    Lightweight Python tool for downloading videos from many platforms

    Videodl is a lightweight video downloader implemented entirely in Python that allows users to retrieve videos from a wide range of online media platforms. It focuses on providing a fast and simple way to parse video pages and download media files, often prioritizing high-definition versions without watermarks when available. It supports numerous video platforms across both Chinese and international streaming ecosystems, enabling users to fetch content from many popular services through a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Interactsh

    Interactsh

    An OOB interaction gathering server and client library

    Interactsh is an open-source tool for detecting out-of-band interactions. It is a tool designed to detect vulnerabilities that cause external interactions. Interactsh Cli client requires go1.17+ to install successfully. interactsh-client with -sf, -session-file flag can be used store/read the current session information from user defined file which is useful to resume the same session to poll the interactions even after the client gets stopped or closed. Running the interactsh-client in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Middleman

    Middleman

    Hand-crafted frontend development

    Middleman is a Ruby-powered static site generator that emphasizes clean templates, flexible layouts, and a great developer experience. It supports common templating languages (ERB, Haml, Slim) and content formats (Markdown, AsciiDoc), plus front-matter metadata to drive layouts, navigation, and per-page behavior. A built-in dev server with live reload keeps iteration fast, while an asset pipeline handles fingerprinting, minification, and cache-friendly builds. Extensions cover blogging, internationalization, sitemaps, data-driven pages, and external build steps, so complex sites remain manageable without server-side code. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Coraza

    Coraza

    OWASP Coraza WAF is a golang modsecurity compatible firewall library

    ...CRS protects from many common attack categories including: SQL Injection (SQLi), Cross Site Scripting (XSS), PHP & Java Code Injection, HTTPoxy, Shellshock, Scripting/Scanner/Bot Detection & Metadata & Error Leakages. Coraza is a library at its core, with many integrations to deploy on-premise Web Application Firewall instances.
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB