metadata free download

87 projects for "metadata" with 2 filters applied:

Internet BSD Clear Filters & Widen Search

$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
1

MDCx

Movie metadata scraper and organizer for media libraries and NFO

MDCx is an open source media metadata scraping and organization tool designed to automate the process of collecting detailed information for movie files. It retrieves metadata from multiple online sources and applies it to local media collections, helping users maintain structured and well-organized libraries. MDCx can download information such as titles, cast data, artwork, and other metadata, then generate standardized NFO files compatible with media management systems. ...

Downloads: 7 This Week

Last Update: 2026-03-10
See Project
2

JavDB

javdb official app

...It includes distribution options through various channels such as direct downloads, third-party installers, and release builds, making it accessible across different user environments. The application is relatively lightweight and centered around providing quick access to indexed content, metadata, and media listings available on the platform. Its development is maintained through a GitHub repository with ongoing releases and updates, indicating active iteration and refinement.

Downloads: 658 This Week

Last Update: 2026-03-23
See Project
3

diskover-community

Open source file indexing & storage analytics powered by Elasticsearch

Diskover Community Edition is an open source file system indexing and storage analytics platform designed to help organizations understand and manage large volumes of file data. It crawls file systems and indexes metadata using Elasticsearch, enabling fast search, analysis, and organization of files stored across different storage systems. It allows administrators and users to explore file structures, monitor storage usage, and gain insights into how data is distributed across infrastructure. By indexing file metadata from sources such as local file systems, network shares like NFS and SMB, and cloud storage, the tool provides a centralized way to analyze heterogeneous storage environments. ...

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
4

Bili23 Downloader

Cross platform GUI tool for downloading videos from Bilibili sites

...It also allows users to choose video resolution, audio quality, and encoding format based on the available sources. Additional features include downloading subtitles, comments, metadata, and artwork associated with videos.

Downloads: 3 This Week

Last Update: 2026-03-10
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

GeoNetwork opensource - Metadata Catalog

A web based Metadata Catalog Server for data description and discovery. Supports both generic and geospatial data discovery. It implements international standards (e.g. ISO19115/19139/19119, ISO19115-3, DCAT-AP, CSW 2.0, OGC API Records). It originates from the United Nations and is used by many governments as geoportal software. Active development and discussion takes place on GitHub and OSGeo Discourse.

Downloads: 188 This Week

Last Update: 2025-10-07
See Project
6

Weibo Crawler

Python crawler for collecting and downloading Sina Weibo user data

...It automates the process of gathering posts, user profile details, and engagement metrics from one or more target accounts. weibo-crawler can extract comprehensive information about users, including profile attributes such as nickname, follower count, following count, and account metadata. It also captures detailed data about each post, including the content, publishing time, topics, mentions, likes, reposts, and comments. In addition to textual data, the project can download original media from posts, such as images, videos, and Live Photo content. Collected data can be exported to structured formats such as CSV or JSON or stored in databases for further analysis and research. ...

Downloads: 1 This Week

Last Update: 5 days ago
See Project
7

newspaper4k

Python library for scraping and analyzing online news articles easily

...It is a continuation and active fork of the original newspaper3k library, which had stopped receiving updates, with the goal of keeping the ecosystem maintained while adding improvements and bug fixes. It provides developers with tools to automatically download web pages, extract the main article content, and collect associated metadata such as titles, authors, images, and publication dates. Newspaper4k also includes natural language processing capabilities that can generate summaries and identify keywords from extracted article text. Newspaper4k supports both single-article extraction and full news site processing, allowing users to build sources representing entire publications and iterate through their articles. ...

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
8

videodl

Lightweight Python tool for downloading videos from many platforms

Videodl is a lightweight video downloader implemented entirely in Python that allows users to retrieve videos from a wide range of online media platforms. It focuses on providing a fast and simple way to parse video pages and download media files, often prioritizing high-definition versions without watermarks when available. It supports numerous video platforms across both Chinese and international streaming ecosystems, enabling users to fetch content from many popular services through a...

Downloads: 10 This Week

Last Update: 3 days ago
See Project
9

Coraza

OWASP Coraza WAF is a golang modsecurity compatible firewall library

...CRS protects from many common attack categories including: SQL Injection (SQLi), Cross Site Scripting (XSS), PHP & Java Code Injection, HTTPoxy, Shellshock, Scripting/Scanner/Bot Detection & Metadata & Error Leakages. Coraza is a library at its core, with many integrations to deploy on-premise Web Application Firewall instances.

Downloads: 3 This Week

Last Update: 5 days ago
See Project
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
10

owllook

Vertical novel search engine with unified reading and tracking tools

...It focuses on providing a simple and comfortable reading experience with features such as searching for books, following updates, bookmarking chapters, and maintaining a personal bookshelf. It aggregates results from multiple search engines and applies parsing rules to extract novel metadata, chapters, and content in a consistent format. Owllook also includes functionality for tracking reading history, displaying rankings based on search activity, and recommending books using a similarity-based approach. Owllook is built using asynchronous technologies to support efficient data retrieval and responsive interactions while reading or searching.

Downloads: 0 This Week

Last Update: 5 days ago
See Project
11

news-please

Python tool for crawling and extracting structured data from news site

news-please is an open source news crawler and information extraction tool designed to collect and structure articles from online news websites. It provides an integrated pipeline that crawls news sites, retrieves article pages, and extracts structured information such as headlines, authors, publication dates, and article text. news-please can recursively follow internal links and read RSS feeds to gather both recent and archived articles from a news outlet when given only the root URL of a...

Downloads: 0 This Week

Last Update: 5 days ago
See Project
12

dude uncomplicated data extraction

dude uncomplicated data extraction: A simple framework

Dude is a very simple framework for writing web scrapers using Python decorators. The design, inspired by Flask, was to easily build a web scraper in just a few lines of code. Dude has an easy-to-learn syntax. Dude is currently in Pre-Alpha. Please expect breaking changes. You can run your scraper from terminal/shell/command-line by supplying URLs, the output filename of your choice and the paths to your python scripts to dude scrape command.

Downloads: 0 This Week

Last Update: 2024-03-02
See Project
13

LogicalDOC Document Management - DMS

smart and open source document management system

LogicalDOC is both document management and collaboration system. The software is loaded with many functions and allows organizing, index, retrieving, controlling and distributing important business documents securely and safely for any organization and individual. Gone are the days when companies used paper-based processes such as printing, mailing and manual filing of paper documents; our document management system replaces all of this with electronic procedures that allow your...

36 Reviews

Downloads: 238 This Week

Last Update: 2025-08-11
See Project
14

twitch-batch-downloader

Automate the download of entire Twitch.tv channels

Automate the download of entire Twitch.tv channels with its metadata. Save each Twitch video into its own folder, with date and time values, video ID, stream metadata, frame screenshot, .ts parts list and sha256 hash. Keep the original ts files and generate mp4 files from them. It requires a shell and some command line utilities. See README.md for details in the Code/git section.

Downloads: 9 This Week

Last Update: 1 day ago
See Project
15

ResCarta

Archive your personal history

ResCarta Toolkit offers an open source solution to creating, storing, viewing, and searching digital collections. Applications in the toolkit let users create and edit metadata, convert data to open standard ResCarta format, index and host collections.

4 Reviews

Downloads: 25 This Week

Last Update: 2025-01-27
See Project
16

MGB OpenSource Guestbook

MGB is a free OpenSource Guestbook completely written in PHP, using JavaScript and a MySQL Database. Easy to use, flexible and customizable with templates to make it fit 100% to your homepage.

Downloads: 11 This Week

Last Update: 2026-02-20
See Project
17

instagram-profilecrawl

Instagram profile crawler that extracts posts, tags, and stats

instagram-profilecrawl is a Python-based automation script designed to collect publicly available information from Instagram profiles. It crawls profile data such as follower counts, post information, hashtags, and other engagement-related metadata. It operates by automating a web browser using Selenium and performing requests to gather structured information from the platform. instagram-profilecrawl can analyze multiple usernames in a single run and store the extracted information locally in structured formats such as JSON. The collected data can include profile metadata, post details, engagement metrics, and commenter activity, allowing users to analyze account behavior or monitor profile growth over time. ...

Downloads: 6 This Week

Last Update: 4 days ago
See Project
18

Zero Install

Zero Install is a decentralised cross-distribution software installation system. Create one package that works everywhere! With dependency handling and automatic updates, full support for shared libraries, and integration with native package managers

24 Reviews

Downloads: 50,062 This Week

Last Update: 2021-02-17
See Project
19

RED HAWK

All-in-one reconnaissance and vulnerability scanning toolkit for sites

...RED HAWK includes utilities for performing DNS lookups, port scans, subdomain discovery, and reverse IP analysis, giving users a comprehensive view of a target environment. In addition to vulnerability detection, RED HAWK offers crawling features that gather links and metadata from websites to support deeper reconnaissance.

Downloads: 0 This Week

Last Update: 5 days ago
See Project
20

MyCoRe

your repository framework

MyCoRe is an Open Source project for the development of Repositories, Digital Library and archive solutions. The technical base of the system is formed of Java class libraries, XML technology and different database backends. Since 2015 we use https://mycore.atlassian.net/ for bug tracking. Please use our ticket system there.

Downloads: 0 This Week

Last Update: 2020-02-18
See Project
21

panFMP

panFMP is a generic framework suitable for harvested XML metadata that is searchable through Apache Lucene without any additional RDBMS. Fields can be defined by XPath allowing for full text queries on all types of fields including numerical ranges. The code was moved to Github: https://github.com/pangaea-data-publisher/panfmp

Downloads: 0 This Week

Last Update: 2019-05-01
See Project
22

pxer

Pixiv crawler userscript for downloading artwork and galleries easily

...It is implemented primarily in client-side JavaScript and runs directly in the browser through a userscript environment, allowing it to integrate seamlessly with Pixiv pages. Pxer provides functionality to crawl and gather images, artwork metadata, and other related content from supported Pixiv pages. It is designed to be accessible even for users who are not developers, emphasizing ease of use and quick setup through browser extensions such as Tampermonkey. Once installed, Pxer adds controls to compatible Pixiv pages so users can trigger batch retrieval of illustrations and related assets. ...

Downloads: 4 This Week

Last Update: 2026-03-11
See Project
23

KORA

...GO DOWNLOAD KORA 3: https://github.com/matrix-msu/Kora3/releases yours truly, Kora Development Team KORA is a database driven, online digital repository that allows institutions to ingest, manage, and deliver digital objects and their corresponding metadata.

Downloads: 0 This Week

Last Update: 2018-12-04
See Project
24

OpenSearchServer Search Engine

An open source search engine with RESTFul API and crawlers

OpenSearchServer is a powerful, enterprise-class, search engine program. Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. OpenSearchServer runs on...

31 Reviews

Downloads: 3 This Week

Last Update: 2018-08-26
See Project
25

WeChatSogou

Python library to crawl and retrieve data from WeChat accounts

WechatSogou is an open source Python library designed to retrieve data from WeChat official accounts by using the Sogou WeChat search service as its data source. It provides developers with a programmatic way to search for public accounts and collect article information without manually browsing the search interface. It functions as a crawler interface that sends requests to the search engine, retrieves results, and converts the returned pages into structured data that can be used in...

Downloads: 0 This Week

Last Update: 2026-03-10
See Project