Search Results for "data scraper website" - Page 2

Showing 545 open source projects for "data scraper website"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    LaTeX.CSS

    LaTeX.CSS

    LaTeX.css is a library that makes your website look like a LaTeX doc

    This almost class-less CSS library turns your HTML document into a website that looks like a LATEX document. Write semantic HTML, and you are good to go. The source code can be found on GitHub. LaTeX.css is a minimal, almost class-less CSS library that makes any website look like a LaTeX document. Add any optional classes to elements with special styles (author subtitle, abstract, lemmas, theorems, etc.). The labels of theorems, definitions, lemmas and proofs can be changed to other...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Jikan REST

    Jikan REST

    The REST API for Jikan

    Jikan REST is an unofficial RESTful API for MyAnimeList.net, providing access to anime, manga, and user data by scraping the website. It allows developers to integrate MyAnimeList data into their applications without relying on the official API. ​
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    Skyvern

    Skyvern

    Automate browser-based workflows with LLMs and Computer Vision

    ...Skyvern understands how to solve CAPTCHAs to complete complicated workflows. Support for authenticating into user accounts, including support for 2FA/TOTP. Extract data from workflows in any schema of your choice including CSV or JSON. Automate procurement pipelines, breeze through government forms, and complete workflows in any language.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 4
    workflowr

    workflowr

    Organize your project into a research website

    workflowr is an R package that helps researchers organize, version, and share their data science projects in a reproducible and transparent manner. It combines R Markdown, Git, and a structured file system to create a research website that tracks analysis, results, and code changes over time. It’s ideal for academic and collaborative research workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    OSCAL

    OSCAL

    Open Security Controls Assessment Language (OSCAL)

    ...Public contributions to this project are welcome. With this effort, we are stressing the agile development of a set of minimal formats that are generic enough to capture the breadth of data in scope (controls specifications), while also capable of ad-hoc tuning and extension to support peculiarities of both (industry or sector) standards and new control types. The OSCAL website provides an overview of the OSCAL project, including an XML and JSON schema reference, examples, and other resources.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 6
    RuoYi

    RuoYi

    The warehouse's SpringBoot-based rights management system

    The warehouse's SpringBoot-based rights management system is easy to read and understand, and the interface is simple and beautiful. The core technology uses Spring, MyBatis, and Shiro without any other heavy dependencies. I have always wanted to make a background management system, and I have seen many excellent open source projects but found no suitable ones. So I started to write a background system in my spare time. So there is Zoe. She can be used for all web applications, such as...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    Parsera

    Parsera

    Lightweight library for scraping web-sites with LLMs

    Scrape data from any website with only a link and column descriptions. Parsera is a tool designed to scrape web content, specifically handling poorly structured or messy websites.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    SkyCrypt

    SkyCrypt

    A Hypixel skyblock stats website

    SkyCrypt is a web-based application that allows players of Hypixel SkyBlock to view and share detailed information about their in-game profiles through a visually rich interface. It aggregates data from the Hypixel API and presents it in an organized format, including player statistics, skills, equipment, and inventory details. The project is built with a Node.js-based stack and integrates additional technologies such as MongoDB and Redis to handle data storage and caching. SkyCrypt enhances...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 9
    MDCx

    MDCx

    Movie metadata scraper and organizer for media libraries and NFO

    MDCx is an open source media metadata scraping and organization tool designed to automate the process of collecting detailed information for movie files. It retrieves metadata from multiple online sources and applies it to local media collections, helping users maintain structured and well-organized libraries. MDCx can download information such as titles, cast data, artwork, and other metadata, then generate standardized NFO files compatible with media management systems. It also supports...
    Downloads: 6 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    Publii

    Publii

    Publii is a desktop-based CMS for Windows, Mac and Linux

    Publii is a powerful blogging app perfect for anyone looking to create a privacy-focused website. Whether you're a beginner or a developer, it has all the tools you need to get started. Publii is a static site generator that makes it easy to create a personal blog, portfolio, or corporate website. With instant site switching and no databases or other credentials to remember, Publii is the perfect platform for anyone who wants a hassle-free way to build and manage an online presence. Websites...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    101-0250-00

    101-0250-00

    ETH course - Solving PDEs in parallel on GPUs

    This course aims to cover state-of-the-art methods in modern parallel Graphical Processing Unit (GPU) computing, supercomputing and code development with applications to natural sciences and engineering.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    PlutoSliderServer.jl

    PlutoSliderServer.jl

    Web server to run just the `@bind` parts of a Pluto.jl notebook

    Web server to run just the @bind parts of a Pluto.jl notebook. PlutoSliderServer can run a notebook and generate the export HTML file. This will give you the same file as the export button inside Pluto (top right), but automatically, without opening a browser. One use case is to automatically create a GitHub Pages site from a repository with notebooks. For this, take a look at our template repository that used GitHub Actions and PlutoSliderServer to generate a website on every commit. Many...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    GeoServer

    GeoServer

    GeoServer repository

    GeoServer is an open-source software server written in Java that allows users to share and edit geospatial data. Designed for interoperability, it publishes data from any major spatial data source using open standards. Being a community-driven project, GeoServer is developed, tested, and supported by a diverse group of individuals and organizations from around the world. GeoServer is the reference implementation of the Open Geospatial Consortium (OGC) Web Feature Service (WFS) and Web...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 14
    DuckDB

    DuckDB

    DuckDB is an in-process SQL OLAP Database Management System

    ...DuckDB supports arbitrary and nested correlated subqueries, window functions, collations, complex types (arrays, structs), and more. For more information on the goals of DuckDB, please refer to the Why DuckDB page on our website. Processing and storing tabular datasets, e.g. from CSV or Parquet files. Interactive data analysis, e.g. Joining & aggregate multiple large tables. Concurrent large changes, to multiple large tables, e.g. appending rows, adding/removing/updating columns. Large result set transfer to client. For development, DuckDB requires CMake, Python3 and a C++11 compliant compiler. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    I Still Don't Care About Cookies

    I Still Don't Care About Cookies

    Debloated fork of the extension "I don't care about cookies"

    Debloated fork of the extension "I don't care about cookies". Get rid of cookie warnings from almost all websites! This extension has been acquired by Avast and simply I don't trust Avast with my data. Also having it on Github allows us to improve the code & add support for websites faster. The EU regulations require that any website using tracking cookies must get user's permission before installing them. These warnings appear on most websites until the visitor agrees with the website's terms and conditions. Imagine how irritating that becomes when you surf anonymously or if you delete cookies automatically every time you close the browser. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    TURF

    TURF

    A modular geospatial engine written in JavaScript

    Turf is a JavaScript library for spatial analysis. It includes traditional spatial operations, helper functions for creating GeoJSON data, and data classification and statistics tools. Turf can be added to your website as a client-side plugin, or you can run Turf server-side with Node.js. Modular, simple-to-understand JavaScript functions that speak GeoJSON. Turf is a collection of small modules, you only need to take what you want to use. Takes advantage of the newest algorithms and doesn't require you to send data to a server. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    Astro

    Astro

    The web framework for content-driven websites

    Astro powers the world's fastest marketing sites, blogs, e-commerce websites, and more. Astro improves website performance by rendering components on the server, sending lightweight HTML to the browser with zero unnecessary JavaScript overhead. Astro was designed to work with your content, no matter where it lives. Load data from your file system, external API, or your favorite CMS. Extend Astro with your favorite tools. Bring your own JavaScript UI components, CSS libraries, themes, integrations, and more. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 18
    ZetaJS

    ZetaJS

    JS wrapper for ZetaOffice in the browser

    The zeta.js library provides the facilities to run an instance of ZetaOffice integrated into your web site, allowing you to control it with JavaScript code via the LibreOffice UNO technology. Use cases range from an in-browser office suite that looks and feels just like its desktop counterpart, to fine-tuned custom text editing and spreadsheet capabilities embedded in your website, to a headless zetajs instance that does document conversion in the background.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    Logseq

    Logseq

    A privacy-first, open-source platform for knowledge management

    ...Logseq is a platform for knowledge management and collaboration. It focuses on privacy, longevity, and user control. The server will never store or analyze your private notes. Your data are plain text files and we currently support both Markdown and Emacs Org-mode (more to be added soon). In the unlikely event that the website is down or cannot be maintained, your data is, and will always be yours. No data lock-in, no proprietary formats, you can edit the same Markdown/Org-mode file with any tools at the same time. ...
    Downloads: 17 This Week
    Last Update:
    See Project
  • 20
    finvizfinance

    finvizfinance

    Finviz analysis python library

    finvizfinance is a package that collects financial information from FinViz website. Stock charts, fundamental & technical information, insider information and stock news. Forex charts and performance. Crypto charts and performance. Screener and Group provide data frames for comparing stocks according to different filters and trading signals. Getting information (fundament, description, outer rating, stock news, inside trader) of an individual stock.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 21
    Surmon.me

    Surmon.me

    Personal website and blog

    Surmon.me is a full-featured personal website and blog platform built with Vue and designed as part of a larger ecosystem of interconnected applications and services. The project functions as a server-side rendered (SSR) web application that delivers content dynamically while maintaining performance and SEO optimization. It is powered by a dedicated backend service called NodePress, which provides RESTful APIs for content management, data retrieval, and system operations. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 22
    Fidus Writer

    Fidus Writer

    Fidus Writer is an online collaborative editor for academics

    Fidus Writer is an online collaborative editor especially made for academics who need to use citations and/or formulas. The editor focuses on the content rather than the layout, so that with the same text, you can later on publish it in multiple ways: On a website, as a printed book, or as an ebook. In each case, you can choose from a number of layouts that are adequate for the medium of choice.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Zotonic

    Zotonic

    The Erlang Web Framework & CMS

    Zotonic is the content management system and framework for semantic data and high-connectivity websites. Your data outlives your website. Grow your data and share your data. The Zotonic content management system excels in managing structured data from multiple sources. It brings structure to your documents and makes semantic data manageable for normal humans. Real-time connectivity is deeply integrated using the Internet of Things publish/subscribe protocol MQTT. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    Bruno

    Bruno

    Opensource IDE For Exploring and Testing Api's

    Bruno is a Fast and Git-Friendly Opensource API client, aimed at revolutionizing the status quo represented by Postman, Insomnia, and similar tools out there. Bruno stores your collections directly in a folder on your filesystem. We use a plain text markup language, Bru, to save information about API requests. You can use git or any version control of your choice to collaborate over your API collections. Bruno is offline only. There are no plans to add cloud-sync to Bruno, ever. We value...
    Downloads: 61 This Week
    Last Update:
    See Project
  • 25
    Checkmate

    Checkmate

    Checkmate is an open-source, self-hosted tool

    Checkmate is an open-source, self-hosted infrastructure monitoring platform that provides real-time visibility into server health, uptime, response times, and incident activity through a modern web interface. The application continuously checks whether websites and services are accessible and performing optimally, generating alerts and reports when availability or performance degrades. It supports detailed infrastructure monitoring through an optional agent called Capture, which collects...
    Downloads: 2 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB