Search Results for "data scraper website" - Page 2

Showing 474 open source projects for "data scraper website"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 1
    SkyCrypt

    SkyCrypt

    A Hypixel skyblock stats website

    SkyCrypt is a web-based application that allows players of Hypixel SkyBlock to view and share detailed information about their in-game profiles through a visually rich interface. It aggregates data from the Hypixel API and presents it in an organized format, including player statistics, skills, equipment, and inventory details. The project is built with a Node.js-based stack and integrates additional technologies such as MongoDB and Redis to handle data storage and caching. SkyCrypt enhances...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 2
    crwlr

    crwlr

    Library for Rapid (Web) Crawler and Scraper Development

    This library provides kind of a framework and a lot of ready-to-use, so-called steps, that you can use as building blocks, to build your own crawlers and scrapers with. Before diving into the library, let's have a look at the terms crawling and scraping. For most real-world use cases, those two things go hand in hand, which is why this library helps with and combines both. A (web) crawler is a program that (down)loads documents and follows the links in it to load them as well. A crawler...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Skyvern

    Skyvern

    Automate browser-based workflows with LLMs and Computer Vision

    ...Skyvern understands how to solve CAPTCHAs to complete complicated workflows. Support for authenticating into user accounts, including support for 2FA/TOTP. Extract data from workflows in any schema of your choice including CSV or JSON. Automate procurement pipelines, breeze through government forms, and complete workflows in any language.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Parsera

    Parsera

    Lightweight library for scraping web-sites with LLMs

    Scrape data from any website with only a link and column descriptions. Parsera is a tool designed to scrape web content, specifically handling poorly structured or messy websites.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    Jikan REST

    Jikan REST

    The REST API for Jikan

    Jikan REST is an unofficial RESTful API for MyAnimeList.net, providing access to anime, manga, and user data by scraping the website. It allows developers to integrate MyAnimeList data into their applications without relying on the official API. ​
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Publii

    Publii

    Publii is a desktop-based CMS for Windows, Mac and Linux

    Publii is a powerful blogging app perfect for anyone looking to create a privacy-focused website. Whether you're a beginner or a developer, it has all the tools you need to get started. Publii is a static site generator that makes it easy to create a personal blog, portfolio, or corporate website. With instant site switching and no databases or other credentials to remember, Publii is the perfect platform for anyone who wants a hassle-free way to build and manage an online presence. Websites...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    RuoYi

    RuoYi

    The warehouse's SpringBoot-based rights management system

    The warehouse's SpringBoot-based rights management system is easy to read and understand, and the interface is simple and beautiful. The core technology uses Spring, MyBatis, and Shiro without any other heavy dependencies. I have always wanted to make a background management system, and I have seen many excellent open source projects but found no suitable ones. So I started to write a background system in my spare time. So there is Zoe. She can be used for all web applications, such as...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    MDCx

    MDCx

    Movie metadata scraper and organizer for media libraries and NFO

    MDCx is an open source media metadata scraping and organization tool designed to automate the process of collecting detailed information for movie files. It retrieves metadata from multiple online sources and applies it to local media collections, helping users maintain structured and well-organized libraries. MDCx can download information such as titles, cast data, artwork, and other metadata, then generate standardized NFO files compatible with media management systems. It also supports...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    DuckDB

    DuckDB

    DuckDB is an in-process SQL OLAP Database Management System

    ...DuckDB supports arbitrary and nested correlated subqueries, window functions, collations, complex types (arrays, structs), and more. For more information on the goals of DuckDB, please refer to the Why DuckDB page on our website. Processing and storing tabular datasets, e.g. from CSV or Parquet files. Interactive data analysis, e.g. Joining & aggregate multiple large tables. Concurrent large changes, to multiple large tables, e.g. appending rows, adding/removing/updating columns. Large result set transfer to client. For development, DuckDB requires CMake, Python3 and a C++11 compliant compiler. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 10
    101-0250-00

    101-0250-00

    ETH course - Solving PDEs in parallel on GPUs

    This course aims to cover state-of-the-art methods in modern parallel Graphical Processing Unit (GPU) computing, supercomputing and code development with applications to natural sciences and engineering.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    GeoServer

    GeoServer

    GeoServer repository

    GeoServer is an open-source software server written in Java that allows users to share and edit geospatial data. Designed for interoperability, it publishes data from any major spatial data source using open standards. Being a community-driven project, GeoServer is developed, tested, and supported by a diverse group of individuals and organizations from around the world. GeoServer is the reference implementation of the Open Geospatial Consortium (OGC) Web Feature Service (WFS) and Web...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 12
    Astro

    Astro

    The web framework for content-driven websites

    Astro powers the world's fastest marketing sites, blogs, e-commerce websites, and more. Astro improves website performance by rendering components on the server, sending lightweight HTML to the browser with zero unnecessary JavaScript overhead. Astro was designed to work with your content, no matter where it lives. Load data from your file system, external API, or your favorite CMS. Extend Astro with your favorite tools. Bring your own JavaScript UI components, CSS libraries, themes, integrations, and more. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 13
    PlutoSliderServer.jl

    PlutoSliderServer.jl

    Web server to run just the `@bind` parts of a Pluto.jl notebook

    Web server to run just the @bind parts of a Pluto.jl notebook. PlutoSliderServer can run a notebook and generate the export HTML file. This will give you the same file as the export button inside Pluto (top right), but automatically, without opening a browser. One use case is to automatically create a GitHub Pages site from a repository with notebooks. For this, take a look at our template repository that used GitHub Actions and PlutoSliderServer to generate a website on every commit. Many...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    I Still Don't Care About Cookies

    I Still Don't Care About Cookies

    Debloated fork of the extension "I don't care about cookies"

    Debloated fork of the extension "I don't care about cookies". Get rid of cookie warnings from almost all websites! This extension has been acquired by Avast and simply I don't trust Avast with my data. Also having it on Github allows us to improve the code & add support for websites faster. The EU regulations require that any website using tracking cookies must get user's permission before installing them. These warnings appear on most websites until the visitor agrees with the website's terms and conditions. Imagine how irritating that becomes when you surf anonymously or if you delete cookies automatically every time you close the browser. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Datasette

    Datasette

    An open source multi-tool for exploring and publishing data

    Datasette is a tool for exploring and publishing data. It helps people take data of any shape or size, analyze and explore it, and publish it as an interactive website and accompanying API. Datasette is aimed at data journalists, museum curators, archivists, local governments, scientists, researchers and anyone else who has data that they wish to share with the world. It is part of a wider ecosystem of tools and plugins dedicated to making working with structured data as productive as possible. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Surmon.me

    Surmon.me

    Personal website and blog

    Surmon.me is a full-featured personal website and blog platform built with Vue and designed as part of a larger ecosystem of interconnected applications and services. The project functions as a server-side rendered (SSR) web application that delivers content dynamically while maintaining performance and SEO optimization. It is powered by a dedicated backend service called NodePress, which provides RESTful APIs for content management, data retrieval, and system operations. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 17
    finvizfinance

    finvizfinance

    Finviz analysis python library

    finvizfinance is a package that collects financial information from FinViz website. Stock charts, fundamental & technical information, insider information and stock news. Forex charts and performance. Crypto charts and performance. Screener and Group provide data frames for comparing stocks according to different filters and trading signals. Getting information (fundament, description, outer rating, stock news, inside trader) of an individual stock.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 18
    NVIDIA Merlin

    NVIDIA Merlin

    Library providing end-to-end GPU-accelerated recommender systems

    ...For more information, see NVIDIA Merlin on the NVIDIA developer website. Transform data (ETL) for preprocessing and engineering features. Accelerate your existing training pipelines in TensorFlow, PyTorch, or FastAI by leveraging optimized, custom-built data loaders. Scale large deep learning recommender models by distributing large embedding tables that exceed available GPU and CPU memory. Deploy data transformations and trained models to production with only a few lines of code.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    AdGuard Home

    AdGuard Home

    Network-wide ads and trackers blocking DNS server

    ...With the rise of Internet-Of-Things and connected devices, it becomes more and more important to be able to control your whole network. Block throughout the whole system. This includes video ads and ads in your favorite apps, browsers, games, and on any website you can imagine. Dozens of ad filters are available to you and are updated on a regular basis, guaranteeing the best filtering quality. Protecting your personal data is our top priority. With AdGuard, you and your sensitive data will be safe from any online tracker and analytics system that may attempt to steal your data while surfing the web. ...
    Downloads: 44 This Week
    Last Update:
    See Project
  • 20
    Logseq

    Logseq

    A privacy-first, open-source platform for knowledge management

    ...Logseq is a platform for knowledge management and collaboration. It focuses on privacy, longevity, and user control. The server will never store or analyze your private notes. Your data are plain text files and we currently support both Markdown and Emacs Org-mode (more to be added soon). In the unlikely event that the website is down or cannot be maintained, your data is, and will always be yours. No data lock-in, no proprietary formats, you can edit the same Markdown/Org-mode file with any tools at the same time. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 21
    sitespeed.io

    sitespeed.io

    Monitor, analyze and optimize your website speed and performance

    Sitespeed.io is an open source tool that helps you monitor, analyze and optimize your website speed and performance, based on performance best practices advices from the coach and collecting browser metrics using the Navigation Timing API, User Timings and Visual Metrics (FirstVisualChange, SpeedIndex & LastVisualChange). Measuring performance shouldn’t be hard, you should be able to have full control of your metrics, own your own data and you should be able to do it without paying top dollars. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Mina

    Mina

    Mina is a new cryptocurrency with a constant size blockchain

    We are the world’s lightest blockchain, powered by participants. Using zero-knowledge technology, Mina is creating the infrastructure for the secure, democratic future we all deserve. Build on Mina with zkApps, smart contracts powered by zero-knowledge. The SDK, based on Typescript, will be available soon. The Mina blockchain remains ~22kb, enabling users to quickly and directly access the current state from their smartphone and other blockchains. Users today are compelled to hand over their...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Home Assistant

    Home Assistant

    Open source home automation that puts local control and privacy first

    ...See more examples on the official website: https://www.home-assistant.io/
    Downloads: 24 This Week
    Last Update:
    See Project
  • 24
    Bruno

    Bruno

    Opensource IDE For Exploring and Testing Api's

    Bruno is a Fast and Git-Friendly Opensource API client, aimed at revolutionizing the status quo represented by Postman, Insomnia, and similar tools out there. Bruno stores your collections directly in a folder on your filesystem. We use a plain text markup language, Bru, to save information about API requests. You can use git or any version control of your choice to collaborate over your API collections. Bruno is offline only. There are no plans to add cloud-sync to Bruno, ever. We value...
    Downloads: 42 This Week
    Last Update:
    See Project
  • 25
    Checkmate

    Checkmate

    Checkmate is an open-source, self-hosted tool

    Checkmate is an open-source, self-hosted infrastructure monitoring platform that provides real-time visibility into server health, uptime, response times, and incident activity through a modern web interface. The application continuously checks whether websites and services are accessible and performing optimally, generating alerts and reports when availability or performance degrades. It supports detailed infrastructure monitoring through an optional agent called Capture, which collects...
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB