164 projects for "extensible web spider" with 2 filters applied:

  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    Python-Spider

    Python-Spider

    Python3 web crawler practice

    ...As part of the author’s public learning-path repositories, python-spider likely includes examples of HTTP requests, HTML parsing, maybe concurrency or scheduling to crawl multiple pages, and techniques to handle common web-scraping issues. For people wanting to get hands-on with building scrapers, collecting data, or learning how to navigate web programming in Python, this repository acts as a didactic reference or starting point.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Node Crawler

    Node Crawler

    Web Crawler/Spider for NodeJS + server-side jQuery

    Most powerful, popular and production crawling/scraping package for Node, happy hacking.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 3
    Heritrix

    Heritrix

    Internet Archive's open-source, web-scale, web crawler project

    Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project. Heritrix (sometimes spelled heretrix, or misspelled or missaid as heratrix/heritix/heretix/heratix) is an archaic word for heiress (woman who inherits). Since our crawler seeks to collect and preserve the digital artifacts of our culture for the benefit of future researchers and generations, this name seemed apt.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Ash

    Ash

    A declarative, extensible framework for building Elixir applications

    Ash is a declarative framework for building resource-oriented apps in Elixir. It emphasizes composability, DSL-driven definitions of resources/actions/relationships, and extensibility through plugins for API, database, and UI layers.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 5
    Mako

    Mako

    An extremely fast, production-grade web bundler based on Rust

    Mako is a new web bundler for web apps, libraries, and frameworks. It's designed to be fast, reliable, and easy to use. It has been used in hundreds of projects in production by Ant Group, and other companies. If you are looking for a modern web bundler, Mako is the right choice.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Django

    Django

    The Web framework for perfectionists with deadlines

    Django is a high-level, free and open-source Python web framework founded on the Model–Template–View (MTV) pattern, designed to facilitate rapid development of secure, maintainable, and scalable database-driven websites. First, read docs/intro/install.txt for instructions on installing Django. Next, work through the tutorials in order (docs/intro/tutorial01.txt, docs/intro/tutorial02.txt, etc.). If you want to set up an actual deployment server, read docs/howto/deployment/index.txt for...
    Downloads: 32 This Week
    Last Update:
    See Project
  • 7
    Flight Core

    Flight Core

    An extensible micro-framework for PHP

    FlightPHP is a lightweight, fast, and flexible micro-framework for PHP, designed to build RESTful web applications and APIs. It provides a simple routing system, middleware support, and a powerful templating engine. FlightPHP is ideal for developers looking for a minimalist framework that doesn't impose a lot of structure, while still offering essential features for building modern web applications.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    Scrapy

    Scrapy

    A fast, high-level web crawling and web scraping framework

    Scrapy is a fast, open source, high-level framework for crawling websites and extracting structured data from these websites. Portable and written in Python, it can run on Windows, Linux, macOS and BSD. Scrapy is powerful, fast and simple, and also easily extensible. Simply write the rules to extract the data, and add new functionality if you wish without having to touch the core. Scrapy does the rest, and can be used in a number of applications. It can be used for data mining, monitoring...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 9
    Monaco Editor

    Monaco Editor

    A browser based code editor

    Monaco Editor is the rich, browser-based code editor that powers Visual Studio Code, providing advanced editing capabilities as a standalone embeddable library for web applications. Models are at the heart of Monaco editor. It's what you interact with when managing content. A model represents a file that has been opened. This could represent a file that exists on a file system, but it doesn't have to. For example, the model holds the text content, determines the language of the content, and...
    Downloads: 17 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    TanStack Form

    TanStack Form

    Headless, performant, and type-safe form state management

    TanStack Form is a powerful, headless form management library designed to simplify form handling in web applications. It offers a flexible and extensible API that allows developers to manage form state, validation, and submission with ease. By providing a headless architecture, TanStack Form enables seamless integration with various UI frameworks and custom components, promoting a clean separation between form logic and presentation.​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Prism

    Prism

    Lightweight, robust, elegant syntax highlighting

    Prism is a lightweight, extensible syntax highlighter, built with modern web standards in mind. It’s used in millions of websites, including some of those you visit daily.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    Logbook

    Logbook

    An extensible Java library for HTTP request and response logging

    Logbook is an extensible Java library to enable complete request and response logging for different client- and server-side technologies. It satisfies a special need by a) allowing web application developers to log any HTTP traffic that an application receives or sends b) in a way that makes it easy to persist and analyze it later. This can be useful for traditional log analysis, meeting audit requirements or investigating individual historic traffic issues.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Servant

    Servant

    Haskell DSL for describing, serving, querying, mocking web apps

    Servant provides a type-level domain-specific language (DSL) in Haskell for describing web APIs. From a single API specification, developers can derive server implementations, client libraries, documentation, and more—ensuring consistency and type safety across the stack. We have a tutorial that introduces the core features of servant. After this article, you should be able to write your first server web services, learning the rest from the haddocks' examples. The core documentation can be...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    Behat

    Behat

    BDD in PHP

    Behat is a Behavior-Driven Development (BDD) framework for PHP that helps developers write tests in a human-readable format. It uses Gherkin syntax to describe expected application behavior and allows developers to write scenarios that map to automated tests. Behat is highly extensible, making it suitable for testing both web applications and APIs, and it is often used alongside Mink for browser automation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    React ChatBotify

    React ChatBotify

    A modern React library for creating flexible and extensible chatbots

    Welcome to React ChatBotify, an intuitive and versatile library that allows you to build responsive chatbots with ease. Whether you're aiming to create a simple FAQ bot or a sophisticated conversational interface integrated with Large Language Models (LLMs), React ChatBotify provides the tools and flexibility you need to bring your vision to life.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    JupyterLite

    JupyterLite

    Wasm powered Jupyter running in the browser

    JupyterLite is a browser-based distribution of the Jupyter ecosystem that enables interactive computing entirely within a web browser without requiring a backend server. Built using JupyterLab components and powered by WebAssembly technologies, it allows users to run Python and other language kernels directly in the browser through tools like Pyodide or Xeus. This architecture eliminates the need for installation or server infrastructure, making it highly accessible for education,...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 17
    Perfetto

    Perfetto

    Production-grade client-side tracing, profiling, and analysis

    Perfetto is a production-grade tracing platform for Android, Linux, and Chrome that captures extremely detailed information about what a system is doing over time. It’s designed around a low-overhead producer/consumer model: instrumented components (“producers”) write binary events into shared memory buffers and a collector (“service”) reliably streams them to storage. The data model spans kernel and userspace, so you can stitch together CPU scheduling, app lifecycles, binder/IPC hops, GPU...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 18
    Lyricist

    Lyricist

    Missing I18N/L10N (internationalization/localization) multiplatform

    Jetpack Compose greatly improved the way we build UIs on Android, but not how we interact with strings. stringResource() works well, but doesn't benefit from the idiomatic Kotlin like Compose. Lyricist tries to make working with strings as powerful as building UIs with Compose, i.e., working with parameterized string is now typesafe, use of when expression to work with plurals with more flexibility, and even load/update the strings dynamically via an API.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19
    Papis

    Papis

    Powerful and highly extensible command-line based document

    Papis is a powerful and highly extensible CLI document and bibliography manager. With Papis, you can search your library for books and papers, add documents and notes, import and export to and from other formats, and much much more. Papis uses a human-readable and easily hackable .yaml file to store each entry's bibliographical data. It strives to be easy to use while providing a wide range of features. And for those who still want more, Papis makes it easy to write scripts that extend its...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    miso

    miso

    A tasty Haskell front-end framework

    Miso is a small, production-ready, "isomorphic" Haskell front-end framework for quickly building highly interactive single-page web applications. It features a virtual-dom, recursive diffing / patching algorithm, attribute and property normalization, event delegation, event batching, SVG, Server-sent events, Websockets, type-safe servant-style routing and an extensible Subscription-based subsystem. Inspired by Elm, Redux and Bobril. Miso is pure by default, but side effects (like XHR) can be introduced into the system via the Effect data type. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    BentoCache

    BentoCache

    Bentocache is a robust multi-tier caching library for Node.js app

    Bentocache is a flexible caching library for Python that supports multiple backends like memory, disk, and Redis. It offers decorators for easy function-level caching and is designed to be lightweight, extensible, and developer-friendly. Bentocache is well-suited for performance optimization in web apps, scripts, and data pipelines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Jovo Framework

    Jovo Framework

    The React for Voice and Chat, build apps for Alexa, Google Assistant

    The multimodal experience platform enables professional teams to build and run apps that work across smart speakers, the web, mobile, and more. Fully customizable and open source. The Jovo product ecosystem allows you to build, test, and run powerful experiences for voice, chat, and web platforms. From local development to production, Jovo allows you to build robust experiences, faster. Build across devices and platforms and use all supported modalities thanks to the Jovo output template...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Aimeos core

    Aimeos core

    Aimeos PHP ecommerce framework for ultra fast online shops

    Aimeos-Core is the core library and framework for the Aimeos ecommerce ecosystem, providing the essential building blocks for flexible, scalable online shops, marketplaces, and enterprise commerce applications. Unlike a monolithic shop application, Aimeos-Core is designed as a modular framework that you can embed into your own web app, enabling you to define custom front-ends and workflows while relying on its robust commerce services for catalog, cart, checkout, orders, and customer...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Silverstripe CMS

    Silverstripe CMS

    Silverstripe CMS - this is a module for Silverstripe Framework

    Silverstripe CMS is an intuitive content management system and flexible framework loved by editors and developers alike. Equip your web teams to achieve outstanding results. Silverstripe CMS fits the outcomes you want and doesn't force your business outcomes into an out-of-the-box solution. Customize to your needs. You can be the CMS expert in no time. Get started quickly and deliver your content to your users fast. Don’t stay awake at night worrying! Silverstripe CMS is solid as a rock,...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Graphile's Crystal Monorepo

    Graphile's Crystal Monorepo

    Graphile's Crystal Monorepo; home to Grafast, PostGraphile

    The next-generation planning and execution engine for GraphQL. Enables greater performance and efficiency via deeper integration into your existing Node.js or remote business logic, no matter the shape of your GraphQL schema. Grafast and PostGraphile V5 are in sponsors-only alpha; sponsor at any tier, then reach out for access. A quick-start project for full-stack application development in React, Node.js, GraphQL and PostgreSQL. At Graphile we love GraphQL so much we named ourself for our...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next