19 projects for "extensible web spider" with 2 filters applied:

  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    Python-Spider

    Python-Spider

    Python3 web crawler practice

    ...As part of the author’s public learning-path repositories, python-spider likely includes examples of HTTP requests, HTML parsing, maybe concurrency or scheduling to crawl multiple pages, and techniques to handle common web-scraping issues. For people wanting to get hands-on with building scrapers, collecting data, or learning how to navigate web programming in Python, this repository acts as a didactic reference or starting point.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Node Crawler

    Node Crawler

    Web Crawler/Spider for NodeJS + server-side jQuery

    Most powerful, popular and production crawling/scraping package for Node, happy hacking.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 3
    Heritrix

    Heritrix

    Internet Archive's open-source, web-scale, web crawler project

    Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project. Heritrix (sometimes spelled heretrix, or misspelled or missaid as heratrix/heritix/heretix/heratix) is an archaic word for heiress (woman who inherits). Since our crawler seeks to collect and preserve the digital artifacts of our culture for the benefit of future researchers and generations, this name seemed apt.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Mako

    Mako

    An extremely fast, production-grade web bundler based on Rust

    Mako is a new web bundler for web apps, libraries, and frameworks. It's designed to be fast, reliable, and easy to use. It has been used in hundreds of projects in production by Ant Group, and other companies. If you are looking for a modern web bundler, Mako is the right choice.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 5
    Prism

    Prism

    Lightweight, robust, elegant syntax highlighting

    Prism is a lightweight, extensible syntax highlighter, built with modern web standards in mind. It’s used in millions of websites, including some of those you visit daily.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    Logbook

    Logbook

    An extensible Java library for HTTP request and response logging

    Logbook is an extensible Java library to enable complete request and response logging for different client- and server-side technologies. It satisfies a special need by a) allowing web application developers to log any HTTP traffic that an application receives or sends b) in a way that makes it easy to persist and analyze it later. This can be useful for traditional log analysis, meeting audit requirements or investigating individual historic traffic issues.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Servant

    Servant

    Haskell DSL for describing, serving, querying, mocking web apps

    Servant provides a type-level domain-specific language (DSL) in Haskell for describing web APIs. From a single API specification, developers can derive server implementations, client libraries, documentation, and more—ensuring consistency and type safety across the stack. We have a tutorial that introduces the core features of servant. After this article, you should be able to write your first server web services, learning the rest from the haddocks' examples. The core documentation can be...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    React ChatBotify

    React ChatBotify

    A modern React library for creating flexible and extensible chatbots

    Welcome to React ChatBotify, an intuitive and versatile library that allows you to build responsive chatbots with ease. Whether you're aiming to create a simple FAQ bot or a sophisticated conversational interface integrated with Large Language Models (LLMs), React ChatBotify provides the tools and flexibility you need to bring your vision to life.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    Perfetto

    Perfetto

    Production-grade client-side tracing, profiling, and analysis

    Perfetto is a production-grade tracing platform for Android, Linux, and Chrome that captures extremely detailed information about what a system is doing over time. It’s designed around a low-overhead producer/consumer model: instrumented components (“producers”) write binary events into shared memory buffers and a collector (“service”) reliably streams them to storage. The data model spans kernel and userspace, so you can stitch together CPU scheduling, app lifecycles, binder/IPC hops, GPU...
    Downloads: 15 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10
    Lyricist

    Lyricist

    Missing I18N/L10N (internationalization/localization) multiplatform

    Jetpack Compose greatly improved the way we build UIs on Android, but not how we interact with strings. stringResource() works well, but doesn't benefit from the idiomatic Kotlin like Compose. Lyricist tries to make working with strings as powerful as building UIs with Compose, i.e., working with parameterized string is now typesafe, use of when expression to work with plurals with more flexibility, and even load/update the strings dynamically via an API.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    Papis

    Papis

    Powerful and highly extensible command-line based document

    Papis is a powerful and highly extensible CLI document and bibliography manager. With Papis, you can search your library for books and papers, add documents and notes, import and export to and from other formats, and much much more. Papis uses a human-readable and easily hackable .yaml file to store each entry's bibliographical data. It strives to be easy to use while providing a wide range of features. And for those who still want more, Papis makes it easy to write scripts that extend its...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    BentoCache

    BentoCache

    Bentocache is a robust multi-tier caching library for Node.js app

    Bentocache is a flexible caching library for Python that supports multiple backends like memory, disk, and Redis. It offers decorators for easy function-level caching and is designed to be lightweight, extensible, and developer-friendly. Bentocache is well-suited for performance optimization in web apps, scripts, and data pipelines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    ClojureDart

    ClojureDart

    Clojure dialect for Flutter and Dart

    ClojureDart is a modern dialect of Clojure that compiles to Dart, enabling developers to create native mobile, desktop, web, and CLI applications using the Flutter and Dart ecosystem with Clojure syntax while offering production-ready performance. You should get the sample running either in Chrome or as a desktop app. Compiles Clojure code into Dart to produce mobile, desktop, and web apps. Production-ready toolchain, with stable compiler used in real-world apps.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    C++ Standard Airline IT Object Library
    That project aims at providing a clean API, and the corresponding C++ implementation, for the basis of Airline IT Business Object Model (BOM), ie, to be used by several other Open Source projects, such as RMOL, Air-Sched, Travel-CCM, OpenTREP, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Android Saripaar

    Android Saripaar

    UI form validation library for Android

    Android Saripaar is a simple, feature-rich and powerful rule-based UI form validation library for Android. It is the SIMPLEST UI validation library available for Android. Built on top of Apache Commons Validator, a validation framework with proven track record on the web, desktop and mobile platforms. Declarative style validation using Annotations. Extensible, now allows Custom Annotations. Synchronous and Asynchronous validations, you don't have to worry about threading. Supports both BURST and IMMEDIATE modes. Works with Stock Android Widgets, no custom view dependencies. Isolates validation logic using rules.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    C++ Simulated Travel Distribution System
    That project aims at providing a clean API and a simple implementation, as a C++ library, of a Travel-oriented Distribution System. It corresponds to the simulated version of the real-world Computerized Reservation Systems (CRS).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    C++ Airline Inventory Management Library
    That project aims at providing a clean API and a simple implementation, as a C++ library, of an Airline-related Inventory Management system. That library uses the Standard Airline IT C++ object model (http://sf.net/projects/stdair).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Open Airline Revenue Accounting
    That project aims at delivering a reference implementation of a library, estimating and serving average prices paid for air travel products. It is not intended for use by an actual airline, but rather by simulators or other airline-related modules of
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    You know the component scanning feature of Spring? You'll love this: component scanning as easy as select(javaClasses()).from("your.package").returning(allAnnotatedWith(YourAnnotation.class)). Can support any language running on the JVM. ---------- eXtcos is now also available from Maven Central. To include it into your Maven project just add this dependency: groupId: net.sf.extcos artifactId: extcos version: 0.4b Unfortunately Sourceforge doesn't support XML snippets in the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next