Showing 28 open source projects for "extensible web spider"

View related business solutions
  • Atera - an All-in-one platform for IT management Icon
    Atera - an All-in-one platform for IT management

    Ideal for IT departments and MSPs (managed service providers)

    Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!
    Try Atera now
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 1
    Scrapy-Redis

    Scrapy-Redis

    Redis-based components for Scrapy

    You can start multiple spider instances that share a single redis queue. Best suitable for broad multi-domain crawls. Scraped items gets pushed into a redis queued meaning that you can start as many as needed post-processing processes sharing the items queue. Scheduler + Duplication Filter, Item Pipeline, Base Spiders. Default requests serializer is pickle, but it can be changed to any module with loads and dumps functions. Note that pickle is not compatible between python versions. Version...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    Luakit

    Luakit

    Fast, small, webkit based browser framework extensible by Lua

    Luakit is a highly configurable browser framework based on the WebKit web content engine and the GTK+ toolkit. It is very fast, extensible with Lua, and licensed under the GNU GPLv3 license. It is primarily targeted at power users, developers and anyone who wants to have fine-grained control over their web browser’s behavior and interface. While switching to the WebKit 2 API means a vastly improved security situation, not all distributions of Linux package the most up-to-date version of WebKitGTK+, and several package very outdated versions that have many known vulnerabilities. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 3
    Lighthouse

    Lighthouse

    Automated auditing, performance metrics, & best practices for the web

    Lighthouse is an open-source, automated tool that analyzes and audits web apps and web pages in order to improve their quality. Lighthouse collects modern performance metrics and insights on developer best practices; auditing for performance, accessibility, SEO and more. After auditing it produces a report either in JSON or HTML. Included in the report is a reference doc that explains the importance of the audit and how to fix the problem areas, which you can use to improve the web app or...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 4
    Open ChatGPT Atlas

    Open ChatGPT Atlas

    Open Source and Free Alternative to ChatGPT Atlas

    Open ChatGPT Atlas is an open-source toolkit and interface for working with OpenAI’s ChatGPT models in a more extensible, adaptable, and composable way than standard web UIs allow. It provides an architecture where developers and power users can manage state, tool integrations, and multi-turn workflows with more control, enabling custom UIs, automation layers, and advanced routing logic. Unlike a fixed chat app, Atlas is designed as a foundation that can be extended with plugins, external APIs, and custom logic to support domain-specific assistants, agent-like behaviors, and multi-task workflows. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • Error to trace to log to deploy. One click. No SSH. Icon
    Error to trace to log to deploy. One click. No SSH.

    Catch the cause before the pager goes off.

    AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.
    Free 30 days.
  • 5
    Melt UI

    Melt UI

    A set of headless, accessible component builders for Svelte

    Melt UI is an open-source headless component builder library created specifically for the Svelte ecosystem, designed to help developers construct accessible and highly customizable user interface components. Rather than providing fully styled widgets, the library focuses on supplying the behavioral logic and accessibility patterns needed to build UI components while allowing developers to control the visual appearance. Melt UI introduces a builder-based architecture where component logic is...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 6
    Mink

    Mink

    PHP web browser emulator abstraction

    Mink is an open-source PHP library that provides a browser abstraction layer for web application testing, allowing developers to simulate user interactions with websites in a consistent and flexible way. Instead of tying test logic to a specific browser driver, Mink introduces a unified API that can work with multiple drivers such as Goutte, Selenium, ChromeDriver, or BrowserKit. This abstraction enables developers to write tests once and run them across different environments without...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Lobo Evolution - Java Web Browser

    Lobo Evolution - Java Web Browser

    Lobo Evolution is an extensible all-Java web browser and RIA platform

    Lobo Evolution is a fork of Lobo Browser. The project continuing the work of Lobo Browser(lobochief). Lobo Evolution is an extensible all-Java web browser and RIA platform. It supports HTML 4, HTML5 Javascript, CSS 3 and Java (Swing) rendering. CobraEvolution is the web browser's renderer API; also a Javascript-aware HTML parser. Lobo Evolution 5.0 relesed CHANGELOG: https://github.com/LoboEvolution/LoboEvolution/releases Read wiki: https://loboevolution.github.io/LoboEvolution/project-info.html Javadoc site: https://oswetto.github.io/LoboEvolution Now you can fork the project and help me with code. ...
    Leader badge
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    Eric Integrated Development Environment

    Eric Integrated Development Environment

    Python Development Environment with all batteries included

    Eric is a Python IDE written using PyQt and QScintilla. It provides various features such as any number of open editors, an integrated (remote) debugger, project management facilities, unit test, refactoring and much more.
    Leader badge
    Downloads: 136 This Week
    Last Update:
    See Project
  • 9
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    In Files there is WebCrawlerMySQL.jar which supports MySql Connection Please follow this link to get latest version https://sourceforge.net/projects/web-spider-web-crawler-extract/ Free Web Spider & Crawler. Extracts Information from Web by parsing millions of pages. Store data into Derby OR MySQL Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export to Excel File - Data Saved into Derby Database - Written in Java Cross Platform See also Free Email Sender in this link: https://sourceforge.net/projects/gitst-free-email-ender/ Please install Microsoft OpenJDK to start the application https://www.microsoft.com/openjdk
    Downloads: 0 This Week
    Last Update:
    See Project
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 10
    generator-ngx-rocket

    generator-ngx-rocket

    Extensible Angular 13+ enterprise-grade project generator

    Extensible Angular 5+ enterprise-grade project generator based on angular-cli with best practices from the community. Bootstrap, Angular Material or even Ionic are available, just pick one and start to code. Save time and maintenance effort by creating Progressive Web App and mobile app with the same code base. The app template is based on HTML5, TypeScript and Sass.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Browser Pool

    Browser Pool

    A Node.js library to easily manage and rotate a pool of web browsers

    ...We also wanted to easily and reliably manage the whole browser/page lifecycle. You can use Browser Pool for scraping the internet at scale, testing your website in multiple browsers at the same time or launching web automation robots.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 12
    LibreWeb Browser

    LibreWeb Browser

    Decentralized Web Browser

    LibreWeb is an open-source decentralized web browser, leveraging IPFS. What would you do differently; if you could reinvent The Internet in the 21st century? With all the knowledge and new technologies available today. I was inspired by Douglas Engelbart, Tim Berners-Lee, and Ted Nelson as well as projects like IPFS, Jekyll, ARPANET, and more. Everyone should be able to easily read and create a site/blog/news page and publish the content online (without minimal technical knowledge). Built-in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Wexond

    Wexond

    Modern and feature-rich web browser base based on Electron

    Wexond is a beautiful, open-source, extensible web browser that's focused on privacy and offers a totally new user experience. Because it's made with Electron, it's not bloated with redundant Google tracking services and only utilizes minimal resources. And thanks to its Wexond Shield powered by Cliqz, websites can load up to eight times faster, and you can freely browse the web without pesky ads or worrying about websites tracking you.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 14
    Literally Canvas

    Literally Canvas

    A canvas in your browser

    Literally Canvas is an extensible, open source (BSD-licensed), HTML5 drawing widget. Its only dependency is React.js. You can use it to embed drawing boards in web pages. It’s kind of like an extensible MS Paint in JavaScript. Users can sketch drawings and you can do what you like with the results. comes with an easy-to-use jQuery plugin. It’s developed by the community for many purposes and is free to use and extend.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Nightingale

    Nightingale

    A community supported fork of the Songbird media player and library.

    Nightingale is a community created fork of the Songbird media player. It is developed by a proud community and we are equally proud to bring you the most extensible, feature-rich media experience on Windows, Mac, and Linux. See the official website at http://getnightingale.com for the source, builds, and information. On Sourceforge, we provide our releases, the binary deps for building, as well as builds for testing purposes.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 16
    webStraktor is a programmable World Wide Web data extraction client. Its purpose is to scrape HTML based content via the HTTP protocol and extract relevant information. webStraktor features a scripting language to facilitate the collection, the extraction and the storage of information available on the web, including images. The scripting language uses elements of the Regular Expression and xPath syntax. The webStraktor scripting language has a small instruction set and its syntax is easy...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    jCounter

    jCounter is a simple light-weight extensible jQuery plugin that helps

    jCounter is a simple light-weight extensible jQuery plugin that helps you create various date-based countdowns, countups and custom timers. It comes with server-side timezone support to display accurate countdowns across the globe and pure css scalable themes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Midori is a lightweight web browser.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 19
    VxmlSurfer is a modular, multi-platform, extensible VoiceXML 2.0 interpreter, geared to Web browsing as opposed to telephony. VxmlSurfer is currently used as a user interface for the HearSay non-visual Web browser.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    XUP (Extensible User Interface Protocol) is a SOAP-based protocol for delivering events and incremental UI updates. This project builds a web application development framework on top of XUP, consisting of a client, a server toolkit, and a template viewer.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A firefox extension that resolves an eXtensible Resource Identifier (XRI) and renders the eXtensible Resource Descriptor Set (XRDS) document in a human-friendly interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Pirate is a programmable API for developing complex, extensible JavaScript objects and modules.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Robust featureful multi-threaded CLI web spider using apache commons httpclient v3.0 written in java. ASpider downloads any files matching your given mime-types from a website. Tries to reg.exp. match emails by default, logging all results using log4j.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    The goal of the project is to create a 100% pure Java-based browser support the latest standards from the W3C. This project is made up of two parts, the actual browser application written in Java Swing and a Swing component that renders HTML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    ETC is a tiny distribution of linux combined with a PNG displaying web browser. Using an inexpensive surplus touchscreen PC, ETC becomes a interface for home control, MP3 players, or whatever you can imagine.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next