Showing 215 open source projects for "extensible web spider"

View related business solutions
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 1
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    In Files there is WebCrawlerMySQL.jar which supports MySql Connection Free Web Spider & Crawler. Extracts Information from Web by parsing millions of pages. Store data into Derby Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export to Excel File - Data Saved into Derby and MySQL Database - Written in Java Cross Platform Also See Free email Sender : https://sourceforge.net/projects/gitst-free-email-ender/ Please install Microsoft OpenJDK to start the application https://www.microsoft.com/openjdk
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    Heritrix

    Heritrix

    Internet Archive's open-source, web-scale, web crawler project

    Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project. Heritrix (sometimes spelled heretrix, or misspelled or missaid as heratrix/heritix/heretix/heratix) is an archaic word for heiress (woman who inherits). Since our crawler seeks to collect and preserve the digital artifacts of our culture for the benefit of future researchers and generations, this name seemed apt.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    IntelliJ Community Edition

    IntelliJ Community Edition

    IntelliJ IDEA & IntelliJ Platform

    IntelliJ Community is the open source upstream of JetBrains’ IntelliJ IDEA, forming the core of a powerful, extensible, and intelligent development environment. It provides foundational features like a robust editor with code completion, syntax highlighting, refactoring tools, version control integrations, terminal, debugger, and plugin architecture. Since it’s open, community developers can contribute to language supports, UI tweaks, and platform enhancements.
    Downloads: 1,637 This Week
    Last Update:
    See Project
  • 4
    Stirling-PDF

    Stirling-PDF

    Web application that allows you to perform operations on PDF files

    Stirling PDF is a powerful, locally hosted web-based PDF manipulation tool offering a wide range of editing, conversion, and utility features. It allows users to merge, split, compress, convert, OCR, and perform other operations on PDF files directly from a browser without uploading data to third-party servers. The tool is privacy-conscious, self-hostable via Docker, and built with modularity in mind to allow future expansion and integration.
    Downloads: 32 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 5
    ZAP

    ZAP

    The OWASP ZAP core project

    The OWASP Zed Attack Proxy (ZAP) is one of the world’s most popular free security tools and is actively maintained by a dedicated international team of volunteers. It can help you automatically find security vulnerabilities in your web applications while you are developing and testing your applications. It's also a great tool for experienced pentesters to use for manual security testing. ZAP is an easy to use integrated penetration testing tool for finding vulnerabilities in web...
    Downloads: 104 This Week
    Last Update:
    See Project
  • 6
    Jenkins

    Jenkins

    Build great things at any scale

    Jenkins is the leading open-source automation server that allows you to build great things at any scale. Jenkins is built with Java and provides hundreds of plugins for building, deploying and automating virtually anything, allowing you to focus on more important things. Jenkins is often used for building projects, running tests, analyzing static code and deployment. Whatever is done repetitively, Jenkins can most likely execute and execute well, saving you time and optimizing your...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 7
    Logbook

    Logbook

    An extensible Java library for HTTP request and response logging

    Logbook is an extensible Java library to enable complete request and response logging for different client- and server-side technologies. It satisfies a special need by a) allowing web application developers to log any HTTP traffic that an application receives or sends b) in a way that makes it easy to persist and analyze it later. This can be useful for traditional log analysis, meeting audit requirements or investigating individual historic traffic issues.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    Activiti

    Activiti

    Light-weight workflow and business process management pltaform

    Helping businesses solve automation challenges in distributed, highly-scalable and cost effective infrastructures. Activiti is the leading lightweight, java-centric open-source BPMN engine supporting real-world process automation needs. Activiti Cloud is now the new generation of business automation platform offering a set of cloud native building blocks designed to run on distributed infrastructures. Inmutable, scalable & pain free Process & Decision Runtimes designed to integrate with your...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    RStudio

    RStudio

    RStudio is an integrated development environment (IDE) for R

    RStudio is a powerful, full-featured integrated development environment (IDE) tailored primarily for the R programming language but increasingly supportive of other languages like Python and Julia. It brings together console, editor, plotting, workspace, history, and file-management panes into a unified interface, helping data scientists, statisticians, and analysts to work more productively. The IDE is cross-platform: there are desktop versions for Windows, macOS and Linux, as well as a...
    Downloads: 101 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10
    Eclipse Jetty Canonical Repository

    Eclipse Jetty Canonical Repository

    Eclipse Jetty - Web Container & Clients - supports HTTP/2, HTTP

    Jetty provides a web server and servlet container, additionally providing support for HTTP/2, WebSocket, OSGi, JMX, JNDI, JAAS and many other integrations. These components are open source and are freely available for commercial use and distribution. Jetty is used in a wide variety of projects and products, both in development and production. Jetty has long been loved by developers due to its long history of being easily embedded in devices, tools, frameworks, application servers, and modern...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    eXist-db

    eXist-db

    eXist Native XML Database and Application Platform

    eXist-db is an open-source, native XML database and application platform that provides a powerful environment for storing, querying, and managing XML documents. It is designed for complex data management needs, offering XQuery, XSLT, and RESTful web services for interacting with structured data.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Spring Security

    Spring Security

    Authentication and access-control framework

    Spring Security is a powerful and highly customizable authentication and access-control framework. It is the de-facto standard for securing Spring-based applications. Spring Security is a framework that focuses on providing both authentication and authorization to Java applications. Like all Spring projects, the real power of Spring Security is found in how easily it can be extended to meet custom requirements. Spring Security uses a Gradle-based build system. In the instructions, ./gradlew...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Agents-Flex

    Agents-Flex

    Agents-Flex is an elegant LLM Application Framework like LangChain

    Agents-Flex includes a variety of network protocols for connecting LLMs, such as HTTP, SSE and WS. Its simple and flexible design allows developers to easily connect to various LLMs, including OpenAI, LLama, and other AI. Agents-Flex provides a rich set of development templates and Prompt Frameworks, including FEW-SHOT, CRISPE, BROKE, and ICIO. Developers can also customize their own unique prompt templates. Agents-Flex has a very flexible Function Calling component. It supports local method...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    Lobo Evolution - Java Web Browser

    Lobo Evolution - Java Web Browser

    Lobo Evolution is an extensible all-Java web browser and RIA platform

    Lobo Evolution is a fork of Lobo Browser. The project continuing the work of Lobo Browser(lobochief). Lobo Evolution is an extensible all-Java web browser and RIA platform. It supports HTML 4, HTML5 Javascript, CSS 3 and Java (Swing) rendering. CobraEvolution is the web browser's renderer API; also a Javascript-aware HTML parser. Lobo Evolution 5.0 relesed CHANGELOG: https://github.com/LoboEvolution/LoboEvolution/releases Read wiki: https://loboevolution.github.io/LoboEvolution/project-info.html Javadoc site: https://oswetto.github.io/LoboEvolution Now you can fork the project and help me with code. ...
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    TJWS is an Open Source HTTP Server and Servlet container written in 100% Java. It's designed to be a light weight, high performing, secure, embeddable, extensible and flexible. Very small footprinted (~100K), CGI, J2EE/JSP compatible. Servlet spec 3.1
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    jEdit

    jEdit

    jEdit is a programmer's text editor written in Java.

    jEdit is a programmer's text editor written in Java. It uses the Swing toolkit for the GUI and can be configured as a rather powerful IDE through the use of its plugin architecture.
    Leader badge
    Downloads: 402 This Week
    Last Update:
    See Project
  • 17
    A service-oriented, cross-platform application framework (v1 was implemented in .net, current codebase is in Java), which is trageted to speed up and standartise development by providing a robust freely extensible component structure.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    WFDownloader App

    WFDownloader App

    Free batch downloader for image, wallpaper, video, audio, document,

    Use as an image gallery, wallpaper, audio/music, video, document, and other media bulk downloader from supported websites. Also use to download sequential website urls that have a certain pattern (e.g. image01.png to image100.png). Also use app's built-in site crawler for advanced link search or extraction. There is also special support for forum media downloading, forum thread offline archiving, rss feed downloading, and open directory downloading. It's a programmable downloader and also...
    Leader badge
    Downloads: 358 This Week
    Last Update:
    See Project
  • 19
    OpenKM Document Management - DMS

    OpenKM Document Management - DMS

    Document Management System and Content Management System

    OpenKM Community Edition is a free Document Management System (DMS) that helps businesses control the production, storage, management and distribution of electronic documents, boosting effectiveness and productivity. It integrates document management, collaboration and advanced search into one easy-to-use solution, including administration tools for user roles, access control, security levels, activity logs and automation setup. With OpenKM Community Edition you can: Collect information...
    Leader badge
    Downloads: 337 This Week
    Last Update:
    See Project
  • 20
    LogicalDOC Document Management - DMS

    LogicalDOC Document Management - DMS

    smart and open source document management system

    LogicalDOC is both document management and collaboration system. The software is loaded with many functions and allows organizing, index, retrieving, controlling and distributing important business documents securely and safely for any organization and individual. Gone are the days when companies used paper-based processes such as printing, mailing and manual filing of paper documents; our document management system replaces all of this with electronic procedures that allow your...
    Leader badge
    Downloads: 120 This Week
    Last Update:
    See Project
  • 21
    Joget

    Joget

    AI Powered Open Source Platform to Easily Build Enterprise Web Apps

    Joget offers an open-source, AI-powered platform that converges no-code/low-code development with AI to rapidly build and customize enterprise applications at scale. By combining AI with visual app builders—not raw code—Joget makes app generation faster, safer, and more accessible for everyone. With Generative AI and Agentic AI capabilities, Joget Intelligence enables organizations to automate and enhance processes while maintaining oversight and compliance. Unlike typical AI code...
    Downloads: 21 This Week
    Last Update:
    See Project
  • 22
    x3d

    x3d

    X3D is the open-standard format for 3D graphics scenes on the Web.

    Extensible 3D (X3D) Graphics is a royalty-free International Standard for real-time interactive 3D graphics on the Web, providing unsurpassed interoperability for 3D communications on the Web. This project includes source for example X3D scene libraries and multiple X3D codebases produced by Web3D Consortium members. All open-source contributions are welcome.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    Scribe
    Scribe is a CMS for the Liferay Portal framework. It includes Web Content Management as well as Learning Management System features.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Cinnamon is an Enterprise CMS with all functions of a Component CMS, and is ideal for versioned management of assets of all types, like modular technical documentation (DITA and other standards), media assets, documents, contracts, software releases etc. Cinnamon consists of the server, a desktop client and all libraries the desktop client is built on. Cinnamon is an ideal platform for technical documentation systems, custom web or desktop applications in the fields of document...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    eXo Platform - Digital Workplace

    eXo Platform - Digital Workplace

    The open-source digital workplace for growing teams and enterprises.

    eXo Platform is an open-source digital workplace solutions for growing teams and enterprises, featuring: ✅ Internal Communications ✅ Team Collaboration ✅ Knowledge Management ✅ Productivity and Employee Recognition use cases. eXo stands out by: 👍 its fluid and integrated employe experience, on desktop and mobile 👍 the platform’s ease of use 👍 innovative employee engagement features. eXo Platform is developed on open-source technology and supports open...
    Downloads: 10 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next