Showing 245 open source projects for "xpath"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Scrapy

    Scrapy

    A fast, high-level web crawling and web scraping framework

    Scrapy is a fast, open source, high-level framework for crawling websites and extracting structured data from these websites. Portable and written in Python, it can run on Windows, Linux, macOS and BSD. Scrapy is powerful, fast and simple, and also easily extensible. Simply write the rules to extract the data, and add new functionality if you wish without having to touch the core. Scrapy does the rest, and can be used in a number of applications. It can be used for data mining, monitoring...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 2
    gain

    gain

    Asyncio-based Python framework for building fast web crawling spiders

    ...Developers define crawlers using components such as spiders, parsers, and items, allowing them to organize crawling logic and data extraction rules clearly. Gain supports CSS selectors and XPath expressions for parsing page content and extracting specific elements. Gain also allows developers to configure headers, concurrency levels, and proxy settings to control how crawlers interact with target websites. Because it uses asynchronous programming, Gain can handle multiple requests efficiently while minimizing blocking operations.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Crawl4AI

    Crawl4AI

    Open-source LLM Friendly Web Crawler & Scraper

    Crawl4AI is a high-performance, AI‑ready web crawler tailored for LLM data ingestion and RAG pipelines. It supports adaptive crawling heuristics (stopping when enough info is gathered), structured markdown output, and high-speed parallel execution. Designed to operate at scale with optional Docker deployment and framework integrations.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Hurl

    Hurl

    Hurl, run and test HTTP requests with plain text

    ...Hurl makes it easy to work with HTML content, REST / SOAP / GraphQL APIs, or any other XML / JSON-based APIs. Hurl can run HTTP requests but can also be used to test HTTP responses. Different types of queries and predicates are supported, from XPath and JSONPath on body response, to assert on status code and response headers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 5
    kimuraframework

    kimuraframework

    AI-first Ruby framework for building fast, flexible web scraping spide

    Kimurai is an open source web scraping framework written in Ruby that simplifies the process of building automated data extraction tools. It provides a clean domain-specific language that allows developers to define scraping logic and data schemas with minimal boilerplate code. Kimurai can use AI-assisted extraction to identify where data resides in HTML pages, automatically generating selectors that are cached for future use so subsequent scraping runs operate with pure Ruby performance....
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    VuFind® is a library resource discovery portal designed and developed for libraries by libraries. The goal of VuFind® is to enable your users to search and browse through all of your library's resources by replacing the traditional OPAC.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 7
    GeoNetwork opensource - Metadata Catalog
    A web based Metadata Catalog Server for data description and discovery. Supports both generic and geospatial data discovery. It implements international standards (e.g. ISO19115/19139/19119, ISO19115-3, DCAT-AP, CSW 2.0, OGC API Records). It originates from the United Nations and is used by many governments as geoportal software. Active development and discussion takes place on GitHub and OSGeo Discourse. You can also connect directly with the companies supporting the...
    Leader badge
    Downloads: 158 This Week
    Last Update:
    See Project
  • 8
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Scraper of Death
    Scraper of Death is a web scraper. Multiple Scraping Methods Requests + BeautifulSoup (fast, lightweight) Selenium (JavaScript support, dynamic content)
    Downloads: 1 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 10
    PHPScraper

    PHPScraper

    A universal web-util for PHP

    PHPScraper is a universal web-scraping util for PHP, built with simplicity in mind. The goal is to make xPath Selectors optional and avoid the commonly needed boilerplate code. Just create an instance of PHPScraper, go to a website, and start collecting data. All scraping functionality can be accessed either as a function call or a property call. For example, the title can be accessed in two ways. Many common use cases are covered already.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    dude uncomplicated data extraction

    dude uncomplicated data extraction

    dude uncomplicated data extraction: A simple framework

    Dude is a very simple framework for writing web scrapers using Python decorators. The design, inspired by Flask, was to easily build a web scraper in just a few lines of code. Dude has an easy-to-learn syntax. Dude is currently in Pre-Alpha. Please expect breaking changes. You can run your scraper from terminal/shell/command-line by supplying URLs, the output filename of your choice and the paths to your python scripts to dude scrape command.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    RPA for Python

    RPA for Python

    Python package for doing RPA

    Python package for doing RPA. RPA for Python's simple and powerful API makes robotic process automation fun! You can use it to quickly automate away repetitive time-consuming tasks on websites, desktop applications, or the command line. See sample Python script, the RPA Challenge solution, and RedMart groceries example. To send a Telegram app notification, simply look up @rpapybot to allow receiving messages. To automate Chrome browser invisibly, use headless mode. To run 10X faster instead...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Saxon XSLT and XQuery Processor

    Saxon XSLT and XQuery Processor

    The Saxon XSLT and XQuery processor, developed by Saxonica

    The home page for the Saxon XSLT 3.0, XQuery 3.1, and XPath 3.1 processor has moved to http://www.saxonica.com/. Information on this site is no longer kept up to date. Sourceforge was the home for open-source versions of the Saxon XSLT 3.0, XQuery 3.1, and XPath 3.1, processor from 1999 until 2022. From version 12.0 in January 2023, you need to visit the Saxonica site at http://www.saxonica.com/ for the latest information and releases.
    Leader badge
    Downloads: 831 This Week
    Last Update:
    See Project
  • 14
    XForms to XHTML+Javascript (AJAX) conversion based on a unique XSL transformation. Suitable server-side (PHP) or client-side (Internet Explorer, Mozilla FireFox, Opera, Safari) browser treatement where an XSLT 1.0 engine is available
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    Relative XPath Assistant

    Relative XPath Assistant

    Just right click on the html elements to compare them

    This WebExtension will give you the relative XPath expression of two web elements Release 1.4.2 (04/08/2022) - Adding control key functionality Add-on is based on relative-xpath-helper open-source project on github [https://github.com/syamsasi99/relative-xpath-helper] ### Details - Compared to the original version, some incompatibilities of the scripts with the Firefox browser have been improved and fixed - No data is sent to any server and not get from any server ### Limitations - WebExtension only works on sites that allow content scripts...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    mlscraper

    mlscraper

    ML-based HTML scraper that learns extraction rules from examples

    mlscraper is a Python library designed to automatically extract structured data from HTML pages without requiring developers to manually write CSS selectors or XPath rules. Instead of defining extraction logic by hand, users provide a few examples of the data they want to retrieve from a webpage. It analyzes those examples within the HTML document and determines patterns or rules that can be used to extract the same type of information from similar pages. Once trained, the generated scraper can process new pages and return the extracted data in structured formats such as dictionaries or lists. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    JDynamiTe, Dynamic Template in Java

    JDynamiTe, Dynamic Template in Java

    Dynamically generate documents from templates

    JDynamiTe is a tool which allows you to dynamically create documents in any format from "template" documents. And very few lines of code (or no line at all!) are needed to do that. Some typical usage domains of JDynamiTe are: - dynamic Web pages creation, - text document generation, - source code generation... In fact, it can be useful in any case where pre-defined documents (templates) have to be dynamically populated with data. The main benefit of JDynamiTe is to allow a true...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    This is software to create web dictionaries, esp. for Esperanto like Reta Vortaro (http://reta-vortaro.de). A dictionary is made from articles written in a special XML dialect by transformations using XSLT, ant and some Java code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    MySQL::Admin

    MySQL::Admin

    cpan MySQL::Admin

    The Web Site Administration Tool lets you view and manage MySQL Databases through a Ajax-Xsl Web 2.0 interface. The easy way to install MySQL::Admin ist to download mysql-admin-0.82-webinstaller.tar.gz Upload the whole folder to your webspace. Open install.html in your Browser. Open index.html in your Browser. Delete cgi-bin/install.pl Delete install.html or simple via cpan MySQL::Admin Support for mod_perl is available from the Cpan Package.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    Whitebeam Web Application Server

    Whitebeam Web Application Server

    JavaScript/XML engine for Apache

    Whitebeam is a complete rapid development Web application server integrating Apache, Posgresql, Mozilla's SpiderMonkey JavaScript engine with it's own powerful XML/XPath based secure application framework.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    MyCoRe

    MyCoRe

    your repository framework

    MyCoRe is an Open Source project for the development of Repositories, Digital Library and archive solutions. The technical base of the system is formed of Java class libraries, XML technology and different database backends. Since 2015 we use https://mycore.atlassian.net/ for bug tracking. Please use our ticket system there.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    pulse - Java web app framework / CMS

    pulse - Java web app framework / CMS

    Built for enterprise level and highly customized websites

    ... - Mature user and role management - Easy-to-use administration interface based on AJAX (using Ext JS) - Built for internationalisation - Flexible templating with XSLT 2.0/XPath 2.0 pulse is designed with brand sites in mind where absolute flexibility in presentation is a core requirement.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    eurephia is an authentication and access control plug-in for OpenVPN. It improves authentication by adding user/password auth in addition to certificates. Access control is managed via iptables on Linux servers. See web page for more info
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    XML Viewer Plus

    XML Viewer Plus

    A powerful XML Viewer - add-on for browser Firefox

    A powerful XML Viewer, supports text/regex and jQuery/CSS, XPath selectors ==== Release 1.2.6 - 17.06.2019 - Added utf-8 encode for xml viewer ==== Get your XML Editor and Validator on Play Store > https://play.google.com/store/apps/details?id=com.ulm.xmleditor ==== ## Libraries and techs used 1. `Vue` 2. `Vuex` 3. `lodash/debounce` 4. `jQuery` 5. `document.querySelectorAll` 6. `document.evaluate` for `XPath` 7.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 25

    mod_psldap

    Apache LDAP Directory Manager

    mod_psldap is an Apache module for leveraging LDAP services built on the OpenLDAP library and the Apache APIs, to include web based A&A, web based updates to the LDAP store, server-side XSLT processing, and session management across servers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
Auth0 Logo