Showing 30 open source projects for "js-xpath"

View related business solutions
  • 99.99% Uptime for MySQL and PostgreSQL Databases Icon
    99.99% Uptime for MySQL and PostgreSQL Databases

    Sub-second maintenance. 2x read/write performance. Built-in vector search for AI apps.

    Cloud SQL Enterprise Plus delivers near-zero downtime with 35 days of point-in-time recovery. Supports MySQL, PostgreSQL, and SQL Server.
    Try Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Scrapy

    Scrapy

    A fast, high-level web crawling and web scraping framework

    Scrapy is a fast, open source, high-level framework for crawling websites and extracting structured data from these websites. Portable and written in Python, it can run on Windows, Linux, macOS and BSD. Scrapy is powerful, fast and simple, and also easily extensible. Simply write the rules to extract the data, and add new functionality if you wish without having to touch the core. Scrapy does the rest, and can be used in a number of applications. It can be used for data mining, monitoring...
    Downloads: 21 This Week
    Last Update:
    See Project
  • 2
    gain

    gain

    Asyncio-based Python framework for building fast web crawling spiders

    ...Developers define crawlers using components such as spiders, parsers, and items, allowing them to organize crawling logic and data extraction rules clearly. Gain supports CSS selectors and XPath expressions for parsing page content and extracting specific elements. Gain also allows developers to configure headers, concurrency levels, and proxy settings to control how crawlers interact with target websites. Because it uses asynchronous programming, Gain can handle multiple requests efficiently while minimizing blocking operations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Crawl4AI

    Crawl4AI

    Open-source LLM Friendly Web Crawler & Scraper

    Crawl4AI is a high-performance, AI‑ready web crawler tailored for LLM data ingestion and RAG pipelines. It supports adaptive crawling heuristics (stopping when enough info is gathered), structured markdown output, and high-speed parallel execution. Designed to operate at scale with optional Docker deployment and framework integrations.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    crawler

    crawler

    Collection of JS reverse engineering examples for web scraping study

    crawler is a collection of web scraping and JavaScript reverse engineering examples designed for learning how modern websites protect their data and how those protections can be analyzed. It contains many case studies that demonstrate how to analyze and replicate request parameters, cookies, and encryption logic used by real websites. Each directory in the project focuses on a specific target service or scenario, showing how browser network requests and JavaScript code can be studied to...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 5
    Shynet

    Shynet

    Modern, privacy-friendly, and detailed web analytics

    Modern, privacy-friendly, and detailed web analytics that works without cookies or JS. There are a lot of web analytics tools. Unfortunately, most of them come with the following caveats. They require handing all of your visitors' info to a third-party company They use cookies to track visitors across sessions, so you need to have those annoying cookie notices. They collect so much personal data that even the NSA is jealous.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    DjangoBlog

    DjangoBlog

    A blog system based on python3.8 and Django3.0

    ...Simple SEO Features, notify Google and Baidu when there was a new article or other things. Simple picture bed feature integrated. django-compressor integrated, auto-compressed css, js. Website exception email notification. When there is an unhandle exception, system will send an email notification. Wechat official account feature integrated. Now, you can use wechat official account to manage your VPS.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    WhakerKit

    WhakerKit

    A seamless toolkit to manage dynamic websites and shared documents

    ...It is based on the following technologies: * python >= 3.9 * (optional) PyJWT and ldap3 for authentication (install with pip) * WhakerPy >= 1.3: <https://whakerpy.sourceforge.io> (install with pip) * Whakerexa >= 0.7: <https://whakerexa.sourceforge.io> (download package and unzip) * HTML-5, CSS-4 and JS technologies
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    dude uncomplicated data extraction

    dude uncomplicated data extraction

    dude uncomplicated data extraction: A simple framework

    Dude is a very simple framework for writing web scrapers using Python decorators. The design, inspired by Flask, was to easily build a web scraper in just a few lines of code. Dude has an easy-to-learn syntax. Dude is currently in Pre-Alpha. Please expect breaking changes. You can run your scraper from terminal/shell/command-line by supplying URLs, the output filename of your choice and the paths to your python scripts to dude scrape command.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    RPA for Python

    RPA for Python

    Python package for doing RPA

    Python package for doing RPA. RPA for Python's simple and powerful API makes robotic process automation fun! You can use it to quickly automate away repetitive time-consuming tasks on websites, desktop applications, or the command line. See sample Python script, the RPA Challenge solution, and RedMart groceries example. To send a Telegram app notification, simply look up @rpapybot to allow receiving messages. To automate Chrome browser invisibly, use headless mode. To run 10X faster instead...
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    mlscraper

    mlscraper

    ML-based HTML scraper that learns extraction rules from examples

    mlscraper is a Python library designed to automatically extract structured data from HTML pages without requiring developers to manually write CSS selectors or XPath rules. Instead of defining extraction logic by hand, users provide a few examples of the data they want to retrieve from a webpage. It analyzes those examples within the HTML document and determines patterns or rules that can be used to extract the same type of information from similar pages. Once trained, the generated scraper can process new pages and return the extracted data in structured formats such as dictionaries or lists. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    AET

    AET

    Detects visual changes on websites and performs page health checks

    AET is a system that detects visual changes on websites and performs basic page health checks (like w3c compliance, accessibility, HTTP status codes, JS Error checks and others). AET is designed as a flexible system that can be adapted and tailored to the regression requirements of a given project. The tool has been developed to aid front-end client-side layout regression testing of websites or portfolios, in essence assessing the impact or change of a website from one snapshot to the next.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Requests-HTML

    Requests-HTML

    Pythonic HTML Parsing for Humans

    ...When using this library you automatically get full JavaScript support! (Using Chromium, thanks to puppeteer) CSS Selectors (a.k.a jQuery-style, thanks to PyQuery). XPath Selectors, for the faint of heart. Mocked user-agent (like a real web browser). Automatic following of redirects. Connection–pooling and cookie persistence. The Requests experience you know and love, with magical parsing abilities, and async support. The rest of the code operates the same way as the synchronous version except that results is a list containing multiple response objects however the same basic processes can be applied as above to extract the data you want.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    pgal

    Online gallery factory

    Given images folders, generate a static web gallery. Note: This project has been moved to: https://github.com/viewplatgh/pgal
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    dynamide
    dynamide is a dynamic web application framework for handling the presentation and business layers in a traditional web app. See http://dynamide.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    webgeno is a light-weight Content Management System (CMS) for generating websites offline. It is primarily designed for personal websites where the user doesn't have access to the server except to upload files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Serves music files, playlists, HTML files, etc using http/WebDAV. Music files are categorized using metainformation (e.g. id3 tags) and this metainformation is stored in a MySQL database for sorting and searching operations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    process runtime detection with xml / image statistic output - like xfire .com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Wolfpack is an open source server implementing the protocol used by the massive multiplayer online game Ultima Online. It aims for full support of the Ultima Online protocol and tries to mimic the same gameplay as the original.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    JS library for downloading image list via an Ajax Backend and show them in a Mootools Slideshow only when image has been loaded! Plus a Server-side application (in Python), that can resize your image on-the-fly or apply a wathermark.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    CMS based on XSLT templates and usability tricks. Edit the page content directly in HTML, keeping the essential data in only one place.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    PennAve is a dynamic photo gallery software written in Python and designed for use alongside F-Spot. It makes heavy use of XML and XSLT for ease of presentation modification and sharing of information with other users, web sites, and programs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    process runtime detection with xml/image statistic output - like xfire .com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Pirate is a programmable API for developing complex, extensible JavaScript objects and modules.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Syncato is a Weblog Web Services system built on top of Berkeley DB XML, Webware and Python. It has a number of unique features; XPath access to all content via URLs, XSL-T presentation and extremely flexible database structure.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Python Web development framework based on XSLT engine.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Auth0 Logo