Showing 21 open source projects for "xpath"

View related business solutions
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Scrapy

    Scrapy

    A fast, high-level web crawling and web scraping framework

    Scrapy is a fast, open source, high-level framework for crawling websites and extracting structured data from these websites. Portable and written in Python, it can run on Windows, Linux, macOS and BSD. Scrapy is powerful, fast and simple, and also easily extensible. Simply write the rules to extract the data, and add new functionality if you wish without having to touch the core. Scrapy does the rest, and can be used in a number of applications. It can be used for data mining, monitoring...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 2
    gain

    gain

    Asyncio-based Python framework for building fast web crawling spiders

    ...Developers define crawlers using components such as spiders, parsers, and items, allowing them to organize crawling logic and data extraction rules clearly. Gain supports CSS selectors and XPath expressions for parsing page content and extracting specific elements. Gain also allows developers to configure headers, concurrency levels, and proxy settings to control how crawlers interact with target websites. Because it uses asynchronous programming, Gain can handle multiple requests efficiently while minimizing blocking operations.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Crawl4AI

    Crawl4AI

    Open-source LLM Friendly Web Crawler & Scraper

    Crawl4AI is a high-performance, AI‑ready web crawler tailored for LLM data ingestion and RAG pipelines. It supports adaptive crawling heuristics (stopping when enough info is gathered), structured markdown output, and high-speed parallel execution. Designed to operate at scale with optional Docker deployment and framework integrations.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    dude uncomplicated data extraction

    dude uncomplicated data extraction

    dude uncomplicated data extraction: A simple framework

    Dude is a very simple framework for writing web scrapers using Python decorators. The design, inspired by Flask, was to easily build a web scraper in just a few lines of code. Dude has an easy-to-learn syntax. Dude is currently in Pre-Alpha. Please expect breaking changes. You can run your scraper from terminal/shell/command-line by supplying URLs, the output filename of your choice and the paths to your python scripts to dude scrape command.
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 5
    RPA for Python

    RPA for Python

    Python package for doing RPA

    Python package for doing RPA. RPA for Python's simple and powerful API makes robotic process automation fun! You can use it to quickly automate away repetitive time-consuming tasks on websites, desktop applications, or the command line. See sample Python script, the RPA Challenge solution, and RedMart groceries example. To send a Telegram app notification, simply look up @rpapybot to allow receiving messages. To automate Chrome browser invisibly, use headless mode. To run 10X faster instead...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    mlscraper

    mlscraper

    ML-based HTML scraper that learns extraction rules from examples

    mlscraper is a Python library designed to automatically extract structured data from HTML pages without requiring developers to manually write CSS selectors or XPath rules. Instead of defining extraction logic by hand, users provide a few examples of the data they want to retrieve from a webpage. It analyzes those examples within the HTML document and determines patterns or rules that can be used to extract the same type of information from similar pages. Once trained, the generated scraper can process new pages and return the extracted data in structured formats such as dictionaries or lists. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    Requests-HTML

    Requests-HTML

    Pythonic HTML Parsing for Humans

    ...When using this library you automatically get full JavaScript support! (Using Chromium, thanks to puppeteer) CSS Selectors (a.k.a jQuery-style, thanks to PyQuery). XPath Selectors, for the faint of heart. Mocked user-agent (like a real web browser). Automatic following of redirects. Connection–pooling and cookie persistence. The Requests experience you know and love, with magical parsing abilities, and async support. The rest of the code operates the same way as the synchronous version except that results is a list containing multiple response objects however the same basic processes can be applied as above to extract the data you want.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    pgal

    Online gallery factory

    Given images folders, generate a static web gallery. Note: This project has been moved to: https://github.com/viewplatgh/pgal
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    dynamide
    dynamide is a dynamic web application framework for handling the presentation and business layers in a traditional web app. See http://dynamide.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 10
    webgeno is a light-weight Content Management System (CMS) for generating websites offline. It is primarily designed for personal websites where the user doesn't have access to the server except to upload files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Serves music files, playlists, HTML files, etc using http/WebDAV. Music files are categorized using metainformation (e.g. id3 tags) and this metainformation is stored in a MySQL database for sorting and searching operations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    process runtime detection with xml / image statistic output - like xfire .com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Wolfpack is an open source server implementing the protocol used by the massive multiplayer online game Ultima Online. It aims for full support of the Ultima Online protocol and tries to mimic the same gameplay as the original.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    CMS based on XSLT templates and usability tricks. Edit the page content directly in HTML, keeping the essential data in only one place.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    PennAve is a dynamic photo gallery software written in Python and designed for use alongside F-Spot. It makes heavy use of XML and XSLT for ease of presentation modification and sharing of information with other users, web sites, and programs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    process runtime detection with xml/image statistic output - like xfire .com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Syncato is a Weblog Web Services system built on top of Berkeley DB XML, Webware and Python. It has a number of unique features; XPath access to all content via URLs, XSL-T presentation and extremely flexible database structure.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Python Web development framework based on XSLT engine.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Radix is a RAD framework for creating native XML web applications. Complete web applications can be created without programming knowledge using XML, XSLT, XPath and related technologies. Radix can be extended using Java, JavaScript, Python, and Tcl.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    XMDNS is an extensible DNS management scheme that uses XML to store data. It features easy manipulation of views (or split horizon DNS). There is also support for hand-crafting records for situations where complicated rules must be enforced.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    PyLogAlyzer is a Web Log Analyzer in Pure Python (a clone of Awstats). PyLogAlyzer produces a XML result and uses XSLT to generate the HTML files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo