Showing 1150 open source projects for "xpath"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    CssSelector Component

    CssSelector Component

    Converts CSS selectors to XPath expressions

    XPath expressions are incredibly flexible, so there is almost always an XPath expression that will find the element you need. Unfortunately, they can also become very complicated, and the learning curve is steep. Even common operations (such as finding an element with a particular class) can require long and unwieldy expressions. CSS selectors are less powerful than XPath, but far easier to write, read and understand.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Kazumi

    Kazumi

    Flutter-based, rule-driven anime collection

    Kazumi is a cross-platform “anime (番剧)” fetching and streaming application built with Flutter. It allows users to define custom scraping rules using XPath-style selectors (up to five lines) to collect anime metadata and streaming sources. The app supports streaming with real-time super resolution (via Anime4K), danmaku (on-screen comments), multiple video sources, offline caching, and even collaborative watching modes. It targets many platforms (Android, iOS, Windows, macOS, Linux) and supports custom rule importing and sharing. ...
    Downloads: 279 This Week
    Last Update:
    See Project
  • 3
    PDFPatcher

    PDFPatcher

    A versatile toolkit for PDF manipulation

    PDFPatcher (aka “PDF补丁丁”) is a versatile toolkit for PDF manipulation—editing document metadata, bookmarks, page layout, content restrictions, rotation, compression, merging/splitting, image extraction, and more, all within an intuitive interface. Merge/split PDFs or images, preserve or add bookmarks, and set page dimensions. Batch style/color/target changes, regex/XPath search/replace, mid‑page positioning. Modify PDF metadata, page numbers, links, initial view mode, and remove open actions.
    Downloads: 57 This Week
    Last Update:
    See Project
  • 4
    Scrapy

    Scrapy

    A fast, high-level web crawling and web scraping framework

    Scrapy is a fast, open source, high-level framework for crawling websites and extracting structured data from these websites. Portable and written in Python, it can run on Windows, Linux, macOS and BSD. Scrapy is powerful, fast and simple, and also easily extensible. Simply write the rules to extract the data, and add new functionality if you wish without having to touch the core. Scrapy does the rest, and can be used in a number of applications. It can be used for data mining, monitoring...
    Downloads: 16 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    Symfony DomCrawler

    Symfony DomCrawler

    Eases DOM navigation for HTML and XML documents

    Symfony DomCrawler is a PHP component that provides powerful tools for navigating and extracting data from HTML and XML documents. It allows developers to parse, filter, and manipulate web pages using CSS selectors and XPath expressions. DomCrawler is widely used for web scraping, testing, and processing structured content, and integrates well with other Symfony components like BrowserKit.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    EventCatalog

    EventCatalog

    Open source tool for documenting and exploring event-driven systems

    EventCatalog is an open source documentation tool designed for event-driven architectures. It helps teams organize, document, and understand complex systems by mapping events, services, domains, and flows in one place. Instead of scattered knowledge, it creates a central, searchable catalog that improves visibility and collaboration. Documentation is generated and maintained automatically, making it easier to keep systems up to date as they evolve. With built-in discoverability features,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    lxml

    lxml

    The lxml XML toolkit for Python

    A Python library for efficient XML and HTML processing, known for speed and compatibility. The lxml XML toolkit is a Pythonic binding for the C libraries libxml2 and libxslt. It is unique in that it combines the speed and XML feature completeness of these libraries with the simplicity of a native Python API, mostly compatible but superior to the well-known ElementTree API. The latest release works with all CPython versions from 3.6 to 3.12. See the introduction for more information about the...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 8
    Google Style Guides

    Google Style Guides

    Style guides for Google-originated open source projects

    Google Styleguide is a comprehensive collection of coding style guides created and maintained by Google to ensure consistency, readability, and maintainability across its vast array of software projects. These guides define best practices and conventions for writing code in multiple programming languages, from C++ and Python to JavaScript, Go, and Swift. By adhering to these standards, developers can more easily collaborate, review code, and maintain high-quality software across teams and...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    EzXML.jl

    EzXML.jl

    XML/HTML handling tools for primates

    EzXML.jl is a package to handle XML/HTML documents for primates. This package depends on libxml2, which will be automatically installed as an artifact via XML2_jll.jl if you use Julia 1.3 or later. Currently, Windows, Linux, macOS, and FreeBSD are now supported.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    Json.NET

    Json.NET

    A popular high-performance JSON framework for .NET

    Serialize and deserialize any .NET object with Json.NET's powerful JSON serializer. Create, parse, query and modify JSON using Json.NET's JObject, JArray and JValue objects. Query JSON with an XPath-like syntax. 50% faster than DataContractJsonSerializer, and 250% faster than JavaScriptSerializer. Json.NET makes the simple easy and the complex possible. If you need it, Json.NET supports converting between XML and JSON. Json.NET is open source software and is completely free for commercial use. Json.NET supports Windows, MacOS, Linux, Mono, and Xamarin. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 11
    SafeLine

    SafeLine

    Serve as a reverse proxy to protect your web services from attacks

    ...A web application firewall helps protect web apps by filtering and monitoring HTTP traffic between a web application and the Internet. It typically protects web apps from attacks such as SQL injection, XSS, code injection, os command injection, CRLF injection, LDAP injection, XPath injection, RCE, XXE, SSRF, path traversal, backdoor, brute force, HTTP-flood, bot abuse, among others. By deploying a WAF in front of a web application, a shield is placed between the web application and the Internet. While a proxy server protects a client machine’s identity by using an intermediary, a WAF is a type of reverse-proxy, protecting the server from exposure by having clients pass through the WAF before reaching the server. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12
    Crawl4AI

    Crawl4AI

    Open-source LLM Friendly Web Crawler & Scraper

    Crawl4AI is a high-performance, AI‑ready web crawler tailored for LLM data ingestion and RAG pipelines. It supports adaptive crawling heuristics (stopping when enough info is gathered), structured markdown output, and high-speed parallel execution. Designed to operate at scale with optional Docker deployment and framework integrations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Hurl

    Hurl

    Hurl, run and test HTTP requests with plain text

    ...Hurl makes it easy to work with HTML content, REST / SOAP / GraphQL APIs, or any other XML / JSON-based APIs. Hurl can run HTTP requests but can also be used to test HTTP responses. Different types of queries and predicates are supported, from XPath and JSONPath on body response, to assert on status code and response headers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Karate

    Karate

    Test automation made simple

    Karate is the only open-source tool to combine API test-automation, mocks, performance-testing and even UI automation into a single, unified framework. The BDD syntax popularized by Cucumber is language-neutral, and easy for even non-programmers. Assertions and HTML reports are built-in, and you can run tests in parallel for speed. There’s also a cross-platform stand-alone executable for teams not comfortable with Java. You don’t have to compile code. Just write tests in a simple, readable...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 15
    Taiko

    Taiko

    A node.js library for testing modern web applications

    ...Taiko works with Chromium-based browsers such as Chrome, Microsoft Edge, and Opera, and can also interact with Firefox, enabling developers to test modern web interfaces across multiple environments. The library includes intelligent element selection capabilities that automatically locate elements on the page, reducing the need for fragile CSS selectors or XPath expressions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Nokogiri

    Nokogiri

    Tool to work with XML and HTML from Ruby

    Nokogiri (鋸) makes it easy and painless to work with XML and HTML from Ruby. It provides a sensible, easy-to-understand API for reading, writing, modifying, and querying documents. It is fast and standards-compliant by relying on native parsers like libxml2 (C) and xerces (Java). Be secure-by-default by treating all documents as untrusted by default. Be a thin-as-reasonable layer on top of the underlying parsers, and don't attempt to fix behavioral differences between the parsers. "Native...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    kimuraframework

    kimuraframework

    AI-first Ruby framework for building fast, flexible web scraping spide

    Kimurai is an open source web scraping framework written in Ruby that simplifies the process of building automated data extraction tools. It provides a clean domain-specific language that allows developers to define scraping logic and data schemas with minimal boilerplate code. Kimurai can use AI-assisted extraction to identify where data resides in HTML pages, automatically generating selectors that are cached for future use so subsequent scraping runs operate with pure Ruby performance....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    FreshRSS

    FreshRSS

    A free, self-hostable news aggregator

    FreshRSS is a self-hosted RSS and Atom feed aggregator. It is lightweight, easy to work with, powerful, and customizable. Follow websites, podcasts, and video channels in a single place. Read your articles directly in FreshRSS. Search and save queries for quick access. Generate feeds by scraping external websites. Generate new feeds based on your filters. Import and export your feeds with OPML. Stay connected to your feeds in real time. Adapt to your needs thanks to a lot of options. Follow...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    PHPScraper

    PHPScraper

    A universal web-util for PHP

    PHPScraper is a universal web-scraping util for PHP, built with simplicity in mind. The goal is to make xPath Selectors optional and avoid the commonly needed boilerplate code. Just create an instance of PHPScraper, go to a website, and start collecting data. All scraping functionality can be accessed either as a function call or a property call. For example, the title can be accessed in two ways. Many common use cases are covered already.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    GnuCash

    GnuCash

    Personal and small-business finance manager

    GnuCash is a personal and small-business finance manager with a check-book like register GUI to enter and track bank accounts, stocks, income and expenses. GnuCash is designed to be simple and easy to use but still based on formal accounting principles.
    Leader badge
    Downloads: 4,724 This Week
    Last Update:
    See Project
  • 21
    Vidiot

    Vidiot

    Non-linear video editor targeted for home video editing.

    Vidiot is a non-linear video editor targeted for home video editing. It supports compositing (scaling/rotating/positioning), cropping, changing speed, adding transitions and titles, trimming, key frames, etc.
    Leader badge
    Downloads: 519 This Week
    Last Update:
    See Project
  • 22
    GanttProject

    GanttProject

    Free project scheduling and management tool

    GanttProject is a project scheduling application written in Java and featuring Gantt chart, resource management, calendaring, import/export (MS Project, HTML, PDF, spreadsheets). Learn more on http://ganttproject.biz
    Leader badge
    Downloads: 313 This Week
    Last Update:
    See Project
  • 23

    xsd2pgschema

    Relational database replication tool based on XML Schema

    ...File conversion from XML to CSV, TSV, or JSON is possible as well as mapping XML Schema to JSON Schema. Obtained PostgreSQL database can be optimized at user's discretion. Moreover, differential update, and XPath query evaluation over PostgreSQL are possible. PgSchema server, serialized relational data model server, can be used to speed up the analysis of complex XML Schema. Large XML file can be split through xmlsplitter, a flexible XML splitter based on XPath and StAX.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 24
    XML Copy Editor
    XML Copy Editor is a fast, free, validating XML editor.
    Leader badge
    Downloads: 911 This Week
    Last Update:
    See Project
  • 25
    malware-samples

    malware-samples

    A collection of malware samples and relevant dissection information

    This repo is a public collection of malware samples and related dissection/analysis information, maintained by InQuest. It gathers various kinds of malicious artifacts, executables, scripts, macros, obfuscated documents, etc., with metadata (e.g., VirusTotal reports), file carriers, and sample hashes. It’s intended for malware analysts/researchers to help study how malware works, how they are delivered, and how it evolves.
    Downloads: 113 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB