xpath free download - SourceForge

Showing 21 open source projects for "xpath"

View related business solutions

Internet Python Clear Filters & Widen Search

Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
1

Scrapy

A fast, high-level web crawling and web scraping framework

Scrapy is a fast, open source, high-level framework for crawling websites and extracting structured data from these websites. Portable and written in Python, it can run on Windows, Linux, macOS and BSD. Scrapy is powerful, fast and simple, and also easily extensible. Simply write the rules to extract the data, and add new functionality if you wish without having to touch the core. Scrapy does the rest, and can be used in a number of applications. It can be used for data mining, monitoring...

Downloads: 23 This Week

Last Update: 2026-05-19
See Project
2

gain

Asyncio-based Python framework for building fast web crawling spiders

...Developers define crawlers using components such as spiders, parsers, and items, allowing them to organize crawling logic and data extraction rules clearly. Gain supports CSS selectors and XPath expressions for parsing page content and extracting specific elements. Gain also allows developers to configure headers, concurrency levels, and proxy settings to control how crawlers interact with target websites. Because it uses asynchronous programming, Gain can handle multiple requests efficiently while minimizing blocking operations.

Downloads: 1 This Week

Last Update: 20 minutes ago
See Project
3

Crawl4AI

Open-source LLM Friendly Web Crawler & Scraper

Crawl4AI is a high-performance, AI‑ready web crawler tailored for LLM data ingestion and RAG pipelines. It supports adaptive crawling heuristics (stopping when enough info is gathered), structured markdown output, and high-speed parallel execution. Designed to operate at scale with optional Docker deployment and framework integrations.

Downloads: 1 This Week

Last Update: 3 days ago
See Project
4

dude uncomplicated data extraction

dude uncomplicated data extraction: A simple framework

Dude is a very simple framework for writing web scrapers using Python decorators. The design, inspired by Flask, was to easily build a web scraper in just a few lines of code. Dude has an easy-to-learn syntax. Dude is currently in Pre-Alpha. Please expect breaking changes. You can run your scraper from terminal/shell/command-line by supplying URLs, the output filename of your choice and the paths to your python scripts to dude scrape command.

Downloads: 0 This Week

Last Update: 2024-03-02
See Project
$300 Free Credits to Build on Google Cloud
New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.

Claim $300 Free
5

RPA for Python

Python package for doing RPA

Python package for doing RPA. RPA for Python's simple and powerful API makes robotic process automation fun! You can use it to quickly automate away repetitive time-consuming tasks on websites, desktop applications, or the command line. See sample Python script, the RPA Challenge solution, and RedMart groceries example. To send a Telegram app notification, simply look up @rpapybot to allow receiving messages. To automate Chrome browser invisibly, use headless mode. To run 10X faster instead...

Downloads: 0 This Week

Last Update: 2023-07-07
See Project
6

mlscraper

ML-based HTML scraper that learns extraction rules from examples

mlscraper is a Python library designed to automatically extract structured data from HTML pages without requiring developers to manually write CSS selectors or XPath rules. Instead of defining extraction logic by hand, users provide a few examples of the data they want to retrieve from a webpage. It analyzes those examples within the HTML document and determines patterns or rules that can be used to extract the same type of information from similar pages. Once trained, the generated scraper can process new pages and return the extracted data in structured formats such as dictionaries or lists. ...

Downloads: 3 This Week

Last Update: 19 minutes ago
See Project
7

Requests-HTML

Pythonic HTML Parsing for Humans

...When using this library you automatically get full JavaScript support! (Using Chromium, thanks to puppeteer) CSS Selectors (a.k.a jQuery-style, thanks to PyQuery). XPath Selectors, for the faint of heart. Mocked user-agent (like a real web browser). Automatic following of redirects. Connection–pooling and cookie persistence. The Requests experience you know and love, with magical parsing abilities, and async support. The rest of the code operates the same way as the synchronous version except that results is a list containing multiple response objects however the same basic processes can be applied as above to extract the data you want.

Downloads: 0 This Week

Last Update: 2023-04-10
See Project
8

pgal

Online gallery factory

Given images folders, generate a static web gallery. Note: This project has been moved to: https://github.com/viewplatgh/pgal

Downloads: 0 This Week

Last Update: 2014-05-08
See Project
9

dynamide

dynamide is a dynamic web application framework for handling the presentation and business layers in a traditional web app. See http://dynamide.com

Downloads: 0 This Week

Last Update: 2013-10-06
See Project
Stop vibe-debugging.
Plug Claude into your app's actual errors.

AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.

Free 30 days.
10

webgeno

webgeno is a light-weight Content Management System (CMS) for generating websites offline. It is primarily designed for personal websites where the user doesn't have access to the server except to upload files.

Downloads: 0 This Week

Last Update: 2013-08-11
See Project
11

Remus music server

Serves music files, playlists, HTML files, etc using http/WebDAV. Music files are categorized using metainformation (e.g. id3 tags) and this metainformation is stored in a MySQL database for sorting and searching operations.

Downloads: 0 This Week

Last Update: 2014-06-09
See Project
12

pyFire

process runtime detection with xml / image statistic output - like xfire .com

Downloads: 0 This Week

Last Update: 2014-06-09
See Project
13

Wolfpack

Wolfpack is an open source server implementing the protocol used by the massive multiplayer online game Ultima Online. It aims for full support of the Ultima Online protocol and tries to mimic the same gameplay as the original.

Downloads: 0 This Week

Last Update: 2014-06-09
See Project
14

InPlace CMS

CMS based on XSLT templates and usability tricks. Edit the page content directly in HTML, keeping the essential data in only one place.

Downloads: 0 This Week

Last Update: 2013-03-21
See Project
15

PennAve

PennAve is a dynamic photo gallery software written in Python and designed for use alongside F-Spot. It makes heavy use of XML and XSLT for ease of presentation modification and sharing of information with other users, web sites, and programs.

Downloads: 0 This Week

Last Update: 2013-04-17
See Project
16

pyFire

process runtime detection with xml/image statistic output - like xfire .com

Downloads: 0 This Week

Last Update: 2014-04-28
See Project
17

Syncato

Syncato is a Weblog Web Services system built on top of Berkeley DB XML, Webware and Python. It has a number of unique features; XPath access to all content via URLs, XSL-T presentation and extremely flexible database structure.

Downloads: 0 This Week

Last Update: 2015-05-06
See Project
18

Plim

Python Web development framework based on XSLT engine.

Downloads: 0 This Week

Last Update: 2014-05-23
See Project
19

Radix: XML web-application framework

Radix is a RAD framework for creating native XML web applications. Complete web applications can be created without programming knowledge using XML, XSLT, XPath and related technologies. Radix can be extended using Java, JavaScript, Python, and Tcl.

Downloads: 0 This Week

Last Update: 2013-02-27
See Project
20

XMDNS

XMDNS is an extensible DNS management scheme that uses XML to store data. It features easy manipulation of views (or split horizon DNS). There is also support for hand-crafting records for situations where complicated rules must be enforced.

Downloads: 0 This Week

Last Update: 2015-04-09
See Project
21

PyLogAlyzer another web log analyzer

PyLogAlyzer is a Web Log Analyzer in Pure Python (a clone of Awstats). PyLogAlyzer produces a XML result and uses XSLT to generate the HTML files.

Downloads: 0 This Week

Last Update: 2013-03-08
See Project