45 projects for "html parse" with 1 filter applied:

  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 1
    Symfony DomCrawler

    Symfony DomCrawler

    Eases DOM navigation for HTML and XML documents

    Symfony DomCrawler is a PHP component that provides powerful tools for navigating and extracting data from HTML and XML documents. It allows developers to parse, filter, and manipulate web pages using CSS selectors and XPath expressions. DomCrawler is widely used for web scraping, testing, and processing structured content, and integrates well with other Symfony components like BrowserKit.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    commonmark-java

    commonmark-java

    Java library for parsing and rendering CommonMark (Markdown)

    Java library for parsing and rendering Markdown text according to the CommonMark specification (and some extensions). Provides classes for parsing input to an abstract syntax tree of nodes (AST), visiting and manipulating nodes, and rendering to HTML. It started out as a port of commonmark.js, but has since evolved into a full library with a nice API.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    GoNB

    GoNB

    GoNB, a Go Notebook Kernel for Jupyter

    ...GoNB leverages that compilation speed to implement a full-featured (at least it's getting there) Jupyter notebook kernel. As a side benefit it works with packages that use CGO — although it won't parse C code in the cells, so it can't be used as a C kernel. It already includes many goodies: cache between cell of results, contextual help and auto-complete (with gopls), compilation error context (by mousing over), bash command execution, images, html, etc. See the tutorial. It's been heavily used by the author (in developing GoMLX, a machine learning framework for Go), but should still be seen as experimental — if we hear success stories from others, we can change this.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    wombat

    wombat

    Lightweight Ruby DSL for scraping structured data from web pages

    Wombat is a lightweight web crawling and scraping library written in Ruby that focuses on extracting structured data from web pages using a concise domain-specific language (DSL). It is designed to simplify the process of defining how information should be collected from HTML documents without requiring large amounts of scraping boilerplate code. Developers can declare the data fields they want and specify selectors or rules for retrieving them, allowing Wombat to parse and return structured results. The DSL approach helps make scraping definitions more readable and maintainable, especially when dealing with multiple fields or nested data structures. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Atera - an All-in-one platform for IT management Icon
    Atera - an All-in-one platform for IT management

    Ideal for IT departments and MSPs (managed service providers)

    Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!
    Try Atera now
  • 5
    XRAY

    XRAY

    XRay for recon, mapping and OSINT gathering from public networks

    XRAY is a modular security toolset that helps developers and security professionals analyze, fuzz, and test web applications, protocols, and network services for vulnerabilities. It provides a framework for writing and executing inspection modules that can parse structured data (JSON, XML, HTML), traverse graphs of endpoints, and perform intelligent probing guided by discovered surface area. XRay is typically used as a reconnaissance and vulnerability discovery engine in red-team or app-security workflows: it leverages extensible plugins to adapt to different protocols, inject payloads, and detect common bug classes such as injection flaws, misconfigurations, and unsafe endpoints. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    AngleSharp

    AngleSharp

    The ultimate angle brackets parser library parsing HTML5, MathML, SVG

    AngleSharp follows the W3C specifications and gives you the same results as state of the art browsers. Besides the official API AngleSharp adds some useful extension methods on top. This makes working with the DOM convenient. AngleSharp integrates everything you need to explore and mutate the DOM tree. Node retrieval is straight forward by using powerful CSS query selectors. The CSS queries in AngleSharp are super fast and very simple to use. AngleSharp respects the relationship of HTML with...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 7
    Python-Spider

    Python-Spider

    Python3 web crawler practice

    Python-Spider is a repository intended to teach or provide examples for writing web spiders / crawlers in Python — part of a broader learning and resource collection by its author. The code and documentation are oriented toward beginners or intermediate learners who want to learn how to fetch, parse, and extract data from websites programmatically. As part of the author’s public learning-path repositories, python-spider likely includes examples of HTTP requests, HTML parsing, maybe concurrency or scheduling to crawl multiple pages, and techniques to handle common web-scraping issues. For people wanting to get hands-on with building scrapers, collecting data, or learning how to navigate web programming in Python, this repository acts as a didactic reference or starting point. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    metaf2xml

    metaf2xml

    Parse and decode METAR, TAF, SYNOP, BUOY, AMDAR and write data as XML

    ...It also provides XSLT style sheets to convert the XML to plain language (text, HTML), or XML with different schemas. A web-based (CGI) user interface can be used to download and display up-to-date weather data from NOAA, Ogimet, and other sources.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    readpe

    readpe

    The PE file analysis toolkit

    readpe (formerly known as pev) is a multiplatform toolkit to work with PE (Portable Executable) binaries. Its main goal is to provide feature-rich tools for properly analyze binaries with a strong focus on suspicious ones.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 10
    JDynamiTe, Dynamic Template in Java

    JDynamiTe, Dynamic Template in Java

    Dynamically generate documents from templates

    JDynamiTe is a tool which allows you to dynamically create documents in any format from "template" documents. And very few lines of code (or no line at all!) are needed to do that. Some typical usage domains of JDynamiTe are: - dynamic Web pages creation, - text document generation, - source code generation... In fact, it can be useful in any case where pre-defined documents (templates) have to be dynamically populated with data. The main benefit of JDynamiTe is to allow a true...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Forms

    Forms

    An easy way to create, parse and validate forms in node.js

    Forms is a Node.js library designed to simplify the creation, validation, and handling of web forms on the server side, providing a structured approach to managing user input. It allows developers to define form schemas using a declarative API, specifying fields, validation rules, and data transformations in a consistent format. The library supports a wide range of input types, including text, numbers, dates, and custom fields, making it adaptable to different application needs. It includes...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    htmlcxx is a simple non-validating html parser library for C++. It allows to fully dump the original html document, character by character, from the parse tree. It also has an intuitive tree traversal API.
    Downloads: 39 This Week
    Last Update:
    See Project
  • 13
    Gallop

    Gallop

    A framework for build smooth asynchronous iOS APP

    ...It encapsulates CoreText's rich text functions and commonly used image processing capabilities. just need use LWTextStorage object instead of UILabel object and use LWImageStorage object instead of UIImageView object,Gallop will make sure your app scroll smoothly. You can also use Gallop to parse HTML pages and customize machining to parse HTML pages into iOS native pages. Use Gallop Building complex rich text interface application, can get a great experience. You can easy to insert local images,web images or UIView object in the text. Easily add click and long press events to the text. Easily draw text frame bouding, drawing the hollow words, sets the text vertical Alignment property, etc Easy to parse the expression in the text, like http(s) link, @ user, # theme #, phone number. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Open Source software (PHP) managing iCal (rfc2445) files, for non-calendar systems like CMS, project mgmt systems etc. able to manage calendar information like agendas, tasks, reports, totos, journaling data and communicate with calendar software.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    PAS2PHP

    PAS2PHP

    Transpiles/Converts Pascal to Working PHP Source Code

    ...As a guide to what PAS2PHP can achieve... my web site is written in Pascal, and then converted to PHP. (The code is in the source package). PAS2PHP uses the Pascal Parse Tree Classes (PasTree) which comes with FreePascal, and therefore the parsing limits are defined by this library.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    HyperSQL is like a doxygen plus javadoc for SQL, hypermapping SQL views, packages, procedures, and functions to HTML source code listings and showing all code locations where these are used.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17

    Z Notation E-Mail Mark-up Tools

    Tools to convert Z mark-up to HTML or text.

    A small library and two command-line tools to parse and convert Z notation from the "e-mail" mark-up into HTML code, or into UTF-8 text with box-drawing graphics, or into the Z Standard text format. See the project's Wiki Home Page for details --- the "Wiki" button in the bar above, or the following link:
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    TinyHTML++

    TinyHTML++

    Allows to parse HTML in C++

    This library allows to parse HTML in C++. Its main focus is put on stability and simplicity rather than pure performance. TinyHTML++ is even able to parse and correct wrong HTML documents. It is also possible to get the page in text format as a user of a webbrowser reads it. TinyHTML++ also allows to convert it back to a HTML string. Please let me know if you find any bugs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    Spondulas

    Spondulas is browser emulator designed to retrieve web pages for hunti

    Spondulas is browser emulator and parser designed to retrieve web pages for hunting malware. It supports generation of browser user agents, GET/POST requests, and SOCKS5 proxy. It can be used to parse HTML files sent via e-mail. Monitor mode allows a website to be monitored at intervals to discover changes in DNS or content over time. Autolog mode creates an investigation file that documents redirection chains. The retrieved web pages are parsed for links and reported to an output file. More information is available on the wiki.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    WSciParser (Web Science Parser) is a handy small library created in C#.NET to parse data from web science databases (ISI Web, NCBI, PubMed...)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    ShaniXmlParser is a small and fast Xml/Html DOM/SAX non validating parser written in java. It can parse invalid xml files. It uses the org.w3c.dom interfaces and the jaxp interfaces. It works also on mono/.net thanks to ikvm.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Wow Log Parser is a combat log parser for the game World of Warcraft. The purpose of the program is to parse the files generated with the /combatlog command. The source code can be found on: http://www.gurre.eu/wowlogparser/forum
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    xBB-code is the PHP library to parse and edit text formatted with BBCode.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    quarReminder is a simple PHP script to parse the Amavisd-new Spam/Virus quarantine database, and provide an HTML Message to the user reminding them of these messages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    TeXConverter is a converter to parse LaTeX input and create representations of the document in other formats (currently HTML, XDocs(Maven), a specific Wiki-format). Internally a Java object model is used to represent the document, to ease extensibility.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Auth0 Logo