Showing 171 open source projects for "html parser"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Context for your AI agents Icon
    Context for your AI agents

    Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.

    Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
    Try for free
  • 1
    cebe/markdown

    cebe/markdown

    A super fast, highly extensible markdown parser for PHP

    cebe/markdown is a super fast, highly extensible markdown parser for PHP. It is a set of PHP classes, each representing a Markdown flavor and a command line tool for converting Markdown files to HTML files. The implementation focus is to be fast (see benchmark) and extensible. You are able to add additional language elements by directly hooking into the parser, no (possibly error-prone) post- or pre-processing is needed to extend the language.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Himalaya

    Himalaya

    JavaScript HTML to JSON Parser

    Parse HTML into JSON. Himalaya bundles well with Browersify and Webpack. Himalaya transforms HTML into JSON, that's it. Himalaya is synchronous and does not require any complicated callbacks. Himalaya does not cut corners and returns an accurate representation of the HTML supplied. To remove whitespace, post-process the JSON; check out an example script. Himalaya can include the start and end positions of nodes in the parse output. To enable this, you can pass parse the parseDefaults...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    React HTML Parser

    React HTML Parser

    Converts HTML strings directly into React components

    Utility for converting HTML strings into React components. Converts standard HTML elements, attributes and inline styles into their React equivalents and provides a simple way to modify and replace the content. It is important to understand that this library should not be used as a direct replacement for using properly sanitized HTML and that it only provides the same level of protection that React does which does not provide 100% protection. All HTML should be properly sanitized using a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Snowflake HTML

    Snowflake HTML

    SVG in Swift

    SVG in Swift. Use XML parser from Reindeer. The Shape object maps to SVG elements. The Style object encapsulates style information. The cool thing about CALayer is that most of its properties are animatable. Snowflake is available through CocoaPods. Snowflake is also available through Carthage. Snowflake can also be installed manually. Just download and drop Sources folders in your project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Run applications fast and securely in a fully managed environment Icon
    Run applications fast and securely in a fully managed environment

    Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of scalable infrastructure.

    Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.
    Try for free
  • 5

    TemplateLite

    A small fast Template Engine for PHP, without a huge framework.

    Template Lite is a very fast, small HTML template engine written in PHP. The engine supports most of the Smarty2 template engine functions and filters. This template engine is no longer a Smarty Replacement. But is still similar to Smarty. The new TemplateLite3 is currently in the works and has a new parser and compiler structure along with a modified syntax. The new TemplateLite is not 100% backward compatible for the templates but, the usage from php should be.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    The MangaStream Downloader is an open source application written in Java for managing and downloading manga from the site mangastream.com and mangafox.me. It is written under the GNU-GPL license and uses an open source HTML parser - TagSoup. Follow the project page on Facebook for updates: https://www.facebook.com/MangastreamDownloader
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Aglio

    Aglio

    An API Blueprint renderer with theme support that outputs static HTML

    Aglio is a renderer for API Blueprint documents (Markdown‑based API description format), producing static HTML documentation with theme support. It functions as both a command‑line executable and a Node.js library, allowing integration into pipelines or live-preview workflows. An API Blueprint renderer that supports multiple themes and outputs static HTML that can be served by any web host. API Blueprint is a Markdown-based document format that lets you write API descriptions and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Luna Editor

    Luna Editor

    Write and edit your lua scripts in an easy way.

    Write and edit your lua scripts in an easy way.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9

    JSONjuicer

    JSON parser and encoder

    A Java open-source library which makes encoding and decoding Java data-structures to and from JSON text easy and intuitive.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Next-Gen Encryption for Post-Quantum Security | CLEAR by Quantum Knight Icon
    Next-Gen Encryption for Post-Quantum Security | CLEAR by Quantum Knight

    Lock Down Any Resource, Anywhere, Anytime

    CLEAR by Quantum Knight is a FIPS-140-3 validated encryption SDK engineered for enterprises requiring top-tier security. Offering robust post-quantum cryptography, CLEAR secures files, streaming media, databases, and networks with ease across over 30 modern platforms. Its compact design, smaller than a single smartphone image, ensures maximum efficiency and low energy consumption.
    Learn More
  • 10
    jQuery Calx

    jQuery Calx

    A jQuery plugin for creating formula-based calculation form

    jQuery Calx is an Excel calculation engine and formula parser built as a jQuery plugin, it offers spreadsheet ability to process and calculate the formula without bringing a table-like spreadsheet interface. In the core of jQuery Calx is a formula parser generated using jison which parse and processes the formula and set of formulas imported from formula.js.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Charm Pascal IDE

    Charm Pascal IDE

    The newest free IDE for Pascal programming language

    Charm Pascal is an IDE created to be completely useful . It's different from other IDEs with features like this , this IDE is completely free for any user , and it came with new features to make coding in Pascal funnier,easier and ways better than before. Created by Alaa Ben Fatma .
    Leader badge
    Downloads: 34 This Week
    Last Update:
    See Project
  • 12
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 13

    go-sslscan

    go-sslscan is an orchestrator, parser and HTML reporter for sslscan

    go-sslscan is an orchestrator, parser and HTML reporter for rbsec/sslscan with a Fair Source License. Download the sslscan executable into your go-sslscan folder, edit the subnetlist.txt, run go-sslscan and open your browser at localhost:8080. Check out https://github.com/jelmerdereus/go-sslscan
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    PHP Advanced HTML DOM Parser

    A drop-in replacement for simple html dom

    This project has moved here https://github.com/monkeysuffrage/advanced_html_dom
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    NekoHTML is a simple HTML scanner and tag balancer that enables application programmers to parse HTML documents and access the information using standard XML interfaces.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    Products of the project: Java HTMLParser - VietSpider Web Data Extractor - Extractor VietSpider News. Click on "Show project details" to see more feature about each product.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Torrtux

    Torrtux

    A terminal-program for downloading torrents from PirateBay

    ...It also allows you to get the details of your torrent, the author, the date, the type, the size, etc., just like being on the TPB site ! Moreover, it retrieves subs from www.opensubtitles.org. It retrieves informations in the source code of the TPB page and parses it with regexp and the library html-parser. In the config file ~/.torrtuxrc, you can chose your display, subs, comments preferences, your torrent-manager and a proxy if needed ! Thanks for reporting all bugs you find !
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    PangView

    PangView

    Text Viewer that displays text similar to pango

    A text viewer utility that displays text in a pango like markup. The utility is written in python, and uses the pygtk toolkit. The parser reads pango tags similar to html. The utility was written, so formatted text can be displayed quickly. NEW! The parser now is updated to use a faster method of internal data structure.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    go_spider

    go_spider

    An awesome Go concurrent Crawler(spider) framework

    ...It can be expanded to an Individualized crawler easily or you can use the default crawl components only. Spider gets a Request in Scheduler that has url to be crawled. Then Downloader downloads the result(html, json, jsonp, text) of the Request. The result is saved in Page for parsing in PageProcesser. Html parsing is based on goquery package. Json parsing is based on simple JSON package. Jsonp will converse to json. Text form represents plain text content without a parser. The PageProcesser moduler only parse results. The moduler gets results(key-value pairs) and URLs to be crawled next step. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    A java-based parser for parsing/grabbing web sites and other text or XML documents, based on a nondeterministic parser language, creating XML output. Also contains a few utility classes for HTML, CSV and text parsing, and additional character sets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A java m3u8 playlist parser for Http Live Streaming. See ietf draft http://tools.ietf.org/html/draft-pantos-http-live-streaming-02 for details. See See: http://m3u8parser.chilicat.net for JavaDoc and simple Webstart application to test the parser
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    FigDice

    FigDice

    PHP Templating System

    FigDice is a template rendering system, primarily designed for web applications and the dynamic generation of HTML documents. FigDice puts the view screens (templates) at the centre of the rendering process. FigDice defines a simple notation based on XML tags and attributes, allowing the engine, using source files very close to the target HTML, to produce the rendering, dynamically enhanced by the Web application's services and presentation layers. FigDice is independent of the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    QueLang

    QueLang is a designing tool to use for Questionnaire Design.

    This is the first implementation of QueLang. QueLang is a language I designed for Questionnaire Design and Implementation. This software can compile your code (written in .ql text files) into a special .qlc format (a kind of database). Then it can read those .qlc files to open them in viewer and export them to PDF format. It can be also used for exam and test designing! Tested on: -Linux Ubuntu 12.04 -Windows 7 64-bit QueLang can run by double clicking the .jar (or .exe)...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    Web Page Parser

    This is a simple web page parser based on Perl

    The purpose of this script is to "beautify" raw html code that might not be properly formatted into a properly parsed and easy to read html page. It attempts to auto-correct unclosed tags that may render correctly in a browser but is incorrectly formatted. This script is setup to take a single url as input but can easily be modified to handle multiple urls or local input.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    CppWeb - C++ Web developement framework

    CppWeb - C++ Web developement framework

    Cross-platform C++ library for developing CGI Web applications

    CppWeb is cross-platform C++ library for developing web applications with server push support. The library decodes CGI variables and cookies, supports file uploads, performs automatic cookie detection, provides URL and HTML entity encode/decode functions, supports server-push (long-polling via ajax), has built-in HTML parser, SQLite database wrapper etc. CppWeb compiles on Windows, Linux and MacOSX (tested with GNU C++, MingW, MS Visual C++ and Borland C++ compiler) and can run with almost any web server (Apache, IIS, Boa etc.). Can be used in embedded systems (tested with FriendlyARM Mini2440 and Raspberry PI)
    Downloads: 0 This Week
    Last Update:
    See Project