Showing 47 open source projects for "processing"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Automate contact and company data extraction Icon
    Automate contact and company data extraction

    Build lead generation pipelines that pull emails, phone numbers, and company details from directories, maps, social platforms. Full API access.

    Generate leads at scale without building or maintaining scrapers. Use 10,000+ ready-made tools that handle authentication, pagination, and anti-bot protection. Pull data from business directories, social profiles, and public sources, then export to your CRM or database via API. Schedule recurring extractions, enrich existing datasets, and integrate with your workflows.
    Explore Apify Store
  • 1
    html-loader

    html-loader

    HTML Loader

    Exports HTML as a string. HTML is minimized when the compiler demands. The true value enables the processing of all default elements and attributes, the false value disables the processing of all attributes. Allows you to specify which tags and attributes to process, filter them, filter urls and process sources starting with /. Allows to setup which tags and attributes to process and how, as well as the ability to filter some of them.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    geckodriver

    geckodriver

    WebDriver for Firefox

    ...Generally speaking, Selenium 3 enabled geckodriver as the default WebDriver implementation for Firefox. With the release of Firefox 47, FirefoxDriver had to be discontinued for its lack of support for the new multi-processing architecture in Gecko. Since geckodriver is a separate HTTP server that is a complete remote end implementation of WebDriver, it is possible to avoid using the Selenium remote server if you have no requirements to distribute processes across a matrix of systems.
    Downloads: 99 This Week
    Last Update:
    See Project
  • 3
    Tesla

    Tesla

    The flexible HTTP client library for Elixir

    ...Tesla supports multiple HTTP adapter that do the actual HTTP request processing. Each piece of the stream will be encoded as JSON and sent as a new line (conforming to JSON stream format). You can set the adapter to Tesla.Mock in tests.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Lexbor

    Lexbor

    Lexbor is development of an open source HTML Renderer library

    Lexbor is the development of a web browser engine available as a software library; it ships with a free license and has no extra dependencies. For us, speed is an absolute must-have. In our development process, we focus on fastest parsing techniques for HTML, CSS, and fonts, fastest data processing methods, and fastest ways to serve content to end users. Whether you are building a backend that handles millions of HTML documents or a UI-heavy user app, your software’s response rate always matters to users and developers alike. Lexbor’s code is optimized for ease of access in end-user applications and across programming languages. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Deliver trusted data with dbt Icon
    Deliver trusted data with dbt

    dbt Labs empowers data teams to build reliable, governed data pipelines—accelerating analytics and AI initiatives with speed and confidence.

    Data teams use dbt to codify business logic and make it accessible to the entire organization—for use in reporting, ML modeling, and operational workflows.
    Learn More
  • 5
    SingleFile

    SingleFile

    Web Extension for Firefox/Chrome/MS Edge and CLI tool

    ...Wait until the page is fully loaded. Click on the SingleFile button in the extension toolbar to save the page. You can click again on the button to cancel the action when processing a page. Open the context menu by right-clicking the SingleFile button in the extension toolbar or on the webpage. It allows you to save the current tab, the selected content, the selected frame. With auto-save active, pages are automatically saved every time after being loaded (or before being unloaded if not). Right-click on the SingleFile button and select "Manage extension" (Firefox) / "Options" (Chrome) to open the options page.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    mp-html

    mp-html

    Small program rich text component, supports rendering and editing html

    A powerful applet-rich text component. Small program rich text component supports rendering and editing HTML and supports use on WeChat, QQ, Baidu, Alipay, Toutiao, and uni-app platforms. Displaying dynamic HTML rich text is a necessary requirement for many applications. The applet platform does not support dom operations, making this a problem. The built-in rich-text component supports few tags and blocks all events, making it difficult for practical application. Therefore, there is such a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Emogrifier

    Emogrifier

    Converts CSS styles into inline style attributes in your HTML code

    Emogrifier converts CSS styles into inline style attributes in your HTML code. This ensures proper display on email and mobile device readers that lack stylesheet support. This utility was developed as part of Intervals to deal with the problems posed by certain email clients (namely Outlook 2007 and GoogleMail) when it comes to the way they handle styling contained in HTML emails. As many web developers and designers already know, certain email clients are notorious for their lack of CSS...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    CSSBox

    CSSBox

    Pure Java HTML / CSS rendering engine

    CSSBox is an (X)HTML/CSS rendering engine written in pure Java. Its primary purpose is to provide a complete information about the rendered page suitable for further processing. However, it also allows displaying the rendered document.
    Downloads: 13 This Week
    Last Update:
    See Project
  • Fully managed relational database service for MySQL, PostgreSQL, and SQL Server Icon
    Fully managed relational database service for MySQL, PostgreSQL, and SQL Server

    Focus on your application, and leave the database to us

    Cloud SQL manages your databases so you don't have to, so your business can run without disruption. It automates all your backups, replication, patches, encryption, and storage capacity increases to give your applications the reliability, scalability, and security they need.
    Try for free
  • 10
    ModularAdmin

    ModularAdmin

    Free Dashboard Theme Built On Bootstrap 4 | HTML Version

    ...Main variables are defined in the src/_variables.scss folder. For making life easier we broke down styles into components, and on build we're just merging all .scss files together and processing it to the dist/css/app.css file. There are also different theme variations located in src/_themes/ folder, where you can change the main variables to get different themes. There are a few predefined themes built-in. You can add new themes by adding a new file in src/_themes/ folder. We separate the application's scripts across its components. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    unfluff

    unfluff

    Automatically extract body content (and other cool stuff) from HTML

    unfluff is a Node.js library designed to automatically extract the main content from an HTML document — stripping away navigation bars, ads, footers and other boilerplate to leave you with the “body content”, metadata (title, author, date) and other useful fields. It’s a tool very much aimed at content-analysis, web scraping, building datasets, or repurposing article text for downstream processing (like machine-learning or summarization). The API is simple: you feed in raw HTML and it returns a structured object with the extracted text and other fields. It supports caching internal representations to speed up repeated extractions. While its language support is best for English, it is still widely used in web-content-processing pipelines. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    React HTML Parser

    React HTML Parser

    Converts HTML strings directly into React components

    Utility for converting HTML strings into React components. Converts standard HTML elements, attributes and inline styles into their React equivalents and provides a simple way to modify and replace the content. It is important to understand that this library should not be used as a direct replacement for using properly sanitized HTML and that it only provides the same level of protection that React does which does not provide 100% protection. All HTML should be properly sanitized using a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Gallop

    Gallop

    A framework for build smooth asynchronous iOS APP

    ...Easily draw text frame bouding, drawing the hollow words, sets the text vertical Alignment property, etc Easy to parse the expression in the text, like http(s) link, @ user, # theme #, phone number. Fast to sets the image corner radius property and blur processing, can be processed directly after the image.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    dpanalyzer

    postprocessing tool for Project Gutenberg Distributed Proofreaders

    ...Planned future features: generation of normalized dp output by rejoining split paragraphs and moving around footnotes, renumbering of pages; conversion to basic LaTeX and basic HTML markup for further processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    pup

    pup

    Parsing HTML at the command line

    pup is a command line tool for processing HTML. It reads from stdin, prints to stdout, and allows the user to filter parts of the page using CSS selectors. Inspired by jq, pup aims to be a fast and flexible way of exploring HTML from the terminal. If you have Go installed on your computer just run go get. If you're on OS X, use Homebrew to install (no Go required). By default pup will fill in missing tags and properly indent the page.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 17
    A java-based parser for parsing/grabbing web sites and other text or XML documents, based on a nondeterministic parser language, creating XML output. Also contains a few utility classes for HTML, CSV and text parsing, and additional character sets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    jStyleParser

    Java CSS parser and DOM style assignment library

    jStyleParser is a CSS parser written in Java. It has its own application interface that is designed to allow an efficient CSS processing in Java and mapping the values to the Java data types. It is also able to apply the parsed style sheets to a DOM that represents an HTML or XML document and to compute the resulting style of the individual document elements. It supports CSS 2.1 and a large subset of CSS3.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    now here: https://github.com/plastex/plastex plasTeX is a Python-based LaTeX document processing framework. It gives DOM-like access to a LaTeX document, as well as the ability to generate mulitple output formats (e.g. HTML, DocBook, tBook, etc.).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    A stand-alone editor using Mediawiki markup language to generate HTML code. You can create and preview pages written using Mediawiki markup (i.e. Wikipedia pages) while off-line.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 21
    The xslt2 script semAuth (semantic authoring) translates a freemind mindmap into an xhtml website and an RDF ontology
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Storm MVC is a php framework based on the model-view-controller design pattern featuring pretty URLs, site themes via inherited master pages, and easy forms processing. It is a mix of the best ideas from Rails, Django and ASP.NET MVC.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    nanoWIME is a simple, flexible, easy-to-use javascript based WikiMarkup editor.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    ServingXML is an open source, Apache 2.0 licensed, framework for flat/XML data transformations. It defines an extensible markup vocabulary for expressing flat-XML, XML-flat, flat-flat, and XML-XML processing in pipelines.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    HTMLtools includes several Java HTML tools for preparing Web pages. The HTMLtools program automates batch conversion of tab-delimited spreadsheet text files to HTML Web-page files, file & table editing, keyword mapping, templates, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next