Showing 57 open source projects for "processing"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Automate contact and company data extraction Icon
    Automate contact and company data extraction

    Build lead generation pipelines that pull emails, phone numbers, and company details from directories, maps, social platforms. Full API access.

    Generate leads at scale without building or maintaining scrapers. Use 10,000+ ready-made tools that handle authentication, pagination, and anti-bot protection. Pull data from business directories, social profiles, and public sources, then export to your CRM or database via API. Schedule recurring extractions, enrich existing datasets, and integrate with your workflows.
    Explore Apify Store
  • 1
    html-loader

    html-loader

    HTML Loader

    Exports HTML as a string. HTML is minimized when the compiler demands. The true value enables the processing of all default elements and attributes, the false value disables the processing of all attributes. Allows you to specify which tags and attributes to process, filter them, filter urls and process sources starting with /. Allows to setup which tags and attributes to process and how, as well as the ability to filter some of them.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    geckodriver

    geckodriver

    WebDriver for Firefox

    ...Generally speaking, Selenium 3 enabled geckodriver as the default WebDriver implementation for Firefox. With the release of Firefox 47, FirefoxDriver had to be discontinued for its lack of support for the new multi-processing architecture in Gecko. Since geckodriver is a separate HTTP server that is a complete remote end implementation of WebDriver, it is possible to avoid using the Selenium remote server if you have no requirements to distribute processes across a matrix of systems.
    Downloads: 99 This Week
    Last Update:
    See Project
  • 3
    Tesla

    Tesla

    The flexible HTTP client library for Elixir

    ...Tesla supports multiple HTTP adapter that do the actual HTTP request processing. Each piece of the stream will be encoded as JSON and sent as a new line (conforming to JSON stream format). You can set the adapter to Tesla.Mock in tests.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Lexbor

    Lexbor

    Lexbor is development of an open source HTML Renderer library

    Lexbor is the development of a web browser engine available as a software library; it ships with a free license and has no extra dependencies. For us, speed is an absolute must-have. In our development process, we focus on fastest parsing techniques for HTML, CSS, and fonts, fastest data processing methods, and fastest ways to serve content to end users. Whether you are building a backend that handles millions of HTML documents or a UI-heavy user app, your software’s response rate always matters to users and developers alike. Lexbor’s code is optimized for ease of access in end-user applications and across programming languages. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Free and Open Source HR Software Icon
    Free and Open Source HR Software

    OrangeHRM provides a world-class HRIS experience and offers everything you and your team need to be that HR hero you know that you are.

    Give your HR team the tools they need to streamline administrative tasks, support employees, and make informed decisions with the OrangeHRM free and open source HR software.
    Learn More
  • 5
    SingleFile

    SingleFile

    Web Extension for Firefox/Chrome/MS Edge and CLI tool

    ...Wait until the page is fully loaded. Click on the SingleFile button in the extension toolbar to save the page. You can click again on the button to cancel the action when processing a page. Open the context menu by right-clicking the SingleFile button in the extension toolbar or on the webpage. It allows you to save the current tab, the selected content, the selected frame. With auto-save active, pages are automatically saved every time after being loaded (or before being unloaded if not). Right-click on the SingleFile button and select "Manage extension" (Firefox) / "Options" (Chrome) to open the options page.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    mp-html

    mp-html

    Small program rich text component, supports rendering and editing html

    A powerful applet-rich text component. Small program rich text component supports rendering and editing HTML and supports use on WeChat, QQ, Baidu, Alipay, Toutiao, and uni-app platforms. Displaying dynamic HTML rich text is a necessary requirement for many applications. The applet platform does not support dom operations, making this a problem. The built-in rich-text component supports few tags and blocks all events, making it difficult for practical application. Therefore, there is such a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Emogrifier

    Emogrifier

    Converts CSS styles into inline style attributes in your HTML code

    Emogrifier converts CSS styles into inline style attributes in your HTML code. This ensures proper display on email and mobile device readers that lack stylesheet support. This utility was developed as part of Intervals to deal with the problems posed by certain email clients (namely Outlook 2007 and GoogleMail) when it comes to the way they handle styling contained in HTML emails. As many web developers and designers already know, certain email clients are notorious for their lack of CSS...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    docconv

    docconv

    Converts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text

    ...To add image support to the docconv library you first need to install and build gosseract. Now you can add -tags ocr to any go command when building/fetching/testing docconv to include support for processing images. Documents can be sent as a multipart POST request and the plain text (body) and meta information are then returned as a JSON object.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Run your private office with the ONLYOFFICE Icon
    Run your private office with the ONLYOFFICE

    Secure office and productivity apps

    A Comprehensive Alternative to Office 365 for Business
    Learn More
  • 10

    HTML parser in Delphi

    A Delphi class with functions to read and dissect a HTML file

    ...The file to be parsed must have valid HTML4/5 tags. It is not necessary that the HTML is ‘correct’ in the sense that end tags may be wrongly placed or be absent altogether. The speed of processing (reading+parsing) is formidable: 15-40 msec per Mbyte or around 1 msec per 1000 HTML tags.
    Downloads: 20 This Week
    Last Update:
    See Project
  • 11
    CSSBox

    CSSBox

    Pure Java HTML / CSS rendering engine

    CSSBox is an (X)HTML/CSS rendering engine written in pure Java. Its primary purpose is to provide a complete information about the rendered page suitable for further processing. However, it also allows displaying the rendered document.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 12
    ModularAdmin

    ModularAdmin

    Free Dashboard Theme Built On Bootstrap 4 | HTML Version

    ...Main variables are defined in the src/_variables.scss folder. For making life easier we broke down styles into components, and on build we're just merging all .scss files together and processing it to the dist/css/app.css file. There are also different theme variations located in src/_themes/ folder, where you can change the main variables to get different themes. There are a few predefined themes built-in. You can add new themes by adding a new file in src/_themes/ folder. We separate the application's scripts across its components. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    unfluff

    unfluff

    Automatically extract body content (and other cool stuff) from HTML

    unfluff is a Node.js library designed to automatically extract the main content from an HTML document — stripping away navigation bars, ads, footers and other boilerplate to leave you with the “body content”, metadata (title, author, date) and other useful fields. It’s a tool very much aimed at content-analysis, web scraping, building datasets, or repurposing article text for downstream processing (like machine-learning or summarization). The API is simple: you feed in raw HTML and it returns a structured object with the extracted text and other fields. It supports caching internal representations to speed up repeated extractions. While its language support is best for English, it is still widely used in web-content-processing pipelines. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    React HTML Parser

    React HTML Parser

    Converts HTML strings directly into React components

    Utility for converting HTML strings into React components. Converts standard HTML elements, attributes and inline styles into their React equivalents and provides a simple way to modify and replace the content. It is important to understand that this library should not be used as a direct replacement for using properly sanitized HTML and that it only provides the same level of protection that React does which does not provide 100% protection. All HTML should be properly sanitized using a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Gallop

    Gallop

    A framework for build smooth asynchronous iOS APP

    ...Easily draw text frame bouding, drawing the hollow words, sets the text vertical Alignment property, etc Easy to parse the expression in the text, like http(s) link, @ user, # theme #, phone number. Fast to sets the image corner radius property and blur processing, can be processed directly after the image.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    dpanalyzer

    postprocessing tool for Project Gutenberg Distributed Proofreaders

    ...Planned future features: generation of normalized dp output by rejoining split paragraphs and moving around footnotes, renumbering of pages; conversion to basic LaTeX and basic HTML markup for further processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    pup

    pup

    Parsing HTML at the command line

    pup is a command line tool for processing HTML. It reads from stdin, prints to stdout, and allows the user to filter parts of the page using CSS selectors. Inspired by jq, pup aims to be a fast and flexible way of exploring HTML from the terminal. If you have Go installed on your computer just run go get. If you're on OS X, use Homebrew to install (no Go required). By default pup will fill in missing tags and properly indent the page.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 19
    A java-based parser for parsing/grabbing web sites and other text or XML documents, based on a nondeterministic parser language, creating XML output. Also contains a few utility classes for HTML, CSV and text parsing, and additional character sets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    jStyleParser

    Java CSS parser and DOM style assignment library

    jStyleParser is a CSS parser written in Java. It has its own application interface that is designed to allow an efficient CSS processing in Java and mapping the values to the Java data types. It is also able to apply the parsed style sheets to a DOM that represents an HTML or XML document and to compute the resulting style of the individual document elements. It supports CSS 2.1 and a large subset of CSS3.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    DocFrac is a document converter that can convert between RTF, HTML and ASCII text. This includes RTF to HTML and HTML to RTF. Supports text formatting (e.g. bold); tables; and most European languages. Available for Windows; Linux; ActiveX and DLL.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 22
    DPRK pull is a script that pulls the English language North Korean news articles from the KCNA website and puts them into one file for reading by a Text to Speech program.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    (Hyper)Markup can: 1) Edit HTML or text files, 2) tidy up your HTML source code and produce formatted text, 3) Flexisibly generate XML codes or XHTML files from given HTML files with XML template and XSL stylesheet.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    now here: https://github.com/plastex/plastex plasTeX is a Python-based LaTeX document processing framework. It gives DOM-like access to a LaTeX document, as well as the ability to generate mulitple output formats (e.g. HTML, DocBook, tBook, etc.).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Maintain your resume like the lazy slob you are! Write a simple text file, produce simple HTML. Don't screw around with XML or any other form of overkill.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next