Showing 1207 open source projects for "linux file parser"

View related business solutions
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 1
    BSON parser

    BSON parser

    BSON Parser for node and browser

    BSON parser is the core JavaScript implementation of BSON (Binary JSON), the data format used by MongoDB to represent documents on the wire and in storage. It provides encoding and decoding capabilities so JavaScript environments like Node.js and browsers can serialize rich data structures (including types not native to JSON) into the compact, efficient BSON binary representation used by MongoDB drivers and servers. Value types such as ObjectId, Binary, UTC datetime, Decimal128, and Long are...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    html-react-parser

    html-react-parser

    HTML to React parser

    HTML to React parser that works on both the server (Node.js) and the client (browser). The parser converts an HTML string to one or more React elements. Available as part of the Tidelift Subscription. For TypeScript projects, you may need to check that domNode is an instance of domhandler's Element. Make sure to render parsed adjacent elements under a parent element.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    py-pdf-parser

    py-pdf-parser

    A Python tool to help extracting information from structured PDFs

    py-pdf-parser is a Python tool designed to help extract information from structured PDFs. It provides a simple interface to define parsing rules and extract data from PDF documents. ​
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Json Parser in Standard C

    Json Parser in Standard C

    JSON parser in standard C

    This JSON parser was created for the project of C++ Workflow.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 5
    fast-xml-parser

    fast-xml-parser

    Validate XML, Parse XML and Build XML rapidly

    Validate XML, Parse XML to JS Object, or Build XML from JS Object without C/C++ based libraries and no callback.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Vince's CSV Parser

    Vince's CSV Parser

    A modern C++ library for reading, writing, and analyzing CSV

    There's plenty of other CSV parsers in the wild, but I had a hard time finding what I wanted. Inspired by Python's csv module, I wanted a library with simple, intuitive syntax. Furthermore, I wanted support for special use cases such as calculating statistics on very large files. With the deluge of large datasets available, a performant CSV parser is a necessity. By using overlapped threads, memory-mapped IO, and efficient data structures, this parser can quickly tackle large CSV files....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    PHP7

    PHP7

    PHP7 / Laravel Multi-format Streaming Parser

    When it comes to parsing XML/CSV/JSON/... documents, there are 2 approaches to consider. DOM loading loads all the documents, making it easy to navigate and parse, and as such provides maximum flexibility for developers. Streaming implies iterating through the document, acts like a cursor, and stops at each element in its way, thus avoiding memory overkill. Thus, when it comes to big files, callbacks will be executed meanwhile file is downloading and will be much more efficient as far as...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    asar

    asar

    Simple extensive tar-like archive format with indexing

    Asar is a simple extensive archive format, it works like tar that concatenates all files together without compression while having random access support. You can pass in a transform option, which is a function, that either returns nothing or a stream. Transform. The latter will be used on files that will be in the .asar file to transform them. There is also an unofficial grunt plugin to generate asar archives at bwin/grunt-asar. Asar uses Pickle to safely serialize binary value to file,...
    Downloads: 27 This Week
    Last Update:
    See Project
  • 9
    LOL HTML

    LOL HTML

    Low output latency streaming HTML parser/rewriter with CSS API

    Low Output Latency streaming HTML rewriter/parser with CSS-selector based API. It is designed to modify HTML on the fly with minimal buffering. It can quickly handle very large documents, and operate in environments with limited memory resources. The crate serves as a back-end for the HTML rewriting functionality of Cloudflare Workers, but can be used as a standalone library with a convenient API for a wide variety of HTML rewriting/analysis tasks. The parser switches back to the tag scanner...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10
    parse5

    parse5

    HTML parsing/serialization toolset for Node.js.

    HTML parsing/serialization toolset for Node.js. WHATWG HTML Living Standard (aka HTML5)-compliant. parse5 provides nearly everything you may need when dealing with HTML. It's the fastest spec-compliant HTML parser for Node to date. It parses HTML the way the latest version of your browser does. It has proven itself reliable in such projects as jsdom, Angular, Lit, Cheerio, rehype and many more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    JC

    JC

    CLI tool and python library

    CLI tool and python library that converts the output of popular command-line tools and file types to JSON or Dictionaries. This allows piping of output to tools like jq and simplifying automation scripts. jc JSONifies the output of many CLI tools and file types for easier parsing in scripts. This allows further command-line processing of output with tools like jq or jello by piping commands. The JC parsers can also be used as python modules.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    PostHTML

    PostHTML

    PostHTML is a tool to transform HTML/XML with JS plugins

    PostHTML is a tool for transforming HTML/XML with JS plugins. PostHTML itself is very small. It includes only an HTML parser, an HTML node tree API and a node tree stringified. All HTML transformations are made by plugins. And these plugins are just small plain JS functions, which receive an HTML node tree, transform it, and return a modified tree. PostHTML is a tool for transforming HTML/XML with JS plugins. PostHTML itself is very small. It includes only an HTML parser, an HTML node tree...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    mistletoe

    mistletoe

    A fast, extensible and spec-compliant Markdown parser in pure Python

    mistletoe is a Markdown parser in pure Python, designed to be fast, spec-compliant and fully customizable. Apart from being the fastest CommonMark-compliant Markdown parser implementation in pure Python, mistletoe also supports easy definitions of custom tokens. Parsing Markdown into an abstract syntax tree also allows us to swap out renderers for different output formats, without touching any of the core components.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    jackson-core

    jackson-core

    Core part of Jackson that defines Streaming API

    This project contains core low-level incremental ("streaming") parser and generator abstractions used by Jackson Data Processor. It also includes the default implementation of handler types (parser, generator) that handle JSON format. The core abstractions are not JSON specific, although naming does contain 'JSON' in many places, due to historical reasons. Only packages that specifically contain the word 'json' are JSON-specific. This package is the base on which Jackson's data-binding...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 15
    Floki

    Floki

    Floki is a simple HTML parser that enables search for nodes using CSS

    Floki is a simple HTML parser that enables search for nodes using CSS selectors. Floki needs the :leex module in order to compile. Normally this module is installed with Erlang in a complete installation. By default, Floki uses a patched version of mochiweb_html for parsing fragments due to its ease of installation (it's written in Erlang and has no outside dependencies). fast_html is generally faster, according to the benchmarks conducted by its developers.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    simdjson

    simdjson

    Parsing gigabytes of JSON per second

    JSON is everywhere on the Internet. Servers spend a *lot* of time parsing it. We need a fresh approach. The simdjson library uses commonly available SIMD instructions and microparallel algorithms to parse JSON 4x faster than RapidJSON and 25x faster than JSON for Modern C++. The simdjson library uses three-quarters less instructions than state-of-the-art parser RapidJSON. To our knowledge, simdjson is the first fully-validating JSON parser to run at gigabytes per second (GB/s) on commodity...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Nokogiri

    Nokogiri

    Tool to work with XML and HTML from Ruby

    Nokogiri (鋸) makes it easy and painless to work with XML and HTML from Ruby. It provides a sensible, easy-to-understand API for reading, writing, modifying, and querying documents. It is fast and standards-compliant by relying on native parsers like libxml2 (C) and xerces (Java). Be secure-by-default by treating all documents as untrusted by default. Be a thin-as-reasonable layer on top of the underlying parsers, and don't attempt to fix behavioral differences between the parsers. "Native...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    omniparser

    omniparser

    Native Golang ETL streaming parser and transform library

    Omniparser is a native Golang ETL parser that ingests input data of various formats (CSV, txt, fixed length/width, XML, EDI/X12/EDIFACT, JSON, and custom formats) in streaming fashion and transforms data into desired JSON output based on a schema written in JSON.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ojg

    ojg

    Optimized JSON for Go

    Optimized JSON for Go is a high-performance parser with a variety of additional JSON tools. OjG is optimized to processing huge data sets where data does not necessarily conform to a fixed structure.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    html-loader

    html-loader

    HTML Loader

    ...Allows to setup which tags and attributes to process and how, as well as the ability to filter some of them. Filter can also be used to extend the supported elements and attributes. By default, the parser in html-loader interprets content inside noscript tags as #text, so processing of content inside this tag will be ignored. A very common scenario is exporting the HTML into their own .html file, to serve them directly instead of injecting with javascript.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    Parsedown

    Parsedown

    Better markdown parser in PHP

    Parsedown is capable of escaping user-input within the HTML that it generates. Additionally Parsedown will apply sanitisation to additional scripting vectors (such as scripting link destinations) that are introduced by the markdown syntax itself. If you wish to allow HTML within untrusted user-input, but still want output to be free from XSS it is recommended that you make use of a HTML sanitiser that allows HTML tags to be whitelisted, like HTML Purifier. In both cases you should strongly...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Jawn

    Jawn

    Jawn is for parsing jay-sawn (JSON)

    The term "jawn" comes from the Philadelphia area. It conveys about as much information as "thing" does. I chose the name because I had moved to Montreal so I remembered Philly fondly. Also, there isn't a better way to describe objects encoded in JSON than "things". Finally, we get a catchy slogan. Jawn was designed to parse JSON into an AST as quickly as possible. Currently, Jawn is competitive with the fastest Java JSON libraries (GSON and Jackson) and in the author's benchmarks, it often...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    python-bibtexparser v2

    python-bibtexparser v2

    Bibtex parser for Python 3

    Welcome to python-bibtexparser, a parser for .bib files with a long history and wide adaption. Bibtexparser is available in two versions: V1 and V2. For new projects, we recommend using v2 which, in the long run, will provide an overall more robust and faster experience. For now, however, note that v2 is an early beta, and does not contain all features of v1.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    jsoup

    jsoup

    Java library for working with real-world HTML

    ...It provides a very convenient API for fetching URLs and extracting and manipulating data, using the best of HTML5 DOM methods and CSS selectors. jsoup implements the WHATWG HTML5 specification, and parses HTML to the same DOM as modern browsers do. jsoup is designed to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup; jsoup will create a sensible parse tree. The parser will make every attempt to create a clean parse from the HTML you provide, regardless of whether the HTML is well-formed or not. You have HTML in a Java String, and you want to parse that HTML to get at its contents, or to make sure it's well formed, or to modify it. The String may have come from user input, a file, or from the web.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Coost

    Coost

    A tiny boost library in C++11

    coost is an elegant and efficient cross-platform C++ base library. Its goal is to create a sword of C++ to make C++ programming easy and enjoyable. Coost, co for short, is like boost, but more lightweight, the static library built on Linux or Mac is only about 1MB in size. log supports two types of logs: one is level log, which is divided into 5 levels: debug, info, warning, error, and fatal, printing a fatal log will terminate the program; the other is topic log, logs are grouped by topic...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB