Showing 1589 open source projects for "parsing"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    GROBID

    GROBID

    A machine learning software for extracting information

    ...Header extraction and parsing from article in PDF format. The extraction here covers the usual bibliographical information (e.g. title, abstract, authors, affiliations, keywords, etc.). References extraction and parsing from articles in PDF format, around .87 F1-score against on an independent PubMed Central set of 1943 PDF containing 90,125 references, and around .89 on a similar bioRxiv set of 2000 PDF (using the Deep Learning citation model).
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    mail-parser

    mail-parser

    Fast and robust e-mail parsing library for Rust

    Stalwart Mail Parser is a high-performance Rust library designed for parsing and analyzing email messages. It efficiently handles complex MIME structures, supports various encodings, and is optimized for speed and robustness. The library is suitable for applications requiring detailed email processing and analysis.​
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    commonmark-java

    commonmark-java

    Java library for parsing and rendering CommonMark (Markdown)

    Java library for parsing and rendering Markdown text according to the CommonMark specification (and some extensions). Provides classes for parsing input to an abstract syntax tree of nodes (AST), visiting and manipulating nodes, and rendering to HTML. It started out as a port of commonmark.js, but has since evolved into a full library with a nice API.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Aeson

    Aeson

    A fast Haskell JSON library

    ...Fast JSON parsing and serialization.
    Downloads: 3 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 5
    Douyin TikTok Download API

    Douyin TikTok Download API

    Douyin TikTok Download API

    ...Fast, asynchronous, free, open source, ad-free, long-term maintenance. This project is based on PyWebIO , FastAPI , HTTPX , a fast and asynchronous Douyin / TikTok data crawling tool, and realizes online batch parsing and downloading of watermark-free videos or atlases through the web, data crawling API, and iOS shortcut instructions for watermark-free download and other functions. You can deploy or transform this project yourself to achieve more functions, or you can directly call scraper.py in your project or install an existing pip package as a parsing library to easily crawl data, etc. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 6
    ArgParse.jl

    ArgParse.jl

    Package for parsing command-line arguments to Julia programs

    ArgParse.jl is a package for parsing command-line arguments to Julia programs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    ProxyPin

    ProxyPin

    Open source free capture HTTP(S) traffic software ProxyPin

    ...Developed in Flutter, it enables interception, inspection, rewriting, and blocking of network traffic across devices via scans and QR-based connection. It supports scripting, filtering, statistics, and integrates HTTP/2, zstd, WebSocket parsing, and AES tools.
    Downloads: 52 This Week
    Last Update:
    See Project
  • 8
    WhatsApp Chat Exporter

    WhatsApp Chat Exporter

    A cross-platform tool for parsing WhatsApp chat databases

    WhatsApp-Chat-Exporter is a customizable tool for parsing WhatsApp databases from Android and iOS devices. It extracts chat histories and exports them in HTML and JSON formats, supporting various encryption standards. ​
    Downloads: 12 This Week
    Last Update:
    See Project
  • 9
    LlamaParse

    LlamaParse

    Parse files for optimal RAG

    LlamaParse is a GenAI-native document parser that can parse complex document data for any downstream LLM use case (RAG, agents). Load in 160+ data sources and data formats, from unstructured, and semi-structured, to structured data (API's, PDFs, documents, SQL, etc.) Store and index your data for different use cases. Integrate with 40+ vector stores, document stores, graph stores, and SQL db providers.
    Downloads: 4 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Trurl

    Trurl

    A command line tool for URL parsing and manipulation

    trurl is a command-line tool developed by the curl project for parsing and manipulating URLs. It allows users to modify URL components easily, aiding in tasks like scripting and testing.​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    dots.ocr

    dots.ocr

    Multilingual Document Layout Parsing in a Single Vision-Language Model

    dots.ocr is a cutting-edge multilingual document parsing system built on a unified vision-language model that combines layout detection, text recognition, and structural understanding into a single architecture. Unlike traditional OCR pipelines that rely on multiple specialized components, dots.ocr integrates these processes end-to-end, reducing error propagation and improving consistency across tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    DAW JSON Link v3

    DAW JSON Link v3

    Fast, convenient JSON serialization and parsing in C++

    Fast, convenient JSON serialization and parsing in C++. The DAW JSON Link library is a high-performance, no allocation, C++ JSON Library. Direct parsing of JSON to your own data structures via simple declarative mappings. Earlier error reporting of unexpected data and errors. Low memory requirement, parser requires a small amount of stack but does not allocate itself. Optimizations based on data types.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Rust-Peg

    Rust-Peg

    Parsing Expression Grammar (PEG) parser generator for Rust

    rust-peg is a Parsing Expression Grammar (PEG) parser generator for Rust. It allows developers to define grammars in a concise syntax and generates efficient parsers, facilitating the creation of domain-specific languages and interpreters.​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Pyparsing

    Pyparsing

    Python library for creating PEG parsers

    pyparsing is a Python library that facilitates the creation of parsers using a parsing expression grammar (PEG) approach. It allows developers to construct grammars directly in Python code, offering an alternative to traditional parsing methods.​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Fast CSV

    Fast CSV

    CSV parser and formatter for node

    A high-performance Node.js library for parsing and formatting CSV data efficiently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Swift Argument Parser

    Swift Argument Parser

    Straightforward, type-safe argument parsing for Swift

    Swift Argument Parser is a type-safe, declarative library for building Swift command-line tools by annotating your command types with property wrappers like @Option, @Argument, and @Flag. It handles parsing, validation, automatic help generation, and dispatch to your run() or async entry point, letting you focus on command logic instead of boilerplate. The package supports subcommands, default values, custom parsing strategies, and rich error messages that match platform conventions. It integrates cleanly with Swift Package Manager, works well in CI, and provides incremental improvements such as Sendable conformances for wrappers to help in concurrent contexts. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    ChordSheetJS

    ChordSheetJS

    A JavaScript library for parsing and formatting chords and chord sheet

    ChordSheetJS is a JavaScript library for parsing, formatting, and transposing chord sheets. It supports various chord sheet formats and provides tools for rendering and manipulating chord and lyric data.​
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    YAML

    YAML

    JavaScript parser and stringifier for YAML

    ...This library supports both YAML 1.1 and YAML 1.2 and all common data schemas, passes all of the yaml-test-suite tests. It can accept any string as input without throwing, parsing as much YAML out of it as it can, and supports parsing, modifying, and writing YAML comments and blank lines. The library is released under the ISC open source license, and the code is available on GitHub. It has no external dependencies and runs on Node.js as well as modern browsers. For the purposes of versioning, any changes that break any of the endpoints or APIs documented here will be considered semver-major breaking changes. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    TextFSM

    TextFSM

    Python module for parsing semi-structured text into python tables

    TextFSM is a Python library created by Google that provides a template-based state machine engine for parsing semi-structured text. It is particularly useful for extracting structured data from command-line interface (CLI) outputs, such as those from network devices, routers, and switches. By defining parsing logic through reusable template files, TextFSM transforms unstructured text into structured data like lists or tables without requiring complex regular expression code. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    JSON for Modern C++

    JSON that's part of C++

    ...It's been heavily unit-tested covering 100% of the code, and follows the Core Infrastructure Initiative (CII) best practices to ensure the highest quality at all times. Among its many features are JSON pointers, JSON patches, Iterators, SAX parsing and various container operations.
    Downloads: 36 This Week
    Last Update:
    See Project
  • 21
    Rust lexical

    Rust lexical

    Fast numeric to- and from-string conversion routines

    ...Designed for efficiency and flexibility, it operates in no_std environments, making it suitable for embedded systems and performance-critical applications. The library supports various numeric formats and provides customizable parsing and formatting options.​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Cpp-Peglib

    Cpp-Peglib

    A single file C++ header-only PEG (Parsing Expression Grammars)

    cpp-peglib is a single-file, header-only C++17 library for Parsing Expression Grammars (PEG). It enables developers to define grammars and build parsers directly within C++ code without external dependencies.​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    lxml

    lxml

    The lxml XML toolkit for Python

    A Python library for efficient XML and HTML processing, known for speed and compatibility. The lxml XML toolkit is a Pythonic binding for the C libraries libxml2 and libxslt. It is unique in that it combines the speed and XML feature completeness of these libraries with the simplicity of a native Python API, mostly compatible but superior to the well-known ElementTree API. The latest release works with all CPython versions from 3.6 to 3.12. See the introduction for more information about the...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 24
    LiteParse

    LiteParse

    A fast, helpful, and open-source document parser

    LiteParse is an open-source lightweight parsing library designed to extract structured data from unstructured text using large language models in an efficient and cost-effective manner. It focuses on simplifying the process of turning raw text into structured outputs such as JSON by providing a streamlined interface for prompt-based parsing. The system is designed to minimize overhead, making it suitable for applications where performance and cost are critical considerations. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    DeviceDetector

    DeviceDetector

    The Universal Device Detection library will parse any User Agent

    Device Detector is a PHP library for parsing user-agent strings to detect devices, operating systems, and browsers, widely used in analytics and tracking.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB