Showing 22 open source projects for "structured text"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 1
    jq

    jq

    Lightweight and flexible command-line JSON processor

    jq is like sed for JSON data - you can use it to slice, filter, map and transform structured data with the same ease that sed, awk, grep and friends let you play with text. jq is written in portable C, and it has zero runtime dependencies. You can download a single binary, scp it to a far away machine of the same type, and expect it to work. jq can mangle the data format that you have into the one that you want with very little effort, and the program to do so is often shorter and simpler than you'd expect. ...
    Downloads: 72 This Week
    Last Update:
    See Project
  • 2
    OCRBase

    OCRBase

    MD/.JSON Document OCR and structured data extraction API

    OCRBase is a self-hostable document OCR and structured extraction system built to turn PDFs into machine-usable outputs at scale, aiming to bridge the gap between raw text extraction and production-ready pipelines. Instead of treating OCR as a one-off script, it presents an API-driven workflow where documents are submitted as jobs and processed through a queue-based architecture that can handle high throughput.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    py-pdf-parser

    py-pdf-parser

    A Python tool to help extracting information from structured PDFs

    py-pdf-parser is a Python tool designed to help extract information from structured PDFs. It provides a simple interface to define parsing rules and extract data from PDF documents. ​
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    PDFCraft

    PDFCraft

    PDFCraft is a free, privacy-focused PDF toolkit

    PDFCraft is an extensible toolkit for creating, editing, and transforming PDF documents with both a graphical interface and a scripting API, making it useful for users ranging from casual editors to automated document processors. At its core, the project provides a clean, modern UI where you can rearrange pages, annotate text, insert images, fill forms, and export to multiple formats, all without needing a heavyweight commercial PDF suite. But beyond manual editing, it also offers a programmable layer so developers can write scripts to batch process documents, generate templated reports, or extract structured data from PDFs for integration in workflows. ...
    Downloads: 16 This Week
    Last Update:
    See Project
  • Error to trace to log to deploy. One click. No SSH. Icon
    Error to trace to log to deploy. One click. No SSH.

    Catch the cause before the pager goes off.

    AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.
    Free 30 days.
  • 5
    JSON Editor

    JSON Editor

    A web-based tool to view, edit, format, and validate JSON

    JSON Editor is a web-based JSON editing and visualization tool designed for viewing, editing, formatting, validating, and transforming JSON documents in multiple interactive modes. The project provides several editing interfaces including tree view, code editor, form-based editing, and plain text modes, allowing users to work with structured data in the format most suitable for their workflow. It can be embedded directly into web applications as a reusable component and supports large JSON documents with schema validation and formatting capabilities. JSONEditor emphasizes usability by combining developer-focused functionality with accessible visual editing tools for non-technical users. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    jsonrepair

    jsonrepair

    Repair invalid JSON documents

    ...The project focuses on automation and fault tolerance, reducing the need for manual cleanup of corrupted JSON data. Its lightweight architecture and practical functionality have made it valuable for modern applications that process unpredictable structured text.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    CSV Quick Viewer

    CSV Quick Viewer

    CSV Quick Viewer

    ...CSVQuickViewer runs without administrative rights, provides column insights, and allows exporting filtered data, offering a fast, reliable, and local solution for inspecting structured text data. Ideal for analysts, developers, and support teams working with large or messy data files.
    Leader badge
    Downloads: 28 This Week
    Last Update:
    See Project
  • 8
    Go support for Protocol Buffers

    Go support for Protocol Buffers

    The Go support for Google's protocol buffers

    Protocol buffers are Google's language-neutral, platform-neutral, extensible mechanism for serializing structured data, think XML, but smaller, faster, and simpler. You define how you want your data to be structured once, then you can use special generated source code to easily write and read your structured data to and from a variety of data streams and using a variety of languages. Protocol buffers currently support generated code in Java, Python, Objective-C, and C++. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    JSONVue

    JSONVue

    Fork of JSONView for Chromium-based browsers

    JSONVue is a Chromium browser extension derived from JSONView for displaying JSON responses in a more readable format. It improves the experience of opening raw JSON in the browser by formatting the data instead of showing unstructured text. The extension is useful for developers, API testers, and anyone who frequently inspects JSON returned by web services. It makes nested objects and arrays easier to scan by applying structured presentation in the browser tab. Because it targets Chromium-based browsers, it fits naturally into Chrome-style web development workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 10
    RTextDoc

    RTextDoc

    An editor for structured documents

    RTextDoc is an editor for structured text documents such as LaTeX, AsciiDoc, DocBook. RTextDoc has proofreading capabilities: on-the-fly spelling, instant grammar checking and built-in free dictionaries. RTextDoc has syntax highlighting, bracket matching, folding, document structure browser for sections and labels, bookmarks, manager for LaTeX symbols, an editor for mathematical equations,integrated BibTeX database manager and several tools to convert LaTeX to HTML and back. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    rq

    rq

    A tool for doing record analysis and transformation

    ...The goal is to make ad-hoc exploration of data sets easy without having to use more heavy-weight tools like SQL/MapReduce/custom programs. rq fills a similar niche as tools like awk or sed, but works with structured (record) data instead of text. It was created with love out of the best parts of Rust, and is distributed as a dependency-free binary on many operating systems and architectures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    unfluff

    unfluff

    Automatically extract body content (and other cool stuff) from HTML

    unfluff is a Node.js library designed to automatically extract the main content from an HTML document — stripping away navigation bars, ads, footers and other boilerplate to leave you with the “body content”, metadata (title, author, date) and other useful fields. It’s a tool very much aimed at content-analysis, web scraping, building datasets, or repurposing article text for downstream processing (like machine-learning or summarization). The API is simple: you feed in raw HTML and it returns a structured object with the extracted text and other fields. It supports caching internal representations to speed up repeated extractions. While its language support is best for English, it is still widely used in web-content-processing pipelines. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    data2bin

    Create structured binary files from XML data.

    Need to create binary files with data for Your program, game etc.? Tired of using hex-editor and editing the file manually with the risk of structure-mismatches? Too lazy to reedit complete file after changing structure members order or size? "data2bin" is a utility that takes: 1. Your structures description (you can use integers of different sizes and endiannesses, null-terminated text strings, fixed-size binary strings, structures, arrays...) 2. Your data in a XML file written...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 14
    OGDL is a structured format for representing graphs of information, alternative to XML. Its grammar is very simple allowing for compact parsers. The text version is easily readable; the binary version is used for storage and interprocess communication.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    A bundle of lisp extensions, largely original, for GNU Emacs with the goal to obtain a more user friendly and powerful interface. The new features include contextual tool bars, new TeX interface, very complete menus, and a well structured IDE.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    Configuration markup language

    Human readable and human writable format for the config files

    For object-oriented programs is useful to supply the configuration information in structured manner also. Seems the XML is the an answer. But too much <tag></tag> everywhere make the XML for *.config or *.ini files almost human unreadable and uneditable. This library intended to read text of markup configurations files in uniform way. Text information from the file is loaded by your program as a structural tree. After slurping a *.config file we can supply the resulting objects to object instances of our program to let them configure themselves. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    ePubHub
    A structured document editor designed for creating ePub eBook files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Klang is a project that allows viewing and editing of binary files in a structured way. Unlike traditional hex editors, Klang provides a hierarchical view of many binary file types that can be 'chunked', such as WAV and AIFF.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    DendrEd
    DendrEd is a tree-structured text data editor, that helps entry-level computer users to create and store such data in XML files avoiding markup and confusion caused by this markup.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Generates PHP code from QML source. QML is for "Quest Markup Language", a structured description language to create text-based quests created by Philipp Lenssen. PHP output is customizable using CSS and overriding the default header and footer files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    The Comparative Genomics Vocabulary (CGV) is a SKOS representation of comparative genomics containing terms, text definitions and synonyms of the domain. The vocabulary is structured with broader and narrower relationships between the concepts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Prototype for a framework and user interface for combining various structured search and document clustering techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo