Showing 141 open source projects for "text based"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    TeXworks

    TeXworks

    A simple interface for working with TeX documents

    TeXworks is a free and simple working environment for authoring TeX (LaTeX, ConTeXt and XeTeX) documents. Inspired by Dick Koch's award-winning TeXShop program for Mac OS X, it makes entry into the TeX world easier for those using desktop operating systems other than OS X. It provides an integrated, easy-to-use environment for users on other platforms particularly GNU/Linux and Windows and features a clean, simple interface accessible to casual and non-technical users.
    Downloads: 67 This Week
    Last Update:
    See Project
  • 2
    Pix2Text

    Pix2Text

    Open-Source Python3 tool for recognizing layouts, tables, and math

    An Open-Source Python3 tool for recognizing layouts, tables, math formulas, and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported. Pix2Text (P2T) aims to be a free and open-source Python alternative to Mathpix, and it can already accomplish Mathpix's core functionality. Pix2Text (P2T) can recognize layouts, tables, images, text, and mathematical formulas, and integrate all of these contents into Markdown format. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    Better BibTeX for Zotero

    Better BibTeX for Zotero

    Make Zotero effective for us LaTeX holdouts

    Better BibTeX (BBT) is a plugin for Zotero and Juris-M that makes it easier to manage bibliographic data, especially for people authoring documents using text-based toolchains (e.g. based on LaTeX / Markdown). Zotero does all its work in UTF-8 Unicode, which is absolutely the right thing to do. Unfortunately, for those shackled to BibTeX and who cannot (yet) move to BibLaTeX, unicode is a major PITA. Also, Zotero supports some simple HTML markup in your items that Bib(La)TeX won’t understand.
    Downloads: 90 This Week
    Last Update:
    See Project
  • 4
    Memvid

    Memvid

    Video-based AI memory library. Store millions of text chunks in MP4

    Memvid encodes text chunks as QR codes within MP4 frames to build a portable “video memory” for AI systems. This innovative approach uses standard video containers and offers millisecond-level semantic search across large corpora with dramatically less storage than vector DBs. It's self-contained—no DB needed—and supports features like PDF indexing, chat integration, and cloud dashboards.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    PDF4QT

    PDF4QT

    Open source PDF editor

    ...Basic browsing and lots of other functionalities, such as encryption, reading a document, verification of digital signatures, editing of annotations, searching for text using regular expressions, drawing pages into an image, and much more. Several plug-ins are available.
    Downloads: 56 This Week
    Last Update:
    See Project
  • 6
    JSON Editor

    JSON Editor

    A web-based tool to view, edit, format, and validate JSON

    JSON Editor is a web-based JSON editing and visualization tool designed for viewing, editing, formatting, validating, and transforming JSON documents in multiple interactive modes. The project provides several editing interfaces including tree view, code editor, form-based editing, and plain text modes, allowing users to work with structured data in the format most suitable for their workflow.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    KOReader

    KOReader

    An ebook reader application supporting PDF, DjVu, EPUB, FB2, etc.

    KOReader is a document viewer for E Ink devices. Supported fileformats include EPUB, PDF, DjVu, XPS, CBT, CBZ, FB2, PDB, TXT, HTML, RTF, CHM, DOC, MOBI and ZIP files. It’s available for Kindle, Kobo, PocketBook, Android and desktop Linux. Runs on embedded devices (Cervantes, Kindle, Kobo, PocketBook, reMarkable), Android and Linux computers. Developers can run a KOReader emulator in Linux and MacOS. Multi-lingual user interface with a highly customizable reader view and many typesetting...
    Downloads: 136 This Week
    Last Update:
    See Project
  • 8
    svelte-jsoneditor

    svelte-jsoneditor

    Web-based tool to view, edit, format, repair, query, & transform JSON

    A web-based tool to view, edit, format, transform, and validate JSON. The library is written with Svelte, but can be used in plain JavaScript too, and in any framework (SolidJS, React, Vue, Angular, etc). View and edit JSON, has a low-level text editor and high-level tree view and table view. Format (beautify) and compact JSON. Sort, query, filter, and transform JSON.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    OCRBase

    OCRBase

    MD/.JSON Document OCR and structured data extraction API

    OCRBase is a self-hostable document OCR and structured extraction system built to turn PDFs into machine-usable outputs at scale, aiming to bridge the gap between raw text extraction and production-ready pipelines. Instead of treating OCR as a one-off script, it presents an API-driven workflow where documents are submitted as jobs and processed through a queue-based architecture that can handle high throughput. The core output is designed for downstream automation, producing structured results like JSON according to user-defined schemas while also providing readable formats like Markdown for human review or indexing. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Static Web Server

    Static Web Server

    A cross-platform, high-performance and asynchronous web server

    Static Web Server (or SWS abbreviated) is a tiny and fast production-ready web server suitable to serve static web files or assets. It is focused on lightness and easy-to-use principles while keeping high performance and safety powered by The Rust Programming Language. Written on top of Hyper and Tokio runtime, it provides concurrent and asynchronous networking abilities and the latest HTTP/1 - HTTP/2 implementations. Cross-platform and available for Linux, macOS, Windows, FreeBSD, NetBSD,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Elixir Language Server

    Elixir Language Server

    A frontend-independent IDE "smartness" server for Elixir

    Implementing features such as auto-complete or go-to-definition for a programming language is not trivial. Traditionally, this work had to be repeated for each development tool and it required a mix of expertise in both the targeted programming language and the programming language internally used by the development tool of choice. The Elixir Language Server (ElixirLS) provides a server that runs in the background, providing IDEs, editors, and other tools with information about Elixir Mix...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    CssSelector Component

    CssSelector Component

    Converts CSS selectors to XPath expressions

    XPath expressions are incredibly flexible, so there is almost always an XPath expression that will find the element you need. Unfortunately, they can also become very complicated, and the learning curve is steep. Even common operations (such as finding an element with a particular class) can require long and unwieldy expressions. CSS selectors are less powerful than XPath, but far easier to write, read and understand. Since they are less powerful, almost all CSS selectors can be converted to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Microsoft Works format import library
    libwps is a Microsoft Works file format import filter based on top of the librevenge (see https://sourceforge.net/p/libwpd/wiki/librevenge/ ). Currently, libwps can import all word processing Works formats since about 1995 with some success. It may also be able to import some basic database and spreadsheet files.
    Leader badge
    Downloads: 338 This Week
    Last Update:
    See Project
  • 14

    RecordEditor

    Editor for Fixed Width, Csv and Existing Xml files.

    The RecordEditor is a Data File editor for Flat Files (delimited and fixed field position). It supports Unix / PC / Legacy (e.g. Mainframe) file formats, both Text and binary files. The Editor uses a Record-Layout description to format the files. This is ideal for Fixed width (Text or Binary) files, Cobol Data Files, Mainframe files and complicated Csv files. Cobol Copybooks can be used to format Cobol Data files. As well as an editor, The following utilities are supplied * Formatted...
    Leader badge
    Downloads: 28 This Week
    Last Update:
    See Project
  • 15
    Krajee

    Krajee

    An enhanced HTML 5 file input for Bootstrap 5.x/4.x./3.x

    ...The plugin allows you a simple way to setup an advanced file picker/upload control built to work specially with Bootstrap CSS3 styles. It enhances the file input functionality further, by offering support to preview a wide variety of files i.e. images, text, html, video, audio, flash, and objects. In addition, it includes AJAX based uploads, dragging & dropping files, viewing upload progress, and selectively previewing, adding, or deleting files. Bootstrap 5.x Support is available with release v5.2.0. From release v5.2.0, the bootstrap version is also auto detected by the plugin based on the bootstrap JS library loaded. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    JRecord

    Read Cobol data files in Java

    provide Java Record based IO routines for Fixed Width (including Text, Mainframe, Cobol and Binary) and delimited Flat files via a Record Layout (Cobol, CSV or XML). The source is now available at https://github.com/bmTas/JRecord Projects using JRecord include: * https://github.com/thospfuller/rcoboldi - Cobol File in R * https://github.com/tmalaska/CopybookInputFormat - Cobol files in Hadoop * https://github.com/gss2002/copybook_formatter * https://github.com/gss2002/ftp2hdfs has some code that allows ftping RDW files directly from the Mainframe into Hadoop/HDFS as a mapreduce job or standalone client.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 17
    metaf2xml

    metaf2xml

    Parse and decode METAR, TAF, SYNOP, BUOY, AMDAR and write data as XML

    ...The extracted data can be written as XML or passed to a user-defined function (all done in Perl). It also provides XSLT style sheets to convert the XML to plain language (text, HTML), or XML with different schemas. A web-based (CGI) user interface can be used to download and display up-to-date weather data from NOAA, Ogimet, and other sources.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Chord5

    Chord5

    A version of CHORD4 updated to cooperate with ChordSmith.

    CHORD5 is a ChordPro editor and renderer, useful for formatting and printing song sheets ("lead sheets"). This version of the CHORD program (based on CHORD4) has been modified to add functionality that enables it to cooperate with the ChordSmith program (available at https://sourceforge.net/projects/chordsmith/). This modified version has been renamed to CHORD5. Although it is revised to work with ChordSmith, it also works well as a standalone program. NOTE: If you have...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    jPicEdt

    jPicEdt

    Another drawing editor for LaTeX with PSTricks & TikZ

    jPicEdt is an extensible internationalized vector-based drawing editor for LaTeX and related packages (TikZ, PsTricks,...), written in Java. It is also a library of reusable high-level graphic primitives.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 21

    xsd2pgschema

    Relational database replication tool based on XML Schema

    xsd2pgschema is a Java application suite, which converts XML Schema 1.1 (hierarchical data model) to PostgreSQL DDL (relational data model) and supports XML data migration into PostgreSQL based on the XML Schema without defects on information content. It also supports full-text indexing via either Apache Lucene or Sphinx Search utilizing the relational data model. File conversion from XML to CSV, TSV, or JSON is possible as well as mapping XML Schema to JSON Schema. Obtained PostgreSQL database can be optimized at user's discretion. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 22
    BWTC32Key

    BWTC32Key

    A file compressor with AES256CTR and Base32768 binary-to-text encoding

    BWTC32Key is a program I wrote that compresses data, then optionally encrypts it, and then outputs a Base32768 representation as the final output
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    damo-blog-engine

    DAMO Blog Engine is a light posting articles, comment, ...

    - generate static html from simple syntax articles - tags index generation - comment posting - smart formating : titles, images, text, ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Kile LaTeX Editor

    Kile LaTeX Editor

    Kile is a feature-rich LaTeX editor

    Kile is a LaTeX editor, a TeX shell that simplifies the creation of LaTeX documents.
    Leader badge
    Downloads: 232 This Week
    Last Update:
    See Project
  • 25
    HTML_And_CSS_Basics_With_Important_Tags

    HTML_And_CSS_Basics_With_Important_Tags

    An easy to edit webpage template / cheatsheet

    Just view this html file on your favourite internet browser and also edit it using your favourite text editor, according to your needs. That's all.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB