Showing 25 open source projects for "python text parser"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Papermerge

    Papermerge

    Open Source Document Management System for Digital Archives

    Papermerge is an open source document management system (DMS) primarily designed for archiving and retrieving your digital documents. Instead of having piles of paper documents all over your desk, office or drawers - you can quickly scan them and configure your scanner to directly upload to Papermerge DMS. Store, organize and index scanned documents in PDF, JPEG and TIFF formats. Instantly find relevant information using full text, tags and metadata-based search. Papermerge is free and open...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 2
    Google Open Source Project Style Guide

    Google Open Source Project Style Guide

    Chinese version of Google open source project style guide

    .... If the project you are modifying originates from Google, you may be directed to the English version of the project page to understand the style used by the project. The Chinese version of the project uses reStructuredText plain text markup syntax, and uses Sphinx to generate document formats such as HTML / CHM / PDF.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    DevDocs

    DevDocs

    API Documentation Browser

    The devdocs repository powers the DevDocs web application, a fast, offline-friendly documentation browser for many programming languages, libraries, and APIs. It aggregates documentation from multiple sources (e.g., MDN, Python, Ruby, Git, etc.), converts them into a uniform format, and indexes them for instant text searching. The codebase includes a backend that handles ingestion, parsing, and transformation of documentation sources into a static site structure, as well as the client side UI...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    Utilities for general- and special-purpose documentation. Includes reStructuredText, the easy to read, easy to use, what-you-see-is-what-you-get plaintext markup language.
    Leader badge
    Downloads: 137 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    Paperless-ng

    Paperless-ng

    A supercharged version of paperless, scan, index and archive docs

    Paperless is a simple Django application running in two parts, a Consumer (the thing that does the indexing) and a Web server (the part that lets you search & download already-indexed documents). Paper is a nightmare. Environmental issues aside, there’s no excuse for it in the 21st century. It takes up space, collects dust, doesn’t support any form of a search feature, indexing is tedious, it’s heavy and prone to damage & loss. I wrote this to make “going paperless” easier. I do not have to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Moritz

    Moritz

    transfer xml into specific text-formats (html, dot, source-code, ...)

    Moritz is an "addon" to the well known tool doxygen. It generates nassi shneiderman diagramms of functions and methods in a c/c++ source as html-files, which could be included in a software-dokumentaion or simple whached by using a html-browser.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 7
    PROJECT MOVED TO https://github.com/paulhtremblay/rtf2xml The script rtf2xml faithfully converts Microsoft's RTF format to structured XML. Developers can make further transformations using standard XML tools, or use the stylsheets provided to convert to sdocbook or TEI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    inFolder

    A personal wiki created by your directory structure

    A graphical text editor used to maintain a collection of pages, whose content is created by the user, but whose hierarchical structure is dictated by the directory structure.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    pyWeb Literate Programming Tool

    Literate Programming in pure Python

    pyWeb is a Literate Programming tool that will work with any markup language and any programming langauge. The idea is to allow you to create great documentation with as constraints or limitations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Photo and Video Editing APIs and SDKs Icon
    Photo and Video Editing APIs and SDKs

    Trusted by 150 million+ creators and businesses globally

    Unlock Picsart's full editing suite by embedding our Editor SDK directly into your platform. Offer your users the power of a full design suite without leaving your site.
    Learn More
  • 10
    AsciiDoc is a presentable text document format for writing articles, UNIX man pages and other small to medium sized documents. The asciidoc(1) command translates AsciiDoc files to HTML, DocBook and LinuxDoc formats.
    Downloads: 40 This Week
    Last Update:
    See Project
  • 11
    Txt2tags converts a text file with minimal markup to HTML, XHTML, SGML, LaTeX, Lout, UNIX Man Page, Wikipedia, Google Code Wiki, DokuWiki, MoinMoin, MagicPoint(mgp), PageMaker. Features: simple, fast, automatic TOC, macros, filters, include, GUI/CLI/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    QuickDoc is a java document parser that reads documents from plain text files using a simple language and exports the document to other formats like PDF, HTML, Java Help and XML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Some text parser, that transforms a custom TeX-like syntax into XML. Un parseur, qui produit du XML à partir d'une syntaxe ressemblant à TeX.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Lightweight requirements management tool, with export, search and traceability functions. Uses plain text files as backend.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    A Python tool for creating websites or project documentation. Pages can be stored as reST (text) or html. With a simple templating and macro system it can autogenerate index pages and navigation links. Facilities for multiple translations as well.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    DocLite is a simple documentation authoring system, it produces multi-page HTML output (like this document) in a style similar to that found in the Linux HOTWTOs or other DocBook created documents.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 17
    This is a parser which reads plain-text input files and generates HTML output files. It combines the presentation features of HTML with the simplicity of plain-text notes. Generates HTML index files and hyperlinks for the words you choose to index.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    POST (Python Obviously Simple Text) provides support for simple, flexible dynamic document generation in multiple output formats. Supports inputs in text or XML, outputs in HTML, PDF, RTF, LaTeX source, nroff source, postscript, and plain text.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Tools for extracting and transforming XML-like mark-up, embedded in source code comments, into proper external entities or well-formed XML files. Can be used for JavaDoc-like "literate programming", or embedding other build-related or CM metadata.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    A modular system for extracting and converting Python docstrings into useful structured formats like HTML, XML, and TeX. Project inactive. Development taken over by Docutils, http://docutils.sourceforge.net/.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    reStructuredText defines & implement a markup syntax for use in Python docstrings and other documentation domains, that is readable & simple, yet powerful. Project inactive. Development taken over by Docutils, http://docutils.sourceforge.net/.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    man2dbk converts Unix man pages to XML DocBook.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    SrcComDoc allows documentation written in source comment rows to be extracted, formatted and highlighted according to the chosen documentation format. The basic SrcComDoc syntax is source and documentation language independent.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Yet Another Meeting Assistant (YaMA) can help you with minutes of meetings in a simple and efficient manner.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    This editor aims to help users creating their own ebooks in the newly released Open Publishing standard defined by International Digital Publishing Forum. The editor will permit the creatioin of ebook in OCF-1.0 format (.epub)
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.