Showing 31 open source projects for "process open"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    HarfBuzz

    HarfBuzz

    Open source text shaping engine

    HarfBuzz is an open source text-shaping engine with a C API that turns fonts and strings of character codes into a form that is correctly arranged for the corresponding language and writing system. This is essentially the process of text shaping: translating a string of character codes into a properly arranged sequence of glyphs that can be rendered onto a screen or into final output form for inclusion in a document.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 2
    ANTLR

    ANTLR

    Parser generator to read, process, or translate structured text

    ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files. It's widely used to build languages, tools, and frameworks. From a grammar, ANTLR generates a parser that can build and walk parse trees. It’s widely used in academia and industry to build all sorts of languages, tools, and frameworks. Twitter search uses ANTLR for query parsing, with over 2 billion queries a day. The languages for...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    tika-python

    tika-python

    Python binding to the Apache Tika™ REST services

    A Python port of the Apache Tika library that makes Tika available using the Tika REST Server. This makes Apache Tika available as a Python library, installable via Setuptools, Pip and easy to install. To use this library, you need to have Java 7+ installed on your system as tika-python starts up the Tika REST server in the background. To get this working in a disconnected environment, download a tika server file (both tika-server.jar and tika-server.jar.md5, which can be found here) and set...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    TextExtractor

    TextExtractor

    Extracts plain text from a variety of different file types

    TextExtractor extracts plain text from hundreds of different file types, storing the text extracted in suitably named text files. TextExtractor 1.10 works in six different modes :- Instant Mode - Just select any file and extract the text from it. Batch Mode - Select a group of files and extract the text from all of them in one go. Polling Mode - Watch a folder location, processing new files as they appear there. Hierarchical Mode - Extract Text from files in a directory...
    Downloads: 5 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    Chord5

    Chord5

    A version of CHORD4 updated to cooperate with ChordSmith.

    CHORD5 is a ChordPro editor and renderer, useful for formatting and printing song sheets ("lead sheets"). This version of the CHORD program (based on CHORD4) has been modified to add functionality that enables it to cooperate with the ChordSmith program (available at https://sourceforge.net/projects/chordsmith/). This modified version has been renamed to CHORD5. Although it is revised to work with ChordSmith, it also works well as a standalone program. NOTE: If you have...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    EBookGenTools

    EBookGenTools

    EBook Generation Tools - scripts to create ebook formats EPUB, DOC

    EBookGenTools is a set of GNU/Linux shell scripts to process plain text for a book into HTML and electronic book formats. It was developed to create EPUB and DOC files from book text exported from novel writing software such as Manuskript, StoryBook, or your favourite text editor. EBookGenTools builds on the power of other software to create the following ebook formats: - EPUB: Calibre - ebook management - DOC: LibreOffice - free office suite These tools can be used directly to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    The goal of this tool is to simplify and accelerate the process of creating bookmarks for DjVu and PDF documents. You can see additional information on the project page.
    Leader badge
    Downloads: 34 This Week
    Last Update:
    See Project
  • 9

    SimpleTextFormatter

    STF automatically generates documentation

    STF is a system of automatically generating documentation under control of a program or a script. It is frequently used to automatically generate test reports. STF is also used to clean up the output of a process and turn it into a nice looking report.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    wordTabulator
    Program wordTabulator is intended for text analysis. It can generate index of word elements extracted from defined text set. Word elements may be words, N-grams or phrases (syntagmes). The program can process texts as in ordinary 1-byte encoding (ANSI), as in multibyte UTF-8 encoding.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    Distributed Proofreaders
    Project has moved to https://github.com/DistributedProofreaders/dproofreaders Distributed Proofreaders is a web application intended to ease the process of converting public domain books and other printed materials into e-texts. The main site is at http://www.pgdp.net By breaking the work into individual pages, many proofreaders can be working on the same book at the same time. This significantly speeds up the proofreading/E-Text creation process. When a proofer elects to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    RPTParse

    Quartus project compilation reports parsing tool

    Altera Quartus creates .rpt files during synthesis, place&route and bitstream generation stages of FPGA project compilation. Verification engineer checks these reports, finds warning messages and put them into own report. But big projects can contain a lot of warnings, and manual warning search is very boring and long process. That's why this parsing tool was created. This software parse report file, classifies warning types and place them into own report, where types of warnings and their...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Roman Life Manager

    Roman Life Manager

    Personal information manager. Free, opensource. Powerful and simple.

    Personal information management application. Powerful and simple. Free and opensoure. Inspired by todos.txt and markdown.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    GeditChangelog

    A Gedit plugin for exact, character based change logging

    This is a plugin for the famous text editor "Gedit" (Available for Linux, WIndows, ...?) When writing a text, GeditChangelog records every character typed. For instance, capture a video of your text creation process. It adds another dimension of text writing, the time dimension. It is very exciting to experiment with this type of writing. Since this is a plugin for gedit, any type of text can be tracked: Programming, Writing short or long text, etc. For a nice visualization, have a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    A simple text process library, aims to assist parsing all kinds of text including plain text, XML, HTML, etc., which means it can be used as a simple XML parser or a HTML parser.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    AleSimpleLexer is a C# class that tokenizes given text. Can process lists, numbers, text constants, parentheses and comments. Supports unicode.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    Trim Lines

    Trim Lines removes trailing whitespace from source code files

    This is simple command line tool to batch process source code files to remove trailing whitespaces and convert all line endings to your system native style. Usage example: trimlines d:\Projects\SomeProject\src *.c;*.cpp;*.h;*.hpp;*.inc .svn;.git With this command all files in "d:\Projects\SomeProject\src" including sub-folders that match search masks "*.c;*.cpp;*.h;*.hpp;*.inc" and excluding specified folders ".svn;.git" will be processed. Can process unicode and system native...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    TextMarker
    TextMarker is now developed and hosted at Apache UIMA (http://uima.apache.org/textmarker.html). TextMarker is a UIMA-based tool for information extraction and more. The full featured editor of the rule language and the build process of UIMA descriptors are complemented with components for visualization, explanation, testing and rule learning.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    JTextEncoding
    Small Program which lets you convert the encoding of text files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Simple Text Splitter
    A very simple text splitter that can split text based files (txt, log, srt etc.) into smaller chunks.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 21
    A multi-line string formatting tool to take in any delimited input and process it into a well defined format. Especially useful for creating bulk scripts, SQL and otherwise, from spreadsheet values and other input.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Apostilator is a XML based front end to LaTeX that simplifies the process of writing all kinds of documents from letters to books, reports and articles. It's easy to customize and create templates defining a style for all your documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    WebSHi (Web Syntax Highlighter) is just another code syntax highlighting engine. Though it's written in JavaScript, WebSHi is very fast and scalable, can process 30,000+ lines of code in seconds even on slower browsers like IE6.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    XVCL is a general-purpose language for configuring variants in all sorts of textual documents (including programs). It is based on frame technology. XVCL processor automates the customization process to produce system from specification of variants.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    xReplace is a procedural text editor to define a process of text manipulation which will be performed for multiple files described using XML. xReplace also supports third party plug-ins for extra features. Read the Wiki for more information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB