Showing 1024 open source projects for "text processing"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    A student of the Franklin W. Olin College of Engineering wrote his own extremely customizeable, extraordinarily functional, tabbed text editor in Python and pygtk. Works on windows and POSIX-compliant systems. For the scripter and excessive customizer.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    JTF (Java Text Formatter) is a plain (latin) text simple formatter. JTF will format the inputed text to an well-formed text, with considering: line width, justification, table, cell, padding, and other parameters that correspond to formatting a text.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Early Access iText, a PDF generation library in Java
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    ClipView is a Windows clipboard viewer that lets you view content on the Windows clipboard as text and as HTML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    This is simple and tiny template framework module. It processing is speedy. And provides extract variables, dictionary reference and sequencial variable loop. Import a tinpy module and call the build function, so it became generate document with templat
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    TMG - Text Mining for german language documents
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Java library to convert FCK Editor XHTML into pdf, using iText. The goal is to provide implementation through API and also via Java Servlet; and to embed a PDF Preview into FCK Editor (as in the HTML Preview), referring to a Servlet URL.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    YRX: easily use regular expressions in C. Useful for creating lexers (lexical scanners)and other text processing software.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Strip out useless tags and other junk from HTML files. Shrink files, enhance readability of HTML source, promote privacy, and clean HTML exported from Microsoft Word (MS-Word). Run HTMLStrip as-is or customize it with your own regular expressions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Streamline Azure Security with Palo Alto Networks VM-Series Icon
    Streamline Azure Security with Palo Alto Networks VM-Series

    Centrally manage physical and virtualized firewalls with Panorama

    Improve your security posture and reduce incident response time. Use the VM-Series to natively analyze Azure traffic and dynamically drive policy updates based on workload changes.
    Learn more
  • 10
    MonkeyFish is an editor with features oriented toward programmers and programming.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    XiMoL is an XML reader/writer (non-validating) library written in C++. It is a iostream-oriented library based on the STL and not a SAX or DOM library (like Xerces, expat, ...). Each object has its own reader/writer (operator<< and operator>>).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    This is an Emacs-Lisp package that enables easy editting and maintenance of DocBook XML files within GNU Emacs.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Word segment utility for Chinese(simplified) language, open for segment strategy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    A set of LaTeX packages for different purposes. facsimile is for creating faxes with LaTeX, blacklettert1 lets you use Fraktur fonts, and retro is for typrewriter-based LaTeX documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    (Almost) all a scholar in the Humanities needs (polytonic Greek fonts, stylistic and metrical analysis tools, search engines on TLG and PHI) concentrated in only one Linux Live CD, ready to use everywhere at home or at University, without installation
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    FictionVocabulary is a tool for counting words in text files. It uses vocabularies with word sets e.g. 1000 most useful words etc. Counting occurs through vocabularies, so word list consists only of words which are not represent in vocabularies
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    This is an easy to use tool for the manipulation (case, expand, order,...) of strings and text. Its main purpose is to help users change the format and look of texts efficiently and at large scales. This JAVA utility will run on the major pc platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Prototype for a framework and user interface for combining various structured search and document clustering techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    SiSMS alows you to read Siemens SMI/SMO files, which are archieved SMS (short messages) by Siemens mobile phones. SiSMS supports EMS, message saving and printing, searching in SMI/SMO files, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    The converter performs automatically the full process of converting the files of a C project into the equivalent C++ files. Classes are created, var and functions becomes attributes and methods and the changes are propagated into all files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Tabfmt is a command line utility to format tabular data. It reads lines from one or more files or from standard input, breaks the lines into fields given a set of field delimiters, and prints a table with constant-width columns to standard output.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    SDValidator implements a generalized method for validating document structure and content. The application validates based on user-defined Structured Document Definitions and provides an environment for validation, SDD development, and document editting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Estraier is a personal full-text search system for web sites, local file systems, mail boxes, and so on. Estraier has flexible interface and it can handle multilingual documents and various file formats with external plug-ins.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Gothe is a writing aid in picking the most appropriate prepositions or synonyms in a text. It does this by checking the frequency of appearance of different combination on Google.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB