Showing 34 open source projects for "text processing"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 1
    Midnight Commander for Windows

    Midnight Commander for Windows

    Native Midnight Commander 4.8.33 for Windows/Win32

    Windows XP+/32 bit native port of GNU Midnight Commander, based on the current 4.8.33 development stream. Midnight Commander (also known as mc) is a free cross-platform orthodox file manager and a clone of Norton Commander. Supports both the recent Windows 10 enhanced plus legacy consoles, 16 and 256+ colour modes are available within either. Features include the ability work with common archive formats as if they were simply another directory, and function as an SFTP/FTP...
    Leader badge
    Downloads: 181 This Week
    Last Update:
    See Project
  • 2
    Swiss File Knife

    Swiss File Knife

    One hundred command line tools in a small and portable binary.

    Create zip files, extract zip files, replace text in files, search in files using expressions, stream text editor, instant command line ftp and http server, send folder via network, copy folder excluding sub folders and files, find duplicate files, run a command on all files of a folder, split and join large files, make md5 checksum lists of files, remove tab characters, convert CR/LF, list newest or biggest files of a folder, compare folders, treesize, show first or last lines of a file,...
    Leader badge
    Downloads: 400 This Week
    Last Update:
    See Project
  • 3
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4

    UniversalTextExtractor

    Command-line toolset for extracting text from files

    Command-line toolset for extracting text from files (documents, images, archives) into SQLite with OCR support. Simple, expandable, one shell script only.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5

    docx_cleanup

    free DOCX Cleanup - Embedded Font Remover - easy batch processing

    A lightweight, drag-and-drop freeware Windows utility that instantly reduces Microsoft Word (.docx) file sizes by stripping out heavy, embedded fonts without corrupting the document structure. Drag and Drop: No installation. Just drop a file or folder onto the .exe. Batch Processing: Automatically process an entire folder of .docx files (with a choice of including or excluding subfolders). 100% Safe for Word: Surgically removes the saved font files (if any) from inside the DOCX file...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    AI File Sorter

    AI File Sorter

    Local AI file organization with categorization and rename suggestions

    AI File Sorter is a cross-platform desktop application that uses AI (local LLMs run on your computer) to organize files and suggest meaningful file names based on real content, not just filenames or extensions. The app can analyze images locally and propose descriptive rename suggestions (for example, IMG_2048.jpg → clouds_over_lake.jpg). It can also analyze document text to improve categorization and renaming. Supported formats include PDF, DOCX, XLSX, PPTX, ODT, ODS, ODP, and common...
    Leader badge
    Downloads: 516 This Week
    Last Update:
    See Project
  • 7

    Safe Harbor Deidentification

    Safe Harbor Deidentification for medical documents

    Phalanx - Deidentify Safe Harbor Deidentification Mode of Phalanx is an abridged pipeline of NLP annotators culminating in NER annotators which write output of text offsets. It uses the Safe Harbor deidentification method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    PanDocElectron

    Graphical User Interface for PanDoc for Linux, Mac & Windows

    PanDoc Graphical User Interface implemented with Electron for Linux, Mac and Windows. It support users in converting source documents into various other formats like docx, odt, html and reveal documentation. The zip files contain the full source code because PanDocElectron is written in HTML/Javascript. Electron is used more or less as browser that runs the HTML/Javascript application. [Download PanDocElectron](https://sourceforge.net/p/pandocelectron/wiki/Home/) Extract the zip-file...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    XML Truncator-Fixer

    XML Truncator-Fixer

    Truncates XML at first error-uses the xmllint to add correct end tags.

    XML is intolerant of errors. This program will automatically salvage XML files by finding the first XML error, then truncating a configurable number of characters before the error and finally using the great xmllint to recover the xml file by automatically adding the correct end tags. The program starts with a default of 50 characters truncation. This truncation value may need to be changed if the truncation happens in the middle of a complex tag as xmllint may not know how to recover the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    GitSync is a shell script designed to simplify the usage of the version control system GIT (see www.git-scm.com for more information) by providing a "do everything to sync my repository" command.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    Replasol is basically Visual Fart. The purpose is to create a simple front end GUI for FART (Find and Replace Text) in JAVA that allows you to use all of the command line options quickly and easily, as well as store .wind files, containing multiple Farts
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Drag-and-drop files/directories/HTML-URLs into a Java GUI. Perform text operations on the files into output files. Operations include concatention, text and regex editing, and other file/string/row/column/script operations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    PTools is a set of useful tools written in Pascal. It includes: scientific calculator, archiver, text editor, remote adminitration and more. It is designed to be portable across operating systems, specially Java-based mobiles, Windows and Unixes.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 14
    Filecmp is a command-line application that gets two filenames as argument and outputs the comparison between them - e.g. if they are the same or not... it may look irrelevant but sometime it's very useful, specially inside scripts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    This program is used as a tool to check for ASCII code and strip it out to a new file to recover documents that got corrupted or find hidden data in viruses that contain text by the virus writer or spies puting comments in images or other files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    This application reads the output of Web forms posted on your website (usually via email) and converts them to csv files for importing into a database, or managing in Excel
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Smart Id3 Tag from filename. Rename file from Tag. Full Regex Search & Replace. Intuitive parsing. Smart file numbering. Artist Album report. Split compilation album. Configurable text processing. mp3, flac, ogg, mp4, m4a, mp4p + more. Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Moved to http://code.google.com/p/evil-programmers/
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    csvtoxml will convert parse csv comma separated value data into xml. a command line console utility that uses stdin and stdout pipe with more cat, pr, wget, zip, find -exec for added functionality. file stream term c c++ small fast parser unix win osx
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Visual xsltproc is a tool which help to write xslt file, and debug it to find errors. It writes xml, and generates xml (Syntax highlighting of XML & line Nr.). Finally if the result is XSL-FO it generates the pdf on Apache FOP java. Build on QT4.2.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    QReader - The High Quality Book Reader for Smart Mobiles. Symbian OS - different phones, different supported file types, designed to be most useful book reader for PDA/cellular.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    This project utilizes the iPod's ability to store and display short text files to allow you to view RSS Feeds, Weather Forecasts, Movie Showtimes, and other text documents on your iPod when you are away from your computer
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    A graphical MS Windows version of the ever useful "tail" command in *nix. Features RegEx highlighting, multiple notification methods (Flash, Beep, Email, Balloon), alternating line colors for readability, Threshold Seperators, and simple XML Config
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    A backup/transfer files program to move and process files between computers in a production (industrial) process. The processing of file begins when it are created or copied to directory and they can be redirected to other machine to be used.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    ZipDB is a program that allows databases to be made easily and efficiently using compressed zip files. An easy-to-use gui is the key feature allowing quick access to all files in the archive, and since the archives are .zip they can always be opened on
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB