Showing 41 open source projects for "batch text processing"

View related business solutions
  • Cut Cloud Costs with Google Compute Engine Icon
    Cut Cloud Costs with Google Compute Engine

    Save up to 91% with Spot VMs and get automatic sustained-use discounts. One free VM per month, plus $300 in credits.

    Save on compute costs with Compute Engine. Reduce your batch jobs and workload bill 60-91% with Spot VMs. Compute Engine's committed use offers customers up to 70% savings through sustained use discounts. Plus, you get one free e2-micro VM monthly and $300 credit to start.
    Try Compute Engine
  • Host LLMs in Production With On-Demand GPUs Icon
    Host LLMs in Production With On-Demand GPUs

    NVIDIA L4 GPUs. 5-second cold starts. Scale to zero when idle.

    Deploy your model, get an endpoint, pay only for compute time. No GPU provisioning or infrastructure management required.
    Try Free
  • 1
    Bowtie, an ultrafast, memory-efficient short read aligner for short DNA sequences (reads) from next-gen sequencers. Please cite: Langmead B, et al. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10:R25.
    Leader badge
    Downloads: 414 This Week
    Last Update:
    See Project
  • 2
    Swiss File Knife

    Swiss File Knife

    One hundred command line tools in a small and portable binary.

    Create zip files, extract zip files, replace text in files, search in files using expressions, stream text editor, instant command line ftp and http server, send folder via network, copy folder excluding sub folders and files, find duplicate files, run a command on all files of a folder, split and join large files, make md5 checksum lists of files, remove tab characters, convert CR/LF, list newest or biggest files of a folder, compare folders, treesize, show first or last lines of a file,...
    Leader badge
    Downloads: 408 This Week
    Last Update:
    See Project
  • 3

    UniversalTextExtractor

    Command-line toolset for extracting text from files

    Command-line toolset for extracting text from files (documents, images, archives) into SQLite with OCR support. Simple, expandable, one shell script only.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    ansifilter

    ansifilter

    ANSI sequence filter

    Ansifilter handles text files containing ANSI terminal escape codes. The command sequences may be stripped or be interpreted to generate formatted output (HTML, RTF, TeX, LaTeX, BBCode and Pango Markup).
    Downloads: 3 This Week
    Last Update:
    See Project
  • Go From Idea to Deployed AI App Fast Icon
    Go From Idea to Deployed AI App Fast

    One platform to build, fine-tune, and deploy. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    AI File Sorter

    AI File Sorter

    Local AI file organization with categorization and rename suggestions

    AI File Sorter is a cross-platform desktop application that uses AI to organize files and suggest meaningful file names based on real content, not just filenames or extensions. The app can analyze image files locally and propose human-readable rename suggestions (for example, IMG_2048.jpg → clouds_over_lake.jpg). It can also analyze the text content of documents to improve categorization and renaming. Supported formats include PDF, DOCX, XLSX, PPTX, ODT, ODS, ODP, and common text files....
    Downloads: 261 This Week
    Last Update:
    See Project
  • 7
    Windows Native Unix Shell (wnus prompt)

    Windows Native Unix Shell (wnus prompt)

    A comprehensive Unix/Linux-like shell environment for Windows 7+ 64bit

    This project is implemented entirely in C++ without external dependencies. Windows Native Unix Shell (wnus prompt) (pronounced Wenus) brings the power and familiarity of bash sh ash commands and common terminal utilities to native Windows, with full NTFS support and Windows API integration. . It contains 342 fully implemented commands built-in , encompassing file managements operation and so much more. It has a comprehensive help system. This doesn't require WSL to work, or any...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    AnalysePlugin
    Shorten your time of reading mega bytes of log files! DE 2016-01 Supporting Don HO's NotePad++ see http://notepad-plus-plus.org/ This sources are a dockable pattern search plugin for Notepad++ version 5.1 or later. With this plugin you can search for multiple patterns in any of the opened documents in NotePad++. You may want to tune your search using all fancy tricks from NPP like regular expressions or escaped patterns and give each of the searches different colors. The...
    Leader badge
    Downloads: 604 This Week
    Last Update:
    See Project
  • 9

    FastoCloud PRO

    IPTV/NVR/CCTV/Video cloud https://fastocloud.com

    IPTV/Video cloud Features: Cross-platform (Linux, MacOSX, FreeBSD, Raspbian/Armbian) GPU/CPU Encode/Decode/Post Processing Stream statistics CCTV Adaptive hls streams Load balancing Temporary urls HLS push EPG scanning Subtitles to text conversions AD insertion Logo overlay Video effects Relays Timeshifts Catchups Playlists Restream/Transcode from online streaming services like Youtube, Twitch Mozaic Many Outputs Physical Inputs Streaming Protocols File Formats Presets Vods/Series server-side support Pay per view channels Channels on demand HTTP Live Streaming (HLS) server-side support Public API, client server communication via JSON RPC Protocol gzip compression Deep learning video analysis Supported deep learning frameworks: Tensorflow NCSDK Caffe ML Hardware:
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Across 150+ Cloud Services Icon
    $300 in Free Credit Across 150+ Cloud Services

    VMs, containers, AI, databases, storage | build anything. No commitment to start.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale with Google Cloud.
    Start Building Free
  • 10
    UltraDefrag

    UltraDefrag

    Increase your system performance with this easy to use tool

    Please note, here you can find the latest open source release of UltraDefrag (7.1.4). The most recent UltraDefrag release (15.0.0) can be found on our official website: https://ultradefrag.net UltraDefrag is a disk defragmenter for Windows, which supports defragmentation of locked system files by running during the boot process. It is easy to use without any complicated scripting or a huge load of configuration settings. You can filter the files processed by size, number of fragments,...
    Leader badge
    Downloads: 1,000 This Week
    Last Update:
    See Project
  • 11

    Safe Harbor Deidentification

    Safe Harbor Deidentification for medical documents

    Phalanx - Deidentify Safe Harbor Deidentification Mode of Phalanx is an abridged pipeline of NLP annotators culminating in NER annotators which write output of text offsets. It uses the Safe Harbor deidentification method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    QStripper
    Convert Sinclair QL word processing (Quill etc) documents to pdf, html, text and DocBook XML.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    IDTE- ID3 Tag Editor

    IDTE- ID3 Tag Editor

    A Smarter Way to Tag FLAC, Vorbis, Lyrics, WMA, WavPack, and ID3 Tags

    ...IDTE offers a wide range of advanced features: • Rename files using tag information • Export and manage tag data • Create and manage playlists • Search online for missing or incomplete tags • Fetch lyrics and album artwork • Full CUE sheet support • Batch processing and multi-file editing • SFX, visualizations, and more A new, minimalistic version of IDTE is also available for download: https://sourceforge.net/projects/idte ⚠️ Important Notice Tagging MP4 video files and WAV audio files using IDTE is not recommended. WAV tag support is ambiguous and inconsistently handled across players (see: http://wiki.audacityteam.org/wiki/WAV).
    Downloads: 37 This Week
    Last Update:
    See Project
  • 14

    bnf2xml

    simple BNF parser makes xml markup of matches

    bnf2xml a simple BNF parser that takes text as input, searches according to a BNF query file, and outputs text marked up by the xml labels that show context. bnf2xml is as simple to use as any text binary ie, awk(1) grep(1). bnf2xml does not require C API because it outputs simple xml labeling. README is visible on file dl page. EXAMPLE: $ echo "hi" | bnf2xml patternfile <word><alph>h</alph><alph>i</alph></word> or <gas>hydrogen iodide</gas> patternfile says how to find...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    TEA is a text editor that provides a wide range of text-processing functions (over 100) and the syntax highlighting. There are two branches of TEA: Qt-based and GTK-based.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16

    Hierarchical cluster engine HCE

    Hierarchical Cluster Engine project

    The main idea of this project – to implement the solution that can be used to: construct custom network mesh or distributed network cluster structure with several relations types between nodes, formalize the data flow processing goes from upper node level central source point to down nodes and backward, formalize the management requests handling from multiple source points, support native reducing of multiple nodes results (aggregation, duplicates elimination, sorting and so on), internally support powerful full-text search engine and data storage, provide transactions-less and transactional requests processing, support flexible run-time changes of cluster infrastructure, have many languages bindings for client-side integration APIs in one product build on C++ language... ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    A utility to read/write from: pipes, sockets, standard handles, files, block devices, NT Native API objects, file mapping/section objects, comm ports, etc. Supports various text/data conversions, multiple inputs/outputs, and various IO types/options.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    etherpaste

    Cut and paste across computers on a network with EtherClip.

    With no programme to do it with, it is hard to move short snippets of information across computers on a network; one can not simply press Shift-Delete on one computer and Shift-Insert on the other. This requires a "shared clipboard"; EtherPaste (formerly known as NetClip) fulfills this niche quite well, at least for older versions of Windows. This project's aim is to make EtherPaste work for Windows Vista, 7, and 8.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    7zip Batch Compression

    7zip Batch Compression

    Small Windows-GUI for Batch-Compression with 7zip

    My first try of a GUI with wxWidgets. Now 7zip can do "each file to a seperate archive" from Winrar. Sends commands to the CMD to process through the list, though maybe Admin rights are required. Maybe a Linux-Version later...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    Find And Replace Text command line utility. New & improved version of the well-known grep command, with advanced features such as: case-adaption of the replace string; find (& replace) in filenames, auto CVS edit. Moved to https://github.com/lionello/fart-it
    Leader badge
    Downloads: 31 This Week
    Last Update:
    See Project
  • 21
    Search files for a text string. Like grep but really dumbed down
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    EzerKb is a virtual keyboard for Windows. It emulates a keyboard with, for example, Russian, Greek, or Hebrew characters without actually installing a keyboard driver for that language. EzerKb works with most (but not all) Windows programs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    CMDREGED is a simple text-mode registry editor for Windows. The goal is to provide a fast way of making changes to the registry, through simple commands, as well as scripts (either batch/cmd scripts or registry files).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Yet Another Meta Languge Indentation Tool YAMLIT can indent or compress HTML and XML for easy reading, or bandwidth saving respecively. Similar to HTML Tidy but will NOT verify the document against a DTD.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Tiny Notes is a note/code organizer. It is meant as a small and simple tool that helps you organize small snippets of text, through which you can quickly search. Supports syntax highlighting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB