Showing 35 open source projects for "text processing"

View related business solutions
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Bowtie, an ultrafast, memory-efficient short read aligner for short DNA sequences (reads) from next-gen sequencers. Please cite: Langmead B, et al. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10:R25.
    Leader badge
    Downloads: 382 This Week
    Last Update:
    See Project
  • 2
    ansifilter

    ansifilter

    ANSI sequence filter

    Ansifilter handles text files containing ANSI terminal escape codes. The command sequences may be stripped or be interpreted to generate formatted output (HTML, RTF, TeX, LaTeX, BBCode and Pango Markup).
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    Swiss File Knife

    Swiss File Knife

    One hundred command line tools in a small and portable binary.

    Create zip files, extract zip files, replace text in files, search in files using expressions, stream text editor, instant command line ftp and http server, send folder via network, copy folder excluding sub folders and files, find duplicate files, run a command on all files of a folder, split and join large files, make md5 checksum lists of files, remove tab characters, convert CR/LF, list newest or biggest files of a folder, compare folders, treesize, show first or last lines of a file,...
    Leader badge
    Downloads: 400 This Week
    Last Update:
    See Project
  • 4
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 5

    UniversalTextExtractor

    Command-line toolset for extracting text from files

    Command-line toolset for extracting text from files (documents, images, archives) into SQLite with OCR support. Simple, expandable, one shell script only.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    AI File Sorter

    AI File Sorter

    Local AI file organization with categorization and rename suggestions

    AI File Sorter is a cross-platform desktop application that uses AI (local LLMs run on your computer) to organize files and suggest meaningful file names based on real content, not just filenames or extensions. The app can analyze images locally and propose descriptive rename suggestions (for example, IMG_2048.jpg → clouds_over_lake.jpg). It can also analyze document text to improve categorization and renaming. Supported formats include PDF, DOCX, XLSX, PPTX, ODT, ODS, ODP, and common...
    Leader badge
    Downloads: 516 This Week
    Last Update:
    See Project
  • 7
    AnalysePlugin
    Shorten your time of reading mega bytes of log files! DE 2016-01 Supporting Don HO's NotePad++ see http://notepad-plus-plus.org/ This sources are a dockable pattern search plugin for Notepad++ version 5.1 or later. With this plugin you can search for multiple patterns in any of the opened documents in NotePad++. You may want to tune your search using all fancy tricks from NPP like regular expressions or escaped patterns and give each of the searches different colors. The...
    Leader badge
    Downloads: 563 This Week
    Last Update:
    See Project
  • 8

    FastoCloud PRO

    IPTV/NVR/CCTV/Video cloud https://fastocloud.com

    IPTV/Video cloud Features: Cross-platform (Linux, MacOSX, FreeBSD, Raspbian/Armbian) GPU/CPU Encode/Decode/Post Processing Stream statistics CCTV Adaptive hls streams Load balancing Temporary urls HLS push EPG scanning Subtitles to text conversions AD insertion Logo overlay Video effects Relays Timeshifts Catchups Playlists Restream/Transcode from online streaming services like Youtube, Twitch Mozaic Many Outputs Physical Inputs Streaming Protocols File Formats Presets Vods/Series server-side support Pay per view channels Channels on demand HTTP Live Streaming (HLS) server-side support Public API, client server communication via JSON RPC Protocol gzip compression Deep learning video analysis Supported deep learning frameworks: Tensorflow NCSDK Caffe ML Hardware:
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9

    Safe Harbor Deidentification

    Safe Harbor Deidentification for medical documents

    Phalanx - Deidentify Safe Harbor Deidentification Mode of Phalanx is an abridged pipeline of NLP annotators culminating in NER annotators which write output of text offsets. It uses the Safe Harbor deidentification method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    QStripper
    Convert Sinclair QL word processing (Quill etc) documents to pdf, html, text and DocBook XML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    bnf2xml

    simple BNF parser makes xml markup of matches

    bnf2xml a simple BNF parser that takes text as input, searches according to a BNF query file, and outputs text marked up by the xml labels that show context. bnf2xml is as simple to use as any text binary ie, awk(1) grep(1). bnf2xml does not require C API because it outputs simple xml labeling. README is visible on file dl page. EXAMPLE: $ echo "hi" | bnf2xml patternfile <word><alph>h</alph><alph>i</alph></word> or <gas>hydrogen iodide</gas> patternfile says how to find...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    TEA is a text editor that provides a wide range of text-processing functions (over 100) and the syntax highlighting. There are two branches of TEA: Qt-based and GTK-based.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 13

    Hierarchical cluster engine HCE

    Hierarchical Cluster Engine project

    The main idea of this project – to implement the solution that can be used to: construct custom network mesh or distributed network cluster structure with several relations types between nodes, formalize the data flow processing goes from upper node level central source point to down nodes and backward, formalize the management requests handling from multiple source points, support native reducing of multiple nodes results (aggregation, duplicates elimination, sorting and so on), internally support powerful full-text search engine and data storage, provide transactions-less and transactional requests processing, support flexible run-time changes of cluster infrastructure, have many languages bindings for client-side integration APIs in one product build on C++ language... ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A utility to read/write from: pipes, sockets, standard handles, files, block devices, NT Native API objects, file mapping/section objects, comm ports, etc. Supports various text/data conversions, multiple inputs/outputs, and various IO types/options.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    etherpaste

    Cut and paste across computers on a network with EtherClip.

    With no programme to do it with, it is hard to move short snippets of information across computers on a network; one can not simply press Shift-Delete on one computer and Shift-Insert on the other. This requires a "shared clipboard"; EtherPaste (formerly known as NetClip) fulfills this niche quite well, at least for older versions of Windows. This project's aim is to make EtherPaste work for Windows Vista, 7, and 8.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Find And Replace Text command line utility. New & improved version of the well-known grep command, with advanced features such as: case-adaption of the replace string; find (& replace) in filenames, auto CVS edit. Moved to https://github.com/lionello/fart-it
    Leader badge
    Downloads: 64 This Week
    Last Update:
    See Project
  • 17
    Search files for a text string. Like grep but really dumbed down
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    EzerKb is a virtual keyboard for Windows. It emulates a keyboard with, for example, Russian, Greek, or Hebrew characters without actually installing a keyboard driver for that language. EzerKb works with most (but not all) Windows programs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Yet Another Meta Languge Indentation Tool YAMLIT can indent or compress HTML and XML for easy reading, or bandwidth saving respecively. Similar to HTML Tidy but will NOT verify the document against a DTD.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Tiny Notes is a note/code organizer. It is meant as a small and simple tool that helps you organize small snippets of text, through which you can quickly search. Supports syntax highlighting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A java wrapper for microsoft's ifilter technology.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Removed specified trailing characters from the end of a string. Useful for removing filename extensions for use in scripts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Moved to http://code.google.com/p/evil-programmers/
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    csvtoxml will convert parse csv comma separated value data into xml. a command line console utility that uses stdin and stdout pipe with more cat, pr, wget, zip, find -exec for added functionality. file stream term c c++ small fast parser unix win osx
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    wxCocoaDialog is an multi-platform port of the CocoaDialog application for OS X, that allows the use of common GUI controls such as file selectors, text input, progress bars, yes/no confirmations and more with a command-line application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB