10 projects for "text processing" with 2 filters applied:

  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 1
    The Art of Command Line

    The Art of Command Line

    Master the command line, in one page

    The Art of Command Line is a single, highly curated page of tips that distills years of Unix command-line experience into practical, memorable guidance. It emphasizes fluency: small habits and commands that compound into faster debugging, data wrangling, and system navigation. The content spans basic shell usage, text processing with tools like grep/sed/awk, networking and performance inspection, and advice for working safely with root and destructive commands. Many entries highlight lesser-known flags or idioms that save keystrokes or avoid pitfalls, and the list aims to be dense but scannable. It is written for Linux first while acknowledging macOS and Windows differences where relevant. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Flat file extractor can be used for reading and parsing different flat file structures and printing them in different formats. ffe is a command line tool developed in GNU/Linux environment and it is distributed under GPL. Project moved to https://github.com/igitur/ffe
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    pdfsandwich generates "sandwich" OCR pdf files, i.e. pdf files which contain only images (but no editable text) will be processed by optical character recognition (OCR) and the text will be added to each page invisibly "behind" the images. pdfsandwich is a command line tool which is supposed to be useful to OCR scanned books or journals. It is able to recognize the page layout even for multicolumn text. Essentially, pdfsandwich is a wrapper script which calls the following binaries: convert, unpaper, tesseract, gs, and hocr2pdf (if tesseract < 3.03). ...
    Leader badge
    Downloads: 342 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 5
    The tlve program is a command-line tool for parsing different tlv (tag-length-value) structures and for printing them in various text-based formats. tlve is developed in GNU/Linux environment and it is distributed under GPL.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    binreplace is a command line tool to search and replace binary sequences in a file. It can be very powerful when used together with grep and sed. It uses prefix tree internally to search. It's efficient even with millions of search terms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Random name generator library with rule files. Fast, lightweight and easy. Can generate rule files from custom text, you can easily use it into your application and use the command line tool to generate the best suitable name for your needs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Chordeus is yet another Chord Pro to PDF converter that creates nice looking guitar chord sheets. You can create single sheets or a whole songbook using the command line tool or a simple wizard-like GUI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    xmlsnipe is a command line tool to display, remove, and modify content of nodes within an XML document
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Atox is a fully customizable Python library and command-line tool for converting plain text into XML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB