Showing 14 open source projects for "text processing"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    ReductStore

    ReductStore

    The fastest time series object store for Edge AI

    History storage and management of images, vibration data, text, labels, and more - all in one place with the highest performance. Merge blob and time series functionalities, reducing the need for multiple databases. Customize real-time data retention policies and replication strategies. Store billions of time-stamped blobs with AI labels and access them with low latency. Outperform other databases with a customized solution for time-series object data. Capture and access blob data as time...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    sed-pcre2

    GNU sed with PCRE2 regexp

    GNU sed with PCRE2 regexp support
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    The Art of Command Line

    The Art of Command Line

    Master the command line, in one page

    The Art of Command Line is a single, highly curated page of tips that distills years of Unix command-line experience into practical, memorable guidance. It emphasizes fluency: small habits and commands that compound into faster debugging, data wrangling, and system navigation. The content spans basic shell usage, text processing with tools like grep/sed/awk, networking and performance inspection, and advice for working safely with root and destructive commands. Many entries highlight lesser-known flags or idioms that save keystrokes or avoid pitfalls, and the list aims to be dense but scannable. It is written for Linux first while acknowledging macOS and Windows differences where relevant. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    Flat file extractor can be used for reading and parsing different flat file structures and printing them in different formats. ffe is a command line tool developed in GNU/Linux environment and it is distributed under GPL. Project moved to https://github.com/igitur/ffe
    Downloads: 7 This Week
    Last Update:
    See Project
  • 6
    CSVfix

    CSVfix

    Command-line tool specifically designed to deal with CSV data

    ...Unfortunately, the CSV files you are given, or are required to produce, never seem to be in quite the right format for your particular business application. And because of the structure of CSV records, using standard text processing tools like sed, awk and perl is not as simple as it might be. Usage: http://csvfix.byethost5.com/csvfix15/csvfix.html?csvfix.html?Usage.html?i=1&i=2 CSVfix aims to provide a solution to these problems. It is a command-line stream editor specifically designed to deal with CSV data. With it you can, among other things:
    Downloads: 48 This Week
    Last Update:
    See Project
  • 7
    pdfsandwich generates "sandwich" OCR pdf files, i.e. pdf files which contain only images (but no editable text) will be processed by optical character recognition (OCR) and the text will be added to each page invisibly "behind" the images. pdfsandwich is a command line tool which is supposed to be useful to OCR scanned books or journals. It is able to recognize the page layout even for multicolumn text. Essentially, pdfsandwich is a wrapper script which calls the following binaries: convert, unpaper, tesseract, gs, and hocr2pdf (if tesseract < 3.03). ...
    Leader badge
    Downloads: 342 This Week
    Last Update:
    See Project
  • 8
    The tlve program is a command-line tool for parsing different tlv (tag-length-value) structures and for printing them in various text-based formats. tlve is developed in GNU/Linux environment and it is distributed under GPL.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9

    isbntools

    A command line tool to extract, transform and get metadata for ISBNs

    As of 2015-06-02, this project is no longer under active development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    binreplace is a command line tool to search and replace binary sequences in a file. It can be very powerful when used together with grep and sed. It uses prefix tree internally to search. It's efficient even with millions of search terms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Random name generator library with rule files. Fast, lightweight and easy. Can generate rule files from custom text, you can easily use it into your application and use the command line tool to generate the best suitable name for your needs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Chordeus is yet another Chord Pro to PDF converter that creates nice looking guitar chord sheets. You can create single sheets or a whole songbook using the command line tool or a simple wizard-like GUI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    xmlsnipe is a command line tool to display, remove, and modify content of nodes within an XML document
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Atox is a fully customizable Python library and command-line tool for converting plain text into XML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB