Search Results for "text processing" - Page 17

Showing 1568 open source projects for "text processing"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Text Hex Convert v2.1.1

    Text Hex Convert v2.1.1

    THC - Convert Hex, or String with ease!

    A tool for converting hex values with ease! Download now, and you wont get your head hurt again. This tool converts Text values to Hex values, and vice-versa. Written in VB.NET. Created solely for aiding ROM Hacking at first, but became rather an important tool. Source Code: https://sourceforge.net/u/kakarot1212/profile/
    Downloads: 8 This Week
    Last Update:
    See Project
  • 2
    prose NLP

    prose NLP

    Golang library for text processing

    Prose is a natural language processing library for Go, designed for text analysis tasks like tokenization, named entity recognition, and dependency parsing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Synonyms

    Synonyms

    Chinese synonyms, chat robot, intelligent question and answer toolkit

    Chinese Synonyms for natural language processing and understanding. Better Chinese synonyms, chatbot, intelligent question and answer toolkit. synonymsCan be used for many tasks in natural language understanding, text alignment, recommendation algorithms, similarity calculation, semantic shifting, keyword extraction, concept extraction, automatic summarization, search engines, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Parsr

    Parsr

    Transforms PDF, Documents and Images into Enriched Structured Data

    Parsr is an open-source document parsing tool that converts PDFs, scanned images, and other structured documents into structured, machine-readable data formats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    MyDiary1000

    MyDiary1000

    Its A Diary/TextEditor that encrypts your text data using images

    This is a diary that allows you to digitally write entries and view them. What makes this a masterpiece from its counterparts is how user data is created and abstracted. User data is protected and stored in PNG images, the image it self is not encrypted but the data inside the image is encrypted. If you open any one of the created PNG images, they look like blank images!. These "normal" looking images protect your information from prying eyes. The simplicity and elegance of my program...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    fastNLP

    fastNLP

    fastNLP: A Modularized and Extensible NLP Framework

    ...Provide a variety of neural network components and recurrence models (covering tasks such as Chinese word segmentation, named entity recognition, syntactic analysis, text classification, text matching, metaphor resolution, summarization, etc.). Trainer provides a variety of built-in Callback functions to facilitate experiment recording, exception capture, etc. Automatic download of some datasets and pre-trained models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    htmleditor.py

    A Python based HTML and CSS Editor

    Requires PyQt >= 5.2 QsciScintilla >= 2.8 Python >=3.4
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Blackfriday

    Blackfriday

    Blackfriday, a markdown processor for Go

    Blackfriday is a Markdown processor implemented in Go. It is paranoid about its input (so you can safely feed it user-supplied data), it is fast, it supports common extensions (tables, smart punctuation substitutions, etc.), and it is safe for all utf-8 (unicode) input. HTML output is currently supported, along with Smartypants extensions. It started as a translation from C of Sundown. Blackfriday itself does nothing to protect against malicious content. If you are dealing with user-supplied...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    CC-Net

    CC-Net

    Tools to download and cleanup Common Crawl data

    cc_net provides tools to download, segment, clean, and filter Common Crawl to build large-scale text corpora, including monolingual datasets and the multilingual CC-100 collection introduced in the associated paper. It includes pipelines to fetch snapshots, extract text, de-duplicate, identify language, and apply quality filtering based on heuristics and language models. The outputs are intended for pretraining language models and for creating standardized corpora that can be reproduced or...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10

    QVASIMODO's String Tools

    This project is a quick way of applying macros to a portion of text.

    This project is basically a quick way of applying macros, i.e., sequences of editing commands, to a portion of text. You create a "ruleset", which is basically a sequence of "rules" to be sequentially applied to the source text. The original text is entered in the top left box, while the result appears in the top right box. Program configuration is stored in a sqlite file in a '.panglossa' directory in the user's home folder. Remember that the "rules", or commands, will be...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    up

    up

    A tool for writing Linux pipes with instant live preview

    ...The main goal of the Ultimate Plumber is to help interactively and incrementally explore textual data in Linux, by making it easier to quickly build complex pipelines, thanks to a fast feedback loop. This is achieved by boosting any typical Linux text-processing utils such as grep, sort, cut, paste, awk, wc, perl, etc., etc., by providing a quick, interactive, scrollable preview of their results. To start using up, redirect any text-emitting command (or pipeline) into it. Use PgUp/PgDn and Ctrl-[←]/Ctrl-[→] for basic browsing through the command output. In the input box at the top of the screen, start writing any bash pipeline; then press Enter to execute the command you typed, and the Ultimate Plumber will immediately show you the output of the pipeline in the scrollable window.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    NLP.js

    NLP.js

    An NLP library for building bots

    ...Sentiment Analysis for phrases (with negation support). Named Entity Recognition and management, multi-language support, and acceptance of similar strings, so the introduced text does not need to be exact. Natural Language Processing Classifier, to classify an utterance into intents. NLP Manager, a tool able to manage several languages, the Named Entities for each language, the utterances, and intents for the training of the classifier, and for a given utterance return the entity extraction, the intent classification and the sentiment analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Flat file extractor can be used for reading and parsing different flat file structures and printing them in different formats. ffe is a command line tool developed in GNU/Linux environment and it is distributed under GPL. Project moved to https://github.com/igitur/ffe
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14
    Pure Bash Bible

    Pure Bash Bible

    A collection of pure bash alternatives to external processes

    ...Its goal is to reduce reliance on external tools like sed, awk, or grep, which can slow down scripts and add unnecessary dependencies. The project is organized as a reference book of function-based code snippets, each showcasing practical solutions for string manipulation, text processing, file operations, and more. By relying exclusively on Bash built-ins, these methods can make scripts faster, more portable, and easier to maintain. Every snippet is linted with shellcheck, and many have accompanying tests, ensuring correctness and usability. The project is not only a coding resource but also an educational tool for Bash users who want to better understand the full power of the language.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    FAR - Find And Replace
    Search and replace operations on file content accross multiple files. Recursive operations within entire directory trees. FAR comes with support for regular expressions (regex) over multiple lines, automatic backup and various character encodings. Run grep like extractions to condense or rearrange sources, or perform bulk file renaming.
    Downloads: 27 This Week
    Last Update:
    See Project
  • 16
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    CSVfix

    CSVfix

    Command-line tool specifically designed to deal with CSV data

    ...Unfortunately, the CSV files you are given, or are required to produce, never seem to be in quite the right format for your particular business application. And because of the structure of CSV records, using standard text processing tools like sed, awk and perl is not as simple as it might be. Usage: http://csvfix.byethost5.com/csvfix15/csvfix.html?csvfix.html?Usage.html?i=1&i=2 CSVfix aims to provide a solution to these problems. It is a command-line stream editor specifically designed to deal with CSV data. With it you can, among other things:
    Downloads: 42 This Week
    Last Update:
    See Project
  • 19
    GluonNLP

    GluonNLP

    NLP made easy

    GluonNLP is a toolkit that helps you solve NLP problems. It provides easy-to-use tools that helps you load the text data, process the text data, and train models. To facilitate both the engineers and researchers, we provide command-line-toolkits for downloading and processing the NLP datasets. Gluon NLP makes it easy to evaluate and train word embeddings. Here are examples to evaluate the pre-trained embeddings included in the Gluon NLP toolkit as well as example scripts for training embeddings on custom datasets. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    DeText

    DeText

    A Deep Neural Text Understanding Framework

    DeText is a Deep Text understanding framework for NLP-related ranking, classification, and language generation tasks. It leverages semantic matching using deep neural networks to understand member intents in search and recommender systems. As a general NLP framework, DeText can be applied to many tasks, including search & recommendation ranking, multi-class classification and query understanding tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A collection of small utilities for: data extraction (text or binary files), data buffering, message queue control, column addition, date/time manipulation, and data recovery testing.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    NLP-Models-Tensorflow

    NLP-Models-Tensorflow

    Gathers machine learning and Tensorflow deep learning models for NLP

    NLP-Models-Tensorflow is a collection of natural language processing model implementations built using the TensorFlow deep learning framework. The repository provides numerous examples of neural network architectures used in modern NLP research and applications, including text classification, language modeling, machine translation, and sentiment analysis. Each model implementation is designed to illustrate how common NLP architectures operate, such as recurrent neural networks, convolutional models for text processing, and transformer-style attention mechanisms. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Leseratte is a Java parser for German written language. Currently, it contains a German lexicon (based on the Wiktionary), inflexion rules, a grammar and a parser. (Semantics component planned.) Usable as a Java library, also provides a graphical UI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    agent99

    Passive network monitor.

    agent99 watches your network and generates a stream of events from the traffic it sees. Processing of the events is the job of other software, but some of it is immediately useful. It can log clear text usernames and passwords if it sees them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Delta ML

    Delta ML

    Deep learning based natural language and speech processing platform

    ...It helps you to train, develop, and deploy NLP and/or speech models. Use configuration files to easily tune parameters and network structures. What you see in training is what you get in serving: all data processing and features extraction are integrated into a model graph. Text classification, named entity recognition, question and answering, text summarization, etc. Uniform I/O interfaces and no changes for new models.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB