Search Results for "text processing" - Page 21

Showing 1568 open source projects for "text processing"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    JUSH is a syntax highlighting component written in JavaScript. It highlights HTML, CSS, JS, PHP and SQL code embedded into each other. Beside syntax highlighting, it provides links to the documentation for all supported languages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Deeplearning-papernotes

    Deeplearning-papernotes

    Summaries and notes on Deep Learning research papers

    Deeplearning-papernotes is an implementation of Convolutional Neural Networks for sentence and text classification in TensorFlow, based on a well-known research paper that applies CNN architectures to natural language processing tasks with strong performance in sentiment analysis and similar classification problems. The repository provides the complete network definition, including an embedding layer to convert words into dense representations, convolution and max-pooling layers to extract informative features, and a final softmax classifier to distinguish between target classes. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    IceNLP is an open source Natural Language Processing (NLP) toolkit for analyzing and processing Icelandic text. The toolkit is implemented in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    cpDetector is a proxy for codepage detection of documents. It delegates to multiple instances that try to detect the codepage by different techinques. A command line executeable is shipped that allows to sort documents by codepage.
    Downloads: 7 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5

    Indexmeister

    automatic indexing for large LaTex documents

    Indexmeister reads a variety of formats (.tex, .docx, .epub, and others) and suggests keywords for indexing. The included program Imbrowse provides a semi-automatic interface to rapidly add index tags to multi-file latex documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    JETT

    JETT - Java Excel Template Translator

    JETT is a Java 5.0 API that reads an Excel spreadsheet as a template, takes your data, and creates a new Excel spreadsheet that contains your data, formatted as in the template. It works with .xls and .xlsx template spreadsheets.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    uncaptcha

    uncaptcha

    Defeating Google's audio reCaptcha with 85% accuracy

    uncaptcha is an open-source proof-of-concept system designed to demonstrate vulnerabilities in Google’s audio reCAPTCHA challenges by automatically solving them using speech recognition techniques. The project uses browser automation to navigate to CAPTCHA challenges, extract audio files, and process them through multiple speech-to-text services. By combining outputs from several transcription engines, the system increases the likelihood of correctly identifying the spoken digits or phrases required to solve the challenge. It employs signal processing techniques such as segmenting audio clips into individual components before transcription, which improves accuracy in noisy or complex audio conditions. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    XMLStarlet is a set of command line utilities (tools) to transform, query, validate, and edit XML documents and files using simple set of shell commands in similar way it is done for text files with UNIX grep, sed, awk, diff, patch, join, etc utilities.
    Leader badge
    Downloads: 1,286 This Week
    Last Update:
    See Project
  • 9
    High-Throughput Tabular Data Processor
    ...Citation: Madanecki P, Bałut M, Buckley PG, Ochocka JR, Bartoszewski R, Crossman DK, et al. (2018) High-Throughput Tabular Data Processor – Platform independent graphical tool for processing large data sets. PLoS ONE 13(2): e0192858. https://doi.org/10.1371/journal.pone.0192858
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    Distributed Proofreaders
    Project has moved to https://github.com/DistributedProofreaders/dproofreaders Distributed Proofreaders is a web application intended to ease the process of converting public domain books and other printed materials into e-texts. The main site is at http://www.pgdp.net By breaking the work into individual pages, many proofreaders can be working on the same book at the same time. This significantly speeds up the proofreading/E-Text creation process. When a proofer elects to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Vim provides a rich set of tools which makes generating latex easy, pain-free and quite pleasurable. This web-site aims at bringing together the rich set of tools the vim community has produced over the years into a central repository
    Downloads: 9 This Week
    Last Update:
    See Project
  • 12
    fif

    fif

    A simple tool to find text in multiple files

    fif (find in files) is a small desktop application that makes searching files for a string simple.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Lied is a FLTK text editor (with some modifications) with the addition of a specific menu that collects several definitions in Lilypond syntax ordered by argument.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Capsim(r) C Text Mode Kernel(TMK),DSP and communication blocks, topologies, libraries and tools for the development of high performance block diagram digital signal processing and communications systems,built in interpreter for scripting.SystemC support.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Perl Text Editor

    Perl Text Editor

    Source code for perl simple text editor.

    Simple text editor. Run: $perl 01text0.pl
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    file_lemmater

    text file quick lemmater

    This executable get a text file (input name "in.txt" at the same folder where the executable is) and creates a file called "out.txt" with the same content but each noun, adjective or verb is lemmatized. From the Aseryla (https://memla.000webhostapp.com/index.html) system that combines the Stanford Core NLP (https://stanfordnlp.github.io/CoreNLP/index.html) and the CSTlemmatiser(http://cst.dk/online/lemmatiser/uk/)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    transformenator

    transformenator

    Text and binary file transformation

    Text and binary file transformation. Text-based sed, awk, etc. failed me when I needed a simple way to transform binary files to something else. Transformenator is what resulted.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    A Swiss Army Knife GUI application for PDF documents: combine, split, rotate, reorder (n-up, booklet), watermark, edit bookmarks/fileinfo/pagetransition, compress, encrypt, decrypt, sign, repair, edit attachments and more.
    Leader badge
    Downloads: 75 This Week
    Last Update:
    See Project
  • 19
    The Text Trix editor is an open-source, cross-platform text editor with a mind for user-friendly file and text navigation. See our new GitHub home for the latest and greatest Text Trix development: https://github.com/the4thchild/texttrix
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Stringy

    Stringy

    A PHP string manipulation library with multibyte support

    ...It offers a fluent interface for common string operations, including case conversion, trimming, and formatting. Stringy is designed to simplify string handling by providing a consistent and expressive API, making it a valuable tool for text processing in PHP applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    PSIworx

    Data processing for the PSI fluorometer

    PSIworxR (R) and PSIworx (MATLAB) are a collection of functions and scripts to analyze data from the PSI SuperHead Fast Fluorometer series (www.psi.cz). These fluorometers are used in limnology and oceanography research communities. The program retrieves parameters from single turnover induction and relaxation. Results are returned as text files and PDF figures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    360-FAAR  Firewall Analysis Audit Repair

    360-FAAR Firewall Analysis Audit Repair

    360-FAAR Analyze FW1 Cisco Netscreen Policy Offline Using Config/Logs

    360-FAAR (Firewall Analysis Audit and Repair) is an offline, command line, firewall policy manipulation tool to filter, compare to logs, merge, translate and output firewall commands for new policies, in Checkpoint dbedit, Cisco ASA or ScreenOS commands, and its one file! Read Policy and Logs for: Checkpoint FW1 (in odumper.csv / logexport format), Netscreen ScreenOS (in get config / syslog format), Cisco ASA (show run / syslog format), 360-FAAR compares firewall policies and uses...
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23

    cbrTekStraktor

    an application to automatically extract text from comic books.

    cbrTekStraktor is an application to automatically extract text from the text bubbles or speech balloons present in comic book reader files (CBR). Its prime goal is to perform analysis on the texts of comic books. cbrTekStraktor can however also be used for scanlation or similar purposes. The application also enables to manually define text areas in CBR files. The application comprises a simple graphical editor for further processing the extracted text. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    Ncurses based hex editor with vi/vim-like interface. Features include large file support, search highlight, multiple undo/redo, visual select, cut/paste, blob coloring, file tabs, and much more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    aeneas

    aeneas

    Automagically synchronize audio and text (aka forced alignment)

    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment). aeneas automatically generates a synchronization map between a list of text fragments and an audio file containing the narration of the text. In computer science this task is known as (automatically computing a) forced alignment.
    Downloads: 4 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB