Search Results for "text processing" - Page 10

Showing 1237 open source projects for "text processing"

View related business solutions
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1

    modnlp

    Modular Suite of NLP Tools

    modnlp aims to provide a modular architecture and tools for natural language processing written (mainly) in Java. It provides an API and tools for (inverted) indexing, storage and retrieval of large amounts of text, with (XML-based) handling of meta-data, tools for text categorisation, including, functionality for XML parsing, term set reduction (and basic keyword extraction), probabilistic classifier induction, sample classification tools, and evaluation modules, a suite of corpus management, curation and distributed access tools. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Articlefox is a workflow system that can be used to prepare the articles of a small journal.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    biblatex
    Biblatex is a LaTeX package which provides full-featured bibliographic facilities
    Leader badge
    Downloads: 23 This Week
    Last Update:
    See Project
  • 4
    Script Echo Color

    Script Echo Color

    Terminal text colorizing and simplifies script coding.

    ScriptEchoColor simplifies Linux terminal text colorizing, formatting and several steps of script coding.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    symbols

    symbols

    Free JSON dataset of Unicode text symbols (keys, hex codes, names)

    ...The data can be used by developers in applications, websites, and programming projects where symbol handling is required. The JSON file makes it easy to integrate symbols into JavaScript, Python, or other programming environments. It can also be used for text-processing tools, font testing, and UI/UX design projects where special characters are needed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Bootleg Sound Processor

    Bootleg Sound Processor

    Software for processing audio files.

    Software for processing audio files. The files "Batch Processor.py" and "Duplicate remover.py" are meant to be used with the output of Bootleg Text Slicer (https://github.com/Northstrix/bootleg-text-slicer) placed into the "Unprocessed" folder, while "Single file processor.py" can be used with standalone files from arbitrary locations. GitHub repository: https://github.com/Northstrix/bootleg-sound-processor Made using Google AI Studio (https://aistudio.google.com/) and Perplexity (https://www.perplexity.ai/)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    threeddonut

    threeddonut

    3D donut. Example of frojasg1.com libraries usage

    The application shows a 3D donut, that can be rotated with two sliders in both axis. It is a simple example of what can be done with frojasg1.com platform libraries: - Zoom option for components - Multi language - Dark mode option - Automatic Undo-Redo for text components, with popup menu included - Text Search/Replace window prepared to be used. - Base components for auto-completion windows. - Automatic component relocation after redimensioning a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    GATE
    NOTE THAT THE SOURCE CODE AND ISSUE TRACKER HAVE NOW MOVED TO GITHUB. FIND US AT https://github.com/GateNLP/ GATE (General Architecture for Text Engineering) is an architecture, framework and development environment for developing, evaluating and embedding Human Language Technology. See http://gate.ac.uk for full details.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9

    xmlj

    XMLJ is a Java XML Editor and validator project.

    XMLJ is a Java XML Editor and validator project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    Unihanconver

    Unihanconver

    Traditional/Simplified Chinese conversion with CLI or GUI

    Tool to convert between Traditional/Simplified Chinese directly in Unicode (not GB/Big5 conversion). It is written in Perl and does not use any external libraries. It provides a command-line utility as well as a GTK+ interface for X Window.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    FOray

    Modular XSL-FO Implementation for Java.

    FOray is an open-source XSL-FO publishing system that is suitable for converting XML content into PDF and other document formats. Although not yet fully conformant with the XSL-FO standard, it is very useful for many applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    FileCut

    FileCut

    Simple cross-platform application to cut and join any text file.

    FileCut is a simple easy-to-use cross-platform application to cut to a given line and then join any text file. It is also possible to join in normal or reverse order. Works also from command-line interface, e.g. 'java -jar filecut.jar -c file.txt . 10', to cut 'file.txt' at line 10, and 'java -jar filecut.jar -j . >file.txt', to join files in the current directory in 'file.txt'. FileCut is portable, does not need installation and is developed in Java, so needs the Java Virtual Machine...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    SubLin

    SubLin

    Software tool to subtract lines of any text file from another.

    SubLin is a simple easy-to-use cross-platform application to subtract lines of any text file from another. It is also possible to keep or ignore case sensitive. Works also from command-line interface, e.g. "java -jar sublin.jar -s file1.txt file2.txt >new_file1.txt", to create output file "new_file1.txt", or "java -jar sublin.jar -s file1.txt file2.txt >>new_file1.txt", to create or append to output file "new_file1.txt". SubLin is portable, does not need installation and is developed in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    DupRem

    DupRem

    Simple application to remove duplicate and empty lines on text files.

    DupRem is a simple easy-to-use cross-platform application to remove duplicate and empty lines from any text file. It is also possible to keep or ignore case sensitive. Works also from command-line interface, e.g. "java -jar duprem.jar -r input_file.txt >output_file.txt", to create output file, or "java -jar duprem.jar -r input_file.txt >>output_file.txt", to create or append to output file. DupRem is portable, does not need installation and is developed in Java, so needs the Java Virtual...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    ChordSmith

    ChordSmith

    Chordpro editor that can display, transpose and print song sheets.

    ChordSmith is a chordpro editor that can display, transpose and print song sheets containing chords and lyrics. It can also edit and convert song sheet formats (including Harmonica tabs) between chordpro format (chords in square brackets in line with lyrics) and two-line format (chords above lyrics). You can find many free sources of song sheets in both formats on the Internet. More information at https://chordsmith.sourceforge.io/ Here are just a few of ChordSmith's many features:
    Downloads: 47 This Week
    Last Update:
    See Project
  • 17
    SingleFile

    SingleFile

    Web Extension for saving a copy of complete web page in a single file

    Web Extension for Firefox/Chrome/MS Edge and CLI tool to save a faithful copy of an entire web page in a single HTML file. SingleFile is a Web Extension (and a CLI tool) compatible with Chrome, Firefox (Desktop and Mobile), Microsoft Edge, Vivaldi, Brave, Waterfox, Yandex Browser, and Opera. It helps you to save a complete web page into a single HTML file. Wait until the page is fully loaded. Click on the SingleFile button in the extension toolbar to save the page. You can click again on the...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 18
    DOCX Document Converter

    DOCX Document Converter

    Convert .docx to .md/.txt and .html. Free, unlimited, fast.

    A simple, free, unlimited, secure web-based tool that converts Microsoft Word documents (.docx) into Markdown (.md/.txt) and HTML files. Perfect for developers, writers, and anyone who needs to transform .docx MS Office Word documents into web-friendly or AI context friendly formats. Unlike those other jerks on the web that charge many dollars per month for this, I made it free, unlimited and open source. This is a better version of 'convert docx to txt' since .md files can be opened...
    Downloads: 22 This Week
    Last Update:
    See Project
  • 19
    SciTECO

    SciTECO

    Advanced TECO dialect and interactive screen editor based on Scintilla

    SciTECO is an interactive TECO dialect, similar to Video TECO. It also adds features from classic TECO-11, as well as unique new ideas. Project development takes place here: https://git.fmsbw.de/sciteco The download archive is mirrored at Sourceforge, but for nightly builds check out: https://sciteco.fmsbw.de/downloads/nightly/
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20

    multinotes

    Text architecture for music theory.

    ...Furthermore, dynamic interactive documents can be useful for presenting complicated interdependencies to the reader more clearly, far beyond conventional paper publication. The mulitNotes text architecture and processing pipeline is based on d2d and standard technologies (XSLT, ECMAScript. LilyPond, PostScript, etc.) and addresses these issues. An overview about the software architecture and its operation is given in: Journal of the Text Encoding Initiative, Open Issue 18/2024: "Using d2d for Writing XML --- The multiNotes Text Architecture for Musical Analysis" https://doi.org/10.4000/132ex
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    PCSecrets

    PCSecrets

    Encrypt and manage secret text data

    PCSecrets is a PC application that holds secret text data - protected by a master password and strong encryption. Use it as a password manager or just somewhere to hold any text data securely in one place. It can hold a second, hidden set of secrets that is undetectable and plausibly deniable. The program is also a PC counterpart of the Secrets for Android app. It uses the same data structure and provides synchronization that allows easy transfer of secrets between the two. For those who...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Trayslate – Tray Translate

    Trayslate – Tray Translate

    Trayslate is a tray-based client for translation services. You can ent

    Trayslate is a tray-based client for translation services. You can enter text directly, translate clipboard content, or translate selected text in any application. You can also replace text in another app with its translation using a hotkey. The app lets you choose and fully configure the translation service you use.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    CSM (Conversational Speech Model)

    CSM (Conversational Speech Model)

    A Conversational Speech Generation Model

    The CSM (Conversational Speech Model) is a speech generation model developed by Sesame AI that creates RVQ audio codes from text and audio inputs. It uses a Llama backbone and a smaller audio decoder to produce audio codes for realistic speech synthesis. The model has been fine-tuned for interactive voice demos and is hosted on platforms like Hugging Face for testing. CSM offers a flexible setup and is compatible with CUDA-enabled GPUs for efficient execution.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 24
    Chord5

    Chord5

    A version of CHORD4 updated to cooperate with ChordSmith.

    CHORD5 is a ChordPro editor and renderer, useful for formatting and printing song sheets ("lead sheets"). This version of the CHORD program (based on CHORD4) has been modified to add functionality that enables it to cooperate with the ChordSmith program (available at https://sourceforge.net/projects/chordsmith/). This modified version has been renamed to CHORD5. Although it is revised to work with ChordSmith, it also works well as a standalone program. NOTE: If you have...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    Agena

    Agena

    Agena is an interpreted procedural programming language.

    Agena is an easy-to-learn procedural programming language designed for science, scripting, and many other applications. Binaries are available for Windows, Linux, Solaris, OS/2, Mac OS X, Raspberry Pi and DOS.
    Leader badge
    Downloads: 214 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB