Showing 1417 open source projects for "text processing"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    A collection of small utilities for: data extraction (text or binary files), data buffering, message queue control, column addition, date/time manipulation, and data recovery testing.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Leseratte is a Java parser for German written language. Currently, it contains a German lexicon (based on the Wiktionary), inflexion rules, a grammar and a parser. (Semantics component planned.) Usable as a Java library, also provides a graphical UI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 5
    wordTabulator
    Program wordTabulator is intended for text analysis. It can generate index of word elements extracted from defined text set. Word elements may be words, N-grams or phrases (syntagmes). The program can process texts as in ordinary 1-byte encoding (ANSI), as in multibyte UTF-8 encoding.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    fastText

    fastText

    Library for fast text classification and representation

    FastText is an open-source, free, lightweight library that allows users to learn text representations and text classifiers. It works on standard, generic hardware. Models can later be reduced in size to even fit on mobile devices. ext classification is a core problem to many applications, like spam detection, sentiment analysis or smart replies. In this tutorial, we describe how to build a text classifier with the fastText tool. The goal of text classification is to assign documents (such as...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    EF-SKHR

    EF-SKHR

    Éditeur de fichier

    EF-SKHR est un éditeur de fichier développé par Loyto_SKHR en C++ avec la bibliothèque QT5.9. Code source: https://github.com/Loyto-SKHR/EF-SKHR.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Programa para generar documentos HTML con expresiones matemáticas incrustadas, procesadas con Maxima (maxima.sourceforge.net).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Anaphraseus
    Anaphraseus is a CAT (Computer Aided Translation) tool, OpenOffice.org 2-3 macro set for OpenOffice/LibreOffice Writer similar to famous Wordfast. It works with the Wordfast Translation Memory format (*.TXT), and supports text segmentation.
    Downloads: 20 This Week
    Last Update:
    See Project
  • 11
    LaTeX Reference Card Creator

    LaTeX Reference Card Creator

    A Makefile based build system for creating LaTeX reference cards

    LaTeX Reference Card Creator is a Makefile based build system for creating reference cards. LaTeX Reference Card Creator compiles content into PDF, DjVu, TEX DVI, HTML and PostScript output formats. A three column reference card will be created. Features include batch image format conversions, spell checking, broken link checking, automatic backups and .zip and .tar.gz distribution building. LaTeX Reference Card Creator provides many LaTeX examples which can be used to make a reference card.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    NOTEMAN2

    NOTEMAN2

    This is a small notemaking tool for scholars for organising notes

    This is a small C# coded tool for implementing the notemaking activity suitable for researchers and scholars. The application must be put into the folder where you want to create notes. Usually the folder may be denoting the name of author or a topic for taking down notes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    XSLT syntax highlighting

    XSLT syntax highlighting

    Java based XSLT Processor extension for syntax highlighting

    Please note that project moved to GitHub: https://github.com/xmlark/xslthl This is an implementation of syntax highlighting as an extension module for XSLT processors (Xalan, Saxon), so if you have e.g. article about programming written in DocBook, code examples can be automatically syntax highlighted during the XSLT processing phase.
    Leader badge
    Downloads: 82 This Week
    Last Update:
    See Project
  • 14
    Jed Modes Repository
    A collection of S-Lang extension scripts (modes) for the Jed text editor, contributed by Jed users. Browse the repository at http://jedmodes.sf.net/
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    bsed

    bsed

    Simple SQL-like syntax on top of Perl text processing

    bsed is a stream editor that offers a simple SQL-like syntax for text processing tasks. Designed to replace basic uses of tools like sed, grep, AWK, and Perl, it allows users to perform complex text manipulations with intuitive commands.​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Pgn2ltx is a simple converter that reads chess games in PGN format and outputs them to a LaTeX input file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    ChessTask is a frontend for easily creating chess tasks with LaTeX. The tasks are stored in a list which can be exported, either to a LaTeX input file, or to HTML files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Fen2eps is a small console program that converts FEN (Forsyth Edwards Notation) strings to EPS (Encapsulated Postscript) files containing the chess board diagram.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Sinhala Writer

    Sinhala Writer

    Best tool there is to type Sinhala unicode characters

    Sinhala Writer is a fully portable phonetic unicode character converter tool designed to generate Sinhala (සිංහල) characters whenever you need. Sinhala Writer is a must have tool which can be very useful in many scenarios, such as messaging in social media networks, for the use in 3rd party applications like PhotoShop, AfterEffects etc, generating Sinhala characters to be used on web pages and computers that do not use Sinhala fonts, etc. Simply put, if you want to quickly generate some...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Café is an yagte (*) multipurpose, tabbed, scriptable, lightweight, fast, UTF-8 ready and full-featured text editor with IDE aspirations. Cafe' uses Perl as macro language. (*) yet another GTK+ text editor
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Win32Forth is ANS compatible, Forth language application development system with many tools; Interactive console, integrated extensible debugger, a GUI file editor, hypertext rendering, hyperlinked source files. VIEW <word-name> to explore the many files
    Leader badge
    Downloads: 57 This Week
    Last Update:
    See Project
  • 22
    Web Book Downloader

    Web Book Downloader

    Download websites as e-book: pdf, txt, epub.

    This application allows user to download chapters from website in 3 ways: - from table of contents; - from range: first chapter address, last chapter address; - by crawling from first chapter to n; In settings you can customize language, input(website encoding) for simplicity output is in the same encoding. If you want your language add new class into strings package, and new fields into Settings class and GUI menu(initialize method).
    Downloads: 8 This Week
    Last Update:
    See Project
  • 23
    Common Litt

    Common Litt

    Simple java script library for auto literation, input tool.

    This project focuses of auto conversion in between language alphabets. Using this 'lit.js' library currently you can done conversion in between English - Tamil - Sinhala scrpits vise-versa. This is use full when you need to know how write something in an other given language. This is still at development stage but works purfectly and easy to customize as well. Live demo available at: http://commonlitt.42web.io/ For UI creations I had used Bootstrap and Jquery. For easy array...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    ConcatPDF

    PDF Concatenation Tool

    ConcatPDF is the tool to concatenate PDF files. It can concatenate, extract, encrypt, decrypt, configure PDF files, convert image files to PDF. GUI version and CUI version are both available. iText.NET is iText porting on .NET Framework by J#. This library allows you to generate PDF, (X)HTML, XML, RTF files on Microsoft.NET Framework including ASP.NET.
    Leader badge
    Downloads: 27 This Week
    Last Update:
    See Project
  • 25

    SmGen

    Verilog Finite State Machine (FSM) Code Generator

    SmGen is a finite state machine (FSM) generator for Verilog. On the other hand, it is not an FSM entry tool. The input is behavioral Verilog with clock boundaries specifically set by the designer. SmGen unrolls this behavioral code and generates an FSM from it in synthesizable Verilog. Clock boundaries are explicitly provided by the designer so there is good control on the expected timing
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB