Showing 19 open source projects for "rxvt-unicode"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 1
    Tesseract OCR

    Tesseract OCR

    Open Source OCR Engine

    ...Tesseract can recognize over 100 languages out-of-the-box, and can be trained to recognize other languages. It supports various output formats, including plain text, HTML, PDF and more. It also has unicode (UTF-8) support.
    Downloads: 2,078 This Week
    Last Update:
    See Project
  • 2
    SentencePiece

    SentencePiece

    Unsupervised text tokenizer for Neural Network-based text generation

    ...Purely data driven, sentencePiece trains tokenization and detokenization models from sentences. Pre-tokenization (Moses tokenizer/MeCab/KyTea) is not always required. SentencePiece treats the sentences just as sequences of Unicode characters. There is no language-dependent logic.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    NeMo Curator

    NeMo Curator

    Scalable data pre processing and curation toolkit for LLMs

    NeMo Curator is a Python library specifically designed for fast and scalable dataset preparation and curation for large language model (LLM) use-cases such as foundation model pretraining, domain-adaptive pretraining (DAPT), supervised fine-tuning (SFT) and paramter-efficient fine-tuning (PEFT). It greatly accelerates data curation by leveraging GPUs with Dask and RAPIDS, resulting in significant time savings. The library provides a customizable and modular interface, simplifying pipeline...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    minbpe

    minbpe

    Minimal, clean code for the Byte Pair Encoding (BPE) algorithm

    minbpe is a minimal, clean implementation of byte-level Byte Pair Encoding (BPE), the tokenization approach widely used in modern language models. It operates on UTF-8 encoded bytes rather than Unicode characters, which makes it robust to arbitrary text inputs and avoids needing a language-specific character vocabulary. The repository is structured as a teaching-oriented implementation that shows how to train a tokenizer by learning merge rules, then apply those merges to encode text into token IDs and decode tokens back into text. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    Burn To The Brim

    Burn To The Brim

    Utility for efficiently grouping files and folders together

    **Burn To The Brim** is a highly efficient archiving utility designed to solve the classic subset-sum (bin packing) optimization challenge. It intelligently selects and groups files and directories (documents, high-fidelity media, or raw back-ups) to optimally fill recordable Blu-Rays, USB drives or custom-capacity storage drives. By recursively scanning your designated folders, BTTB matches item sizes to your media capacity, finding a near-perfect selection in milliseconds and an...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    TXM

    TXM

    Unicode XML TEI text analysis platform

    TXM is a free and open-source cross-platform Unicode & XML based text analysis environment and graphical client, supporting Windows, Linux and Mac OS X. It can also be used online as a J2EE standard compliant web portal (GWT based) with access control built in. DOWNLOAD LATEST VERSION OF TXM : http://textometrie.ens-lyon.fr/spip.php?rubrique61&lang=en TXM offers a comprehensive range of analysis tools (concordances, collocate search, frequency lists, etc.) based on the powerfull CQP full text search engine (http://cwb.sourceforge.net) and a range of statistical functions (factorial analysis, classification, cooccurrency analysis, etc.) based on R packages (http://www.r-project.org). ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 7
    Unitag is a language-independent Unicode-based part-of-speech tagging system. Written entirely in ANSI-compatible C, it should (in theory) compile on any OS, but has been tested on 32-bit Windows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    YouTokenToMe

    YouTokenToMe

    Unsupervised text tokenizer focused on computational efficiency

    YouTokenToMe is a fast and efficient unsupervised text tokenization library designed for training subword embeddings, particularly useful for NLP models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    CLISP - an ANSI Common Lisp
    CLISP is a portable ANSI Common Lisp implementation and development environment by Bruno Haible. Interpreter, compiler, debugger, CLOS, MOP, FFI, Unicode, sockets, CLX. UI in English, German, French, Spanish, Dutch, Russian, and Danish.
    Leader badge
    Downloads: 209 This Week
    Last Update:
    See Project
  • Error to trace to log to deploy. One click. No SSH. Icon
    Error to trace to log to deploy. One click. No SSH.

    Catch the cause before the pager goes off.

    AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.
    Free 30 days.
  • 10
    Program Q AIML is a C++ Qt-based library offering a simple API to parse AIML XML files and then interacting with a user input with Latin/Arabic support (Unicode). AIML is a technology permitting to have an AI chat bot. A sample application is provided.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    romanize

    Romanizing 9 Indian languages (Unicode) to English alphabets

    ...Romanization is normally done using ASCII and extended ASCII syllables, which is easy to process but difficult to work with. Romanize project converts the Indian languages in their unicode form to english alphabets. Compared to the existing schemes of romanizations, this project focuses on few main points - Readability, easy typability, English alphabet combinations only, incoporations with existing popular schemes, phonetically equivallent transliterations and most importantly non-ambiguity across the languages (9) using the same transliteration mapping set.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    The first Free Text-to-Speech (TTS) synthesis system in Malayalam based on festival speech synthesis system. It gives Orca - a free domain screen reading software, the additional functionality to read Unicode Malayalam documents and websites as well.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Our Objective is to create a GPU-based system that can accept scanned inputs of printed Devanagari texts, and produce outputs of the same in Unicode with a very high accuracy (>99.9%).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Collaborative development and distribution of Windows Speech Recognition (WSR) application macros to 1) improve the accessibility of personal computing for impaired users, and 2) improve the efficiency of personal computing for all users.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    ...Include: contour recognition; vectorisation; matrix letter feature recognition; auto page segmentation and detect rotation; SS3 ASM core; XML base; web-based GUI; 99,6% printed Unicode text recognition; letter base up to 1200 letters.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    NOTE: I couldn't keep up this project to align with latest Unicode spec. Not sure I may be continuing. You can try Myanmar3 from Myanmar NLP or WinUniInnwa or https://sourceforge.net/projects/prahita/ or something better compliant font. ~Victor --- [This is UniBurma - UniMM project workshop area. This project currently have two productions, UniBurma and UniMM. For more descriptive info about this project, please visit http://unimm.org/.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Auvai is a Java API and Java Swing based application for Text to Speech conversion of Unicode Tamil. Future direction of this API and application is to support Text to Speech conversion for all "Indic" languages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Kathak is a Bangla text to speech synthesizer which can produce the speech from unicode bangla text input. We are developing the system based on Festvox framework.The Festival Speech Synthesis System was used as a base for developing Kathak.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    K is a C++ library featuring Genetic Programming, cryptography, 64 bits arithmetic, cross-platform macros and unicode conversion routines. It compiles on NewtonOS, Mac < X, Unix and Windows and it makes little use of C++ templates.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo