23 projects for "text encoding" with 2 filters applied:

  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    zpdf

    zpdf

    Zero-copy PDF text extraction library written in Zig

    ...It implements multiple PDF decompression filters and handles common font encoding pathways, which are essential for turning raw PDF content streams into readable text. It also understands both classic cross-reference tables and newer cross-reference streams, including PDF 1.5+ features, and it offers configurable strict vs permissive error handling depending on whether you prioritize correctness or robustness.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Ksoup

    Ksoup

    Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML

    Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML, extracting HTML tags, attributes, and text, and encoding and decoding HTML entities. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    MIME Component

    MIME Component

    Allows manipulating MIME messages

    ...It is commonly used for handling email content and attachments in Symfony applications. The component supports building complex email structures, including multi-part messages, and correctly encoding text and binary data. Mime is an essential part of Symfony Mailer but can be utilized independently to handle MIME message generation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    International Components for Unicode

    International Components for Unicode

    The home of the ICU project source code

    ...ICU is released under a nonrestrictive open-source license that is suitable for use with both commercial software and with other open-source or free software. Convert text data to or from Unicode and nearly any other character set or encoding. ICU's conversion tables are based on charset data collected by IBM over the course of many decades and is the most complete available anywhere. Compare strings according to the conventions and standards of a particular language, region or country. ICU's collation is based on the Unicode Collation Algorithm plus locale-specific comparison rules from the Common Locale Data Repository, a comprehensive source for this type of data.
    Downloads: 11 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    BinEd Binary/Hex Editor

    BinEd Binary/Hex Editor

    Binary / hex editor and component written in Java

    Free and open source hex editor written in Java. This is standalone desktop app, library for Java applications as well as variants for Java IDEs are also available.
    Downloads: 22 This Week
    Last Update:
    See Project
  • 6
    jPicEdt

    jPicEdt

    Another drawing editor for LaTeX with PSTricks & TikZ

    jPicEdt is an extensible internationalized vector-based drawing editor for LaTeX and related packages (TikZ, PsTricks,...), written in Java. It is also a library of reusable high-level graphic primitives.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 7

    Change File Encoding

    Change encoding of text files.

    Change File Encoding is a utility that allows you to change the encoding of text files. For example, files saved in US-ASCII can be converted to UTF-8. Over 170 encodings are supported. Requires Java 1.8 or higher.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    towhee

    towhee

    Framework that is dedicated to making neural data processing

    Towhee is an open-source machine-learning pipeline that helps you encode your unstructured data into embeddings. You can use our Python API to build a prototype of your pipeline and use Towhee to automatically optimize it for production-ready environments. From images to text to 3D molecular structures, Towhee supports data transformation for nearly 20 different unstructured data modalities. We provide end-to-end pipeline optimizations, covering everything from data decoding/encoding, to model inference, making your pipeline execution 10x faster. Towhee provides out-of-the-box integration with your favorite libraries, tools, and frameworks, making development quick and easy. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Big List of Naughty Strings

    Big List of Naughty Strings

    List of strings which have a high probability of causing issues

    ...It exists so developers and QA engineers can easily test edge cases that normal test data would miss, such as zero-width characters, right-to-left marks, emojis, foreign alphabets, and long or malformed strings. By throwing these strings at forms, APIs, databases, and UIs, teams can discover encoding bugs, sanitizer gaps, rendering issues, and security oversights early. The list is language-agnostic and repository-friendly, meaning you can consume it from CI pipelines or local scripts with minimal setup. Because it’s crowdsourced, it reflects real issues practitioners have faced in production, not just theoretical cases. Using the list regularly helps harden applications against the fragile edges of text processing and user input.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 10
    Stringy

    Stringy

    A PHP string manipulation library with multibyte support

    Stringy is a PHP library that provides a set of string manipulation functions inspired by the String class in other programming languages. It offers a fluent interface for common string operations, including case conversion, trimming, and formatting. Stringy is designed to simplify string handling by providing a consistent and expressive API, making it a valuable tool for text processing in PHP applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    The `8conv` text encoding converter

    Converts quoted-printable, UTF-8, UTF-16BE, UTF16LE to 8-bit.

    Command-line tool to decode text which has undergone MIME-typical encoding into (ISO-8859) 8-bit --- without having or parsing the relevant MIME type declarations. Decodes quoted-printable sequences; handles UTF-8, UTF-16BE, UTF-16LE; optionally translates common characters from DOS or Windows code pages or HP-Roman8 into ISO-8859-1. UCS characters can be mapped to 8-bit translation strings.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    xorlisp

    Bit level lambda continuations and nothing else - Queue automata

    Not working yet. To deal with the Halting Problem, computing and data are navigated using debugger ops: linearForward and treeForward, which navigate an astronomically large bit string where 1 is ( and 0 is ). All pairs are derived from (). For example, true is represented as ((()())()), and false is (()(()())). It appears related to the church encoding of lambda where T chooses first parameter and F chooses second, of a pair. Continuations are nearly finished code and are represented as a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Hunspell is a spell checker and morphological analyzer library and program designed for languages with rich morphology and complex compounding or character encoding. Hunspell interfaces: Curses, Ispell compatible pipe interface, OpenOffice.org UNO module
    Leader badge
    Downloads: 308 This Week
    Last Update:
    See Project
  • 14
    notepas

    notepas

    Advanced multiplatform text editor for developers

    Notepas is a fast native multiplatform text editor written in Lazarus and can be compiled for multiple platforms and widget sets using the advanced native Free Pascal Compiler. Aimed towards developers it is equipped with some functions usually not found in other text editors and introduces a couple of new exclusive features.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Transliterator between any Language files - Map Fonts, Create Encoding Scheme, Input Phonetic, Indian, Roman, Tamil, Hindi, English, French, German, Spanish or Any World Language Keyboard. Ex: [Phonetic Input]-[Any World Language Output] or ViceVersa.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    gundercrypt [MOVED]

    A CLI text encrypting/ciphering program and library for java

    ** MOVED: to github.com/gundermanc/gundercrypt A CLI text ciphering program and library written in java that makes use of pseudo random numbers and a password (provided by the user) to generate encrypted text by selectively shifting each subsequent letter by a pseudo-random index, encoding it as a letter stored in a hidden position within the String, and then re shifting the encrypted String again by numeric values deduced from a password to provide a dual layer of security. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    XML Text Declaration

    Support for XML text declaration handling

    The purpose of this library is to determine the text encoding of an input, and dynamically insert a filter, which allows the processing program to assume UTF-8 input, regardless of the actual input encoding.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Codepage Converter
    Codepage Converter - Convert HTML/Text files to different encoding formats e.g. ANSI to UTF-8 or Unicode. Convert multiple files with 1 click. Works with all encodings. * The issue with UTF-8 has now been fixed.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 19
    JavaEmailer is a Java Classes Library that allows Java developers to easily integrate a mass mailer to their applications. JavaEmailer manages file attachments, SMTP servers, CCs, BCCs, plain text and/or html messages, and more...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Java classes that enable definition of new Charsets based on other existing Charsets, without additional programming. Includes a character set with Kamenik encoding.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    SandR is a Recursive Regex Search and Replacement utility. It works on files or directories recursively. It supports Java-style Regular Expression in Search terms; it supports auto-detection of character encoding of the files. SandR is written in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    EDGE (electronic document general encoding) is a SGML-based markup Language for general documents, incl. scientific papers, technical/computer documentation, prose, drama, etc. It aims to be less restricted than comparable DTDs (e.g. DocBook or TEI).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    The MetaText Java library provides an extensible abstraction of a generic text object using many text encoding format so that applications can ignore the underlying encoding. Supported encodings:Unicode,ASCII,Cp1252,ISO8859_1,ETSI GSM,XML/HTML,DOS CP437
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB