Showing 25 open source projects for "text encoding"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Text Encoding Initiative

    Text Encoding Initiative

    TEI produces the TEI Guidelines and associated software

    The TEI is an international and interdisciplinary standard used by libraries, museums, publishers, and academics to represent all kinds of literary and linguistic texts, using an encoding scheme that is maximally expressive and minimally obsolescent.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    biber
    Biber is a sophisticated bibliography processing backend for the LaTeX biblatex package. It supports a unsurpassed feature set for automated conformance to complex bibliography style requirements such as labelling, sorting and name handling. It has comprehensive Unicode support.
    Leader badge
    Downloads: 287 This Week
    Last Update:
    See Project
  • 3

    Change File Encoding

    Change encoding of text files.

    Change File Encoding is a utility that allows you to change the encoding of text files. For example, files saved in US-ASCII can be converted to UTF-8. Over 170 encodings are supported. Requires Java 1.8 or higher.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Web Book Downloader

    Web Book Downloader

    Download websites as e-book: pdf, txt, epub.

    This application allows user to download chapters from website in 3 ways: - from table of contents; - from range: first chapter address, last chapter address; - by crawling from first chapter to n; In settings you can customize language, input(website encoding) for simplicity output is in the same encoding. If you want your language add new class into strings package, and new fields into Settings class and GUI menu(initialize method).
    Downloads: 9 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5

    Ghawwas_V4

    An open source system for Arabic corpora processing

    Ghawwas (previously known as Khawas) is an open source system for Arabic corpora processing. Ghawwas V4.0 provides the following main functions: a. Frequency list for single word and N-Grams b. Concordance c. Collocation (MI, CHI Squared, LL, T-Score, Z Score, Dice, Log Dice, Weirdness Coefficient) d. Lexical patterns search e. Two corpora frequency profile comparison based on MI, CHI, LL, T-Score, Z Score, Dice, Log Dice, Weirdness Coefficient f. Accept Windows and UTF-8 character...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    TeXML is an XML vocabulary for TeX. The processor transforms the TeXML markup into the TeX markup, escaping special and out-of-encoding characters. The intended audience is developers who automatically generate [La]TeX or ConTeXt files.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    cpDetector is a proxy for codepage detection of documents. It delegates to multiple instances that try to detect the codepage by different techinques. A command line executeable is shipped that allows to sort documents by codepage.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 8
    Notepad3

    Notepad3

    Light-weight Scintilla-based text editor with syntax highlighting

    Notepad3 is a fast and light-weight Scintilla-based text editor with syntax highlighting. Notepad3 is an excellent replacement for the default Windows text editor. Notepad3 offers many extra features over Notepad. It has a small memory footprint, but is powerful enough to handle most programming jobs.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 9

    Character Encoding Translator

    Translate text from one encoding to another.

    Character Encoding Translator is an API and utility for translating text from one character encoding to another. A single JAR provides a GUI application, a console application, and an API.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    WinTailViewer++

    WinTailViewer++

    Tool that allows to tail log files

    WinTailViewer++ is a free Windows tool that allows you to tail log files and change the encoding by offering a bunch of features. You can set the window size while display it as top most. Also you can read the log while you're using another window by setting the window opacity. Description by Softpedia.com: "An intuitive and easy to use tool designed for reading log files, providing users with the possibility to make the window transparent and carry on with their work. WinTailViewer++...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    EncTool

    Command line tool to detect and convert files encoding.

    Command line tool to detect and convert files encoding. Works with files or directories. Can be used to add or remove UTF-8 BOM. Multi-platform. EncTool requires Java 1.5 or highter.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Transliterator between any Language files - Map Fonts, Create Encoding Scheme, Input Phonetic, Indian, Roman, Tamil, Hindi, English, French, German, Spanish or Any World Language Keyboard. Ex: [Phonetic Input]-[Any World Language Output] or ViceVersa.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    JTextEncoding
    Small Program which lets you convert the encoding of text files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Java classes that enable definition of new Charsets based on other existing Charsets, without additional programming. Includes a character set with Kamenik encoding.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    KBookReader is a Linux KDE twin-panel text files viewer specialized in reading e-books. It supports multiple bookmarks, custom fonts, encoding selection.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Provide a robust and efficient implementation of n-gram based classifiers to Java. N-Gram algorithms have shown to be surprisingly good at tasks like guessing the language/encoding from an arbitrary text file. And there are many more applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    TextSplitter is an open source GUI utility written in C# for .NET platform. Designed to split the text file into multiple files for a specified number of rows or the size of the resulting file. Encoding of input and output files can be selected.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Concrete Voice is a text to speech program. It can read the time, anounce weather, read text file, save text files to audio files, open any text file (supports all text encoding formats) and many more advance stuff!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    SandR is a Recursive Regex Search and Replacement utility. It works on files or directories recursively. It supports Java-style Regular Expression in Search terms; it supports auto-detection of character encoding of the files. SandR is written in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Have you ever encountered dynamic web page (php, asp.net...) problems as like being browsed full of question marks? This (currently) tiny utility is for you. It tries to correct the problem by changing the file encoding to unicode.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Simplest Qt based Text/Html editor, but able to autodetect many incomimg encodings (by 'enca' library) and import various file formats by external converters (wv, xlhtml, rtf2html, etc.). Just to have all in one GUI shell for sevearal well known tools.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    A Java application that attempts to make sense of the multiple encodings that exist for the representation of human languages on a computer. Helps ease the transition to Unicode as well as allow conversion among legacy formats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    The most powerful non-commercial translation memory software (TM tool) with enhanced capabilities, like networking/collaboration (http, rpc), encoding conversion, project management capabilities, email capability with attachments, file tree diff etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Old-Russian Ispell is a superset of A.I. Lebedev's rus-ispell package that enables spellchecking of texts in old Russian orthography (pre-1918). The KOI8-C encoding is used for representing old Russian characters. This package requires ispell 3.1.20+.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    EDGE (electronic document general encoding) is a SGML-based markup Language for general documents, incl. scientific papers, technical/computer documentation, prose, drama, etc. It aims to be less restricted than comparable DTDs (e.g. DocBook or TEI).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB