Biber is a sophisticated bibliography processing backend for the LaTeX biblatex package. It supports a unsurpassed feature set for automated conformance to complex bibliography style requirements such as labelling, sorting and name handling. It has comprehensive Unicode support.
Change File Encoding is a utility that allows you to change the encoding of text files. For example, files saved in US-ASCII can be converted to UTF-8. Over 170 encodings are supported. Requires Java 1.8 or higher.
TeXML is an XML vocabulary for TeX. The processor transforms the TeXML markup into the TeX markup, escaping special and out-of-encoding characters. The intended audience is developers who automatically generate [La]TeX or ConTeXt files.
cpDetector is a proxy for codepage detection of documents. It delegates to multiple instances that try to detect the codepage by different techinques. A command line executeable is shipped that allows to sort documents by codepage.
Secure File Transfer for Windows with Cerberus by Redwood
Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.
Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
Transliterator between any Language files - Map Fonts, Create Encoding Scheme, Input Phonetic, Indian, Roman, Tamil, Hindi, English, French, German, Spanish or Any World Language Keyboard. Ex: [Phonetic Input]-[Any World Language Output] or ViceVersa.
Java classes that enable definition of new Charsets based on other existing Charsets, without additional programming. Includes a character set with Kamenik encoding.
TextSplitter is an open source GUI utility written in C# for .NET platform. Designed to split the text file into multiple files for a specified number of rows or the size of the resulting file. Encoding of input and output files can be selected.
SandR is a Recursive Regex Search and Replacement utility. It works on files or directories recursively. It supports Java-style Regular Expression in Search terms; it supports auto-detection of character encoding of the files. SandR is written in Java.
Have you ever encountered dynamic web page (php, asp.net...) problems as like being browsed full of question marks? This (currently) tiny utility is for you. It tries to correct the problem by changing the file encoding to unicode.
A Java application that attempts to make sense of the multiple encodings that exist for the representation of human languages on a computer. Helps ease the transition to Unicode as well as allow conversion among legacy formats.
The most powerful non-commercial translation memory software (TM tool) with enhanced capabilities, like networking/collaboration (http, rpc), encoding conversion, project management capabilities, email capability with attachments, file tree diff etc.
Old-Russian Ispell is a superset of A.I. Lebedev's rus-ispell package that enables spellchecking of texts in old Russian orthography (pre-1918). The KOI8-C encoding is used for representing old Russian characters. This package requires ispell 3.1.20+.
EDGE (electronic document general encoding) is a SGML-based markup Language for general documents, incl. scientific papers, technical/computer documentation, prose, drama, etc. It aims to be less restricted than comparable DTDs (e.g. DocBook or TEI).