Showing 32 open source projects for "pdf unicode"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 1
    Tesseract OCR

    Tesseract OCR

    Open Source OCR Engine

    ...Tesseract can recognize over 100 languages out-of-the-box, and can be trained to recognize other languages. It supports various output formats, including plain text, HTML, PDF and more. It also has unicode (UTF-8) support.
    Downloads: 34,935 This Week
    Last Update:
    See Project
  • 2
    TeXworks

    TeXworks

    A simple interface for working with TeX documents

    TeXworks is a free and simple working environment for authoring TeX (LaTeX, ConTeXt and XeTeX) documents. Inspired by Dick Koch's award-winning TeXShop program for Mac OS X, it makes entry into the TeX world easier for those using desktop operating systems other than OS X. It provides an integrated, easy-to-use environment for users on other platforms particularly GNU/Linux and Windows and features a clean, simple interface accessible to casual and non-technical users.
    Downloads: 73 This Week
    Last Update:
    See Project
  • 3
    fpdf2

    fpdf2

    Simple PDF generation for Python

    fpdf2 is a library for simple & fast PDF document generation in Python. It is a fork and the successor of PyFPDF. Compared with other PDF libraries, fpdf2 is fast, versatile, easy to learn and to extend (example). It is also entirely written in Python and has very few dependencies: Pillow, defusedxml, & fontTools. It is a fork and the successor of PyFPDF.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    ripgrep

    ripgrep

    Regex pattern directory search tool that respects your .gitignore

    ...By default, ripgrep will ignore your .gitignore and skip hidden files or directories and binary files automatically. ripgrep has first class support on Windows, macOS and Linux, with binary downloads available for every release. ripgrep is similar to other popular search tools like The Silver Searcher, ack and grep. ripgrep supports arbitrary input preprocessing filters which could be PDF text extraction, less supported decompression, decrypting, automatic encoding detection and so on. In other words, use ripgrep if you like speed, filtering by default, fewer bugs and Unicode support.
    Downloads: 94 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    TEXminer

    TEXminer

    Text Mining Classification for Texts in ASCII, Unicode and PDF Format.

    TEXminer uses generic Text Mining Methods to analyze Unicode Files as plain Text or PDF. The Text Database can be saved in XML where the orginal Text, the Sentence and Word Lists and additional Parameters (e.g. Abbreviations) are stored. TEXminer allows Language Detection by Letter Frequency Analysis, finding important Words by Cooccurrence Analysis, Determination of Central Expressions, Thematic Text Classification (also Semantic Groups) Fingerprint Comparison and Word Frequency. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Chordii

    Chordii

    Easy lead sheets from text input

    ChordPro creates elegant, stafless lead sheets for musicians needing only chords and lyrics. It processes plain text input in ChordPro format and it is a rewrite of the old though still popular Chord/Chordii programs.
    Leader badge
    Downloads: 28 This Week
    Last Update:
    See Project
  • 7
    Decaleon

    Decaleon

    Multilingual Esperanto Translator, Word Dictionary, Vocabulary Trainer

    Esperanto Translator - translates sentences from 12 languages to Esperanto - VB.NET project based on XML - 30-60,000 words in each language: extensible Vocabulary for English, French, Spanish, Portoguese, Italian, German, Danish, Swedish, Greek, Russian, Polish and Croatian including context and synonym data as well as semantic group information. It includes an any-to-any multilingual Dictionary and Vocabulary Trainer (learning niveaus/logic themes/semantic groups) and uses UTF8/Unicode Characters; Export into Text, HTML, TeX, PDF Files; Text files may be imported in other Vocabulary Training Software. Version 6.0 gives a big addition of standard words and supports another 23 Languages; a small Vocabulary for: Albanian, Bulgarian, Czech, Dutch, Finnish, Hungarian, Norwegian, Romanian, Serbian, Slovak, Slovene, Turkish, Ukrainian, Interslavic, Arabian and Asian Languages. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Msc-generator

    Msc-generator

    Draws signalling charts, block diagrams and graphs from text input.

    NOTE! We have moved to https://gitlab.com/msc-generator/msc-generator All development happens there. Also, download new releases & submit issues there. A tool to draw various charts from textual descriptions. Currently, three types of charts are supported: Message Sequence Charts, generic Graphs, and Block Diagrams, with more to be added in the future. There is a command-line version for Linux and Mac (replacing mscgen), which now sports a GUI, as well. Msc-generator allows fine...
    Leader badge
    Downloads: 21 This Week
    Last Update:
    See Project
  • 9
    Elementary Algorithms

    Elementary Algorithms

    Book of elementary algorithms and data structures

    This book introduces elementary algorithms and data structure. It includes side-by-side comparison of purely functional realization and their imperative counterpart. From 2020/12, I started re-writing this book. The PDF can be downloaded for preview (EN, 中文). The 1st edition in Chinese (中文) was published in 2017. I recently switched my focus to the Mathematics of programming, the new book is also available in (github). To build the book in PDF format from the sources, you need the following software pre-installed, TeXLive, The book is built with XeLaTeX, a Unicode friendly version of TeX. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    TextSeek

    TextSeek

    Professional full-text desktop search tool

    TextSeek is a professional full-text desktop search tool. Unlike the filename search tool like Everything and Listary, TextSeek can search filename and file content easily and quickly. It supports PDF, Word, Excel, Powerpoint, RTF and other formats. The software can run directly, and no extra package is required to install.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 11
    mPDF

    mPDF

    PHP library generating PDF files from UTF-8 encoded HTML

    mPDF is a PHP library that generates PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files ‘on-the-fly’ from his website, handling different languages. It is slower than the original scripts e.g. HTML2FPDF and produces larger files when using Unicode fonts, but support for CSS styles etc. and has been much enhanced. Supports almost all languages including RTL (Arabic and Hebrew), and CJK (Chinese-Japanese-Korean).
    Downloads: 82 This Week
    Last Update:
    See Project
  • 12

    PoDoFo

    A PDF parsing, modification and creation library.

    The PoDoFo library is a free, portable C++ library. It can parse and modify existing PDF files and create new ones from scratch. It also includes several tools to work with PDF files. It features an unique approach which provides access to PDF documents via an object tree. Therefore, PDFs can be created and or manipulated using a simple tree structure. Development of PoDoFo has been moved to GitHub: https://github.com/podofo/podofo Please raise new issues in the GitHub project.
    Leader badge
    Downloads: 77 This Week
    Last Update:
    See Project
  • 13
    printpdf

    printpdf

    Rust / WASM library for reading, writing and rendering PDF

    ...It includes advanced typography capabilities such as character spacing, scaling, superscript, and subscript, as well as support for Unicode text. printpdf also offers optimization features like font subsetting to reduce file size, making generated PDFs more efficient. Experimental capabilities include rendering PDF pages to SVG and extracting text content, expanding its use cases beyond simple document generation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    PHP Pdf creation - R&OS
    MOVED TO GITHUB https://github.com/ole1986/pdf-php
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    multipractice

    multipractice

    Tool for practicing languages.

    Practice makes perfect. Panglossa MultiPractice is a tool to help you learn and practice languages. You can create your own courses, import courses created by others, and even export courses to HTML or PDF documents. The original project (for Lazarus) was basically a flashcard app. Now it is more like a platform for creating structured courses with text, images, audio and video content, as well as different types of exercises. Please keep in mind that this project is created and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    TCPDF - PHP class for PDF

    TCPDF - PHP class for PDF

    PHP class for PDF

    TCPDF is a PHP class for generating PDF documents without requiring external extensions. TCPDF Supports UTF-8, Unicode, RTL languages, XHTML, Javascript, digital signatures, barcodes and much more. IMPORTANT: This version will be soon marked as deprecated and replaced by a new version currently under development: https://github.com/tecnickcom/tc-lib-pdf
    Leader badge
    Downloads: 135 This Week
    Last Update:
    See Project
  • 17
    purePDF

    purePDF

    A complete actionscript PDF library

    purePDF is a complete PDF library for ActionScript designed as a port of the Java iText 1 library. It allows developers to both create and read PDF documents directly from running SWF files, enabling seamless integration of PDF functionality into Flash-based applications. The library supports a wide range of PDF features, including advanced text rendering, graphics, gradients, transparency, annotations, and interactive forms. It also provides robust support for images, fonts, metadata,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    SICP PDF

    SICP PDF

    SICP PDF with Texinfo and LaTeX source

    ...This conversion allows for high-quality typesetting, improved design options, and the integration of OpenType and Unicode features through XeTeX. The repository contains both Texinfo and LaTeX sources, with automated scripts to keep them in sync during builds. Users can recompile the book locally with a recent TeX Live distribution and the necessary fonts, while Inkscape is required for SVG-to-PDF image conversions. The project is continuously refined to address formatting issues and ensure the text and figures render correctly across platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest...
    Leader badge
    Downloads: 105 This Week
    Last Update:
    See Project
  • 20
    suiviSIO2

    suiviSIO2

    Outil de suivi personnalisé de gestion des compétences en BTS SIO

    Cet outil permet de gérer les promotions, les professeurs et les étudiants de BTS SIO pour la mise en forme et le suivi des situations correspondantes aux compétences du référentiel BTS SIO. 2017-02-02 SuiviSIO2 v2017.4 2017-01-22 SuiviSIO2 v2017.3
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Quick Hash GUI

    Quick Hash GUI

    Linux, Windows and Apple Mac File Hashing GUI Tool

    This project has moved to www.quickhash-gui.org as of 2016-12-04. I kept v2.6.9.2 and below hosted here since Dec 16 but too many people were ignoring the fact that no updates were being posted here. For the latest QuickHash v2.8.4 release (Aug 28th 2017), go to www.quickhash-gui.org, and note that as of 29/12/16 a Debian package is also available
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    moderncv

    moderncv

    A modern curriculum vitae class for LaTeX

    moderncv is a LaTeX class for producing clean, contemporary CVs and résumés with minimal effort and consistent typography. It ships with multiple style variants and color themes, plus high-level macros for entries (education, experience, skills, publications) so you can focus on content rather than layout. The class supports optional headshots, icons, sidebars, and fine-grained spacing controls to adapt the look from compact academic to more visual design-oriented profiles. Because it’s...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    xstdgreek

    xstdgreek

    A bug-free, idiot-proof, Greek language environment XeLaTeX/LuaLaTex

    Provides a bug-free, idiot-proof, Greek language environment for unicode-enabled LaTeX like XeLaTeX and LuaLaTeX. * The project intents to be supported and controlled by its members as a typical FSF project at sourceforge.net. * This project intents to standardize the Greek macros and Greek usage in unicode-enabled LaTeX. (for example: the 'ano teleia' is the Greek 'semi colon' but there is no standard macro!). * A solution to any Greek related problem. Our goal is to fix any such...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    WikiDict English Sinhala Dictionary

    WikiDict English Sinhala Dictionary

    English Sinhala Dictionary

    English To Sinhala Translate Sinhala To English Translate Full Text Translate Results To Speech PDF Reader Mini Dictionary Mode Search Bookmark Real Time Search Suggestion Search Mode Sinhala Unicode Keyboard Single Word Search Results To Mail Over 200,000 word database
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Odia Unicode Converter
    This is a Unicode converter for typed text in Odia ISCII characters. A handy tool with simple UI. Just COPY- CLICK- PASTE. Currently it supports OR-ttSarala font only. As most non-Unicode Odia articles available are in pdf format, when Copied text directly it doesn't include font properties. So with this s/w handling bilingual documents becomes a tedious job as English text also converted to irrelevant Odia unicode text.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next