Showing 44 open source projects for "pdf unicode"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Tesseract OCR

    Tesseract OCR

    Open Source OCR Engine

    ...Tesseract can recognize over 100 languages out-of-the-box, and can be trained to recognize other languages. It supports various output formats, including plain text, HTML, PDF and more. It also has unicode (UTF-8) support.
    Downloads: 31,492 This Week
    Last Update:
    See Project
  • 2
    TeXworks

    TeXworks

    A simple interface for working with TeX documents

    TeXworks is a free and simple working environment for authoring TeX (LaTeX, ConTeXt and XeTeX) documents. Inspired by Dick Koch's award-winning TeXShop program for Mac OS X, it makes entry into the TeX world easier for those using desktop operating systems other than OS X. It provides an integrated, easy-to-use environment for users on other platforms particularly GNU/Linux and Windows and features a clean, simple interface accessible to casual and non-technical users.
    Downloads: 64 This Week
    Last Update:
    See Project
  • 3
    fpdf2

    fpdf2

    Simple PDF generation for Python

    fpdf2 is a library for simple & fast PDF document generation in Python. It is a fork and the successor of PyFPDF. Compared with other PDF libraries, fpdf2 is fast, versatile, easy to learn and to extend (example). It is also entirely written in Python and has very few dependencies: Pillow, defusedxml, & fontTools. It is a fork and the successor of PyFPDF.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    ripgrep

    ripgrep

    Regex pattern directory search tool that respects your .gitignore

    ...By default, ripgrep will ignore your .gitignore and skip hidden files or directories and binary files automatically. ripgrep has first class support on Windows, macOS and Linux, with binary downloads available for every release. ripgrep is similar to other popular search tools like The Silver Searcher, ack and grep. ripgrep supports arbitrary input preprocessing filters which could be PDF text extraction, less supported decompression, decrypting, automatic encoding detection and so on. In other words, use ripgrep if you like speed, filtering by default, fewer bugs and Unicode support.
    Downloads: 92 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 5
    TEXminer

    TEXminer

    Text Mining Classification for Texts in ASCII, Unicode and PDF Format.

    TEXminer uses generic Text Mining Methods to analyze Unicode Files as plain Text or PDF. The Text Database can be saved in XML where the orginal Text, the Sentence and Word Lists and additional Parameters (e.g. Abbreviations) are stored. TEXminer allows Language Detection by Letter Frequency Analysis, finding important Words by Cooccurrence Analysis, Determination of Central Expressions, Thematic Text Classification (also Semantic Groups) Fingerprint Comparison and Word Frequency. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    shortcutnotes

    shortcutnotes

    copy but NO Paste and make presentations with PDF support.

    ...It features a clean, modern GUI with dark/light theme toggling and adjustable font sizes. Users can copy text from the clipboard, automatically add serial numbers, and organize content in a text area with support for Unicode, including Hindi text and emojis. The app allows saving notes as PPT, PDF, or TXT files, ensuring text formatting is preserved. It includes a slide preview feature for PowerPoint slides, enabling easy content management and organization. The project also incorporates settings management through a JSON configuration file for a personalized user experience.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Chordii

    Chordii

    Easy lead sheets from text input

    ChordPro creates elegant, stafless lead sheets for musicians needing only chords and lyrics. It processes plain text input in ChordPro format and it is a rewrite of the old though still popular Chord/Chordii programs.
    Leader badge
    Downloads: 25 This Week
    Last Update:
    See Project
  • 8
    Decaleon

    Decaleon

    Multilingual Esperanto Translator, Word Dictionary, Vocabulary Trainer

    Esperanto Translator - translates sentences from 12 languages to Esperanto - VB.NET project based on XML - 30-60,000 words in each language: extensible Vocabulary for English, French, Spanish, Portoguese, Italian, German, Danish, Swedish, Greek, Russian, Polish and Croatian including context and synonym data as well as semantic group information. It includes an any-to-any multilingual Dictionary and Vocabulary Trainer (learning niveaus/logic themes/semantic groups) and uses UTF8/Unicode Characters; Export into Text, HTML, TeX, PDF Files; Text files may be imported in other Vocabulary Training Software. Version 6.0 gives a big addition of standard words and supports another 23 Languages; a small Vocabulary for: Albanian, Bulgarian, Czech, Dutch, Finnish, Hungarian, Norwegian, Romanian, Serbian, Slovak, Slovene, Turkish, Ukrainian, Interslavic, Arabian and Asian Languages. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Msc-generator

    Msc-generator

    Draws signalling charts, block diagrams and graphs from text input.

    NOTE! We have moved to https://gitlab.com/msc-generator/msc-generator All development happens there. Also, download new releases & submit issues there. A tool to draw various charts from textual descriptions. Currently, three types of charts are supported: Message Sequence Charts, generic Graphs, and Block Diagrams, with more to be added in the future. There is a command-line version for Linux and Mac (replacing mscgen), which now sports a GUI, as well. Msc-generator allows fine...
    Leader badge
    Downloads: 20 This Week
    Last Update:
    See Project
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 10
    Elementary Algorithms

    Elementary Algorithms

    Book of elementary algorithms and data structures

    This book introduces elementary algorithms and data structure. It includes side-by-side comparison of purely functional realization and their imperative counterpart. From 2020/12, I started re-writing this book. The PDF can be downloaded for preview (EN, 中文). The 1st edition in Chinese (中文) was published in 2017. I recently switched my focus to the Mathematics of programming, the new book is also available in (github). To build the book in PDF format from the sources, you need the following software pre-installed, TeXLive, The book is built with XeLaTeX, a Unicode friendly version of TeX. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    TextSeek

    TextSeek

    Professional full-text desktop search tool

    TextSeek is a professional full-text desktop search tool. Unlike the filename search tool like Everything and Listary, TextSeek can search filename and file content easily and quickly. It supports PDF, Word, Excel, Powerpoint, RTF and other formats. The software can run directly, and no extra package is required to install.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12

    YAppaRi Seyali

    YAppaRi Seyali is a Tamil Prosidy Analyzer

    யாப்பறி செயலி This is a java interfaced UI for the Tamil Prosidy Analyzer It is in Beta version and expected to have some bugs.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 13
    mPDF

    mPDF

    PHP library generating PDF files from UTF-8 encoded HTML

    mPDF is a PHP library that generates PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files ‘on-the-fly’ from his website, handling different languages. It is slower than the original scripts e.g. HTML2FPDF and produces larger files when using Unicode fonts, but support for CSS styles etc. and has been much enhanced. Supports almost all languages including RTL (Arabic and Hebrew), and CJK (Chinese-Japanese-Korean).
    Downloads: 75 This Week
    Last Update:
    See Project
  • 14

    PoDoFo

    A PDF parsing, modification and creation library.

    The PoDoFo library is a free, portable C++ library. It can parse and modify existing PDF files and create new ones from scratch. It also includes several tools to work with PDF files. It features an unique approach which provides access to PDF documents via an object tree. Therefore, PDFs can be created and or manipulated using a simple tree structure. Development of PoDoFo has been moved to GitHub: https://github.com/podofo/podofo Please raise new issues in the GitHub project.
    Leader badge
    Downloads: 73 This Week
    Last Update:
    See Project
  • 15
    printpdf

    printpdf

    Rust / WASM library for reading, writing and rendering PDF

    ...It includes advanced typography capabilities such as character spacing, scaling, superscript, and subscript, as well as support for Unicode text. printpdf also offers optimization features like font subsetting to reduce file size, making generated PDFs more efficient. Experimental capabilities include rendering PDF pages to SVG and extracting text content, expanding its use cases beyond simple document generation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    PHP Pdf creation - R&OS
    MOVED TO GITHUB https://github.com/ole1986/pdf-php
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    multipractice

    multipractice

    Tool for practicing languages.

    Practice makes perfect. Panglossa MultiPractice is a tool to help you learn and practice languages. You can create your own courses, import courses created by others, and even export courses to HTML or PDF documents. The original project (for Lazarus) was basically a flashcard app. Now it is more like a platform for creating structured courses with text, images, audio and video content, as well as different types of exercises. Please keep in mind that this project is created and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    XL-Parser

    XL-Parser

    XL-Parser is a tool for data extraction and analysis.

    XL-Parser provides a bunch of functions for data extraction and analysis. It also provides web log analysis features like a tool for detection of suspicious activities. More details and screenshots on http://le-tools.com.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 19
    TCPDF - PHP class for PDF

    TCPDF - PHP class for PDF

    PHP class for PDF

    TCPDF is a PHP class for generating PDF documents without requiring external extensions. TCPDF Supports UTF-8, Unicode, RTL languages, XHTML, Javascript, digital signatures, barcodes and much more. IMPORTANT: This version will be soon marked as deprecated and replaced by a new version currently under development: https://github.com/tecnickcom/tc-lib-pdf
    Leader badge
    Downloads: 132 This Week
    Last Update:
    See Project
  • 20
    purePDF

    purePDF

    A complete actionscript PDF library

    purePDF is a complete PDF library for ActionScript designed as a port of the Java iText 1 library. It allows developers to both create and read PDF documents directly from running SWF files, enabling seamless integration of PDF functionality into Flash-based applications. The library supports a wide range of PDF features, including advanced text rendering, graphics, gradients, transparency, annotations, and interactive forms. It also provides robust support for images, fonts, metadata,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Maktab-e-Shamila is an online website and software which hosts thousands of Islamic books in Arabic in multiple formats including online searching, PDF and BOK formats This application will be able to take BOK file and will provide multiple output options. Initially it'll support export to SQL Server database. Later on it'll support HTML Book format (https://oreillymedia.github.io/HTMLBook/)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    SICP PDF

    SICP PDF

    SICP PDF with Texinfo and LaTeX source

    ...This conversion allows for high-quality typesetting, improved design options, and the integration of OpenType and Unicode features through XeTeX. The repository contains both Texinfo and LaTeX sources, with automated scripts to keep them in sync during builds. Users can recompile the book locally with a recent TeX Live distribution and the necessary fonts, while Inkscape is required for SVG-to-PDF image conversions. The project is continuously refined to address formatting issues and ensure the text and figures render correctly across platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest...
    Leader badge
    Downloads: 93 This Week
    Last Update:
    See Project
  • 24
    NOTE: The projects is now being developed here: https://github.com/eugmes/fntsample A program for making font samples that show Unicode coverage of the font and are similar in appearance to Unicode charts. Samples can be saved as PDF or PostScript files.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    Kindle English Sinhala Dictionary

    Kindle English Sinhala Dictionary

    Sinhala English Dictionary for kindle ebook readers

    මෙම ශබ්දකෝෂය සංස්කරණ දෙකකින් යුක්තය. 1) English Sinhala Dictionary_Uni.mobi (Unicode version) 2) English Sinhala Dictionary_PV.mobi (Singlish/phonetic version) යුනිකේත සංස්කරණය භාවිත කිරීමට නම් ඔබේ කින්ඩ්ල් රීඩරයට iskolapotha font ගොණුව ස්ථාපනය කල යුතු වේ. (නිවැරදි තාක්ෂණික දැනුමකින් තොරව එසේ කිරීම අනුමත කල නොහැක.) Singlish සංස්කරණය භාවිත කිරීමට එවැනි කිසිවක් කල යුතු නොවන අතර English Sinhala Dictionary_PV.mobi ගොණුව සාමාන්‍ය PDF වැනි ගොණුවක් ලෙසම කින්ඩ්ල් රීඩරයට කොපි කරන්න.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next