Search Results for "pdf language translator"

Showing 370 open source projects for "pdf language translator"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    XUnity Auto Translator

    XUnity Auto Translator

    Advanced translator plugin that can be used to translate Unity games

    XUnity.AutoTranslator is an advanced plugin framework designed to automatically translate text in Unity-based games in real time by intercepting and replacing in-game text during rendering. It works by injecting itself into the game process and leveraging external translation services such as Google Translate, DeepL, or custom APIs to dynamically convert text into a target language. The tool supports both automatic translation and manual translation workflows, allowing users to refine or...
    Downloads: 170 This Week
    Last Update:
    See Project
  • 3
    OpenDataLoader PDF

    OpenDataLoader PDF

    PDF Parser for AI-ready data. Automate PDF accessibility

    OpenDataLoader PDF is an open-source document processing system designed to convert complex PDF files into structured, AI-ready formats such as Markdown, JSON, and HTML while preserving layout, hierarchy, and semantic meaning. It focuses on enabling downstream use cases like retrieval-augmented generation (RAG), knowledge extraction, and document intelligence pipelines by maintaining accurate reading order and spatial metadata through bounding boxes. The tool combines deterministic parsing...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    MarkPDFDown

    MarkPDFDown

    A high-quality PDF to Markdown tool based on large language model

    MarkPDFdown is an open-source document processing tool designed to convert PDF files into structured Markdown output that can be easily used for documentation, content pipelines, and AI processing workflows. The project focuses on extracting text, formatting, and structural information from complex PDF documents and transforming that information into clean Markdown that preserves the original hierarchy of headings, paragraphs, tables, and lists.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    fpdf2

    fpdf2

    Simple PDF generation for Python

    fpdf2 is a library for simple & fast PDF document generation in Python. It is a fork and the successor of PyFPDF. Compared with other PDF libraries, fpdf2 is fast, versatile, easy to learn and to extend (example). It is also entirely written in Python and has very few dependencies: Pillow, defusedxml, & fontTools. It is a fork and the successor of PyFPDF.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    PDFIO.jl

    PDFIO.jl

    PDF Reader Library for Native Julia.

    ...Every time, you get a new PDF file there is a possibility that it may not work to the best interpretation of the specification. A script-based language makes it easier for the consumers to quickly modify the code and enhance to their specific needs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    MetaScreener

    MetaScreener

    AI-powered tool for efficient abstract and PDF screening

    ...Instead of manually reviewing hundreds or thousands of documents, researchers can use MetaScreener to apply machine learning techniques that assist with classification and prioritization of candidate papers. The platform can analyze both abstracts and full PDF documents, enabling automated filtering based on research criteria defined by the user. By incorporating natural language processing techniques, the system can identify potentially relevant studies and reduce the workload associated with manual screening.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Glint Translator
    Glint Translator is a high-performance Windows application for real-time in-game and voice translation without interrupting gameplay. It supports 240+ languages using DeepL, Google, OpenAI, Azure, and Google Gemini models. The interface is available in 18 languages. Features • 3 Translation Modes: Fluent (parallel), Area (overlay), Full Screen (smart detection) • Speaker detection with color-coding • Glint AI custom terminology control • Game-based profile system • Advanced...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 9
    Easy DataSet

    Easy DataSet

    A powerful tool for creating datasets for LLM fine-tuning

    Easy DataSet is a comprehensive open-source tool designed to make creating high-quality datasets for large language model fine-tuning, retrieval-augmented generation (RAG), and evaluation as easy and automated as possible by providing intuitive interfaces and powerful parsing, segmentation, and labeling tools. It supports ingesting domain-specific documents in a wide range of formats — including PDF, Markdown, DOCX, EPUB, and plain text — and can intelligently segment, clean, and structure content into rich datasets tailored for downstream LLM training needs. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 10
    Sphinx

    Sphinx

    Main repository for the Sphinx documentation builder

    ...It was originally created for the Python documentation, and it has excellent facilities for the documentation of software projects in a range of languages. Of course, this site is also created from reStructuredText sources using Sphinx! HTML (including Windows HTML Help), LaTeX (for printable PDF versions), ePub, Texinfo, manual pages, plain text. Semantic markup and automatic links for functions, classes, citations, glossary terms and similar pieces of information. Easy definition of a document tree, with automatic links to siblings, parents and children. General index as well as a language-specific module index. Automatic highlighting using the Pygments highlighter. ...
    Downloads: 30 This Week
    Last Update:
    See Project
  • 11
    LandPPT

    LandPPT

    An LLM-based presentation generation platform

    LandPPT is an open-source AI platform that automatically generates professional presentation slides using large language models. The system allows users to create complete PowerPoint presentations simply by entering a topic or uploading source documents such as PDFs, Word files, or Markdown notes. Using natural language processing and structured content generation, the platform produces presentation outlines and converts them into fully formatted slide decks. The application integrates...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 12
    Cpp17

    Cpp17

    Chinese translation of C++17 The Complete Guide

    The repository is a Chinese translation of “C++17: The Complete Guide”, a comprehensive reference book originally by Nicolai M. Josuttis, adapted for learners and developers who prefer reading in Chinese. It includes a full LaTeX source, Markdown drafts, and compiled PDF/EPUB versions, allowing readers to either consume the translated text or regenerate formatted outputs themselves. The content is organized into multiple parts: basic language features (e.g. structured binding, inline variables, enhanced switch, lambdas), template and compile-time features (e.g. fold expressions, class template argument deduction, constexpr improvements), and the additions to the standard library (e.g. std::optional, std::variant, std::string_view, file system, concurrency, and parallel algorithms). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Asciidoc Editor based on JavaFX 20

    Asciidoc Editor based on JavaFX 20

    Asciidoc Editor and Toolchain written with JavaFX 19

    Asciidoc FX is a WYSIWYG editor for the Asciidoc markup language. You can build PDF, Epub, and HTML books, documents, and slides. Supported Operating Systems and Builds shows the list of available builds with links for reference. If you are looking for the very latest version, visit the link in the note above to be guaranteed of downloading the latest and greatest version of AsciidocFX. AsciidocFX converts documents via the AsciidoctorJ library.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Unredact

    Unredact

    A simple tool for reading in poorly redacted documents

    Unredact is a specialized tool that attempts to reconstruct redacted or obscured text in images, PDFs, or screenshots using a combination of image processing and generative AI inference to suggest plausible completions of blurred, black-boxed, or jumbled content. Unlike traditional optical character recognition (OCR), which only reads visible text, Unredact focuses on inferring missing content where redaction has been applied by analyzing surrounding context, font characteristics, and...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 15
    AnythingLLM

    AnythingLLM

    The all-in-one Desktop & Docker AI application with full RAG and AI

    A full-stack application that enables you to turn any document, resource, or piece of content into a context that any LLM can use as references during chatting. This application allows you to pick and choose which LLM or Vector Database you want to use as well as supporting multi-user management and permissions. AnythingLLM is a full-stack application where you can use commercial off-the-shelf LLMs or popular open-source LLMs and vectorDB solutions to build a private ChatGPT with no...
    Downloads: 133 This Week
    Last Update:
    See Project
  • 16
    Asciidoctor

    Asciidoctor

    A fast, open source text processor and publishing toolchain

    A fast text processor & publishing toolchain for converting AsciiDoc to HTML5, DocBook & more. Asciidoctor is a fast, open source, Ruby-based text processor for parsing AsciiDoc® into a document model and converting it to output formats such as HTML 5, DocBook 5, manual pages, PDF, EPUB 3, and other formats. Asciidoctor also has an ecosystem of extensions, converters, build plugins, and tools to help you author and publish content written in AsciiDoc. Asciidoctor reads the AsciiDoc source,...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    Xournal++

    Xournal++

    A handwriting notetaking software with PDF annotation support

    Xournal++ (/ˌzɚnl̟ˌplʌsˈplʌs/) is an open-source and cross-platform note-taking software that is fast, flexible, and functional. A modern rewrite and a more feature-rich version of the wonderful Xournal program. Support for pressure-sensitive stylus and drawing tablets (Wacom, Huion, XP-Pen, etc.) Robust and customizable pen, highlighter and eraser tools, allowing you to write how you want to write. Use layers to make complex notes that are still pleasant to work with. Keep track of the...
    Downloads: 60 This Week
    Last Update:
    See Project
  • 18
    Khoj

    Khoj

    An AI personal assistant for your digital brain

    Get more done with your open-source AI personal assistant. Khoj is a desktop application to search and chat with your notes, documents, and images. It is an offline-first, open-source AI personal assistant that is accessible from Emacs, Obsidian or your Web browser. Khoj is a thinking tool that is transparent, fun, and easy to engage with. You can build faster and better by using Khoj to search and reason across all your data sources. Khoj learns from your notes and documents to function as...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 19

    Alphabet Language Database

    Maps over 30 Hebrew Fonts in a one to many relationship.

    Alphabet Language Database or Hebrew Font Mapper Create an open database of all language's alphabets and words. The spreadsheet is based on the Hebrew Aleph Bet. The correct fonts must be installed on your system to view properly. Future Goals - Dictionary / Thesaurus / Translator - Better font integration with database https://alphabet-language-database.sourceforge.io
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    Eng2BN CSV Translator

    Translate English to Bangla using CSV file format and range wise.

    Eng2BN CSV Translator user-friendly Python tool that enables efficient translation of English text to Bangla within CSV files. The application supports large datasets and allows users to translate specific row ranges, making it ideal for batch processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    DB-GPT

    DB-GPT

    Revolutionizing Database Interactions with Private LLM Technology

    DB-GPT is an experimental open-source project that uses localized GPT large models to interact with your data and environment. With this solution, you can be assured that there is no risk of data leakage, and your data is 100% private and secure.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Typst LSP

    Typst LSP

    A brand-new language server for Typst, plus a VS Code extension

    A brand-new language server for Typst. Syntax highlighting, error reporting, code completion, and function signature help. Compiles to PDF on save (configurable to as-you-type, or can be disabled)
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Decaleon

    Decaleon

    Multilingual Esperanto Translator, Word Dictionary, Vocabulary Trainer

    Esperanto Translator - translates sentences from 12 languages to Esperanto - VB.NET project based on XML - 30-60,000 words in each language: extensible Vocabulary for English, French, Spanish, Portoguese, Italian, German, Danish, Swedish, Greek, Russian, Polish and Croatian including context and synonym data as well as semantic group information.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    Dasar Pemrograman Golang

    Dasar Pemrograman Golang

    Source Code E-book Dasar Pemrograman Golang

    Goal, or Go is a programming language that was born in 2009. The goal has many advantages, as evidenced by a large number of large companies using this language in the development of their products, up to the level of production of course. This ebook is one of the many references that can be used as learning material for Go programming. The topics provided vary greatly from basic ( from 0), to adventurous chapters. Basic Go Programming topics discussed are very basic, and suitable for people...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    text-extract-api

    text-extract-api

    Document (PDF, Word, PPTX ...) extraction and parse API

    text-extract-api is an open-source service designed to extract readable text from a wide variety of document formats through a simple API interface. The project focuses on converting complex files such as PDFs, images, scanned documents, and office files into structured plain text that can be processed by downstream applications or language models. Instead of requiring developers to integrate multiple document parsing libraries individually, the system centralizes text extraction...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB