Showing 18 open source projects for "pdf to text"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    Zotero PDF Translate

    Zotero PDF Translate

    Translate PDF, EPub, webpage, metadata, annotations, notes

    Zotero PDF Translate is a plugin for Zotero that enhances the research workflow by enabling in-app translation of PDFs, EPUBs, webpages, and associated metadata directly within the Zotero interface. It integrates seamlessly with Zotero’s document reader, allowing users to select text and instantly receive translations in a pop-up or side panel without leaving the application.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 2
    tinypdf

    tinypdf

    Minimal PDF creation library

    ...It also supports clickable links so generated documents can include interactive URLs, and it can create multi-page documents with custom page sizes. A notable convenience is built-in markdown-to-PDF conversion for common structures like headers and lists, letting you go from formatted text to a PDF layout quickly.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    canvas-editor

    canvas-editor

    Canvas-based WYSIWYG rich text editor with advanced layout tools

    canvas-editor is a browser-based rich text editor that renders content using HTML5 Canvas and SVG instead of traditional DOM-based approaches. It is designed to provide a WYSIWYG editing experience similar to word processors, enabling precise control over layout, rendering, and document structure. canvas-editor supports a wide range of formatting and document features, including text styling, tables, images, and embedded elements, all managed through a structured data model. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    LibPDF

    LibPDF

    A modern PDF library for TypeScript

    LibPDF-js/core is a modern, TypeScript-first PDF processing library that provides a comprehensive toolkit for parsing, modifying, and generating PDF documents with a clean, intuitive API designed to handle real-world files safely and robustly. Unlike many existing JavaScript PDF libraries, it emphasizes lenient parsing that can gracefully handle malformed structures and fallback strategies where typical parsers fail, making it useful for production environments that encounter unpredictable file formats. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Add Two Lines of Code. Get Full APM. Icon
    Add Two Lines of Code. Get Full APM.

    AppSignal installs in minutes and auto-configures dashboards, alerts, and error tracking.

    Works out of the box for Rails, Django, Express, Phoenix, and more. Monitoring exceptions and performance in no time.
    Start Free
  • 5
    OmniTools

    OmniTools

    Self-hosted collection of powerful web-based tools for everyday tasks

    ...A key design choice is that file processing happens entirely on the client side, meaning your data stays in your browser instead of being sent to the backend. The tool catalog spans both technical and non-technical needs, including image, video, audio, PDF, text, date/time, math, and data format utilities like JSON/CSV/XML helpers. It’s also packaged for straightforward self-hosting, with a lightweight Docker image and simple run commands, so it can be deployed quickly on a homelab or internal network.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 6
    Saladict

    Saladict

    Pop-up dictionary and page translator with multiple search modes

    All-in-one professional pop-up dictionary and page translator which supports multiple search modes, page translations, new word notebook and PDF selection searching. Saladict supports mixed use of multiple search modes. Whether it be single-click, double-click, pop-up icon, hovering or keyboard shortcuts, you name it. The combination of dictionaries is dynamically adjusted according to search text and Saladict profile. Although Saladict is built for looking up English words, it also supports many other languages. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    OCRBase

    OCRBase

    MD/.JSON Document OCR and structured data extraction API

    OCRBase is a self-hostable document OCR and structured extraction system built to turn PDFs into machine-usable outputs at scale, aiming to bridge the gap between raw text extraction and production-ready pipelines. Instead of treating OCR as a one-off script, it presents an API-driven workflow where documents are submitted as jobs and processed through a queue-based architecture that can handle high throughput. The core output is designed for downstream automation, producing structured...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Logseq

    Logseq

    A privacy-first, open-source platform for knowledge management

    Logseq is a privacy-first, open-source knowledge base that works on top of local plain-text Markdown and Org-mode files. Use it to write, organize and share your thoughts, keep your to-do list, and build your own digital garden. Logseq is a platform for knowledge management and collaboration. It focuses on privacy, longevity, and user control. The server will never store or analyze your private notes. Your data are plain text files and we currently support both Markdown and Emacs Org-mode...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 9
    beautiful-mermaid

    beautiful-mermaid

    Render Mermaid diagrams as beautiful SVGs or ASCII art

    ...The project includes presets for different aesthetic styles, customization options for branding or themes, and rendering pipelines that can export to multiple formats such as SVG, PNG, and PDF.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 10
    PowerPoint-ist

    PowerPoint-ist

    Web presentation editor replicating many PowerPoint features online

    PPTist is a web-based presentation editing application designed to replicate many of the commonly used features found in traditional slide presentation software. It allows users to create, edit, and present slide decks directly within a web browser while maintaining a desktop-like editing experience. PPTist is built with Vue 3 and TypeScript and focuses on providing a highly interactive slide editing environment with extensive customization and extension potential. PPTist supports a wide...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    Resume-Matcher

    Resume-Matcher

    Improve your resumes with Resume Matcher

    Resume-Matcher is a command-line application that compares resumes against job descriptions using natural language processing. It provides a compatibility score based on keyword relevance and highlights areas where the resume aligns—or doesn't—with the target role. Designed for job seekers and HR professionals, it helps improve resume tailoring and streamlines candidate screening.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Dify

    Dify

    One API for plugins and datasets, one interface for prompt engineering

    Dify is an easy-to-use LLMOps platform designed to empower more people to create sustainable, AI-native applications. With visual orchestration for various application types, Dify offers out-of-the-box, ready-to-use applications that can also serve as Backend-as-a-Service APIs. Unify your development process with one API for plugins and datasets integration, and streamline your operations using a single interface for prompt engineering, visual analytics, and continuous improvement....
    Downloads: 14 This Week
    Last Update:
    See Project
  • 13
    PandaWiki

    PandaWiki

    AI-powered open source platform for building intelligent wiki bases

    PandaWiki is an open source knowledge base system designed to help users build intelligent documentation platforms powered by large language models. It combines traditional wiki functionality with modern AI capabilities, allowing teams and individuals to create and manage product documentation, technical manuals, FAQs, and blog-style knowledge resources. PandaWiki provides tools for managing knowledge bases through an administrative interface while also generating public-facing wiki sites...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Linkwarden

    Linkwarden

    Self-hosted collaborative bookmark manager

    Linkwarden is a self-hosted, open-source bookmark manager built to help individuals and teams collect, organize, and preserve important web content in a way that stays useful long after the original pages change or disappear. Instead of saving only a URL, it captures durable archived formats so your saved knowledge remains accessible even when link rot happens. The experience is designed to feel like a modern “read-it-later” tool, with a reader view that makes long articles easier to consume...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Ainee

    Ainee

    Ainee - AI Notetaking and Learning Companion

    Ainee is your ultimate AI-powered notetaking and learning companion. Capture lecture notes in real-time and effortlessly transform audio, text, files, and YouTube videos into formatted notes, mindmaps, quizzes, flashcards, podcasts, and more. Explore our AI meeting note taker, AI notes, video transcript generator, PDF to AI converter, and AI flashcard maker. Enhance your learning with our AI voice recorder, article summarizer AI, and AI quiz generator. Additionally, share your knowledge base with others to foster the flow of information and help new users benefit from collective insights. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Advanced Slides for Obsidian

    Advanced Slides for Obsidian

    Create markdown-based reveal.js presentations in Obsidian

    Create markdown-based reveal.js presentations in Obsidian. Advanced Slides is the perfect slide deck extension for Obsidian.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    PDF-LIB

    PDF-LIB

    Create and modify PDF documents in any JavaScript environment

    Create and modify PDF documents in any JavaScript environment. Create PDF documents from scratch, or modify existing PDF documents. Draw text, images, and vector graphics. Embed your own fonts. Even embed and draw pages from other PDFs. Written in TypeScript and compiled to pure JavaScript with no native dependencies. Works in any JavaScript runtime, including browsers, Node, Deno, and even React Native.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Polar Bookshelf

    Polar Bookshelf

    A personal knowledge repository for PDF and web content

    Polar is an integrated reading environment to build your knowledge base. Actively read, annotate, connect thoughts, create flashcards, and track progress. Save time in creating flashcards in one click from text highlights. We use OpenAI's GPT-3 to automate this tedious process. Review them with our spaced repetition algorithm or sync them to Anki. Manage and save for later all your PDFs, EPUBs, and web pages in one place. Use tags, reading progress, and detailed document information to stay...
    Downloads: 5 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB