Showing 31 open source projects for "aoo-pdf-import"

View related business solutions
  • 8 Monitoring Tools in One APM. Install in 5 Minutes. Icon
    8 Monitoring Tools in One APM. Install in 5 Minutes.

    Errors, performance, logs, uptime, hosts, anomalies, dashboards, and check-ins. One interface.

    AppSignal works out of the box for Ruby, Elixir, Node.js, Python, and more. 30-day free trial, no credit card required.
    Start Free
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Scribe.js

    Scribe.js

    JavaScript OCR and text extraction for images and PDFs

    Scribe.js is a JavaScript library that provides Optical Character Recognition (OCR) and text extraction capabilities for both images and PDF documents, aimed at developers who want to build OCR features directly into their applications. The library can take image files (such as PNG or JPEG) and recognize the text they contain, and it can also extract text from PDF files that either already contain text or are image-based scans, using modern web standards and WebAssembly under the hood. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    PandaWiki

    PandaWiki

    AI-powered open source platform for building intelligent wiki bases

    PandaWiki is an open source knowledge base system designed to help users build intelligent documentation platforms powered by large language models. It combines traditional wiki functionality with modern AI capabilities, allowing teams and individuals to create and manage product documentation, technical manuals, FAQs, and blog-style knowledge resources. PandaWiki provides tools for managing knowledge bases through an administrative interface while also generating public-facing wiki sites...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Desktop Commander MCP

    Desktop Commander MCP

    AI-powered MCP server for desktop file and terminal automation

    ...It allows users to run terminal commands with streaming output, manage long-running processes, and even execute code in memory without saving files. It also supports working with structured and document formats such as Excel, PDF, and DOCX, enabling AI to read, modify, and generate these files directly.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Jimp

    Jimp

    An image processing library written entirely in JavaScript for Node

    An image processing library for Node written entirely in JavaScript, with zero native dependencies. If you're using this library with TypeScript the method of importing slightly differs from JavaScript. Instead of using require, you must import it with ES6 default import scheme. If you're using a web bundles (webpack, rollup, parcel) you can benefit from using the module build of jimp. Using the module build will allow your bundler to understand your code better and exclude things you aren't using. If you're using webpack you can set process.browser to true and your build of jimp will exclude certain parts, making it load faster. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    canvas-editor

    canvas-editor

    Canvas-based WYSIWYG rich text editor with advanced layout tools

    ...Its architecture is modular, allowing developers to extend functionality through plugins, custom commands, and event hooks. It includes support for page-based layouts with headers, footers, pagination, and print-ready output, including PDF generation. It also provides interactive components such as form controls and context menus, making it suitable for building complex document editing systems.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Easy DataSet

    Easy DataSet

    A powerful tool for creating datasets for LLM fine-tuning

    Easy DataSet is a comprehensive open-source tool designed to make creating high-quality datasets for large language model fine-tuning, retrieval-augmented generation (RAG), and evaluation as easy and automated as possible by providing intuitive interfaces and powerful parsing, segmentation, and labeling tools. It supports ingesting domain-specific documents in a wide range of formats — including PDF, Markdown, DOCX, EPUB, and plain text — and can intelligently segment, clean, and structure content into rich datasets tailored for downstream LLM training needs. The system includes automated question-generation capabilities, hierarchical label trees, and answer generation pipelines that use LLM APIs to produce coherent paired data with customizable templates. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    AnythingLLM

    AnythingLLM

    The all-in-one Desktop & Docker AI application with full RAG and AI

    A full-stack application that enables you to turn any document, resource, or piece of content into a context that any LLM can use as references during chatting. This application allows you to pick and choose which LLM or Vector Database you want to use as well as supporting multi-user management and permissions. AnythingLLM is a full-stack application where you can use commercial off-the-shelf LLMs or popular open-source LLMs and vectorDB solutions to build a private ChatGPT with no...
    Downloads: 101 This Week
    Last Update:
    See Project
  • 8
    Open Semantic Search

    Open Semantic Search

    Open source semantic search and text analytics for large document sets

    ...Open Semantic Search includes an ETL framework that can ingest documents, process them through analysis steps, and enrich the data with extracted information such as named entities and metadata. It also supports optical character recognition to extract text from images and scanned documents, including images embedded inside PDF files. It integrates text mining and analytics capabilities that allow users to examine relationships, topics, and structured data within document collections.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    ChatLab

    ChatLab

    Local-first AI chat analysis tool for insights from conversation data

    ChatLab is an open source desktop application designed to help users analyze and better understand their personal chat histories through structured data exploration and AI-assisted insights. It enables users to import chat exports from multiple messaging platforms and transform them into a unified data model for consistent analysis. By combining a flexible SQL engine with AI agents, the tool allows users to query, summarize, and explore conversation patterns in a more interactive and intelligent way. ChatLab emphasizes a local-first approach, meaning all chat data is processed and stored on the user’s device rather than being uploaded to external servers. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 10
    canvas-constructor

    canvas-constructor

    An ES6 utility for canvas with built-in functions and chained methods

    An ES6 utility for canvas with built-in functions and chained methods. Alternatively, you can import canvas-constructor/browser. That will create a canvas with size of 300 pixels width, 300 pixels height. Set the color to #AEFD54. Draw a rectangle with the previous color, covering all the pixels from (5, 5) to (290 + 5, 290 + 5) Set the color to #FFAE23. Set the font size to 28 pixels with font Impact. Write the text 'Hello World!'
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    LandPPT

    LandPPT

    An LLM-based presentation generation platform

    LandPPT is an open-source AI platform that automatically generates professional presentation slides using large language models. The system allows users to create complete PowerPoint presentations simply by entering a topic or uploading source documents such as PDFs, Word files, or Markdown notes. Using natural language processing and structured content generation, the platform produces presentation outlines and converts them into fully formatted slide decks. The application integrates...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Claude Code Config

    Claude Code Config

    My personal Claude Code configuration

    Claude Code Config is a highly customizable personal configuration repository for Claude Code, containing tailored rules, hooks, agents, skills, and commands meant to enhance the coding assistant experience. The project centralizes configuration files that instruct Claude Code how to behave in different contexts, automating repetitive tasks and enforcing coding patterns across languages or project types. Its rulesets can apply path-scoped conventions (such as for TypeScript or test files),...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    myGPTReader

    myGPTReader

    AI Slack bot for reading, summarizing, and chatting with content

    myGPTReader is an AI-powered Slack bot designed to help users read, summarize, and interact with various types of digital content through conversational interfaces. It enables users to quickly understand web pages, documents, and even video content by transforming them into interactive discussions rather than static reading experiences. myGPTReader supports a wide range of file formats, including eBooks, PDFs, and text-based documents, making it flexible for both casual and professional use...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    DocsGPT

    DocsGPT

    Private AI platform for agents, enterprise search and RAG pipelines

    DocsGPT is an open-source AI platform for deploying private RAG pipelines, AI agents, and enterprise search on your own infrastructure. Connect any data source (PDFs, DOCX, CSV, Excel, HTML, audio, GitHub, databases, URLs) and get accurate, hallucination-free answers with source citations. Choose your LLM: OpenAI, Anthropic, Google Gemini, or local models. Works with Qdrant, MongoDB, and Elasticsearch and more. Deploy via Docker or Kubernetes with full data sovereignty. Build...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    AIGCPanel

    AIGCPanel

    One-stop AI digital human system with video voice synthesis tools

    AIGCPanel is an open source desktop application designed as a comprehensive, all-in-one platform for creating AI-powered digital humans and media content. It integrates multiple capabilities such as video synthesis, voice synthesis, and voice cloning into a unified interface, allowing users to generate realistic audiovisual outputs with minimal setup. AIGCPanel focuses heavily on simplifying the management of local AI models by providing streamlined workflows for importing, configuring, and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    MyBox

    MyBox

    Easy Tools of PDF, Image, File, Network, Data, and Medias

    javafx-desktop-apps pdf image ocr icc barcode color-palette text bytes markdown html archive compress digest video audio editor converter media https://github.com/Mararsh/MyBox Self-contain packages need not java env nor installation. Jar packages need Java 16 or higher.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    CodinIT.dev

    CodinIT.dev

    Free, local, open-source AI app builder

    CodinIT.dev is a free, local, open source AI app builder that lets you go from idea to full-stack application entirely on your machine, no coding required, just chat with AI. You can build unlimited apps with real-time previews, instant undo, and responsive, frictionless workflows. Deep Supabase integration means you can create UI and backend logic in one cohesive environment, while the model-agnostic architecture lets you connect to any AI, whether cloud-based (Gemini 3 Pro, GPT-5,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Langdesk

    Langdesk

    Windows application to search multiple pdfs and chat with them

    Langdesk is desktop application for windows that allows the user to assemble a knowledge base consisting of multiple pdf documents, retrieve information from them and chat with the retrieved content. Feel free to reach us for any request at info@tecnoesis.gr . Currently open to customizations / additions / cooperation.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    BingGPT

    BingGPT

    Desktop application of Bing's AI-powered chat (Windows, Mac, Linux)

    Desktop application of new Bing's AI-powered chat 1. Get access to the early preview of new Bing 2. Sign in to your Microsoft account 3. Start chatting
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    pdf-extractor

    pdf-extractor

    Node.js module for rendering pdf pages to images, svgs and HTML files

    Pdf-extractor is a wrapper around pdf.js to generate images, svgs, html files, text files and json files from a pdf on node.js. A DOM Canvas is used to render and export the graphical layer of the pdf. Canvas exports *.png as a default but can be extended to export to other file types like .jpg. Pdf objects are converted to svg using the SVGGraphics parser of pdf.js.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    KoboldAI

    KoboldAI

    Your gateway to GPT writing

    This is a browser-based front-end for AI-assisted writing with multiple local & remote AI models. It offers the standard array of tools, including Memory, Author's Note, World Info, Save & Load, adjustable AI settings, formatting options, and the ability to import existing AI Dungeon adventures. You can also turn on Adventure mode and play the game like AI Dungeon Unleashed. Stories can be played like a Novel, a text adventure game or used as a chatbot with an easy toggles to change between the multiple gameplay styles. This makes KoboldAI both a writing assistant, a game and a platform for so much more. ...
    Leader badge
    Downloads: 152 This Week
    Last Update:
    See Project
  • 22
    Sklearn TensorFlow

    Sklearn TensorFlow

    Sklearn and TensorFlow: A Practical Guide to Machine Learning

    ...It aims to make practical machine learning education more accessible to Chinese-speaking learners by translating the technical explanations, examples, and exercises from the original English material. The repository organizes the content as structured documentation that can be compiled into multiple formats such as HTML, PDF, EPUB, and MOBI, allowing users to read the material both online and offline. It focuses on teaching core machine learning concepts using Python while demonstrating practical workflows with popular libraries like Scikit-Learn and TensorFlow. The material covers topics ranging from basic machine learning theory to deep learning techniques and model evaluation, enabling learners to build and experiment with models step by step.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    neurojs

    neurojs

    A JavaScript deep learning and reinforcement learning library

    neurojs is a JavaScript framework designed to enable deep learning and reinforcement learning directly within web environments. The library provides a full machine learning framework implemented in JavaScript that can run inside browsers or Node.js environments. It focuses particularly on reinforcement learning algorithms, enabling developers to create intelligent agents that learn through interaction with simulated environments. The framework supports neural network architectures and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Universal Data Tool

    Universal Data Tool

    Collaborate & label any type of data, images, text, or documents etc.

    ...Use the Universal Data Tool directly from a web browser or with a Windows, Mac or Linux desktop application. Join a link to a collaborative session and see dataset samples from team members complete in real-time. Import from your S3 buckets easily with IAM or Cognito authentication. Working together, we can accomplish more. The Universal Data Tool was built to bring together the best ideas from different machine learning communities. Upload your dataset to Courses to create a training course. Testing and exercises validate that your workforce knows exactly how the data should be labeled. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Parsr

    Parsr

    Transforms PDF, Documents and Images into Enriched Structured Data

    Parsr is an open-source document parsing tool that converts PDFs, scanned images, and other structured documents into structured, machine-readable data formats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB