Showing 13 open source projects for "pdf parser"

View related business solutions
  • Top-Rated Free CRM Software Icon
    Top-Rated Free CRM Software

    216,000+ customers in over 135 countries grow their businesses with HubSpot

    HubSpot is an AI-powered customer platform with all the software, integrations, and resources you need to connect your marketing, sales, and customer service. HubSpot's connected platform enables you to grow your business faster by focusing on what matters most: your customers.
  • Translate docs, audio, and videos in real time with Google AI Icon
    Translate docs, audio, and videos in real time with Google AI

    Make your content and apps multilingual with fast, dynamic machine translation available in thousands of language pairs.

    Google Cloud’s AI-powered APIs help you translate documents, websites, apps, audio files, videos, and more at scale with best-in-class quality and enterprise-grade control and security.
  • 1
    pdf-editor

    pdf-editor

    Edit your PDFs without needing a subscription or creating accounts

    Edit your PDFs without needing a subscription or creating accounts. Add a GUI/Turn it into a web application. Add a parser for the command line to do multiple commands at once e.g. merge (cut pdf1) pdf2. Tested working with Python 3.8.5. Install venv (py -3.8 -m pip install virtualenv). PDF and Word documents are binary files, which makes them much more complex than plaintext files. In addition to text, they store lots of font, color, and layout information. If you want your programs to read...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 2
    pdf-extractor

    pdf-extractor

    Node.js module for rendering pdf pages to images, svgs and HTML files

    Pdf-extractor is a wrapper around pdf.js to generate images, svgs, html files, text files and json files from a pdf on node.js. A DOM Canvas is used to render and export the graphical layer of the pdf. Canvas exports *.png as a default but can be extended to export to other file types like .jpg. Pdf objects are converted to svg using the SVGGraphics parser of pdf.js. Pdf text is converted to HTML. This can be used as a (transparent) layer over the image to enable text selection. Pdf text...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Jupyter Notebook Tools for Sphinx

    Jupyter Notebook Tools for Sphinx

    Sphinx source parser for Jupyter notebooks

    nbsphinx is a Sphinx extension that provides a source parser for *.ipynb files. Custom Sphinx directives are used to show Jupyter Notebook code cells (and of course their results) in both HTML and LaTeX output. Un-evaluated notebooks – i.e. notebooks without stored output cells – will be automatically executed during the Sphinx build process.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Publish.jl

    Publish.jl

    A universal document authoring package for Julia

    A universal document authoring package for Julia. This is a package for Julia that provides a general framework for writing prose, technical documentation is its focus, though it is general enough to be applied to any kind of written document.
    Downloads: 0 This Week
    Last Update:
    See Project
  • ConnectWise Cybersecurity Management for MSPs Icon
    ConnectWise Cybersecurity Management for MSPs

    Software and support solutions to protect your clients’ critical business assets

    ConnectWise SIEM (formerly Perch) offers threat detection and response backed by an in-house Security Operations Center (SOC). Defend against business email compromise, account takeovers, and see beyond your network traffic. Our team of threat analysts does all the tedium for you, eliminating the noise and sending only identified and verified treats to action on. Built with multi-tenancy, ConnectWise SIEM helps you keep clients safe with the best threat intel on the market.
  • 5
    Swagger2Markup

    Swagger2Markup

    Swagger to AsciiDoc or Markdown converter

    ... file into several AsciiDoc or GitHub Flavored Markdown documents which can be combined with hand-written documentation. The Swagger source file can be located locally or remotely via HTTP. Swagger2Markup supports the Swagger 1.2 and 2.0 specifications. Internally it uses the official swagger-parser and my markup-document-builder. You can use Swagger2Markup to convert your contract-first Swagger YAML file into a human-readable format.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Delphi : VRCalc++ OOSL (Script) and more

    Delphi : VRCalc++ OOSL (Script) and more

    Delphi : VRCalc++ OOSL & + (Paged List, TextEditor, VRAstroVision ...)

    Vincent Radio {Adrix.NT} Sources Library & Applications : Delphi C++ Java VRCalc++ C# VRCalc++ Object Oriented Scripting Language - Engine Source Pascal Code - Delphi Packages Build Prjs - VRCalc++ Scripted System Std RT Library - Guides & Docs (CHM, PDF, DOCX) - VCL & FMX (FireMonkey) Support - Script Test Code (Lang RTL VCL FMX) - Visual Stage Project : VCL & FMX Paged Lists & Iterators : Delphi C++ Java C# Multi-Dim Arrays & Direct Graph Classes : Delphi C++ Java VRCalc++ C# Delphi Drag...
    Leader badge
    Downloads: 13 This Week
    Last Update:
    See Project
  • 7

    pdfsummary

    Summarize PDF file contents by page.

    Uses a modified form of Didier Stevens PDF parser to get object descriptions by page and then summarizes them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    CaLi2CoPi is a multiplatform PDF parser library programmed in PostScript. Works with several specialized switch in order to verify, add, extract or change any PDF content. Also supports online execution on web based user interface via Ghostscript.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Andoffline

    A toolkit for some Android sms/call Apps, base64 encoder, vcf parser a

    MOVED TO: https://github.com/fulvio999/Andoffline Feature: Browser for exported SMS, CALL and CONTACT from Android Phone Save to PDF file for exported SMS, CALL and CONTACT, VCF parser Support tool for: http://android.riteshsahu.com/apps/sms-backup-restore http://android.riteshsahu.com/apps/call-logs-backup-restore Image base64 encoder/decoder ** Allow to execute job/script execution from SMS sent from remote phone (without internet connection): - connect the phone to PC with usb...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Engage for Amazon Connect, the Pre-built Contact Center Platform Icon
    Engage for Amazon Connect, the Pre-built Contact Center Platform

    Utilizing the power of AWS and Generative AI, Engage provides your customers with highly personalized, exceptional experiences.

    Engage is a pre-built, intelligent contact center platform that transforms customer service.
  • 10

    cextools

    Command line helpers for Conexp files.

    Some small command line programs and a file parser for Concept Explorer (conexp) written in C++. Currently features include: Converters from concept explorer into PDF, PostScript, SVG and PovRay, a modified 3D Freese layout.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    ScientificPdfParser

    Parses scientific articles from PDF and marks the meta data.

    .... The project contains three runnable classes that can work on given PDFs in batch mode via threading: a) BatchHeuristic: A parser that uses defined heuristics and rules. Especially applicable for articles with a broad set of layouts (e.g. PeDocs, http://www.pedocs.de/). b) BatchHybrid: A parser that uses machine learning (Naive Bayes) to find the correct element. Useful for e.g. ACL. c) ModelGenerator: Generates a training model, used by BatchHybrid, from given PDF and XML file
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    ** Guys I have built a much more powerful Fully Featured CMS system at: https://github.com/MacdonaldRobinson/FlexDotnetCMS Macs CMS is a Flat File ( XML and SQLite ) based AJAX Content Management System. It focuses mainly on the Edit In Place editing concept. It comes with a built in blog with moderation support, user manager section, roles manager section, SEO / SEF URL
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    QuickDoc is a java document parser that reads documents from plain text files using a simple language and exports the document to other formats like PDF, HTML, Java Help and XML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next