Showing 82 open source projects for "pdf"

View related business solutions
  • Automate contact and company data extraction Icon
    Automate contact and company data extraction

    Build lead generation pipelines that pull emails, phone numbers, and company details from directories, maps, social platforms. Full API access.

    Generate leads at scale without building or maintaining scrapers. Use 10,000+ ready-made tools that handle authentication, pagination, and anti-bot protection. Pull data from business directories, social profiles, and public sources, then export to your CRM or database via API. Schedule recurring extractions, enrich existing datasets, and integrate with your workflows.
    Explore Apify Store
  • Cloud-based help desk software with ServoDesk Icon
    Cloud-based help desk software with ServoDesk

    Full access to Enterprise features. No credit card required.

    What if You Could Automate 90% of Your Repetitive Tasks in Under 30 Days? At ServoDesk, we help businesses like yours automate operations with AI, allowing you to cut service times in half and increase productivity by 25% - without hiring more staff.
    Try ServoDesk for free
  • 1
    TeXworks

    TeXworks

    A simple interface for working with TeX documents

    TeXworks is a free and simple working environment for authoring TeX (LaTeX, ConTeXt and XeTeX) documents. Inspired by Dick Koch's award-winning TeXShop program for Mac OS X, it makes entry into the TeX world easier for those using desktop operating systems other than OS X. It provides an integrated, easy-to-use environment for users on other platforms particularly GNU/Linux and Windows and features a clean, simple interface accessible to casual and non-technical users.
    Downloads: 118 This Week
    Last Update:
    See Project
  • 2
    Scribus

    Scribus

    Powerful desktop publishing software

    ...Underneath a modern and user-friendly interface, Scribus supports professional publishing features, such as color separations, CMYK and spot colors, ICC color management, and versatile PDF creation.
    Leader badge
    Downloads: 21,969 This Week
    Last Update:
    See Project
  • 3

    FOray

    Modular XSL-FO Implementation for Java.

    FOray is an open-source XSL-FO publishing system that is suitable for converting XML content into PDF and other document formats. Although not yet fully conformant with the XSL-FO standard, it is very useful for many applications.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 4
    OmegaT - multiplatform CAT tool

    OmegaT - multiplatform CAT tool

    The free computer aided translation (CAT) tool for professionals

    OmegaT is a free and open source multiplatform Computer Assisted Translation tool with fuzzy matching, translation memory, keyword search, glossaries, and translation leveraging into updated projects.
    Leader badge
    Downloads: 1,824 This Week
    Last Update:
    See Project
  • D&B Hoovers is Your Sales Accelerator Icon
    D&B Hoovers is Your Sales Accelerator

    For sales teams that want to accelerate B2B sales with better data

    Speed up sales prospecting with the rich audience targeting capabilities of D&B Hoovers so you can spend more sales time closing.
    Learn More
  • 5
    Ltxshell
    Shell for calling LaTeX and accompanying tools (dvi-viewer, dvips, ps-file-viewer, pdf-file-viewer) on Linux and Windows-systems
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    ant4docbook

    ant4docbook

    ANT4DOCBOOK is an ANT task for DOCBOOK

    ANT4DOCBOOK is an ANT task for DOCBOOK, a semantic markup language for technical documentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 8
    RTextDoc

    RTextDoc

    An editor for structured documents

    ...RTextDoc has syntax highlighting, bracket matching, folding, document structure browser for sections and labels, bookmarks, manager for LaTeX symbols, an editor for mathematical equations,integrated BibTeX database manager and several tools to convert LaTeX to HTML and back. AsciiDoc files can be converted to DocBook, HTML and PDF files.
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    The goal of this tool is to simplify and accelerate the process of creating bookmarks for DjVu and PDF documents. You can see additional information on the project page.
    Leader badge
    Downloads: 59 This Week
    Last Update:
    See Project
  • Network Management Software and Tools for Businesses and Organizations | Auvik Networks Icon
    Network Management Software and Tools for Businesses and Organizations | Auvik Networks

    Mapping, inventory, config backup, and more.

    Reduce IT headaches and save time with a proven solution for automated network discovery, documentation, and performance monitoring. Choose Auvik because you'll see value in minutes, and stay with us to improve your IT for years to come.
    Learn More
  • 10

    SimpleTextFormatter

    STF automatically generates documentation

    STF is a system of automatically generating documentation under control of a program or a script. It is frequently used to automatically generate test reports. STF is also used to clean up the output of a process and turn it into a nice looking report.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    LaTeX Reference Card Creator

    LaTeX Reference Card Creator

    A Makefile based build system for creating LaTeX reference cards

    LaTeX Reference Card Creator is a Makefile based build system for creating reference cards. LaTeX Reference Card Creator compiles content into PDF, DjVu, TEX DVI, HTML and PostScript output formats. A three column reference card will be created. Features include batch image format conversions, spell checking, broken link checking, automatic backups and .zip and .tar.gz distribution building. LaTeX Reference Card Creator provides many LaTeX examples which can be used to make a reference card.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Web Book Downloader

    Web Book Downloader

    Download websites as e-book: pdf, txt, epub.

    This application allows user to download chapters from website in 3 ways: - from table of contents; - from range: first chapter address, last chapter address; - by crawling from first chapter to n; In settings you can customize language, input(website encoding) for simplicity output is in the same encoding. If you want your language add new class into strings package, and new fields into Settings class and GUI menu(initialize method).
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    PDF-Shuffler
    PDF-Shuffler is a small python-gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface. It is a frontend for python-pyPdf.
    Leader badge
    Downloads: 40 This Week
    Last Update:
    See Project
  • 14
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. ...
    Leader badge
    Downloads: 171 This Week
    Last Update:
    See Project
  • 15
    mbFXWords

    mbFXWords

    Analyze text. Diagonal read subject, predicate, obj. Search other pdf.

    ... - Divide plain text: subject, predicate, object. - Count words: stemming. - Search for similar content: pdf's. Gives out subject, predicate and object of sentences of pdf and plain text files. Provides comfortable GUI. Automatic language detection.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A Swiss Army Knife GUI application for PDF documents: combine, split, rotate, reorder (n-up, booklet), watermark, edit bookmarks/fileinfo/pagetransition, compress, encrypt, decrypt, sign, repair, edit attachments and more.
    Leader badge
    Downloads: 123 This Week
    Last Update:
    See Project
  • 17
    Texinfo Web Publisher

    Texinfo Web Publisher

    Multi-format web publishing system based on Texinfo

    Texinfo Web Publisher is a Makefile based publishing system featuring simultaneous con- tent creation into HTML, non-split HTML, Framed HTML, HTML Zip, XML, DocBook, PDF, DjVu, PostScript, DVI, Plain text, Info and EPUB book formats. All Texinfo Web Publisher output formats are from a single source. Texinfo Web Publisher can be used for website creation has FTP deployment capabilities and supports Cascading Style Sheets (CSS). Texinfo Web Publisher is a low maintenance solution for publishing websites with multi- format matching content. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    PDF Clown

    PDF Clown

    General-Purpose PDF Library for Java and .NET

    PDF Clown is a general-purpose Java and .NET library for manipulating PDF files through multiple abstraction layers, rigorously adhering to PDF 1.7 specification (ISO 32000-1). This project aims to provide a universal access to PDF files (creation, reading, editing, rendering...) through an accurate and elegant object-oriented API.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 19
    This project has moved to https://github.com/workinghard/GuitarTeX2 GuitarTeX2 is based on the idea of Chord. It takes a Chord file containing Chordpro directives to produce good-looking and easy-to-play song sheets for guitarists in PostScript or PDF format. GuitarTeX2 is a further development of GuitarTeX.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    LaTeX Web Publisher

    LaTeX Web Publisher

    LaTeX Web Publisher is a Makefile based Web publishing system

    LaTeX Web Publisher is a Makefile based Web publishing system featuring content creation into HTML, non-split HTML, HTML Zip, PDF, DjVu, PostScript, DVI and Plain text formats. All LaTeX Web Publisher output formats are from a single LaTeX source and have indices. LaTeX Web Publisher can be used for website creation and has FTP deployment capabilities. A website created with LaTeX Web Publisher will have HTML, non-split HTML and PDF content formats. The website will have complete HTML contents available as downloadible zip file and the websites source available as gzipped tar file. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    seppdflatex

    seppdflatex

    Build a large LaTeX book with separate linked chapters

    seppdflatex is a Perl script which automates a lot of the tasks needed to compile PDF documents from LaTeX source for a multi-volume book, or a book with many huge chapters which you may not want as a single document, but which should all be unified by cross-references and external hyperlinks, so a PDF reader will open a link to an external chapter PDF file. The Table of Contents and LOF and LOT are made for all chapters, and the TOC, LOF, LOT are all hyperlinked to the correct external PDF chapter files using zr-ref and xr packages from CTAN. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    Detexter

    Detexter is an app designed to extract text from PDF files.

    Detexter lets you extract text from multiple PDF files. Detexter uses the PDFBox library for its text extraction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    ...It reads a high-level description of a document similar in style to LaTeX and produces a PostScript file which can be printed on most laser printers. Plain text and PDF output are also available.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    jPod is a rich PDF manipulation and rendering framework. A complete rendering library based on jPod is available here at "jPodRenderer". To see jPod & jPodRenderer at work, have a look at www.cabaret-solutions.com
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    Thunderpad (formerly Textpad)

    Thunderpad (formerly Textpad)

    Simple and lightweight text editor

    Thunderpad is a simple, general-purpose and cross-platform text editor written in C++ using the Qt libraries. Thunderpad aims to be faster and more lightweight than most text editors, but as useful as them. Additionally, Thunderpad supports syntax highlighting, word count, line count and comes with various color schemes.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next