Showing 17 open source projects for "pdf data mining"

View related business solutions
  • Error to trace to log to deploy. One click. No SSH. Icon
    Error to trace to log to deploy. One click. No SSH.

    Catch the cause before the pager goes off.

    AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.
    Free 30 days.
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 1
    Data Crow

    Data Crow

    The ultimate cataloguer

    Data Crow allows you to use the standard movie & video (divx, xvid, DVD, Blu-ray, etc), book (and eBooks), images, board games, comic books, games & software, music (mp3 and other music files) cataloguing modules. Besides these modules, which you can change to fit your requirements, you can create new modules (want to catalogue your stamps, equipment, or anything else?). The GUI is skinnable. Reporting (using JasperReports and their community edition JasperSoft Developer Studio ), loan...
    Leader badge
    Downloads: 264 This Week
    Last Update:
    See Project
  • 2
    ArchiveBox

    ArchiveBox

    Open source self-hosted web archiving

    ArchiveBox is a powerful, self-hosted internet archiving solution to collect, save, and view websites offline. Without active preservation effort, everything on the internet eventually disappears or degrades. Archive.org does a great job as a centralized service, but saved URLs have to be public, and they can't save every type of content. ArchiveBox is an open source tool that lets organizations & individuals archive both public & private web content while retaining control over their data....
    Downloads: 10 This Week
    Last Update:
    See Project
  • 3
    File Converter

    File Converter

    Simple tool which allows you to convert and compress files

    File Converter is a minimalist open‑source tool (GPL‑3.0) that lets users convert and compress one or multiple files directly via the Windows Explorer context menu. It integrates with powerful back-end utilities—FFmpeg, ImageMagick, Ghostscript—to handle a broad range of media and document transformations. File Converter is a personal open source project started in 2014. I have put hundreds of hours adding, refining and tuning File Converter with the goal of making the conversion and...
    Downloads: 39 This Week
    Last Update:
    See Project
  • 4
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to...
    Downloads: 5 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    AI File Sorter

    AI File Sorter

    Local AI file organization with categorization and rename suggestions

    AI File Sorter is a cross-platform desktop application that uses AI (local LLMs run on your computer) to organize files and suggest meaningful file names based on real content, not just filenames or extensions. The app can analyze images locally and propose descriptive rename suggestions (for example, IMG_2048.jpg → clouds_over_lake.jpg). It can also analyze document text to improve categorization and renaming. Supported formats include PDF, DOCX, XLSX, PPTX, ODT, ODS, ODP, and common...
    Leader badge
    Downloads: 227 This Week
    Last Update:
    See Project
  • 6
    Monitoring Storage,  SAN,  LAN

    Monitoring Storage, SAN, LAN

    Storage, SAN, LAN Performance Monitoring: IBM,NetApp,Hitachi,HPE,EMC

    The tool offers you end-to-end views of your storage environment including LAN and SAN and can save you significant money in operation monitoring and by predicting utilization bottlenecks in your virtualized environment. You can also generate policy-based alerts, view overall health status of your systems, reduce service downtime, use capacity and forecasting data - Real-time storage performance visibility in LAN and SAN multi-vendor environment - Historical reporting (graph, CSV, PDF) - Alerting based on performance thresholds - Storage event monitoring The tool supports enterprise class level storage devices from major storage vendors like: IBM, Dell EMC, NetAPP, HPE, Hitachi, Lenovo, Pure Storage, Huawei, Dot Hill, INFINIDAT, Fujitsu, DataCore, Quantum, QNAP, FalconStor,Ceph, Synology, RAIDIX, Qumulo,Inspur,Veritas Monitoring of the SAN supports Brocade, Qlogic and Cisco SAN switches Monitoring of the LAN network Try demo at http://demo.stor2rrd.com
    Leader badge
    Downloads: 14 This Week
    Last Update:
    See Project
  • 7
    DynaQ

    DynaQ

    Innovative text document search. http://dynaq.opendfki.de for details.

    The goal of DynaQ is to develop an inquiry system to explore the personal information space, supporting you with the searching paradigm 'orienteering'. DynaQ is a (desktop)search engine with enhanced functionality for file, email and blog search. Look at our GitLab homepage for sourcecode and documentation: http://dynaq.opendfki.de
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    FixSit

    FixSit

    Fix Mac Type/Creator data for MacOS on Linux

    This little tool will scan your .HFS filesystem for files with "????UNIX" codes and tries to correct them. Currently supports .sit, .zip, and .pdf
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Xena - Digital Preservation Software

    Xena - Digital Preservation Software

    Xena transforms files into open data formats

    Xena transforms files into open data formats for long-term digital preservation, encodes content in Base64 and wraps in XML metadata. Formats supported include MBOX, PST, MSG, DOC, XLS, PPT, RTF, PNG, XML, PDF, JPG, TIFF, PCX, WAV, MP3 and more. NO LONGER MAINTAINED, NO LONGER SUPPORTED
    Downloads: 2 This Week
    Last Update:
    See Project
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 10

    msgpdf

    Convert Outlook MSG Files into Adobe PDF Format with Attachments

    MSG file primarily associated with Outlook having .msg file is known MSG format. Such file contains of plain ASCII text for headers, body messages, hyperlinks, footer and attachments. In present time, PDF files has become standard format for moving documents from one user to another. Occasionally, it is tough for users to access of MSG file because it is an email application reliant file. So, for viewing MSG files there is a general demand for converting data of MSG to PDF format. This approach will convert the attachments of MSG Files and other formatting as well.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Crème Fraiche

    Crème Fraiche

    eml2pdf converter

    I DO NO LONGER CLAIM PLATFORM-INDEPEDENCE FOR Crème Fraiche. THIS PROGRAM RUNS ON LINUX. Crème Fraiche transforms EML-files, as they are created by email-clients, to PDF. PSE see the rubygems.org site for updates or use the gem-tool right away to install Crème Fraiche: ~$ gem install cremefraiche
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Remove Log Files

    Remove Log Files

    Log file remover oriented for Windows Servers (x86 & x64)

    AutoDeletions is a free, smart and very performant Windows tool, oriented to the IT Specialists and/or Server Administrators, who have to face the problem of cleaning their file systems from old or obsolete files (typically log or old data files), that are systematically produced by the applications running on their company servers. Configuring this tool is very easy and flexible thanks to a single central INI file. AutoDeletions is ideal for batch processing your deletions. You could...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Tool to recover data from corrupted ZIP archives and DEFLATE-compressed streams from other files and disk images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    PDFcat will be a platform-independent JAVA application, that helps to manage thousands of books, articles, lecture notes, music sheets in PDF format. I also want to support txt, djvu, and zipped pacgages. I will use sqlite for the sake of portability
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Zonomania
    Zonomania is a webbased application to document a SAN (Storage Area Network) environment. It is powerful in the hand of a SAN Administrator to manage, engineer and plan a SAN.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Knowledge management tool which is primarily designed for pdf files. The tool will let the documents get tagged either automatically or manually. Within the pdf archieve, a user will be able to retrieve desired documents quickly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Visual xsltproc is a tool which help to write xslt file, and debug it to find errors. It writes xml, and generates xml (Syntax highlighting of XML & line Nr.). Finally if the result is XSL-FO it generates the pdf on Apache FOP java. Build on QT4.2.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next