Showing 47 open source projects for "pdf data mining"

View related business solutions
  • Your monitoring isn't a stack. It's a pile. Fix that. Icon
    Your monitoring isn't a stack. It's a pile. Fix that.

    Errors, performance, logs, uptime. One install, one invoice, one UI.

    Replace Datadog, New Relic, and Sentry without adding three more dashboards.
    Free 30 days.
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    AWESOME DATA SCIENCE

    AWESOME DATA SCIENCE

    Awesome Data Science repository to learn and apply for real world

    An open source Data Science repository to learn and apply towards solving real world problems. This is a shortcut path to start studying Data Science. Just follow the steps to answer the questions, "What is Data Science and what should I study to learn Data Science?" Data Science is one of the hottest topics on the Computer and Internet farmland nowadays. People have gathered data from applications and systems until today and now is the time to analyze them. The next steps are producing...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    KOReader

    KOReader

    An ebook reader application supporting PDF, DjVu, EPUB, FB2, etc.

    KOReader is a document viewer for E Ink devices. Supported fileformats include EPUB, PDF, DjVu, XPS, CBT, CBZ, FB2, PDB, TXT, HTML, RTF, CHM, DOC, MOBI and ZIP files. It’s available for Kindle, Kobo, PocketBook, Android and desktop Linux. Runs on embedded devices (Cervantes, Kindle, Kobo, PocketBook, reMarkable), Android and Linux computers. Developers can run a KOReader emulator in Linux and MacOS. Multi-lingual user interface with a highly customizable reader view and many typesetting...
    Downloads: 95 This Week
    Last Update:
    See Project
  • 3
    Ray Tracing in One Weekend Book Series

    Ray Tracing in One Weekend Book Series

    The Ray Tracing in One Weekend series of books

    The Ray Tracing in One Weekend series of books are now available to the public for free online. They are now released under the CC0 license. This means that they are as close to public domain as we can get. (While that also frees you from the requirement of providing attribution, it would help the overall project if you could point back to this web site as a service to other users.) These books are formatted for printing directly from your browser, where you can also (on most browsers) save...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    Jmol

    Jmol

    An interactive viewer for three-dimensional chemical structures.

    Over 1,000,000 page views per month. Jmol/JSmol is a molecular viewer for 3D chemical structures that runs in four independent modes: an HTML5-only web application utilizing jQuery, a Java applet, a stand-alone Java program (Jmol.jar), and a "headless" server-side component (JmolData.jar). Jmol can read many file types, including PDB, CIF, SDF, MOL, PyMOL PSE files, and Spartan files, as well as output from Gaussian, GAMESS, MOPAC, VASP, CRYSTAL, CASTEP, QuantumEspresso, VMD, and many other...
    Leader badge
    Downloads: 587 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    qvge

    qvge

    Qt Visual Graph Editor

    qvge is a multiplatform graph editor written in C++/Qt. Its main goal is to make possible visually edit two-dimensional graphs in a simple and intuitive way. Please note that qvge is not a replacement for such a software like Gephi, Graphvis, Dot, yEd, Dia and so on. It is neither a tool for "big data analysis" nor a math application. It is really just a simple graph editor :)
    Downloads: 8 This Week
    Last Update:
    See Project
  • 6
    Kiwix

    Kiwix

    Wikipedia offline & more

    Kiwix is an offline reader for Web content. It's especially intended to make Wikipedia available offline. With Kiwix, you can enjoy Wikipedia on a boat, in the middle of nowhere... or in Jail. Kiwix manages to do that by reading ZIM files, a highly compressed open format with additional meta-data.
    Leader badge
    Downloads: 245 This Week
    Last Update:
    See Project
  • 7
    Science Olympiad Scoring System

    Science Olympiad Scoring System

    Excel based scoring system for Science Olympiad tournaments

    ...Click on the green button above to download the current Div C version, or on the files link to get the Div B version (be sure to save to disk first, then open the files). There is also a version with test data pre-populated and numerous settings / errors to demonstrate the functionality of the program. NOTE: Excel 2008 for Mac does NOT support macros at all, thus many parts of this system won't work. Virtually any other version of Office will work. Be sure to signup for the mailing list to be informed of updates Note a 'SO Scoring Best Practices' PDF is available to give tips and tricks used at the National Tournament.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 8
    WIKINDX

    WIKINDX

    Virtual Research Environment / On-line Bibliography Manager

    Reference management, bibliography management, citations and a whole lot more. Designed by academics for academics, under continuous development since 2003, and used by both individuals and major research institutions worldwide, WIKINDX is a Virtual Research Environment (an enhanced on-line bibliography manager) storing searchable references, notes, files, citations, ideas, and more. An integrated WYSIWYG word processor exports formatted articles to RTF, DOCX, and HTML. Plugins include...
    Leader badge
    Downloads: 70 This Week
    Last Update:
    See Project
  • 9
    Quiz/Survey/Test - QST

    Quiz/Survey/Test - QST

    A Free, complete, open source universal assessment/exam platform

    QST, the worlds unparalleled open source, multi-tenant, online/lan assessment software. From a quick quiz on your phone to very large scale, high stakes, proctored desktop testing, we make it easy/secure/economical. Our intuitive design contains features (Immediate detailed results, Create/Export/Import/Convert Questions, WYSIWYG/Math-Chemistry/Basic Editors, Question/Item Bank, Multiple Question Types, Multiple Delivery Styles, Multiple Delivery/Results Options, Adaptive/Branching...
    Leader badge
    Downloads: 41 This Week
    Last Update:
    See Project
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 10
    Mastering Bitcoin

    Mastering Bitcoin

    Mastering Bitcoin 3rd Edition - Programming the Open Blockchain

    The bitcoinbook repository contains the source code for Mastering Bitcoin, the authoritative open-source book by Andreas M. Antonopoulos on Bitcoin and cryptocurrency technologies. Written in a collaborative and continuously updated format using Markdown and AsciiDoc, the book serves as a comprehensive technical guide for developers, engineers, and system architects who want to understand how Bitcoin works. It covers the protocol, cryptography, peer-to-peer architecture, wallets, mining, and...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 11
    General Knowledge Machine Project

    General Knowledge Machine Project

    Intellect Modeling Kit: assisting research, diagnostics, consulting

    ...Intellect Modeling Kit (IMK) is intended to build knowledge machines (KM) assisting experts on the steps of activity: * Observation; * Producing propositions based on knowledge; * Elimination of impossible propositions; * Selection and verification of the most appropriate propositions; * Memorizing - new knowledge item creation; * Abstraction – building objects representing typical signs of similar objects groups, data mining. KM is not intended to replace human experts, it is built to multiply abilities. Machine should not be responsible for decisions. The IMK is designed to create ready-to-use software applications using simple text files. Any human knowledge can be uploaded to KM by expert not familiar with software coding. Demos present in kit. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Elementary Algorithms

    Elementary Algorithms

    Book of elementary algorithms and data structures

    This book introduces elementary algorithms and data structure. It includes side-by-side comparison of purely functional realization and their imperative counterpart. From 2020/12, I started re-writing this book. The PDF can be downloaded for preview (EN, 中文). The 1st edition in Chinese (中文) was published in 2017. I recently switched my focus to the Mathematics of programming, the new book is also available in (github).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    TemaTres: controlled vocabulary server

    TemaTres: controlled vocabulary server

    Manage, Publish and Share Ontologies, Taxonomies, Thesauri, Glossaries

    Web application for management formal representations of knowledge, thesauri, taxonomies and multilingual vocabularies / Aplicación para la gestión de representaciones formales del conocimiento, tesauros, taxonomías, vocabularios multilingües. For the latest version of code: https://github.com/tematres/TemaTres-Vocabulary-Server
    Downloads: 8 This Week
    Last Update:
    See Project
  • 14
    R4DS (R for Data Science)

    R4DS (R for Data Science)

    R for data science: a book

    “R for Data Science” (r4ds) is the source material (book + examples) by Hadley Wickham et al., intended to teach data science using R and the tidyverse. It covers the workflow from importing data, tidying, transforming, visualizing, modelling, communicating results, and programming in R. The repository contains the source files (Quarto / RMarkdown), example datasets, visualizations, exercises, and all content needed to build the book. Includes many example datasets, diagrams, code samples,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    hubs-research-acm-chi-2021

    hubs-research-acm-chi-2021

    Supplemental code and dataset for the ACM CHI 2021 paper

    Supplemental code and dataset for the ACM CHI 2021 paper on "Proxemics and Social Interactions in an Instrumented Virtual Reality Workshop". In this research paper we instrumented Mozilla Hubs Cloud to record where participants where during the event. From there, we measured proxemic and plotted the activity along with some semi-structured interviews. Virtual environments (VEs) can create collaborative and social spaces, which are increasingly important in the face of remote work and travel...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    multipractice

    multipractice

    Tool for practicing languages.

    Practice makes perfect. Panglossa MultiPractice is a tool to help you learn and practice languages. You can create your own courses, import courses created by others, and even export courses to HTML or PDF documents. The original project (for Lazarus) was basically a flashcard app. Now it is more like a platform for creating structured courses with text, images, audio and video content, as well as different types of exercises. Please keep in mind that this project is created and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    jpeg2pdf

    Create PDF from JPEG scans and photos

    Cross-platform command-line tool for creation of PDF documents from scans/photos of pages in JPEG (.jpg) format and the lightest weight ANSI C library to put multiple JPEG files into one PDF file. You can add handwritten comments to PDF scans (over original images) with xournal: http://xournal.sourceforge.net/ It supports graphics tablets and saves comments to PDFs as vector data.
    Leader badge
    Downloads: 25 This Week
    Last Update:
    See Project
  • 18
    Category Theory for Programmers

    Category Theory for Programmers

    'Category Theory for Programmers' unofficial PDF and LaTeX source

    This is an unofficial PDF version of "Category Theory for Programmers" by Bartosz Milewski, converted from his blogpost series (with permission). Category Theory for Programmers was published 12 August, 2019. Based off release tag v1.3.0. See errata-1.3.0 for changes and fixes since print. Scala Edition is now available in paperback. Publish. Based off release tag v1.3.0. See errata-scala for changes and fixes since print. The best way to build the book is using the Nix package manager....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    TreeForm Syntax Tree Drawing Software

    Syntax Tree Drawing Software (Linguistics)

    TreeForm Syntax tree drawing software is a Linguistic Syntax/Semantics tree drawing editor. Designed for graphical n-ary tree drawing. Mac users can install the software through the new package, but must give authority through "System Preferences" > "Security & Privacy". Windows and Linux users can run the software through the JAR file directly. All users must have Java 8 or higher installed. https://java.com/en/download/
    Leader badge
    Downloads: 60 This Week
    Last Update:
    See Project
  • 20
    Canorus

    Canorus

    Music score editor

    Canorus is a free cross-platform music score editor. It supports an unlimited number and length of staffs, polyphony, a MIDI playback of notes, chord markings, lyrics, import/export filters to formats like MIDI, MusicXML, ABC Music, MusiXTeX and LilyPond
    Downloads: 27 This Week
    Last Update:
    See Project
  • 21
    Xena - Digital Preservation Software

    Xena - Digital Preservation Software

    Xena transforms files into open data formats

    Xena transforms files into open data formats for long-term digital preservation, encodes content in Base64 and wraps in XML metadata. Formats supported include MBOX, PST, MSG, DOC, XLS, PPT, RTF, PNG, XML, PDF, JPG, TIFF, PCX, WAV, MP3 and more. NO LONGER MAINTAINED, NO LONGER SUPPORTED
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    Contestant

    Contestant

    Helps to manage contestant/participant applications

    The contestant project is an idea and an example how to deal with plenty number of participant applications. Especially when those applications are sent to organizer in paper form. In short, participant fills out a form through web site which generates pdf file with QR code. It can be printed, signed and stamped by participant organization. The organizer gets the application form by email or via post, scans QR code from the form and pastes the data into table without necessary of rewriting. All with open-source tools.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    JSentiWordNet

    A wrapper for the famous SentiWordNet, a resource for opinion mining

    This project aims to provide a wrapper around the SentiWrodnet, a lexical resource for opinion mining. As defined by the authors : SentiWordNet assigns to each synset of WordNet three sentiment scores: positivity, negativity, objectivity. You can find additional information about the creation of SentiWordnet here : http://nmis.isti.cnr.it/sebastiani/Publications/LREC06.pdf sentiWordnet (avilable here : https://drive.google.com/open?
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    MarkList

    MarkList

    Collection of Mark lists from Mobile App and Print Progress Cards

    Teachers can submit mark lists from mobile app. Mark lists are bulk printed. Pdf can be printed individually. Mark lists are compiled in to one data sheet. Progress cards can be viewed or printed directly. You can write your own code to print Progress Card. Data is collected in Text Format so can be accessed and interpreted easily. Mobile App Google Playstore Link : https://play.google.com/store/apps/details?
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    ZeGrapher

    ZeGrapher

    Fast and easy to use plotting software.

    ZeGrapher is a free and open source software for plotting mathematical objects (functions, sequences, parametric equations) and experimental data on the plane. This software is intended to be as easy to use as possible, suitable for high school students.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next