Showing 135 open source projects for "pdf data mining"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 1
    Orange Data Mining

    Orange Data Mining

    Orange: Interactive data analysis

    ...When teaching data mining, we like to illustrate rather than only explain.
    Downloads: 52 This Week
    Last Update:
    See Project
  • 2
    Dawarich

    Dawarich

    Self-hostable alternative to Google Timeline

    Dawarich is a command-line tool (likely Ruby-based) for transforming and analyzing Arabic text data with normalization, diacritic handling, segmentation, and morphological tokenization. Designed for text mining and NLP workflows in Arabic-language contexts.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    Skim

    Skim

    A PDF Reader and Note-taker for OS X

    Skim is a PDF reader and note-taker for OS X. It is designed to help you read and annotate scientific papers in PDF, but is also great for viewing any PDF file. Skim requires OS X 10.10 or higher.
    Leader badge
    Downloads: 12,863 This Week
    Last Update:
    See Project
  • 4
    PaperQA2

    PaperQA2

    High accuracy RAG for answering questions from scientific documents

    PaperQA2 is a package for doing high-accuracy retrieval augmented generation (RAG) on PDFs or text files, with a focus on the scientific literature. See our recent 2024 paper to see examples of PaperQA2's superhuman performance in scientific tasks like question answering, summarization, and contradiction detection. In this example we take a folder of research paper PDFs, magically get their metadata - including citation counts and a retraction check, then parse and cache PDFs into a...
    Downloads: 8 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 5
    dxf2gcode

    dxf2gcode

    DXF2GCODE: converting 2D dxf drawings to CNC machine compatible G-Code

    DXF2GCODE is a tool for converting 2D (dxf, pdf, ps) drawings to CNC machine compatible GCode. Windows, Linux, and Mac support by using python scripting language.
    Leader badge
    Downloads: 318 This Week
    Last Update:
    See Project
  • 6
    GeoDMA

    GeoDMA

    Geographic feature extraction and data mining

    GeoDMA is a plugin for TerraView software, used for geographical data mining. With a single image, the user can perform segmentation, attributes extraction, normalization and classification.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7

    TabuVis

    An Interactive Visualisation for Tabular Data

    TabuVis is a comprehensive visual analysis tool that provides a flexible, customizable and interactive visualization for tabular (or multidimensional) data. It utilizes scatter-plot visualization approach to provide comprehensive and interactive views for different attribute mappings. It provides single scatter-plot and multiple scatter-plots together with Map (GIS) capability. The project is available at: http://staff.scem.uws.edu.au/~vinh/projects/TabuVis/ Related publications: https://www.sciencedirect.com/science/article/pii/S2468502X20300358 https://dl.acm.org/doi/pdf/10.1145/2968220.2968248 https://link.springer.com/content/pdf/10.1007/s11432-013-4870-1.pdf https://dl.acm.org/doi/pdf/10.1145/2397696.2397705
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Jmol

    Jmol

    An interactive viewer for three-dimensional chemical structures.

    Over 1,000,000 page views per month. Jmol/JSmol is a molecular viewer for 3D chemical structures that runs in four independent modes: an HTML5-only web application utilizing jQuery, a Java applet, a stand-alone Java program (Jmol.jar), and a "headless" server-side component (JmolData.jar). Jmol can read many file types, including PDB, CIF, SDF, MOL, PyMOL PSE files, and Spartan files, as well as output from Gaussian, GAMESS, MOPAC, VASP, CRYSTAL, CASTEP, QuantumEspresso, VMD, and many other...
    Leader badge
    Downloads: 587 This Week
    Last Update:
    See Project
  • 9
    A Matlab toolkit for all aspects of EEG/ERP analysis, especially PCA. If you run into a problem, please send me a note and I'll fix it. The tutorial pdf is in the documentation folder and the tutorial data can be downloaded from a link provided by the pdf. The tutorial file has full install instructions. For all platforms supported by Matlab.
    Leader badge
    Downloads: 8 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 10
    JaxoDraw

    JaxoDraw

    JaxoDraw: Feynman Diagrams made easy!

    JaxoDraw is a tool to generate Feynman diagrams in a mouse click-and-drag fashion. Graphs can be exported to a variety of graphics formats and arbitrary latex code can be used for labels to produce high-quality publishing-style figures.
    Leader badge
    Downloads: 36 This Week
    Last Update:
    See Project
  • 11
    AuroreNR
    Software developed for the analysis of Neutron Reflectivity data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    QXRD is software for the acquisition and analysis of X-ray data taken with 2 dimensional detectors. The software can drive a Perkin Elmer XRD series flat panel detector and can be remote-controlled via a socket interface, or directly from SPEC
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    WP 34s

    WP 34s

    Scientific/engineering firmware repurposing HP business calculators!

    This project has created scientific firmware for the HP-20b and HP-30b business calculators. WP 34S turns either of these calculators into a powerful keystroke programmable scientific device. According to our customers, it's the most powerful and fastest RPN scientific pocket calculator ever built. WP 34S is alive and stable since 2011. We have succeeded in satisfying the most picky users - read about their experiences on http://www.hpmuseum.org/forum/forum-8.html. Since 2014, WP...
    Leader badge
    Downloads: 19 This Week
    Last Update:
    See Project
  • 14
    ...However, it also offers extra functionality via applications and utilities developed by others. Electronic Design Automation (EDA) tool suites are used to provide schematic capture and editing, and schematic to netlist conversion. Waveform data viewers are used to display simulation results and PDF viewers to display user manuals.
    Leader badge
    Downloads: 10 This Week
    Last Update:
    See Project
  • 15
    To give users the full control over the running application. This means that an application is working according to its purpose but the control over the whole interface is taken from developer and given to users. While an application is running, users can move, resize, and tune all the screen objects through which the communication with an application is going. Set of files includes the book (both in DOC and PDF formats), a big demonstration project with all its files available (all the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    stkpp

    stkpp

    C++ Statistical ToolKit

    ...At a convenience, we propose the source packages on sourceforge. The library offers a dense set of (mostly) template classes in C++ and is suitable for projects ranging from small one-off projects to complete data mining application suites.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    UnBBayes

    UnBBayes

    Framework & GUI for Bayes Nets and other probabilistic models.

    UnBBayes is a probabilistic network framework written in Java. It has both a GUI and an API with inference, sampling, learning and evaluation. It supports Bayesian networks, influence diagrams, MSBN, OOBN, HBN, MEBN/PR-OWL, PRM, structure, parameter and incremental learning. Please, visit our wiki (https://sourceforge.net/p/unbbayes/wiki/Home/) for more information. Check out the license section (https://sourceforge.net/p/unbbayes/wiki/License/) for our licensing policy.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 18
    Periodic Table Of Elements 4 Mathematica

    Periodic Table Of Elements 4 Mathematica

    generates nice period table from given data

    PTE-1.5 is for Mathematica 11.1 (not mm14 - needs ver change fixing) PTE-1.4 and prev for Mathematica 4.0 linux PTE generates periodic tables from variable data (the data and layout can be changed for a particular subject). Includes Pauli Exclusion makers and some data. Charts can be browsed with an iPhone nicely (may need .jpg, .gif nicer). It's printable at a small size. Layout change un-restricted but is non-interactive. which is good and bad: it does the job. CreateModule...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    miRDeep*

    miRDeep*

    MiRDeep*

    Please cite: An, J., Lai, J., Lehman, M.L. and Nelson, C.C. (2013) miRDeep*: an integrated application tool for miRNA identification from RNA sequencing data. Nucleic Acids Res, 41, 727-737. We will create index for you if you tell us your interested species (j.an@qut.edu.au). download command line version "MDS_command_line_Vxx.zip" clicking "Browse All Files" please find miRPlant in sourceforge for plant miRNA prediction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    DataMelt

    DataMelt

    Computation and Visualization environment

    DataMelt (or "DMelt") is an environment for numeric computation, data analysis, computational statistics, and data visualization. This Java multiplatform program is integrated with several scripting languages such as Jython (Python), Groovy, JRuby, BeanShell. DMelt can be used to plot functions and data in 2D and 3D, perform statistical tests, data mining, numeric computations, function minimization, linear algebra, solving systems of linear and differential equations. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 21

    BitMagic Library

    Compressed bit-sets, sparse bit matrices and algorithms

    BitMagic - C and C++ library implementing dynamic bitvectors and bit-set algorithms with several types of on-the-fly, adaptive compression. Designed for use in databases, search systems, data-mining algorithms, scientific projects. The core of the library is C++, but it provides C-compatibility wrappers and can be compiled without C++ runtime. Optimizations for Intel SSE2, SSE4.2 and AVX2.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22

    Divide and Conquer Treemaps

    Divide and Conquer Treemap Visualisation

    The project develops new new techniques and algorithms for quickly partitioning and visualizing very large hierarchical structures within a variety of arbitrary shapes and space using Divide and Conquer approach for Treemaps. Related publications: https://www.sciencedirect.com/science/article/pii/S1045926X1500066X https://dl.acm.org/doi/pdf/10.1145/2493102.2493112
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    OpenVigil

    Open pharmacovigilance data extraction, mining and analysis tool

    OpenVigil provides a webinterface to analyse pharmacovigilance data, i.e., spontanous or systematic collections of treatments (drugs) and observed adverse events ("drug side effects"). FDA Adverse Event Reporting System (AERS) and other pharmacovigilance data (e.g., Canadian or German) are supported. The OpenVigil web-based analysis tools offer several analysis modes like extraction, filtering, mining of data and analyses via measurements of disproportionality like proportional reporting ratio or reporting odds ratios) and export to spreadsheets programs like Microsoft Excel or statistics programs like R. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 24
    OmicSelector

    OmicSelector

    Feature selection and deep learning modeling for omic biomarker study

    OmicSelector is an environment, Docker-based web application, and R package for biomarker signature selection (feature selection) from high-throughput experiments and others. It was initially developed for miRNA-seq (small RNA, smRNA-seq; hence the name was miRNAselector), RNA-seq and qPCR, but can be applied for every problem where numeric features should be selected to counteract overfitting of the models. Using our tool, you can choose features, like miRNAs, with the most significant...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    MANTI

    MANTI

    MANTI - Mastering Advanced N-Termini Interpretation

    ...For a very detailed explanation of script parameters and the evaluation strategy, please consult the extensive manual PDF
    Downloads: 1 This Week
    Last Update:
    See Project