Showing 161 open source projects for "pdf data mining"

View related business solutions
  • Your monitoring isn't a stack. It's a pile. Fix that. Icon
    Your monitoring isn't a stack. It's a pile. Fix that.

    Errors, performance, logs, uptime. One install, one invoice, one UI.

    Replace Datadog, New Relic, and Sentry without adding three more dashboards.
    Free 30 days.
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 1
    Orange Data Mining

    Orange Data Mining

    Orange: Interactive data analysis

    ...When teaching data mining, we like to illustrate rather than only explain.
    Downloads: 53 This Week
    Last Update:
    See Project
  • 2
    Dawarich

    Dawarich

    Self-hostable alternative to Google Timeline

    Dawarich is a command-line tool (likely Ruby-based) for transforming and analyzing Arabic text data with normalization, diacritic handling, segmentation, and morphological tokenization. Designed for text mining and NLP workflows in Arabic-language contexts.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    ProM is the comprehensive, extensible framework for process mining. Process Mining deals with the a-posteriori analysis of (business) processes using enactment logs.
    Leader badge
    Downloads: 45 This Week
    Last Update:
    See Project
  • 4
    PaperQA2

    PaperQA2

    High accuracy RAG for answering questions from scientific documents

    PaperQA2 is a package for doing high-accuracy retrieval augmented generation (RAG) on PDFs or text files, with a focus on the scientific literature. See our recent 2024 paper to see examples of PaperQA2's superhuman performance in scientific tasks like question answering, summarization, and contradiction detection. In this example we take a folder of research paper PDFs, magically get their metadata - including citation counts and a retraction check, then parse and cache PDFs into a...
    Downloads: 5 This Week
    Last Update:
    See Project
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 5
    dxf2gcode

    dxf2gcode

    DXF2GCODE: converting 2D dxf drawings to CNC machine compatible G-Code

    DXF2GCODE is a tool for converting 2D (dxf, pdf, ps) drawings to CNC machine compatible GCode. Windows, Linux, and Mac support by using python scripting language.
    Leader badge
    Downloads: 353 This Week
    Last Update:
    See Project
  • 6
    GeoDMA

    GeoDMA

    Geographic feature extraction and data mining

    GeoDMA is a plugin for TerraView software, used for geographical data mining. With a single image, the user can perform segmentation, attributes extraction, normalization and classification.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Jmol

    Jmol

    An interactive viewer for three-dimensional chemical structures.

    Over 1,000,000 page views per month. Jmol/JSmol is a molecular viewer for 3D chemical structures that runs in four independent modes: an HTML5-only web application utilizing jQuery, a Java applet, a stand-alone Java program (Jmol.jar), and a "headless" server-side component (JmolData.jar). Jmol can read many file types, including PDB, CIF, SDF, MOL, PyMOL PSE files, and Spartan files, as well as output from Gaussian, GAMESS, MOPAC, VASP, CRYSTAL, CASTEP, QuantumEspresso, VMD, and many other...
    Leader badge
    Downloads: 582 This Week
    Last Update:
    See Project
  • 8
    A Matlab toolkit for all aspects of EEG/ERP analysis, especially PCA. If you run into a problem, please send me a note and I'll fix it. The tutorial pdf is in the documentation folder and the tutorial data can be downloaded from a link provided by the pdf. The tutorial file has full install instructions. For all platforms supported by Matlab.
    Leader badge
    Downloads: 12 This Week
    Last Update:
    See Project
  • 9
    JaxoDraw

    JaxoDraw

    JaxoDraw: Feynman Diagrams made easy!

    JaxoDraw is a tool to generate Feynman diagrams in a mouse click-and-drag fashion. Graphs can be exported to a variety of graphics formats and arbitrary latex code can be used for labels to produce high-quality publishing-style figures.
    Leader badge
    Downloads: 33 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 10
    QXRD is software for the acquisition and analysis of X-ray data taken with 2 dimensional detectors. The software can drive a Perkin Elmer XRD series flat panel detector and can be remote-controlled via a socket interface, or directly from SPEC
    Downloads: 8 This Week
    Last Update:
    See Project
  • 11
    AuroreNR
    Software developed for the analysis of Neutron Reflectivity data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    WP 34s

    WP 34s

    Scientific/engineering firmware repurposing HP business calculators!

    This project has created scientific firmware for the HP-20b and HP-30b business calculators. WP 34S turns either of these calculators into a powerful keystroke programmable scientific device. According to our customers, it's the most powerful and fastest RPN scientific pocket calculator ever built. WP 34S is alive and stable since 2011. We have succeeded in satisfying the most picky users - read about their experiences on http://www.hpmuseum.org/forum/forum-8.html. Since 2014, WP...
    Leader badge
    Downloads: 21 This Week
    Last Update:
    See Project
  • 13
    ...However, it also offers extra functionality via applications and utilities developed by others. Electronic Design Automation (EDA) tool suites are used to provide schematic capture and editing, and schematic to netlist conversion. Waveform data viewers are used to display simulation results and PDF viewers to display user manuals.
    Leader badge
    Downloads: 10 This Week
    Last Update:
    See Project
  • 14
    To give users the full control over the running application. This means that an application is working according to its purpose but the control over the whole interface is taken from developer and given to users. While an application is running, users can move, resize, and tune all the screen objects through which the communication with an application is going. Set of files includes the book (both in DOC and PDF formats), a big demonstration project with all its files available (all the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    stkpp

    stkpp

    C++ Statistical ToolKit

    ...At a convenience, we propose the source packages on sourceforge. The library offers a dense set of (mostly) template classes in C++ and is suitable for projects ranging from small one-off projects to complete data mining application suites.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    UnBBayes

    UnBBayes

    Framework & GUI for Bayes Nets and other probabilistic models.

    UnBBayes is a probabilistic network framework written in Java. It has both a GUI and an API with inference, sampling, learning and evaluation. It supports Bayesian networks, influence diagrams, MSBN, OOBN, HBN, MEBN/PR-OWL, PRM, structure, parameter and incremental learning. Please, visit our wiki (https://sourceforge.net/p/unbbayes/wiki/Home/) for more information. Check out the license section (https://sourceforge.net/p/unbbayes/wiki/License/) for our licensing policy.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    Periodic Table Of Elements 4 Mathematica

    Periodic Table Of Elements 4 Mathematica

    generates nice period table from given data

    PTE-1.5 is for Mathematica 11.1 (not mm14 - needs ver change fixing) PTE-1.4 and prev for Mathematica 4.0 linux PTE generates periodic tables from variable data (the data and layout can be changed for a particular subject). Includes Pauli Exclusion makers and some data. Charts can be browsed with an iPhone nicely (may need .jpg, .gif nicer). It's printable at a small size. Layout change un-restricted but is non-interactive. which is good and bad: it does the job. CreateModule...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    IEC 104 Protocol Linux arm code download

    IEC 104 Protocol Linux arm code download

    IEC 104 Source code Library for Linux c c++ posix arm

    v21.06.018 Complete implementation of IEC 104 including file transfer. Supports Embedded Linux (ARM, Coldfire, Power PC), Ubuntu Linux(X86, X86-64), Fedora, CentOS, Red Hat. FreyrSCADA IEC 60870-5-104 Linux SDK contains Shared Library(.a) (precompiled library), Static link lib, Demo IEC104 Server & Client programs, CodeBlock Demo console project files, Doxygen . Get free Evaluation Kit - IEC 60870-5-104 Development Bundle In the Development Bundle, We included IEC 60870-5-104...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 19
    miRDeep*

    miRDeep*

    MiRDeep*

    Please cite: An, J., Lai, J., Lehman, M.L. and Nelson, C.C. (2013) miRDeep*: an integrated application tool for miRNA identification from RNA sequencing data. Nucleic Acids Res, 41, 727-737. We will create index for you if you tell us your interested species (j.an@qut.edu.au). download command line version "MDS_command_line_Vxx.zip" clicking "Browse All Files" please find miRPlant in sourceforge for plant miRNA prediction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    DataMelt

    DataMelt

    Computation and Visualization environment

    DataMelt (or "DMelt") is an environment for numeric computation, data analysis, computational statistics, and data visualization. This Java multiplatform program is integrated with several scripting languages such as Jython (Python), Groovy, JRuby, BeanShell. DMelt can be used to plot functions and data in 2D and 3D, perform statistical tests, data mining, numeric computations, function minimization, linear algebra, solving systems of linear and differential equations. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 21

    BitMagic Library

    Compressed bit-sets, sparse bit matrices and algorithms

    BitMagic - C and C++ library implementing dynamic bitvectors and bit-set algorithms with several types of on-the-fly, adaptive compression. Designed for use in databases, search systems, data-mining algorithms, scientific projects. The core of the library is C++, but it provides C-compatibility wrappers and can be compiled without C++ runtime. Optimizations for Intel SSE2, SSE4.2 and AVX2.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22

    OpenVigil

    Open pharmacovigilance data extraction, mining and analysis tool

    OpenVigil provides a webinterface to analyse pharmacovigilance data, i.e., spontanous or systematic collections of treatments (drugs) and observed adverse events ("drug side effects"). FDA Adverse Event Reporting System (AERS) and other pharmacovigilance data (e.g., Canadian or German) are supported. The OpenVigil web-based analysis tools offer several analysis modes like extraction, filtering, mining of data and analyses via measurements of disproportionality like proportional reporting ratio or reporting odds ratios) and export to spreadsheets programs like Microsoft Excel or statistics programs like R. ...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 23
    OmicSelector

    OmicSelector

    Feature selection and deep learning modeling for omic biomarker study

    OmicSelector is an environment, Docker-based web application, and R package for biomarker signature selection (feature selection) from high-throughput experiments and others. It was initially developed for miRNA-seq (small RNA, smRNA-seq; hence the name was miRNAselector), RNA-seq and qPCR, but can be applied for every problem where numeric features should be selected to counteract overfitting of the models. Using our tool, you can choose features, like miRNAs, with the most significant...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    MANTI

    MANTI

    MANTI - Mastering Advanced N-Termini Interpretation

    ...For a very detailed explanation of script parameters and the evaluation strategy, please consult the extensive manual PDF
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    QuickPlot

    QuickPlot

    Simple user interface for gnuplot aimed for reflectometry data

    Graphical user interface for gnuplot to create publication quality figure very quickly. It supports templates for fast formatting of graphics, different plot styles, insets, axis and label options. One important feature is storing metadata in png and pdf files that can be used to reload any graph saved with QuickPlot.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next