Showing 90 open source projects for "python data analysis"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    google-java-format

    google-java-format

    Reformats Java source code to comply with Google Java Style

    google-java-format is a program that reformats Java source code to comply with Google Java Style. The formatter can act on whole files, on limited lines, on specific offsets, passing through to standard-out (default) or altered in-place. There is no configurability as to the formatter's algorithm for formatting. This is a deliberate design decision to unify our code formatting on a single format. The formatter can be used in software that generates java to output more legible java code. Just...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 2
    OpenDataLoader PDF

    OpenDataLoader PDF

    PDF Parser for AI-ready data. Automate PDF accessibility

    OpenDataLoader PDF is an open-source document processing system designed to convert complex PDF files into structured, AI-ready formats such as Markdown, JSON, and HTML while preserving layout, hierarchy, and semantic meaning. It focuses on enabling downstream use cases like retrieval-augmented generation (RAG), knowledge extraction, and document intelligence pipelines by maintaining accurate reading order and spatial metadata through bounding boxes. The tool combines deterministic parsing...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 3
    Framework Benchmarks

    Framework Benchmarks

    Source for the TechEmpower Framework Benchmarks project

    If you're new to the project, welcome! Please feel free to ask questions here. We encourage new frameworks and contributors to ask questions. We're here to help! This project provides representative performance measures across a wide field of web application frameworks. With much help from the community, coverage is quite broad and we are happy to broaden it further with contributions. The project presently includes frameworks on many languages including Go, Python, Java, Ruby, PHP, C#,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    ProM is the comprehensive, extensible framework for process mining. Process Mining deals with the a-posteriori analysis of (business) processes using enactment logs.
    Leader badge
    Downloads: 30 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    Grassroots DICOM

    Grassroots DICOM

    Cross-platform DICOM implementation

    Grassroots DiCoM is a C++ library for DICOM medical files. It is accessible from Python, C#, Java and PHP. It supports RAW, JPEG, JPEG 2000, JPEG-LS, RLE and deflated transfer syntax. It comes with a super fast scanner implementation to quickly scan hundreds of DICOM files. It supports SCU network operations (C-ECHO, C-FIND, C-STORE, C-MOVE). PS 3.3 & 3.6 are distributed as XML files. It also provides PS 3.15 certificates and password based mecanism to anonymize and de-identify DICOM datasets.
    Leader badge
    Downloads: 119 This Week
    Last Update:
    See Project
  • 6

    xsd2pgschema

    Relational database replication tool based on XML Schema

    ...Obtained PostgreSQL database can be optimized at user's discretion. Moreover, differential update, and XPath query evaluation over PostgreSQL are possible. PgSchema server, serialized relational data model server, can be used to speed up the analysis of complex XML Schema. Large XML file can be split through xmlsplitter, a flexible XML splitter based on XPath and StAX.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    PoJamas aims to provide a Python and tools for loading, processing, and producing .cr2, pz3 (crz, pzz) files compatible with the SmithMicro (e-frontier) Poser character animation application. PoJamas is composed of: - Python library - Python Wavefront (.obj) 3D viewer based on GLFW - LibreOffice/Python Application (to ease the library and the viewer usage) As of 2020, the project is ported in Python3 As of 2021 this project proposes a 3D viewer for Wavefront files...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    javahexeditor Java Hex Editor

    javahexeditor Java Hex Editor

    A hex editor Eclipse plugin and multi-platform desktop application

    You can install the latest Eclipse plugin version from the update site https://javahexeditor.sourceforge.io/update or the Eclipse Marketplace https://marketplace.eclipse.org/content/java-hex-editor. Older versions of the Eclipse plugin are available via the update site for the version, e.g. https://javahexeditor.sourceforge.io/update/0.5.1 You can download the latest stand-alone version and older versions on the "Files" tab.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    ebfformat

    ebfformat

    An Efficient Binary data Format

    ...It is also designed to simplify the programming of input output routines in different programming languages. In a nutshell an EBF file is a collection of data objects. Each data object is specified by a unique name and a single file can have multiple data objects. Each data object is preceded by a meta-data or header which describes the binary data associated with it. Among other things, this header allows the files to be portable across systems with different endianess.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    FlexibleParser_Java

    FlexibleParser_Java

    Group of multi-purpose Java-converted-from-C# parsing libraries.

    FlexibleParser (Java) is currently formed by the following independent libraries/parts: - UnitParser. It allows to easily deal with a wide variety of situations involving units of measurement. - NumberParser. It provides a common framework for all the .NET numeric types. AUTHORSHIP & COPYRIGHT I, Alvaro Carballo Garcia (varocarbas), am the sole author of each single bit of this code. All the contents of this repository can be considered public domain. For more information about my...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    FilteredPush
    Network software for annotation and quality control of distributed data. Supported by NSF: DBI 0646266 and NSF: DBI 0960535. http://wiki.filteredpush.org
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Simple-Scrape is a simple web-scraping library that allows for programmatic access to HTML code. No further techniques are needed and the library is very compact and thus easy to use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    TraceInspector

    Offline Wily Transaction Inspector 0.16.7.3 BETA

    Offline Wily Transaction Inspector is a Java standalone application with which one can analyze trace files offline generated by Wily Introscope Workstation or Webview.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    EXMARaLDA
    EXMARaLDA stands for "Extensible Markup Language for Discourse Annotation". It's a system of concepts, data formats and tools for the computer assisted transcription and annotation of spoken language, and the analysis of spoken language corpora. This project's source code has moved to https://github.com/Exmaralda-Org/exmaralda
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    MathOCR

    MathOCR

    A scientific document recognition system

    MathOCR is a printed scientific document recognition system. MathOCR is still in the pre-alpha stage, recognition result may not be good enough for practical purposes. MathOCR is a printed scientific document recognition system written in pure Java. MathOCR has the functionality of image preprocessing, layout analysis and character recognition, especially the ability to recognize mathematical expression. MathOCR can work do not depends on other OCR software.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    OpenSearchServer Extractor

    OpenSearchServer Extractor

    A RESTFul/JSON Web Service for text and metata extraction

    An open source RESTFul Web Service for text , meta-data extraction and analysis. oss-text-extractor supports various binary formats: Word processor (doc, docx, odt, rtf) Spreadsheet (xls, xlsx, ods) Presentation (ppt, pptx, odp) Publishing (pdf, pub) Web (rss, html/xhtml) Medias (audio, images) Others (vsd, text)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    DoCookBook

    DoCookBook

    Cookbook Style Document for DocBook Customizations

    This project has been moved to GitHub: https://github.com/tomschr/dbcookbook/ The DoCookBook project aims to create an open source book about DocBook and the DocBook XSL stylesheets written as a cookbook and released under a Creative Commons license.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Human Speakable Programming Language

    Human Speakable Programming Language

    foundation of the General Intelligence Operating System

    HSPL is Human Speakable Programming Language, allowing for communication between human-to-computer and human-to-human in the same language. This project has moved to http://sourceforge.net/p/spel We are currently working on human-to-computer programming-language with mostly English base vocabulary. Though once we have that, we plan to add support for other world Languages, including Chinese, Spanish, Russian, Arabic, Hindi, among others. Eventually HSPL shall be the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    picard

    A set of tools for working with high-throughput sequencing data

    A set of tools (in Java) for working with next generation sequencing data in the SAM/BAM format. Note that development has moved to GitHub at https://github.com/broadinstitute/picard and support is available on the GATK forum at http://gatkforums.broadinstitute.org/categories/ask-the-team
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    Darkfall Emulator

    Darkfall Emulator

    Develop an emulated Darkfall Online Server.

    This project is the development of an open source fully emulated Darkfall Online Server and reverse engineering the proprietary game engine that DFO uses, the SF3D engine as well as its source code.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    The National Library of New Zealand's Metadata Extraction Tool automatically extracts preservation-related metadata from digital files, then output that metadata in XML formats. It can be used through a graphical user interface or command-line interface. Please take the latest code from 'https://github.com/DIA-NZ/Metadata-Extraction-Tool.git'. The code on source forge will not be updated henceforth as it is moved to github.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 23

    XML Content Provider

    Plugin to connect XML data sources to the GIN Server

    The GIN Server is a semantic middleware for efficient "bottom-up" data integration and automized semantic analysis for dynamically linked data. The XML Content Provider is a configurable plugin for the GIN Server to integrate any XML data source with a simple structure.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    RDF Content Provider for iQser GIN

    Plugin to connect RDF sources with the GIN Server

    GIN Server is a semantic middleware for easy data integration and automized analysis. The extendable architecture allows to plugin in data sources, analytics and event handling. This RDF Content Provider enables access to Semantic Web Content as an RDF file or SPAEQL endpoint.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    KMLCSV Converter
    Converts Google Earth KML file to a CSV file, and vice versa. This enables you to plot all custom POIs with Google Earth first before transferring them into your Garmin GPS using the POI Loader.
    Downloads: 32 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next
MongoDB Logo MongoDB