Showing 520 open source projects for "python data analysis"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1

    iMir

    Integrated pipeline for HT miRNA-Seq data analysis

    ...To allow a flexible and comprehensive analysis of smallRNA-Seq data we designed a novel modular pipeline, called iMir, integrating multiple open source modules and resource in an automated workflow, devising different statistical approaches to analyze data rigorously. iMir comprises also a Graphical User Interface (GUI), so that the pipeline is particularly suited for biologist and early stage bioinformaticians and produces both graphics and text outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    MZmine 2

    MZmine 2

    A framework for differential analysis of mass spectrometry data

    MZmine 2 is a framework written in Java for differential analysis of mass spectrometry data.
    Leader badge
    Downloads: 25 This Week
    Last Update:
    See Project
  • 3
    GESPA

    GESPA

    Accurately predicts disease association of single nucleotide mutations

    GESPA (GEnomic Single nucleotide Polymorphism Analyzer) is a bioinformatics tool for classifying Nonsynonymous Single Nucleotide Polymorphisms (nsSNPs). GESPA predicts if a nsSNP is pathogenic using reports from literature and various algorithms to assess conservation in orthologous and paralogous protein alignments. Using reports from literature, GESPA is also able to predict the phenotype of a nsSNP with high accuracy. The software can be used clinically to determine if observed nsSNPs are...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Biological Network Analyzer
    Workbench for visualizing biological networks (regulatory, interaction, and metabolic). Can be used as a front-end for BNDB datawarehouses, but also as stand-alone tool.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5

    ProC 3.0

    smart Workflow Engine

    ProC 3.0 is a scientific workflow engine to build, manage and execute workflows (pipelines) in heterogeneous environments, supporting GRID and other means of parallel processing. It includes a data management component (DMC) to transparently access databases for storage of results and automatically adds metadata to track the processing of data products, so that at every time a full processing history is available. The software was developed and used within the ESA Planck satellite mission.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    openSEQ

    NGS compute distro proloaded with pipeline analysis software

    Forked Ubuntu 14.04 minimal install with XOrg and FluxBox desktop environment for web-based pipeline construction and job submission. HOWTO: 1) Set VM network settings to a bridged adapter type. 2) Load VM to client 3) From host: $ ssh -X ubuntu@<foobar> 'firefox && butterfly.server.py --unsecure' --- passwd = 616287xx 5) or ssh into host with X forwarding, and type runme This will launch an X-session of Firefox on the client, which will then be forwarded to the host. The...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    OARS Android IMU & GPS Data Logger

    OARS Android IMU & GPS Data Logger

    OARS - Orientational Awareness Reaction Systems

    The OARS app is a data logger developed to run on Android OS and devices containing an internal GPS ,along with micro-electro-mechanical systems (MEMS) chipset. Specifically the Invensence MPU-6000 or newer. This chip is a highly accurate inertial measurement unit, or (IMU). Contained within this small IMU is a 3-axis Accelerometer, 3-axis Gyro, and 3-axis Magnetometer. Along with a built in processor, it can calculate movements along 9-axis. Depending on platform and app setting, OARS can...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Text Expander, Inverse summarizer

    Text Expander, Inverse summarizer

    Expand text, inverse summarizer

    IT WILL WORK WITH A JAVA DEVELOPMENT KIT 1.7 ONLY !!! This program is a data-miner and a knowledge-miner. It does exactly the opposite of what the text summarizers do. A text summarizer produces a shortened text given some text as an input. An inverse summarizer takes the shortened input, a similar or a same text and does the process in reverse. This results in an expanded text. It can be used with any text or notes that have the knowledge gaps. It is a great aid to any creative...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Chordalysis

    Log-linear analysis (data modelling) for high-dimensional data

    ===== Project moved to https://github.com/fpetitjean/Chordalysis ===== Log-linear analysis is the statistical method used to capture multi-way relationships between variables. However, due to its exponential nature, previous approaches did not allow scale-up to more than a dozen variables. We present here Chordalysis, a log-linear analysis method for big data. Chordalysis exploits recent discoveries in graph theory by representing complex models as compositions of triangular structures, also known as chordal graphs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    VANTED
    VANTED - Visualization and Analysis of NeTworks containing Experimental Data At SourceForge the VANTED development history is preserved, only limited amount of development will proceed here. Please head on to the most recent developments, which can be observed at www.vanted.org.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    SoNIA (Social Network Image Animator)

    Social Network Image Animator creates animations of network data

    SoNIA (Social Network Image Animator) is a Java-based package for visualizing dynamic or longitudinal network data and rendering it out as animations. The home page with examples is located at http://sonia.stanford.edu Documentation is at https://sourceforge.net/p/sonia/wiki/Main_Page/
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    e-Science Central

    e-Science Central

    Cloud based scientific data analysis

    e-Science Central has moved to BitBucket, please visit: https://bitbucket.org/digitalinstitute/esciencecentral/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Libraries, documentation, examples & drivers for Eagle Technology South Africa's Data Acquisition products. These include ISA, PCI, PCI Express, USB, Serial & Ethernet. Supported languages will be C/C++, JAVA, Perl, Python.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    picard

    A set of tools for working with high-throughput sequencing data

    A set of tools (in Java) for working with next generation sequencing data in the SAM/BAM format. Note that development has moved to GitHub at https://github.com/broadinstitute/picard and support is available on the GATK forum at http://gatkforums.broadinstitute.org/categories/ask-the-team
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15

    Deem

    Analyze time-course data with significance tests, clustering, modeling

    Use statistical methods to analyze time-course data (gene expression microarray and RNA-seq data in particular, but not limited to). Apply significance tests to filter out only significant genes or time series. Cluster time series into similar groups. Generate network models, including linear or non-linear models. Variable selection and optimization routines included. Written in Scala and R. The application is a cross-platform desktop app with a simple GUI and is fully functional...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    microtransat
    A project to host code developed for the Microtransat challenge, a race between autonomous sailing robots. This project is intended for both supporting code such as tracking systems and robot control systems or parts of robot control systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    phenotemp

    phenotemp

    phenological trends using NOAA AVHRR time series

    Starting from GIMMS values as ascii data (each line will be treated as individual time series), the application can smooth the ts - values by applying different algorithm that are based on Fouriertransformation. Separation of seasonal figure as well as the detection of linear trends is possible. Possible breaks in longterm mean can be detected with a change-point analysis using CuSum algorithms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    vipR is a program to screen for sequence variants (SNPs, deletions) in sequence data generated by high-throughput-sequencing platforms. Information on this and other projects can be found on: http://www.altmann.eu
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    KeplerDB

    KeplerDB

    Timeseries databases management system

    KeplerDB is a temporal database to store time/value entries where the type of value could be integer, float/double, boolean and string. KeplerDB is dedicated to be scalable and to create clusters of server allowing the user to analyse and store massing amount of data to monitor systems like computers, clusters, building and captors or financial systems like markets and accounts. The user can use KeplerDB to make data analysis on enormous amount of data (statistics and modelling).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Graph-RAT
    Graph-RAT is a database abstraction layer designed to make it easy to use a large library of graph-analysis routines on a database as well as add new kinds of algorithms to data mining.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    The National Library of New Zealand's Metadata Extraction Tool automatically extracts preservation-related metadata from digital files, then output that metadata in XML formats. It can be used through a graphical user interface or command-line interface. Please take the latest code from 'https://github.com/DIA-NZ/Metadata-Extraction-Tool.git'. The code on source forge will not be updated henceforth as it is moved to github.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 22
    Graphical Grammar Studio

    Graphical Grammar Studio

    An user friendly grammar tool for natural language processing tasks

    Full documentation with tutorials is included in the download package. Graphical Grammar Studio is a tool for applying grammars which behave as words acceptors/consumers and annotators. GGS grammars can be used to find and annotate sequences of words which respect certain conditions, in a given input. Its purpose is for creating NLP tools like phrase chunkers, named entity finders, pronoun co-reference solvers etc. A grammar is represented by a state machine which can be visualized, edited...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    PlaTypUS

    PlaTypUS

    Plasmodium Typing Utility Software

    The goal of PlaTypUS is to establish a unified Plasmodium whole genome analysis tool, that aligns short read sequences in an agreed upon manner, with many quality control steps, and calls both SNVs and CNVs in a community agreed standard way. We also seek to provide this analysis in a stand-alone graphic user interface so that labs around the world can analyze their own whole genome sequencing data, rather than relying on outside institutes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    TriageTools

    Tools for partitioning and prioritizing fastq data

    TriageTools is a collection of tools for partitioning raw data (fastq reads) from high-throughput sequencing projects. The tools are designed for basic data management as well for prioritizing analysis of certain subsets. The project wiki contains usage information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    ActiveFolder

    ActiveFolder

    ActiveFolder can be integrated with existed scientific computation.

    Welcome to the ActiveFolder project, the open source for management simulataion data and reproducing. ActiveFolder provide data reproduction for comparative study and searching reproduction data, by storing data coupled with provenance information. ActiveFolder is integrated existed grid computing service and cloud services. And also, ActiveFolder can be integrated with existed scientific computation, analysis and visualization software without modification.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB