Showing 481 open source projects for "data science"

View related business solutions
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    plot.py

    plot.py

    direct data plotting and evaluation

    The Plot.py project tries to supply a measurement data visualization and treatment framework being easy to use while keeping the freedom for advanced users to execute additional data treatment algorithms. Plotting is done via gnuplot and the script used to produce the graphs can be exported for later use/changes. Many raw experimental data types (mostly of x-ray and neutron scattering experiments) are supported with more to be added on user request. The data treatment includes...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    reditools

    RNA editing detection by NGS data

    REDItools are python scripts developed with the aim to study RNA editing at genomic scale by next generation sequencing data. RNA editing is a post-transcriptional phenomenon involving the insertion/deletion or substitution of specific bases in precise RNA localizations. In human, RNA editing occurs by deamination of cytosine to uridine (C-to-U) or mostly by the adenosine to inosine (A-to-I) conversion through ADAR enzymes. A-to-I substitutions may have profound functional consequences and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    PyTom

    PyTom

    http://www.sciencedirect.com/science/article/pii/S1047847711003492

    PyTom is a toolbox developed for interpreting cryo electron tomography data. All steps from reconstruction, localization, alignment and classification are covered with standard and improved methods. Please sign up to our mailing list to keep up with the most recent updates and versions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    YFitter

    Fitting Y chromosome haplogroups by maximum likelihood

    Yfitter is a program for assigning Y chromosome haplogroups to individuals sequenced at low coverage. It is designed to be used in a samtools/bcftools pipeline. Yfitter also supports haplogrouping using chip genotype data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 5

    CIF2Cell

    Generating cells for electronic structure calculations from CIF files

    CIF2Cell is a tool to generate the geometrical setup for various electronic structure codes from a CIF (Crystallographic Information Framework) file. The program currently supports output for a number of popular electronic structure programs, including ABINIT, ASE, CASTEP, CP2K, CPMD, CRYSTAL09, Elk, EMTO, Exciting, Fleur, FHI-aims, Hutsepot, MOPAC, Quantum Espresso, RSPt, Siesta, SPR-KKR, VASP. Also exports some related formats like .coo, .cfg and .xyz-files. The program has been published...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 6
    Awesome Math

    Awesome Math

    This is the Curriculum for "How to Learn Mathematics Fast"

    This repository is a curated roadmap for learning the core mathematics used in computer science, machine learning, and data science without getting lost in unnecessary detours. It organizes topics like algebra, calculus, linear algebra, probability, and statistics into a pragmatic sequence that favors intuition and problem-solving over purely formal proofs. The materials emphasize short, high-leverage resources—video lectures, concise notes, and hands-on exercises—that help you build momentum quickly. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    SpacePy
    Now maintained at github.com/spacepy/spacepy Space Science library for Python - contains superposed epoch classes, drift shell tracing, access to magnetic field models, streamline tracing, bootstrap confidence limits, time and coordinate conversions, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    GDL - GNU Data Language, a free IDL (Interactive Data Language, see http://ittvis.com/idl/) compatible incremental compiler.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    TEACUP

    TEACUP

    TCP Experiment Automation Controlled Using Python

    TEACUP automates many aspects of running TCP performance experiments in a specially-constructed physical testbed. TEACUP enables repeatable testing of different TCP algorithms over a range of emulated network path conditions, bottleneck rate limits and bottleneck queuing disciplines. TEACUP utilises a text-based configuration file to define experiments as combinations of parameters specifying desired network path and end host conditions. When multiple values are provided (e.g. for TCP...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10

    lr2rmats

    Long read to rMATS

    lr2rmats is a Snakemake-based light-weight pipeline which is designed to utilize both third-generation long-read and second-generation short-read RNA-seq data to generate an enhanced gene annotation file. The newly generated annotation file could be provided to rMATS for differential alternative splicing analysis. More information can be found at https://sourceforge.net/p/lr2rmats/wiki/Home/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    DualTranscriptDiscovery

    Transcript-discovery approach for gene feature delimitation by RNA-seq

    This project contains Python scripts usable for a dual transcript-discovery approach that improves the delimitation of gene features from RNA-seq data in the chicken model. Documentation: http://bio.biologists.org/content/biolopen/suppl/2018/01/17/bio.028498.DC1/BIO028498supp.pdf Citation: Orgeur M., Martens M., Börno S. T., Timmermann B., Duprez D. and Stricker S. (2018). A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    multiplierz
    Open-source Python software library and GUI desktop environment for direct bioinformatic analysis of mass-spectrometry data through powerful scripting tools and interfaces to many machine data formats, database search engines, and peptide data formats. For a copy of the source code, check out our Github repositories: mzDesktop: https://github.com/MaxAlex/mzDesktop multiplierz: https://github.com/MaxAlex/multiplierz
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Spectral-SpecPro

    Spectral-SpecPro

    Spectral - software for manipulating optical spectroscopy data

    Spectral-SpecPro helps with the manipulation of optical spectroscopy data. Spectral takes files produced by Jasco instruments (uv-vis absorbance, fluorescence, circular dichroism readings as a function of wavelength, temperature, or time) and facilitates basic operations such as unit conversion (CD spectra), conversion into the format taken by CDPro (estimation of secondary structure; CD spectra), scatter correction (absorbance spectra), smoothing, re-sampling of the x-axis, and generic...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14

    MethyMer

    Design of specific primer combinations for bisulfite sequencing

    MethyMer is a Python-based tool aimed at selecting specific primers for amplification of complete CpG islands. These regions are difficult in terms of selection appropriate primers because of their low-complexity, polyN-, CG-richness, etc. MethyMer have a flexible scoring system capable of selecting primers in problematic regions (e.g. SpG islands) and includes specificity test (based on bowtie alignment against bisulfite-treated genome). It also incorporates TCGA CpG methylation...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    Collaborative Computing Project for NMR

    Collaborative Computing Project for NMR (CCPN)

    The Collaborative Computational Project for NMR (CCPN) provides tools and knowledge to maximise the impact of the biological NMR studies. The CCPN software facilitates data analysis and software integration. The project actively promotes the exchange of knowledge and provides training and best practices for the NMR community. CCPN also has a leading role in the development of a NMR data-sharing standard and coordination of NMR instrumentation proposals for RCUK and BIS. The 28 partners of CCPN jointly cover all aspects of biomolecular NMR and together they promote excellence in science in their respective fields.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    nmr-nessy

    nmr-nessy

    NMR relaxation dispersion spectroscopy analysis software

    NESSY is an open source software to analyse NMR relaxation dispersion data of either CPMG or R1p (R1rho) dispersion experiments. The graphical interface enables simple management of large experimental data sets and simple and automated analysis. NESSY automatically calculates effective transverse relaxation rate (R2eff) and performs model selection between different relaxation dispersion models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    PyVE

    PyVE is image analysis and visualization environment

    PyVE is image analysis and Visualization Environment focused at clinical use. At the core of it there is a powerful viewer for displaying 3D datasets (MRI, PET, CT) based on VTK. It all comes precompiled allowing painless access to Python (2.x), the ITK toolkit for image analysis, numpy/scipy for numerical calculations, Qt and PyQt4 for the development Graphical User Interfaces. It is what you need for fast prototyping and development of more complex projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    composight
    Composight is a cross-platform toolkit for 3D-image processing in the domain of composite materials science. It is written in C++ and provides small, problem-specific applications for viewing, filtering and segmentation of volumetric data such as micro-CT scans. The main objective is not to provide yet another complex application for volume data visualization and medical image processing. Instead, Composight is a collection of small and simple apps that have already been successfully used to solve various problems in materials science.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    MuPIF

    MuPIF

    Multi-Physics Integration Framework (MuPIF)

    Multi-Physics Integration Framework (MuPIF) is an integration framework, that will facilitate the implementation of multi-physic and multi-level simulations, built from independently developed components. The principal role of the framework is to steer individual components (applications) and to provide high-level data-exchange services. Each application should implement an interface that allows to steer application and execute data requests. The design supports various coupling strategies,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    SnowyOwl

    RNA-Seq based gene prediction pipeline for fungal genomes

    SnowyOwl is a gene prediction pipeline that uses RNA-Seq data to train and provide hints for the generation of Hidden Markov Model (HMM)-based gene predictions, and to evaluate the resulting models. The pipeline has been validated and streamlined by comparing its predictions to manually curated gene models in three fungal genomes, and its results show substantial increases in sensitivity and selectivity over previous gene predictions. Sensitivity is gained by repeatedly running the HMM gene...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    This is a repository for software that is needed to communicate with the data loggers of CORK borehole observatories (mlterm) and to convert and post-process the data. Have a look at http://www.corkobservatories.org/ for an introduction to CORKs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    PBSuite

    PBSuite

    Software for Long-Read Sequencing Data from PacBio

    This currently hosts two projects created and maintained by Adam English. PBJelly - the genome upgrading tool. PBHoney - the structural variation discovery tool Both are contained within the PBSuite code found in downloads. ----- PBJelly ----- Read The Paper http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0047768 PBJelly is a highly automated pipeline that aligns long sequencing reads (such as PacBio RS reads or long 454 reads in fasta format) to...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23

    Gene-Environment iNteraction Simulator 2

    A tool able to simulate gene-environment and gene-gene interactions.

    Gene-Environment iNteraction Simulator 2 (GENS2) simulates interactions among two genetic and one environmental factor and also allows for epistatic interactions. GENS2 is based on data with realistic patterns of linkage disequilibrium, and imposes no limitations either on the number of individuals to be simulated or on number of non-predisposing genetic/environmental factors to be considered. The GENS2 tool is able to simulate gene-environment and gene-gene interactions. To make the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    This project houses software to analyze data acquired from electrophysiology experiments. Currently, we have an Octave/MATLAB program to analyze electroneurogram traces of coupled oscillators, and a Perl library for the analysis of voltage trace data
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Taurus

    Taurus

    A python based User Interface library.

    The Taurus Project has *moved* to: https://github.com/taurus-org/taurus This SourceForge page is *outdated* and kept for historical reference only. Taurus is a python framework for control and data acquisition CLIs and GUIs in scientific/industrial environments. It supports multiple control systems or data sources: Tango, EPICS, ... New control system libraries and data sources can be integrated through plugins.
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo