Showing 57 open source projects for "python data analysis"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    Stanza

    Stanza

    Stanford NLP Python library for many human languages

    Stanza is a collection of accurate and efficient tools for the linguistic analysis of many human languages. Starting from raw text to syntactic analysis and entity recognition, Stanza brings state-of-the-art NLP models to languages of your choosing. Stanza is a Python natural language analysis package. It contains tools, which can be used in a pipeline, to convert a string containing human language text into lists of sentences and words, to generate base forms of those words, their parts of speech and morphological features, to give a syntactic structure dependency parse, and to recognize named entities. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    AutoResearchClaw

    AutoResearchClaw

    Autonomous research from idea to paper. Chat an Idea. Get a Paper 🦞

    AutoResearchClaw is an open-source framework designed to automatically generate full academic research papers from a single idea or topic. Built in Python, it orchestrates a multi-stage research pipeline that gathers literature, formulates hypotheses, runs experiments, analyzes results, and writes the final paper. The system retrieves real academic references from sources such as arXiv and Semantic Scholar to ensure credible citations. It can automatically generate code for experiments, run...
    Downloads: 31 This Week
    Last Update:
    See Project
  • 3
    Recommenders

    Recommenders

    Best practices on recommendation systems

    The Recommenders repository provides examples and best practices for building recommendation systems, provided as Jupyter notebooks. The module reco_utils contains functions to simplify common tasks used when developing and evaluating recommender systems. Several utilities are provided in reco_utils to support common tasks such as loading datasets in the format expected by different algorithms, evaluating model outputs, and splitting training/test data. Implementations of several...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Archivematica

    Archivematica

    Free and open-source digital preservation system

    Archivematica is a web- and standards-based, open-source application which allows your institution to preserve long-term access to trustworthy, authentic, and reliable digital content. Our target users are archivists, librarians, and anyone working to preserve digital objects. You are free to copy, modify, and distribute Archivematica with attribution under the terms of the AGPLv3 license. Archivematica is an open-source application based on recognized standards that makes it possible to...
    Downloads: 10 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    nanoGPT

    nanoGPT

    The simplest, fastest repository for training/finetuning models

    NanoGPT is a minimalistic yet powerful reimplementation of GPT-style transformers created by Andrej Karpathy for educational and research use. It distills the GPT architecture into a few hundred lines of Python code, making it far easier to understand than large, production-scale implementations. The repo is organized with a training pipeline (dataset preprocessing, model definition, optimizer, training loop) and inference script so you can train a small GPT on text datasets like Shakespeare...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    AI Researcher

    AI Researcher

    An autonomous AI researcher

    AI Researcher is an experimental open-source project that demonstrates how multiple AI agents can collaborate to conduct complex research tasks from start to finish with minimal human intervention. It orchestrates agents that can generate research questions, perform literature reviews, execute experiments, analyze results, and synthesize findings into structured outputs like reports or code. Each agent operates with clear roles — such as researcher, analyst, and summarizer — and they...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    Kubernetes Handbook

    Kubernetes Handbook

    Cloud native application architecture practice handbook

    Cloud native is a behavioral method and design concept. In its essence, all behaviors or methods that can improve resource utilization and application delivery efficiency on the cloud are cloud-native. The history of cloud computing is a history of cloud native. Kubernetes opened the prelude to cloud native 1.0. The emergence of service mesh Istio led to microservices in the post-Kubernetes era. The rise of serverless has enabled cloud native to advance from the infrastructure layer to the...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    Megatron-LM

    Megatron-LM

    Ongoing research training transformer models at scale

    Megatron-LM is a GPU-optimized deep learning framework from NVIDIA designed to train extremely large transformer-based language models efficiently at scale. The repository provides both a reference training implementation and Megatron Core, a composable library of high-performance building blocks for custom large-model pipelines. It supports advanced parallelism strategies including tensor, pipeline, data, expert, and context parallelism, enabling training across massive multi-GPU and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    JSONLab

    JSONLab

    JSONLab: compact, portable, robust JSON/binary-JSON encoder

    JSONLab is a free and open-source JSON/UBJSON/MessagePack encoder and decoder written in the native MATLAB language. It can be used to convert a MATLAB data structure (array, struct, cell, struct array, cell array, and objects) into JSON/UBJSON/MessagePack formatted strings and files, or to parse a JSON/UBJSON/MessagePack file into MATLAB data structure. JSONLab supports nearly all versions of MATLAB and GNU Octave (a free MATLAB clone). The development of JSONLab is currently funded by the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    xrayutilities

    xrayutilities

    a package with useful scripts for X-ray diffraction physicists

    xrayutilities is a python package used to analyze x-ray diffraction data. It can support with performing diffraction experiments and used for common steps in the data analysis. It can read experimental data from several data formats (spec, edf, xrdml, ...); convert them to reciprocal space for arbitrary goniometer geometries and different detector systems (point, linear as well as area detectors); for further processing the data can be gridded (transformed to a regular grid). ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    Libro

    Libro

    An interactive program for statistical analysis of texts

    A cross-platform text analysis program written in Python and Free Pascal/Lazarus which scans a whole text file (in plain text, HTML, EPUB, or ODT formats) and ranks all used words according to frequency, performing a quantitative analysis of the text using Shannon-Weaver information statistic and Zipf power law function. It counts words, sentences, chars, spaces, and syllables.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    MathGL

    MathGL

    A library for scientific data visualization

    A free cross-platform library of fast C++ routines for the plotting of up to 3-ranged data. It can export to bitmap and vector EPS/SVG files. There are window interfaces (GLUT/FLTK/Qt) and console tools. MathGL can be used from C/Fortran/Python/Octav/Lua
    Leader badge
    Downloads: 41 This Week
    Last Update:
    See Project
  • 13
    Fuzzy machine learning framework

    Fuzzy machine learning framework

    A library and a GUI front-end for fuzzy machine learning

    Fuzzy machine learning framework is a library and a GUI front-end for machine learning using intuitionistic fuzzy data. The approach is based on the intuitionistic fuzzy sets and the possibility theory. Further characteristics are fuzzy features and classes; numeric, enumeration features and features based on linguistic variables; user-defined features; derived and evaluated features; classifiers as features for building hierarchical systems; automatic refinement in case of dependent...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    CiteSpace

    CiteSpace

    A widely used tool for visual exploration of scientific literature.

    Visit the new site: https://citespace.podia.com CiteSpace generates interactive visualizations of structural and temporal patterns and trends of a scientific field. It facilitates a systematic review of a knowledge domain through an in-depth visual analytic process. It can process citation data from popular sources such as the Web of Science, Scopus, Dimensions, and the Lens. CiteSpace also supports basic visual analytic functions for datasets without citation-related information, for...
    Leader badge
    Downloads: 2,097 This Week
    Last Update:
    See Project
  • 15
    Artifact GeoMorph Toolbox 3D 3.1

    Artifact GeoMorph Toolbox 3D 3.1

    A toolbox for 3DGM shape analysis of archaeological artifacts

    The Artifact Geomorph Toolbox 3D software is designed to provide the archaeologist interested in artifact shape variability with a toolbox to allow the acquisition, analysis and results exploration of homologous 3D landmark-based geometric morphometric data. As such, the toolbox contains an automated item and semi-landmarks positioning procedure and the fundamental statistical analyses and procedures to allow the processing and analysis of the data. It is designed to be easy to use and straightforward with a simple graphical user interface. ...
    Downloads: 17 This Week
    Last Update:
    See Project
  • 16
    ACORBA

    ACORBA

    Automated approach to measure root tip angles of Arabidopsis thaliana

    Gravitropic response is studied in most of the laboratories working with Arabidopsis thaliana, for example, to detect new phenotypes in mutants. However, manual analysis of images and microscopy data are known to be subjected to human bias. This is particularly the case for manual measurements of root bending as the angle is set subjectively. In this context, it is essential to develop and use automated or semi-automated image analysis to produce faster, reproducible, and unbiased data. In this context, we developped ACORBA (Automatic Calculation Of Root Bending Angles), a fully automated software to measure root bending angle over time.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    DIG

    DIG

    A library for graph deep learning research

    The key difference with current graph deep learning libraries, such as PyTorch Geometric (PyG) and Deep Graph Library (DGL), is that, while PyG and DGL support basic graph deep learning operations, DIG provides a unified testbed for higher level, research-oriented graph deep learning tasks, such as graph generation, self-supervised learning, explainability, 3D graphs, and graph out-of-distribution. If you are working or plan to work on research in graph deep learning, DIG enables you to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Brain Tokyo Workshop

    Brain Tokyo Workshop

    Experiments and code from Google Brain’s Tokyo research workshop

    The Brain Tokyo Workshop repository hosts a collection of research materials and experimental code developed by the Google Brain team based in Tokyo. It showcases a variety of cutting-edge projects in artificial intelligence, particularly in the areas of neuroevolution, reinforcement learning, and model interpretability. Each project explores innovative approaches to learning, prediction, and creativity in neural networks, often through unconventional or biologically inspired methods. The...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Catalyst

    Catalyst

    Accelerated deep learning R&D

    Catalyst is a PyTorch framework for accelerated Deep Learning research and development. It allows you to write compact but full-featured Deep Learning pipelines with just a few lines of code. With Catalyst you get a full set of features including a training loop with metrics, model checkpointing and more, all without the boilerplate. Catalyst is focused on reproducibility, rapid experimentation, and codebase reuse so you can break the cycle of writing another regular train loop and make...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Ceka

    Ceka

    Crowd Environment and its Knowledge Analysis

    A knowledge analysis tool for crowdsourcing based on Weka. We also have a Python version of Crowdsourcing Learning: CrowdwiseKit on GitHub (https://github.com/tssai-lab/CrowdwiseKit).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    wav2letter++

    wav2letter++

    Facebook AI research's automatic speech recognition toolkit

    ...At least one of LZMA, BZip2, or Z is required for LM compression with KenLM. It is highly recommended to build KenLM with position-independent code (-fPIC) enabled, to enable python compatibility. After installing, run export KENLM_ROOT_DIR=... so that wav2letter++ can find it. This is needed because KenLM doesn't support a make install step.wav2letter++ expects audio and transcription data to be prepared in a specific format so that they can be read from the pipelines. Each dataset (test/valid/train) needs to be in a separate file with one sample per line. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Data Science at the Command Line

    Data Science at the Command Line

    Data science at the command line

    ...To get you started, author Jeroen Janssens provides a Docker image packed with over 100 Unix power tools, useful whether you work with Windows, macOS, or Linux. You’ll quickly discover why the command line is an agile, scalable, and extensible technology. Even if you’re comfortable processing data with Python or R, you’ll learn how to greatly improve your data science workflow by leveraging the command line’s power.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    This project is intended to provide code to be used with MySQL and Python to create a database of major league baseball game events which are freely provided by the mlb.com Gameday application. Older version also support creating a retrosheet.org database but that is no longer supported. All major and minor league pitch location and game statistic data can be downloaded using BBOS. Installation Videos!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    NiftyNet

    NiftyNet

    An open-source convolutional neural networks platform for research

    ...Adapt existing networks to your imaging data. Quickly build new solutions to your own image analysis problems. NiftyNet currently supports medical image segmentation and generative adversarial networks. NiftyNet is not intended for clinical use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Pydicom by examples

    Pydicom by examples

    Basic and intermediate examples of DICOM library with Jupyter

    Basic and intermediate examples to read, modify and write DICOM files with Python code using Jupyter - To install Jupyter - https://jupyter.org/install ====== All examples are based on Pydicom. An open source library - https://pydicom.github.io/
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB