Showing 283 open source projects for "python data analysis"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    Pythonidae

    Pythonidae

    Curated decibans of scientific programming resources in Python

    ...While the primary focus is on Python, some entries also highlight resources implemented in other languages like Julia, R, Go, and Java. The project emphasizes open contribution, allowing the community to continuously expand and refine the index. By gathering these resources in one place, Pythonidae acts as a central hub for scientific and data-driven programming with Python.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    TensorFlow Examples

    TensorFlow Examples

    TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)

    TensorFlow Examples is a comprehensive repository of example implementations, tutorials, and reference code intended to help newcomers and intermediate learners dive into TensorFlow quickly. It contains both Jupyter notebooks and raw source code, covering a broad range of tasks: from basic machine-learning and neural-network models to more advanced use cases, using both TensorFlow v1 and v2 APIs. For clarity and educational value, each example is accompanied by explanatory comments or...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    PyTorchVideo

    PyTorchVideo

    A deep learning library for video understanding research

    ...The library includes efficient implementations of state-of-the-art architectures such as SlowFast, X3D, and MViT, optimized for both research prototyping and production inference. It supports video I/O pipelines, data augmentation, distributed training, and mixed precision computation for large-scale experiments. PyTorchVideo also connects seamlessly with other Meta AI tools such as Detectron2 and PyTorch3D for multimodal video analysis. Designed to accelerate research and deployment, it serves as a unified framework for reproducible, high-performance video AI development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Pandas TA

    Pandas TA

    Python 3 Pandas Extension with 130+ Indicators

    Technical Analysis Indicators - Pandas TA is an easy-to-use Python 3 Pandas Extension with 130+ Indicators. Pandas Technical Analysis (Pandas TA) is an easy-to-use library that leverages the Pandas package with more than 130 Indicators and Utility functions and more than 60 TA Lib Candlestick Patterns. Many commonly used indicators are included, such as: Candle Pattern(cdl_pattern), Simple Moving Average (sma) Moving Average Convergence Divergence (macd), Hull Exponential Moving Average (hma), Bollinger Bands (bbands), On-Balance Volume (obv), Aroon & Aroon Oscillator (aroon), Squeeze (squeeze) and many more.
    Downloads: 388 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 5
    libpython-clj

    libpython-clj

    Python bindings for Clojure

    libpython-clj is a deep interop library enabling you to load and use Python modules from within Clojure as if they were native namespaces—and even extend Python objects from Clojure. It bridges to the Python C API, preserving REPL‑based workflows. Bridge between JVM objects and Python objects easily; use Python in your Java and use some Java in your Python. Python objects are linked to the JVM GC such that when they are no longer reachable from the JVM, their references are released....
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    ReinventCommunity

    ReinventCommunity

    Jupyter Notebook tutorials for REINVENT 3.2

    This repository is a collection of useful jupyter notebooks, code snippets and example JSON files illustrating the use of Reinvent 3.2.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Arraymancer

    Arraymancer

    A fast, ergonomic and portable tensor library in Nim

    Arraymancer is a tensor and deep learning library for the Nim programming language, designed for high-performance numerical computations and machine learning applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    libpostal

    libpostal

    A C library for parsing/normalizing street addresses around the world

    A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data. libpostal is a C library for parsing/normalizing street addresses around the world using statistical NLP and open data. The goal of this project is to understand location-based strings in every language, everywhere. Addresses and the locations they represent are essential for any application dealing with maps (place search, transportation, on-demand/delivery services,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    BMC

    BMC

    Notes on Scientific Computing for Biomechanics

    This repository is a collection of lecture notes and code on scientific computing and data analysis for Biomechanics and Motor Control.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 10
    earthengine-py-notebooks

    earthengine-py-notebooks

    A collection of 360+ Jupyter Python notebook examples

    earthengine-py-notebooks is a comprehensive collection of hundreds of Jupyter Python notebooks that serve as examples and tutorials for using the Google Earth Engine Python API. These notebooks are organized into thematic areas such as image processing, machine learning, visualization, filtering, and asset management, exposing users to real geospatial analysis tasks. The repository makes it easier to explore Earth Engine’s large geospatial data catalog, interactively display map layers, and generate visual insights without the need for external GIS software by leveraging interactive widgets and mapping libraries. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Big List of Naughty Strings

    Big List of Naughty Strings

    List of strings which have a high probability of causing issues

    The Big List of Naughty Strings is a community-maintained catalog of “gotcha” inputs that commonly break software, from unusual Unicode to SQL and script injection payloads. It exists so developers and QA engineers can easily test edge cases that normal test data would miss, such as zero-width characters, right-to-left marks, emojis, foreign alphabets, and long or malformed strings. By throwing these strings at forms, APIs, databases, and UIs, teams can discover encoding bugs, sanitizer...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    CodeQL

    CodeQL

    Libraries and queries that power security researchers

    CodeQL is a semantic code analysis engine that treats programs as queryable databases, enabling users to write expressive queries that identify security vulnerabilities, logic bugs, and code quality issues across large codebases. Instead of just pattern matching text, CodeQL ingests source code, builds rich representations of structure and data flow, and allows queries that reason about control flow, type systems, and interprocedural relationships.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Brain AutoML

    Brain AutoML

    Google Brain AutoML

    This repository contains a list of AutoML related models and libraries. Google Brain researchers have introduced a new way of programming automated machine learning (AutoML) based on symbolic programming. The researchers also proposed PyGlove, a general symbolic programming library for Python, to implement the symbolic formulation of AutoML. AutoML, designed to fill the machine learning industry’s talent gap, is gaining traction among various organizations.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    BeaEngine 5

    BeaEngine 5

    BeaEngine disasm project

    BeaEngine is a C library designed to decode instructions from 16-bit, 32-bit and 64-bit intel architectures. It includes standard instructions set and instructions set from FPU, MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, VMX, CLMUL, AES, MPX, AVX, AVX2, AVX512 (VEX & EVEX prefixes), CET, BMI1, BMI2, SGX, UINTR, KL, TDX and AMX extensions. If you want to analyze malicious codes and more generally obfuscated codes, BeaEngine sends back a complex structure that describes precisely the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    CNN for Image Retrieval
    cnn-for-image-retrieval is a research-oriented project that demonstrates the use of convolutional neural networks (CNNs) for image retrieval tasks. The repository provides implementations of CNN-based methods to extract feature representations from images and use them for similarity-based retrieval. It focuses on applying deep learning techniques to improve upon traditional handcrafted descriptors by learning features directly from data. The code includes training and evaluation scripts that...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    fastNLP

    fastNLP

    fastNLP: A Modularized and Extensible NLP Framework

    fastNLP is a lightweight framework for natural language processing (NLP), the goal is to quickly implement NLP tasks and build complex models. A unified Tabular data container simplifies the data preprocessing process. Built-in Loader and Pipe for multiple datasets, eliminating the need for preprocessing code. Various convenient NLP tools, such as Embedding loading (including ELMo and BERT), intermediate data cache, etc.. Provide a variety of neural network components and recurrence models (covering tasks such as Chinese word segmentation, named entity recognition, syntactic analysis, text classification, text matching, metaphor resolution, summarization, etc.). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    NLP Architect

    NLP Architect

    A model library for exploring state-of-the-art deep learning

    NLP Architect is an open-source Python library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing and Natural Language Understanding neural networks. The library includes our past and ongoing NLP research and development efforts as part of Intel AI Lab. NLP Architect is designed to be flexible for adding new models, neural network components, data handling methods, and for easy training and running models. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    gradslam

    gradslam

    gradslam is an open source differentiable dense SLAM library

    gradslam is an open-source framework providing differentiable building blocks for simultaneous localization and mapping (SLAM) systems. We enable the usage of dense SLAM subsystems from the comfort of PyTorch. The question of “representation” is central in the context of dense simultaneous localization and mapping (SLAM). Newer learning-based approaches have the potential to leverage data or task performance to directly inform the choice of representation. However, learning representations...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Zipline

    Zipline

    Zipline, a Pythonic algorithmic trading library

    ...Zipline is currently used in production as the backtesting and live-trading engine powering Quantopian -- a free, community-centered, hosted platform for building and executing trading strategies. Quantopian also offers a fully managed service for professionals that includes Zipline, Alphalens, Pyfolio, FactSet data, and more. Installing Zipline is slightly more involved than the average Python package. For a development installation (used to develop Zipline itself), create and activate a virtualenv, then run the etc/dev-install script. Please note that Zipline is not a community-led project. Zipline is maintained by the Quantopian engineering team, and we are quite small and often busy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    StructPie

    StructPie

    A set of C libraries to implement data structures and algorithms

    ...In the "hash_table" directory, the hash table implementation uses linked lists. While in "HashBSTree" directory, a hash table with a binary tree in each index is implemented for faster lookup in large data. The stack, the tree and the hash table accept int, float and char* data type To look at the python library : https://github.com/mnoorfawi/struct-pie
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    ebfformat

    ebfformat

    An Efficient Binary data Format

    ...It is also designed to simplify the programming of input output routines in different programming languages. In a nutshell an EBF file is a collection of data objects. Each data object is specified by a unique name and a single file can have multiple data objects. Each data object is preceded by a meta-data or header which describes the binary data associated with it. Among other things, this header allows the files to be portable across systems with different endianess.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Forecasting Best Practices

    Forecasting Best Practices

    Time Series Forecasting Best Practices & Examples

    ...Rather than creating implementations from scratch, we draw from existing state-of-the-art libraries and build additional utilities around processing and featuring the data, optimizing and evaluating models, and scaling up to the cloud. The examples and best practices are provided as Python Jupyter notebooks and R markdown files and a library of utility functions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    PyCBC

    PyCBC

    Learn how to use PyCBC to analyze gravitational-wave data

    PyCBC is a software developed by a collaboration of LIGO, Virgo, and independent scientists. It is open source and freely available. We use PyCBC in the detection of gravitational waves from binary mergers such as GW150914. These examples explore how to analyze gravitational wave data, how we find potential signals and learn about them. Many of these tutorials will require you to make edits to config files as part of their exercises. At the moment this isn't easy to do on services like...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    PerfKit Benchmarker

    PerfKit Benchmarker

    PerfKit Benchmarker (PKB) contains a set of benchmarks

    PerfKitBenchmarker is an open-source benchmarking framework designed to measure and compare the performance of cloud infrastructure across multiple providers in a consistent and reproducible way. It allows users to evaluate metrics such as latency, throughput, provisioning time, and system performance using a standardized set of benchmarks. The tool supports a wide range of environments, including major cloud platforms, Kubernetes clusters, and even local hardware, making it highly versatile...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Brand new cheatsheets and handouts

    Brand new cheatsheets and handouts

    Matplotlib 3.1 cheat sheet

    The Brand new cheatsheets and handouts repo is a compact, quick-reference summary of the most commonly used plotting commands and configurations in Matplotlib, intended to serve as a handy reference for experienced users who want to recall syntax or find the right function without digging into full documentation. It lays out common use cases (plot types, styling, figure configuration, saving/exporting, subplot layout, etc.) in a concise and organized format — often serving as a “cheat sheet”...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB