Showing 171 open source projects for "python data analysis"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    AllenNLP

    AllenNLP

    An open-source NLP research library, built on PyTorch

    AllenNLP makes it easy to design and evaluate new deep learning models for nearly any NLP problem, along with the infrastructure to easily run them in the cloud or on your laptop. AllenNLP includes reference implementations of high quality models for both core NLP problems (e.g. semantic role labeling) and NLP applications (e.g. textual entailment). AllenNLP supports loading "plugins" dynamically. A plugin is just a Python package that provides custom registered classes or additional...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    DeepMind Educational Resources

    DeepMind Educational Resources

    DeepMind's repo of educational notebooks for learning AI and research

    ...The repository provides hands-on, beginner-friendly resources that introduce essential AI concepts through Google Colab notebooks, combining intuitive explanations with executable code. The tutorials cover a broad range of topics—from foundational Python programming and data handling to supervised, unsupervised, and reinforcement learning, as well as graph neural networks and scientific reasoning. Specialized notebooks also explore creative AI applications, language modeling, generative models, and protein folding. Each tutorial is designed to be standalone and adaptable for self-study, classroom teaching, or use at summer schools and community workshops.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Haircut_EQ

    Haircut_EQ

    Fetches the historical Haircut data (%) of Equities and Gold Bonds

    Haircut is the amount of margin money deducted while pledging the Equites/ Mutual Funds / Bonds for trading. A haircut value of 13% means that, if a share worth of Rs 100 is pledged for trading, then Rs. 13 will be deducted and Rs. 87 will be given as collateral margin for trading. It is based on previous closing price. This program is used to identify the real value of a share with date. It fetches the historical Haircut data (%) and Price of an equity listed in NSE, India at a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    ety

    ety

    A Python module to discover the etymology of words

    ety is a Python library and command-line tool designed to explore and retrieve the etymological origins of words by analyzing linguistic data and relationships between languages. It allows users to query a word and obtain its historical roots, including intermediate forms across different languages and time periods. The tool can generate recursive etymology chains as well as tree structures that visually represent how a word evolved over time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    Brain Tokyo Workshop

    Brain Tokyo Workshop

    Experiments and code from Google Brain’s Tokyo research workshop

    The Brain Tokyo Workshop repository hosts a collection of research materials and experimental code developed by the Google Brain team based in Tokyo. It showcases a variety of cutting-edge projects in artificial intelligence, particularly in the areas of neuroevolution, reinforcement learning, and model interpretability. Each project explores innovative approaches to learning, prediction, and creativity in neural networks, often through unconventional or biologically inspired methods. The...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Catalyst

    Catalyst

    Accelerated deep learning R&D

    Catalyst is a PyTorch framework for accelerated Deep Learning research and development. It allows you to write compact but full-featured Deep Learning pipelines with just a few lines of code. With Catalyst you get a full set of features including a training loop with metrics, model checkpointing and more, all without the boilerplate. Catalyst is focused on reproducibility, rapid experimentation, and codebase reuse so you can break the cycle of writing another regular train loop and make...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Binarytree

    Binarytree

    Python library for studying Binary Trees

    Binarytree is Python library that lets you generate, visualize, inspect and manipulate binary trees. Skip the tedious work of setting up test data, and dive straight into practicing algorithms. Heaps and BSTs (binary search trees) are also supported. Binarytree supports another representation which is more compact but without the indexing properties.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Haircut_MF

    Haircut_MF

    Fetches the historical Haircut data (%) of Mutual Funds and Bonds

    Haircut is the amount of margin money deducted while pledging the Equites/ Mutual Funds / Bonds for trading. A haircut value of 13% means that, if a share worth of Rs 100 is pledged for trading, then Rs. 13 will be deducted and Rs. 87 will be given as collateral margin for trading. It is based on previous closing price (NAV). This program is used to identify the real value of an equity / share / mutual fund / Bonds / SGB with date. This program fetches the historical Haircut data (%)...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Statistics for Data Scientists

    Statistics for Data Scientists

    "Statistics for Data Scientists: 50 Essential Concepts"

    The “statistics-for-data-scientists” repository is a pedagogical resource designed to bridge rigorous statistics theory and practical data science workflows. The code and materials are intended to help data scientists and analysts grasp statistical principles (e.g. inference, regressions, hypothesis testing, probability, confidence intervals) in contexts relevant to real data analysis tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Linha do Texto is a web based game for classificate semiotic text content from user input data with 1 to 4 parameters predifined in each section. It can be used to discuss discrete and continuous semantic categories in a scientific and educational fied.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    PersonGen

    PersonGen

    A minor Project in Python which uses the RandomUser API .

    A Small Program in Python That Makes Use of RandomUser API To Generate Random Person Data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Showtime

    Showtime

    A Minor Project made in Python using OMDb API and Tkinter for Frontend

    A Minor Project made in Python using Tkinter for frontend which fetches Data about Movies/TV-Series from an Online Database and uses the OMDB REST API and pyImdb to show Information about movies. Github : https://github.com/Cyborg117/Showtime
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    EduData

    EduData

    Datasets in Education and convenient interface for dataset

    Datasets in Education and convenient interface for downloading and preprocessing dataset in education. The CLI tools to quickly convert the "raw" data of the dataset into "mature" data for knowledge tracing task. The "mature" data is in json sequence format and can be modeled by XKT and TKT(TBA) The analysis dataset tool only supports the json sequence format. To check the following statical indexes of the dataset. In order to better verify the effectiveness of the model, the dataset is usually divided into train/valid/test or using kfold method. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Stats With Julia Book

    Stats With Julia Book

    Collection of runnable Julia code examples for a statistics book

    StatsWithJuliaBook is the companion code repository for the book Statistics with Julia: Fundamentals for Data Science, Machine Learning and Artificial Intelligence. It contains over 200 code blocks that correspond to the book’s ten chapters and three appendices, covering topics from probability theory and data summarization to regression analysis, hypothesis testing, and machine learning basics. The repository is designed for Julia users and provides ready-to-run examples that reinforce theoretical concepts with practical implementation. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Ceka

    Ceka

    Crowd Environment and its Knowledge Analysis

    A knowledge analysis tool for crowdsourcing based on Weka. We also have a Python version of Crowdsourcing Learning: CrowdwiseKit on GitHub (https://github.com/tssai-lab/CrowdwiseKit).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Gato (Graph Animation Toolbox): Animate graph algorithms for example for computing shortest paths, minimal spanning trees, maximum flows or maximal cardinality or weight matchings. Create your own animations using the Animated Data Structures (ADS).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Tensorflow 2017 Tutorials

    Tensorflow 2017 Tutorials

    Tensorflow tutorial from basic to hard

    Tensorflow 2017 Tutorials is a structured set of tutorials that introduce developers to TensorFlow, starting with basic neural network constructs and progressing to sophisticated model architectures and training techniques. This repository covers essential building blocks like sessions (for older TF versions), placeholders, variables, activation functions, and optimizers, before guiding learners through building end-to-end models for regression, classification, and data pipelines. Beyond the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    wav2letter++

    wav2letter++

    Facebook AI research's automatic speech recognition toolkit

    ...At least one of LZMA, BZip2, or Z is required for LM compression with KenLM. It is highly recommended to build KenLM with position-independent code (-fPIC) enabled, to enable python compatibility. After installing, run export KENLM_ROOT_DIR=... so that wav2letter++ can find it. This is needed because KenLM doesn't support a make install step.wav2letter++ expects audio and transcription data to be prepared in a specific format so that they can be read from the pipelines. Each dataset (test/valid/train) needs to be in a separate file with one sample per line. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Quantitative-Notebooks

    Quantitative-Notebooks

    Educational notebooks on quantitative finance, algorithmic trading

    Quantitative-Notebooks is a curated set of Jupyter notebooks focused on quantitative finance, algorithmic investing, and data-driven portfolio analysis. While each individual notebook is aimed at practical finance workflows, the overall repository helps practitioners and learners use Python, pandas, and numerical libraries to build, test, and evaluate financial strategies using historical market data. The notebooks typically showcase how to perform backtesting, factor analysis, risk assessment, and other quantitative workflows in a reproducible, exploratory format. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    palmerpenguins

    palmerpenguins

    A great intro dataset for data exploration & visualization

    palmerpenguins is an R package offering real-world ecological data from the Palmer Archipelago penguin species—Adélie, Chinstrap, and Gentoo. Designed as a more engaging alternative to the classical iris dataset, it provides size measurements, clutch information, and blood isotope data for teaching, visualization, and analytics practice.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    interactive-coding-challenges

    interactive-coding-challenges

    120+ interactive Python coding interview challenges

    Interactive Coding Challenges is a collection of practice problems designed to strengthen data structures, algorithms, and problem-solving skills. The repository emphasizes a learn-by-doing approach: you read a prompt, attempt a solution, and verify behavior with tests, often within notebooks or scripts. Problems span arrays, strings, stacks, queues, linked lists, trees, graphs, dynamic programming, and more, mirroring common interview themes. Many challenges include hints and reference...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Data Science at the Command Line

    Data Science at the Command Line

    Data science at the command line

    ...To get you started, author Jeroen Janssens provides a Docker image packed with over 100 Unix power tools, useful whether you work with Windows, macOS, or Linux. You’ll quickly discover why the command line is an agile, scalable, and extensible technology. Even if you’re comfortable processing data with Python or R, you’ll learn how to greatly improve your data science workflow by leveraging the command line’s power.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Python4Proteomics Course

    Python4Proteomics Course

    Python course for Proteomics analysis

    Python course (in Spanish) for Proteomics analysis using basically Jupyter NoteBooks. For more information, you can have a look at the readme.md file in the source code tree: https://sourceforge.net/p/lp-csic-uab/p4p/code/ci/default/tree/readme.md
    Downloads: 6 This Week
    Last Update:
    See Project
  • 24
    VStar

    VStar

    Multi-platform Variable Star Visualisation and Analysis

    VStar is a multi-platform, easy-to-use variable star observation visualisation and analysis tool. Data can be read from a file or the AAVSO database, light curves and phase plots created, period analysis performed, and filters applied. Plugins can be developed, e.g. to make additional observation sources available. See also: 1. http://www.aavso.org/vstar-overview 2. http://www.aavso.org/forums/about-aavso/vstar 3. http://dbenn.wordpress.com/category/astronomy-science/vstar 4. http://www.citizensky.org/teams/vstar-software-development
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    StepIntoChinese

    StepIntoChinese

    Chinese language tool

    Explore Chinese. Application uses data structure with over 26,000 words/concepts and 8,300 characters.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB