Showing 1214 open source projects for "python data analysis"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    iX

    iX

    Autonomous GPT-4 agent platform

    IX is a platform for designing and deploying autonomous and [semi]-autonomous LLM-powered agents and workflows. IX provides a flexible and scalable solution for delegating tasks to AI-powered agents. Agents created with the platform can automate a wide variety of tasks while running in parallel and communicating with each other.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2
    Prompt Engineering Interactive Tutorial

    Prompt Engineering Interactive Tutorial

    Anthropic's Interactive Prompt Engineering Tutorial

    Prompt-eng-interactive-tutorial is a comprehensive, hands-on tutorial that teaches the craft of prompt engineering with Claude through guided, executable lessons. It starts with the anatomy of a good prompt and moves into techniques that deliver the “80/20” gains—separating instructions from data, specifying schemas, and setting evaluation criteria. The course leans heavily on realistic failure modes (ambiguity, hallucination, brittle instructions) and shows how to iteratively debug prompts...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    TensorFlow Privacy

    TensorFlow Privacy

    Library for training machine learning models with privacy for data

    Library for training machine learning models with privacy for training data. This repository contains the source code for TensorFlow Privacy, a Python library that includes implementations of TensorFlow optimizers for training machine learning models with differential privacy. The library comes with tutorials and analysis tools for computing the privacy guarantees provided.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    TextAttack

    TextAttack

    Python framework for adversarial attacks, and data augmentation

    Generating adversarial examples for NLP models. TextAttack is a Python framework for adversarial attacks, data augmentation, and model training in NLP.
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    Machine Learning Zoomcamp

    Machine Learning Zoomcamp

    Learn ML engineering for free in 4 months

    Machine Learning Zoomcamp is an open-source repository containing the materials for a comprehensive course that teaches machine learning engineering from fundamentals to deployment. The project is designed to guide learners through the complete lifecycle of developing machine learning systems, starting with data preparation and model training and ending with production deployment. Participants learn how to build regression and classification models using Python libraries such as NumPy, Pandas, and Scikit-learn. The course also introduces more advanced topics including decision trees, ensemble methods, and neural networks. Later modules focus on practical engineering topics such as containerization with Docker, API development with FastAPI, and scaling machine learning services using Kubernetes and cloud platforms. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    handson-ml

    handson-ml

    Teaching you the fundamentals of Machine Learning in python

    handson-ml hosts the notebooks for the first edition of the same hands-on ML book, reflecting the tooling and idioms of its time while teaching durable concepts. It walks through supervised and unsupervised learning with scikit-learn, then introduces deep learning using the earlier TensorFlow 1 graph-execution style. The examples underscore fundamentals like bias-variance trade-offs, regularization, and proper validation, grounding learners before they move to deep nets. Even though the deep...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    local-llm

    local-llm

    Run LLMs locally on Cloud Workstations

    local-llm is a development framework that enables developers to run large language models locally within Google Cloud Workstations or standard environments without requiring GPU hardware. It focuses on making generative AI development more accessible by leveraging quantized models and CPU-based execution, eliminating the dependency on expensive GPU infrastructure. The repository includes tools, Docker configurations, and command-line utilities that simplify the process of downloading,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    handson-ml3

    handson-ml3

    Fundamentals of Machine Learning and Deep Learning

    handson-ml3 contains the Jupyter notebooks and code for the third edition of the book Hands-On Machine Learning with Scikit-Learn, Keras & TensorFlow. It guides readers through modern machine learning and deep learning workflows using Python, with examples spanning data preparation, supervised and unsupervised learning, deep neural networks, RL, and production-ready model deployment. The third edition updates the content for TensorFlow 2 and Keras, introduces new chapters (for example on reinforcement learning or generative models), and offers best-practice code that reflects current ecosystems. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    FlashMLA

    FlashMLA

    FlashMLA: Efficient Multi-head Latent Attention Kernels

    FlashMLA is a high-performance decoding kernel library designed especially for Multi-Head Latent Attention (MLA) workloads, targeting NVIDIA Hopper GPU architectures. It provides optimized kernels for MLA decoding, including support for variable-length sequences, helping reduce latency and increase throughput in model inference systems using that attention style. The library supports both BF16 and FP16 data types, and includes a paged KV cache implementation with a block size of 64 to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Add Two Lines of Code. Get Full APM. Icon
    Add Two Lines of Code. Get Full APM.

    AppSignal installs in minutes and auto-configures dashboards, alerts, and error tracking.

    Works out of the box for Rails, Django, Express, Phoenix, and more. Monitoring exceptions and performance in no time.
    Start Free
  • 10
    snorkel

    snorkel

    A system for quickly generating training data with weak supervision

    The Snorkel team is now focusing their efforts on Snorkel Flow, an end-to-end AI application development platform based on the core ideas behind Snorkel. The Snorkel project started at Stanford in 2016 with a simple technical bet: that it would increasingly be the training data, not the models, algorithms, or infrastructure, that decided whether a machine learning project succeeded or failed. Given this premise, we set out to explore the radical idea that you could bring mathematical and...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 11
    sketch

    sketch

    AI code-writing assistant that understands data content

    ...The tool integrates directly into pandas dataframes through an extension, making it easy to use within existing Python workflows without requiring additional IDE plugins. Sketch supports a variety of tasks including data cleaning, feature engineering, visualization, and exploratory analysis, all driven by simple natural language prompts. It also includes advanced capabilities for generating structured outputs and applying transformations directly to datasets, reducing the need for manual coding.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Datapipe

    Datapipe

    Real-time, incremental ETL library for ML with record-level depend

    Datapipe is a real-time, incremental ETL library for Python with record-level dependency tracking. Datapipe is designed to streamline the creation of data processing pipelines. It excels in scenarios where data is continuously changing, requiring pipelines to adapt and process only the modified data efficiently. This library tracks dependencies for each record in the pipeline, ensuring minimal and efficient data processing.
    Downloads: 128 This Week
    Last Update:
    See Project
  • 13
    scikit-learn-videos

    scikit-learn-videos

    Jupyter notebooks from the scikit-learn video series

    ...Each video corresponds to a notebook that walks through the code step by step, allowing students to see both the theoretical explanation and its practical implementation. The project emphasizes accessibility and beginner-friendly explanations, making it suitable for learners who are new to data science or machine learning programming. The tutorials collectively span several hours of instructional content and demonstrate how to build predictive models using Python tools commonly used in the data science ecosystem.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    PyDenseCRF

    PyDenseCRF

    Python wrapper to Philipp Krähenbühl's dense (fully connected) CRFs

    PyDenseCRF is a Python library that provides a wrapper around the implementation of fully connected Conditional Random Fields (CRFs) developed by Philipp Krähenbühl and Vladlen Koltun. The project allows developers and researchers to integrate Dense CRF inference into Python-based machine learning pipelines, particularly for computer vision tasks such as image segmentation and labeling. Conditional Random Fields are probabilistic graphical models used to model contextual relationships...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Improved Diffusion

    Improved Diffusion

    Release for Improved Denoising Diffusion Probabilistic Models

    improved-diffusion is an open source implementation of diffusion probabilistic models created by OpenAI. These models, also known as score-based generative models, are a class of generative models that have shown strong performance in producing high-quality synthetic data such as images. The repository provides code for training and sampling diffusion models with improved techniques that enhance stability, efficiency, and output fidelity. It includes scripts for setting up training runs,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    LIDA

    LIDA

    Automatic Generation of Visualizations and Infographics using LLMs

    LIDA is an open-source library developed to automate the process of creating data visualizations and infographics using large language models. The system treats visualizations as executable code and uses AI to generate, modify, and interpret that code in order to transform raw datasets into meaningful charts and graphical explanations. Instead of requiring users to manually explore datasets and write plotting scripts, LIDA analyzes the data and automatically proposes visualization goals and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Obsei

    Obsei

    Obsei is a low code AI powered automation tool

    Obsei is an automated no-code/low-code AI-powered text observation and analysis framework, designed for extracting insights from unstructured text data such as social media, reviews, and logs.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 18
    SMILI

    SMILI

    Scientific Visualisation Made Easy

    The Simple Medical Imaging Library Interface (SMILI), pronounced 'smilie', is an open-source, light-weight and easy-to-use medical imaging viewer and library for all major operating systems. The main sMILX application features for viewing n-D images, vector images, DICOMs, anonymizing, shape analysis and models/surfaces with easy drag and drop functions. It also features a number of standard processing algorithms for smoothing, thresholding, masking etc. images and models, both with...
    Leader badge
    Downloads: 58 This Week
    Last Update:
    See Project
  • 19
    Automated Interpretability

    Automated Interpretability

    Code for Language models can explain neurons in language models paper

    The automated-interpretability repository implements tools and pipelines for automatically generating, simulating, and scoring explanations of neuron (or latent feature) behavior in neural networks. Instead of relying purely on manual, ad hoc interpretability probing, this repo aims to scale interpretability by using algorithmic methods that produce candidate explanations and assess their quality. It includes a “neuron explainer” component that, given a target neuron or latent feature,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    SPPAS

    SPPAS

    SPPAS - the automatic annotation and analyses of speech

    ...Available for free, with open source code, there is simply no other package for linguists to simple use in the automatic annotations of speech, the analyses of any kind of annotated data and the conversion of annotated files. SPPAS is able to produce automatically speech annotations from a recorded speech sound and its orthographic transcription. SPPAS is helpful for the analysis of any annotated data: estimate statistical distributions, make requests, manage files, visualize annotations. SPPAS offers a file converter from/to a wide range of formats: xra, TextGrid, eaf, trs... ...
    Downloads: 30 This Week
    Last Update:
    See Project
  • 21
    Eventer

    Eventer

    Rapid, unbiased, reproducible analysis of synaptic events

    ...The software combines deconvolution for detection, and variable length template matching approaches for screening out false positive events. Eventer also includes a machine learning-based approach allowing users to train a model to implement their ‘expert’ selection criteria across data sets without bias. Sharing models allows users to implement consistent analysis procedures. The software is coded in MATLAB, but has been compiled as standalone applications for Windows, Mac and Linux. Please visit the official Eventer website for more info https://eventerneuro.netlify.app/ While the paper is in preparation, please cite as; Winchester, G., Liu, S., Steele, O.G., Aziz, W. and Penn, A.C. (2020) Eventer. ...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 22
    DeepSeek AIO

    DeepSeek AIO

    Access and use all DeepSeek AI models in one program.

    DeepSeek AIO is a simple program that allows you to interact with all DeepSeek large language models in one place. It supports text-based chats, data analysis, code generation, language translation, and more. The program is designed to make it easy for users to use DeepSeek's AI tools for different purposes without switching between multiple platforms.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 23
    Amphion

    Amphion

    Toolkit for audio, music, and speech generation

    Amphion is a toolkit from OpenMMLab dedicated to audio, music, and speech generation, aimed at both reproducible research and helping newcomers get started in generative audio. It provides standardized implementations and recipes for classic and state-of-the-art generative models in audio, including TTS, music generation, and voice conversion. A distinctive feature of Amphion is its emphasis on visualization: it offers interactive visualizations of model architectures and generation...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    Arthur Bench

    Arthur Bench

    Bench is a tool for evaluating LLMs for production use cases

    Bench is a tool for evaluating LLMs for production use cases. Whether you are comparing different LLMs, considering different prompts, or testing generation hyperparameters like temperature and # tokens, Bench provides one touch point for all your LLM performance evaluation.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 25
    Computer vision projects

    Computer vision projects

    computer vision projects | Fun AI projects related to computer vision

    Computer vision projects is an open-source collection of computer vision projects and experiments that demonstrate practical applications of modern AI techniques in image processing, robotics, and real-time visual analysis. The repository includes multiple demonstration systems implemented using languages such as Python and C++, covering topics ranging from object detection to embedded vision systems. Many of the projects illustrate how computer vision algorithms can interact with hardware platforms, including robotics systems and edge computing devices. ...
    Downloads: 2 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB