Showing 1158 open source projects for "python data analysis"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    X's Recommendation Algorithm

    X's Recommendation Algorithm

    Source code for the X Recommendation Algorithm

    The Algorithm is Twitter’s open source release of the core ranking system that powers the platform’s home timeline. It provides transparency into how tweets are selected, prioritized, and surfaced to users, reflecting Twitter’s move toward openness in recommendation algorithms. The repository contains the recommendation pipeline, which incorporates signals such as engagement, relevance, and content features, and demonstrates how they combine to form ranked outputs. Written primarily in...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    aws-cli

    aws-cli

    Universal Command Line Interface for Amazon Web Services

    The AWS CLI is the universal command-line interface for managing AWS services, automating tasks, and scripting cloud workflows. It exposes nearly every public API from EC2 and S3 to IAM, Lambda, and beyond, providing parity with the service SDKs in a tool you can run anywhere. Profiles, regions, single-sign-on, and credential helpers make it straightforward to switch contexts securely across accounts and environments. Its output controls and JMESPath querying let you slice, filter, and...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    GNU Emacs / N Λ N O

    GNU Emacs / N Λ N O

    Emacs made simple

    ...This makes it appealing for users who like the power of Emacs (or similar advanced editors) but want a fast, responsive, and easy-to-maintain editor. Nano-Emacs can be especially useful for scripting, quick edits, data-analysis workflows, or writing in scientific/research contexts, especially when you want something more powerful than a simple text editor but lighter than a full IDE.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    DocsGPT

    DocsGPT

    Private AI platform for agents, enterprise search and RAG pipelines

    DocsGPT is an open-source AI platform for deploying private RAG pipelines, AI agents, and enterprise search on your own infrastructure. Connect any data source (PDFs, DOCX, CSV, Excel, HTML, audio, GitHub, databases, URLs) and get accurate, hallucination-free answers with source citations. Choose your LLM: OpenAI, Anthropic, Google Gemini, or local models. Works with Qdrant, MongoDB, and Elasticsearch and more. Deploy via Docker or Kubernetes with full data sovereignty. Build...
    Downloads: 3 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    Anomalib

    Anomalib

    An anomaly detection library comprising state-of-the-art algorithms

    Anomalib is an open-source deep learning library focused on anomaly detection and localization tasks, collecting state-of-the-art algorithms and tools under one modular framework. It provides implementations of leading anomaly detection methods drawn from current research, as well as a full set of utilities for training, evaluating, benchmarking, and deploying these models on both public and private datasets. Anomalib emphasizes flexibility and reproducibility: you can use its simple APIs to...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    fmt

    fmt

    Formatting library as an alternative to C stdio and C++ iostreams

    {fmt} is an open-source formatting library providing a fast and safe alternative to C stdio and C++ iostreams. The format API is similar in spirit to the C printf family of function but is safer, simpler and several times faster than common standard library implementations. The format string syntax is similar to the one used by str.format in Python. The library is fully type safe, automatic memory management prevents buffer overflow, errors in format strings are reported using exceptions or...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 7
    Prompt Declaration Language

    Prompt Declaration Language

    Prompt Declaration Language is a declarative prompt programming lang

    LLMs will continue to change the way we build software systems. They are not only useful as coding assistants, providing snipets of code, explanations, and code transformations, but they can also help replace components that could only previously be achieved with rule-based systems. Whether LLMs are used as coding assistants or software components, reliability remains an important concern. LLMs have a textual interface and the structure of useful prompts is not captured formally. Programming...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    Papis

    Papis

    Powerful and highly extensible command-line based document

    Papis is a powerful and highly extensible CLI document and bibliography manager. With Papis, you can search your library for books and papers, add documents and notes, import and export to and from other formats, and much much more. Papis uses a human-readable and easily hackable .yaml file to store each entry's bibliographical data. It strives to be easy to use while providing a wide range of features. And for those who still want more, Papis makes it easy to write scripts that extend its...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    TurboGears

    TurboGears

    Python web framework with full-stack layer

    TurboGears is a hybrid web framework able to act both as a Full Stack framework or as a Microframework. TurboGears helps you get going fast and gets out of your way when you want it! TurboGears can be used both as a full stack framework or as a microframework in single-file mode. TurboGears 2 is built on top of the experience of several next-generation web frameworks including TurboGears 1 (of course), Django, and Rails. All of these frameworks had limitations that frustrated us, and TG2 was...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 10
    kapture

    kapture

    Tools for manipulating datasets

    Kapture is a pivot file format, based on text and binary files, used to describe SfM (Structure From Motion) and more generally sensor-acquired data.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    ccapi

    ccapi

    A header-only C++ library for interacting with crypto exchanges

    A header-only C++ library for streaming market data and executing trades directly from cryptocurrency exchanges (i.e. the connections are between your server and the exchange server without anything in-between).
    Downloads: 5 This Week
    Last Update:
    See Project
  • 12
    Kornia

    Kornia

    Open Source Differentiable Computer Vision Library

    Kornia is a differentiable computer vision library for PyTorch. It consists of a set of routines and differentiable modules to solve generic computer vision problems. At its core, the package uses PyTorch as its main backend both for efficiency and to take advantage of the reverse-mode auto-differentiation to define and compute the gradient of complex functions. Inspired by existing packages, this library is composed by a subset of packages containing operators that can be inserted within...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 13
    CodeGuide

    CodeGuide

    This repository is a collection of the author, Xiao Fuge

    CodeGuide is a curated collection of Java development resources built from years of real-world engineering experience by the author, who has worked extensively in large-scale internet companies. The project serves as a structured and comprehensive learning path for developers who want to strengthen their understanding of Java fundamentals and core programming practices. It includes detailed explanations of design patterns, source code analysis, frameworks, algorithms, and real-world project...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Chronicle Queue

    Chronicle Queue

    Micro second messaging that stores everything to disk

    ...With millions of users worldwide, Chronicle Queue is a leading choice for organizations of all sizes seeking a low-latency, high-performance messaging framework. The Chronicle Queue Enterprise version eliminates compatibility issues and allows for the seamless integration and sharing of data between Java, Rust, Python, and C++ libraries. Chronicle Queue is available as a fully native Java, Rust or C++ library, and also supports a Python API. All versions are fully binary compatible.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Jina

    Jina

    Build cross-modal and multimodal applications on the cloud

    Jina is a framework that empowers anyone to build cross-modal and multi-modal applications on the cloud. It uplifts a PoC into a production-ready service. Jina handles the infrastructure complexity, making advanced solution engineering and cloud-native technologies accessible to every developer. Build applications that deliver fresh insights from multiple data types such as text, image, audio, video, 3D mesh, PDF with Jina AI’s DocArray. Polyglot gateway that supports gRPC, Websockets, HTTP,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Synapse Machine Learning

    Synapse Machine Learning

    Simple and distributed Machine Learning

    SynapseML (previously MMLSpark) is an open source library to simplify the creation of scalable machine learning pipelines. SynapseML builds on Apache Spark and SparkML to enable new kinds of machine learning, analytics, and model deployment workflows. SynapseML adds many deep learning and data science tools to the Spark ecosystem, including seamless integration of Spark Machine Learning pipelines with the Open Neural Network Exchange (ONNX), LightGBM, The Cognitive Services, Vowpal Wabbit,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Lightweight' GAN

    Lightweight' GAN

    Implementation of 'lightweight' GAN, proposed in ICLR 2021

    Implementation of 'lightweight' GAN proposed in ICLR 2021, in Pytorch. The main contribution of the paper is a skip-layer excitation in the generator, paired with autoencoding self-supervised learning in the discriminator. Quoting the one-line summary "converge on single gpu with few hours' training, on 1024 resolution sub-hundred images". Augmentation is essential for Lightweight GAN to work effectively in a low data setting. You can test and see how your images will be augmented before...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    XLS

    XLS

    XLS: Accelerated HW Synthesis

    XLS is an open-source toolkit for building high-level hardware with a modern compiler stack that spans from a functional DSL to optimized IR and hardware generation. At the front end, DSLX lets you describe algorithms with strong typing and familiar control flow while remaining synthesis-friendly. The compiler lowers DSLX into a rich intermediate representation, applies aggressive optimization and scheduling passes, and can either JIT the design for software simulation or emit Verilog for...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 19
    FairChem

    FairChem

    FAIR Chemistry's library of machine learning methods for chemistry

    FAIRChem is a unified library for machine learning in chemistry and materials, consolidating data, pretrained models, demos, and application code into a single, versioned toolkit. Version 2 modernizes the stack with a cleaner core package and breaking changes relative to V1, focusing on simpler installs and a stable API surface for production and research. The centerpiece models (e.g., UMA variants) plug directly into the ASE ecosystem via a FAIRChem calculator, so users can run relaxations,...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 20
    PAL MCP

    PAL MCP

    The power of Claude Code / GeminiCLI / CodexCLI

    PAL MCP is an open-source Model Context Protocol (MCP) server designed to act as a powerful middleware layer that connects AI clients and tools—like Claude Code, Codex CLI, Cursor, and IDE plugins—to a broad range of underlying AI models, enabling collaborative multi-model workflows rather than relying on a single model. It lets developers orchestrate interactions across multiple models (including Gemini, OpenAI, Grok, Azure, Ollama, OpenRouter, and custom/self-hosted models), preserving...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    TensorStore

    TensorStore

    Library for reading and writing large multi-dimensional arrays

    ...The library is engineered for scalability—background caching, chunk sharding, and retryable operations keep throughput high even over unreliable networks. With language bindings, it fits into Python-heavy analysis pipelines while retaining a fast C++ core.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Tunix

    Tunix

    A JAX-native LLM Post-Training Library

    Tunix is a JAX-native library for post-training large language models, bringing supervised fine-tuning, reinforcement learning–based alignment, and knowledge distillation into one coherent toolkit. It embraces JAX’s strengths—functional programming, jit compilation, and effortless multi-device execution—so experiments scale from a single GPU to pods of TPUs with minimal code changes. The library is organized around modular pipelines for data loading, rollout, optimization, and evaluation,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Public APIs

    Public APIs

    A collective list of free APIs

    public-apis is a collaboratively maintained repository that provides an extensive, categorized list of publicly available APIs for developers. Curated by community contributors and the team at APILayer, it serves as a centralized resource for discovering APIs across a wide range of domains, including data, machine learning, weather, entertainment, and finance. The project aims to make API exploration and integration more accessible by offering a single, organized index of open and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Fast Binary Encoding (FBE)

    Fast Binary Encoding (FBE)

    Fast Binary Encoding is fast and universal serialization solution

    Fast Binary Encoding is ultra fast and universal serialization solution for C++, C#, Go, Java, JavaScript, Kotlin, Python, Ruby, and Swift. Fast Binary Encoding allows to description of any domain models, business objects, complex data structures, client/server requests & responses and generate native code for different programming languages and platforms.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    Django Bootstrap Modal Forms

    Django Bootstrap Modal Forms

    A Django plugin for creating AJAX driven forms in Bootstrap modal

    A Django plugin for creating AJAX-driven forms in Bootstrap modal. This repository includes Dockerfile and docker-compose.yml files so you can easily setup and start to experiment with django-bootstrap-modal-forms running inside of a container on your local machine. Any changes you make in bootstrap_modal_forms, examples and test folders are reflected in the container (see docker-compose.yml) and the data stored in the sqlite3 database are persistent even if you remove the stopped container....
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB