Showing 81 open source projects for "data analysis"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    all AI news

    all AI news

    A list of online news & info sources in the AI/ML/Data Science space

    all AI news is a curated repository that aggregates and organizes sources for AI-related news and information. It serves as a centralized collection of feeds, links, and resources that can be used to build news aggregation systems or stay updated on developments in artificial intelligence. The project is designed to be easily extendable, allowing users to add new sources or customize the dataset for their specific needs. It is particularly useful for developers building AI news platforms,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Librelancer

    Librelancer

    A re-implementation of Freelancer

    Librelancer is an open-source reimplementation of the engine behind Freelancer, the iconic 2003 space trading and combat simulation game by Microsoft. Written in C# and powered by OpenGL and modern game development practices, Librelancer faithfully replicates the original gameplay while providing a flexible codebase for modding, analysis, and future enhancements. It supports original assets and is designed to be both a preservation project and a foundation for new space simulation games...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    PyOpenCL

    PyOpenCL

    OpenCL integration for Python, plus shiny features

    ...PyOpenCL also includes convenient features for managing memory, compiling kernels, and interfacing with NumPy, making it a preferred choice in scientific computing, data analysis, and machine learning workflows that demand acceleration.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Peroxide

    Peroxide

    Rust numeric library with high performance and friendly syntax

    Rust numeric library contains linear algebra, numerical analysis, statistics and machine learning tools with R, MATLAB, Python-like macros. Peroxide uses a 1D data structure to represent matrices, making it straightforward to integrate with BLAS (Basic Linear Algebra Subprograms). This means that Peroxide can guarantee excellent performance for linear algebraic computations by leveraging the optimized routines provided by BLAS.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 5
    Logbook

    Logbook

    An extensible Java library for HTTP request and response logging

    Logbook is an extensible Java library to enable complete request and response logging for different client- and server-side technologies. It satisfies a special need by a) allowing web application developers to log any HTTP traffic that an application receives or sends b) in a way that makes it easy to persist and analyze it later. This can be useful for traditional log analysis, meeting audit requirements or investigating individual historic traffic issues. Logbook is ready to use out of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    LangExtract

    LangExtract

    A Python library for extracting structured information

    LangExtract is a Python library developed by Google that leverages large language models (LLMs) to extract structured information from unstructured text—such as clinical notes, research papers, or literary works—based on user-defined instructions. It is designed to transform free-form text into reliable, schema-constrained data while maintaining traceability back to the source material. Each extracted entity is precisely grounded in its original context, allowing visual inspection and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Tokenize.jl

    Tokenize.jl

    Tokenization for Julia source code

    Tokenize is a Julia package that serves a similar purpose and API as the tokenize module in Python but for Julia. This is to take a string or buffer containing Julia code, perform lexical analysis and return a stream of tokens.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Statistical Rethinking 2024

    Statistical Rethinking 2024

    This course teaches data analysis

    The 2024 repository is the most recent version of the course, reflecting ongoing refinements in pedagogy, statistical modeling techniques, and coding practices. It provides updated notebooks, R scripts, and model examples, some streamlined and restructured compared to previous years. The 2024 repo also highlights the transition toward more robust Stan models and integration with newer Bayesian workflow practices, continuing to emphasize accessibility for learners while modernizing the tools....
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    Statistical Rethinking 2023

    Statistical Rethinking 2023

    Statistical Rethinking Course for Jan-Mar 2023

    ...It continues to provide scripts for lectures and tutorials, while integrating refinements to examples, notation, and computational workflows introduced that year. Compared with 2022, some models are rewritten for clarity, and teaching materials reflect refinements in McElreath’s evolving presentation of Bayesian data analysis. Students following the 2023 lecture videos use this repository as their coding reference. There are 10 weeks of instruction. Links to lecture recordings will appear in this table. Weekly problem sets are assigned on Fridays and due the next Friday, when we discuss the solutions in the weekly online meeting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    CARTOframes

    CARTOframes

    CARTO Python package for data scientists

    A Python package for integrating CARTO maps, analysis, and data services into data science workflows. Python data analysis workflows often rely on the de facto standards pandas and Jupyter notebooks. Integrating CARTO into this workflow saves data scientists time and energy by not having to export datasets as files or retain multiple copies of the data. Instead, CARTOframes give the ability to communicate reproducible analysis while providing the ability to gain from CARTO's services like hosted, dynamic or static maps and Data Observatory augmentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    CSI-Math-Notation-PostfixInfix

    CSI-Math-Notation-PostfixInfix

    Perl Lib Math Notation

    ... - The library provide: - Convert INFIX expressions to POSTFIX; - Convert POSTFIX expressions to INFIX and; - Perform POSTFIX context validations. - Context validation can be implemented in item selection routines or data context validation, when it is possible to identify data to be selected or ignored in some data analysis process. * NOTE: - Before any implementation, we recommend details in WIKI (https://sourceforge.net/p/csi-math-notation-postfixinfix/wiki/) or the CPAN Perl Modules, see: https://metacpan.org/pod/Math::Notation::PostfixInfix * Support: The Support Service is FREE. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    quantitative

    quantitative

    Quantized transactions python3

    ...The repo is evidently tied to a popular video series (on Bilibili) that reportedly drew substantial attention, suggesting the material is meant to be both educational and hands-on. The README and associated lessons walk the user through implementing algorithms, likely covering data handling, backtesting, and maybe simple trading logic. As an open-source educational resource, it’s designed for Python users interested in automatic trading, algorithmic strategies, and financial data analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Google IPs

    Google IPs

    Public IP address ranges associated with Google infrastructure

    Google-IPs aggregates public IP address ranges that are associated with Google’s infrastructure, collecting them in machine-readable formats useful for routing and firewall rules. The list is helpful for administrators who need to whitelist Google endpoints, analyze traffic, or tune proxies and CDN configurations. By centralizing ranges that are otherwise spread across announcements and registries, it saves time and reduces misconfiguration risk. The repository typically includes CIDR blocks...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Sandia Cyber Omni Tracker

    Sandia Cyber Omni Tracker

    Sandia Cyber Omni Tracker (SCOT)

    SCOT is an open-source cybersecurity incident response and threat intelligence management system developed by Sandia National Laboratories. It centralizes alert collection, facilitates team collaboration, and enriches data with automated analysis, enhancing the efficiency and effectiveness of security operations teams.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Julia.jl

    Julia.jl

    Curated decibans of Julia programming language

    Julia.jl is a curated collection of knowledge resources for the Julia programming language, designed to support high-performance numerical analysis and computational science. The repository aggregates diverse content across domains such as mathematics, physics, data science, optimization, machine learning, and supercomputing. It functions as a structured index, helping developers, researchers, and learners easily find materials to deepen their understanding of Julia’s ecosystem. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    CoolplaySpark

    CoolplaySpark

    Spark Cool Play: Spark source code analysis, Spark class library, etc.

    CoolplaySpark is a learning and practice repository designed to help users understand and work with Apache Spark. It serves as a companion resource for the book 深入理解Spark核心思想与源码分析 (In-Depth Understanding of Spark’s Core Concepts and Source Code Analysis). The project contains annotated examples, explanations, and exercises that guide learners through Spark’s architecture, execution model, and source code internals. It is particularly valuable for developers who want to strengthen their understanding of Spark by not only using it as a data processing engine but also exploring how its internals function. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    Unet

    Unet

    Source code for unet-pytorch, which can train its own model

    ...It is useful for developers and students who want a clear U-Net implementation for segmentation experiments, custom masks, and biomedical-style image analysis.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    waybackurls

    waybackurls

    Fetch all the URLs that the Wayback Machine knows about for a domain

    ...The tool is particularly valuable in security research, bug bounty hunting, and penetration testing, as it uncovers endpoints that may no longer be publicly linked but still exist on servers. By leveraging archived data, waybackurls helps identify hidden attack surfaces, legacy APIs, and forgotten resources that could be vulnerable. Its design is intentionally simple and efficient, focusing on delivering large volumes of URLs quickly with minimal configuration. The output can be combined with other tools for further analysis, such as filtering parameters or probing endpoints.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Statistical Rethinking 2022

    Statistical Rethinking 2022

    Statistical Rethinking course winter 2022

    This repository hosts the 2022 version of the Statistical Rethinking course. It contains course materials such as R scripts, notebooks, and worked examples aligned with McElreath’s textbook. The code emphasizes Bayesian data analysis using R, the rethinking package, and Stan models. It includes lecture code files, example datasets, and structured exercises that parallel the topics covered in the lectures (probability, regression, model comparison, Bayesian updating). The repo functions as a direct hands-on reference for students following the 2022 recorded lecture series. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Interpret-Text

    Interpret-Text

    State-of-the-art explainers for text-based machine learning models

    A library that incorporates state-of-the-art explainers for text-based machine learning models and visualizes the result with a built-in dashboard. Interpret-Text builds on Interpret, an open source python package for training interpretable models and helping to explain blackbox machine learning systems. We have added extensions to support text models. Interpret-Text incorporates community-developed interpretability techniques for NLP models and a visualization dashboard to view the results....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    PyTorchVideo

    PyTorchVideo

    A deep learning library for video understanding research

    ...The library includes efficient implementations of state-of-the-art architectures such as SlowFast, X3D, and MViT, optimized for both research prototyping and production inference. It supports video I/O pipelines, data augmentation, distributed training, and mixed precision computation for large-scale experiments. PyTorchVideo also connects seamlessly with other Meta AI tools such as Detectron2 and PyTorch3D for multimodal video analysis. Designed to accelerate research and deployment, it serves as a unified framework for reproducible, high-performance video AI development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    BMC

    BMC

    Notes on Scientific Computing for Biomechanics

    This repository is a collection of lecture notes and code on scientific computing and data analysis for Biomechanics and Motor Control.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    earthengine-py-notebooks

    earthengine-py-notebooks

    A collection of 360+ Jupyter Python notebook examples

    ...These notebooks are organized into thematic areas such as image processing, machine learning, visualization, filtering, and asset management, exposing users to real geospatial analysis tasks. The repository makes it easier to explore Earth Engine’s large geospatial data catalog, interactively display map layers, and generate visual insights without the need for external GIS software by leveraging interactive widgets and mapping libraries. Many of the notebooks integrate with tools like folium, ipyleaflet, and geemap to bridge Earth Engine data with Python’s rich ecosystem for plotting and analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    CodeQL

    CodeQL

    Libraries and queries that power security researchers

    CodeQL is a semantic code analysis engine that treats programs as queryable databases, enabling users to write expressive queries that identify security vulnerabilities, logic bugs, and code quality issues across large codebases. Instead of just pattern matching text, CodeQL ingests source code, builds rich representations of structure and data flow, and allows queries that reason about control flow, type systems, and interprocedural relationships.
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo