Showing 135 open source projects for "ai data analyst"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    AugLy

    AugLy

    A data augmentations library for audio, image, text, and video

    AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations. Each modality’s augmentations are contained within its own sub-library. These sub-libraries include both function-based and class-based transforms, composition operators, and have the option to provide metadata about the transform applied, including its intensity. AugLy is a great library to utilize for augmenting your data in model training, or to evaluate...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Pythonidae

    Pythonidae

    Curated decibans of scientific programming resources in Python

    Pythonidae is a curated collection of scientific programming resources in Python, designed to support research and development across a wide range of disciplines. The repository organizes tools and libraries into domain-specific categories, including mathematics, statistics, machine learning, artificial intelligence, biology, chemistry, physics, earth sciences, and supercomputing. It also covers practical areas such as build automation, databases, APIs, computer graphics, and utilities,...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    PyTorchVideo

    PyTorchVideo

    A deep learning library for video understanding research

    ...The library includes efficient implementations of state-of-the-art architectures such as SlowFast, X3D, and MViT, optimized for both research prototyping and production inference. It supports video I/O pipelines, data augmentation, distributed training, and mixed precision computation for large-scale experiments. PyTorchVideo also connects seamlessly with other Meta AI tools such as Detectron2 and PyTorch3D for multimodal video analysis. Designed to accelerate research and deployment, it serves as a unified framework for reproducible, high-performance video AI development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    ML for Trading

    ML for Trading

    Code for machine learning for algorithmic trading, 2nd edition

    On over 800 pages, this revised and expanded 2nd edition demonstrates how ML can add value to algorithmic trading through a broad range of applications. Organized in four parts and 24 chapters, it covers the end-to-end workflow from data sourcing and model development to strategy backtesting and evaluation. Covers key aspects of data sourcing, financial feature engineering, and portfolio management. The design and evaluation of long-short strategies based on a broad range of ML algorithms,...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    DeepCode for Visual Studio Code

    DeepCode for Visual Studio Code

    DeepCode extension for Visual Studio Code

    DeepCode AI has always been the backbone of Snyk code, which is why it's the fastest, most accurate SAST on the market. DeepCode AI, powering the Snyk platform, utilizes multiple AI models, is trained on security-specific data, and is all curated by top security researchers to give you all the power of AI without any of the drawbacks. With 11 supported languages, and multiple AI models, Snyk's DeepCode AI was designed to find and fix vulnerabilities and manage tech debt. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    NLP Architect

    NLP Architect

    A model library for exploring state-of-the-art deep learning

    NLP Architect is an open-source Python library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing and Natural Language Understanding neural networks. The library includes our past and ongoing NLP research and development efforts as part of Intel AI Lab. NLP Architect is designed to be flexible for adding new models, neural network components, data handling methods, and for easy training and running models. NLP Architect is a model-oriented library designed to showcase novel and different neural network optimizations. The library contains NLP/NLU-related models per task, different neural network topologies (which are used in models), procedures for simplifying workflows in the library, pre-defined data processors and dataset loaders and misc utilities. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    HyperGAN

    HyperGAN

    Composable GAN framework with api and user interface

    A composable GAN built for developers, researchers, and artists. HyperGAN builds generative adversarial networks in PyTorch and makes them easy to train and share. HyperGAN is currently in pre-release and open beta. Everyone will have different goals when using hypergan. HyperGAN is currently beta. We are still searching for a default cross-data-set configuration. Each of the examples supports search. Automated search can help find good configurations. If you are unsure, you can start with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    RobotsDisallowed

    RobotsDisallowed

    A curated list of the most common and most interesting robots.txt

    RobotsDisallowed is a public catalog that tracks websites and organizations explicitly blocking AI and web-scraping crawlers in their robots.txt or related mechanisms. It focuses on documenting the growing trend of content owners asserting control over how their data is used for model training and automated harvesting. The project aggregates domains, notes the targeted bots or user agents, and surfaces patterns for researchers, policymakers, and tool builders.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    TGAN

    TGAN

    Generative adversarial training for generating synthetic tabular data

    We are happy to announce that our new model for synthetic data called CTGAN is open-sourced. The new model is simpler and gives better performance on many datasets. TGAN is a tabular data synthesizer. It can generate fully synthetic data from real data. Currently, TGAN can generate numerical columns and categorical columns. TGAN has been developed and runs on Python 3.5, 3.6 and 3.7. Also, although it is not strictly required, the usage of a virtualenv is highly recommended in order to avoid...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 10
    Functional, Data Science Intro To Python

    Functional, Data Science Intro To Python

    [tutorial]A functional, Data Science focused introduction to Python

    The first section is an intentionally brief, functional, data science-centric introduction to Python. The assumption is a someone with zero experience in programming can follow this tutorial and learn Python with the smallest amount of information possible. The sections after that, involve varying levels of difficulty and cover topics as diverse as Machine Learning, Linear Optimization, build systems, command line tools, recommendation engines, Sentiment Analysis and Cloud Computing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Zhao

    Zhao

    A compilation of "The Princely Party Relationship Network"

    zhao is a repository that consolidates research, data, and insights related to Zhao, which is likely an individual’s research collection, notes, or curated resources on deep learning, AI, or computational topics (name and content context suggest specialized study). The project may include code examples, experiment results, references to academic papers, mathematical notes, and supporting scripts to explore specific ML methods, benchmarks, or theoretical findings.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Goldfish

    Goldfish

    A HashiCorp Vault UI written with VueJS and Vault native Go API

    ...Create a web application for admin or control your mobile apps with Drag&Drop visual builder. Create your data models with fields, everything else will be built in seconds by platform AI. Visually create business processes of any complexity and without hassle.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Edward

    Edward

    A probabilistic programming language in TensorFlow

    A library for probabilistic modeling, inference, and criticism. Edward is a Python library for probabilistic modeling, inference, and criticism. It is a testbed for fast experimentation and research with probabilistic models, ranging from classical hierarchical models on small data sets to complex deep probabilistic models on large data sets. Edward fuses three fields, Bayesian statistics and machine learning, deep learning, and probabilistic programming. Edward is built on TensorFlow. It...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Spring PetClinic

    Spring PetClinic

    A sample Spring-based application

    The Spring Petclinic is the canonical reference sample application built with Spring Boot and Thymeleaf, demonstrating best practices in building Spring-based web applications; forks also exist showcasing implementations with modern architectures like microservices, reactive stacks, and AI integrations. In its default configuration, Petclinic uses an in-memory database (H2) which gets populated at startup with data. The h2 console is exposed and it is possible to inspect the content of the database using the jdbc:h2:mem:<uuid> URL. The UUID is printed at startup to the console. One of the best parts about working on the Spring Petclinic application is that we have the opportunity to work in direct contact with many Open Source projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    The Common Lisp Reasoner extends the Common Lisp Object System (CLOS) to incorporate a powerful rule language suitable for all kinds of reasoning tasks, vanilla XML and RDF/XML interfaces, and support for a variety of AI-related applications, such as scheduling, planning and diagnosis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Grenade

    Grenade

    Deep Learning in Haskell

    Grenade is a composable, dependently typed, practical, and fast recurrent neural network library for concise and precise specifications of complex networks in Haskell. Because the types are so rich, there's no specific term level code required to construct this network; although it is of course possible and easy to construct and deconstruct the networks and layers explicitly oneself. Networks in Grenade can be thought of as a heterogeneous list of layers, where their type includes not only...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    JCLTP

    A Java Class Library for Text Processing

    JCLTP is a class library designed for processing text. JCLTP is free, open source and developed with the Java programming language. JCLTP is distributed under the GNU license. It incorporates several technologies that enable process information while applying AI techniques, in order to build predictive models for text classification. Through a flexible structure of interfaces and classes, the opportunity to extend, adapt and add functionality JCLTP is provided. Thus, analysis of new types...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Kohonen neural network library is a set of classes and functions for design, train and use Kohonen network (self organizing map) which is one of AI algorithms and useful tool for data mining and discovery knowledge in data (http://knnl.sf.net).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Skill Map

    Skill Map

    A visualization of programmer skill maps

    Skill‑Map is an open-source, collaborative project—originating from Geekbang—offering a structured visualization of programmer skill maps across domains like AI, front-end, backend, architecture, DevOps, and more. It serves as a navigable resource to organize learning paths and essential knowledge areas. Covers areas like AI, big data, architecture, frontend, backend, DevOps, testing, etc. Visual representation of programming and IT skill domains. Encourages community collaboration and feedback via GitHub Issues. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Human AI Net

    Human AI Net

    a Human and Artificial Intelligence Network

    ...The main data format is, from xorlisp which is also in progress, immutable binary forest nodes, so if millions of people build that together nobody can damage or change anyone else's data since its all constant. You dont change variables. You create new data that points at existing constant data, as deep as you need it. I have mindmap lists, definitions, and 2 editable properties working that way with 2 kinds of event listeners that work locally.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    JCLALtext

    Text processing module for JCLAL

    JCLALtext is a class library designed to extend the framework JCLAL text tasks. JCLALtext is free, open source and developed with the Java programming language. JCLALtext is distributed under the GNU license. The researcher can use the class library by adding it to your project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Vector File Parser is a library for parsing the data from swf, svg, ai, and possibly other vector files. By sub-classing the base render class, the data can be passed to any rendering system for display.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    tiny-AI Library

    tiny-AI Library

    small and fast C++ library dealing with artificial intelligence

    A fast artificial intelligence library which currently supports: kNN (k-Nearest Neighbor algorithm) MLP (Multilayer-Perceptron)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    MASyV (Multi-Agent System Visualization) enables one to write agent-based models/cellular automata, eg. in C, visualize them in real time & capture to movie file with MASyVs GUI & message passing lib. Includes examples: Hello World, ants, viral infection
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    breve is a free, open-source 3D multi-agent simulation package designed for simulating decentralized systems and artificial life.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB