Showing 464 open source projects for "ai data analyst"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    AI Researcher

    AI Researcher

    An autonomous AI researcher

    ...Each agent operates with clear roles — such as researcher, analyst, and summarizer — and they communicate through a task-management interface that ensures progress tracking and iterative refinement. The system emphasizes modularity, so teams can swap in new reasoning modules, data retrieval strategies, or domain knowledge bases depending on the research topic. Through self-supervised feedback loops, agents adjust their strategies based on prior outcomes, improving both the quality and relevance of results over time.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    Streamline Analyst

    Streamline Analyst

    AI agent that streamlines the entire process of data analysis

    Streamline Analyst is a cutting-edge, open-source application powered by Large Language Models (LLMs) designed to revolutionize data analysis. This Data Analysis Agent effortlessly automates all the tasks such as data cleaning, preprocessing, and even complex operations like identifying target objects, partitioning test sets, and selecting the best-fit models based on your data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    AI Data Science Team

    AI Data Science Team

    An AI-powered data science team of agents

    AI Data Science Team is a Python library and agent ecosystem designed to accelerate and automate common data science workflows by modeling them as specialized AI “agents” that can be orchestrated to perform tasks like data cleaning, transformation, analysis, visualization, and machine learning. It provides a modular agent framework where each agent focuses on a step in the typical data science pipeline — for example, loading data from CSV/Excel files, cleaning and wrangling messy datasets, engineering predictive features, building models with AutoML, connecting to SQL databases, and producing visual outputs — all driven by natural language or programmatic instructions. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    Agentic Data Scientist

    Agentic Data Scientist

    An end-to-end Data Scientist

    Agentic Data Scientist is an experimental AI-driven research framework that orchestrates data science workflows through autonomous agents that can reason, plan, and execute complex analytics tasks. Unlike traditional scripted pipelines, this project lets AI agents break down high-level research goals into sub-tasks such as data acquisition, cleaning, modeling, evaluation, and reporting, with minimal human direction.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 5
    PandasAI

    PandasAI

    PandasAI is a Python library that integrates generative AI

    PandasAI is a Python library that adds Generative AI capabilities to pandas, the popular data analysis and manipulation tool. It is designed to be used in conjunction with pandas, and is not a replacement for it. PandasAI makes pandas (and all the most used data analyst libraries) conversational, allowing you to ask questions to your data in natural language. For example, you can ask PandasAI to find all the rows in a DataFrame where the value of a column is greater than 5, and it will return a DataFrame containing only those rows.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    Dash Data Agent

    Dash Data Agent

    Self-learning data agent that grounds its answers in layers of content

    Dash is a self-learning data agent built by the Agno AI community that generates grounded answers to English queries over structured data by synthesizing SQL and reasoning based on six layers of context, improving automatically with each run. It sidesteps common limitations of simple text-to-SQL agents by incorporating multiple context layers — including schema structure, human annotations, known query patterns, institutional knowledge from docs, machine-discovered error patterns, and live runtime context — to generate SQL queries that are both technically correct and semantically meaningful. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    AI-Media2Doc

    AI-Media2Doc

    AI tool converting video/audio into structured documents instantly

    AI-Media2Doc emphasizes privacy by processing media locally in the browser using WebAssembly-based ffmpeg, ensuring that original video files are not uploaded externally. It separates client-side media handling from backend AI processing, reducing data exposure while still enabling transcription and document generation. AI-Media2Doc supports flexible customization through prompts, allowing users to tailor output styles based on their needs.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 8
    Flowly AI

    Flowly AI

    Flowly is 100x faster than OpenClaw

    Flowly is an open-source personal AI assistant that runs locally on your machine and connects to multiple communication platforms like Telegram, WhatsApp, Discord, and Slack. It acts as a centralized AI system that can perform tasks such as web browsing, file management, command execution, scheduling, and more—all while keeping your data private. Designed for flexibility, Flowly supports multiple AI providers and models through LiteLLM, allowing users to customize how their assistant behaves. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Quark Agent

    Quark Agent

    Quark Agent - Your AI-powered Android APK Analyst

    With Quark Agent, you can perform analyses using only natural language. It creates Quark Script code following your ideas and adjusts the code promptly as you provide feedback.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 10
    CAMEL AI

    CAMEL AI

    Finding the Scaling Law of Agents. A multi-agent framework

    The rapid advancement of conversational and chat-based language models has led to remarkable progress in complex task-solving. However, their success heavily relies on human input to guide the conversation, which can be challenging and time-consuming. This paper explores the potential of building scalable techniques to facilitate autonomous cooperation among communicative agents and provide insight into their "cognitive" processes. To address the challenges of achieving autonomous...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 11
    AI Runner

    AI Runner

    Offline inference engine for art, real-time voice conversations

    AI Runner is an offline inference engine designed to run a collection of AI workloads on your own machine, including image generation for art, real-time voice conversations, LLM-powered chatbots and automated workflows. It is implemented as a desktop-oriented Python application and emphasizes privacy and self-hosting, allowing users to work with text-to-speech, speech-to-text, text-to-image and multimodal models without sending data to external services.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 12
    Datapizza AI

    Datapizza AI

    Build reliable Gen AI solutions without overhead

    Datapizza AI is a lightweight framework for building modular, multi-agent AI systems that collaborate to solve complex tasks through orchestration and tool usage. The project focuses on simplicity and transparency, enabling developers to construct agent-based workflows without the heavy abstractions and dependencies often found in larger AI frameworks.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    Ludwig AI

    Ludwig AI

    Low-code framework for building custom LLMs, neural networks

    Declarative deep learning framework built for scale and efficiency. Ludwig is a low-code framework for building custom AI models like LLMs and other deep neural networks. Declarative YAML configuration file is all you need to train a state-of-the-art LLM on your data. Support for multi-task and multi-modality learning. Comprehensive config validation detects invalid parameter combinations and prevents runtime failures. Automatic batch size selection, distributed training (DDP, DeepSpeed), parameter efficient fine-tuning (PEFT), 4-bit quantization (QLoRA), and larger-than-memory datasets. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    Pruna AI

    Pruna AI

    Pruna is a model optimization framework built for developers

    ...It is ideal for companies or teams looking to reduce reliance on external APIs while maintaining speed, cost-efficiency, and full control over their data and AI stack. With a focus on extensibility and observability, Pruna empowers engineers to scale LLM applications from prototype to production securely and reliably.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    AI Hedge Fund

    AI Hedge Fund

    An AI Hedge Fund Team

    This repository demonstrates how to build a simplified, automated hedge fund strategy powered by AI/ML. It integrates financial data collection, preprocessing, feature engineering, and predictive modeling to simulate decision-making in trading. The code shows workflows for pulling stock or market data, applying machine learning algorithms to forecast trends, and generating buy/sell/hold signals based on the predictions. Its structure is educational: intended more as a proof-of-concept than a ready-to-use financial product, giving learners insight into the mechanics of quantitative finance automation. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    ai-cookbook

    ai-cookbook

    Examples and tutorials to help developers build AI systems

    ...The repository contains examples that demonstrate how to build AI workflows using modern tools such as large language models, autonomous agents, and external APIs. Developers can learn how to construct applications like intelligent assistants, automation pipelines, and AI-powered data analysis tools through step-by-step tutorials and ready-to-run scripts. The code examples are designed to emphasize practical architecture patterns that are commonly used in production environments, helping developers understand how to integrate AI services into software products.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Pathway AI Pipelines

    Pathway AI Pipelines

    Ready-to-run cloud templates for RAG

    Pathway AI Pipelines is a collection of ready-to-deploy AI pipeline templates designed to help developers rapidly build production-grade retrieval-augmented generation and enterprise search applications. The project provides end-to-end examples that connect live data sources to LLM workflows, enabling applications to stay synchronized with continuously changing information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    AI Engineer Headquarters

    AI Engineer Headquarters

    A collection of scientific methods, processes, algorithms

    ...Rather than focusing only on theoretical knowledge, the repository emphasizes applied learning and encourages engineers to build real systems that incorporate machine learning, large language models, data pipelines, and AI infrastructure. The curriculum includes a progression of topics such as foundational AI engineering skills, machine learning systems design, large language model usage, retrieval-augmented generation systems, model fine-tuning, and autonomous AI agents. It also promotes disciplined learning routines and project-based practice so learners can develop practical experience and build deployable solutions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    AI-Tutorials/Implementations Notebooks

    AI-Tutorials/Implementations Notebooks

    Codes/Notebooks for AI Projects

    AI-Tutorials/Implementations Notebooks repository is a comprehensive collection of artificial intelligence tutorials and implementation examples intended for developers, students, and researchers who want to learn by building practical AI projects. The repository contains numerous Jupyter notebooks and code samples that demonstrate modern techniques in machine learning, deep learning, data science, and large language model workflows.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 20
    The AI Scientist-v2

    The AI Scientist-v2

    Workshop-Level Automated Scientific Discovery via Agentic Tree Search

    AI-Scientist-v2 is an advanced autonomous research system designed to perform end-to-end scientific discovery using large language models and agent-based orchestration. The platform is capable of generating original research ideas, designing and executing experiments, analyzing and visualizing results, and producing full academic papers without direct human intervention. It introduces a generalized framework that removes reliance on predefined templates, enabling broader applicability across...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    Wanwu AI Agent Platform

    Wanwu AI Agent Platform

    Enterprise AI agent platform for workflows, models, and RAG apps

    ...It includes comprehensive model lifecycle management capabilities, enabling users to configure, monitor, and manage different models efficiently. Wanwu also supports knowledge base construction, allowing organizations to incorporate structured and unstructured data into their AI applications. With a focus on openness and extensibility, it encourages developers to build on top of its ecosystem while maintaining a secure and compliant architecture for business use cases.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 22
    Synthetic Data Vault (SDV)

    Synthetic Data Vault (SDV)

    Synthetic Data Generation for tabular, relational and time series data

    The Synthetic Data Vault (SDV) is a Synthetic Data Generation ecosystem of libraries that allows users to easily learn single-table, multi-table and timeseries datasets to later on generate new Synthetic Data that has the same format and statistical properties as the original dataset. Synthetic data can then be used to supplement, augment and in some cases replace real data when training Machine Learning models. Additionally, it enables the testing of Machine Learning or other data dependent...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Superduper

    Superduper

    Superduper: Integrate AI models and machine learning workflows

    ...This allows developers to completely avoid implementing MLOps, ETL pipelines, model deployment, data migration, and synchronization. Using Superduper is simply "CAPE": Connect to your data, apply arbitrary AI to that data, package and reuse the application on arbitrary data, and execute AI-database queries and predictions on the resulting AI outputs and data.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 24
    cognee

    cognee

    Deterministic LLMs Outputs for AI Applications and AI Agents

    We build for developers who need a reliable, production-ready data layer for AI applications. Cognee implements scalable, modular data pipelines that allow for creating the LLM-enriched data layer using graph and vector stores. Cognee acts a semantic memory layer, unveiling hidden connections within your data and infusing it with your company's language and principles. This self-optimizing process ensures ultra-relevant, personalized, and contextually aware LLM retrievals. ...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 25
    DeerFlow

    DeerFlow

    Deep Research framework, combining language models with tools

    DeerFlow is an open-source, community-driven “deep research” framework / multi-agent orchestration platform developed by ByteDance. It aims to combine the reasoning power of large language models (LLMs) with automated tool-use — such as web search, web crawling, Python execution, and data processing — to enable complex, end-to-end research workflows. Instead of a monolithic AI assistant, DeerFlow defines multiple specialized agents (e.g. “planner,” “searcher,” “coder,” “report generator”) that collaborate in a structured workflow, allowing tasks like literature reviews, data gathering, data analysis, code execution, and final report generation to be largely automated. ...
    Downloads: 54 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB