34 projects for "data analytics" with 2 filters applied:

  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    cracking-the-data-science-interview

    cracking-the-data-science-interview

    A Collection of Cheatsheets, Books, Questions, and Portfolio

    ...In addition to conceptual study materials, the project includes interview question banks and case study prompts that simulate real hiring scenarios. The resource is particularly useful for candidates preparing for technical interviews in data science, machine learning, or analytics roles.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Agentic Data Scientist

    Agentic Data Scientist

    An end-to-end Data Scientist

    Agentic Data Scientist is an experimental AI-driven research framework that orchestrates data science workflows through autonomous agents that can reason, plan, and execute complex analytics tasks. Unlike traditional scripted pipelines, this project lets AI agents break down high-level research goals into sub-tasks such as data acquisition, cleaning, modeling, evaluation, and reporting, with minimal human direction.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    trench

    trench

    Open-Source Analytics Infrastructure

    Trench is an open-source analytics infrastructure designed for tracking events and performing real-time analysis of application data at scale. The system is built on top of high-performance data technologies including Apache Kafka and ClickHouse, which allows it to ingest and process very large volumes of events while maintaining fast query performance. It was originally developed to solve scaling challenges in product analytics systems where traditional relational databases become inefficient as event tables grow. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Rill

    Rill

    Fast SQL-based BI tool for real-time dashboards and analytics

    ...Its BI-as-code model combines SQL, YAML configuration, Git version control, and CLI tools, allowing teams to build, manage, and deploy analytics workflows efficiently. Automatic data profiling and responsive query updates help users understand datasets instantly. Interactive dashboards come with opinionated defaults, so teams can focus on insights instead of setup, while metrics layers standardize business logic for consistent reporting across dashboards, APIs, and AI systems.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 5
    Machine Learning and Data Science Apps

    Machine Learning and Data Science Apps

    A curated list of applied machine learning and data science notebooks

    ...Most examples are written in Python and frequently use Jupyter notebooks to present practical implementations and experiments. The project encourages contributions from data scientists and domain experts who want to share applied analytics projects and techniques that address real business challenges.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Open Semantic Search

    Open Semantic Search

    Open source semantic search and text analytics for large document sets

    ...It also supports optical character recognition to extract text from images and scanned documents, including images embedded inside PDF files. It integrates text mining and analytics capabilities that allow users to examine relationships, topics, and structured data within document collections.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 7
    InvestBrain

    InvestBrain

    LLM-enabled investment tracker that consolidates market performance

    InvestBrain is a financial portfolio management and investment insight platform designed to help individual investors track assets, analyze performance, and explore data-driven insights across markets. It provides tools to import financial data such as stocks, cryptocurrencies, or ETFs, maintain watchlists, and view performance summaries that highlight gains, losses, allocations, and historical trends. The interface blends real-time or near-real-time market data with personalized analytics, so users can assess portfolio health, diversification, and risk exposure with intuitive charts and tables. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    HeavyDB

    HeavyDB

    HeavyDB (formerly MapD/OmniSciDB)

    ...Its architecture allows users to query datasets containing billions of rows in milliseconds without requiring traditional indexing, pre-aggregation, or sampling techniques. HeavyDB was originally developed as part of the OmniSci platform (formerly MapD) and is commonly used for large-scale analytics and geospatial data processing. The database compiles queries into optimized machine code that executes efficiently on GPU hardware, significantly accelerating analytical workloads. It supports hybrid deployment environments where queries can run on both CPU and GPU architectures depending on the available resources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Apache Hamilton

    Apache Hamilton

    Helps data scientists define testable self-documenting dataflows

    Apache Hamilton is an open-source Python framework designed to simplify the creation and management of dataflows used in analytics, machine learning pipelines, and data engineering workflows. The framework enables developers to define data transformations as simple Python functions, where each function represents a node in a dataflow graph and its parameters define dependencies on other nodes. Hamilton automatically analyzes these functions and constructs a directed acyclic graph representing the pipeline, allowing the system to execute transformations in the correct order. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    AI-Crawler

    AI-Crawler

    Crawl a website starting from a URL, find relevant pages

    AI Crawler is an experimental AI-powered web crawling and data extraction tool that uses natural language prompts to guide the discovery and retrieval of relevant information across websites. Unlike traditional web scrapers that rely on static selectors and manual scripting, it uses AI to dynamically identify and prioritize pages based on user intent, making it more flexible and resilient to changes in website structure. Users can define their data requirements in plain English, and the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    SEO Machine

    SEO Machine

    A specialized Claude Code workspace for creating long-form

    ...The system uses specialized commands and agents to perform tasks such as keyword research, competitor analysis, content drafting, and optimization. It incorporates real data sources like Google Analytics and Search Console to guide decision-making and improve content effectiveness. The architecture emphasizes context-awareness, using brand voice, style guides, and keyword strategies to maintain consistency across outputs. It also includes performance evaluation tools that score content and suggest improvements before publishing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    TabPFN

    TabPFN

    Foundation Model for Tabular Data

    ...The system supports a variety of tabular machine learning tasks and is designed to handle structured datasets commonly found in spreadsheets, databases, and business analytics systems.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    Pixeltable

    Pixeltable

    Data Infrastructure providing an approach to multimodal AI workloads

    ...Developers define data transformations and AI operations using computed columns on tables, allowing pipelines to evolve incrementally as new data or models are added. The framework supports multimodal content including images, video, text, and audio, enabling applications such as retrieval-augmented generation systems, semantic search, and multimedia analytics.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Vanna 2.0

    Vanna 2.0

    Chat with your SQL database

    ...The framework uses a retrieval-augmented generation architecture that learns from database schemas, documentation, and past query examples to generate accurate queries tailored to a specific dataset. Vanna can be integrated into many environments, including notebooks, web applications, messaging platforms, and data dashboards, making it flexible for analytics and data exploration workflows. The system streams query results, visualizations, and summaries directly to user interfaces, allowing non-technical users to interact with complex data systems through conversational queries. It also includes enterprise-grade features such as user-aware security, permission enforcement, and query auditing for production deployments.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Quantitative Trading System

    Quantitative Trading System

    A comprehensive quantitative trading system with AI-powered analysis

    Quantitative Trading System is a comprehensive quantitative trading platform that integrates artificial intelligence, financial data analysis, and automated strategy execution within a unified software system. The project is designed to provide an end-to-end infrastructure for building and operating algorithmic trading strategies in financial markets. It includes tools for collecting and processing market data from multiple sources, performing statistical and machine learning analysis, and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Netflix Maestro

    Netflix Maestro

    Netflix’s Workflow Orchestrator

    Maestro is a large-scale workflow orchestration platform originally developed by Netflix to coordinate complex data processing and machine learning workflows across distributed systems. The system acts as a general-purpose workflow orchestrator that manages the execution, scheduling, monitoring, and recovery of large pipelines used for analytics and AI operations. It was designed to support the demanding internal infrastructure of Netflix, where thousands of workflows must process massive volumes of data reliably and efficiently every day. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    DenchClaw

    DenchClaw

    Fully Managed OpenClaw Framework for all knowledge work ever

    DenchClaw is a local-first AI-powered CRM and productivity platform built on top of the OpenClaw framework, designed to transform a user’s entire computer into a programmable, agent-driven workspace. Unlike traditional cloud-based CRMs or AI tools, it runs entirely on the user’s machine and exposes a web interface locally, allowing full control over data, workflows, and automation without relying on external servers. The system combines database management, browser automation, and AI...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    Pydantic Logfire

    Pydantic Logfire

    Python observability platform for tracing apps, metrics, and logs

    Pydantic Logfire is an observability platform designed to help developers monitor, analyze, and understand the behavior of their applications in real time. It is built by the team behind Pydantic and follows a philosophy of combining powerful capabilities with ease of use, making it accessible to entire engineering teams. Pydantic Logfire provides deep visibility into application performance by capturing traces, metrics, and logs through an OpenTelemetry-based architecture. It is...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    Habit Tracker

    Habit Tracker

    Habit Tracker for the AI Coding Workshop

    Habit Tracker is a personal habit-tracking web application designed to help users build and maintain daily habits through intuitive UI and analytics that visualize progress over time. It runs locally with a FastAPI backend (Python) and a React frontend, storing all data in a lightweight SQLite database so there’s no need for user accounts or cloud storage, which keeps habit data fully private and self-contained. The app provides streak tracking and completion rates for each habit, giving users feedback on consistency and motivation by showing how often habits are completed and where they may be lagging. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    NVIDIA FLARE

    NVIDIA FLARE

    NVIDIA Federated Learning Application Runtime Environment

    NVIDIA Federated Learning Application Runtime Environment NVIDIA FLARE is a domain-agnostic, open-source, extensible SDK that allows researchers and data scientists to adapt existing ML/DL workflows(PyTorch, TensorFlow, Scikit-learn, XGBoost etc.) to a federated paradigm. It enables platform developers to build a secure, privacy-preserving offering for a distributed multi-party collaboration. NVIDIA FLARE is built on a componentized architecture that allows you to take federated...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    SALMONN family

    SALMONN family

    A suite of advanced multi-modal LLMs

    SALMONN is a family of advanced multi-modal large language models (LLMs) developed by ByteDance — designed to handle and integrate multiple data modalities (e.g. text, audio, video) rather than just plain text. The repository bundles different branches targeting specialized tasks (e.g. video-SALMONN, speech-quality assessment, general multimodal tasks), suggesting that the project is modular and extensible across domains. SALMONN aims to push the frontier of multi-modal AI by allowing models to process and reason over diverse inputs, which can be useful for applications such as video understanding, speech analytics, cross-modal retrieval, and general AI capable of interpreting rich, multi-sensory data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    XIAOJUSURVEY

    XIAOJUSURVEY

    Powerful survey system for creating, managing, and analyzing forms

    Xiaoju Survey is an open source survey platform developed to support the creation, distribution, and analysis of questionnaires at scale. It provides a comprehensive system for designing surveys with various question types, allowing users to build flexible and structured forms suited for different data collection needs. Xiaoju Survey includes tools for managing survey workflows, enabling teams to organize responses and monitor participation efficiently. It also focuses on data analysis...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 23
    MOA - Massive Online Analysis

    MOA - Massive Online Analysis

    Big Data Stream Analytics Framework.

    A framework for learning from a continuous supply of examples, a data stream. Includes classification, regression, clustering, outlier detection and recommender systems. Related to the WEKA project, also written in Java, while scaling to adaptive large scale machine learning.
    Downloads: 50 This Week
    Last Update:
    See Project
  • 24
    Mars Framework

    Mars Framework

    Mars is a tensor-based unified framework for large-scale data

    ...Its architecture automatically divides large computational tasks into smaller chunks that can be executed across multiple nodes in a cluster, allowing complex analytics, machine learning workflows, and data transformations to run efficiently at scale. Mars is particularly useful for workloads that exceed the memory capacity of a single machine or require high levels of parallel processing.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25
    Guia do Cientista de Dados das Galáxias

    Guia do Cientista de Dados das Galáxias

    Repository for gathering information on study materials

    Guia do Cientista de Dados das Galáxias is an open-source community repository that aggregates educational resources, tools, and references related to data science, machine learning, and analytics. The project was created by the Pizza de Dados community with the goal of organizing useful materials for people interested in learning or working in the data science ecosystem. The repository collects links to books, podcasts, tutorials, datasets, communities, and study groups that can help learners navigate the field of data science more efficiently. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB