Search Results for "data analytics" - Page 6

Showing 629 open source projects for "data analytics"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    Apache AGE

    Apache AGE

    Graph database optimized for fast analysis

    Apache AGE (A Graph Extension) is a PostgreSQL extension that adds support for graph database functionality using the openCypher query language. AGE allows users to store and query graph-structured data alongside traditional relational data in a single database. Built for performance and scalability, it brings the power of graph queries to the robustness of PostgreSQL, making it suitable for use cases like social networks, recommendation systems, and fraud detection.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    AI-Crawler

    AI-Crawler

    Crawl a website starting from a URL, find relevant pages

    AI Crawler is an experimental AI-powered web crawling and data extraction tool that uses natural language prompts to guide the discovery and retrieval of relevant information across websites. Unlike traditional web scrapers that rely on static selectors and manual scripting, it uses AI to dynamically identify and prioritize pages based on user intent, making it more flexible and resilient to changes in website structure. Users can define their data requirements in plain English, and the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    SEO Machine

    SEO Machine

    A specialized Claude Code workspace for creating long-form

    ...The system uses specialized commands and agents to perform tasks such as keyword research, competitor analysis, content drafting, and optimization. It incorporates real data sources like Google Analytics and Search Console to guide decision-making and improve content effectiveness. The architecture emphasizes context-awareness, using brand voice, style guides, and keyword strategies to maintain consistency across outputs. It also includes performance evaluation tools that score content and suggest improvements before publishing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    BettaFish

    BettaFish

    Public opinion analysis system

    ...Unlike simpler analytics tools, BettaFish employs agent collaboration and a “forum” style internal mechanism to combine diverse model outputs, making the analysis richer and more robust. It also integrates multimodal processing, enabling it to parse images and video alongside text.
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    IronClaw

    IronClaw

    IronClaw is OpenClaw inspired but focused on privacy & security

    IronClaw is a security-first, open-source personal AI assistant built in Rust and designed to keep your data fully under your control. It operates on the principle that your AI should work for you, not external vendors, ensuring all data is stored locally, encrypted, and never shared. The platform emphasizes transparency, offering auditable code with no hidden telemetry or data harvesting. IronClaw runs untrusted tools inside isolated WebAssembly (WASM) sandboxes with strict capability-based...
    Downloads: 58 This Week
    Last Update:
    See Project
  • 6
    Kotlin Dataframe

    Kotlin Dataframe

    Structured data processing in Kotlin

    ...The Kotlin Dataframe library is an idiomatic Kotlin DSL defining such operations. The process of working with data frame is often called data wrangling which is the process of transforming and mapping data from one "raw" data form into another format that is more appropriate for analytics and visualization. The goal of data wrangling is to assure quality and useful data.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    ConcourseDB

    ConcourseDB

    Distributed database warehouse for transactions, search and analytics

    ConcourseDB is a distributed, self-tuning database designed for real-time applications, offering strong consistency and ACID compliance without requiring complex configurations. It provides dynamic schema support and automatic indexing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    redis-py

    redis-py

    Redis Python client

    redis-py is the official Python client for interacting with Redis, the in-memory data structure store. It supports all Redis commands and data types, making it easy to build caching, messaging, or real-time analytics features in Python applications. With both synchronous and asyncio support, redis-py is suited for modern Python projects and integrates smoothly into web frameworks, task queues, and backend services.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Lithops

    Lithops

    A multi-cloud framework for big data analytics

    Lithops is an open-source serverless computing framework that enables transparent execution of Python functions across multiple cloud providers and on-prem infrastructure. It abstracts cloud providers like IBM Cloud, AWS, Azure, and Google Cloud into a unified interface and turns your Python functions into scalable, event-driven workloads. Lithops is ideal for data processing, ML inference, and embarrassingly parallel workloads, giving you the power of FaaS (Function-as-a-Service) without...
    Downloads: 1 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 10
    Cassandra Spark Connector

    Cassandra Spark Connector

    Apache Spark to Apache Cassandra connector

    The Apache Cassandra Spark Connector allows Spark jobs (RDDs or DataFrames/Datasets) to read from and write to Cassandra tables. Compatible with Apache Cassandra (v2.1+), Spark 1.0–3.5, and Scala 2.11–2.13, it supports mapping Cassandra rows to Scala case classes, saving results back to Cassandra, and executing arbitrary CQL within Spark applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Apache Hudi

    Apache Hudi

    Upserts, Deletes And Incremental Processing on Big Data

    Apache Hudi (pronounced Hoodie) stands for Hadoop Upserts Deletes and Incrementals. Hudi manages the storage of large analytical datasets on DFS (Cloud stores, HDFS or any Hadoop FileSystem compatible storage). Apache Hudi is a transactional data lake platform that brings database and data warehouse capabilities to the data lake. Hudi reimagines slow old-school batch data processing with a powerful new incremental processing framework for low latency minute-level analytics. Hudi provides efficient upserts, by mapping a given hoodie key (record key + partition path) consistently to a file id, via an indexing mechanism. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Milvus

    Milvus

    Vector database for scalable similarity search and AI applications

    ...Rich APIs designed for data science workflows. Consistent user experience across laptop, local cluster, and cloud. Embed real-time search and analytics into virtually any application. Milvus’ built-in replication and failover/failback features ensure data and applications can maintain business continuity in the event of a disruption. Component-level scalability makes it possible to scale up and down on demand.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    FastGPT

    FastGPT

    FastGPT is a knowledge-based platform built on the LLMs

    FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Greenplum Database

    Greenplum Database

    Massive parallel data platform for analytics, machine learning and AI

    Rapidly create and deploy models for complex applications in cybersecurity, predictive maintenance, risk management, fraud detection, and many other areas. With its unique cost-based query optimizer designed for large-scale data workloads, Greenplum scales interactive and batch-mode analytics to large datasets in the petabytes without degrading query performance and throughput. Based on PostgreSQL, Greenplum provides you with more control over the software you deploy, reducing vendor lock-in, and allowing open influence on product direction. Greenplum reduces data silos by providing you with a single, scale-out environment for converging analytic and operational workloads, like streaming ingestion. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 15
    TabPFN

    TabPFN

    Foundation Model for Tabular Data

    ...The system supports a variety of tabular machine learning tasks and is designed to handle structured datasets commonly found in spreadsheets, databases, and business analytics systems.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    KIS Open API

    KIS Open API

    Korea Investment & Securities Open API Github

    The open-trading-api repository from Korea Investment & Securities provides sample code and developer resources for interacting with the KIS Developers Open Trading API, which enables programmatic access to financial market data and automated trading functionality. The project is designed primarily for Python developers and AI automation environments that want to build investment applications, algorithmic trading systems, or financial analytics tools using the brokerage’s infrastructure. It includes example scripts that demonstrate how to authenticate with the service, retrieve financial data, and execute trading operations through REST and WebSocket interfaces. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    Apache Hamilton

    Apache Hamilton

    Helps data scientists define testable self-documenting dataflows

    Apache Hamilton is an open-source Python framework designed to simplify the creation and management of dataflows used in analytics, machine learning pipelines, and data engineering workflows. The framework enables developers to define data transformations as simple Python functions, where each function represents a node in a dataflow graph and its parameters define dependencies on other nodes. Hamilton automatically analyzes these functions and constructs a directed acyclic graph representing the pipeline, allowing the system to execute transformations in the correct order. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Stripe Sync Engine

    Stripe Sync Engine

    Sync your Stripe account to you Postgres database

    stripe-sync-engine is a tool by Supabase that continuously syncs Stripe data into a Postgres database using webhooks. It ensures that billing-related Stripe objects like customers, subscriptions, and invoices are always up to date in your local database. This makes it easy to run analytics, reporting, or custom business logic using SQL without hitting Stripe’s API repeatedly.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Vanna 2.0

    Vanna 2.0

    Chat with your SQL database

    ...The framework uses a retrieval-augmented generation architecture that learns from database schemas, documentation, and past query examples to generate accurate queries tailored to a specific dataset. Vanna can be integrated into many environments, including notebooks, web applications, messaging platforms, and data dashboards, making it flexible for analytics and data exploration workflows. The system streams query results, visualizations, and summaries directly to user interfaces, allowing non-technical users to interact with complex data systems through conversational queries. It also includes enterprise-grade features such as user-aware security, permission enforcement, and query auditing for production deployments.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    Fabric

    Fabric

    Fabric is an open-source framework for augmenting humans using AI

    Fabric is a command-line framework that turns a curated library of prompt “patterns” into reusable, automatable workflows for large language models. Instead of repeatedly crafting ad-hoc prompts, you pick a pattern (for research, summarization, brainstorming, code review, and more) and feed it inputs or files. The tool standardizes execution with configuration files and environment variables, enabling reproducible runs across different models and providers. Patterns can be customized with...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 21
    Site Kit for WordPress

    Site Kit for WordPress

    Site Kit is a one-stop solution for WordPress users

    ...Widgets and contextual panels put data where it’s needed—for example, on the post list or edit screen—reducing tab-switching. For developers and agencies, Site Kit simplifies client onboarding and standardizes reporting across many sites.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 22
    Deequ

    Deequ

    Deequ is a library built on top of Apache Spark

    ...It also includes a little domain-specific language called DQDL (Data Quality Definition Language) which allows declarative specification of quality rules. Users typically run Deequ before feeding data downstream (to ML pipelines, analytics, or production systems), enabling early detection and isolation of data errors. There is also a Python wrapper, PyDeequ, for users who prefer working from Python environments.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Siddhi Core Libraries

    Siddhi Core Libraries

    Stream Processing and Complex Event Processing Engine

    Fully open source, cloud-native, scalable, micro streaming, and complex event processing system capable of building event-driven applications for use cases such as real-time analytics, data integration, notification management, and adaptive decision-making. Event processing logic can be written using Streaming SQL queries via graphical and source editors, to capture events from diverse data sources, process and analyze them, integrate with multiple services and data stores, and publish output to various endpoints in real time. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    OpenSearch

    OpenSearch

    Open source distributed and RESTful search engine

    OpenSearch is a distributed search and analytics engine based on Apache Lucene. After adding your data to OpenSearch, you can perform full-text searches on it with all of the features you might expect: search by field, search multiple indices, boost fields, rank results by score, sort results by field, and aggregate results. Unsurprisingly, people often use search engines like OpenSearch as the backend for a search application, think Wikipedia or an online store.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    Elide

    Elide

    Elide is a Java library that lets you stand up a GraphQL/JSON-API

    ...And thats it, you are ready to deploy and query your data with JSON or GraphQL requests. Quickly build and deploy production-quality web services that expose your data as a service. Elide APIs support complex filtering rules, sorting, pagination, subscriptions, and text search. Controlling access to your data is as simple as defining your rules and annotating your models.
    Downloads: 3 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB