Showing 38 open source projects for "ai data analyst"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    watercrawl

    watercrawl

    AI-ready web crawler that extracts and structures website content

    ...WaterCrawl supports customizable extraction rules so users can focus only on relevant elements while ignoring unnecessary page components. WaterCrawl also offers real-time monitoring capabilities, allowing users to track crawling progress, performance metrics, and errors during large data collection jobs. Developers can integrate the tool into applications through a REST API and multiple client SDKs, enabling automated data pipelines and AI data preparation workflows.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    Postiz

    Postiz

    The ultimate social media scheduling tool, with a bunch of AI

    ...Easily manage multiple client accounts for increased productivity and better results. Schedule, analyze, and engage with your audience. Cross-post your social media posts into multiple channels. Improve your content creation process with an AI agent that performs all tasks for you. Use a Canva-like tool to create stunning visuals for your social media posts and generate pictures with AI. Manage your social media channels with ease. Collaborate with your team and delegate tasks. Expose your brand to a wider audience by connecting with influencers and brands. Learn from your data and improve your social media strategy. ...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 3
    BrowserOS

    BrowserOS

    Agentic browser; privacy-first alternative to ChatGPT Atlas

    BrowserOS is an open-source, agentic web browser built on a Chromium base that integrates AI agents directly into the browsing experience. Rather than just doing standard browsing, it places AI intelligence at the core: you can connect your own API keys (for e.g., OpenAI, Anthropic, Google Gemini) or run local models (via e.g., Ollama) so that your browsing data and automation stay on your machine — privacy and control are emphasized throughout.
    Downloads: 36 This Week
    Last Update:
    See Project
  • 4
    kimuraframework

    kimuraframework

    AI-first Ruby framework for building fast, flexible web scraping spide

    Kimurai is an open source web scraping framework written in Ruby that simplifies the process of building automated data extraction tools. It provides a clean domain-specific language that allows developers to define scraping logic and data schemas with minimal boilerplate code. Kimurai can use AI-assisted extraction to identify where data resides in HTML pages, automatically generating selectors that are cached for future use so subsequent scraping runs operate with pure Ruby performance. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Scira

    Scira

    AI-powered search engine that helps you find information

    ...Scira emphasizes speed, clean UI design, and extensibility so teams can customize data sources, models, and ranking logic. The architecture typically supports real-time querying, streaming responses, and modular backend components. Overall, Scira targets builders who want a self-hosted AI search experience focused on transparency and customization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    ai-scrapper
    🚀 Discover AI Web Scraper! 🚀 Tired of copying and pasting data from websites? I developed a desktop application with Electron and Gemini AI to extract structured data easily and efficiently! 🤖✨
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    Ghostery

    Ghostery

    Ghostery Browser Extension for Firefox, Chrome, Opera and Edge

    Ghostery helps you browse smarter by giving you control over ads and tracking technologies to speed up page loads, eliminate clutter, and protect your data. This is the unified code repository for the Ghostery browser extensions in Chrome, Firefox, Opera and Edge. Browse the web safer, faster & with less annoying ads. Equipped with award-winning AI anti-tracking technology to browse the websafe and quickly. Ghostery helps you stay informed about what companies are tracking you by listing the trackers on each website you visit. ...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 8
    SEO Machine

    SEO Machine

    A specialized Claude Code workspace for creating long-form

    ...It incorporates real data sources like Google Analytics and Search Console to guide decision-making and improve content effectiveness. The architecture emphasizes context-awareness, using brand voice, style guides, and keyword strategies to maintain consistency across outputs. It also includes performance evaluation tools that score content and suggest improvements before publishing.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Crawl4AI

    Crawl4AI

    Open-source LLM Friendly Web Crawler & Scraper

    Crawl4AI is a high-performance, AI‑ready web crawler tailored for LLM data ingestion and RAG pipelines. It supports adaptive crawling heuristics (stopping when enough info is gathered), structured markdown output, and high-speed parallel execution. Designed to operate at scale with optional Docker deployment and framework integrations.
    Downloads: 1 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    YourInfo

    YourInfo

    Real-time browser fingerprinting demo with cross-browser tracking

    YourInfo is a personal information management tool designed to let users securely store, structure, and retrieve their key data — such as contacts, credentials, personal notes, and preferences — while also enabling AI-assisted queries or reminders using that data. The platform prioritizes privacy by focusing on local storage or user-controlled databases, ensuring sensitive data stays under the user’s control rather than in third-party servers. Users can define types of information, tag entries for quick categorization, and perform intuitive searches when they need to recall something like a phone number, address, or secret detail. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    CyberScraper 2077

    CyberScraper 2077

    A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama

    CyberScraper 2077 is not just another web scraping tool – it's a glimpse into the future of data extraction. Born from the neon-lit streets of a cyberpunk world, this AI-powered scraper uses OpenAI, Gemini and LocalLLM Models to slice through the web's defenses, extracting the data you need with unparalleled precision and style.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Alluxio

    Alluxio

    Open Source Data Orchestration for the Cloud

    Alluxio is the world’s first open source data orchestration technology for analytics and AI for the cloud. It bridges the gap between computation frameworks and storage systems, bringing data from the storage tier closer to the data driven applications. This enables applications to connect to numerous storage systems through a common interface. It makes data local, more accessible and as elastic as compute.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    PostHog

    PostHog

    PostHog provides open-source web & product analytics

    PostHog is an all‑in‑one open‑source platform for product and web analytics—offering event-based analytics, session recording, feature flagging, A/B testing, cohorts, and more—that you can self‑host, with full support for data privacy and enterprise compliance. Sync data from external tools like Stripe, Hubspot, your data warehouse, and more. Query it alongside your product data. Run custom filters and transformations on your incoming data. Send it to 25+ tools or any webhook in real time or...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 14
    KaraKeep

    KaraKeep

    A self-hostable bookmark-everything app

    KaraKeep is a self-hostable “bookmark everything” application that lets users save and organize links, notes, images, and documents in one place with rich metadata and AI-assisted tagging. Built with a focus on self-hosting and personal control, Karakeep integrates full-text search and AI-based automatic tagging and summarization (supporting local models like Ollama) to quickly make large collections navigable. Users can group saved items into lists, collaborate with others, and organize content with custom tags and filters. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    UTMStack

    UTMStack

    Customizable SIEM and XDR powered by Real-Time correlation

    Welcome to the UTMStack open-source project! UTMStack is a unified threat management platform that merges SIEM (Security Information and Event Management) and XDR (Extended Detection and Response) technologies. Our unique approach allows real-time correlation of log data, threat intelligence, and malware activity patterns from multiple sources, enabling the identification and halting of complex threats that use stealthy techniques. UTMStack stands out in threat prevention by surpassing the...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 16
    Playwright Skill for Claude Code

    Playwright Skill for Claude Code

    Claude Code Skill for browser automation with Playwright

    Playwright Skill is an open-source plugin designed for Claude Code that enables dynamic browser automation using Playwright through natural language instructions. The tool allows an AI agent to generate, execute, and manage browser automation scripts on demand, rather than relying on predefined workflows or static test scripts. It is structured as a modular skill within the Claude ecosystem, meaning it can be installed as a plugin and invoked automatically when browser automation tasks are required. The system supports a wide range of use cases, including testing web applications, validating user interfaces, automating workflows, and extracting data from websites. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    yudao-cloud

    yudao-cloud

    New Cloud version of Ruoyi-Vue-Pro optimized to refactor all features

    yudao-cloud is the cloud-native evolution of the popular ruoyi-vue-pro backend system, rebuilt around Spring Cloud Alibaba and a microservice architecture. It delivers a full-stack solution that combines a Spring-based backend, MyBatis Plus for data access, and a Vue + Element-based admin front-end, along with user-facing mini-programs. The system targets enterprise scenarios and includes modules for RBAC-based dynamic permissions, multi-tenant SaaS capabilities, data permissions, and workflow/process engines. On top of the core platform, it provides integrated subsystems such as third-party login, payment, SMS, e-commerce, CRM, ERP and even AI large-model integrations. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Scrapling

    Scrapling

    An adaptive Web Scraping framework

    ...The framework includes advanced fetchers capable of bypassing anti-bot protections such as Cloudflare Turnstile using stealth and browser automation techniques. Its powerful spider system supports multi-session crawling, pause and resume functionality, and real-time streaming of scraped data. Scrapling combines high performance, memory efficiency, and extensive async support to deliver blazing-fast scraping workflows. With a developer-friendly API, CLI tools, MCP server integration for AI-assisted extraction, and Docker support, it offers a complete solution for modern web scrapers.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    TinyGSM

    TinyGSM

    A small Arduino library for GSM modules, that just works

    A small Arduino library for GSM modules that just works. This library is easy to integrate with lots of sketches that use Ethernet or WiFi. PubSubClient (MQTT), Blynk, HTTP Client, and File Download examples are provided. Arduino GSM library uses 15868 bytes (49%) of Flash and 1113 bytes (54%) of RAM in a similar scenario. TinyGSM also pulls data gently from the modem (whenever possible), so it can operate on very little RAM. Now, you have more space for your experiments. TCP (HTTP, MQTT,...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    AppFlowy

    AppFlowy

    Bring projects, wikis, and teams together with AI.

    AppFlowy is an AI collaborative workspace where you can achieve more without losing control of your data. It is the best open source alternative to Notion, offering a 100% offline mode and self-hosting with a cloud service of your choice. Build a centralized workspace for your wiki, projects, and notes with AppFlowy. It allows you to organize and visualize your data in tables, Kanban boards, calendars, and more.
    Downloads: 52 This Week
    Last Update:
    See Project
  • 21
    UC Browser

    UC Browser

    Faster, safer, more private browser with VPN

    ...UC Browser automatically detects online videos for quick one-click downloads from websites and social platforms. It offers a generous 20GB of free cloud storage for backing up and syncing your data securely. The browser includes AI-driven translation, supporting multiple languages and search engines for easy navigation. Since its launch in 2004, UC Browser has grown to over 1 billion users worldwide, becoming one of the top mobile browsers globally. Its user-friendly interface and innovative tools make it a popular choice for fast and efficient web browsing on mobile devices.
    Downloads: 39 This Week
    Last Update:
    See Project
  • 22
    Catbird Linux

    Catbird Linux

    Linux for content creation, web scraping, coding, and data analysis.

    Catbird Linux is a USB pluggable Live Linux operating system built for media creation, web scraping, and software coding. It is the daily driver you want for retrieving data, making videos or podcasts, and making software tools to automate the repetitive tasks. It is ready for work in Python, Lua, and Go languages, with numerous packages for web scraping or downloading data via API calls. Using Catbird Linux, it is possible to accomplish in depth stock market analysis, track weather...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 23

    uweb browser: unlimited power

    minimal suckless android web browser with unlimited power

    ...and .js files as commands). - user-defined site-specific JS/CSS/HTML/preprocessing. - Online play/preview/preprocess for downloadable resources. - Multiple type profiles: switch any data including logins/config orthogonally - web automation, crontab (alarm clock)
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    NSFW Filter

    NSFW Filter

    Google Chrome extension that blocks NSFW images

    A Google Chrome extension that blocks NSFW images from the web pages that you load using TensorFlow JS. NSFW Filter web extension blocks NSFW content using AI. NSFW Filter allows you to block inappropriate, Not-Safe-For-Work content, protecting you online. A browser extension that blocks NSFW images from the web pages that you load using TensorFlowJS. When a web page is loaded, all the images remain hidden until they are found to be NSFW or not. If they are found to be NSFW, they remain...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    scraper-with-chatgpt
    It is a powerful data scraping tool that helps you extract information from various online sources. Easily collect data from Google SERP, Maps, Shopify, Zillow, and more. With a user-friendly interface, you can scrape and save data in JSON or Excel formats. Unlock insights from the web effortlessly with scrape-it.cloud API.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB