Showing 11 open source projects for "extraction"

View related business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    HeadlessX

    HeadlessX

    The undetected self-hosted browser automation platform

    ...One of the platform’s goals is to bypass common bot-detection systems by implementing advanced fingerprint spoofing and stealth techniques. The tool can perform tasks such as HTML extraction, screenshot generation, content parsing, and search result scraping while appearing like a normal user browser. Because it is self-hosted, organizations can run the platform on their own infrastructure to maintain privacy and control over automation workflows.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2
    BrowserOS

    BrowserOS

    Agentic browser; privacy-first alternative to ChatGPT Atlas

    BrowserOS is an open-source, agentic web browser built on a Chromium base that integrates AI agents directly into the browsing experience. Rather than just doing standard browsing, it places AI intelligence at the core: you can connect your own API keys (for e.g., OpenAI, Anthropic, Google Gemini) or run local models (via e.g., Ollama) so that your browsing data and automation stay on your machine — privacy and control are emphasized throughout. The interface remains familiar to users of...
    Downloads: 35 This Week
    Last Update:
    See Project
  • 3
    Symfony Panther

    Symfony Panther

    A browser testing and web crawling library for PHP and Symfony

    Symfony Panther is a browser testing and web scraping tool that allows developers to interact with websites programmatically. It uses headless Chrome or Firefox to automate browser tasks, making it suitable for end-to-end testing and data extraction. Panther integrates well with Symfony and PHPUnit, allowing developers to write comprehensive tests for web applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Kuma UI

    Kuma UI

    A Headless, Utility-First, and Zero-Runtime UI Component Library

    Kuma UI is an open-source styling and component library that focuses on providing a headless, utility-first approach to building modern web interfaces. The framework emphasizes performance by extracting CSS at build time, allowing developers to create fast websites without requiring runtime styling engines in the browser. By combining utility-first styling with headless component patterns, Kuma UI allows developers to fully customize visual appearance while relying on reusable component...
    Downloads: 10 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    Browserless

    Browserless

    The headless Chrome/Chromium driver on top of Puppeteer

    Browserless is an open-source headless browser automation library and service built on top of Puppeteer that simplifies the process of running and scaling Chromium-based browser tasks in production environments. It provides a high-level API for interacting with headless Chrome, allowing developers to perform operations such as generating PDFs, capturing screenshots, extracting text or HTML, and automating web navigation. The project is designed to act as a production-ready abstraction layer...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    OmniPull

    OmniPull

    Just pull anything

    OmniPull is a powerful, cross-platform download manager built with Python and PySide6. It provides a modern, intuitive interface for managing downloads with advanced features like multi-threading, queue management, and media extraction.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 7
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    ...Extracts Information from Web by parsing millions of pages. Store data into Derby OR MySQL Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export to Excel File - Data Saved into Derby Database - Written in Java Cross Platform See also Free Email Sender in this link: https://sourceforge.net/projects/gitst-free-email-ender/ Please install Microsoft OpenJDK to start the application https://www.microsoft.com/openjdk
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    OCR Web based

    OCR Web based

    OCR web based for Browser Firefox & PC

    ...id=com.ulm.ocr ========= Add-on for Opera: http://bit.ly/1F0E0wP ========= Release 1.0.1 For safety reasons, I disabled the possibility to import an image from url. Finally, I wish to inform you that you can write or draw directly on the canvas to get the subsequent character recognition and text extraction
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Turbo Download Manager

    Turbo Download Manager

    A portable modern multi-threading download manager for all platforms

    A modern multi thread download manager for Windows, Linux, Mac OS, Firefox, Chrome, Opera and Android devices. For bug reports visit: https://github.com/inbasic/turbo-download-manager/issues For FAQs visit: http://add0n.com/turbo-download-manager.html Turbo Download Manager is an stand-alone application without any dependencies. It should run out of the box. Just set the download location while adding the first job request. If you have a browser and would like to integrate this...
    Leader badge
    Downloads: 86 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 10
    Please visit http://imgv.sf.net/ - IMGV is a cross-platform Image Viewer. Features include slideshows, exif viewing, histograms, gamma correction, adjustable thumbnails, playlists, website image extraction, multi-dir loading, movies, and much more.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Marquee Plus
    This is a add-on plugin/module for wordpress and joomla, which helps placing marquees in your wordpress blog or joomla site. Visit http://www.etdsonline.com/technical/plugins-and-modules for more information on these plugins and modules.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB