Showing 52 open source projects for "extraction"

View related business solutions
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    chrome-cdp

    chrome-cdp

    Give your AI agent access to your live Chrome session

    chrome-cdp-skill is a specialized integration that enables AI agents to control and interact with web browsers through the Chrome DevTools Protocol (CDP). It allows agents to perform tasks such as navigating pages, extracting data, interacting with elements, and executing scripts in a browser environment. The project is designed to extend the capabilities of AI systems beyond static knowledge by giving them real-time access to web content and interactive interfaces. Its architecture likely...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Open Semantic Search

    Open Semantic Search

    Open source semantic search and text analytics for large document sets

    Open Semantic Search is an open source research and analytics platform designed for searching, analyzing, and exploring large collections of documents using semantic search technologies. It provides an integrated search server combined with a document processing pipeline that supports crawling, text extraction, and automated analysis of content from many different sources. Open Semantic Search includes an ETL framework that can ingest documents, process them through analysis steps, and enrich the data with extracted information such as named entities and metadata. It also supports optical character recognition to extract text from images and scanned documents, including images embedded inside PDF files. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    spider_collection

    spider_collection

    Collection of Python web scraping scripts for data extraction tasks

    spider_collection is a collection of Python web crawler scripts created primarily for experimentation, learning, and practical scraping tasks. spider_collection gathers multiple independent spiders designed to collect data from different platforms and services, demonstrating a variety of scraping techniques and workflows. These crawlers make use of common Python scraping tools such as requests, parsel, BeautifulSoup, and the Scrapy framework to extract structured information from web pages....
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    MCiSEE

    MCiSEE

    All of Minecraft, EASILY get Minecraft resources

    MCiSEE is an open-source project designed to integrate Minecraft with computer vision and artificial intelligence experiments. The system focuses on capturing visual information from the game environment and exposing it to external programs for analysis or machine learning research. By converting gameplay data into visual or structured formats, MCiSEE enables researchers and developers to build AI agents capable of interacting with the Minecraft environment. The project can be used as a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    browserable

    browserable

    Open source and self-hostable browser automation library for AI agents

    Browserable is an open-source browser automation framework designed specifically for AI agents that need to interact with web interfaces in a human-like way. The project provides tools that allow automated agents to navigate websites, click buttons, fill out forms, and extract information from pages without manual scripting of each step. Built primarily in JavaScript, the framework offers both a developer-friendly SDK and a REST API that allow integration with AI applications and automation...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    matcik svg editor

    matcik svg editor

    svg editor: Easily create documents for math, physics, chemistry...

    Easily create documents for math, physics, chemistry, biology and other sciences... Book, exam and activity preparation program Easily import application vectors such as geogebra, chart, mathlive, ckeditor, desmos, into your document, or import vectors from pdf files into your document, save and rearrange your file. if you want You can use the documents you have prepared on your website or convert them to pdf. Matcik advanced online svg editor. Matematik, fizik, kimya, biyoloji ve...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    MBR Bulk WP Detector

    MBR Bulk WP Detector

    A free WP plugin that lets you check unlimited URLs

    MBR Bulk WP Detector is a free WordPress plugin that lets you check unlimited URLs right from your own dashboard. No subscriptions, no URL limits, and your data stays completely private on your server. What Can You Do With It? The basics are simple: Paste a list of URLs (or upload a CSV file), click a button, and boom—you’ve got a clear breakdown of which sites are running WordPress and which aren’t. But it gets better… Turn on Deep Scan mode, and you’ll also discover what...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    AdBlock Classic

    AdBlock Classic

    AdBlock Classic blocks ads in K-Meleon.

    ...AdBlock Classic supports AdBlock Plus filters and rules (including AdBlock Locus's Fallback subscription). The extension is made available under the MPL 2.0. For source code, extract the xpi with any extraction tool and open the files in any text editor. AdBlock Classic is distributed freely, as-is, and without warranty. That said, you're welcome to visit the K-Meleon forums if you have questions or issues: http://kmeleonbrowser.org/forum/read.php?19,155958
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    Exifr

    Exifr

    The fastest and most versatile JS EXIF reading library

    Exifr is a fast and very versatile JavaScript EXIF reading library that works everywhere, parses everything and handles just about anything you throw at it. It can handle any input: buffers, url, <img> tag and more; .jpg, .tif, and .heic files; and TIFF (EXIF, GPS, etc.), XMP, ICC, IPTC, JFIF segments. It skips parsing tags you don’t need, and reads only the first few bytes. There’s no need to read the whole file to see if there’s an EXIF file in it, or extract all the data when you just...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    Laravel Mix

    Laravel Mix

    The power of webpack, distilled for the rest of us

    Laravel Mix provides a clean, fluent API for defining basic webpack build steps for your applications. Mix supports several common CSS and JavaScript pre-processors. If you've ever been confused about how to get started with module bundling and asset compilation, you will love Laravel Mix! Webpack is an incredibly powerful module bundler that prepares your JavaScript and assets for the browser. The only understandable downside is that it requires a bit of a learning curve. In an effort to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    NLP.js

    NLP.js

    An NLP library for building bots

    ...NLP Manager, a tool able to manage several languages, the Named Entities for each language, the utterances, and intents for the training of the classifier, and for a given utterance return the entity extraction, the intent classification and the sentiment analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Marathon -GUI Test Runner Web, Swing, FX

    Marathon -GUI Test Runner Web, Swing, FX

    Marathon supports testing of Java/Swing and Java/Fx applications.

    ...Currently, Marathon supports JRuby script models for recording the test scripts.Marathon test runner generates Allure test reports.Marathon allows for grouping of test cases.It also has an option of inserting modules while recording(in MarathonITE while recording). It allows tester to inset checklist while recording and also takes screen capture and annotate it in playing the test cases. Extraction of code into modules and inserting it is very easy using marathon.It has different themes of UI which attracts tester to do work on it. Object mapping is quite simple and configuring is very easy for tester to understand and do the work. Debugging of test scripts is also easy and has script console which takes selenium commands. ...
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 13
    JavoueJapasse_gl02

    JavoueJapasse_gl02

    LOGICIEL D'ANALYSE D'EMAILS

    Vous souhaitez analyser un programme permettant d'analyser simplement et efficacement tous vos emails ? Bienvenue chez JavoueJapasse ! Le cabinet de conseil UIConsult souhaite se doter d'un outil d'aide à l'analyse des communications et des expertises à l'intérieur de ses équipes de collaborateurs. L'enjeu est pour les responsables de secteur de pouvoir produire un rapport d'analyse des échanges par emails des collaborateurs, appelé "RCom". Ces rapport sont réalisés à l'échelle...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Easy File Compression

    Easy File Compression

    Client side solution for PC & Browser Firefox to create zip archive

    All client-side solution for PC & Browser Firefox to create zip archive and extract zip and rar archive files with API HTML5 file system. == Browser Extensions == Add-on Firefox: http://mzl.la/1Kd7fiD OS requirements - Windows 7 and later are supported, older operating systems are not supported (and do not work). Both x86 and amd64 (x64) binaries are provided for - - Windows. Please note, the ARM version of Windows is not supported for now. == Installation and Activation == 1....
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    OCR Web based

    OCR Web based

    OCR web based for Browser Firefox & PC

    ...id=com.ulm.ocr ========= Add-on for Opera: http://bit.ly/1F0E0wP ========= Release 1.0.1 For safety reasons, I disabled the possibility to import an image from url. Finally, I wish to inform you that you can write or draw directly on the canvas to get the subsequent character recognition and text extraction
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Turbo Download Manager

    Turbo Download Manager

    A portable modern multi-threading download manager for all platforms

    A modern multi thread download manager for Windows, Linux, Mac OS, Firefox, Chrome, Opera and Android devices. For bug reports visit: https://github.com/inbasic/turbo-download-manager/issues For FAQs visit: http://add0n.com/turbo-download-manager.html Turbo Download Manager is an stand-alone application without any dependencies. It should run out of the box. Just set the download location while adding the first job request. If you have a browser and would like to integrate this...
    Leader badge
    Downloads: 86 This Week
    Last Update:
    See Project
  • 17
    Roomba

    Roomba

    A Node.js tool to examine the correctness of Open Data Metadata

    Linked Open Data (LOD) has emerged as one of the largest collection of interlinked datasets on the web. Benefiting from this mine of data requires the existence of descriptive information about each dataset in the accompanying metadata. Such meta information is currently very limited to few data portals where they are usually provided manually thus giving little or bad quality insights. To address this issue, we propose a scalable automatic approach for extracting, validating and generating...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    cdr-view

    cdr-view

    Webface witch extract CDR (Call Detail Records) from MySQL base.

    Webface witch extract CDR (Call Detail Records ) directly from MySQL base. May be used as SOHO billing system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Pumilio
    Pumilio is a web-based sound analysis and archive system for almost any kind of sound file with tools to see the spectrogram of the sound, select regions for further analysis and insertion in a database, filtering, and many other manipulations.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    Customizable browser based (text/web(WYSIWYG)) file editors environment in PHP (GPL Licensed) with loads of features. (tested only in firefox)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Lioness (Languages Interop Framework)
    Framework for making Windows applications that are one .exe file in AutoHotKey_L,C++,C#, VB.NET,Java,Groovy,Common Lisp,Nemerle,Ruby,Python,PHP,Lua,Tcl,Perl,Jint,S#,WSH VBScript,HTML/JavaScript/CSS,COM, PowerShell without compiling . For .NET 4.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    This project develops a web-based (JSP) Fuzzy Rule-Based Expert System for analyzing ECG (electro cardio gram) signals & diagnosing Tachi-Arrhythmias. Proj. main blocks: inference engine, knowledgebase, KB editor, explanation, and feature extraction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Web Time Entry
    Web Time Entry is a Web based project time entry and client billing system. It can track, approve, and manage employee time incurred for projects and clients you define. It can schedule projects to employees and track time incurred on project schedul
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    The Cornell Web Lab Collaboration Server is a suite of tools and services for GUI-based extraction, analysis and sharing of archived web data. See http://weblab.infosci.cornell.edu/ and http://www.cs.cornell.edu/~weigel for details about the project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Software for web pages data extraction.
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB