Showing 24 open source projects for "extraction"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    newpipeextractor

    newpipeextractor

    Library for extracting streaming site data without official APIs

    NewPipeExtractor is an open source Java library designed to extract data from streaming platforms by analyzing their web interfaces instead of relying on official APIs. It serves as the core extraction component used by the NewPipe Android application, but it is built as a standalone library that can also be integrated into other software projects. NewPipeExtractor provides a unified framework for retrieving information such as video streams, playlists, channels, and search results from supported streaming services. It handles many low-level tasks involved in web data extraction, including parsing responses, managing platform-specific logic, and handling errors, allowing developers to focus on implementing application features rather than scraping mechanics. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2
    Mobile Next

    Mobile Next

    Model Context Protocol Server for Mobile Automation and Scraping

    ...It abstracts away platform-specific complexities, allowing developers and AI agents to interact with mobile devices using a consistent set of commands regardless of operating system. The system supports real devices, emulators, and simulators, making it suitable for testing, automation, and data extraction workflows in diverse development setups. One of its key innovations is its hybrid interaction model, which combines structured accessibility data with fallback screenshot-based analysis to ensure reliable automation even in complex UI scenarios. It is built to integrate seamlessly with modern AI agents, enabling multi-step workflows such as automated testing, form filling, and user journey simulation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    X-osint

    X-osint

    Open source OSINT tool for gathering data on emails, phones, and IPs

    X-osint is an open source intelligence framework designed to collect and analyze publicly available information from multiple sources. It focuses on gathering useful and credible data about entities such as phone numbers, email addresses, and IP addresses using a range of automated OSINT techniques. It provides investigators and researchers with a centralized interface for running information-gathering tasks that would normally require multiple separate tools. X-osint can also perform...
    Downloads: 34 This Week
    Last Update:
    See Project
  • 4
    Droidrun

    Droidrun

    Powerful framework for controlling Android and iOS devices

    Droidrun is a native mobile agent platform that gives users natural-language control over real Android devices to automate any mobile app workflow, from logins and bookings to purchases and data extraction, including access to mobile-only content behind app logins, rate limits, or platform restrictions. Its cloud offering lets users spin up agents in seconds with preinstalled apps, run tasks in parallel across multiple devices, and compose complex, multi-step conditional workflows using conversational commands; recorded workflows can be auto-replayed at high speed. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    Recorder

    Recorder

    HTML5 js recording mp3 wav ogg webm amr format

    ​Supports microphone recording and real-time processing in most of the implemented getUserMediamobile and PC browsers, mainly including Chrome, Firefox, Safari, iOS 14.3+, Android WebView, Tencent Android X5 kernel (QQ, WeChat, Mini Program WebView) , uni-app (App, H5), and most Android phones updated after 2021 have their own browsers; do not support: UC-based kernel (typical Alipay), most of the old domestic mobile phones that have not been updated have their own browsers and any other...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 6
    Emerge

    Emerge

    Browser-based interactive codebase and dependency visualization tool

    Emerge (or emerge-viz) is an interactive code analysis tool to gather insights about source code structure, metrics, dependencies, and complexity of software projects. You can scan the source code of a project, calculate metric results and statistics, generate an interactive web app with graph structures (e.g. a dependency graph or a filesystem graph), and export the results in some file formats. Emerge currently has parsing support for the following languages: C, C++, Groovy, Java,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    MiniCPM-o

    MiniCPM-o

    A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming

    MiniCPM-o 2.6 is a cutting-edge multimodal large language model (MLLM) designed for high-performance tasks across vision, speech, and video. Capable of running on end-side devices such as smartphones and tablets, it provides powerful features like real-time speech conversation, video understanding, and multimodal live streaming. With 8 billion parameters, MiniCPM-o 2.6 surpasses its predecessors in versatility and efficiency, making it one of the most robust models available. It supports...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    ffsend

    ffsend

    Easily and securely share files from the command line

    Easily and securely share files and directories from the command line through a safe, private and encrypted link using a single simple command. Files are shared using the Send service and may be up to 1GB. Others are able to download these files with this tool, or through their web browser. All files are always encrypted on the client, and secrets are never shared with the remote host. An optional password may be specified, and a default file lifetime of 1 (up to 20) download or 24 hours is...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    shuyuan

    shuyuan

    Reading book source

    shuyuan is a project oriented around reading and knowledge consumption, especially targeting large-scale text content such as books, articles, or educational material. The name suggests “academy” or “study hall,” and the tool aims to help users ingest, organize, and manage reading content — possibly offering features like text parsing, annotation, metadata generation, translation, or storage for later reference. The repository is set up to support document ingestion, indexing, and maybe some...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 10
    Ghost Commander
    Dual panel Android file manager (like Norton or Total Commander). Aside the /sdcard it works with FTP servers, Windows (Samba) network shares (with a plug-in), ZIP archives, etc. Lots more features and customization settings are there. Feel free to add a feature request here: http://tinyurl.com/gc-user Report bugs and issues: http://tinyurl.com/gc-issue
    Leader badge
    Downloads: 117 This Week
    Last Update:
    See Project
  • 11

    YoungerSibling

    YoungerSibling: Cross-platform OSINT tool for quick data gathering.

    YoungerSibling is a Python-based terminal utility script designed for educational purposes. It provides a set of useful tools to perform tasks like searching the web, performing lookups (Google search, IP lookup, username lookup, etc.), and extracting metadata from images, directly from the terminal. This project aims to help students, developers, and hobbyists learn about web scraping, API usage, and terminal interaction with Python.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    Cyphernight-MAX SEC
    ...Unique features include per-chunk authentication, single-use uniqueness markers, encrypted key vaults, forced external archive loading for safe inspection of ZIP/RAR/ISO files, universal forensic archive extraction, and steganography (LSB hiding of files inside images). Additional highlights: a serverless P2P Wormhole for local network file transfer, customizable UI themes (Dark, Light, Matrix) support@dexmond.com
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    MBR Bulk WP Detector

    MBR Bulk WP Detector

    A free WP plugin that lets you check unlimited URLs

    MBR Bulk WP Detector is a free WordPress plugin that lets you check unlimited URLs right from your own dashboard. No subscriptions, no URL limits, and your data stays completely private on your server. What Can You Do With It? The basics are simple: Paste a list of URLs (or upload a CSV file), click a button, and boom—you’ve got a clear breakdown of which sites are running WordPress and which aren’t. But it gets better… Turn on Deep Scan mode, and you’ll also discover what...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    SmartOpenCV

    SmartOpenCV

    OpenCV enhancement library for Android

    SmartOpenCV is an enhanced computer vision toolkit built on top of OpenCV that aims to simplify and extend common image processing and computer vision tasks through higher-level abstractions and utilities. It provides pre-built modules and optimized pipelines for tasks such as object detection, image transformation, and feature extraction, reducing the need for low-level implementation. The framework is designed to be more developer-friendly than raw OpenCV by offering cleaner APIs and reusable components that accelerate development. It supports integration with machine learning models, enabling hybrid workflows that combine traditional computer vision techniques with modern AI approaches. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Simd

    Simd

    High performance image processing library in C++

    The Simd Library is a free open source image processing library, designed for C and C++ programmers. It provides many useful high performance algorithms for image processing such as: pixel format conversion, image scaling and filtration, extraction of statistic information from images, motion detection, object detection (HAAR and LBP classifier cascades) and classification, neural network. The algorithms are optimized with using of different SIMD CPU extensions. In particular the library supports following CPU extensions: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2 and AVX-512 for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM. ...
    Leader badge
    Downloads: 15 This Week
    Last Update:
    See Project
  • 16

    Semantic Assistants

    Natural Language Processing (NLP) for the Masses

    Semantic Assistants support users in content retrieval, analysis, and development, by offering context-sensitive NLP services directly integrated in standard desktop clients, like a word processor, and web information systems, like a wiki.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Turbo Download Manager

    Turbo Download Manager

    A portable modern multi-threading download manager for all platforms

    A modern multi thread download manager for Windows, Linux, Mac OS, Firefox, Chrome, Opera and Android devices. For bug reports visit: https://github.com/inbasic/turbo-download-manager/issues For FAQs visit: http://add0n.com/turbo-download-manager.html Turbo Download Manager is an stand-alone application without any dependencies. It should run out of the box. Just set the download location while adding the first job request. If you have a browser and would like to integrate this...
    Leader badge
    Downloads: 86 This Week
    Last Update:
    See Project
  • 18
    CaesarIA (openCaesar3)

    CaesarIA (openCaesar3)

    Remake of the classic Caesar III game

    CaesarIA is an open source remake of Caesar III game released by Impressions Games in 1998, it aims to expand the possibilities of the classical city-building simulators and to add new features showing the city life. Now the game work with Windows, Linux, Mac, Haiku and Android. The original Caesar3 game is needed to play openCaesar3. We start crowdfunding campaign for renew graphic set at www.igg.me/at/caesaria-game. Help us set new stage in good old Caesar.
    Downloads: 51 This Week
    Last Update:
    See Project
  • 19
    android-activity-miner

    android-activity-miner

    Activity-Miner for Android

    A mobile application to create accelerometer based activity recognition models directly on the phone. The configuration of the segmentation and feature extraction process chain requires expert knownledge. The prototype was developed in 2012 in a bachelor thesis at the University of Kassel and was optimized and enhanced for an experiment in 2015.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    ePUBator

    ePUBator

    Minimal offline PDF to ePUB converter for Android

    Minimal offline PDF to ePUB converter for Android - ©2011 Ezio Querini ePUBator extract text from a PDF file and put it in a well formed (epubcheck compliant) ePUB file. PDF extraction based on iText library <http://itextpdf.com/> released under the AGPL license. - ePUBator IS THINKED FOR BOOKS (NOT FOR EVERY TYPE OF PDF), BUT IF YOU NEED A BETTER RESULT TRY SOMETHING ELSE LIKE CALIBRE. - ePUBator doesn't need internet connection (doesn't send your docs somewhere on the net, doesn't have ads). - ePUBator extracts text (no text from pictures or from raster PDF)...
    Leader badge
    Downloads: 31 This Week
    Last Update:
    See Project
  • 21

    ezSIFT

    An easy-to-use standalone SIFT library written in C/C++

    *************************************************************************** Updated 06/28/2018 The ezSift project has moved to https://github.com/robertwgh/ezSIFT *************************************************************************** The SIFT (scale-invariant feature transform) algorithm is considered to be one of the most robust local feature detector and description methods. Most of the open-source SIFT implementations rely on some 3rd-party libraries. Some of them even rely...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    Handwritten Signature SDK

    Java SDK useful to develop handwritten signature verification app

    Handwritten Signature SDK supports: dynamic time wrapping (DTW), extended regression (ER), signature sampling, direction hash and features extraction: see wiki for more info.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    HotelsReservations

    HotelsReservations

    Android Application Integrated with a Web Application (Beta Version)

    Android Application devoted for hotels reservation in Lebanon (Rooms, Suites), exchanged with a web application using PHP/MYSQL.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    SparePartsShop

    SparePartsShop

    Android Application Integrated with a Web Application (Beta Version)

    Android Application infectious for shop of spare parts items with exchange data from a web application using PHP/MYSQL.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB