Showing 774 open source projects for "extraction"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    Ventoy

    Ventoy

    A New Bootable USB Solution

    Ventoy is an open source tool to create bootable USB drive for ISO/WIM/IMG/VHD(x)/EFI files. With Ventoy, you don't need to format the disk over and over, you just need to copy the ISO/WIM/IMG/VHD(x)/EFI files to the USB drive and boot them directly. You can copy many files at a time and Ventoy will give you a boot menu to select them. You can also browse ISO/WIM/IMG/VHD(x)/EFI files in local disks and boot them. x86 Legacy BIOS, IA32 UEFI, x86_64 UEFI, ARM64 UEFI and MIPS64EL...
    Leader badge
    Downloads: 210,399 This Week
    Last Update:
    See Project
  • 2

    cli.renamed.to

    AI-powered CLI for smart file renaming, document extraction, and PDF splitting

    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    librsb

    librsb

    A shared memory parallel sparse matrix library including Sparse BLAS.

    ...The most common operations necessary to iterative solvers are available, e.g.: matrix-vector multiplication, triangular solution, rows/columns scaling, diagonal extraction / setting, blocks extraction, norm computation, formats conversion. The RSB format is especially well suited for symmetric and transposed multiplication variants. On these variants, librsb has been found to be faster than Intel MKL's implementation for CSR. Most numerical kernels code is auto generated, and the supported numerical types can be chosen by the user at build time. ...
    Downloads: 106 This Week
    Last Update:
    See Project
  • 4

    pwwAutoVision

    Non-standard automated vision software

    The only software that integrates vision and motion, as well as interfaces and reports, is developed with zero code and comes with online debugging. Link:https://pan.baidu.com/s/1vsTptn_pvtbK2sDhWVCZJg code:1234
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 5
    iX

    iX

    Autonomous GPT-4 agent platform

    IX is a platform for designing and deploying autonomous and [semi]-autonomous LLM-powered agents and workflows. IX provides a flexible and scalable solution for delegating tasks to AI-powered agents. Agents created with the platform can automate a wide variety of tasks while running in parallel and communicating with each other.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 6
    java-pdf-table-extractor-lib

    java-pdf-table-extractor-lib

    Java Pdf Table extraction library

    The command line application is an example of usage of the Java library. The library is based on pdfbox library and works by looking for the layout of each selected pdf page, and looking for table structure patterns. After calling the library (passing the pdf filename, and the page range), the result is a List<PdfTextElement>. PdfTextElement is an interface that has two implementations. * A basic text (outside the tables) * And PdfTextTabulaElement, for table structures. That...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    GeoDMA

    GeoDMA

    Geographic feature extraction and data mining

    GeoDMA is a plugin for TerraView software, used for geographical data mining. With a single image, the user can perform segmentation, attributes extraction, normalization and classification.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    # Radio Transcription Tool v3.1 A professional Python application for recording and transcribing Dutch and Belgian radio streams using OpenAI Whisper API, with advanced keyword extraction powered by KeyBERT. ## 🎯 Features - **Live Radio Recording**: Record streams from 40+ Dutch and Belgian radio stations - **Live Stream Listening**: Listen to radio streams without recording - **AI Transcription**: High-quality transcription using OpenAI Whisper API - **Smart Keyword Extraction**: Advanced phrase analysis with KeyBERT - **Professional UI**: Modern Tkinter interface with Bluvia branding - **Organized Output**: Timestamped folders with MP3 recordings and transcriptions - **API Key Management**: Built-in OpenAI API key configuration
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    SunnyCapturer

    SunnyCapturer

    A simple and beautiful cross-platform screenshot software

    SunnyCapturer is a simple and beautiful cross-platform screenshot software tool that supports OCR to extract text from images, image translation, and pinning images to the screen, among other features.
    Leader badge
    Downloads: 19 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    WFDownloader App

    WFDownloader App

    Free batch downloader for image, wallpaper, video, audio, document,

    ...Also use to download sequential website urls that have a certain pattern (e.g. image01.png to image100.png). Also use app's built-in site crawler for advanced link search or extraction. There is also special support for forum media and open directory downloading. It's a programmable downloader and also works with password protected sites. Say goodbye to downloading one by one. Go to the Help menu or check out website to get started. Note that this cross-platform version requires Java (minimum version Java 8) to be installed on your Operating System. ...
    Leader badge
    Downloads: 451 This Week
    Last Update:
    See Project
  • 11
    FFmpeg Batch AV Converter

    FFmpeg Batch AV Converter

    Free all in one audio/video ffmpeg batch encoder

    FFmpeg Batch AV Converter is a free universal audio and video encoder for Windows and Linux (via Wine), that allows to use the full potential of ffmpeg command line with a few mouse clicks in a convenient GUI with drag and drop, progress information. Some fancy wizards make things easy for non-experts. Thanks to its multi-file encoding feature, it may be the fastest a/v batch encoder available, since it maximizes system resources usage by launching as many simultaneous processes up to...
    Leader badge
    Downloads: 2,872 This Week
    Last Update:
    See Project
  • 12

    rdmxtractor

    Pyradiomics extractor gui uitility

    This utility is used to extract radiomics features from a list of volumes and segmentations, it saves directly to a mongodb database as an option, and it saves the result in a json file, it helps to automate the extraction task.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    chessPDFBrowser

    chessPDFBrowser

    Chess application whichs allows working with chess PDF books and PGNs.

    Chess application which allows working with PDFs and PGNs. You can work with the chess games of the PDF and edit their tree of variants. Graphical environment. Standard PGN TAGs. PGN comments. Ocr like (Fen string detection from chess board position images). Connection to Uci chess engines (like stockfish). Position analysis, full game analysis. You can now play games against uci engines. pdf2pgn command line command included. Detailed documentation. Multilanguage...
    Downloads: 46 This Week
    Last Update:
    See Project
  • 14
    Datapipe

    Datapipe

    Real-time, incremental ETL library for ML with record-level depend

    Datapipe is a real-time, incremental ETL library for Python with record-level dependency tracking. Datapipe is designed to streamline the creation of data processing pipelines. It excels in scenarios where data is continuously changing, requiring pipelines to adapt and process only the modified data efficiently. This library tracks dependencies for each record in the pipeline, ensuring minimal and efficient data processing.
    Downloads: 174 This Week
    Last Update:
    See Project
  • 15
    CERCA

    CERCA

    CERCA – Citation Extraction & Reference Checking Assistant

    CERCA is an open-source research tool that supports the verification of bibliographic references in scientific manuscripts. It extracts references from PDF files and checks their existence and consistency against authoritative metadata sources, producing explainable diagnostics, audit logs, and reproducible reports. It is intended for: - Researchers performing final manuscript checks - Reviewers assessing reference consistency - Editors supporting editorial quality control -...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 16
    OmniPull

    OmniPull

    Just pull anything

    OmniPull is a powerful, cross-platform download manager built with Python and PySide6. It provides a modern, intuitive interface for managing downloads with advanced features like multi-threading, queue management, and media extraction.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 17
    ai-scrapper
    🚀 Discover AI Web Scraper! 🚀 Tired of copying and pasting data from websites? I developed a desktop application with Electron and Gemini AI to extract structured data easily and efficiently! 🤖✨
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    ttf-mscorefonts-installer

    ttf-mscorefonts-installer

    Installer for Microsoft TrueType core fonts

    ttf-mscorefonts-installer is a Debian package that simplifies the installation of Microsoft TrueType Core Fonts on Debian-based systems. It provides an automated way to download and install widely used web fonts such as Arial, Times New Roman, and Verdana that are not shipped directly with Debian due to licensing restrictions. The package fetches the official font files from Microsoft over the internet and installs them in a system-compatible format. It is commonly used to ensure document...
    Leader badge
    Downloads: 42 This Week
    Last Update:
    See Project
  • 19

    modnlp

    Modular Suite of NLP Tools

    ...It provides an API and tools for (inverted) indexing, storage and retrieval of large amounts of text, with (XML-based) handling of meta-data, tools for text categorisation, including, functionality for XML parsing, term set reduction (and basic keyword extraction), probabilistic classifier induction, sample classification tools, and evaluation modules, a suite of corpus management, curation and distributed access tools. If you use the tool please consider referencing it using the following article: Luz, S., & Sheehan, S. (2020). Methods and visualization tools for the analysis of medical, political and scientific concepts in Genealogies of Knowledge. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 20

    YoungerSibling

    YoungerSibling: Cross-platform OSINT tool for quick data gathering.

    YoungerSibling is a Python-based terminal utility script designed for educational purposes. It provides a set of useful tools to perform tasks like searching the web, performing lookups (Google search, IP lookup, username lookup, etc.), and extracting metadata from images, directly from the terminal. This project aims to help students, developers, and hobbyists learn about web scraping, API usage, and terminal interaction with Python.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 21

    Baymax toOls for x64dbg

    the feature code extraction and search plug-in for x64dbg

    BaymaxTools is a feature code extraction and search plug-in for x64dbg debugger. it is convenient to extract the signature of the specified instruction and the number of occurrences of the signature in the search module.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    LabPlot

    LabPlot

    Data Visualization and Analysis

    LabPlot is a FREE, open source and cross-platform Data Visualization and Analysis software accessible to everyone.
    Downloads: 39 This Week
    Last Update:
    See Project
  • 23

    NeuroSuite

    Klusters, NeuroScope and NDManager

    ...NDManager (Neurophysiological Data Manager) is a simple graphical application designed to help neurophysiologists manage their experimental recording parameters (e.g., number of channels and sampling rate of the acquisition system) and process their data (data filtering, spike extraction, PCA, etc.)
    Leader badge
    Downloads: 25 This Week
    Last Update:
    See Project
  • 24
    file-recovery
    File Recovery helps you extract JPEG, PNG, PDF, ZIP, and GIF data by scanning for file signatures (carving), or recover deleted files from a FAT32 volume image using the filesystem and FAT. A small Qt interface wraps the same command-line engine for easier use on Windows, macOS, and Linux.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 25
    Digital Forensics Guide

    Digital Forensics Guide

    Learn all about Digital Forensics and Computer Forensics

    The Digital Forensics Guide repository is a comprehensive, structured reference for investigators, analysts, students, and cybersecurity professionals interested in digital forensics principles, tools, methodologies, and workflows. It organizes foundational topics such as evidence acquisition, disk and memory analysis, file system structures, network forensics, artifact extraction, timeline generation, and reporting into digestible modules that help build core competency. Alongside conceptual explanations, the guide includes practical examples with widely used tools (like Autopsy, Volatility, Sleuth Kit, and network analysis suites), illustrating how investigations proceed from initial data capture to final analysis. ...
    Downloads: 7 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB