Showing 17 open source projects for "data extraction"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    FModel

    FModel

    Unreal Engine Archives Explorer

    FModel is a freeware application designed to explore Unreal Engine games archives, allowing users to delve into the assets and structures of games developed with Unreal Engine.
    Downloads: 66 This Week
    Last Update:
    See Project
  • 2
    DotnetSpider

    DotnetSpider

    Lightweight .NET framework for fast web crawling and data scraping

    DotnetSpider is a web crawling and data extraction framework built on the .NET Standard platform. It is designed to help developers create efficient and scalable crawlers for collecting structured data from websites. It provides a high-level API that simplifies the process of defining spiders, managing requests, and extracting content from web pages. Developers can create custom spiders by extending base classes and configuring pipelines that handle downloading, parsing, and storing collected data. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    PDFPatcher

    PDFPatcher

    A versatile toolkit for PDF manipulation

    PDFPatcher (aka “PDF补丁丁”) is a versatile toolkit for PDF manipulation—editing document metadata, bookmarks, page layout, content restrictions, rotation, compression, merging/splitting, image extraction, and more, all within an intuitive interface. Merge/split PDFs or images, preserve or add bookmarks, and set page dimensions. Batch style/color/target changes, regex/XPath search/replace, mid‑page positioning. Modify PDF metadata, page numbers, links, initial view mode, and remove open actions.
    Downloads: 26 This Week
    Last Update:
    See Project
  • 4
    LosslessExtract

    LosslessExtract

    Lossless audio extraction tool for Bluray, DVD-Audio, SACD, MKV

    Lossless Extract for macOS and Windows is a tool for purists who demand perfect audio preservation. Designed for precision and simplicity, it effortlessly extracts high-resolution audio from Blu-ray, SACD, MKV or DVD=Audio sources. It handles Dolby TrueHD (with Atmos) and DTS-HD Master Audio preserving atmos object based meta data. Many tools decode immersive audio into PCM, which permanently destroys spatial metadata. Lossless Extract preserves the original audio stream so the immersive mix...
    Leader badge
    Downloads: 33 This Week
    Last Update:
    See Project
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 5
    PasteEx

    PasteEx

    Paste the contents of the clipboard into files

    ...Designed as portable software with minimal setup requirements, it integrates with the Windows context menu for quick access. Overall, PasteEx functions as an efficient clipboard-to-file automation tool for developers, designers, and power users working with frequent asset extraction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Abot

    Abot

    Fast and flexible C# framework for building customizable web crawlers

    Abot is an open source C# web crawler framework designed to help developers efficiently crawl and process web content. It focuses on speed, flexibility, and extensibility while handling the complex low-level tasks involved in web crawling. It manages essential components such as multithreading, HTTP requests, scheduling, and link parsing so developers can focus on processing the collected data. Abot follows a modular architecture that allows developers to customize nearly every stage of the...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    intoit-sra

    intoit-sra

    Find exact position of a web site or URL among search results

    IntoIT-SRA can easily find the position of your web site within search results of a search engine. It also includes an exclusion list that will help you remove unwanted sites. Includes web data extraction, a personal spreadsheet loader for data filtering, a handy alarm for appointments and world time clock. This is a Beta version. This project is looking for programmers who want to build a Linux based version of IntoIT-SRA or a plug-in for browsers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Skinner

    Skinner

    Special Effects with Skinned Mesh in Unity

    Skinner is a collection of real-time special effects for Unity that use vertices of an animating skinned mesh as emission points. Instead of duplicating mesh data on the CPU, it employs a replacement shader to stream vertex positions into GPU-friendly buffers, conserving memory and CPU cycles. With those GPU-side buffers, Skinner can drive effects like trails, particles, or geometry that react to the underlying skinned animation in sophisticated ways. The approach enables complex, performant...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    IBEX_MDA

    IBEX_MDA

    An open infrastructure software to facilitate radiomics research.

    ...The latest stand-alone version can be downloaded at http://bit.ly/IBEX_MDAnderson The latest source-code version can be downloaded at http://bit.ly/IBEXSrc_MDAnderson IBEX (imaging biomarker explorer) is an open infrastructure software platform that flexibly supports common radiomics workflow tasks such as multimodality image data import and review, development of feature extraction algorithms, model validation, and consistent data sharing among multiple institutions. IBEX software package was developed using the MATLAB and C/C++ programming languages. The software architecture deploys the modern model-view-controller, unit testing, and function handle programming concepts to isolate each quantitative imaging analysis task, to validate if their relevant data and algorithms are fit for use, and to plug in new modules.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Catch Bugs Before Your Customers Do Icon
    Catch Bugs Before Your Customers Do

    Real-time error alerts, performance insights, and anomaly detection across your full stack. Free 30-day trial.

    Move from alert to fix before users notice. AppSignal monitors errors, performance bottlenecks, host health, and uptime—all from one dashboard. Instant notifications on deployments, anomaly triggers for memory spikes or error surges, and seamless log management. Works out of the box with Rails, Django, Express, Phoenix, Next.js, and dozens more. Starts at $23/month with no hidden fees.
    Try AppSignal Free
  • 10

    DMEAS

    DNA Methylation Entropy Analysis Software

    DMEAS (DNA Methylation Entropy Analysis Software) is written in C# language. It is a user-friendly DNA methylation analysis tool for DNA methylation pattern visualization and extraction, DNA methylation level calculation and DNA methylation entropy analysis. DMEAS progressively scans the mapping results of bisulfite sequencing reads to extract DNA methylation patterns for contiguous CpG dinucleotides. It was developed in order to assess the DNA methylation variations within a cell population for a given genomic locus or genome-wide methylation data.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 11
    Lioness (Languages Interop Framework)
    Framework for making Windows applications that are one .exe file in AutoHotKey_L,C++,C#, VB.NET,Java,Groovy,Common Lisp,Nemerle,Ruby,Python,PHP,Lua,Tcl,Perl,Jint,S#,WSH VBScript,HTML/JavaScript/CSS,COM, PowerShell without compiling . For .NET 4.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    WSciParser (Web Science Parser) is a handy small library created in C#.NET to parse data from web science databases (ISI Web, NCBI, PubMed...)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    The FSSearchIndex Framework project provides a framework that allows application developers to write their own content based file search and indexing applications. It currently supports content extraction and indexing on Text,Word, Excel, PDF files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A toolkit for crawling information from web pages by combining different kinds of "actions". Actions are simple operations such as navigation to a specified url or extraction of text from the html. Also available is a graphic user interface.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    Tool that allows extraction and archiving of xbig files used in certain games like Ufo Extraterrestrials.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    dbTool has been designed to provide a uniform interface over heterogeneous data sources to enable simple data manipulation. The motivation for the tool was to simplify ad-hoc data population and extraction operation in a software development context.
    Leader badge
    Downloads: 14 This Week
    Last Update:
    See Project
  • 17
    N-GRAM - tool for n-grams extraction from xml files Main features: (i) XPath expressions for nodes selection and stop patterns identification; (ii) custom xsl stylesheet to filter the n-gram data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB