Showing 161 open source projects for "extraction"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 1

    anti-copy-paster

    A plugin for IntelliJ IDEA for just-in-time code duplicates extraction

    The plugin monitors the copying and pasting that takes place inside the IDE. As soon as a code fragment is pasted, the plugin checks if it introduces code duplication, and if it does, the plugin calculates a set of code metrics for it, and these metrics are compared against the currently selected metrics thresholds. If the chosen thresholds are surpassed, the plugin suggests the developer to perform the Extract Method refactoring and applies the refactoring if necessary.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    [ARCHIVAL] The central forum for the MWE community. Share your open-source data sets and MWE extraction tools, exchange ideas on evaluation strategies and further development of the tools, and discuss theoretical definitions and linguistic properties of MWEs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to expand its capabilities, focusing on versatile data extraction, platform support, and seamless integration with various systems. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4

    wx-c.so.recompile

    Recompile of wx-c and wxGTK-2.8.12 for x86_64

    wx-c and wxGTK are all packed in file wx-c-0-9-0-2_x64_wxGTK2.8u.tar.gz Those .so files after extraction, should be put under /usr/local/lib or somewhere in system libraries searching path.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    Botpress

    Botpress

    Dev tools to reliably understand text and automate conversations

    ...We propose you a complete dev-friendly platform that ships with all the tools you need to build, deploy and manage production-grade chatbots in record time. Built-in Natural Language Processing tasks such as intent recognition, spell checking, entity extraction, and slot tagging (and many others). A visual conversation studio to design multi-turn conversations and workflows. An emulator & a debugger to simulate conversations and debug your chatbot. Support for popular messaging channels like Slack, Telegram, MS Teams, Facebook Messenger, and an embeddable web chat. An SDK and code editor to extend the capabilities. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    ldetool

    ldetool

    Code generator for fast log file parsers

    ldetool (Line Data Extraction Tool) is a command-line utility that generates Go code for fast log file parsing. By defining parsing rules, developers can produce efficient parsers tailored to specific log formats, outperforming traditional regex-based approaches. It's particularly useful for processing large volumes of log data.​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    RedtDec

    RedtDec

    RetDec is a retargetable machine-code decompiler based on LLVM

    The decompiler is not limited to any particular target architecture, operating system, or executable file format. ELF, PE, Mach-O, COFF, AR (archive), Intel HEX, and raw machine code supported. 32-bit: Intel x86, ARM, MIPS, PIC32, and PowerPC 64-bit: x86-64 supported. Demangling of symbols from C++ binaries (GCC, MSVC, Borland). Reconstruction of functions, types, and high-level constructs. Output in two high-level languages: C and a Python-like language. Generation of call graphs,...
    Downloads: 28 This Week
    Last Update:
    See Project
  • 8
    NuxtJS

    NuxtJS

    The Intuitive Web Framework, based on Vue 3

    ...Utility, ease of use, and efficiency are key. Nuxt is built with a set of features that make this possible. Optimized with code-splitting, tree-shaking, optimized cold-start, link prefetching, payload extraction, just to name a few. Fast by default so you can focus on building. Decide what rendering strategy at the route level: SSR, SSG, CSR, ISR, ESR, SWR. Build any kind of website or web application with optimized performance in mind. By leveraging server-side rendering, ESM format and optimized images, Nuxt websites are indexable by search engines while giving the feeling of an app to the end-users.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    7-Zip-JBinding

    7-Zip-JBinding

    Java wrapper for 7z archiver engine

    Native (JNI) cross-platform library to extract (password protected, multi-part) 7z Zip Rar Tar Split Lzma Iso HFS GZip Cpio BZip2 Z Arj Chm Lhz Cab Nsis Deb Rpm Wim Udf archives and create 7z, Zip, Tar, GZip & BZip2 from Java.
    Leader badge
    Downloads: 49 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    Specter

    Specter

    Clojure(Script)'s missing piece

    Specter is a powerful Clojure (and ClojureScript) library that revolutionizes navigation and manipulation of deeply nested and recursive data structures through a flexible, high-performance API beyond what vanilla Clojure offers. Specter has an extremely simple core, just a single abstraction called "navigator". Queries and transforms are done by composing navigators into a "path" precisely targeting what you want to retrieve or change. Navigators can be composed with any other navigators,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    EdiZon

    EdiZon

    Homebrew save management, editing tool and memory trainer for Horizon

    A Homebrew save file dumper, injector and on-console editor for Horizon, the OS of the Nintendo Switch. Please note if you are using Atmosphere 0.10.0+, you must use the snapshot version of EdiZon. Older versions of EdiZon do not work on the latest Atmosphere. Extraction of game saves. Injection of extracted game saves (Your own and your friends save files). Uploading of savefiles directly. Batch extraction of all save files of all games on the system. Save file editing. Easy to use, scriptable and easily expandable on-console save editing. Lua and Python script support. Built-in save editor updater. ...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 12
    printpdf

    printpdf

    Rust / WASM library for reading, writing and rendering PDF

    printpdf is a Rust library for creating, reading, writing, and rendering PDF documents, providing developers with fine-grained control over document generation and layout. It supports a wide range of PDF features, including pages, layers, annotations, vector graphics, images, and embedded fonts, allowing the creation of complex and professional documents. The library emphasizes manual positioning of elements, giving developers precise control over layout and rendering rather than relying on...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Exifr

    Exifr

    The fastest and most versatile JS EXIF reading library

    Exifr is a fast and very versatile JavaScript EXIF reading library that works everywhere, parses everything and handles just about anything you throw at it. It can handle any input: buffers, url, <img> tag and more; .jpg, .tif, and .heic files; and TIFF (EXIF, GPS, etc.), XMP, ICC, IPTC, JFIF segments. It skips parsing tags you don’t need, and reads only the first few bytes. There’s no need to read the whole file to see if there’s an EXIF file in it, or extract all the data when you just...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    Chatette

    Chatette

    A powerful dataset generator for Rasa NLU, inspired by Chatito

    Chatette is a Python-based tool for generating training datasets for Natural Language Understanding (NLU) models, particularly those used with Rasa NLU. It employs a domain-specific language to define templates, enabling the creation of diverse and extensive training examples for intent classification and entity recognition.​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    WBBlades

    WBBlades

    WBBlades is a tool set based on Mach-O file parsing

    WBBlades is a toolset based on Mach-O file parsing, including one-click detection for the app (supports OC and Swift), package size analysis (supports a single static library/dynamic library), point-to-point crash analysis (analyze system crash log, based on symbol file and without symbol files), Class automatic extraction and Hook capability based on Mach-O file. It mainly uses __Text assembly code analysis, architecture extraction, DYSM file stripping, symbol table stripping, and crash file (ips) analysis technology. Support big method/small method parsing and iOS 15 above about dyld_chained_Fixups processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    SynWrite

    SynWrite

    Advanced text editor for programmers and Notepad replacement.

    SynWrite is a free source code editor, HTML editor, and Notepad replacement for Windows systems. It is a full-featured alternative not only to simple Notepad, but also to professional editors.
    Downloads: 56 This Week
    Last Update:
    See Project
  • 17
    CNN for Image Retrieval
    cnn-for-image-retrieval is a research-oriented project that demonstrates the use of convolutional neural networks (CNNs) for image retrieval tasks. The repository provides implementations of CNN-based methods to extract feature representations from images and use them for similarity-based retrieval. It focuses on applying deep learning techniques to improve upon traditional handcrafted descriptors by learning features directly from data. The code includes training and evaluation scripts that...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Laravel Mix

    Laravel Mix

    The power of webpack, distilled for the rest of us

    Laravel Mix provides a clean, fluent API for defining basic webpack build steps for your applications. Mix supports several common CSS and JavaScript pre-processors. If you've ever been confused about how to get started with module bundling and asset compilation, you will love Laravel Mix! Webpack is an incredibly powerful module bundler that prepares your JavaScript and assets for the browser. The only understandable downside is that it requires a bit of a learning curve. In an effort to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Marathon -GUI Test Runner Web, Swing, FX

    Marathon -GUI Test Runner Web, Swing, FX

    Marathon supports testing of Java/Swing and Java/Fx applications.

    ...Currently, Marathon supports JRuby script models for recording the test scripts.Marathon test runner generates Allure test reports.Marathon allows for grouping of test cases.It also has an option of inserting modules while recording(in MarathonITE while recording). It allows tester to inset checklist while recording and also takes screen capture and annotate it in playing the test cases. Extraction of code into modules and inserting it is very easy using marathon.It has different themes of UI which attracts tester to do work on it. Object mapping is quite simple and configuring is very easy for tester to understand and do the work. Debugging of test scripts is also easy and has script console which takes selenium commands. ...
    Leader badge
    Downloads: 7 This Week
    Last Update:
    See Project
  • 20
    FAR - Find And Replace
    Search and replace operations on file content accross multiple files. Recursive operations within entire directory trees. FAR comes with support for regular expressions (regex) over multiple lines, automatic backup and various character encodings. Run grep like extractions to condense or rearrange sources, or perform bulk file renaming.
    Downloads: 33 This Week
    Last Update:
    See Project
  • 21

    PDFtk Bookmarks Editor

    GUI for updating PDF bookmarks using PDF Toolkit (PDFtk) on Windows

    Free and open source GUI application for updating bookmarks in a PDF document using the PDF Toolkit command line tool, PDFtk Server. User selects the PDF via drag and drop and then edits the bookmark entries in a text file using a simple, 1-line data format. Program handles everything else in response to a few user button clicks. OS: Windows. Author: David King. License: GPLv3.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 22
    A collection of small utilities for: data extraction (text or binary files), data buffering, message queue control, column addition, date/time manipulation, and data recovery testing.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    SkyWater PDK

    SkyWater PDK

    Open source process design kit for usage with SkyWater Technology

    The SkyWater PDK is the first broadly available open-source process design kit for a commercial-grade CMOS node, enabling researchers, startups, and students to design real ASICs without proprietary NDAs. It provides the essential artifacts for digital and analog flows: SPICE models, DRC/LVS rules, extraction decks, and technology files for open tools like Magic and KLayout. Standard-cell libraries and IO pads are included so digital designers can use open synthesis and place-and-route to reach a manufacturable GDS. Because the PDK is open, it becomes a common target for community reference designs, open tapeouts, and teaching curricula. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 24
    Laravel Stats Tracker

    Laravel Stats Tracker

    Laravel Stats Tracker

    Tracker gathers a lot of information from your requests to identify and store sessions and page views. Storing user tracking information, on indexed and normalized database tables, wastes less disk space and ease the extraction of valuable information about your application and business. As soon as you install and enable it, Tracker will start storing all information you tell it to, then you can in your application use the Tracker Facade to access everything.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    AhoCorasickDoubleArrayTrie

    AhoCorasickDoubleArrayTrie

    An extremely fast implementation of Aho Corasick algorithm

    AhoCorasickDoubleArrayTrie is a Java implementation of the Aho–Corasick multi-pattern matching algorithm that is optimized using a Double-Array Trie data structure. It is designed for fast keyword scanning across large texts, where you want to search for many patterns simultaneously and efficiently. The core idea is to build an automaton from a dictionary of patterns, then stream through input text to emit matches with minimal overhead. By using a double-array trie representation, the...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB