Showing 225 open source projects for "extract"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    unipdf

    unipdf

    Golang PDF library for creating and processing PDF files (pure go)

    UniDoc UniPDF is a PDF library for Go (golang) with capabilities for creating and reading, processing PDF files. The library is written and supported by FoxyUtils.com, where the library is used to power many of its services. Every release of our libraries is automatically tested against known vulnerabilities and do not pass unless everything is remediated. All changes are carefully reviewed by our team.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Ethereum ETL

    Ethereum ETL

    Python scripts for ETL (extract, transform and load) jobs for Ethereum

    Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery. Ethereum ETL lets you convert blockchain data into convenient formats like CSVs and relational databases.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Jenkins Docker

    Jenkins Docker

    Docker jenkins repo

    ...Treat the jenkins_home directory as you would a database, in Docker you would generally put a database on a volume. If your volume is inside a container, you can use a Docker command to extract the data, or other options to find where the volume data is. Note that some symlinks on some OSes may be converted to copies (this can confuse jenkins with lastStableBuild links etc). You can define the number of executors on the Jenkins built-in node using a groovy script. By default it is set to 2 executors, but you can extend the image and change it to your desired number of executors.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    dategrep

    dategrep

    Print lines matching a time range

    dategrep is a command-line utility designed to extract lines from log files that fall within a specified time range. It efficiently processes large log files by performing a binary search to locate the relevant entries, making it a valuable tool for system administrators and developers analyzing time-specific events.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MyQ Print Management Software Icon
    MyQ Print Management Software

    SAVE TIME WITH PERSONALIZED PRINT SOLUTIONS

    Boost your digital or traditional workplace with MyQ’s secure print and scan solutions that respect your time and help you focus on what you do best.
    Learn More
  • 5
    Translate Toolkit

    Translate Toolkit

    Useful localization tools with Python API for building localization

    ...Allowing you and your translators to work on industry-standard translation formats. Search for pattern matches. Run tests that adapt to languages and source projects. Extract terminology. A large toolset to allow you to increase localization quality. The code is available for you to add new formats, project types, localization tests and language modules. Adapting the toolkit to your project and needs.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    i18n ally

    i18n ally

    All in one i18n extension for VS Code

    Lokalise is the fastest growing language cloud technology made by developers, for developers. As a collaborative productivity platform, it helps structure and automate the translation and localization process for any company in the world. This extension itself supports i18n as well. It will be auto-matched to the display language you use in your VS Code editor. Supports multi-root workspaces. Supports remote development. Supports numerous popular frameworks. Supports linked locale messages....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Summarize

    Summarize

    Point at any URL/YouTube/Podcast or file

    Summarize is a toolset that lets you point at almost any content and quickly extract the gist, whether that content is a webpage, a YouTube video, a podcast, or a local file. It’s built around a CLI workflow so you can summarize from the terminal, but it also includes a Chrome extension so you can do the same thing directly while browsing. The project pairs an on-device “daemon” style background service with user-facing commands and extension UI, so summaries can feel immediate and repeatable once installed. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    jsoup

    jsoup

    Java library for working with real-world HTML

    jsoup is a Java library for working with real-world HTML. It provides a very convenient API for fetching URLs and extracting and manipulating data, using the best of HTML5 DOM methods and CSS selectors. jsoup implements the WHATWG HTML5 specification, and parses HTML to the same DOM as modern browsers do. jsoup is designed to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup; jsoup will create a sensible parse tree. The parser will make...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    fx

    fx

    Command-line tool and terminal JSON viewer

    fx can work in two modes, cli and interactive. To start interactive mode pipe any JSON into fx. One of the frequent operations is mapping some function on an array. You can pass any number of anonymous functions for reducing JSON. fx provides a function save which will save everything in place and return saved object. This function can be only used with filename as first argument to fx command. Create .fxrc file in $HOME directory, and require any packages or define global functions. To be...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    Tailslayer

    Tailslayer

    Library for reducing tail latency in RAM reads

    Tailslayer is a cybersecurity and forensic analysis tool designed to extract and analyze artifacts from Tails OS environments. It focuses on uncovering traces of activity in systems that are designed to be privacy-preserving and ephemeral. The tool helps investigators identify residual data such as logs, configurations, or usage traces that may persist despite Tails’ security features. It is particularly relevant for digital forensics and research into privacy systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    goober

    goober

    A less than 1KB css-in-js solution

    ...You'll find as, forwarded, CSS, keyframes, styled and so much more. Easily access your common sizes, colors, and anything really with the use of a theme. On the server, you can easily extract the CSS for the current state with extractCss. The initial thought of goober was a CSS-in-js solution at the cost of peanuts. Hence the peanuts emoji. By using goober, you are practically getting back space in size to build more of the needed features.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Boneyard

    Boneyard

    Auto generated skeleton loading framework

    ...It functions as a sandbox where developers can store unfinished ideas, prototypes, or deprecated implementations without removing them entirely. The project provides a space for experimentation, allowing developers to revisit past work, extract useful components, or learn from previous attempts. It often contains a variety of code snippets or mini-projects that may not be production-ready but still hold educational or practical value. Boneyard reflects a development philosophy that values iteration and learning through experimentation rather than discarding work completely. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    s6-overlay

    s6-overlay

    s6 overlay for containers (includes execline, s6-linux-utils

    s6-overlay is an easy-to-install (just extract a tarball or two!) set of scripts and utilities allowing you to use existing Docker images while using s6 as a pid 1 for your container and process supervisor for your services.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Echidna

    Echidna

    Ethereum smart contract fuzzer

    ...We designed Echidna with modularity in mind, so it can be easily extended to include new mutations or test specific contracts in specific cases. Optional corpus collection, mutation and coverage guidance to find deeper bugs. Powered by Slither to extract useful information before the fuzzing campaign. Source code integration to identify which lines are covered after the fuzzing campaign. Curses-based retro UI, text-only or JSON output.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    pep484 stubs for Django

    pep484 stubs for Django

    PEP-484 stubs for Django

    ...You can show your support by liking the PR. This project does not affect your runtime at all. It only affects mypy type checking process. The current implementation uses Django's runtime to extract information about models, so it might crash if your installed apps or models.py are broken.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    NGX-Translate

    NGX-Translate

    The internationalization (i18n) library for Angular

    ...The main part of the library is named core. You can use it on its own, but it is usually a good idea to add a loader to load your translations into your application. You can also extract the strings from your code with the extractor. This makes it really easy to start and maintain your translations. By default, there is no loader available. You can add translations manually using setTranslation but it is better to use a loader. You can write your own loader, or import an existing one.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    LangExtract

    LangExtract

    A Python library for extracting structured information

    LangExtract is a Python library developed by Google that leverages large language models (LLMs) to extract structured information from unstructured text—such as clinical notes, research papers, or literary works—based on user-defined instructions. It is designed to transform free-form text into reliable, schema-constrained data while maintaining traceability back to the source material. Each extracted entity is precisely grounded in its original context, allowing visual inspection and validation via automatically generated interactive HTML visualizations. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    ytt

    ytt

    YAML templating tool that works on YAML structure instead of text

    ...This eliminates text insertion concerns like manual escaping and allows for easy structure reuse, thus, increasing readability. Weave ytt templating into your own configuration, externalize values to variables, extract repeated snippets to functions, and make whole sections conditional. Patch on top of any configuration using ytt overlays. Express precise structural edits and invariant declaratively. ytt execution environment is hermetic and side-effect free, with no access to the filesystem, network, time, randomness, or the operating system interfaces. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    linaria

    linaria

    Zero-runtime CSS in JS library

    ...Optionally use any CSS preprocessor such as Sass or PostCSS. Easily find where the style was defined with CSS source maps. Linaria currently supports webpack and Rollup to extract the CSS at build time. Optionally, add the @linaria preset to your Babel configuration at the end of the presets list to avoid errors when importing the components in your server code or tests. Linaria can be used with any framework, with additional helpers for React.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    BRAID

    BRAID

    Themeable design system for the SEEK Group

    Braid aims to make cross-brand UI development as fast as possible while maintaining a high level of quality and accessibility. In order to achieve this, Braid provides as a set of React components and CSS variable-based styling themes using vanilla-extract. As much as possible, we want Braid code to make sense to non-developers. We’re aggressively focused on the simplicity and composability of its API. Along with our work on Playroom, our goal is to empower designers and developers to iterate together in the same medium using the same components, reducing the need for high fidelity mock-ups before development starts. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    tsfresh

    tsfresh

    Automatic extraction of relevant features from time series

    tsfresh is a python package. It automatically calculates a large number of time series characteristics, the so called features. tsfresh is used to to extract characteristics from time series. Without tsfresh, you would have to calculate all characteristics by hand. With tsfresh this process is automated and all your features can be calculated automatically. Further tsfresh is compatible with pythons pandas and scikit-learn APIs, two important packages for Data Science endeavours in python. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Python-Spider

    Python-Spider

    Python3 web crawler practice

    Python-Spider is a repository intended to teach or provide examples for writing web spiders / crawlers in Python — part of a broader learning and resource collection by its author. The code and documentation are oriented toward beginners or intermediate learners who want to learn how to fetch, parse, and extract data from websites programmatically. As part of the author’s public learning-path repositories, python-spider likely includes examples of HTTP requests, HTML parsing, maybe concurrency or scheduling to crawl multiple pages, and techniques to handle common web-scraping issues. For people wanting to get hands-on with building scrapers, collecting data, or learning how to navigate web programming in Python, this repository acts as a didactic reference or starting point. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    .NET for Apache Spark

    .NET for Apache Spark

    A free, open-source, and cross-platform big data analytics framework

    .NET for Apache Spark provides high-performance APIs for using Apache Spark from C# and F#. With these .NET APIs, you can access the most popular Dataframe and SparkSQL aspects of Apache Spark, for working with structured data, and Spark Structured Streaming, for working with streaming data. .NET for Apache Spark is compliant with .NET Standard - a formal specification of .NET APIs that are common across .NET implementations. This means you can use .NET for Apache Spark anywhere you write...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    pmd

    pmd

    An extensible multilanguage static code analyzer

    PMD is a source code analyzer. It finds common programming flaws like unused variables, empty catch blocks, unnecessary object creation, and so forth. It supports Java, JavaScript, Salesforce.com Apex and Visualforce, PLSQL, Apache Velocity, XML, and XSL. Additionally, it includes CPD, the copy-paste-detector. CPD finds duplicated code in Java, C, C++, C#, Groovy, PHP, Ruby, Fortran, JavaScript, PLSQL, Apache Velocity, Scala, Objective C, Matlab, Python, Go, Swift and Salesforce.com Apex,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    FVim

    FVim

    Cross platform Neovim front-end UI, built with F# + Avalonia

    Cross platform Neovim front-end UI, built with F# + Avalonia. Download the latest release package for your system, extract and run FVim! For Windows 7 / Vista / 8.1 / Server 2008 R2 / Server 2012 R2, use the win7-x64 package. For Windows 10, use the win-x64 package -- this version has faster startup. For macOS, it's packaged as an app bundle, unzip and drag it to your applications folder. Use a Windows FVim frontend with a WSL neovim: fvim, wsl. Multi-grid support, try Ctrl-w ge to detach a window into a separate OS window!
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB