Showing 205 open source projects for "extract"

View related business solutions
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 1
    Contour

    Contour

    Modern C++ Terminal Emulator

    contour is a modern and actually fast, modal, virtual terminal emulator, for everyday use. It is aimed at power users with a modern feature mindset. Available on all 4 major platforms, Linux, OS/X, FreeBSD, Windows. GPU-accelerated rendering. Font ligatures support (such as in Fira Code). Unicode: Emoji support (-: 🌈 💝 😛 👪 - including ZWJ, VS15, VS16 emoji :-) Unicode: Grapheme cluster support. Bold and italic fonts. High-DPI support. Vertical Line Markers (quickly jump to markers in your...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 2
    Colly

    Colly

    Elegant Scraper and Crawler Framework for Golang

    Colly provides a clean interface to write any kind of crawler/scraper/spider. With Colly you can easily extract structured data from websites, which can be used for a wide range of applications, like data mining, data processing or archiving. Clean API. Fast (>1k request/sec on a single core) Manages request delays and maximum concurrency per domain. Automatic cookie and session handling. Sync/async/parallel scraping. Distributed scraping.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Ethereum ETL

    Ethereum ETL

    Python scripts for ETL (extract, transform and load) jobs for Ethereum

    Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery. Ethereum ETL lets you convert blockchain data into convenient formats like CSVs and relational databases.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    unipdf

    unipdf

    Golang PDF library for creating and processing PDF files (pure go)

    UniDoc UniPDF is a PDF library for Go (golang) with capabilities for creating and reading, processing PDF files. The library is written and supported by FoxyUtils.com, where the library is used to power many of its services. Every release of our libraries is automatically tested against known vulnerabilities and do not pass unless everything is remediated. All changes are carefully reviewed by our team.
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    Jenkins Docker

    Jenkins Docker

    Docker jenkins repo

    ...Treat the jenkins_home directory as you would a database, in Docker you would generally put a database on a volume. If your volume is inside a container, you can use a Docker command to extract the data, or other options to find where the volume data is. Note that some symlinks on some OSes may be converted to copies (this can confuse jenkins with lastStableBuild links etc). You can define the number of executors on the Jenkins built-in node using a groovy script. By default it is set to 2 executors, but you can extend the image and change it to your desired number of executors.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    dategrep

    dategrep

    Print lines matching a time range

    dategrep is a command-line utility designed to extract lines from log files that fall within a specified time range. It efficiently processes large log files by performing a binary search to locate the relevant entries, making it a valuable tool for system administrators and developers analyzing time-specific events.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Translate Toolkit

    Translate Toolkit

    Useful localization tools with Python API for building localization

    ...Allowing you and your translators to work on industry-standard translation formats. Search for pattern matches. Run tests that adapt to languages and source projects. Extract terminology. A large toolset to allow you to increase localization quality. The code is available for you to add new formats, project types, localization tests and language modules. Adapting the toolkit to your project and needs.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    i18n ally

    i18n ally

    All in one i18n extension for VS Code

    Lokalise is the fastest growing language cloud technology made by developers, for developers. As a collaborative productivity platform, it helps structure and automate the translation and localization process for any company in the world. This extension itself supports i18n as well. It will be auto-matched to the display language you use in your VS Code editor. Supports multi-root workspaces. Supports remote development. Supports numerous popular frameworks. Supports linked locale messages....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Summarize

    Summarize

    Point at any URL/YouTube/Podcast or file

    Summarize is a toolset that lets you point at almost any content and quickly extract the gist, whether that content is a webpage, a YouTube video, a podcast, or a local file. It’s built around a CLI workflow so you can summarize from the terminal, but it also includes a Chrome extension so you can do the same thing directly while browsing. The project pairs an on-device “daemon” style background service with user-facing commands and extension UI, so summaries can feel immediate and repeatable once installed. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    jsoup

    jsoup

    Java library for working with real-world HTML

    jsoup is a Java library for working with real-world HTML. It provides a very convenient API for fetching URLs and extracting and manipulating data, using the best of HTML5 DOM methods and CSS selectors. jsoup implements the WHATWG HTML5 specification, and parses HTML to the same DOM as modern browsers do. jsoup is designed to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup; jsoup will create a sensible parse tree. The parser will make...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    fx

    fx

    Command-line tool and terminal JSON viewer

    fx can work in two modes, cli and interactive. To start interactive mode pipe any JSON into fx. One of the frequent operations is mapping some function on an array. You can pass any number of anonymous functions for reducing JSON. fx provides a function save which will save everything in place and return saved object. This function can be only used with filename as first argument to fx command. Create .fxrc file in $HOME directory, and require any packages or define global functions. To be...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    goober

    goober

    A less than 1KB css-in-js solution

    ...You'll find as, forwarded, CSS, keyframes, styled and so much more. Easily access your common sizes, colors, and anything really with the use of a theme. On the server, you can easily extract the CSS for the current state with extractCss. The initial thought of goober was a CSS-in-js solution at the cost of peanuts. Hence the peanuts emoji. By using goober, you are practically getting back space in size to build more of the needed features.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Boneyard

    Boneyard

    Auto generated skeleton loading framework

    ...It functions as a sandbox where developers can store unfinished ideas, prototypes, or deprecated implementations without removing them entirely. The project provides a space for experimentation, allowing developers to revisit past work, extract useful components, or learn from previous attempts. It often contains a variety of code snippets or mini-projects that may not be production-ready but still hold educational or practical value. Boneyard reflects a development philosophy that values iteration and learning through experimentation rather than discarding work completely. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    s6-overlay

    s6-overlay

    s6 overlay for containers (includes execline, s6-linux-utils

    s6-overlay is an easy-to-install (just extract a tarball or two!) set of scripts and utilities allowing you to use existing Docker images while using s6 as a pid 1 for your container and process supervisor for your services.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Echidna

    Echidna

    Ethereum smart contract fuzzer

    ...We designed Echidna with modularity in mind, so it can be easily extended to include new mutations or test specific contracts in specific cases. Optional corpus collection, mutation and coverage guidance to find deeper bugs. Powered by Slither to extract useful information before the fuzzing campaign. Source code integration to identify which lines are covered after the fuzzing campaign. Curses-based retro UI, text-only or JSON output.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    pep484 stubs for Django

    pep484 stubs for Django

    PEP-484 stubs for Django

    ...You can show your support by liking the PR. This project does not affect your runtime at all. It only affects mypy type checking process. The current implementation uses Django's runtime to extract information about models, so it might crash if your installed apps or models.py are broken.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    NGX-Translate

    NGX-Translate

    The internationalization (i18n) library for Angular

    ...The main part of the library is named core. You can use it on its own, but it is usually a good idea to add a loader to load your translations into your application. You can also extract the strings from your code with the extractor. This makes it really easy to start and maintain your translations. By default, there is no loader available. You can add translations manually using setTranslation but it is better to use a loader. You can write your own loader, or import an existing one.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    LangExtract

    LangExtract

    A Python library for extracting structured information

    LangExtract is a Python library developed by Google that leverages large language models (LLMs) to extract structured information from unstructured text—such as clinical notes, research papers, or literary works—based on user-defined instructions. It is designed to transform free-form text into reliable, schema-constrained data while maintaining traceability back to the source material. Each extracted entity is precisely grounded in its original context, allowing visual inspection and validation via automatically generated interactive HTML visualizations. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ytt

    ytt

    YAML templating tool that works on YAML structure instead of text

    ...This eliminates text insertion concerns like manual escaping and allows for easy structure reuse, thus, increasing readability. Weave ytt templating into your own configuration, externalize values to variables, extract repeated snippets to functions, and make whole sections conditional. Patch on top of any configuration using ytt overlays. Express precise structural edits and invariant declaratively. ytt execution environment is hermetic and side-effect free, with no access to the filesystem, network, time, randomness, or the operating system interfaces. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    linaria

    linaria

    Zero-runtime CSS in JS library

    ...Optionally use any CSS preprocessor such as Sass or PostCSS. Easily find where the style was defined with CSS source maps. Linaria currently supports webpack and Rollup to extract the CSS at build time. Optionally, add the @linaria preset to your Babel configuration at the end of the presets list to avoid errors when importing the components in your server code or tests. Linaria can be used with any framework, with additional helpers for React.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    BRAID

    BRAID

    Themeable design system for the SEEK Group

    Braid aims to make cross-brand UI development as fast as possible while maintaining a high level of quality and accessibility. In order to achieve this, Braid provides as a set of React components and CSS variable-based styling themes using vanilla-extract. As much as possible, we want Braid code to make sense to non-developers. We’re aggressively focused on the simplicity and composability of its API. Along with our work on Playroom, our goal is to empower designers and developers to iterate together in the same medium using the same components, reducing the need for high fidelity mock-ups before development starts. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    tsfresh

    tsfresh

    Automatic extraction of relevant features from time series

    tsfresh is a python package. It automatically calculates a large number of time series characteristics, the so called features. tsfresh is used to to extract characteristics from time series. Without tsfresh, you would have to calculate all characteristics by hand. With tsfresh this process is automated and all your features can be calculated automatically. Further tsfresh is compatible with pythons pandas and scikit-learn APIs, two important packages for Data Science endeavours in python. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Python-Spider

    Python-Spider

    Python3 web crawler practice

    Python-Spider is a repository intended to teach or provide examples for writing web spiders / crawlers in Python — part of a broader learning and resource collection by its author. The code and documentation are oriented toward beginners or intermediate learners who want to learn how to fetch, parse, and extract data from websites programmatically. As part of the author’s public learning-path repositories, python-spider likely includes examples of HTTP requests, HTML parsing, maybe concurrency or scheduling to crawl multiple pages, and techniques to handle common web-scraping issues. For people wanting to get hands-on with building scrapers, collecting data, or learning how to navigate web programming in Python, this repository acts as a didactic reference or starting point. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    .NET for Apache Spark

    .NET for Apache Spark

    A free, open-source, and cross-platform big data analytics framework

    .NET for Apache Spark provides high-performance APIs for using Apache Spark from C# and F#. With these .NET APIs, you can access the most popular Dataframe and SparkSQL aspects of Apache Spark, for working with structured data, and Spark Structured Streaming, for working with streaming data. .NET for Apache Spark is compliant with .NET Standard - a formal specification of .NET APIs that are common across .NET implementations. This means you can use .NET for Apache Spark anywhere you write...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    pmd

    pmd

    An extensible multilanguage static code analyzer

    PMD is a source code analyzer. It finds common programming flaws like unused variables, empty catch blocks, unnecessary object creation, and so forth. It supports Java, JavaScript, Salesforce.com Apex and Visualforce, PLSQL, Apache Velocity, XML, and XSL. Additionally, it includes CPD, the copy-paste-detector. CPD finds duplicated code in Java, C, C++, C#, Groovy, PHP, Ruby, Fortran, JavaScript, PLSQL, Apache Velocity, Scala, Objective C, Matlab, Python, Go, Swift and Salesforce.com Apex,...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB