Showing 24 open source projects for "extract"

View related business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    mtail

    mtail

    Extract internal monitoring data from application logs

    Extract internal monitoring data from application logs for collection in a time-series database. mtail is a tool for extracting metrics from application logs to be exported into a timeseries database or timeseries calculator for alerting and dashboarding. It fills a monitoring niche by being the glue between applications that do not export their own internal state (other than via logs) and existing monitoring systems, such that system operators do not need to patch those applications to instrument them or writing custom extraction code for every such application. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    go-i18n

    go-i18n

    Translate your Go program into multiple languages

    ...Code and tests are automatically generated from CLDR data. Supports strings with named variables using text/template syntax. Supports message files of any format (e.g. JSON, TOML, YAML). Use goi18n extract to extract all i18n.Message struct literals in Go source files to a message file for translation. Create an empty message file for the language that you want to add (e.g. translate.es.toml). Run goi18n merge active.en.toml translate.es.toml to populate translate.es.toml with the messages to be translated. The goi18n command manages message files used by the i18n package.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Geziyor

    Geziyor

    Blazing fast Go framework for web crawling and data scraping tasks

    Geziyor is a high-performance web crawling and web scraping framework built for the Go programming language. It is designed to help developers crawl websites and extract structured information from web pages efficiently. It focuses on speed and scalability, allowing large numbers of requests to be processed concurrently. Geziyor supports use cases such as data mining, monitoring web content, and automated testing workflows. It provides a flexible architecture where developers define parsing functions that process responses and extract the desired data. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Alternative JSON parser for Go

    Alternative JSON parser for Go

    One of the fastest alternative JSON parser for Go

    It does not require you to know the structure of the payload (eg. create structs), and allows accessing fields by providing the path to them. It is up to 10 times faster than standard encoding/json package (depending on payload size and usage), allocates no memory. Originally made for a project that relies on a lot of 3rd party APIs that can be unpredictable and complex. Encoding/json requires you to know exactly your data structures, or if you prefer to use map[string]interface{} instead,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    Scriberr

    Scriberr

    Self-hosted AI audio transcription

    ...The application includes a polished user interface that simplifies the management of recordings, transcripts, and annotations, making it suitable for both casual users and professionals handling large volumes of audio. Beyond transcription, Scriberr also integrates features such as summarization, tagging, and interaction with language models, allowing users to extract insights from conversations or meetings efficiently.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    unipdf

    unipdf

    Golang PDF library for creating and processing PDF files (pure go)

    UniDoc UniPDF is a PDF library for Go (golang) with capabilities for creating and reading, processing PDF files. The library is written and supported by FoxyUtils.com, where the library is used to power many of its services. Every release of our libraries is automatically tested against known vulnerabilities and do not pass unless everything is remediated. All changes are carefully reviewed by our team.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Scope Sentry

    Scope Sentry

    Cyberspace asset mapping and vulnerability scanning platform

    ...ScopeSentry combines multiple reconnaissance and vulnerability assessment capabilities such as subdomain enumeration, port scanning, directory scanning, and sensitive information detection. ScopeSentry can automatically identify assets and services, extract URLs, and crawl websites to collect useful security data for further analysis. It also includes vulnerability scanning and subdomain takeover detection to help identify common security weaknesses across web infrastructure. It supports distributed scanning with multiple nodes, allowing large scanning tasks to be performed efficiently across different systems.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    Kaniko

    Kaniko

    Build Container Images In Kubernetes

    kaniko is a tool to build container images from a Dockerfile, inside a container or Kubernetes cluster. kaniko doesn't depend on a Docker daemon and executes each command within a Dockerfile completely in userspace. This enables building container images in environments that can't easily or securely run a Docker daemon, such as a standard Kubernetes cluster.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Carvel ytt

    Carvel ytt

    YAML templating tool that works on YAML structure instead of text

    ...This eliminates text insertion concerns like manual escaping and allows for easy structure reuse, thus, increasing readability. Weave ytt templating into your own configuration, externalize values to variables, extract repeated snippets to functions, and make whole sections conditional. Patch on top of any configuration using ytt overlays. Express precise structural edits and invariant declaratively. ytt execution environment is hermetic and side-effect free, with no access to filesystem, network, time, randomness, or the operating system interfaces. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    crawley

    crawley

    The unix-way web crawler

    Crawls web pages and prints any link it can find. Fast HTML SAX-parser (powered by golang.org/x/net/html) Small (below 1500 SLOC), idiomatic, 100% test-covered codebase. Grabs most of useful resources URLs (pics, videos, audios, forms, etc...) Found URLs are streamed to stdout and guaranteed to be unique (with fragments omitted) Scan depth (limited by starting host and path, by default - 0) can be configured. Can crawl rules and sitemaps from robots.txt. Brute mode - scan HTML comments for...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Colly

    Colly

    Elegant Scraper and Crawler Framework for Golang

    Colly provides a clean interface to write any kind of crawler/scraper/spider. With Colly you can easily extract structured data from websites, which can be used for a wide range of applications, like data mining, data processing or archiving. Clean API. Fast (>1k request/sec on a single core) Manages request delays and maximum concurrency per domain. Automatic cookie and session handling. Sync/async/parallel scraping. Distributed scraping.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    paperless-gpt

    paperless-gpt

    Use LLMs and LLM Vision (OCR) to handle paperless-ngx

    ...It is designed to transform scanned or uploaded documents into structured, searchable, and intelligently categorized data without requiring manual tagging or sorting. The system uses OCR combined with LLM reasoning to extract text, classify documents, and generate metadata such as tags, titles, and categories automatically. It supports advanced workflows where documents can be analyzed contextually, enabling features like semantic search, summarization, and automated classification pipelines. The platform is particularly useful for individuals and organizations managing large volumes of paperwork, such as invoices, contracts, or records, as it reduces the need for manual data entry.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    ytt

    ytt

    YAML templating tool that works on YAML structure instead of text

    ...This eliminates text insertion concerns like manual escaping and allows for easy structure reuse, thus, increasing readability. Weave ytt templating into your own configuration, externalize values to variables, extract repeated snippets to functions, and make whole sections conditional. Patch on top of any configuration using ytt overlays. Express precise structural edits and invariant declaratively. ytt execution environment is hermetic and side-effect free, with no access to the filesystem, network, time, randomness, or the operating system interfaces. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    GoCV

    GoCV

    Go package for computer vision using OpenCV 4 and beyond

    ...Computer Vision (CV) is the ability of computers to process visual information, and perform tasks normally associated with those performed by humans. CV software typically processes video images, then uses the data to extract information in order to do something useful. Since memory allocations for images in GoCV are done through C based code, the go garbage collector will not clean all resources associated with a Mat. As a result, any Mat created must be closed to avoid memory leaks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Apache DevLake

    Apache DevLake

    Apache DevLake is an open-source dev data platform

    Apache DevLake is an open-source dev data platform that ingests, analyzes, and visualizes the fragmented data from DevOps tools to extract insights for engineering excellence, developer experience, and community growth. Apache DevLake is designed for developer teams looking to make better sense of their development process and to bring a more data-driven approach to their own practices. You can ask Apache DevLake many questions regarding your development process. Just connect and query. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Portable Executable Parser

    Portable Executable Parser

    lightweight Go package to parse, analyze and extract metadata

    Saferwall PE is a lightweight Go package for parsing, analyzing, and extracting metadata from Portable Executable (PE) binaries. Designed with malware analysis in mind, it is robust against malformed PE files and provides detailed insights into executable structures.​
    Downloads: 16 This Week
    Last Update:
    See Project
  • 17

    anglolintergo

    lightweight rulebased grammar error checker in Golang

    standalone offline grammar error checker (English, German and Dutch) programmed in Golang V 0.3.8.22 Usage: anglolintergo mytext.txt -> check English text anglolintergo -fix mytext.txt -> check and autofix English text anglolintergo -de mytext_ge.txt -> check German text anglolintergo -nl mytext_nl.txt -> check Dutch text anlgolintergo -help -> show helpfile anglolintergo mytext.tex -> check English latex file License see license.txt (BSD) Build: - extract files from zip-archive - use build.bat (Windows) or build.sh (Linux/macOS) in directory or do in a console: go mod init main go mod tidy go build -o anglolintergo ./... Dependencies: whatlanggo package (MIT license) Example: Luisa and Luise is friends. -> Luisa and Luise are friends. Luisa und Luise ist Freunde. -> Luisa und Luise sind Freunde. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    alp

    alp

    Access Log Profiler

    alp is Access Log Profiler.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Openbooks

    Openbooks

    Search and Download eBooks

    Openbooks allows you to download ebooks from irc.irchighway quickly and easily. For a complete list of features use the --help flags on all subcommands. For example openbooks cli --help or openbooks cli download --help. There are two modes; Server or CLI. In CLI mode you interact and download books through a terminal interface. In server mode the application runs as a web application that you can visit in your browser. OpenBooks server doesn't have to be hosted at the root of your webserver....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    crawlergo

    crawlergo

    Headless Chrome crawler for collecting URLs for vulnerability scans

    ...It also automatically fills and submits forms, helping discover hidden routes or parameters that might otherwise be missed by traditional crawlers. crawlergo includes a built-in URL de-duplication system that removes repeated or pseudo-static links while maintaining fast crawling speeds for large websites. crawlergo also analyzes page content to extract links and resources from multiple sources, including JavaScript files, comments, and configuration files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    VideoSrt

    VideoSrt

    Windows-GUI

    ...It is suitable for business scenarios that quickly and batch generate Chinese/English subtitles and text files for media (video/audio). Recognize video/audio speech to generate subtitle files (support Chinese-English translation, bilingual subtitles) Extract speech text from video/audio. Batch translation, filter processing/encoding SRT subtitle files. Using the Alibaba Cloud speech recognition interface, the accuracy is high, and the standard Mandarin/English recognition rate is over 95%. Video recognition does not need to upload the original video, which is convenient, fast and time-saving.
    Downloads: 30 This Week
    Last Update:
    See Project
  • 22
    archiver

    archiver

    Easily create & extract archives, and compress & decompress files

    ...The core library APIs work pretty well but the command has not been implemented yet, nor have most automated tests. If you need the arc command, stick with v3 for now. Create and extract archive files. Walk or traverse into archive files. Extract only specific files from archives. Insert (append) into .tar files. Numerous archive and compression formats supported. Extensible (add more formats just by registering them) Cross-platform, static binary. Pure Go (no cgo). Multithreaded Gzip. Adjust compression levels. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Goku

    Goku

    A powerful HTTP API gateway in pure golang

    ...Goku API Gateway is a microservice gateway that runs on the boundaries of enterprise system services. When you build websites, apps, IoT, and even API transactions, Goku API Gateway can help you extract duplicate components from your internal system and place them on the Goku gateway, such as user authorization, access control, traffic monitoring, firewalls, data cache, data conversion and so on. Goku API Gateway CE provides a comprehensive usage guide and customization guide.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    dataflowkit

    Golang framework for scraping data from web pages

    Golang Web Scraper library for extracting data from web pages. Save results as CSV, JSON, XML
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB