Showing 44 open source projects for "extraction"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    Enlive

    Enlive

    Selector-based templating and transformation system for Clojure

    Enlive is a Clojure library for HTML templating, transformation, and scraping, supporting composable manipulation of HTML/XML in a functional style. It allows selecting, transforming, and generating HTML fragments using CSS selectors, and supports server-side template composition, dynamic pages, and content rewriting. By default selector-transformation pairs are run sequentially. When you know that several transformations are independent, you can now specify (as an optimization) to process...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Duckling (Old)

    Duckling (Old)

    Clojure library that parses text into structured data

    Duckling (the “old” archived version) is a natural language processing library (in Clojure) for parsing text to structured data — specifically, recognizing quantities such as dates, times, durations, measurements, currencies, etc., from free-form text. To use Duckling in your project, you just need two functions: load! to load the default configuration, and parse to parse a string. Duckling is a Clojure library that parses text into structured data. See our blog post announcement for more...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    MITIE

    MITIE

    MITIE: library and tools for information extraction

    This project provides free (even for commercial use) state-of-the-art information extraction tools. The current release includes tools for performing named entity extraction and binary relation detection as well as tools for training custom extractors and relation detectors. MITIE is built on top of dlib, a high-performance machine-learning library[1], MITIE makes use of several state-of-the-art techniques including the use of distributional word embeddings[2] and Structural Support Vector Machines[3]. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Simd

    Simd

    High performance image processing library in C++

    The Simd Library is a free open source image processing library, designed for C and C++ programmers. It provides many useful high performance algorithms for image processing such as: pixel format conversion, image scaling and filtration, extraction of statistic information from images, motion detection, object detection (HAAR and LBP classifier cascades) and classification, neural network. The algorithms are optimized with using of different SIMD CPU extensions. In particular the library supports following CPU extensions: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2 and AVX-512 for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM. ...
    Leader badge
    Downloads: 32 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest...
    Leader badge
    Downloads: 131 This Week
    Last Update:
    See Project
  • 6
    Deeplearning-papernotes

    Deeplearning-papernotes

    Summaries and notes on Deep Learning research papers

    Deeplearning-papernotes is an implementation of Convolutional Neural Networks for sentence and text classification in TensorFlow, based on a well-known research paper that applies CNN architectures to natural language processing tasks with strong performance in sentiment analysis and similar classification problems. The repository provides the complete network definition, including an embedding layer to convert words into dense representations, convolution and max-pooling layers to extract...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Matlab/Octave Rotations Library

    Matlab/Octave Rotations Library

    Library for working with 3D rotations in Matlab/Octave

    ...The rotation representations that are supported are rotation matrices (Rotmat), Quaternions (Quat), intrinsic ZYX Euler angles (Euler), fused angles (Fused) and tilt angles (Tilt). Operations such as composition, inversion, ZYX yaw extraction, fused yaw extraction, random generation, equality detection, vector rotation and much more have been implemented for each rotation representation. Tilt vector addition and the relative and absolute tilt phase spaces are also supported. Unit tests for each function are provided to ensure that the library is performant.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Rotations Conversion Library

    Rotations Conversion Library

    Library for working with 3D rotations in C++

    The Rotations Conversion Library (RCL) is a collection of C++ functions that address common computations and numerical handling of rotations in 3D Euclidean space, including support for rotation matrices (`Rotmat`), Quaternions (`Quat`), intrinsic ZYX Euler angles (`Euler`), fused angles (`Fused`) and tilt angles (`Tilt`). In addition to the core competency of being able to convert between each of the representations, operations such as inversion, ZYX yaw extraction, fused yaw extraction, renormalisation, equality detection, vector rotation and much more have been implemented. Unit tests for each function are provided to ensure that the library performs exactly as it should. The implementation of the Rotations Conversion Library is based on the very related Matlab/Octave Rotations Library (MORL), which is a highly tested reference implementation for the required algorithms (see https://sourceforge.net/projects/rotationslib/).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    ttag

    ttag

    Simple approach for javascript localization

    Modern javascript i18n localization library based on ES6 tagged templates and the good old GNU gettext. Just tag your strings to make them translatable. Use a simple ttag-cli tool for translation extraction. Can be easily integrated with almost any workflow as it uses the babel-plugin for string extraction. Can be easily used with the typescript. Allows you to place translations into the sources on a build step. Gettext is a simple localization format with a rich ecosystem. Ttag has support for plurals, contexts, translator comments, and much more.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    aeneas

    aeneas

    Automagically synchronize audio and text (aka forced alignment)

    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment). aeneas automatically generates a synchronization map between a list of text fragments and an audio file containing the narration of the text. In computer science this task is known as (automatically computing a) forced alignment.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 11

    BTCore

    A Collection of Useful Java/Swing Code

    NEWEST FEATURE ---------- Added one line ".zip" extraction to Util class! Added one line ".zip" extraction from URL (web) and one line file download from URL! BTCore is a library that was designed to be used with all of Banotech's software. It includes a tremendous amount of code snippets and classes that have been boiled down to allow ease of use by everyone. BTCore is, and always will be FREE.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    CRFasRNN

    CRFasRNN

    Semantic image segmentation method described in the ICCV 2015 paper

    CRF-RNN is a deep neural architecture that integrates fully connected Conditional Random Fields (CRFs) with Convolutional Neural Networks (CNNs) by reformulating mean-field CRF inference as a Recurrent Neural Network. This fusion enables end-to-end training via backpropagation for semantic image segmentation tasks, eliminating the need for separate, offline post-processing steps. Our work allows computers to recognize objects in images, what is distinctive about our work is that we also...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    CMIS Input plugin for Pentaho

    CMIS Input plugin for Pentaho

    Allows querying Content Management Systems that use the CMIS.

    ...Imagine using the information extracted for statistical purposes, for creating reports and, more generally, to analyse your document archives in a way unthinkable until now with the current tools available. All this is possible within the Pentaho Suite, the Open Source Business Intelligence platform, which is useful to the extraction and analysis of structured and semi-structured data. With this goal (the extraction and analysis of data) has been designed and developed the CMIS Input plugin for Pentaho Data Integration (Kettle) that allows querying Content Management Systems that use the CMIS interoperability standard. The data, once extracted, can be stored and analyzed and perhaps presented in customized reports be published in various formats for the end user (PDF, Excel, etc..).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    musicinformationretrieval.com

    musicinformationretrieval.com

    Instructional notebooks on music information retrieval

    musicinformationretrieval.com is a collection of instructional materials for music information retrieval (MIR). These materials contain a mix of casual conversation, technical discussion, and Python code. These pages, including the one you're reading, are authored using Colab notebooks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    cde4php - Cross Database Engine for PHP

    cde4php - Cross Database Engine for PHP

    Uniform Database Abstraction for PHP Development

    Debby has replaced CDE in the Tina4Stack, you may want to check it out at http://tina4.com CDE is a PHP class which implements the general database functions in PHP and provides a common SQL platform for php development where developers change their databases but not their code. Supports Firebird, MySQL,Oracle,SQLite, MSSQL(both drivers),CUBRID,ODBC. CDE now supports date uniformity, param passing & BLOB handling across all the databases supported. CDE is not a replacement for PDO,...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    TextBlob

    TextBlob

    TextBlob is a Python library for processing textual data

    Simple, Pythonic, text processing, Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more. It provides a simple API for diving into common natural language processing (NLP) tasks such as part-of-speech tagging, noun phrase extraction, sentiment analysis, classification, translation, and more. TextBlob stands on the giant shoulders of NLTK and pattern, and plays nicely with both. Supports word inflection (pluralization and singularization) and lemmatization, as well as spelling correction. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    ivantk

    ivantk

    Image-based Vascular Analysis Toolkit

    The Image-based Vascular Analysis Toolkit is a set of multiplatform C++ libraries for vascular analysis of (3D) medical images, typically CT or MRI. It can be considered as an extension of the Insight Toolkit (ITK) for vascular image analysis, with methods for detection, extraction and modeling of vascular structures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    The purpose of the Metabrain library is to give developers a way to extract this information from the Internet without resorting to natural language parsing or other complex techniques, using instead statistical methods and patterns/trends analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    7-Zip-NBinding
    .NET wrapper for 7Zip, based on 7-Zip-JBinding, produced with IKVM. Extracts (password protected, multi-part) 7z Zip Rar Tar Split Lzma Iso HFS GZip Cpio BZip2 Z Arj Chm Lhz Cab Nsis Deb Rpm Udf archives. Archive creation/more formats coming soon.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB