Showing 20 open source projects for "extraction"

View related business solutions
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    Kaldi

    Kaldi

    kaldi-asr/kaldi is the official location of the Kaldi project

    ...Kaldi is designed for researchers who need a highly customizable environment to experiment with new algorithms, as well as for practitioners who want robust, production-ready ASR pipelines. It includes extensive tools for data preparation, feature extraction, acoustic and language modeling, decoding, and evaluation. With its modular design, Kaldi allows users to adapt the system to a wide range of languages and domains. As one of the most influential projects in speech recognition, it has become a foundation for much of the modern work in ASR.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2

    ldif-extract

    Extrect selected entries from LDIF files like grep

    ldif-extract is a small 'grep' like tool to extract and convert data from LDIF files. It could be used standalone or also in a pipe together with other tools like ldapsearch.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Easyspider - Distributed Web Crawler

    Easyspider - Distributed Web Crawler

    Easy Spider is a distributed Perl Web Crawler Project from 2006

    Easy Spider is a distributed Perl Web Crawler Project from 2006. It features code from crawling webpages, distributing it to a server and generating xml files from it. The client site can be any computer (Windows or Linux) and the Server stores all data. Websites that use EasySpider Crawling for Article Writing...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    KB1OIQ - aa-analyzer

    RigExpert AA-xxx antenna analyzer data extraction tool

    aa-analyzer is a Perl script written by KB1OIQ which collects data from a RigExpert AA-xxx antenna analyzer and outputs it as a CSV file, which can be imported into any spreadsheet program. Please note that this software is distributed within the "Andy's Ham Radio Linux" software collection.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    Perl Web Scraping Project

    Perl Web Scraping Project

    Perl Web Scraping Project

    Web scraping (web harvesting or web data extraction) is data scraping used for extracting data from websites.[1] Web scraping software may access the World Wide Web directly using the Hypertext Transfer Protocol, or through a web browser. While web scraping can be done manually by a software user, the term typically refers to automated processes implemented using a bot or web crawler.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    EplSite ETL

    ETL Based on Perl With WEB Interface

    EplSite ETL is a tool to do easy the data migrations, doing extraction, transformation, validation and load in a very fast way. It was built by people involved in data migrations so, it contains the necessary to do the migration(Extract Transformation, validation and load) and do it well.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    clive is a command line video extraction tool for Youtube and other similar video websites that require Adobe Flash for viewing the content.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    This is a Java-based project for complex event extraction from text and co-reference resolution. Currently the code can read BioNLP shared task format (http://2011.bionlp-st.org/) and i2b2 Natural Language Processing for Clinical Data shared task format (https://www.i2b2.org/NLP/DataSets/Main.php). Event extraction includes finding events and the parameters for an event in a text.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Batch extraction of RAR compressed archives
    Downloads: 0 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 10
    openEAR is the Munich Open-Source Emotion and Affect Recognition Toolkit developed at the Technische Universität München (TUM). It provides efficient (audio) feature extraction algorithms implemented in C++, classfiers, and pre-trained models on well-known emotion databases. It is now maintained and supported by audEERING. Updates will follow soon.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 11
    CDNsim is a GNU/LINUX simulation tool for CDNs, written in C++ (core) and python (GUI wizard). It models: redirection policies, cache policies, TCP/IP, batch simulations, statistics extraction and more. CDNsim is uses the OMNet++ library
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    MutationFinder is a biomedical natural language processing (NLP) system for extracting mentions of point mutations from free text. MutationFinder achieves high performance (99% precision, 81% recall on blind test data) as an information extraction system
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Stem-Les (Lexicon Extraction Suite) extracts lexical chunks that are relevant in a corpus of documents. If the corpus is bilingual, Stem-Les also finds translation equivalents for the lexical solution selected by the user.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    mpftools is a collection of tools for manipulating mpf files, Microsoft Media Package Files used by recent versions of Microsoft Office. Currently, the perl script mpfextract exists, allowing for individual file extraction from mpf files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    An attempt at creating a music database management system, with pluggable backends and frontends. The system should incorperate, as a base, a file system/meta data file data store, cd extraction, and a web/daap frontend to listen to music.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    This software allows you to read HP4145 Semiconductor Parameter Analyzer files on a Linux PC.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    ...Users create XML files that are used by PxDBTOFILE to export data from a database to any number of flat text files in variable formats. Great for systems that require database extraction to flat tex
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Mail::MboxParser offers object-oriented access to UNIX-mailboxes. Basically two types of objects exist: Mailboxes and and single messages. It focuses on easy extraction of MIME-parts, parsed headers and bodies. It is intended for read-only access.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Administration tools intended for use on Caldera OpenLinux, some tools will be useable on multiple distributions. Current tools include Printer Administration, SMB Connection Administration and RPM search and extraction tools.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    The solution for managing and tracking test cases and test suites via web interface. Writtent in Perl (Practical Report and Extraction Language) with Catalyst Web Framework
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB