Search Results for "data matching" - Page 3

Showing 141 open source projects for "data matching"

View related business solutions
  • eLearning Solutions For Your Workplace Icon
    eLearning Solutions For Your Workplace

    eloomi is an eLearning solution for your workplace to train, retain and engage employees.

    eloomi combines Learning Management (LMS), Onboarding, Authoring, and continuous Performance Management tools in a cloud-based solution. It allows companies to optimize skill training, onboarding and employee development with strong user experience to enhance productivity and employee satisfaction. As a white label solution, the platform can be customized to mirror a company's branding and logo.
    Learn More
  • The next chapter in business mental wellness Icon
    The next chapter in business mental wellness

    Entrust your employee well-being to Calmerry's nationwide network of licensed mental health professionals.

    Calmerry is beneficial for businesses of all sizes, particularly those in high-stress industries, organizations with remote teams, and HR departments seeking to improve employee well-being and productivity
    Learn More
  • 1
    AvaSattva

    AvaSattva

    Search replace files or pipe

    .../Search/Replace files/pipe text with plain/Regex syntax. And for ETL alike work like Load and filter files -> Extract -> Transform output. For replacing files, you can preview and backup, in multiple directories and files or pipe, with plain text matching or using general Regex as C++, C#, Java, Scala; So msr is a good tool to learn and test Regex since it has different colors for matched groups captured by the Regex pattern.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    sRNAWorkbench

    sRNAWorkbench

    The UEA sRNA Workbench

    A suite of tools for analysing small RNA (sRNA) data from Next Generation Sequencing devices. Including expression profiling of known mirco RNA (miRNA), identification of novel miRNA in deep-sequencing data and identification of other interesting landmarks within high-throughput genetic data
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    Sourcetrail

    Sourcetrail

    Free and open-source interactive source explorer

    Sourcetrail is a free and open-source cross-platform source explorer that helps you get productive on unfamiliar source code. Building Sourcetrail requires several dependencies to be in place on your machine. However, our CMake based setup allows disabling indexing support for specific languages which reduces the number of dependencies to a minimum. Sourcetrail is an interactive source explorer that simplifies navigation in existing source code by indexing your code and gathering data about its...
    Downloads: 55 This Week
    Last Update:
    See Project
  • 4
    SubEthaEdit

    SubEthaEdit

    General purpose plain text editor for macOS

    .... Have multiple search result windows at the same time. Constrain search to just preselected parts of your document. See the structure and depth of your documents in the line number and folding ribbon. Fold text away based on depths or however, you want. Select lines and/or depth easily with a single click. Select to matching braces by a simple double click.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Cloudflare secures and ensures the reliability of your external-facing resources such as websites, APIs, and applications. Icon
    It protects your internal resources such as behind-the-firewall applications, teams, and devices.
    Get Started
  • 5
    The flibs project is a growing collection of (small) Fortran libraries and modules: from manipulating strings and pattern matching to abstract data types
    Downloads: 8 This Week
    Last Update:
    See Project
  • 6
    pcp - Peer Copy

    pcp - Peer Copy

    Command line peer-to-peer data transfer tool based on libp2p

    Command line peer-to-peer data transfer tool based on libp2p. There already exists a long list of file transfer tools, so why bother building another one? The problem I had with the existing tools is that they rely on a limited set of servers to orchestrate peer matching and data relaying which poses a centralization concern. Many of the usual centralization vs. decentralization arguments apply here, e.g. the servers are single points of failure, the service operator has the power over whom...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    MMMISA
    MMMISA : a free, user-friendly tool for single/dual-modality DICOM file analysis MMMISA will be included as part of version 2 of DeepImageTranslator (https://sourceforge.net/projects/deepimagetranslator/) Citation: Ye RZ et al. DeepImageTranslator V2: analysis of multimodal medical images using semantic segmentation maps generated through deep learning.biorxiv.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Vector AI

    Vector AI

    A platform for building vector based applications

    Vector AI is a framework designed to make the process of building production-grade vector-based applications as quick and easily as possible. Create, store, manipulate, search and analyze vectors alongside json documents to power applications such as neural search, semantic search, personalized recommendations etc. Image2Vec, Audio2Vec, etc (Any data can be turned into vectors through machine learning). Store your vectors alongside documents without having to do a db lookup for metadata about...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    drive

    drive

    Google Drive client for the commandline

    drive is a tiny program to pull or push Google Drive files. Make sure to set your GOPATH in your env, .bashrc or .bash_profile file. You can install scripts for automating major drive commands and syncing from drive-google wiki, also described in platform_packages.md. See file Makefile which currently supports cross compilation. Just run make and then inspect the binaries in directory bin. drive supports resource configuration files (.driverc) that you can place both globally (in your home...
    Downloads: 1 This Week
    Last Update:
    See Project
  • High-performance Open Source API Gateway Icon
    High-performance Open Source API Gateway

    KrakenD is a stateless, distributed, high-performance API Gateway that helps you effortlessly adopt microservices

    KrakenD is a high-performance API Gateway optimized for resource efficiency, capable of managing 70,000 requests per second on a single instance. The stateless architecture allows for straightforward, linear scalability, eliminating the need for complex coordination or database maintenance.
    Learn More
  • 10
    Evolutility-UI-jQuery

    Evolutility-UI-jQuery

    Model-driven Web UI for CRUD using REST or localStorage

    Evolutility-UI-jQuery provides a set of generic Backbone Views to browse, edit, filter, export and chart Backbone models and collections of different structures. With it you can make web applications by configuring views with metadata instead of hand-coding templates, Javascript and CSS.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Drive Badger

    Drive Badger

    Open source platform for covert data exfiltration operations.

    Drive Badger is a software tool for data exfiltration – which means, for copying data from the computer to external USB drive. Unlike many other tools from IT security area, it's not a Proof-of-Concept kind of tool, bringing some groundbreaking techniques. Everything, what Drive Badger does, can be as well run manually, step by step. Instead, what Drive Badger really does, is doing it all better, by putting the maximum focus on:
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    fastNLP

    fastNLP

    fastNLP: A Modularized and Extensible NLP Framework

    fastNLP is a lightweight framework for natural language processing (NLP), the goal is to quickly implement NLP tasks and build complex models. A unified Tabular data container simplifies the data preprocessing process. Built-in Loader and Pipe for multiple datasets, eliminating the need for preprocessing code. Various convenient NLP tools, such as Embedding loading (including ELMo and BERT), intermediate data cache, etc.. Provide a variety of neural network components and recurrence models...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    This project is dedicated to open source data quality and data preparation solutions. Data Quality includes profiling, filtering, governance, similarity check, data enrichment alteration, real time alerting, basket analysis, bubble chart Warehouse validation, single customer view etc. defined by Strategy. This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data Creation...
    Leader badge
    Downloads: 15 This Week
    Last Update:
    See Project
  • 14
    ChatterBot

    ChatterBot

    Machine learning, conversational dialog engine for creating chat bots

    ..., the machine-learning nature of ChatterBot allows an agent instance to improve it’s own knowledge of possible responses as it interacts with humans and other sources of informative data. An untrained instance of ChatterBot starts off with no knowledge of how to communicate. Each time a user enters a statement, the library saves the text that they entered and the text that the statement was in response to. As ChatterBot receives more input the number of responses that it can reply increase.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 15
    HyperGAN

    HyperGAN

    Composable GAN framework with api and user interface

    A composable GAN built for developers, researchers, and artists. HyperGAN builds generative adversarial networks in PyTorch and makes them easy to train and share. HyperGAN is currently in pre-release and open beta. Everyone will have different goals when using hypergan. HyperGAN is currently beta. We are still searching for a default cross-data-set configuration. Each of the examples supports search. Automated search can help find good configurations. If you are unsure, you can start...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    diffpak

    A huge files block differential compressor

    A differential compresor (or a binary diff) for huge files. Unlike other tools, e.g. xdelta3, it searches for matching data through the whole source file, even if it weights several gigabytes, while using much less memory (with default configuration approx 25x less than the size of the source file). Output files are not compressed, so you can use any compressor you like with great results. It is quite fast for very similar files (about the speed of hdd, however it reads input files twice...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Taxamatch

    Taxamatch

    Fuzzy name matching algorithm for scientific names of taxa (biology)

    "Taxamatch" is an algorithm designed for fuzzy matching of scientific names of taxa - genera alone, or binomials (genus+species) - in taxonomic databases. It utilises both character substitution (similar to Soundex) to catch phonetic errors, and a customised edit distance (ED) approach to catch non-phonetic ones, which can be up to 50% of all errors in real-world queries. Since ED-based queries are typically slow against large data sets, Taxamatch includes a range of optimisations to heavily...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Jack LIRE (Linked Record)

    Jack LIRE (Linked Record)

    Quantum universal object system with referencing and genetics

    Jack LIRE (Linked Record) is a quantum database-backed life-long text record diary with referencing (linking) between records, and a data library for life-long keeping of files such as documents, photographs, music and more. The diary records are fully searchable. Files can be mentioned in diary records to preserve memories, document knowledge and ideas, and more. Massive numbers of diary records and files can be permanently kept. A database located on one computer can be accessed from another...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ConvertAll is yet another unit converter. But it can combine the units any way you want.
    Downloads: 30 This Week
    Last Update:
    See Project
  • 20
    MatchZoo

    MatchZoo

    Facilitating the design, comparison and sharing of deep text models

    The goal of MatchZoo is to provide a high-quality codebase for deep text matching research, such as document retrieval, question answering, conversational response ranking, and paraphrase identification. With the unified data processing pipeline, simplified model configuration and automatic hyper-parameters tunning features equipped, MatchZoo is flexible and easy to use. Preprocess your input data in three lines of code, keep track parameters to be passed into the model. Make use of MatchZoo...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    JSVerbalExpressions

    JSVerbalExpressions

    JavaScript Regular expressions made easy

    ... VerbalExpression() syntax in ES6 classes as mandated by the spec. You usually wouldn’t need to call VerEx().constructor(). This method will not be accessible from outside the source file since VerEx() returns an instance of the class rather than the class itself. sanitize is a static method on the VerbalExpression class. However, it is also exposed as VerEx().sanitize. Enabling control multi-line matching would cause the expression to not look for matches beyond the first line.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    AQUATONE

    AQUATONE

    A tool for domain flyovers

    Aquatone is a tool for visual inspection of websites across a large amount of hosts and is convenient for quickly gaining an overview of HTTP-based attack surface. Aquatone is designed to be as easy to use as possible and to integrate with your existing toolset with no or minimal glue. Aquatone is started by piping output of a command into the tool. It doesn't really care how the piped data looks as URLs, domains, and IP addresses will be extracted with regular expression pattern matching...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    libinstrument (ex libcsdbg)

    libinstrument (ex libcsdbg)

    C/C++ function instrumentation (AOP, exception/stack tracing, GC)

    Libinstrument is a complete and easy way to do custom function instrumentation for C/C++. The project comes ready with classic applications of instrumentation, such as AOP (Aspect Oriented Programming), Concern and Side Effect injection, stack tracing, exception tracing, basic time profiling, callgraph profiling and other. A GC (Garbage Collector) module is also under development. Libinstrument is highly portable and decoupled. The project comes with a CMake build system, integration with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    xMSanalyzer

    An R package for metabolomics data extraction and quality assessment

    xMSanalyzer comprises of utilities that can be classified into four main modules: 1) merging apLCMS or XCMS sample processing results from multiple sets of parameter settings, 2) evaluation of sample quality, feature consistency, and batch-effect, 3) feature matching, and 4) characterization of m/z using KEGG REST; 5) Batch-effect correction using ComBat
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25

    ENPG

    A tool to extract potential neuropeptides from protein sequence data.

    This tool is intended to extract potential neuropeptide regions from protein sequence data. The currently available version is dedicated to extract peptides that shows the structural hallmarks of cnidarian neuropeptides (C-terminal amidation, proline at N-terminus and pyro-Glutamate). The output FASTA file can be used as a target data set for peptide-spectrum matching to effectively narrow search space for highly sensitive peptide identifications.
    Downloads: 0 This Week
    Last Update:
    See Project