Showing 5 open source projects for "duplicates"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 1

    lsdup

    List files with duplicate contents

    lsdup, short for "list duplicates", is a command line tool which can assist in finding and removing files with identical contents. Unlike many file deduplication tools, this has a "comparison mode" which lets you tell lsdup to "only read the files in this directory, do not consider any for duplicates... but, the files in that other directory may have duplicates, so delete (or move) any of those."
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    A Linux/UNIX command line tool to find duplicate files. The tool supports multiple ways to consider a set of files as duplicates to each other (e.g. content, size, name ... etc.).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    deduper

    A simple tool to merge duplicates in customer records

    Frustrated with multiple records of the same customer which just differ due to a typo or abbreviation or different possible representations of same address? Try Deduper !! Deduper is a simple command line tool to merge duplicates in customer records. It works based on advanced string matching techniques and clustering. This technique is called blocked nearest neighbor clustering and this general technique is further optimized in this tool for the problem of customer merging. Give it a try, we will be happy to hear from you how it helped. See the wiki for more details on usage https://sourceforge.net/p/deduper/wiki/Home/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    duff

    Command-line tool for finding duplicate files

    Duff is a Unix command-line utility for quickly finding duplicates in a given set of files. Duff is written in C and should build and run on most modern Unices.
    Leader badge
    Downloads: 9 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    A small command line tool to process streams of numbers and spit out the running total, bin the results or merge duplicates. Supports arbitrary precision numbers. ocdf is named for the cumulative distribution function in probability.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB