Showing 60 open source projects for "duplicate linux"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    Zingg

    Zingg

    Scalable master data management and identity resolution

    Zingg is an open-source entity resolution and master data management platform for finding duplicate, related, or matching records across large datasets. It uses machine learning to learn how records should be compared, reducing the need for brittle hand-written matching rules. The project is designed for data engineering and analytics teams working on customer 360, supplier 360, deduplication, fuzzy matching, data quality, and golden record workflows. Zingg runs on Apache Spark and can scale...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 2
    Micronaut Data

    Micronaut Data

    Ahead of Time Data Repositories

    Micronaut Data is a database access toolkit that uses Ahead of Time (AoT) compilation to pre-compute queries for repository interfaces that are then executed by a thin, lightweight runtime layer. Both GORM and Spring Data maintain a runtime meta-model that uses reflection to model relationships between entities. This model consumes significant memory and memory requirements grow as your application size grows. The problem is worse when combined with Hibernate which maintains its own...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    DupRem

    DupRem

    Simple application to remove duplicate and empty lines on text files.

    DupRem is a simple easy-to-use cross-platform application to remove duplicate and empty lines from any text file. It is also possible to keep or ignore case sensitive. Works also from command-line interface, e.g. "java -jar duprem.jar -r input_file.txt >output_file.txt", to create output file, or "java -jar duprem.jar -r input_file.txt >>output_file.txt", to create or append to output file. DupRem is portable, does not need installation and is developed in Java, so needs the Java Virtual...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    File-Studio

    File-Studio

    A tool that automates complex file operations.

    File studio is a tool that assists in handling complex file operations such as bulk renaming, organizing folders and more.
    Leader badge
    Downloads: 11 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    Picsimilar

    Picsimilar

    Search and compare similar and identical photos.

    Use reverse image search to find similar and duplicate images in your local photo collection; Use comparison features to select the best images in a set of similar ones.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    cliper

    cliper

    Java3d scenegraph editor

    Create 3D scenes linked together in short films through simple menus. . Installation instructions in readme.txt in files tab. 1-Scene/ backstage Open and close scenes. Choose background, generate a terrain, set lights, fog, camera position, sky picture, scene duration. 2-Draw/ placement Import objects (OBJ, C3D, 3DS), or create shapes (sphere, cone, etc...). Place them and set size, dimensions, duplicate, group. 3-Color/texture Apply textures, colors, transparency...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    ChoiceMaker
    Record matching software
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    MinPOS

    MinPOS

    Stability POS software, Free soft develop from Openbravo POS

    Release: v4.02 Welcome to the MinPOS family. MinPOS is Point Of Sale free software licensed. You can use and free update forever. It's developed from Openbravo POS but modified so many. Please see how to install at Wiki Support multi-language (English, France, Vietnam...) Run good: Retail, Market, Restaurant, Hotel, Motel, Karaoke, Nail, Billiard... -------------- In this release (4.02): - Update Mysql script created for all versions of Mysql and Mariadb. - Update JDK 8. -...
    Leader badge
    Downloads: 16 This Week
    Last Update:
    See Project
  • 9
    myjaphoo
    MyJaPhoO My Java Photo Organizer Manages local Photo and Video Collections
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10

    MarDRe

    MapReduce-based tool to remove duplicate DNA reads

    MarDRe is a de novo MapReduce-based parallel tool to remove duplicate and near-duplicate DNA reads through the clustering of single-end and paired-end sequences from FASTQ/FASTA datasets. This tool allows bioinformatics to avoid the analysis of not necessary reads, reducing the time of subsequent procedures with the dataset. MarDRe is the Big Data counterpart of ParDRe (link above), which employs HPC technologies (i.e., hybrid MPI/multithreading) to reduce runtime on multicore systems....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    SnapRAID duplicate file manager. Tool designed to aid the user in deleting duplicate files identified in a SnapRAID installation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    Uniqueoid

    Simple & adorable program to eliminate copies of files.

    Uniqueoid program is designed for searching and deleting copies of files. Uniqueoid analyses target files and folders and offer to save one of the duplicates and eliminate the else. Search results can be saved and loaded. You may clean disk bit by bit. Uniqueoid takes care of files that become unique even after the end of the search. Unique files can’t be deleted accidently. Also Uniqueoid can automatically choose or ignore files by directory or path prefix. All this features in combine...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    DataCleaner

    DataCleaner

    Data quality analysis, profiling, cleansing, duplicate detection +more

    DataCleaner is a data quality analysis application and a solution platform for DQ solutions. It's core is a strong data profiling engine, which is extensible and thereby adds data cleansing, transformations, enrichment, deduplication, matching and merging. Website: http://datacleaner.github.io
    Downloads: 13 This Week
    Last Update:
    See Project
  • 14

    OovAide

    C++, Java IDE with auto class, sequence, zone, dependency, diagrams

    The OovAide project used to be named oovcde. Searching the web will bring up more information about oovcde at this time. The OovAide project is a C++ or Java analysis IDE for Windows or Linux with an automated multi-tasking build system, cross compiler support, an analysis tool based on CLang that creates UML class, component, sequence as well as zone and portion diagrams from C++ or Java source, static analysis and test coverage. The diagrams allow navigation through the source code,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    BridgeBuddy

    Bridge duplicate tournament card play program

    Program for bridge card tournaments. User setup tournaments by defining tournament type (Howell or mitchell), number of tables (in each section), number of sections, and bridgemates, possible use of watch to oversee and follow current tournament, and some other information regarded to results, html pages, and more. User also has to decide dates for tournament, names and colors for sections, sequence of pairs, etc. After each tournament date user can mark double substituts, get results...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16

    OJson

    Optimised JSON (Javascript Object Notation)

    Optimise JSON by removing duplicate strings and arrays containing repeated object keys. Here you will find binary downloads and discussion (https://sourceforge.net/p/ojson/discussion/) . The actual development and issue tracking can be found here: https://bitbucket.org/cryanfuse/ojson
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    WebCorpus

    Hadoop framework for scalable processing of large web corpora

    WebCorpus is a Hadoop-based framework that enables you to calculate statistics on large web corpora extracted from web crawls.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    xorlisp

    Bit level lambda continuations and nothing else - Queue automata

    Not working yet. To deal with the Halting Problem, computing and data are navigated using debugger ops: linearForward and treeForward, which navigate an astronomically large bit string where 1 is ( and 0 is ). All pairs are derived from (). For example, true is represented as ((()())()), and false is (()(()())). It appears related to the church encoding of lambda where T chooses first parameter and F chooses second, of a pair. Continuations are nearly finished code and are represented as a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    EZreclaim

    EZreclaim

    Reclaim space taken by duplicate files

    Reclaim space taken by duplicate files. With the option of Deleting or replacing duplicate files with hard links. CAUTION : MAKE SURE YOU HAVE A BACKUP OF YOUR DATA BEFORE RUNNING THIS PROGRAM. NEVER RUN AGAINST SYSTEM FILES OR FOLDERS AS YOUR SYSTEM MAY BE PERMANENTLY DAMAGED.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    CloneManager

    Allow to find duplicate files

    Two goals : Allow to find duplicated files on computer / find files which haven't been backed up yet
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    Yum Package Repair

    Tool that fix problems that Yum doesn't fix

    A tool that verifies your installed packages and reinstalls them if there is any problems. I plan do add duplicate package fixing to this tool the next time I get that problem. You should turn yum automatic updates of while running his utillity. It calls rpm and yum in batches so automatic updates may be able to grab the lock. Requres rpm, yum, bash and Java 8 set as default java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    Fast Duplicate File Finder

    Superfast, lightweight, simple duplicate file finder

    A simple, super-fast, lightweight duplicate file finder written in java. System requirement: Java 1.6+ 20 KB of disk space (lol :D)
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    MP3 ID3 Java Tag Library Tool
    This Java software is useful to set id3v1 and id3v2 tags on mp3 files, possibly in a massive way. It helps you managing a song database, searching for duplicate songs. It has a rename file utility, not only for mp3 files. It has an utility panel to create playlists, re-organize files in folders with the name format you prefer, an utility to optimize the CD disk space. This program was written in 2002, so I don't remember much about it ... but I used it a lot, as much as my close...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    ourfilesystem

    Peer-to-peer file-sharing system.

    The primary purpose is to provide a means of organizing, cataloging and sharing your files. You can add information about files and perform sophisticated search queries to find files of interest and open them. Furthermore, if you share a file with someone else, you'll benefit from any further information they share about the file, even if they have an independent copy. For example, someone else may add a preview to a file, and as long as the SHA512 digest of the file matches your copy,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    Funnel Sort

    command line sort utility

    Moved to GitHub https://github.com/fedups A command-line utility to sort files. Funnel is a sort utility to sort files, large and small. It efficiently handles fixed length records and variable length records. Funnel easily handles ascii (readable) data and binary data. There are many more features in Funnel. It is easy to use and very fast. All documentation is on the Wiki.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB