Showing 152 open source projects for "duplicate files"

View related business solutions
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    Czkawka

    Czkawka

    Multi functional app to find duplicates, empty folders, similar images

    Czkawka (Polish for “hiccup”) is a lightning‑fast, multi‑purpose file cleaning tool written in Rust. It helps users declutter storage by finding duplicate files, similar images or audio, empty folders, and unusually large files through CPU‑efficient multithreading. Available with both GUI (GTK‑based) and CLI versions for flexible usage.
    Downloads: 481 This Week
    Last Update:
    See Project
  • 2
    FDUPES

    FDUPES

    FDUPES is a program for identifying or deleting duplicate files

    FDUPES is a lightweight command-line utility that helps users find and optionally delete duplicate files within specified directories by comparing file contents, which can be extremely useful for cleaning up storage clutter or organizing large collections of files. It works by scanning directories and subdirectories, identifying sets of files with identical content through size and hash comparisons, and then listing them together so users can examine duplicates. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 3
    Duplicate Agent

    Duplicate Agent

    Duplicate Files Finder and Cleaner

    This program was created to detect and delete unnecessary files on your computer that you've unknowingly created, copied, or backed up in some way. You can find the release notes on my Github address: https://github.com/shampuan/Duplicate-Agent/issues
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    SortPhotos

    SortPhotos

    SortPhotos is a Python script that organizes photos and videos

    ...SortPhotos includes options for copying versus moving files, recursive searches, silent or test modes, and customizable start times for when a “day” begins. It also prevents duplicate files by comparing content, with an option to keep duplicates if needed. With support for automation through launch agents or cron jobs, SortPhotos is well-suited for photographers, archivists, and anyone looking to streamline large personal or professional media collections.
    Downloads: 4 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    diskover-community

    diskover-community

    Open source file indexing & storage analytics powered by Elasticsearch

    ...Diskover also helps identify outdated or unused files, duplicate data, and inefficient storage usage that can waste resources or increase operational costs. A Python-based indexing engine performs the scanning and indexing tasks.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    supabase-py

    supabase-py

    Python Client for Supabase. Query Postgres from Flask, Django

    Python Client for Supabase. Query Postgres from Flask, Django, FastAPI. Python user authentication, security policies, edge functions, file storage, and realtime data streaming. Good first issue.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    RemoveDuplicate

    RemoveDuplicate

    A software that can helps you remove duplicate files

    RemoveDuplicate is a Windows application designed to help users find and remove duplicate files within folders. It uses MD5 hashing to identify identical files and provides a user-friendly interface to manage and delete duplicates while keeping one copy of each file.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 8
    Percona Toolkit

    Percona Toolkit

    A collection of advanced open source command-line tools

    Percona Toolkit is a collection of battle-tested command-line tools for MySQL and MariaDB that help diagnose performance, verify integrity, and perform online maintenance safely. Utilities such as pt-query-digest analyze slow logs and packet captures to surface hotspots and regressions, while pt-online-schema-change applies ALTERs with minimal blocking by copying and swapping tables. Consistency tools like pt-table-checksum and pt-table-sync detect and reconcile replication drift across...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    Calibre-Web Automated

    Calibre-Web Automated

    Calibre-Web but Automated and with tons of New Features

    Calibre-Web-Automated (CWA) is an all-in-one, self-hosted solution for managing an ebook library that combines the modern, lightweight web UI style of Calibre-Web with the deeper tooling and conversion capabilities associated with Calibre. The goal is to reduce the common “two-service” setup where users run Calibre-Web for browsing and Calibre separately for conversions, metadata fixes, and automation, by packaging those workflows together in a single system. CWA keeps the familiar strengths...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    File Nesting Config for VS Code

    File Nesting Config for VS Code

    Config of File Nesting for VS Code

    ...Because the snippet is generated via a script (update.mjs), it is maintained for many languages, frameworks, and file types. The benefit is much cleaner project file trees, reduced noise, easier navigation, and less time wasted hunting through duplicate or generated files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    trackerslist

    trackerslist

    Updated list of public BitTorrent trackers

    trackerslist is a repository that provides continuously updated lists of public BitTorrent trackers, designed to improve torrent download performance and peer discovery. The project is maintained by an automated system that regularly checks tracker availability, removes duplicates, and ranks them based on reliability and latency. It offers multiple formats of tracker lists, including HTTP, HTTPS, WebSocket, and IP-based versions, making it compatible with a wide range of BitTorrent clients....
    Downloads: 12 This Week
    Last Update:
    See Project
  • 12
    tumblr-crawler

    tumblr-crawler

    Python crawler to download photos and videos from Tumblr blogs

    tumblr-crawler is an open source Python-based utility designed to download media content from Tumblr blogs. It provides a script that automatically retrieves photos and videos from specified Tumblr sites and saves them locally for offline access. Users can specify one or multiple blogs to crawl by editing a configuration file or by passing parameters through the command line. Once executed, the script fetches media from the Tumblr API and stores the downloaded files in folders named after...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Meta Package Manager

    Meta Package Manager

    Wraps all package managers with a unifying CLI

    Meta Package Manager wraps all package managers with a unifying CLI, and provides the MPM CLI, a wrapper around all package managers. MPM is like yt-dlp, but for package managers instead of videos. MPM solves XKCD #1654 - Universal Install Script. List installed packages. List duplicate installed packages. Search for packages. Install a package, remove a package, and list outdated packages. Sync local package infos. Upgrade all outdated packages. Backup list of installed packages to TOML file. Restore/install list of packages from TOML files. Pin-point commands to a subset of package managers (include/exclude selectors). ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 14

    RemoveDuplicateLinesFromTxt

    Remove Duplicate lines From Files in TXT

    This program remove duplicate lines in txt files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    FUSE filesystem over Google Drive

    FUSE filesystem over Google Drive

    FUSE filesystem over Google Drive

    google-drive-ocamlfuse is a FUSE-based file system for Google Drive, written in OCaml. It lets you mount your Google Drive on Linux. The project is hosted on GitHub, where you can find the latest development version. Project documentation is hosted in the website. There are Ubuntu packages in this PPA. Read-only access to Google Docs, Sheets, and Slides (exported to configurable formats). Read-ahead buffers when streaming. Accessing content shared with you (requires configuration). Team...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    parcel/css

    parcel/css

    A CSS parser, transformer, and minifier written in Rust

    A CSS parser, transformer, and minifier written in Rust. Parsing and minifying large files are completed in milliseconds, often with significantly smaller output than other tools. Many other CSS parsers treat property values as an untyped series of tokens. This means that each transformer that wants to do something with these values must interpret them itself, leading to duplicate work and inconsistencies. @parcel/css parses all values using the grammar from the CSS specification and exposes a specific value type for each property. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Useful Scripts

    Useful Scripts

    Useful scripts for making developer's everyday life easier

    ...It is used to quickly troubleshoot performance problems, automatically find out how many threads are consumed in the running process, and print out their thread stacks to determine the method calls that cause performance problems. Find out duplicate classes in jar files and class directories. Used to troubleshoot Javaclass conflicts.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    Swiss File Knife

    Swiss File Knife

    One hundred command line tools in a small and portable binary.

    Create zip files, extract zip files, replace text in files, search in files using expressions, stream text editor, instant command line ftp and http server, send folder via network, copy folder excluding sub folders and files, find duplicate files, run a command on all files of a folder, split and join large files, make md5 checksum lists of files, remove tab characters, convert CR/LF, list newest or biggest files of a folder, compare folders, treesize, show first or last lines of a file, find filenames fast using index files, rename many files using expressions, copy part of a file, change times of a file, set file time from filename, print colored text to terminal, convert csv to tab separated, download files from web, send http or udp requests, print tcp or udp traffic, create hexdump of files, join many text files into one, list nested .zip .tar .tar.gz .tar.bz2 archive contents. ...
    Leader badge
    Downloads: 445 This Week
    Last Update:
    See Project
  • 19
    gopkg

    gopkg

    Example for the go pkg's function

    gopkg is a large community-driven repository of examples for Go’s standard library packages, created to fill the gap left by the relatively sparse official examples. The project organizes content so that each package has its own directory and each function within that package gets its own Markdown file with code examples. The idea is that developers can quickly look up “how do I actually use this function?” without digging through source code or scattered blog posts. The maintainer provides...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    PeaZip

    PeaZip

    Free Zip software and Rar extractor

    ...The application provides an unified, natively portable, cross-platform file manager and archive manager GUI for many Open Source technologies like 7-Zip, FreeArc, PAQ, UPX. Create: 7Z, ARC, Brotl, BZip, GZip, PEA, TAR, WIM, XZ, ZPAQ, ZIP, Zstandard files and more Open and extract 200+ file types: ACE, CAB, DEB, ISO, RAR, ZIPX and more Features of PeaZip includes extract, create and convert multiple archives at once, create self-extracting archives (sfx), split files, strong encryption with two factor authentication, encrypted password manager, secure deletion, find duplicate files, calculate hashes, export task definition as command line script.
    Leader badge
    Downloads: 1,559 This Week
    Last Update:
    See Project
  • 21
    SIMILAR

    SIMILAR

    Compares Folders for Same Name and/or Size Files

    I developed this application because I was not happy with existing ones. The program analyzes two folders to determine whether they are identical or different by identifying files with the same name and/or size. The resulting list can be saved as a text file. Files with matching names and/or sizes can be deleted, but only those in the second directory will be removed. You can control which files are deleted through the textbox, as only the files displayed there will be deleted. Vary name...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    Bootleg Sound Processor

    Bootleg Sound Processor

    Software for processing audio files.

    Software for processing audio files. The files "Batch Processor.py" and "Duplicate remover.py" are meant to be used with the output of Bootleg Text Slicer (https://github.com/Northstrix/bootleg-text-slicer) placed into the "Unprocessed" folder, while "Single file processor.py" can be used with standalone files from arbitrary locations. GitHub repository: https://github.com/Northstrix/bootleg-sound-processor Made using Google AI Studio (https://aistudio.google.com/) and Perplexity (https://www.perplexity.ai/)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    SortMyShit

    A tool designed to help you organize and manage your files

    SortMyShit is an open-source Python project designed to help you organize and manage your files effortlessly. It provides customizable sorting rules to keep your directories clean and structured.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    logwitch

    a simple log file scanner for linux written in shell and lua

    logwitch checks single or rotated log files, either plain text or gzipped. It can work with log4j and gnu/linux system logs. You configure it to watch for lines in log files that interest you in by creating a simple text file in its config directory. It emails a daily report to you and timestamps logs so that you do not receive duplicate information. logwitch is GPL3 licensed
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    DupRem

    DupRem

    Simple application to remove duplicate and empty lines on text files.

    DupRem is a simple easy-to-use cross-platform application to remove duplicate and empty lines from any text file. It is also possible to keep or ignore case sensitive. Works also from command-line interface, e.g. "java -jar duprem.jar -r input_file.txt >output_file.txt", to create output file, or "java -jar duprem.jar -r input_file.txt >>output_file.txt", to create or append to output file. DupRem is portable, does not need installation and is developed in Java, so needs the Java Virtual...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB