Showing 232 open source projects for "duplicate"

View related business solutions
  • Build AI Apps with Gemini 3 on Vertex AI Icon
    Build AI Apps with Gemini 3 on Vertex AI

    Access Google’s most capable multimodal models. Train, test, and deploy AI with 200+ foundation models on one platform.

    Vertex AI gives developers access to Gemini 3—Google’s most advanced reasoning and coding model—plus 200+ foundation models including Claude, Llama, and Gemma. Build generative AI apps with Vertex AI Studio, customize with fine-tuning, and deploy to production with enterprise-grade MLOps. New customers get $300 in free credits.
    Try Vertex AI Free
  • Deploy Apps in Seconds with Cloud Run Icon
    Deploy Apps in Seconds with Cloud Run

    Host and run your applications without the need to manage infrastructure. Scales up from and down to zero automatically.

    Cloud Run is the fastest way to deploy containerized apps. Push your code in Go, Python, Node.js, Java, or any language and Cloud Run builds and deploys it automatically. Get fast autoscaling, pay only when your code runs, and skip the infrastructure headaches. Two million requests free per month. And new customers get $300 in free credit.
    Try Cloud Run Free
  • 1
    Czkawka

    Czkawka

    Multi functional app to find duplicates, empty folders, similar images

    Czkawka (Polish for “hiccup”) is a lightning‑fast, multi‑purpose file cleaning tool written in Rust. It helps users declutter storage by finding duplicate files, similar images or audio, empty folders, and unusually large files through CPU‑efficient multithreading. Available with both GUI (GTK‑based) and CLI versions for flexible usage.
    Downloads: 243 This Week
    Last Update:
    See Project
  • 2
    FDUPES

    FDUPES

    FDUPES is a program for identifying or deleting duplicate files

    FDUPES is a lightweight command-line utility that helps users find and optionally delete duplicate files within specified directories by comparing file contents, which can be extremely useful for cleaning up storage clutter or organizing large collections of files. It works by scanning directories and subdirectories, identifying sets of files with identical content through size and hash comparisons, and then listing them together so users can examine duplicates.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 3
    Beets

    Beets

    Open-source music library management system

    Beets catalogs your music collection with a variety of tools for manipulating and accessing music.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 4
    Mihomo

    Mihomo

    A simple Python Pydantic model for Honkai

    Mihomo is a Python client library leveraging Pydantic to model parsed Honkai: Star Rail user data from the Mihomo public API. It provides structured types, type hints, and convenience methods to fetch and transform player profiles, daily stats, and character details efficiently.
    Downloads: 65 This Week
    Last Update:
    See Project
  • $300 in Free Credit for Your Google Cloud Projects Icon
    $300 in Free Credit for Your Google Cloud Projects

    Build, test, and explore on Google Cloud with $300 in free credit. No hidden charges. No surprise bills.

    Launch your next project with $300 in free Google Cloud credit—no hidden charges. Test, build, and deploy without risk. Use your credit across the Google Cloud platform to find what works best for your needs. After your credits are used, continue building with free monthly usage products. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 5
    gopkg

    gopkg

    Example for the go pkg's function

    ...The maintainer provides conventions for naming and formatting, and requires that submitted examples compile and run locally or on the Go Playground, which helps keep the collection reliable. A todo.md file coordinates contributions so collaborators do not duplicate work, and there is an associated chat group for discussion. Overall, gopkg acts like a practical cookbook or living “examples companion” for the Go standard library.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    supabase-py

    supabase-py

    Python Client for Supabase. Query Postgres from Flask, Django

    Python Client for Supabase. Query Postgres from Flask, Django, FastAPI. Python user authentication, security policies, edge functions, file storage, and realtime data streaming. Good first issue.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    Syncpack

    Syncpack

    Consistent dependency versions in large JavaScript Monorepos

    Syncpack is a command-line tool for managing and synchronizing dependency versions across JavaScript monorepos. It ensures consistency in package versions within a project, preventing conflicts caused by mismatched dependencies.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    Calibre-Web Automated

    Calibre-Web Automated

    Calibre-Web but Automated and with tons of New Features

    Calibre-Web-Automated (CWA) is an all-in-one, self-hosted solution for managing an ebook library that combines the modern, lightweight web UI style of Calibre-Web with the deeper tooling and conversion capabilities associated with Calibre. The goal is to reduce the common “two-service” setup where users run Calibre-Web for browsing and Calibre separately for conversions, metadata fixes, and automation, by packaging those workflows together in a single system. CWA keeps the familiar strengths...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Percona Toolkit

    Percona Toolkit

    A collection of advanced open source command-line tools

    ...Utilities such as pt-query-digest analyze slow logs and packet captures to surface hotspots and regressions, while pt-online-schema-change applies ALTERs with minimal blocking by copying and swapping tables. Consistency tools like pt-table-checksum and pt-table-sync detect and reconcile replication drift across replicas or shards. Other helpers find duplicate indexes, unused keys, skewed data distributions, and configuration issues that quietly erode performance. Each tool favors guardrails: dry runs, throttling, and resumable operations reduce the risk of harming production workloads. Operators and DBAs rely on the toolkit to bring observability and predictability to busy database fleets without hand-rolled scripts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go from Data Warehouse to Data and AI platform with BigQuery Icon
    Go from Data Warehouse to Data and AI platform with BigQuery

    Build, train, and run ML models with simple SQL. Automate data prep, analysis, and predictions with built-in AI assistance from Gemini.

    BigQuery is more than a data warehouse—it's an autonomous data-to-AI platform. Use familiar SQL to train ML models, run time-series forecasts, and generate AI-powered insights with native Gemini integration. Built-in agents handle data engineering and data science workflows automatically. Get $300 in free credit, query 1 TB, and store 10 GB free monthly.
    Try BigQuery Free
  • 10
    Grabber

    Grabber

    Very customizable imageboard/booru downloader with powerful filenaming

    Very customizable imageboard/booru browser and mass downloader, with powerful file naming features. Thanks to its powerful naming features, you just have to set your filename and save the directory using all the tokens available, and the program will generate a filename using the image's information. With this, you can store and manage your pictures in advanced directory structures, and save images with custom filenames. Imageboard/booru downloader which can download thousands of images from...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 11
    janitor

    janitor

    Simple tools for data cleaning in R

    janitor provides simple, convenient tools for data cleaning, formatting, and exploration in R. It is especially useful for cleaning messy data frames, removing duplicates, formatting column names, and producing frequency tables in a tidy workflow.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Laravel Gamify

    Laravel Gamify

    Gamify your Laravel app with Reputation Points & Achievements Badges

    Use qcod/laravel-gamify to quickly add reputation point & badges in your Laravel app. If you are installing on Laravel 5.4 or lower you will need to manually register the Service Provider by adding it in the config/app.php providers array.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    FramePack

    FramePack

    Lets make video diffusion practical

    FramePack explores compact representations for sequences of image frames, targeting tasks where many near-duplicate frames carry redundant information. The idea is to “pack” frames by detecting shared structure and storing differences efficiently, which can accelerate training or inference on video-like data. By reducing I/O and memory bandwidth, datasets become lighter to load while models still see the essential temporal variation. The repository demonstrates both packing and unpacking steps, making it straightforward to integrate into preprocessing pipelines. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 14
    DefectDojo

    DefectDojo

    DefectDojo is a DevSecOps and vulnerability management tool

    DefectDojo is a security orchestration and vulnerability management platform. DefectDojo allows you to manage your application security program, maintain product and application information, triage vulnerabilities and push findings to systems like JIRA and Slack. DefectDojo enriches and refines vulnerability data using a number of heuristic algorithms that improve with the more you use the platform. DefectDojo integrates with 85+ security tools. DefectDojo has bi-directional integration with...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    Laravel Visits

    Laravel Visits

    Laravel Visits is a counter that can be attached to any model

    Laravel Visits is a counter that can be attached to any model to track its visits with useful features like IP protection and lists caching.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    With Advisory Lock

    With Advisory Lock

    Advisory locking for ActiveRecord

    with_advisory_lock is a Ruby gem that provides a simple and thread-safe way to use PostgreSQL advisory locks within Ruby applications. It allows developers to wrap critical code sections with database-level locks to prevent race conditions across concurrent processes or threads. It’s especially useful in distributed systems, background job processing, or scheduled tasks where application-level mutexes aren't sufficient.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Discord.SortedSet

    Discord.SortedSet

    Elixir SortedSet backed by a Rust-based NIF

    SortedSet NIF is a performant and reliable sorted set data structure for Elixir, implemented in Rust using the Rustler crate to take advantage of native performance while maintaining seamless integration with the BEAM ecosystem. It provides ordering and uniqueness guarantees, with all terms stored according to Elixir’s built-in sorting rules. Internally, it uses a vector of vectors layout rather than a single vector to minimize costly reallocations, allowing efficient bucket pointer copying...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    parcel/css

    parcel/css

    A CSS parser, transformer, and minifier written in Rust

    ...Many other CSS parsers treat property values as an untyped series of tokens. This means that each transformer that wants to do something with these values must interpret them itself, leading to duplicate work and inconsistencies. @parcel/css parses all values using the grammar from the CSS specification and exposes a specific value type for each property. Built on the cssparser and selectors crates created by Mozilla and used by Firefox and Servo. These provide a solid general-purpose CSS-parsing foundation on top of which @parcel/css implements support for all specific CSS rules and properties. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    React Three Fiber

    React Three Fiber

    A React renderer for Three.js

    Build your scene declaratively with re-usable, self-contained components that react to state, are readily interactive, and can tap into React's ecosystem. There is no hard dependency on a particular three.js version, it does not wrap or duplicate a single three.js class. It merely expresses three.js in JSX, <mesh /> becomes new THREE.Mesh(), and that happens dynamically. There is no additional overhead. Components participate in a unified render loop outside of React. It outperforms three.js at scale due to React's scheduling abilities.Y ou need to be versed in both React and three.js before rushing into this. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Micronaut Data

    Micronaut Data

    Ahead of Time Data Repositories

    ...This model consumes significant memory and memory requirements grow as your application size grows. The problem is worse when combined with Hibernate which maintains its own meta-model as you end up with duplicate meta-models. Micronaut Data instead moves this model into the compiler. Both GORM and Spring Data use regular expressions and pattern matching in combination with runtime generated proxies to translate a method definition on a Java interface into a query at runtime. No such runtime translation exists in Micronaut Data and this work is carried out by the Micronaut compiler at compilation time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    ydata-profiling

    ydata-profiling

    Create HTML profiling reports from pandas DataFrame objects

    ydata-profiling primary goal is to provide a one-line Exploratory Data Analysis (EDA) experience in a consistent and fast solution. Like pandas df.describe() function, that is so handy, ydata-profiling delivers an extended analysis of a DataFrame while allowing the data analysis to be exported in different formats such as html and json.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Useful Scripts

    Useful Scripts

    Useful scripts for making developer's everyday life easier

    ...It is used to quickly troubleshoot performance problems, automatically find out how many threads are consumed in the running process, and print out their thread stacks to determine the method calls that cause performance problems. Find out duplicate classes in jar files and class directories. Used to troubleshoot Javaclass conflicts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Nano ID

    Nano ID

    A secure, URL-friendly, unique string ID generator for JavaScript

    Nano ID is a library for generating random IDs. Likewise UUID, there is a probability of duplicate IDs. However, this probability is extremely small. Meanwhile, a lot of projects generate IDs in small numbers. For those projects, the ID length could be reduced without risk. This calculator aims to help you realize the extent to which the ID length can be reduced. Instead of using the unsafe Math.random(), Nano ID uses the crypto module in Node.js and the Web Crypto API in browsers. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    Tegon

    Tegon

    Tegon is an open-source, AI-first alternative to Jira, Linear

    Tegon is an AI-first, open-source issue-tracking software that uses AI to smartly automate manual tasks, and workflows or provide more context to engineers for a given task. Manual efforts in task management, such as task triaging and backlog maintenance, can be time-consuming. Engineers often lose time navigating multiple platforms to gather task context, rather than accessing details within the task itself. Issue tracking tools serve as a task database, directing engineers on what to work...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    duf

    duf

    Disk usage and free utility, a better 'df' alternative

    ...You can simply start duf without any command-line arguments. If you supply arguments, duf will only list specific devices & mount points. If you want to list everything (including pseudo, duplicate, inaccessible file systems), you can. You can show and hide specific tables. You can also show and hide specific filesystems. List inode information instead of block usage. If duf doesn't detect your terminal's colors correctly, you can set a theme. Users of oh-my-zsh should be aware that it already defines an alias called duf, which you will have to remove in order to use duf.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB