Identifies and processes duplicate files between a source and target directory.
-
Updated
May 31, 2024 - Python
Identifies and processes duplicate files between a source and target directory.
An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.
Utilities for KML files
The Panako acoustic fingerprinting system.
Nextcloud Media Duplicate Collector application
Interact, analyze and structure massive text, image, embedding, audio and video datasets
CLI utility to find near duplicate images and remove all but the best copy.
Duplicates Detector is a cross-platform GUI utility for finding duplicate files, allowing you to delete or link them to save space. Duplicate files are displayed and processed on two synchronized panels for efficient and convenient operation.
A plugin that does one thing only: Detect and manage duplicate items in Zotero.
JavaScript utility for suppressing duplicate AWS Lambda invocations
Finds duplicated files fast and efficiently
Search for duplicate files based on extension.
Engine for analysis of Siegfried export files and DROID CSV. The tool has three purposes, break the export into its components and store them within a SQLite database; create additional columns to augment the output where useful; and query the SQLite database, outputting results in a readable form useful for analysis by researchers and archivist…
A uniquely crafted image viewer and editor with options to organize files, and maintain large lists of image files for slideshows, dupes detection or other purposes.
Language of Vectors (LangVec) is a simple Python library designed for transforming numerical vector data into a language-like structure using a predefined set of words (lexicon).
Remove Duplicate Messages
A Python library to scan a file system, find duplicated file etc.
Tasks for Advance Natural Language Processing Course @ ITMO University
A script for organising a photo & video library, featuring duplicate removal and a graphical interface.
Add a description, image, and links to the duplicate-detection topic page so that developers can more easily learn about it.
To associate your repository with the duplicate-detection topic, visit your repo's landing page and select "manage topics."