CLI utility to find near duplicate images and remove all but the best copy.
-
Updated
May 30, 2024 - Python
CLI utility to find near duplicate images and remove all but the best copy.
Sift duplicate whitespaces away!
Analyse 2 paths to found identical files and hard link them to save space
Golang structured logging (slog) deduplication and sorting for use with json logging
A CLI tool for images analysis: checking image integrity, images deduplication, image retrieval.
String deduplication package for Go
Yet another tool to find and remove duplicate files.
distill large scale web page text
Find (partial content) duplicate files.
Print FastCDC rolling hash chunks and checksums.
BenSP is a suite of parameterizable benchmarks for stream parallelism which is used to evaluate stream processing characteristics.
Parallel Patterns Implementation of PARSEC Benchmark Applications
Detect and optionally delete duplicate files in a directory tree
Remove local files that are duplicates of files in another path
Add a description, image, and links to the dedup topic page so that developers can more easily learn about it.
To associate your repository with the dedup topic, visit your repo's landing page and select "manage topics."