dedup
Here are 25 public repositories matching this topic...
CLI utility to find near duplicate images and remove all but the best copy.
-
Updated
May 29, 2024 - Python
Parallel Patterns Implementation of PARSEC Benchmark Applications
-
Updated
Dec 29, 2021 - C++
BenSP is a suite of parameterizable benchmarks for stream parallelism which is used to evaluate stream processing characteristics.
-
Updated
Aug 3, 2022 - C
Project to take two similar zipfiles, and to dedupe files that have the same tiemstamp in the older file.
-
Updated
Oct 14, 2018 - Python
distill large scale web page text
-
Updated
Jul 29, 2023 - C++
Yet another tool to find and remove duplicate files.
-
Updated
Nov 2, 2023 - Python
Print FastCDC rolling hash chunks and checksums.
-
Updated
Nov 27, 2022 - Python
Find (partial content) duplicate files.
-
Updated
Dec 10, 2022 - Python
A CLI tool for images analysis: checking image integrity, images deduplication, image retrieval.
-
Updated
Mar 27, 2024 - Rust
python script to analyze dedup usage in btrfs
-
Updated
Sep 5, 2019 - Python
Detect and optionally delete duplicate files in a directory tree
-
Updated
Jun 6, 2021 - Go
String deduplication package for Go
-
Updated
Jan 10, 2024 - Go
Improve this page
Add a description, image, and links to the dedup topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the dedup topic, visit your repo's landing page and select "manage topics."