Repository contains semantic equivalence models for duplicate detection.
-
Updated
Mar 30, 2017 - Jupyter Notebook
Repository contains semantic equivalence models for duplicate detection.
Identifies and lists files with same name in the given path.
Command Line Interface for deplicate
shazer - find and hardlink duplicate files based on checksums
Securely wipe files or folders and clean duplicated files
A Machine learning model that can Identify Variation in names and identifying a unique person and hence solve deduplication of records comming from multiple sources
Detecting near-duplicate videos by aggregating features from intermediate CNN layers
This is an android application which shows a list of all duplicate numbers saved with different names in your phonebook.
🚀 Cloc & duplicate code checker tool
Data mining on stack overflow Q/A data to understand the landscape of languages and developers in computer science
Detect duplicate images and view the distinct in a web app
Random experiments for cg-autofix
Solution to different algorithm problems
Python script that tackles the problem of Duplicate Product Detection!
👯 Find similar objects and partial duplicates in collections
Delete duplicate copies of files in a directory
Compare two strings with help of Levenshtein Distance Metric and weighted string decomposition. Allows to get a measure of the similarity of two strings in percentage terms.
Command line utility to remove exact duplicate files.
Add a description, image, and links to the duplicate-detection topic page so that developers can more easily learn about it.
To associate your repository with the duplicate-detection topic, visit your repo's landing page and select "manage topics."