data-deduplication

A JAVA project that splits data using hashing techniques and removes duplicate blocks to save cloud storage. This project also uses the CloudSim framework for cloud storage simulation.

java cloud-storage data-deduplication cloudsim cloudsim-framework

Updated Jan 6, 2021
Java

dpc / rdedup

Star

Data deduplication engine, supporting optional compression and public key encryption.

backup encryption data-deduplication deduplication

Updated Aug 25, 2022
Rust

Anveshika06 / VIT-VTAS-TY-2022

Star

data-deduplication hashing-algorithm

Updated Jan 7, 2023
Python

jchristn / WatsonDedupe

Sponsor

Star

Self-contained C# library for data deduplication using Sqlite

compression storage nuget dedupe sqlite-database data-deduplication chunk compress deduplication chunk-data duplicate-data chunk-key

Updated Apr 7, 2023
C#

Zabuzard / FastCDC4J

Star

Fast and efficient content-defined chunking for data deduplication. Java implementation of FastCDC as library.

java library data-deduplication chunking cdc fastcdc content-defined-chunking

Updated Sep 21, 2023
Java

Jim-JMCD / Data_storage_network_deduplication_calculator

Star

A calculator for storage and transmission of deduplicated data presentation in charts and tables

data-deduplication deduplication deduplication-calculator storage-deduplication-calculator network-deduplication-calculator

Updated Sep 26, 2023

bevry / fellow

Star

Fellow is a package for creating people that can be unified by their shared values via a singleton list on the class

nodejs model data-deduplication client-side

Updated Jan 7, 2024
TypeScript

gagan3012 / PolyDeDupe

Sponsor

Star

PolyDeDupe: Multi-Lingual Data Deduplication

multilingual nlp data-deduplication

Updated May 27, 2024
Python

sail-sg / sailcraft

Star

Data Toolkit for Sailor Language Models

data-deduplication data-cleaning

Updated May 15, 2024
Python

Improve this page

Add a description, image, and links to the data-deduplication topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-deduplication topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data-deduplication

Here are 13 public repositories matching this topic...

baraverkstad / mixtape

imehar / data-deduplication

david-siqi-liu / sparklyclean

bmiller1009 / deduper

shubham-thakare / data-deduplication

dpc / rdedup

Anveshika06 / VIT-VTAS-TY-2022

jchristn / WatsonDedupe

Zabuzard / FastCDC4J

Jim-JMCD / Data_storage_network_deduplication_calculator

bevry / fellow

gagan3012 / PolyDeDupe

sail-sg / sailcraft

Improve this page

Add this topic to your repo