Got it

【Storage Feature】Smart Data Deduplication Feature for Block-SmartDedupe

Latest reply: Apr 21, 2022 14:32:10 1332 49 8 0 0

1

11

(Participants need to submit their answers in the required comments of each posts, the replies of the comments will be hidden during the activity.)

1. Definition


SmartDedupe and SmartCompression features are developed by Huawei to provide smart data deduplication and data compression functions.

SmartDedupe is a data downsizing technology that deletes duplicate data blocks in a storage system to save physical storage capacity, meeting growing data storage needs.


SmartCompression reorganizes data to reduce storage space consumption and improve the data transfer, processing, and storage efficiency without any data loss. 


The OceanStor V5series storage system supports inline deduplication and compression, that is, only the data written after SmartDedupe and SmartCompression are enabled will be deduplicated and compressed.

When both SmartDedupe and SmartCompression are enabled, deduplication is performed before compression.


SmartDedupe

2. Basic Concepts

Deduplication data block size: This determines deduplication granularity and is identical to the block size of a thin LUN. To set the block size of a thin LUN, run create lun on the command line interface (CLI) and modify the value of parameter grain_size.

Compression data block size: This determines compression granularity. The storage system compresses data intelligently using the block size of newly-written data in LUNs.  Fixed-length deduplication: The storage system deduplicates the data written into a LUN by the specified deduplication data block size.  Hash algorithm: This verifies the consistency of data blocks. It computes the fingerprint of a data block, which is a unique binary number with a fixed length. If the fingerprints of two data blocks are the same, the storage system considers them duplicates.  Byte-by-byte comparison policy: This is complementary to the hash algorithm. If two data blocks are found to have the same fingerprint, the storage system compares the data blocks byte by byte, ensuring that the deduplication is secure.  Deduplication metadata: This stores deduplication information, such as the fingerprints and storage locations of deduplicated data.

3. Working Principle

Deduplication

After SmartDedupe is enabled for a LUN, the OceanStor storage system uses the hash algorithm to calculate the fingerprint of each new data block. It then compares these fingerprints with those of existing data blocks in the LUN. If a new fingerprint is identical to an existing one, it will be deleted and its storage location registered as that of the existing one. If the fingerprint is unique, the new data block is written to disks.

If you have enabled neither SmartDedupe nor SmartCompression when you create a LUN, you cannot enable them any more after the LUN is created.

Figure1 Deduplication process shows the deduplication process.

Figure 1-1 Deduplication process

Deduplication process

Figure2 Effects of using SmartDedupe shows the process of writing data to a storage system with SmartDedupe enabled and disabled. In this example, data blocks B and C have identical fingerprints, while data block D has a unique fingerprint.

Figure 1-2 Effects of using SmartDedupe

Effects of using SmartDedupe




The post is synchronized to: Surprising StorageHCIA Storage

Only the author can view this post
  • x
  • convention:

BAZ
MVE Author Created Jul 25, 2020 13:33:29

Only the author can view this post
  • x
  • convention:

Only the author can view this post
  • x
  • convention:

Only the author can view this post
  • x
  • convention:

Only the author can view this post
  • x
  • convention:

Only the author can view this post
  • x
  • convention:

Only the author can view this post
  • x
  • convention:

Only the author can view this post
  • x
  • convention:

Only the author can view this post
  • x
  • convention:

Comment

You need to log in to comment to the post Login | Register
Comment

Notice: To protect the legitimate rights and interests of you, the community, and third parties, do not release content that may bring legal risks to all parties, including but are not limited to the following:
  • Politically sensitive content
  • Content concerning pornography, gambling, and drug abuse
  • Content that may disclose or infringe upon others ' commercial secrets, intellectual properties, including trade marks, copyrights, and patents, and personal privacy
Do not share your account and password with others. All operations performed using your account will be regarded as your own actions and all consequences arising therefrom will be borne by you. For details, see " User Agreement."

My Followers

Login and enjoy all the member benefits

Login

Block
Are you sure to block this user?
Users on your blacklist cannot comment on your post,cannot mention you, cannot send you private messages.
Reminder
Please bind your phone number to obtain invitation bonus.