Compression techniques for distributed data

JB Khan, SN Trika - US Patent App. 16/293,540, 2019 - Google Patents
In one example, uncompressed data is compressed and divided into chunks. Each chunk of
the compressed data stream is combined with state information to enable each chunk to be …

Deduplication and compression of data segments in a data storage system

J Swift - US Patent 10,678,435, 2020 - Google Patents
Techniques for performing data deduplication and compression in data storage systems.
Data deduplication is performed in a deduplication domain on a segment-by-segment basis …

Concurrently performing normal system operations and garbage collection

T Truong, M Arevalo, P Shilane, KR Lu… - US Patent …, 2022 - Google Patents
Systems and methods enabling garbage collection opera tions and normal system
operations concurrently. Concur rent operations are performed by configuring a similarity …

Removal of reference information for storage blocks in a deduplication system

L Aronovich, A Kredi - US Patent 10,248,656, 2019 - Google Patents
Various embodiments for managing data in a data storage having data deduplication. For a
back reference data structure incorporating reference information for at least one user data …

Scalable garbage collection for deduplicated storage

P Shilane, K Lu, J Brandt, N Noto, T Truong… - US Patent …, 2021 - Google Patents
Abstract Systems and methods for cleaning a storage system. A deduplicated storage
system is cleaned by identifying structures that include dead or unreferenced segments. This …

Method and system for dynamic compression module selection

GR Wallace, PN Shilane, F Douglis, J Luo - US Patent 9,843,802, 2017 - Google Patents
(57) ABSTRACT A computer-implemented method for compressing a data set, the method
comprising receiving a first data block of the data set, selecting automatically by a …

Estimating worker nodes needed for performing garbage collection operations

NA Noto, M Arevalo, P Shilane, JS Brandt - US Patent 10,872,037, 2020 - Google Patents
The number of workers can be determined based on the impacted similarity groups. More
specifically, the number of impacted similarity groups and/or workers can be evaluated in …

Conversion of forms of user data segment IDs in a deduplication system

L Aronovich - US Patent 9,965,487, 2018 - Google Patents
Various embodiments for managing data in a data storage having data deduplication. For a
back reference data structure incorporating reference information for at least one user data …

System and method for efficiently measuring physical space for an ad-hoc subset of files in protection storage filesystem with stream segmentation and data …

G Menezes, F Botelho, A Reza - US Patent 10,303,662, 2019 - Google Patents
In one example, a method for processing data includes receiving information that identifies
an ad-hoc group of size'n'of files F... Fm, each file F including a respective segment set S …

Marking impacted similarity groups in garbage collection operations in deduplicated storage systems

KR Lu, JS Brandt, NA Noto, T Truong… - US Patent …, 2022 - Google Patents
Abstract Systems and methods for marking similarity groups impacted by a garbage
collection operation are disclosed. Similarity groups are used to identify segments …