Search This Blog

March 01, 2021

Azure Data Lake Store vs Azure Blob Storage containers

 

CapabilityAzure Data Lake StoreAzure Blob Storage containers
PurposeOptimized storage for big data analytics workloadsGeneral purpose object store for a wide variety of storage scenarios
Use casesBatch, streaming analytics, and machine learning data such as log files, IoT data, click streams, large datasetsAny type of text or binary data, such as application back end, backup data, media storage for streaming, and general purpose data
StructureHierarchical file systemObject store with flat namespace
AuthenticationBased on Azure Active Directory IdentitiesBased on shared secrets Account Access Keys and Shared Access Signature Keys, and Azure role-based access control (Azure RBAC)
Authentication protocolOAuth 2.0. Calls must contain a valid JWT (JSON web token) issued by Azure Active DirectoryHash-based message authentication code (HMAC). Calls must contain a Base64-encoded SHA-256 hash over a part of the HTTP request.
AuthorizationPOSIX access control lists (ACLs). ACLs based on Azure Active Directory identities can be set file and folder level.For account-level authorization use Account Access Keys. For account, container, or blob authorization use Shared Access Signature Keys.
AuditingAvailable.Available
Encryption at restTransparent, server sideTransparent, server side; Client-side encryption
Developer SDKs.NET, Java, Python, Node.js.NET, Java, Python, Node.js, C++, Ruby
Analytics workload performanceOptimized performance for parallel analytics workloads, High Throughput and IOPSNot optimized for analytics workloads
Size limitsNo limits on account sizes, file sizes or number of filesSpecific limits documented here
Geo-redundancyLocally-redundant (LRS), globally redundant (GRS), read-access globally redundant (RA-GRS), zone-redundant (ZRS).Locally redundant (LRS), globally redundant (GRS), read-access globally redundant (RA-GRS), zone-redundant (ZRS). See here for more information

No comments: