What types of data can be stored in Azure Blob Storage?

Azure Blob Storage is designed for unstructured data, including text, images, videos, audio files, application backups, and log files. It supports any type of data that doesn't adhere to a specific schema.

What are the different access tiers in Azure Blob Storage?

Azure Blob Storage offers Hot, Cool, and Archive access tiers. Hot is for frequently accessed data, Cool for infrequently accessed data with lower storage costs but higher access costs, and Archive for rarely accessed, long-term data retention at the lowest storage cost but highest access costs and latency.

How does Azure Blob Storage ensure data durability?

It provides several redundancy options, including Locally Redundant Storage (LRS), Zone-Redundant Storage (ZRS), Geo-Redundant Storage (GRS), and Geo-Zone-Redundant Storage (GZRS), to protect data against various types of failures, from local hardware issues to regional disasters.

Can Azure Blob Storage be used for data lakes?

Yes, through its integration with Azure Data Lake Storage Gen2, Azure Blob Storage can function as a data lake. This integration provides a hierarchical namespace for improved compatibility with big data analytics frameworks like Hadoop and Spark.

Are there SDKs available for Azure Blob Storage?

Yes, Azure Blob Storage provides comprehensive SDKs for Python, JavaScript, .NET, Java, Go, C++, and Ruby, allowing developers to programmatically interact with the service.

What is the maximum object size supported by Azure Blob Storage?

Block Blobs support files up to 190.7 TiB, while Page Blobs support up to 8 TiB. Append Blobs are optimized for data appending up to 190.7 TiB.

Azure Blob Storage: Object Storage for Unstructured Data

Q: What is the free tier for Azure Blob Storage?

The free tier includes up to 5 GB of LRS hot blob storage, 20,000 read operations, 10,000 write operations, and 10,000 list operations per month for the first 12 months.

Overview

Azure Blob Storage is a component of Microsoft Azure's storage services, specializing in the storage of unstructured data as objects or 'blobs'. Unstructured data does not adhere to a specific data model or schema, encompassing items such as text files, images, video, audio, and application backups. The service is engineered for scalability, durability, and high availability, making it suitable for a range of applications from enterprise data archiving to serving rich media content for global web applications.

The service offers different blob types, each optimized for specific use cases. Block Blobs are designed for storing digital objects, such as documents, media files, and backups, optimized for parallel uploads and large file sizes up to 190.7 TiB. Page Blobs are engineered for random read/write operations and are primarily used as the backing storage for Azure Virtual Machine disks, supporting files up to 8 TiB. Append Blobs are optimized for append operations, making them ideal for logging scenarios where data is continuously added to the end of a file, such as audit trails or IoT device data streams.

Azure Blob Storage also integrates with Azure Data Lake Storage Gen2, extending its capabilities for big data analytics workloads. This integration provides a hierarchical namespace over Blob Storage, enhancing compatibility with Hadoop Distributed File System (HDFS) and optimizing performance for analytics frameworks like Apache Spark and Hadoop. This makes it a foundational component for constructing data lakes, enabling organizations to ingest, store, and analyze vast quantities of data for insights and machine learning. Its tiered storage options, including Hot, Cool, and Archive, allow organizations to optimize costs based on data access frequency, ensuring that frequently accessed data is readily available while rarely accessed data is stored more economically.

For developers, Azure Blob Storage provides comprehensive SDKs across multiple programming languages, including Python, JavaScript, .NET, Java, Go, C++, and Ruby, facilitating programmatic interaction. This allows for integration into custom applications, automation of storage tasks, and management of data lifecycles. The service is frequently chosen by organizations requiring a highly available and durable object storage solution that integrates with the broader Azure ecosystem for compute, networking, and security services. Its compliance certifications, such as SOC 2 Type II and GDPR, address enterprise security and regulatory requirements.

Key features

Blob Types: Supports Block Blobs for general purpose object storage, Page Blobs for random access patterns (e.g., VM disks), and Append Blobs for logging.
Access Tiers: Offers Hot, Cool, and Archive access tiers to optimize storage costs based on data access frequency. Hot for frequent access, Cool for infrequent, and Archive for long-term data retention with flexible retrieval options.
Data Redundancy: Provides various redundancy options including Locally Redundant Storage (LRS), Zone-Redundant Storage (ZRS), Geo-Redundant Storage (GRS), and Geo-Zone-Redundant Storage (GZRS) to ensure data durability and availability against local failures or regional disasters.
Hierarchical Namespace: Integration with Azure Data Lake Storage Gen2 provides a hierarchical namespace, improving performance and compatibility with big data analytics frameworks.
Security Features: Includes features like Azure Active Directory integration, role-based access control (RBAC), encryption at rest and in transit, shared access signatures (SAS), and Azure Private Link for secure data access.
Lifecycle Management: Policies to automate the transition of data between access tiers and deletion of data based on defined rules (e.g., age of blob).
Event Notifications: Integration with Azure Event Grid to trigger serverless functions or other services in response to blob events (e.g., creation, deletion).
Developer Tooling: Comprehensive SDKs for popular languages, Azure CLI, Azure PowerShell, and a REST API for programmatic access, along with the Azure Portal for GUI-based management.

Pricing

Azure Blob Storage pricing is structured around several factors, including the amount of data stored, the data redundancy option chosen, the data access tier, the number of operations performed, and data transfer out of the Azure region. As of May 2026, the pricing details are available on the Azure Blob Storage pricing page.

Service Component	Description	Example Price (LRS Hot, US East 2)
Storage Capacity	Cost per GB per month for stored data	$0.020 per GB
Data Operations	Cost per 10,000 operations (read, write, list)	Write: $0.002 per 10,000; Read: $0.0004 per 10,000
Data Retrieval (Cool/Archive)	Cost per GB to retrieve data from Cool or Archive tiers	Cool: $0.01 per GB; Archive: $0.02 per GB
Data Transfer Out	Cost per GB for data egress from Azure regions	First 100 GB free; then tiered, e.g., $0.087 per GB

A free tier is available, offering up to 5 GB of LRS hot blob storage, 20,000 read operations, 10,000 write operations, and 10,000 list operations per month for 12 months. This allows developers to experiment with the service before incurring charges.

Common integrations

Azure Virtual Machines: Page Blobs serve as the underlying storage for Azure VM disks, enabling persistent storage for virtualized environments.
Azure Functions: Integrate with Azure Functions to process blob events (e.g., new file uploads triggering serverless code). More details on Azure Functions blob trigger.
Azure Data Factory: Used as a source or sink for data pipelines, enabling data movement and transformation between various data stores. Refer to Azure Data Factory Blob Storage connector documentation.
Azure CDN: Seamlessly integrates with Azure Content Delivery Network (CDN) to cache blob content at edge locations, reducing latency for global users.
Azure Synapse Analytics: Blob storage, particularly with Data Lake Storage Gen2, acts as a data lake for large-scale data warehousing and analytics with Synapse.
Azure Kubernetes Service (AKS): Can be used for persistent storage volumes for containerized applications running on AKS, although Azure Files or Disk Storage are also common.
Third-party Tools: Many third-party data management and analytics tools offer connectors for Azure Blob Storage, allowing for broader ecosystem integration. For example, Apache Kafka can be configured to produce or consume data from Blob Storage through various connectors, enabling robust streaming data architectures as highlighted by Redpanda's S3 connector documentation, which demonstrates similar object storage integration patterns.

Alternatives

Amazon S3: A widely adopted object storage service from AWS, offering similar scalability, durability, and a range of storage classes.
Google Cloud Storage: Google's equivalent object storage solution, providing various storage classes and strong integration with Google Cloud services.
Cloudflare R2: An object storage service that emphasizes zero egress fees, designed for global applications and performance.

Getting started

To get started with Azure Blob Storage, you typically create a storage account, then a container within that account, and finally upload blobs to the container. The following Python example demonstrates how to upload a text file to a blob container using the Azure Blob Storage client library for Python. First, install the SDK:

pip install azure-storage-blob

Then, use the following Python code:

import os
from azure.storage.blob import BlobServiceClient, BlobClient, ContainerClient

def upload_blob_example():
    # Replace with your storage account connection string
    # You can find this in the Azure Portal under your storage account -> Access keys
    connect_str = os.getenv("AZURE_STORAGE_CONNECTION_STRING") 
    if not connect_str:
        raise ValueError("AZURE_STORAGE_CONNECTION_STRING environment variable not set.")

    # Create the BlobServiceClient object
    blob_service_client = BlobServiceClient.from_connection_string(connect_str)

    # Create a unique name for the container
    container_name = "my-container-" + str(uuid.uuid4())

    # Create the container
    container_client = blob_service_client.create_container(container_name)
    print(f"Container '{container_name}' created.")

    # Create a local file to upload
    local_file_name = "hello_world.txt"
    with open(local_file_name, "w") as file:
        file.write("Hello, Azure Blob Storage!")

    # Create a blob client using the local file name as the name for the blob
    blob_client = blob_service_client.get_blob_client(container=container_name, blob=local_file_name)

    # Upload the created file
    with open(file=local_file_name, mode="rb") as data:
        blob_client.upload_blob(data)
    print(f"Uploaded '{local_file_name}' to blob '{local_file_name}'.")

    # Clean up local file
    os.remove(local_file_name)

    # Optional: List blobs in the container
    print("Listing blobs...")
    for blob in container_client.list_blobs():
        print(f"\t{blob.name}")

    # Optional: Delete the container (uncomment to delete after testing)
    # container_client.delete_container()
    # print(f"Container '{container_name}' deleted.")

if __name__ == "__main__":
    import uuid
    upload_blob_example()

Ensure you set the AZURE_STORAGE_CONNECTION_STRING environment variable with your storage account's connection string before running the script. This connection string can be found in the Azure Portal under your storage account's "Access keys" section.

Azure Blob Storage: Object Storage for Unstructured Data

Overview

Key features

Pricing

Common integrations

Alternatives

Getting started

# frequently asked questions

## reviews

## comments

Overview

Key features

Pricing

Common integrations

Alternatives

Getting started

# frequently asked questions

# see also

## reviews

## comments