Cloud Reference

Microsoft Azure — Services Documentation

A comprehensive, organized reference for Microsoft's cloud platform — every major service category covered, with a deep focus on Azure AI, including Azure OpenAI Service, AI Foundry, AI Search, Machine Learning, and the full set of pre-built AI services.

200+
Services
60+
Regions worldwide
300+
Datacenters
95%
Fortune 500 use Azure

01 Overview

Azure is Microsoft's public cloud platform — a global network of datacenters offering compute, storage, networking, databases, analytics, AI, and SaaS that you consume on-demand and pay for as you use.

Azure organizes its 200+ services into a handful of well-defined categories: Compute, Containers, Networking, Storage, Databases, Analytics, AI & ML, Integration, Identity & Security, DevOps, IoT, Hybrid, Management, Web, Mobile, Migration, Media, and Quantum/HPC. This document walks each category in turn and surfaces the practical "what to use when."

Azure portal, CLI, PowerShell, SDKs, and ARM/Bicep all hit the same Resource Manager API.
Whatever you do in the portal can be scripted, templated, or version-controlled. Treat the portal as a discovery tool and Bicep / Terraform as your source of truth.

02 Foundations

Five concepts you must understand before placing anything in Azure: tenant, subscription, management group, resource group, and resource.

ConceptWhat it isWhy it matters
Microsoft Entra tenantThe identity boundary — your directory of users, groups, apps.One tenant per organization; everything authenticates against it.
SubscriptionThe billing & quota boundary that holds resources.Common pattern: separate subscriptions per environment (dev/test/prod) or business unit.
Management groupContainer for multiple subscriptions, used to apply policy and access at scale.Apply Azure Policy and RBAC once at a management group → inherits to all child subs.
Resource groupLogical container for resources sharing a lifecycle.Delete the RG → delete everything in it. Tag at the RG for cost rollups.
ResourceAny individually-managed object — a VM, a storage account, a database, a key vault.Every resource has a globally-unique resourceId; the API surface is uniform.

Azure Resource Manager (ARM)

The control plane. Every Azure operation — create, read, update, delete, tag, lock, role-assign — flows through ARM. Templates come in two flavors:

03 Global infrastructure

Azure runs from 60+ regions in 140 countries — more than any other major cloud. Where you deploy affects latency, residency, redundancy, price, and which services are available.

Region

A geography-bound set of datacenters (e.g. eastus, westeurope). Most resources are regional.

Availability Zone

Physically separate datacenters within a region with independent power, cooling, network. Use 3 zones for 99.99%+ SLAs.

Region pair

Geographically distant pair (e.g. East US ↔ West US) for cross-region replication and DR.

Edge zones

Smaller deployments at metropolitan or carrier edge for ultra-low-latency workloads (5G, real-time).

Sovereign clouds

Azure Government (US), Azure China (operated by 21Vianet), Azure for European Union Data Boundary.

Local zones / private MEC

For telcos and regulated industries that need Azure services on-premises or at the edge.

04 Compute

Run code on Azure. The choice spans raw VMs through fully managed serverless functions — pick the lowest-management option that satisfies your constraints.

Container Apps

Serverless containers built on Kubernetes + Dapr + KEDA. Scale to zero; ideal for microservices, event processing, and AI agents.

ServerlessContainers
Container Instances (ACI)

Run a single container without orchestration. Good for short-lived jobs and bursty workloads.

Serverless
Service Fabric

Microsoft's microservices/orchestration platform predating AKS. Still used for high-density stateful services.

PaaS
Batch

Large-scale parallel and HPC workloads — render farms, financial simulations, scientific computing.

HPC
Spring Apps

Managed Spring Boot platform built with VMware. Eliminates Spring infrastructure ops.

PaaSJava
Azure VMware Solution

Run native VMware (vSphere, vSAN, NSX) on Azure-managed bare metal — lift-and-shift VMware estates.

IaaS

VM family quick reference

SeriesTypeBest for
BBurstableDev/test, low-traffic web servers, build agents.
D / Dv5 / Dasv5General purposeMost production workloads — balanced CPU/memory.
F / Fasv6Compute-optimizedWeb tier, batch, analytics, gaming.
E / Easv5Memory-optimizedSQL/NoSQL DBs, in-memory caches, large analytics.
LStorage-optimizedNoSQL, big-data, transactional DBs needing local NVMe.
N (NC, ND, NV)GPUNC: ML training. ND: large model training (H100/H200). NV: visualization/inference.
HB / HCHPCMPI workloads, fluid dynamics, weather, molecular dynamics.
MMemory + CPU extremeSAP HANA, very large in-memory DBs.

05 Containers & Kubernetes

Five surfaces for running containers, ordered from "I just want a container to run" to "I need full Kubernetes control."

Container Instances (ACI)

Single containers / container groups, billed per second. Good for batch jobs, CI runners, ephemeral compute.

Container Registry (ACR)

Private Docker / OCI registry with geo-replication, content trust, image scanning, and tasks (build/patch on Azure).

Azure Red Hat OpenShift (ARO)

Jointly engineered with Red Hat — full OpenShift platform managed by both companies.

Service Fabric Managed Clusters

Reliable services + actor model. Microsoft's older microservices platform; powers many internal Azure services.

06 Networking

Azure networking is layered: a virtual network gives you private IP space; load balancers distribute traffic; gateways connect on-prem; CDN/Front Door accelerates global delivery.

CDN

Content delivery via Microsoft, Akamai, or Verizon (sunset). Static asset caching at the edge.

Traffic Manager

DNS-based global load balancing — geographic, weighted, performance-based, priority routing.

VPN Gateway

Site-to-site IPsec VPN; point-to-site for users. Up to 10 Gbps with VpnGw5 SKU.

ExpressRoute

Private dedicated circuit from your DC to Azure — bypasses the public internet. 50 Mbps to 100 Gbps.

Virtual WAN

Hub-and-spoke at global scale — unifies VPN, ExpressRoute, SD-WAN partners, secure hub with Firewall.

Azure Firewall

Stateful, fully managed network firewall. Premium tier adds TLS inspection and IDPS.

Network Security Groups (NSG)

Stateful packet filtering at the subnet or NIC level. Zero-cost, foundational segmentation.

DDoS Protection

Always-on, ML-driven volumetric and protocol attack mitigation. IP Protection (per-resource) and Network Protection (per-VNet) tiers.

Web Application Firewall

OWASP top-10 protection on Application Gateway and Front Door. Custom rules, bot protection, geo-filtering.

Bastion

SSH/RDP into VMs through the Azure portal — no public IPs required. Premium tier supports private connectivity.

Private Link / Private Endpoint

Reach Azure PaaS (Storage, SQL, Cosmos, Cognitive…) over a private IP in your VNet. The default for production.

NAT Gateway

Outbound-only NAT for VNets — large port pools, predictable SNAT exhaustion behavior.

Route Server

Dynamic routing (BGP) between Azure VNets and your network virtual appliances or ExpressRoute.

Network Watcher

Diagnostics: connection troubleshoot, packet capture, NSG flow logs, topology, IP flow verify.

07 Storage

Object, file, block, and queue storage — durable, replicated, and accessible globally.

Disks

Persistent block storage for VMs. Ultra Disk, Premium SSD v2, Premium SSD, Standard SSD, Standard HDD — pick by IOPS and throughput.

Queues

Simple FIFO message queue (REST). Use Service Bus for advanced messaging.

Tables

NoSQL key-value / wide-column. Cosmos DB Table API supersedes for new workloads.

NetApp Files

Enterprise NAS service powered by NetApp ONTAP — for SAP, EDA, HPC, demanding shared file workloads.

HPC Cache / Avere vFXT

File cache for HPC workloads pulling from on-prem NFS or blob.

Backup

Backup-as-a-service for VMs, SQL, SAP HANA, Files, Blobs. Cross-region, immutable vaults, soft-delete.

Site Recovery

DR orchestration — replicate VMs to another region or to Azure from on-prem. Failover testing without disruption.

StorSimple / Azure Storage Mover

Hybrid cloud storage and migration tooling for moving file shares to Azure.

Data Box

Physical appliances (Disk, Heavy, Box) for offline transfer of TB–PB-scale data into or out of Azure.

Replication options

SKUReplicasWhereDurability
LRS3 copiesSingle datacenter11 nines (annual)
ZRS3 copies3 availability zones in one region12 nines
GRS6 copiesPrimary region (LRS) + paired region (LRS)16 nines
GZRS6 copiesPrimary (ZRS) + paired (LRS)16 nines
RA-GRS / RA-GZRS6 copies+ read access to secondary16 nines

08 Databases

Managed relational and NoSQL databases — pick by workload, consistency model, and how much control you need.

SQL Server on Azure VMs

Bring your own SQL Server license; full OS control. Best when you need OS-level customization.

Database for MySQL — Flexible Server

Managed MySQL with HA, replicas, backups. Choose burstable / general / business-critical.

Database for MariaDB

Managed MariaDB. Note: in retirement — new workloads should use MySQL Flexible Server.

Cache for Redis

Managed Redis OSS and Redis Enterprise tiers (active geo-replication, modules, Redis on Flash, Search).

Database for Apache Cassandra (Managed Instance)

Managed Cassandra clusters; can hybrid-mesh with on-prem Cassandra.

Database Migration Service

Online migrations from SQL Server, Oracle, MySQL, Postgres, MongoDB to Azure equivalents.

SQL Edge

Lightweight SQL engine for IoT and edge — ARM and x64; sync with Azure SQL.

09 Analytics & big data

From data ingestion through transformation, warehousing, and BI. Microsoft Fabric is the umbrella for the modern analytics stack; the underlying services remain available individually.

Stream Analytics

SQL-based stream processing on Event Hubs / IoT Hub / Blob. Sub-second to second latency.

Event Hubs

Hyper-scale event ingestion (millions of events/sec). Kafka-protocol compatible. Capture to blob/ADLS for cold storage.

HDInsight

Managed open-source analytics: Hadoop, Spark, Kafka, HBase, Hive LLAP. Mature; new workloads usually go to Synapse / Fabric / Databricks.

Data Explorer (Kusto / ADX)

Time-series & telemetry analytics with KQL — billions of events/day, sub-second queries. Powers Log Analytics, Sentinel, App Insights.

Purview

Unified data governance — automatic scanning, classification, lineage, sensitivity labels across Azure, M365, on-prem, AWS, GCP.

Power BI

BI and analytics. Pro / Premium per user / Premium capacity / Embedded. Native semantic models, paginated reports, paginated reports.

Analysis Services

Tabular semantic models — being superseded by Power BI Premium / Fabric semantic models.

Data Share

Share datasets across orgs and tenants without copying.

10 AI services

Pre-built, REST-callable AI capabilities across vision, speech, language, and decision — what was branded "Cognitive Services" is now grouped as Azure AI Services. Most are accessed through a single multi-service endpoint or their dedicated resources.

Vision

Custom Vision

Train custom image classifiers and object detectors with a few hundred images. Export to ONNX, CoreML, TensorFlow.js for edge.

Face

Detection, verification, identification. Identification gated to approved customers for responsible-AI reasons.

Video Indexer (AI Video Indexer)

Extract metadata from video: speakers, faces, sentiment, OCR, topics, scenes, audio events.

Content Understanding

Newer multimodal extraction service that understands documents, audio, video, and images with natural-language schemas.

Speech

Speech Translation

Real-time translation between speech and speech / speech and text.

Speaker Recognition

Verify or identify a speaker by their voice (verification GA, identification gated).

Pronunciation Assessment

Score pronunciation quality of spoken words — for language learning apps.

Language

Language Understanding (CLU)

Conversational language understanding — intents, entities, slot filling. Successor to LUIS.

Question Answering

Custom Q&A from URLs/PDFs/Markdown. Successor to QnA Maker.

Immersive Reader

Embeddable reading aid — read-aloud, picture dictionary, line focus, syllabification — for accessibility and learning.

Decision & safety

Personalizer

Reinforcement-learning recommender — being retired in 2026; replace with custom RAG / RL approaches.

Anomaly Detector

Univariate and multivariate anomaly detection — being retired in 2026; replace with Stream Analytics ML or Azure ML.

Metrics Advisor

Time-series anomaly detection with root-cause hints — also being retired; migrate to AI Foundry / custom models.

Multi-service vs single-service resources.
Create a single Azure AI Services multi-service resource and call any of Vision, Language, Speech, Translator, Document Intelligence, and Content Safety from one endpoint and key. Use single-service resources when you need separate billing or stricter access boundaries.

11 Azure OpenAI & AI Foundry

Generative AI on Azure runs through two complementary surfaces: the Azure OpenAI Service (managed access to OpenAI models in your tenant) and Azure AI Foundry (the unified portal, SDK, and runtime for building AI apps and agents on top of those models).

Models as a Service (MaaS)

Pay-per-token serverless endpoints for non-OpenAI models — Llama, Mistral, Phi, Cohere, AI21, Mistral Large — without provisioning compute.

Provisioned Throughput Units (PTUs)

Reserved throughput for OpenAI models — guaranteed latency and capacity at predictable cost. Best for high-volume, latency-sensitive production.

Batch deployments

Submit large jobs at ~50% discount with 24-hour SLA. Ideal for offline embedding, classification, summarization at corpus scale.

Fine-tuning

SFT, DPO, and reinforcement fine-tuning on supported models. Adapter weights live in your tenant.

Prompt Shields & Content Filters

Layered defenses — input prompt-injection detection, output content filtering, groundedness checks, protected-material detection — applied to every request unless explicitly opted out.

AI red teaming agent

Automated adversarial probing of your AI app — generates jailbreaks, indirect injections, harmful inputs and reports failures.

Evaluation in Foundry

Built-in evaluators (groundedness, relevance, similarity, fluency, coherence, safety, custom) plus your own evaluators run in CI against datasets.

Working with Azure OpenAI

# Python — chat completion through Azure OpenAI
from openai import AzureOpenAI

client = AzureOpenAI(
    azure_endpoint="https://my-resource.openai.azure.com",
    api_key=os.environ["AOAI_KEY"],
    api_version="2025-04-01-preview",
)

resp = client.chat.completions.create(
    model="gpt-4o",           # your deployment name
    messages=[
        {"role": "system", "content": "You are a concise assistant."},
        {"role": "user", "content": "Summarise the EU AI Act in two sentences."},
    ],
    max_tokens=200,
)
print(resp.choices[0].message.content)

Model catalog highlights

FamilyExamplesUse for
OpenAI flagshipgpt-5, gpt-4o, gpt-4o-miniGeneral chat, RAG, agents, multimodal.
OpenAI reasoningo3, o4-miniComplex reasoning, math, code, analysis where latency budget allows.
OpenAI imagegpt-image-1, dall-e-3Image generation and editing.
OpenAI audiowhisper, gpt-4o-realtime, ttsSpeech-to-text, real-time voice agents, text-to-speech.
OpenAI embeddingstext-embedding-3-large / -small, text-embedding-ada-002RAG vectorization, clustering, classification.
Microsoftphi-4, phi-4-mini, phi-4-visionSmall, capable models — on-device and edge.
Metallama-4, llama-3.3-70bOpen-weight LLMs, fine-tuning, custom hosting.
Mistralmistral-large-2, codestralEuropean-built reasoning + code.
Coherecommand-r-plus, embed-v3, rerank-v3RAG-tuned generation, embeddings, rerankers.
NVIDIANIM containers, NeMo RetrieverGPU-optimized inference and RAG components.
Hugging Face20K+ modelsLong tail of open models with one-click managed deploy.

12 Machine learning

Custom ML — train, deploy, monitor your own models. Azure Machine Learning is the workspace; AI Foundry is the GenAI-first overlay built on top.

AutoML

Automated training and selection over classification, regression, forecasting, NLP, image, recommendation. Generates explainable code.

Designer

Drag-and-drop pipeline canvas for traditional ML. Useful for citizen data scientists and rapid prototyping.

Managed online endpoints

Production model serving with blue/green deployment, autoscale, A/B traffic split, App Insights tracing.

Batch endpoints

Async scoring of large datasets — pipeline-based, ideal for nightly inference.

Feature store (managed)

Feature definitions, materialization to online (Redis) and offline (ADLS) stores, point-in-time-correct training data.

Responsible AI dashboard

Explainability (SHAP), fairness, error analysis, counterfactuals, causal — single dashboard per model.

Prompt Flow

Visual + code authoring of LLM workflows; runs in AML and Foundry. Ideal bridge between research and production.

Model registry & lineage

Versioned models with MLflow integration; track training data, code, environment, metrics for every version.

14 Integration & messaging

Glue between systems. Logic Apps for workflows, API Management for the API edge, Service Bus / Event Grid / Event Hubs for the messaging spine.

Data Factory pipelines

(Listed under analytics — also serves as an integration tool for ETL between systems.)

Azure Relay

Hybrid Connections + WCF Relay — securely expose on-prem services to the cloud without firewall changes.

SignalR Service

Managed WebSocket service for real-time apps — chat, dashboards, collaborative editing.

Web PubSub

Cloud-native pub/sub over WebSockets, simpler than SignalR for greenfield real-time use cases.

15 Identity & security

Security in Azure starts with identity (Microsoft Entra) and layers on Defender for runtime protection and Sentinel for SIEM/SOAR.

Entra ID Governance

Entitlement management, access reviews, lifecycle workflows, PIM (Privileged Identity Management).

Entra External ID

Customer identity (B2C / CIAM) — sign-up flows, social IDs, custom branding. Replaces legacy Azure AD B2C.

Entra Verified ID

Decentralized identity — issue and verify W3C verifiable credentials (degrees, employment, KYC).

Entra Permissions Management

CIEM — least-privilege analysis across Azure, AWS, GCP. Right-size identities and clean up zombie permissions.

Managed Identities

Identities you give to Azure resources — no secrets in code; tokens injected by IMDS. System-assigned vs user-assigned.

Microsoft Purview Information Protection

Sensitivity labels, DLP, encryption, double-key encryption — apply to documents, emails, datasets across M365 and Azure.

Confidential Computing

VMs (DCsv3, DCasv5) and containers running in Trusted Execution Environments — Intel SGX, AMD SEV-SNP. Data encrypted in use.

Attestation

Verify the integrity of a TEE before releasing secrets to it. Used by confidential computing and confidential ledger.

Confidential Ledger

Tamper-evident, blockchain-backed log for high-value records — built on confidential computing.

16 DevOps

Azure DevOps and GitHub are both Microsoft properties. Most new projects standardize on GitHub + GitHub Actions + Azure; Azure DevOps remains for existing investments and government clouds.

GitHub Actions for Azure

First-party actions for OIDC federated login, deployment to App Service / Functions / AKS / Container Apps, Bicep deploy.

Bicep

Microsoft's IaC DSL that compiles to ARM JSON. First-class linter, modules, what-if, decompile from JSON.

ARM templates

Original declarative deployment language. Verbose; usually compiled from Bicep today.

Azure Deployment Environments

Self-service environments for dev teams — IaC templates, governed catalogs, costs by environment.

Azure Load Testing

Managed load testing built on Apache JMeter; integrates with App Insights for end-to-end correlation.

Azure Chaos Studio

Chaos engineering — inject faults (CPU pressure, network loss, AKS pod kill, region failure simulation) to validate resilience.

Azure Pipelines self-hosted agents

Run CI on your own VMs or AKS — for IP egress control, custom tools, GPU build steps.

17 IoT & edge

Connect, manage, and analyze devices and the data they produce — from low-power sensors to industrial PLCs.

IoT Hub

Bidirectional cloud gateway for millions of devices — device twins, direct methods, file upload, message routing. MQTT, AMQP, HTTPS.

IoT Central

App platform for IoT — UI, dashboards, rules, device templates. Lower-effort path than IoT Hub for standard scenarios.

Digital Twins

Graph model of physical environments (buildings, factories, cities). DTDL ontology, real-time queries.

IoT Edge

Container runtime that ships cloud workloads (Functions, ML models, Stream Analytics) to edge devices. Now largely subsumed by Azure IoT Operations / Arc.

Azure Sphere

Secure MCU + OS + cloud security service for greenfield secure IoT devices. End-of-support announced; see Azure IoT Operations roadmap.

Azure Maps

Location services — base maps, search, routing, traffic, geofencing, weather, mobility, indoor maps.

18 Hybrid & multi-cloud

Run Azure services on-premises, at the edge, or in other clouds — extend the management plane outward.

Azure Stack HCI

On-prem hyperconverged cluster validated by Microsoft + OEMs. Runs Azure services, AKS, AVD on your hardware.

Azure Local

2024 evolution of Stack HCI — distributed, Arc-managed Azure infrastructure for the edge.

Azure Stack Edge

Pre-built appliance shipped by Microsoft — local compute + storage + GPU + accelerators, managed from Azure portal.

Azure Stack Hub

Full Azure consistent cloud in your datacenter — disconnected operation supported. Niche but mature.

Arc-enabled data services

SQL MI and Postgres Hyperscale running on any Kubernetes — billed and managed through Azure.

19 Management & governance

Standards, policy, cost control, and operational tooling that apply across every service.

Azure Blueprints (deprecating)

Composable templates of policy + RBAC + ARM. Being replaced by Bicep + Deployment Stacks.

Deployment Stacks

Group resources to deploy / update / delete as a unit; protect from accidental deletion.

Azure Advisor

Personalized recommendations across cost, security, reliability, performance, operational excellence.

Azure Lighthouse

Cross-tenant management for MSPs and large enterprises — see and operate customer / subsidiary tenants from one portal.

Automation

Runbooks (PowerShell, Python), state config (DSC), update management (now Update Manager), shared modules.

Update Manager

Patch orchestration for Azure VMs, Arc-enabled servers, on-prem.

20 Monitoring & observability

One platform — Azure Monitor — with several feature surfaces stacked on top of a shared metrics, logs, and traces store.

Log Analytics (workspace)

Storage + KQL query engine for logs and traces. Single workspace pattern is recommended for most orgs.

Container Insights

Container and AKS observability — node, pod, deployment metrics; Live Data tail; Prometheus-managed scrape.

VM Insights

Per-VM CPU/memory/disk + dependency mapping using Azure Monitor Agent.

Network Insights

Topology, NSG flow, connection monitor, ExpressRoute monitoring.

Azure Managed Grafana

Managed Grafana with Entra SSO and built-in datasources for Monitor, Prometheus, ADX.

Managed Prometheus

Fully-managed PromQL-compatible metrics service — auto-scrape AKS, Arc clusters, public endpoints.

Workbooks & Dashboards

Curated views of metrics, logs, and KQL — share within team or pin to portal.

21 Web & mobile

Build and host web and mobile front-ends — from static sites to push-notification fan-out.

Communication Services

Voice, video, SMS, email, chat APIs powered by the Teams platform — bring multi-modal communication into your apps.

Notification Hubs

Cross-platform push notifications (APNs, FCM, WNS) at very large scale.

SignalR Service

WebSocket-based real-time messaging for web/mobile clients.

Azure Maps

Maps SDK for web, Android, iOS — routing, search, traffic, indoor.

Visual Studio App Center

Mobile DevOps — build, test, distribute, crash analytics, push for iOS/Android. Sunsetting in 2025; migrate to GitHub Actions + Codemagic / Bitrise.

22 Migration & backup

Move existing workloads to Azure and protect them once they arrive.

Azure Backup

Backup for VMs, SQL/SAP HANA in VMs, Files, Blobs, Azure Database for Postgres. Vaulted-only mode for ransomware protection.

Site Recovery

DR — replicate VMs to another region, fail over with one click, fail back when source recovers.

Database Migration Service

Online and offline DB migrations to managed Azure databases (SQL, Postgres, MySQL, MongoDB).

Data Box

Physical disks / appliances for offline bulk data transfer.

Azure Storage Mover

Migrate file shares from on-prem NAS to Azure Files / Blob.

23 Media & communication

Note: Azure Media Services retired June 2024. Replacements live in Communication Services and AI Services.

Communication Services — Calling & Video

WebRTC voice and video built on the Teams stack. Embed into apps; recording, captioning, live events.

Azure AI Video Indexer

(Listed under AI services.) Replaces Media Services for transcription, scene detection, OCR on video.

Azure Object Anchors / Remote Rendering

Mixed Reality services — ARM is winding down legacy mixed-reality offerings; check current availability.

24 Quantum & HPC

Frontier compute for research workloads.

Azure Quantum

Cloud access to multiple quantum hardware backends (IonQ, Quantinuum, Rigetti, PASQAL) and simulators. Q# language and quantum development kit.

Azure Quantum Elements

Cloud HPC + AI for chemistry and materials science — DFT at scale, generative models for molecules.

CycleCloud

Orchestration for HPC clusters — Slurm, PBS Pro, LSF, GridEngine on Azure compute.

Batch (HPC)

(Listed under compute.) Scale-out parallel batch processing.

AI Compute (ND H100/H200/B200, MI300X)

GPU and accelerator VMs — InfiniBand-clustered for distributed training; available in dedicated AI capacity zones.

25 Architectural patterns

Recurring shapes — what good Azure architectures tend to look like.

Hub-and-spoke networking

One hub VNet hosts shared services: Azure Firewall, ExpressRoute / VPN gateway, DNS, Bastion. Workload VNets ("spokes") peer to the hub. All north-south and inter-spoke traffic flows through the hub for inspection. Virtual WAN automates this at global scale.

Landing zone

The Microsoft Cloud Adoption Framework defines a "landing zone" — a pre-built, opinionated foundation: management groups, subscriptions, networking, identity, policy, monitoring. Two flavors: Azure Landing Zones (enterprise-scale) and Sovereign Landing Zones (regulated). Use the official Bicep / Terraform accelerator instead of building from scratch.

Three-tier web app

  1. Front Door + WAF at the global edge.
  2. App Service (or Container Apps / AKS) behind a Private Endpoint.
  3. SQL Database / Cosmos DB behind a Private Endpoint.
  4. Key Vault for secrets, accessed via Managed Identity.
  5. App Insights + Log Analytics for telemetry.

Event-driven serverless

Producer (e.g. Blob upload) → Event GridAzure FunctionService Bus → downstream worker (Function or Container App). Cosmos DB change feed or Event Hubs for higher volume.

Modern data platform (Fabric)

Sources → Data FactoryOneLake (lakehouse on ADLS) → Synapse Data Engineering (Spark) / Data WarehousePower BI semantic models. Governance via Purview.

Production RAG on Azure

  1. Ingest with Data Factory or Logic Apps; raw to Blob.
  2. Chunk & embed with AI Search integrated vectorization or a Function calling Azure OpenAI embeddings.
  3. Index in Azure AI Search (hybrid + semantic ranker) or Cosmos DB vector index.
  4. Generate via Azure OpenAI gpt-4o with the retrieved context.
  5. Safety: Content Safety + Prompt Shields + groundedness check on every response.
  6. Observability: trace prompts, retrievals, tools through App Insights and Foundry's eval.

Production agent on Azure

  1. AI Foundry Agent Service as the runtime.
  2. Tools: Logic Apps (workflows), Functions (custom logic), MCP servers, Cosmos DB, AI Search.
  3. Identity flows from the user via Entra ID (on-behalf-of); managed identities for downstream Azure resources.
  4. Memory in Cosmos DB (vector + scalar); short-lived state in Redis.
  5. Policy engine (APIM or custom) gates outbound tool calls.
  6. Traces to App Insights; evals and red-teaming in Foundry.

26 Pricing & cost optimization

Azure pricing varies by service. The general levers — which apply almost everywhere — are reservations, savings plans, spot, autoscale, right-sizing, and dev/test pricing.

LeverWhere it appliesTypical savings
Reserved instances (1y / 3y)VMs, SQL DB, Cosmos DB, Synapse, Cache for Redis40–70%
Savings plan for computeFlexible across VMs / Container Apps / Functions / Spring Apps~ same as RI, more flexibility
Spot VMsInterruptible workloads, batch, dev60–90%
Azure Hybrid BenefitBring Windows / SQL / RHEL / SUSE licenses30–80% on those SKUs
Provisioned Throughput UnitsAzure OpenAI high-volumePredictable cost vs PAYG
Batch deploymentsAzure OpenAI offline jobs (24h SLA)~50% off PAYG
Auto-scale + scale-to-zeroApp Service, Functions, Container Appsvaries — pay only when used
Storage tieringBlob (Hot → Cool → Cold → Archive)up to 95% on cold data
Right-sizingContinuous via Advisor recommendations10–40%
Dev/Test subscriptionVisual Studio subscribersDiscounted SKUs and OS licenses
Always tag.
Apply mandatory tags via Azure Policy: environment, cost-center, owner, workload. Without tags, Cost Management is useless after a few months.

27 Choosing services

Quick decision aids when faced with multiple services that overlap.

Run my code

Store my data

Build a GenAI app

Connect on-prem to Azure

Move data

When in doubt, start with the most managed option.
It's almost always cheaper and faster to start on App Service / Container Apps / SQL Database / AI Foundry, and only escalate to AKS / VMs / custom infrastructure when you have a concrete reason.