Security¶

Introduction¶

Security across the hybrid continuum requires adapting cloud security practices for environments with varying levels of connectivity, control, and compliance requirements. The expanded attack surface—spanning multiple networks, identity providers, and physical security boundaries—demands a defense-in-depth strategy grounded in Zero Trust principles.

The Azure Well-Architected Framework Security pillar emphasizes protecting confidentiality, integrity, and availability. In hybrid scenarios, we must apply these principles consistently even when the implementation tools differ between cloud, connected hybrid, and air-gapped environments.

This chapter covers security best practices that apply across all deployment models, with specific guidance for adapting controls to each position on the continuum.

Zero Trust Across the Continuum¶

Zero Trust is a security model based on three principles:

Verify explicitly: Always authenticate and authorize based on all available data points (identity, location, device, workload)
Use least-privilege access: Limit access with just-in-time and just-enough-access (JIT/JEA)
Assume breach: Minimize blast radius and segment access; verify end-to-end encryption

These principles apply universally, but implementation varies by deployment model:

Identity Verification Across Connectivity Models¶

Connected environments (Azure, Connected Azure Local): - Use Microsoft Entra ID as the central identity provider - Implement Multi-Factor Authentication (MFA) for all administrative access - Apply Conditional Access policies based on user, device, location, and risk - Use Privileged Identity Management (PIM) for just-in-time admin access - Federate with on-premises Active Directory using Entra Connect

Disconnected environments: - Deploy local identity providers (self-hosted Active Directory, Keycloak, or FreeIPA) - Implement equivalent controls: local MFA (TOTP via apps, hardware tokens), role-based access control - Synchronize identities during connection windows if periodic connectivity exists - Maintain audit logs of authentication events for compliance

Identity Challenges in Disconnected Environments

Without Entra ID, you lose cloud-powered risk detection (sign-in risk, user risk, anomaly detection). Compensate with enhanced logging, regular access reviews, and stricter password policies.

Network Micro-Segmentation¶

Apply network segmentation at multiple layers:

Azure (cloud): - Use Network Security Groups (NSGs) to control traffic between subnets - Implement Azure Firewall or NVAs for hub-spoke topologies - Use Private Link to access PaaS services over private IPs (no internet exposure) - Enable DDoS Protection for internet-facing resources

Azure Local: - Separate management, storage, and application traffic on different VLANs - Use Kubernetes Network Policies to enforce pod-to-pod communication rules:

apiVersion: networking.k8s.io/v1
kind: NetworkPolicy
metadata:
  name: api-allow-frontend-only
spec:
  podSelector:
    matchLabels:
      app: api
  ingress:
  - from:
    - podSelector:
        matchLabels:
          app: frontend
    ports:
    - protocol: TCP
      port: 8080

- Implement service mesh (Istio, Linkerd) for mTLS between services

Air-gapped environments: - Physical network isolation with air-gap diodes (one-way data transfer) where required - Strict MAC address filtering and 802.1X authentication for network access - Intrusion Detection Systems (IDS) to monitor for anomalies

Continuous Validation and Monitoring¶

Connected environments: - Microsoft Defender for Cloud provides continuous security posture assessment and threat detection - Microsoft Sentinel aggregates security logs for SIEM (Security Information and Event Management) - Azure Policy enforces compliance (e.g., "All VMs must have disk encryption enabled")

Disconnected environments: - Deploy local SIEM solutions: Wazuh (open-source), Elastic Security, or commercial alternatives - Implement file integrity monitoring (FIM) to detect unauthorized changes - Use vulnerability scanners (OpenVAS, Nessus) on regular schedules - Export logs to write-once media for tamper-proof audit trails

Defense in Depth for Hybrid Architectures¶

Defense in depth applies multiple layers of security controls. If one layer fails, others provide protection.

Layer 1: Physical Security¶

On-premises and Azure Local: - Restricted access to data centers with badge readers, biometric scanners - Video surveillance with retention policies - Environmental monitoring: temperature, humidity, smoke detection - Secure disposal of decommissioned hardware (disk wiping, physical destruction)

Cloud: Physical security is Microsoft's responsibility. Focus on logical security controls.

Layer 2: Network Security¶

Perimeter security: - Firewalls at network boundaries (Azure Firewall, Palo Alto, Fortinet) - Web Application Firewalls (WAF) for HTTP/HTTPS applications (Azure Front Door WAF, ModSecurity) - DDoS mitigation for internet-facing services

Internal segmentation: - VLANs and subnets to separate workloads by sensitivity - Network policies in Kubernetes to enforce least-privilege communication - Service mesh for mTLS and traffic encryption within clusters

Layer 3: Identity and Access Security¶

Strong authentication: - Multi-Factor Authentication (MFA) for all users (Entra ID, or local TOTP/hardware tokens) - Passwordless authentication where possible (FIDO2, Windows Hello for Business) - Certificate-based authentication for service-to-service communication

Authorization: - Role-Based Access Control (RBAC) in Azure, Kubernetes, and applications - Attribute-Based Access Control (ABAC) for fine-grained policies - Just-In-Time (JIT) access via Privileged Identity Management - Service accounts with minimal permissions (Kubernetes service accounts, Azure Managed Identities)

Layer 4: Data Security¶

Encryption at rest: - Azure Disk Encryption using BitLocker (Windows) or dm-crypt (Linux) - Transparent Data Encryption (TDE) for SQL Server, PostgreSQL - Storage Spaces Direct encryption for Azure Local - Key management: Azure Key Vault (connected) or HashiCorp Vault (disconnected)

Encryption in transit: - TLS 1.3 for all external communication - mTLS (mutual TLS) for internal service-to-service communication - IPsec/MACsec for network-level encryption where required

Encryption in use: - Confidential computing with AMD SEV-SNP or Intel TDX for workloads processing highly sensitive data - Secure enclaves for cryptographic operations

Data classification and handling: - Classify data by sensitivity (public, internal, confidential, restricted) - Apply appropriate controls based on classification - Use Data Loss Prevention (DLP) to prevent exfiltration

Layer 5: Application Security¶

Secure development practices: - Threat modeling during design phase (STRIDE methodology) - Secure coding training for developers (OWASP Top 10) - Static Application Security Testing (SAST): SonarQube, Checkmarx - Dynamic Application Security Testing (DAST): OWASP ZAP, Burp Suite - Software Composition Analysis (SCA): Dependency-Track, Snyk

Input validation: - Validate all user input at application boundaries - Use parameterized queries to prevent SQL injection - Implement output encoding to prevent XSS (cross-site scripting) - Apply rate limiting to prevent abuse

API security: - Use OAuth 2.0 / OpenID Connect for authentication - Implement API gateways for centralized policy enforcement (Kong, Apigee, Azure API Management) - Apply request throttling and quotas

Layer 6: Operations Security¶

Logging and monitoring: - Enable audit logging for all administrative actions - Collect logs centrally (Azure Monitor, Elasticsearch, Splunk) - Implement alerting for security events (failed logins, privilege escalations) - Retain logs per compliance requirements (typically 1-7 years)

Vulnerability management: - Regular vulnerability scanning (Qualys, Tenable, Microsoft Defender Vulnerability Management) - Patch management processes with testing and rollout plans - Prioritize patching based on risk (CVSS scores, exploitability, exposure)

Incident response: - Documented incident response plan with defined roles (incident commander, scribe, subject matter experts) - Runbooks for common security incidents (compromised account, ransomware, DDoS) - Regular tabletop exercises to practice incident response - Post-incident reviews to identify improvements

graph TB
    subgraph DefenseInDepth["🛡️ Defense-in-Depth Security Model"]
        direction TB

        subgraph Layer1["Layer 1: Physical Security"]
            Physical["🏢 Physical Access Control<br/>Connected: Azure datacenters<br/>Disconnected: On-prem security"]
        end

        subgraph Layer2["Layer 2: Network Security"]
            direction LR
            Network_C["☁️ Connected:<br/>• NSGs & Firewalls<br/>• Private Endpoints<br/>• Azure Firewall"]
            Network_D["🔒 Disconnected:<br/>• pfSense/iptables<br/>• VLANs & ACLs<br/>• Air-gap boundary"]
        end

        subgraph Layer3["Layer 3: Identity & Access"]
            direction LR
            Identity_C["☁️ Connected:<br/>• Azure AD / Entra ID<br/>• Conditional Access<br/>• MFA"]
            Identity_D["🔒 Disconnected:<br/>• AD DS + ADFS<br/>• Smart Cards<br/>• Local MFA"]
        end

        subgraph Layer4["Layer 4: Application Security"]
            direction LR
            App_C["☁️ Connected:<br/>• Azure AD Auth<br/>• API Management<br/>• WAF"]
            App_D["🔒 Disconnected:<br/>• ADFS/OAuth<br/>• Local API Gateway<br/>• ModSecurity"]
        end

        subgraph Layer5["Layer 5: Data Security"]
            direction LR
            Data_C["☁️ Connected:<br/>• TDE (SQL)<br/>• Storage encryption<br/>• Azure Key Vault"]
            Data_D["🔒 Disconnected:<br/>• TDE (SQL)<br/>• LUKS/BitLocker<br/>• HashiCorp Vault"]
        end

        subgraph Layer6["Layer 6: Operations Security"]
            direction LR
            Ops_C["☁️ Connected:<br/>• Defender for Cloud<br/>• Sentinel SIEM<br/>• Azure Monitor"]
            Ops_D["🔒 Disconnected:<br/>• Wazuh<br/>• Elastic SIEM<br/>• Prometheus+Grafana"]
        end

        Physical --> Network_C
        Physical --> Network_D
        Network_C --> Identity_C
        Network_D --> Identity_D
        Identity_C --> App_C
        Identity_D --> App_D
        App_C --> Data_C
        App_D --> Data_D
        Data_C --> Ops_C
        Data_D --> Ops_D
    end

    subgraph Attacker["👾 Attacker Progression"]
        A1["1. Breach Perimeter"] --> A2["2. Compromise Network"]
        A2 --> A3["3. Steal Credentials"]
        A3 --> A4["4. Exploit Application"]
        A4 --> A5["5. Access Data"]
        A5 --> A6["6. Evade Detection"]
    end

    A1 -.->|Blocked by| Physical
    A2 -.->|Blocked by| Network_C
    A2 -.->|Blocked by| Network_D
    A3 -.->|Blocked by| Identity_C
    A3 -.->|Blocked by| Identity_D
    A4 -.->|Blocked by| App_C
    A4 -.->|Blocked by| App_D
    A5 -.->|Blocked by| Data_C
    A5 -.->|Blocked by| Data_D
    A6 -.->|Detected by| Ops_C
    A6 -.->|Detected by| Ops_D

    style DefenseInDepth fill:#e0f7ff,stroke:#0078d4,stroke-width:3px
    style Layer1 fill:#dc3545,stroke:#a71d2a,stroke-width:2px,color:#fff
    style Layer2 fill:#fd7e14,stroke:#cc6600,stroke-width:2px
    style Layer3 fill:#ffc107,stroke:#f57c00,stroke-width:2px
    style Layer4 fill:#20c997,stroke:#0f6848,stroke-width:2px
    style Layer5 fill:#0078d4,stroke:#005a9e,stroke-width:2px,color:#fff
    style Layer6 fill:#6f42c1,stroke:#4a2870,stroke-width:2px,color:#fff
    style Attacker fill:#000,stroke:#dc3545,stroke-width:3px,color:#fff

    style Network_C fill:#50e6ff,stroke:#0078d4,stroke-width:1px
    style Network_D fill:#107c10,stroke:#004b1c,stroke-width:1px,color:#fff
    style Identity_C fill:#50e6ff,stroke:#0078d4,stroke-width:1px
    style Identity_D fill:#107c10,stroke:#004b1c,stroke-width:1px,color:#fff
    style App_C fill:#50e6ff,stroke:#0078d4,stroke-width:1px
    style App_D fill:#107c10,stroke:#004b1c,stroke-width:1px,color:#fff
    style Data_C fill:#50e6ff,stroke:#0078d4,stroke-width:1px
    style Data_D fill:#107c10,stroke:#004b1c,stroke-width:1px,color:#fff
    style Ops_C fill:#50e6ff,stroke:#0078d4,stroke-width:1px
    style Ops_D fill:#107c10,stroke:#004b1c,stroke-width:1px,color:#fff

Secret Management Across Environments¶

Secrets (passwords, API keys, certificates, encryption keys) require special handling. Never hardcode secrets in application code or configuration files.

Secret Management in Connected Environments¶

Azure Key Vault: - Store secrets, certificates, and keys in Azure Key Vault - Use Azure Managed Identities to access Key Vault without storing credentials - Enable soft delete and purge protection to prevent accidental secret loss - Implement RBAC to control who can read, write, or manage secrets - Use Private Link to access Key Vault over private network

Integration with Kubernetes: - Use Azure Key Vault Provider for Secrets Store CSI Driver to mount secrets as volumes in pods - Secrets are synchronized from Key Vault to Kubernetes at pod start time

Secret Management in Disconnected Environments¶

HashiCorp Vault: - Deploy Vault as a self-hosted secret management solution - Configure Vault HA with multi-node clusters backed by Consul or integrated storage - Use dynamic secrets where possible (Vault generates short-lived database credentials) - Implement secret rotation on defined schedules - Integrate with Kubernetes via Vault Agent Injector or CSI driver

Certificate lifecycle management: - Use cert-manager in Kubernetes to automate certificate issuance and renewal - For internal CAs, deploy HashiCorp Vault PKI or EJBCA - Monitor certificate expiration and alert before expiry

Secret Rotation

Automate secret rotation wherever possible. Manual rotation is error-prone and often skipped. Dynamic secrets (generated on-demand with short TTLs) are more secure than long-lived static secrets.

Supply Chain Security¶

Supply chain attacks—compromises introduced through dependencies, build tools, or infrastructure—are a growing threat. This is especially critical in air-gapped environments where external validation is limited.

Container Image Security¶

Image signing and verification: - Use Cosign (part of Sigstore) to sign container images - Configure admission controllers (Kyverno, OPA Gatekeeper) to require signed images:

apiVersion: kyverno.io/v1
kind: ClusterPolicy
metadata:
  name: verify-images
spec:
  validationFailureAction: enforce
  rules:
  - name: verify-signature
    match:
      resources:
        kinds:
        - Pod
    verifyImages:
    - image: "myregistry.azurecr.io/*"
      key: |-
        -----BEGIN PUBLIC KEY-----
        ...
        -----END PUBLIC KEY-----

Image scanning: - Scan images for vulnerabilities with Trivy, Grype, or Clair - Integrate scanning into CI/CD pipelines (fail builds on critical vulnerabilities) - Run periodic scans on images in registries to detect newly disclosed vulnerabilities

Registry security: - Use Azure Container Registry (connected) or Harbor (disconnected) - Enable content trust (Docker Content Trust / Notary v2) - Implement RBAC on registries (separate read/write permissions) - Use immutable tags to prevent tag overwriting

Software Bill of Materials (SBOM)¶

Generate and maintain SBOMs for all applications: - Use Syft or SBOM-tool to generate SBOMs in SPDX or CycloneDX format - Store SBOMs alongside container images - Use SBOMs during vulnerability response to quickly identify affected components

Dependency Management¶

Dependency scanning: - Use Dependabot (GitHub), Renovate, or Snyk to track dependencies - Monitor for vulnerabilities in dependencies (CVE databases, GitHub Security Advisories) - Update dependencies regularly; test updates in non-production environments first

Vendoring for air-gapped: - For air-gapped environments, vendor all dependencies (container images, language libraries, OS packages) - Maintain an internal mirror of required packages - Scan vendored dependencies before importing into air-gapped networks

Security Monitoring and Incident Response¶

Monitoring in Connected Environments¶

Microsoft Defender for Cloud: - Provides Cloud Security Posture Management (CSPM): identifies misconfigurations - Offers Cloud Workload Protection (CWP): detects threats (e.g., crypto-mining, lateral movement) - Generates secure score to measure security posture

Microsoft Sentinel: - Aggregates logs from Azure, on-premises, and third-party sources - Uses analytics rules to detect security incidents - Provides Security Orchestration, Automation, and Response (SOAR) playbooks

Integration with Azure Local: - Connect Azure Local clusters to Defender for Cloud via Azure Arc - Forward logs to Sentinel via Azure Monitor Agent or syslog

Monitoring in Disconnected Environments¶

Local SIEM solutions: - Wazuh: Open-source SIEM with agents for log collection, FIM, vulnerability detection - Elastic Security: Elasticsearch-based security analytics - Splunk: Commercial SIEM (can be deployed on-premises)

Log aggregation: - Collect logs from all sources: OS, Kubernetes, applications, network devices - Use Fluentd or Logstash to forward logs to SIEM - Implement log retention policies based on compliance requirements

Alerting: - Configure alerts for security events: failed logins, privilege escalations, unusual network traffic - Integrate with on-premises notification systems (email via local SMTP, SMS gateways, PagerDuty self-hosted)

Incident Response Process¶

Detection: Security monitoring identifies potential incident
Triage: On-call engineer determines severity and escalates if needed
Containment: Isolate affected systems to prevent spread (network segmentation, disable accounts)
Eradication: Remove threat (delete malware, patch vulnerability, revoke compromised credentials)
Recovery: Restore systems to normal operation (restore from backups if needed)
Post-incident review: Document lessons learned, update runbooks, improve detection

Incident Response in Hybrid Environments

A ransomware incident affecting Azure Local requires: - Detection: Wazuh alerts on file encryption activity - Containment: Isolate affected nodes via network policies, disable compromised accounts - Eradication: Rebuild affected nodes from golden images - Recovery: Restore application data from backups (tested regularly via DR drills) - Review: Identify initial access vector (phishing email), improve email filtering, enhance user training

Compliance and Regulatory Requirements¶

Many hybrid deployments are driven by compliance requirements (GDPR, HIPAA, FedRAMP, CMMC). Security controls must align with applicable regulations.

Common compliance requirements: - Data residency: Data must remain in specific geographic locations (addressed by Azure Local in-country) - Encryption: Data must be encrypted at rest and in transit (addressed by TDE, TLS, disk encryption) - Access controls: Least-privilege access with MFA (addressed by RBAC, Entra ID) - Audit logging: All access to sensitive data must be logged (addressed by centralized logging) - Vulnerability management: Regular scanning and patching (addressed by continuous scanning)

Compliance tooling: - Connected: Azure Policy with built-in compliance initiatives (PCI-DSS, HIPAA, FedRAMP) - Disconnected: OpenSCAP for configuration compliance scanning, custom scripts for policy enforcement

Security Operations Best Practices¶

Security champions: Designate security champions within development teams to promote security awareness and best practices.

Secure by default configurations: Use CIS Benchmarks or DISA STIGs to harden OS and Kubernetes configurations.

Regular security assessments: Conduct penetration testing annually or after significant changes. Engage third-party security firms for unbiased assessments.

Security training: Provide regular training for developers (secure coding), operators (security monitoring), and users (phishing awareness).

Shift-left security: Integrate security into CI/CD pipelines. Fail builds on critical vulnerabilities or policy violations.

Conclusion¶

Security across the hybrid continuum requires adapting cloud security practices for environments with varying connectivity and control. By applying Zero Trust principles consistently, implementing defense-in-depth strategies, managing secrets securely, protecting the supply chain, and maintaining robust monitoring and incident response capabilities, teams can build secure workloads that meet compliance requirements across all deployment models.

The key insight: security is not a product or a checklist—it's a continuous process of risk assessment, control implementation, monitoring, and improvement.

Next: Operations →