infra(hetzner): per-region hcloud_network — DMZ-WG, no shared private net (#1507)

* docs(sovereign): pin multi-region DoD contract — never divert from D1-D14 Founder ruling 2026-05-15: every silent compromise from the multi-region target-state architecture is a quality violation. This file locks the convergence contract so future Claude sessions cannot drift. Architecture invariants A1-A6: - 3 regions minimum (never drop to 2 to dodge provider capacity) - Inter-region link = DMZ WireGuard over PUBLIC IPs, ALWAYS (no hcloud_network cross-region, no VPC peering, no Huawei VPC) - Cilium ClusterMesh apiserver = LoadBalancer (NEVER NodePort) - vCluster topology: primary = MGMT+DMZ, secondary = DMZ+RTZ - Zero public exposure of K8s control-plane endpoints - Provider-mix is canonical (assume 1 Hetzner + 1 AWS + 1 Huawei) DoD gates D1-D14 enforced via Playwright MCP + kubectl + cilium CLI on every fresh prov. No partial credit, no "deferred", no "matrix-drift". Mirrored to auto-memory at ~/.claude/projects/-home-openova-repos-openova-private/memory/sovereign_multiregion_dod.md so it loads at every session start. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * infra(hetzner): per-region hcloud_network — DMZ-WG, no shared private net Implements A1+A2+A6 from docs/SOVEREIGN-MULTI-REGION-DOD.md. Each region gets its own hcloud_network (10.0.0.0/16 INSIDE each, not shared across). Inter-region link is exclusively Cilium WireGuard over PUBLIC IPs through the DMZ — no provider's internal network ever spans regions. - Replaces hcloud_network.main + hcloud_network_subnet.{main,secondary} with hcloud_network.region[*] + hcloud_network_subnet.region[*] (for_each over toset(local.all_region_keys); primary key = "primary", secondary keys = slice-G1 "{cloudRegion}-{index}" shape). - Per-region cluster-cidr (10.42+i.0/16) + service-cidr (10.96+i.0/16) threaded through cloud-init so ClusterMesh peers don't collide on pod/service CIDRs (DoD gate D11). - Firewall: open UDP 51871 from 0.0.0.0/0 (Cilium WG inter-region encryption) — without this the WG mesh between regions cannot form. - Each CP's local private IP is now uniformly 10.0.1.2 per region (every region has its own /24 inside its own /16 — no cross-region IP collision class possible by construction). - Hetzner resource names threaded to cluster-autoscaler now use hcloud_network.region["primary"|<k>].name so autoscaler-spawned workers land in the same isolated /16 as their region's CP. - Pre-2026-05-15 state will plan a network-recreate on next apply; per DoD cycle protocol this is consciously accepted (no tofu state mv runbook, every wipe-and-create is a fresh provision). - tofu tests cover: per-region network count + uniform 10.0.0.0/16 + uniform 10.0.1.0/24 subnet + per-region cluster/service CIDRs + Cilium WG firewall rule existence. - README "Network" section adds the 3-region DMZ-WG ASCII topology. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * chore(tofu): apply tofu fmt — fixes CI fmt-check on PR #1507 Apply OpenTofu's canonical formatting to main.tf. No semantic changes; only whitespace alignment under template substitute blocks where my refactor added 2-char fields (`cluster_cidr` and `service_cidr`) that perturbed the prior column alignment. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: hatiyildiz <hatice.yildiz@openova.io> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com> Co-authored-by: claude <claude@anthropic.com>
2026-05-15 22:04:32 +04:00 · 2026-05-15 22:04:32 +04:00 · 93f699326a
commit 93f699326a
parent 918046874c
4 changed files with 441 additions and 134 deletions
--- a/infra/hetzner/README.md
+++ b/infra/hetzner/README.md
@ -10,28 +10,97 @@ This module is the implementation of [`docs/SOVEREIGN-PROVISIONING.md`](../../do

 | Resource | Purpose |
 |---|---|
-| `hcloud_network` + `hcloud_network_subnet` | Private 10.0.0.0/16 with 10.0.1.0/24 reserved for control-plane and workers. |
-| `hcloud_firewall` | Inbound rules for 80/443 (HTTPS), 6443 (k3s API), ICMP, and an opt-in SSH rule keyed to operator CIDRs. |
-| `hcloud_ssh_key` | The operator's existing SSH key (from their Hetzner project) — never auto-generated. |
-| `hcloud_server` (control plane) | 1 node by default (`ha_enabled=false`); 3 nodes when HA is on. Cloud-init installs k3s + Flux + the bootstrap kit pointer. |
-| `hcloud_server` (workers) | `worker_count` nodes (default **2** — issue #733 multi-node Sovereign). Set to 0 explicitly for solo dev/POC. |
-| `hcloud_load_balancer` (`lb11`) | Public IPv4; forwards 80→31080 and 443→31443 (Cilium Gateway NodePorts post-bootstrap). |
-| `null_resource.dns_pool` | Calls `/usr/local/bin/catalyst-dns` (a helper inside the catalyst-api container) when `domain_mode=pool` to write Dynadot A records for the new sovereign FQDN. |
+| `hcloud_network.region[*]` | **One private 10.0.0.0/16 PER REGION.** Each region gets its own isolated Network — never shared across regions. See [Network](#network) below. |
+| `hcloud_network_subnet.region[*]` | One 10.0.1.0/24 subnet inside each region's Network. Uniform layout: CP at .2, workers at .10+, LB anchor at .254. |
+| `hcloud_firewall.main` | Inbound rules for 80/443 (HTTPS), 6443 (k3s API), 53 (DNS), ICMP, **UDP 51871 (Cilium WireGuard inter-region encryption)**, and an opt-in SSH rule keyed to operator CIDRs. |
+| `hcloud_ssh_key.main` | The operator's existing SSH key (from their Hetzner project) — never auto-generated. |
+| `hcloud_server.control_plane[*]` (primary) | 1 node by default (`ha_enabled=false`); 3 nodes when HA is on. Cloud-init installs k3s + Flux + the bootstrap kit pointer. Attaches to `hcloud_network.region["primary"]`. |
+| `hcloud_server.worker[*]` (primary) | `worker_count` nodes (default **2** — issue #733 multi-node Sovereign). Set to 0 explicitly for solo dev/POC. |
+| `hcloud_server.secondary_control_plane[*]` | One per secondary region, single-CP. Attaches to its own region's `hcloud_network.region[<key>]`. |
+| `hcloud_server.secondary_worker[*]` | Per-region worker fleet, sized by `regions[i].workerCount`. |
+| `hcloud_load_balancer.main` (`lb11`) | Public IPv4 in the primary region; forwards 80→30080, 443→30443, 53→30053 (Cilium Gateway NodePorts). |
+| `hcloud_load_balancer.secondary[*]` | One `lb11` per secondary region. PowerDNS lua-records aggregate every LB IP behind the Sovereign FQDN with `ifurlup` health probes. |

 After Phase 0, the cluster's Flux pulls `clusters/<sovereign_fqdn>/` from the public OpenOva monorepo and installs the 11-component bootstrap kit (Cilium → cert-manager → Crossplane → ESO → SPIRE → NATS → OpenBao → Keycloak → Gitea → catalyst-platform). Hetzner adoption by Crossplane happens once `provider-hcloud` is up.

 ---

-## Multi-region wiring (slice G1, EPIC-0 #1095)
+## Network

-The module accepts a `var.regions[]` list-of-objects payload that captures the wizard's per-region sizing. Slice G1 wires every entry in that list end-to-end:
+**Per [`docs/SOVEREIGN-MULTI-REGION-DOD.md`](../../docs/SOVEREIGN-MULTI-REGION-DOD.md) A2 (founder ruling 2026-05-15): every region has its OWN `hcloud_network`. Provider private networks NEVER span regions — inter-region traffic flows exclusively over Cilium WireGuard (UDP 51871) on each region's public IP through the DMZ vCluster.**
+
+This is the same rule whether the secondary region is Hetzner, AWS, or Huawei (DoD A6). The Hetzner module is provider-mix-friendly: a sister-provider module owns its own regions, and the inter-region link sits ABOVE the provider layer.
+
+### Per-region addressing (uniform across regions)
+
+Every region's Network carries an identical `/16` and `/24`:
+
+| Address | Role |
+|---|---|
+| `10.0.0.0/16` | Region's private Network (one per region; ranges are identical inside isolated Networks). |
+| `10.0.1.0/24` | Region's subnet (only subnet inside that Network). |
+| `10.0.1.2` | Control plane (every region's CP — primary AND secondary). |
+| `10.0.1.10` .. `10.0.1.<10+worker_count-1>` | Workers in that region. |
+| `10.0.1.254` | LB anchor (pinned via `hcloud_load_balancer_network.ip`). |
+
+### Phase-2 topology (DMZ-WG over public IPs)
+
+```
+                ┌────────────── DMZ WireGuard mesh (UDP 51871, public IPs) ──────────────┐
+                │                                                                         │
+   ┌────────────┴────────────┐    ┌─────────────────────────┐    ┌──────────────────────┐
+   │ Region: primary (nbg1)  │    │ Region: fsn1-1 (fsn1)   │    │ Region: hel1-2 (hel1)│
+   │  ┌───────────────────┐  │    │  ┌───────────────────┐  │    │  ┌─────────────────┐ │
+   │  │ hcloud_network    │  │    │  │ hcloud_network    │  │    │  │ hcloud_network  │ │
+   │  │ 10.0.0.0/16       │  │    │  │ 10.0.0.0/16       │  │    │  │ 10.0.0.0/16     │ │
+   │  │  └ subnet 10.0.1.0/24 │ │  │  │  └ subnet 10.0.1.0/24 │ │ │  │  └ subnet 10.0.1.0/24 │ │
+   │  └───────────────────┘  │    │  └───────────────────┘  │    │  └─────────────────┘ │
+   │   CP @ 10.0.1.2         │    │   CP @ 10.0.1.2         │    │   CP @ 10.0.1.2      │
+   │   Workers @ 10.0.1.10+  │    │   Workers @ 10.0.1.10+  │    │   Workers @ 10.0.1.10+│
+   │   LB  @ 10.0.1.254      │    │   LB  @ 10.0.1.254      │    │   LB  @ 10.0.1.254   │
+   │                         │    │                         │    │                      │
+   │  k3s cluster-cidr:      │    │  k3s cluster-cidr:      │    │  k3s cluster-cidr:   │
+   │   10.42.0.0/16          │    │   10.43.0.0/16          │    │   10.44.0.0/16       │
+   │  k3s service-cidr:      │    │  k3s service-cidr:      │    │  k3s service-cidr:   │
+   │   10.96.0.0/16          │    │   10.97.0.0/16          │    │   10.98.0.0/16       │
+   │                         │    │                         │    │                      │
+   │     ┌───────────────┐   │    │     ┌───────────────┐   │    │     ┌─────────────┐  │
+   │     │ DMZ vCluster  │◀──┼────┼────▶│ DMZ vCluster  │◀──┼────┼────▶│ DMZ vCluster│  │
+   │     └───────────────┘   │    │     └───────────────┘   │    │     └─────────────┘  │
+   └─────────────────────────┘    └─────────────────────────┘    └──────────────────────┘
+       │                              │                              │
+       └─ Cilium ClusterMesh apiserver (Service type = LoadBalancer) ─┘
+              public IPs only, never NodePort (DoD A3+A5)
+```
+
+The arrows between DMZ vClusters are WireGuard tunnels carried over each region's public IPv4 + UDP 51871. The provider's internal cross-region routing fabric is **never** in the path — even when all three regions are Hetzner.
+
+### Per-region k3s CIDRs (DoD gate D11)
+
+Cilium ClusterMesh demands non-overlapping pod and service CIDRs across peer clusters. The module allocates them deterministically from the regional index in `local.all_region_keys`:
+
+| Region index | Region key | cluster-cidr (pods) | service-cidr |
+|---|---|---|---|
+| 0 | `primary` | `10.42.0.0/16` | `10.96.0.0/16` |
+| 1 | `<region>-1` (first secondary) | `10.43.0.0/16` | `10.97.0.0/16` |
+| 2 | `<region>-2` (second secondary) | `10.44.0.0/16` | `10.98.0.0/16` |
+| 3 | `<region>-3` | `10.45.0.0/16` | `10.99.0.0/16` |
+| ... up to 16 regions | ... | `10.42+i.0/16` | `10.96+i.0/16` |
+
+The flags `--cluster-cidr` and `--service-cidr` are threaded into the k3s install line in `cloudinit-control-plane.tftpl` via the `${cluster_cidr}` and `${service_cidr}` template substitutes. The catalyst-api passes nothing region-related at runtime — the allocation is pure tofu.
+
+---
+
+## Multi-region wiring (slice G1, EPIC-0 #1095 → DMZ-WG refactor 2026-05-15)
+
+The module accepts a `var.regions[]` list-of-objects payload that captures the wizard's per-region sizing. Slice G1 wires every entry in that list end-to-end; the 2026-05-15 DMZ-WG refactor replaced the shared-network model with one Network per region.

 | `var.regions[i]` | Realised by | Notes |
 |---|---|---|
-| `regions[0]` | Legacy singular path (`hcloud_server.control_plane[0]`, `hcloud_load_balancer.main`, …) | Identity-preserving — no resource-address change for any Sovereign provisioned before slice G1. The catalyst-api provisioner mirrors `regions[0]` into `var.region` / `var.control_plane_size` / `var.worker_size` / `var.worker_count` before `tofu apply`. |
-| `regions[1+]` | Multi-region overlay (`hcloud_server.secondary_control_plane["fsn1-1"]`, `hcloud_load_balancer.secondary["hel1-2"]`, …) | New resources keyed by `for_each = local.secondary_regions`. Same `hcloud_network.main` + `hcloud_firewall.main` + `hcloud_ssh_key.main` (one tenant boundary per Sovereign). Each secondary region gets its own `/24` subnet inside the shared `/16` and its own `lb11`. |
+| `regions[0]` | Legacy singular path (`hcloud_server.control_plane[0]`, `hcloud_load_balancer.main`, attached to `hcloud_network.region["primary"]`, …) | The catalyst-api provisioner mirrors `regions[0]` into `var.region` / `var.control_plane_size` / `var.worker_size` / `var.worker_count` before `tofu apply`. |
+| `regions[1+]` | Multi-region overlay (`hcloud_server.secondary_control_plane["fsn1-1"]`, `hcloud_load_balancer.secondary["hel1-2"]`, …) | New resources keyed by `for_each = local.secondary_regions`. Each secondary region has its OWN `hcloud_network.region[<key>]`, its OWN `/24` subnet, and its OWN `lb11`. The shared resources across the Sovereign are `hcloud_firewall.main` + `hcloud_ssh_key.main` only. |

-The hybrid (singular path + secondary-region overlay) is purely **additive**: no existing Sovereign state has entries in `local.secondary_regions` (the iteration filter `if i > 0` excludes `regions[0]`), so legacy `tofu plan` outputs are unchanged for any Sovereign whose request body has `len(regions) ≤ 1`. **No `tofu state mv` is required for any pre-G1 state.**
+**Network address impact:** the legacy `hcloud_network.main` + `hcloud_network_subnet.main` singletons (and the slice-G1 `hcloud_network_subnet.secondary` overlay) were removed in this refactor. Every region — primary and secondary — is keyed under `hcloud_network.region[<key>]` / `hcloud_network_subnet.region[<key>]`. Per the DoD cycle protocol (every wipe-and-create is a fresh provision), legacy state migrates by destroy-and-recreate, **NOT** by `tofu state mv` — this is intentional.

 ### EPIC-6 (#1101) example: 3-region Continuum DR shape

@ -92,10 +161,32 @@ The catalyst-api joins `secondary_region_keys` with `Request.Regions[1+]` to pro

 ### Resource address contract

-Every legacy resource keeps its existing address. New addresses introduced by slice G1, all keyed `for_each = local.secondary_regions` (key shape: `{cloudRegion}-{index}` where `index` is the position in `var.regions[]`):
+The 2026-05-15 DMZ-WG refactor introduced one Network per region (replacing the singleton `hcloud_network.main` and the slice-G1 secondary overlay) — pre-2026-05-15 Sovereigns will recreate the network on the next apply per the DoD cycle protocol.
+
+Per-region resources (keyed by `for_each` on `toset(local.all_region_keys)`; the primary key is the literal string `"primary"`, secondary keys follow the slice-G1 `{cloudRegion}-{index}` shape):
+
+```
+hcloud_network.region["primary"]
+hcloud_network.region["{cloudRegion}-{index}"]
+hcloud_network_subnet.region["primary"]
+hcloud_network_subnet.region["{cloudRegion}-{index}"]
+```
+
+Primary-region resources (legacy singular path, regions[0]):
+
+```
+hcloud_server.control_plane[0..2]              # count = 1 or 3 (HA)
+hcloud_server.worker[0..N-1]                   # N = var.worker_count
+hcloud_load_balancer.main
+hcloud_load_balancer_network.main
+hcloud_load_balancer_target.control_plane[0..2]
+hcloud_load_balancer_target.workers[0..N-1]
+hcloud_load_balancer_service.{http,https,dns}
+```
+
+Secondary-region resources (slice-G1 overlay, regions[1+], all keyed `for_each = local.secondary_regions`):

 ```
-hcloud_network_subnet.secondary["{key}"]
 hcloud_server.secondary_control_plane["{key}"]
 hcloud_server.secondary_worker["{key}-w{i}"]   # i = 1..workerCount
 hcloud_load_balancer.secondary["{key}"]
@ -196,6 +287,8 @@ The Phase-0 firewall is intentionally minimal. All long-term policy is enforced
 | 80 | TCP | `0.0.0.0/0`, `::/0` | HTTP — for ACME HTTP-01 challenges and the cert-manager bootstrap. Cilium Gateway terminates. |
 | 443 | TCP | `0.0.0.0/0`, `::/0` | HTTPS — the only port end-users reach. All Catalyst surfaces (`console`, `gitea`, `harbor`, `admin`, `api`) are served behind 443 via Cilium Gateway and SNI routing. |
 | 6443 | TCP | `0.0.0.0/0`, `::/0` | k3s API server. Open to allow the wizard to fetch the kubeconfig and confirm the cluster is healthy. Crossplane Composition tightens this to operator-owned CIDRs in Phase 2. |
+| 53 | TCP+UDP | `0.0.0.0/0`, `::/0` | Sovereign's PowerDNS authoritative server — open for LE DNS-01 challenges and subdomain NS delegation. |
+| 51871 | UDP | `0.0.0.0/0`, `::/0` | **Cilium WireGuard inter-region encryption (DMZ-WG)**. Per DoD A2, inter-region pod-to-pod traffic flows exclusively over WireGuard on public IPs — never over the provider's internal network. Source is `0.0.0.0/0` because each region's CP/worker public IP rotates at provision time; Cilium's static-key crypto + node-discovery auth is the real security boundary, not the firewall source filter. |
 | ICMP | ICMP | `0.0.0.0/0`, `::/0` | Diagnostics (Path MTU Discovery, traceroute). Open by default; closing it is a foot-gun that breaks PMTU. |
 | 22 | TCP | `var.ssh_allowed_cidrs` (default: empty) | SSH break-glass. **Off by default** — the rule is omitted entirely when the list is empty. Operators add their own CIDRs at provisioning time or via a Crossplane Composition later. |

@ -234,6 +327,8 @@ k3s is installed via `curl get.k3s.io | sh -` from cloud-init. The `INSTALL_K3S_
 | `--disable=servicelb` | k3s ships with klipper-lb; we use the Hetzner load balancer for ingress (`hcloud_load_balancer.main`) and PowerDNS lua-records (`ifurlup`) for cross-region failover. klipper-lb would steal the NodePort 80/443 binding. |
 | `--disable=local-storage` | k3s ships local-path-provisioner; we use **hcloud-csi** (provisioned by Crossplane after Phase 1) so PVCs survive node deletion and can be migrated across regions via Velero. |
 | `--disable-network-policy` | k3s ships kube-router NetworkPolicy; **Cilium** handles NetworkPolicy. Two NetworkPolicy controllers fight each other. |
+| `--cluster-cidr=10.42+i.0/16` | Pod CIDR — non-overlapping across ClusterMesh peers (DoD gate D11). Index `i` is the region's position in `local.all_region_keys` (0 = primary). Allocated by `local.region_cluster_cidr` in `main.tf`. |
+| `--service-cidr=10.96+i.0/16` | Service CIDR — non-overlapping across ClusterMesh peers. Same allocation rule as `--cluster-cidr`. |
 | `--tls-san=<sovereign_fqdn>` | API server TLS cert must be valid for the public sovereign FQDN, otherwise the wizard's kubeconfig fetch and any operator running `kubectl --server=https://<fqdn>:6443` get a SAN mismatch. |
 | `--node-label catalyst.openova.io/role=control-plane` | Used by NodeAffinity on Catalyst control-plane services (Console, projector, etc.) to pin them off worker nodes. |
 | `--write-kubeconfig-mode=0644` | Lets the catalyst-api fetch the kubeconfig over the wizard channel without sudo. The kubeconfig is rotated and replaced with a SPIFFE-issued identity in Phase 2. |
--- a/infra/hetzner/cloudinit-control-plane.tftpl
+++ b/infra/hetzner/cloudinit-control-plane.tftpl
@ -1341,7 +1341,16 @@ runcmd:
      exit 1
    fi

-  - 'CP_PUBLIC_IPV4=$(curl -fsSL --retry 30 --retry-delay 2 http://169.254.169.254/hetzner/v1/metadata/public-ipv4) && curl -sfL https://get.k3s.io | INSTALL_K3S_VERSION=${k3s_version} K3S_TOKEN=${k3s_token} INSTALL_K3S_EXEC="server --cluster-init --flannel-backend=none --disable-network-policy --disable=traefik --disable=servicelb --node-ip=${cp_private_ip} --advertise-address=${cp_private_ip} --kubelet-arg=max-pods=220 --tls-san=${sovereign_fqdn} --tls-san=${cp_private_ip} --tls-san=$${CP_PUBLIC_IPV4} --kube-apiserver-arg=oidc-issuer-url=https://auth.${sovereign_fqdn}/realms/sovereign --kube-apiserver-arg=oidc-client-id=kubectl --kube-apiserver-arg=oidc-username-claim=preferred_username --kube-apiserver-arg=oidc-username-prefix=oidc: --kube-apiserver-arg=oidc-groups-claim=groups --kube-apiserver-arg=oidc-groups-prefix=oidc: --node-label catalyst.openova.io/role=control-plane --node-label openova.io/region=hz-fsn-rtz-prod ${worker_count > 0 ? "--node-taint node-role.kubernetes.io/control-plane=true:NoSchedule " : ""}--write-kubeconfig-mode=0644" sh -'
+  # k3s install — server mode, embedded etcd (--cluster-init), Cilium-ready
+  # (flannel/network-policy/traefik/servicelb all disabled). The
+  # --cluster-cidr and --service-cidr flags are per-region (10.42+i.0/16
+  # for pods, 10.96+i.0/16 for services) so ClusterMesh peers across
+  # regions don't collide on pod/service routing tables — DoD gate D11
+  # (docs/SOVEREIGN-MULTI-REGION-DOD.md) verifies inter-region pod-to-pod
+  # packet flow over Cilium WireGuard which requires non-overlapping
+  # CIDRs end-to-end. Values are interpolated by OpenTofu from
+  # local.region_cluster_cidr / local.region_service_cidr in main.tf.
+  - 'CP_PUBLIC_IPV4=$(curl -fsSL --retry 30 --retry-delay 2 http://169.254.169.254/hetzner/v1/metadata/public-ipv4) && curl -sfL https://get.k3s.io | INSTALL_K3S_VERSION=${k3s_version} K3S_TOKEN=${k3s_token} INSTALL_K3S_EXEC="server --cluster-init --flannel-backend=none --disable-network-policy --disable=traefik --disable=servicelb --cluster-cidr=${cluster_cidr} --service-cidr=${service_cidr} --node-ip=${cp_private_ip} --advertise-address=${cp_private_ip} --kubelet-arg=max-pods=220 --tls-san=${sovereign_fqdn} --tls-san=${cp_private_ip} --tls-san=$${CP_PUBLIC_IPV4} --kube-apiserver-arg=oidc-issuer-url=https://auth.${sovereign_fqdn}/realms/sovereign --kube-apiserver-arg=oidc-client-id=kubectl --kube-apiserver-arg=oidc-username-claim=preferred_username --kube-apiserver-arg=oidc-username-prefix=oidc: --kube-apiserver-arg=oidc-groups-claim=groups --kube-apiserver-arg=oidc-groups-prefix=oidc: --node-label catalyst.openova.io/role=control-plane --node-label openova.io/region=hz-fsn-rtz-prod ${worker_count > 0 ? "--node-taint node-role.kubernetes.io/control-plane=true:NoSchedule " : ""}--write-kubeconfig-mode=0644" sh -'

  # Wait for the API server to be reachable. Cilium needs to come up before
  # nodes Ready, so we wait specifically for the API endpoint.
--- a/infra/hetzner/main.tf
+++ b/infra/hetzner/main.tf
@ -14,24 +14,113 @@
 #   - No bespoke API calls (we use the canonical hcloud terraform provider)
 #   - Phase 0 is OpenTofu, day-2 is Crossplane, GitOps is Flux, install unit is Blueprints

-# ── Network: private 10.0.0.0/16 with control-plane subnet ────────────────
+# ── Network: ONE hcloud_network PER REGION — no shared private net ────────
+#
+# Founder ruling 2026-05-15 (see docs/SOVEREIGN-MULTI-REGION-DOD.md A2):
+#
+#   > "fuck!!! You can never use private links between regions, why are you
+#   >  wasting my time, how many times I need to explain the architecture!!!
+#   >  irrespective from the same provider or different providers, your
+#   >  wireguard always happens through the DMZ wireguard, never internal
+#   >  shitty routing!!!!!!"
+#
+# Earlier slice-G1 design used ONE shared hcloud_network.main with a /24
+# subnet per region inside the same /16. That gave every CP across every
+# region addresses in 10.0.x.0/24 inside a Hetzner-routed Network — i.e.
+# Hetzner's internal cross-zone routing carrying inter-region pod and
+# control-plane traffic. The DoD contract forbids that explicitly.
+#
+# The replacement: ONE hcloud_network PER REGION, each carrying its OWN
+# 10.0.0.0/16 (the /16 ranges are identical inside each network — they
+# don't share a routing domain, so the collision is harmless and intended).
+# Each network gets a single /24 subnet 10.0.1.0/24, so every CP across
+# every region is at the SAME private IP 10.0.1.2 — uniform, intra-region-
+# only. Inter-region traffic flows EXCLUSIVELY over Cilium WireGuard
+# (UDP 51871) on the public IPs through each region's DMZ vCluster.
+#
+# Provider-agnostic by design (A6): whether the second/third region is
+# Hetzner, AWS, or Huawei, the inter-region link is the same DMZ-WG
+# overlay on public IPs. The Hetzner module never assumes a Hetzner peer
+# on the other side.
+#
+# Resource address impact: the legacy `hcloud_network.main` and
+# `hcloud_network_subnet.main` (singletons) are REPLACED by
+# `hcloud_network.region[<key>]` and `hcloud_network_subnet.region[<key>]`
+# (for_each maps keyed by region key — "primary" for regions[0], the
+# slice-G1 "{cloudRegion}-{index}" form for secondaries). The legacy
+# `hcloud_network_subnet.secondary` (for_each) is also replaced by the
+# unified `hcloud_network_subnet.region` map. Any pre-2026-05-15 Sovereign
+# state will plan a network-recreate on the next apply — by founder
+# directive every Sovereign re-provisions cleanly from the fresh DoD
+# contract, so the state-migration cost is consciously accepted.
+locals {
+  # Region key set: "primary" for regions[0] (driven by the singular
+  # path) plus every secondary region key. Used as the for_each map of
+  # hcloud_network.region / hcloud_network_subnet.region so EVERY region —
+  # primary and secondary — has its own isolated /16.
+  all_region_keys = concat(["primary"], [for k, _ in local.secondary_regions : k])

-resource "hcloud_network" "main" {
-  name     = "catalyst-${replace(var.sovereign_fqdn, ".", "-")}-net"
-  ip_range = "10.0.0.0/16"
-  labels = {
-    "catalyst.openova.io/sovereign" = var.sovereign_fqdn
+  # Per-region network zone. The primary region reads var.region; each
+  # secondary reads its cloudRegion from local.secondary_regions. The
+  # network zone is just a Hetzner placement hint for the subnet; the
+  # subnet CIDR (10.0.1.0/24) is identical across regions because they
+  # live in DIFFERENT networks.
+  region_network_zones = merge(
+    {
+      primary = lookup(local.hetzner_network_zones, var.region, "eu-central")
+    },
+    {
+      for k, r in local.secondary_regions :
+      k => lookup(local.hetzner_network_zones, r.cloudRegion, "eu-central")
+    }
+  )
+
+  # Per-region k3s cluster-cidr / service-cidr — must NOT collide across
+  # ClusterMesh peers, otherwise inter-region pod-to-pod packets (DoD
+  # gate D11) double-DNAT inside Cilium. Each region gets its own /16
+  # off two non-overlapping /12 supernets:
+  #   - cluster (pod) CIDR: 10.42+i.0/16 (10.42.0.0/16 .. 10.57.0.0/16 — 16 regions)
+  #   - service CIDR:        10.96+i.0/16 (10.96.0.0/16 .. 10.111.0.0/16)
+  # Index 0 = primary, secondary entries in stable insertion order of
+  # local.secondary_regions. These are threaded into the k3s install
+  # line via --cluster-cidr= / --service-cidr= (cloudinit-control-plane.tftpl).
+  region_index = {
+    for i, k in local.all_region_keys : k => i
+  }
+  region_cluster_cidr = {
+    for k, _ in local.region_index :
+    k => format("10.%d.0.0/16", 42 + local.region_index[k])
+  }
+  region_service_cidr = {
+    for k, _ in local.region_index :
+    k => format("10.%d.0.0/16", 96 + local.region_index[k])
  }
 }

-resource "hcloud_network_subnet" "main" {
-  network_id   = hcloud_network.main.id
-  type         = "cloud"
-  network_zone = local.network_zone
-  ip_range     = "10.0.1.0/24"
+resource "hcloud_network" "region" {
+  for_each = toset(local.all_region_keys)
+
+  name     = "catalyst-${replace(var.sovereign_fqdn, ".", "-")}-${each.key}-net"
+  ip_range = "10.0.0.0/16"
+  labels = {
+    "catalyst.openova.io/sovereign"  = var.sovereign_fqdn
+    "catalyst.openova.io/region-key" = each.key
+  }
 }

-# ── Firewall: 80/443 + 6443 + ICMP open; 22 only when ssh_allowed_cidrs set ─
+resource "hcloud_network_subnet" "region" {
+  for_each = toset(local.all_region_keys)
+
+  network_id   = hcloud_network.region[each.key].id
+  type         = "cloud"
+  network_zone = local.region_network_zones[each.key]
+  # Same /24 inside every region's /16. Each subnet sits in its OWN
+  # hcloud_network, so addresses don't collide across regions. CP at .2,
+  # workers at .10+, LB pinned at .254 — uniform across regions.
+  ip_range = "10.0.1.0/24"
+}
+
+# ── Firewall: 80/443 + 6443 + ICMP + DMZ-WG 51871 open; 22 only when ssh_allowed_cidrs set ─

 resource "hcloud_firewall" "main" {
  name = "catalyst-${replace(var.sovereign_fqdn, ".", "-")}-fw"
@ -80,6 +169,27 @@ resource "hcloud_firewall" "main" {
    source_ips = ["0.0.0.0/0", "::/0"]
  }

+  # Cilium WireGuard inter-region node encryption (DMZ-WG). Per DoD A2
+  # (docs/SOVEREIGN-MULTI-REGION-DOD.md), inter-region traffic flows
+  # EXCLUSIVELY over Cilium WireGuard on the DMZ vCluster's public IPs,
+  # NEVER over Hetzner's internal network. Cilium's default WG port is
+  # UDP 51871 (config: `encryption.wireguard.userspaceFallback=false`
+  # + `encryption.type=wireguard` in bp-cilium). Without this rule, the
+  # WG mesh between regions cannot form on a fresh provision and DoD
+  # gate D11 (inter-region pod-to-pod packet test) fails immediately.
+  # Open to the world because each region's CP/worker public IP rotates
+  # at provision time and the catalyst-api does not know the public IP
+  # of sister-region peers ahead of time — Cilium's node-discovery
+  # auth + WG static-key crypto is the actual security boundary, not
+  # the firewall source filter.
+  rule {
+    direction   = "in"
+    protocol    = "udp"
+    port        = "51871"
+    source_ips  = ["0.0.0.0/0", "::/0"]
+    description = "Cilium WireGuard inter-region node encryption (DMZ-WG)"
+  }
+
  # SSH (22) is intentionally NOT open to the world. When ssh_allowed_cidrs is
  # set, we add a narrow rule for those operators only; otherwise the rule is
  # omitted entirely and break-glass is via Hetzner Console (out-of-band).
@ -227,21 +337,6 @@ locals {
    if i > 0 && r.provider == "hetzner"
  }

-  # Per-secondary-region subnet CIDR. The legacy singular subnet uses
-  # 10.0.1.0/24 (cp at .2, workers at .10+). Secondary regions allocate
-  # 10.0.<10+index>.0/24 so the slash-16 hcloud_network can hold up to
-  # 245 secondary subnets without collision. Subnet allocation is
-  # deterministic on the region's index in var.regions[] — re-running
-  # `tofu apply` after an intra-list reorder would shift subnets, so
-  # the catalyst-api MUST keep regions[] order stable across re-applies
-  # of the same deployment (it does — the wizard's StepProvider emits
-  # regions in the operator's selection order and provisioner.go never
-  # reorders).
-  secondary_region_subnets = {
-    for k, r in local.secondary_regions :
-    k => format("10.0.%d.0/24", 10 + index(keys(local.secondary_regions), k))
-  }
-
  # Per-secondary-region Cilium ClusterMesh peer anchors (#1101 EPIC-6).
  # Auto-derive cluster.name as `<sovereign-stem>-<region-code-no-digits>`
  # (e.g. omantel + hel1 -> omantel-hel) when the operator left
@ -274,12 +369,14 @@ locals {
    )
  }

-  # Per-secondary-region first-IP for control plane. Mirrors the legacy
-  # singular path's "10.0.1.2" — first usable host of the subnet. Workers
-  # in the same region count up from .10 within their own subnet.
+  # Per-secondary-region first-IP for control plane. Every region now has
+  # its OWN hcloud_network with its OWN 10.0.1.0/24 subnet — so every
+  # secondary CP sits at 10.0.1.2, the same as the primary CP. This was
+  # cidrhost(local.secondary_region_subnets[k], 2) under the old shared-
+  # /16 design; with per-region networks the subnet is uniform.
  secondary_region_cp_ips = {
    for k, _ in local.secondary_regions :
-    k => cidrhost(local.secondary_region_subnets[k], 2)
+    k => "10.0.1.2"
  }

  # GHCR pull token + the dockerconfigjson `auth` field, computed once here
@ -354,24 +451,32 @@ locals {
  # below — any future bloat that pushes user_data ≥ 30 KiB fails at plan-time.
  control_plane_cloud_init = replace(templatefile("${path.module}/cloudinit-control-plane.tftpl", {
    # Primary CP's stable private IP — first allocatable host in the
-    # primary subnet (10.0.1.2 for the canonical 10.0.1.0/24). Used by
+    # primary subnet (10.0.1.2 in the canonical 10.0.1.0/24). Used by
    # the bp-cilium HelmRelease's CILIUM_K8S_SERVICE_HOST substitute
    # so cilium-operator on the primary cluster reaches its OWN local
-    # CP (matching CA), not a different region's CP. Secondary CPs
-    # render `cidrhost(secondary_region_subnets[k], 2)` for the same
-    # var — main.tf:267 secondary_region_cp_ips.
-    cp_private_ip           = "10.0.1.2"
-    sovereign_fqdn          = var.sovereign_fqdn
-    sovereign_subdomain     = var.sovereign_subdomain
+    # CP (matching CA), not a different region's CP. Secondary CPs also
+    # render 10.0.1.2 since every region has its OWN /24 (see
+    # local.secondary_region_cp_ips above — the per-region network refactor
+    # made every CP uniform on 10.0.1.2 within its own subnet).
+    cp_private_ip = "10.0.1.2"
+    # Per-region k3s pod/service CIDRs (DoD gate D11 — no collision across
+    # ClusterMesh peers). Primary uses region_cluster_cidr["primary"]
+    # (= 10.42.0.0/16) and region_service_cidr["primary"] (= 10.96.0.0/16).
+    # Threaded into the k3s install line as --cluster-cidr= / --service-cidr=
+    # in cloudinit-control-plane.tftpl.
+    cluster_cidr        = local.region_cluster_cidr["primary"]
+    service_cidr        = local.region_service_cidr["primary"]
+    sovereign_fqdn      = var.sovereign_fqdn
+    sovereign_subdomain = var.sovereign_subdomain
    # OpenovaFlow integration (Agent #3, PR #1389/#1390 follow-up). The
    # bp-openova-flow-emitter (bootstrap-kit slot 57) reads SOVEREIGN_
    # DEPLOYMENT_ID + SOVEREIGN_REGION_KEY from the bootstrap-kit
    # Kustomization's postBuild.substitute env. Primary CP renders
    # var.region as the region key; secondary CPs render each.key from
    # the for_each loop in local.secondary_region_cloud_init.
-    sovereign_deployment_id = var.sovereign_deployment_id
-    sovereign_region_key    = var.region
-    marketplace_enabled     = var.marketplace_enabled
+    sovereign_deployment_id   = var.sovereign_deployment_id
+    sovereign_region_key      = var.region
+    marketplace_enabled       = var.marketplace_enabled
    qa_fixtures_enabled       = var.qa_fixtures_enabled
    qa_test_session_enabled   = var.qa_test_session_enabled
    qa_fixtures_namespace     = var.qa_fixtures_namespace
@ -501,8 +606,11 @@ locals {
    # agent join silently fails, and the autoscaler times out the
    # scale-up after 15m → backoff. Names are the Phase-0 resource names
    # verbatim — the autoscaler resolves them via the Hetzner API at
-    # scale-up time.
-    hcloud_network_name  = hcloud_network.main.name
+    # scale-up time. Primary CP points at the primary region's per-region
+    # network so autoscaler-spawned workers join the primary region's k3s
+    # (which is reachable on the local 10.0.1.2). Secondary CPs render
+    # their own region's network name (see local.secondary_region_cloud_init).
+    hcloud_network_name  = hcloud_network.region["primary"].name
    hcloud_firewall_name = hcloud_firewall.main.name
    hcloud_ssh_key_name  = hcloud_ssh_key.main.name

@ -532,7 +640,7 @@ resource "hcloud_server" "control_plane" {
  user_data    = local.control_plane_cloud_init

  network {
-    network_id = hcloud_network.main.id
+    network_id = hcloud_network.region["primary"].id
    ip         = "10.0.1.${count.index + 2}" # cp1=10.0.1.2, cp2=10.0.1.3, cp3=10.0.1.4
  }

@ -563,7 +671,7 @@ resource "hcloud_server" "control_plane" {
    }
  }

-  depends_on = [hcloud_network_subnet.main]
+  depends_on = [hcloud_network_subnet.region]
 }

 # ── Workers: variable count ───────────────────────────────────────────────
@ -582,7 +690,7 @@ resource "hcloud_server" "worker" {
  user_data    = local.worker_cloud_init

  network {
-    network_id = hcloud_network.main.id
+    network_id = hcloud_network.region["primary"].id
    ip         = "10.0.1.${count.index + 10}" # workers start at .10
  }

@ -621,16 +729,18 @@ resource "hcloud_load_balancer" "main" {

 resource "hcloud_load_balancer_network" "main" {
  load_balancer_id = hcloud_load_balancer.main.id
-  network_id       = hcloud_network.main.id
+  network_id       = hcloud_network.region["primary"].id
  # Fix #182: pin LB private IP to top-of-subnet so it cannot race the
  # CP server's explicit `ip = "10.0.1.2"` during parallel apply. Without
  # this, Hetzner auto-allocates the first free IP in the matching-zone
-  # subnet — in multi-region prov #32 the secondary LB attached at t+16s
-  # took 10.0.1.2 from main subnet (only eu-central subnet existing then),
-  # causing CP creation to FATAL with `inlineAttachServerToNetwork: IP
-  # not available`. .254 is the last usable host in /24 and is reserved
-  # platform-wide for LB anchors.
+  # subnet. .254 is the last usable host in /24 and is reserved platform-
+  # wide for LB anchors. After the per-region network refactor each region
+  # has its OWN /24 inside its OWN hcloud_network, so the cross-region IP
+  # collision class (prov #32 root cause) is gone by construction — but
+  # the .254 pin still guards intra-region CP/LB races.
  ip = "10.0.1.254"
+
+  depends_on = [hcloud_network_subnet.region]
 }

 resource "hcloud_load_balancer_target" "control_plane" {
@ -809,50 +919,39 @@ resource "aws_s3_bucket_acl" "main" {
  acl    = "private"
 }

-# ── Multi-region overlay (slice G1, EPIC-0 #1095) ─────────────────────────
+# ── Multi-region overlay (slice G1 → DMZ-WG refactor 2026-05-15) ──────────
 #
 # Realises every var.regions[1+] entry as a parallel set of Hetzner
-# resources keyed off local.secondary_regions. Slice G3 wires Cilium
-# ClusterMesh between these and the primary region; this slice only
-# provisions the cloud substrate so the network + compute exist for G3
-# to connect.
+# resources keyed off local.secondary_regions. Cilium ClusterMesh joins
+# the regions over the public DMZ WireGuard endpoint (UDP 51871) — this
+# module only provisions the cloud substrate.
 #
-# Architectural decision (slice G1): hybrid singular-path-plus-overlay,
-# NOT a wholesale refactor of every existing resource into a `for_each`.
-# The hybrid is purely additive — every new resource address below
-# (`hcloud_network_subnet.secondary`, `hcloud_server.secondary_control_plane`,
-# `hcloud_load_balancer.secondary`, …) is keyed on `for_each =
-# local.secondary_regions` and therefore shares NO address space with
-# the legacy singular resources above. Existing Sovereign state that
-# was provisioned with var.regions = [] or len(var.regions) == 1
-# carries no entries in `local.secondary_regions` (the iteration filter
-# `if i > 0` excludes regions[0]; regions[0] is owned by the singular
-# path) and thus produces a no-op plan diff for the entire overlay
-# block. No `tofu state mv` runbook is required for any pre-G1 state.
+# Architectural decision (2026-05-15, founder ruling, see
+# docs/SOVEREIGN-MULTI-REGION-DOD.md): no shared private network across
+# regions. Each region gets its OWN hcloud_network + its OWN /24 subnet
+# (declared at the TOP of this file under `hcloud_network.region` and
+# `hcloud_network_subnet.region`, both keyed `for_each = toset(
+# local.all_region_keys)` so they cover the "primary" key plus every
+# secondary key). Inter-region pod-to-pod traffic flows EXCLUSIVELY
+# over Cilium WireGuard on each region's public IP through the DMZ
+# vCluster — provider-agnostic (works the same way for an AWS or
+# Huawei secondary region, A6) and zero-trust against the provider's
+# internal network fabric.
 #
-# When a new Sovereign request body carries len(regions) ≥ 2 (the
-# EPIC-6 mgmt + fsn + hel shape per docs/EPICS-1-6-unified-design.md
-# §3.8 + §11), the for_each fires and creates one network subnet, one
-# CP server, N worker servers, and one LB per secondary region — all
-# wrapped under the same hcloud_network.main and hcloud_firewall.main
-# the primary region uses (Hetzner Networks span multiple network
-# zones; one Network + one Firewall per Sovereign is the canonical
-# tenant boundary).
-
-# Per-secondary-region subnet — separate /24 inside the shared /16
-# hcloud_network.main. Sub-zoning is allocated deterministically off
-# the region's index in var.regions[] (see local.secondary_region_subnets
-# above). Subnets in different network_zones inside the same Network
-# are supported by Hetzner; cross-zone routing is what Cilium ClusterMesh
-# (slice G3) consumes.
-resource "hcloud_network_subnet" "secondary" {
-  for_each = local.secondary_regions
-
-  network_id   = hcloud_network.main.id
-  type         = "cloud"
-  network_zone = lookup(local.hetzner_network_zones, each.value.cloudRegion, "eu-central")
-  ip_range     = local.secondary_region_subnets[each.key]
-}
+# Architectural decision (2026-05-15, founder ruling): no shared private
+# network across regions, full stop. The previous slice-G1 design had
+# one shared `/16` with per-region `/24`s and was explicitly rejected
+# (DoD A2 trigger phrase: "Hetzner private net spans zones, let me use
+# that for cross-region" → STOP). Replaced with one Network per region,
+# each carrying an identical 10.0.1.0/24 subnet — addresses don't
+# collide because the networks are isolated.
+#
+# Resource-address impact: every legacy Sovereign would replan the
+# network resources on the next apply. By founder directive (DoD cycle
+# protocol — every wipe-and-create cycle is a fresh provision), the
+# state-migration cost is consciously accepted: NO `tofu state mv`
+# runbook ships with this PR; pre-2026-05-15 state is destroyed and
+# reprovisioned cleanly.

 # Per-secondary-region cloud-init — same template as the primary CP,
 # parameterised with the secondary region's LB IPv4 and CP private IP
@ -872,22 +971,28 @@ locals {
  secondary_region_cloud_init = {
    for k, r in local.secondary_regions :
    k => replace(templatefile("${path.module}/cloudinit-control-plane.tftpl", {
-      # Per-region CP's stable private IP (first allocatable host in the
-      # secondary subnet — see main.tf local.secondary_region_cp_ips). The
-      # bp-cilium HelmRelease's CILIUM_K8S_SERVICE_HOST substitute uses this
-      # so cilium-operator on the secondary cluster reaches its OWN local CP
-      # (matching CA), not the primary region's CP across regions.
-      cp_private_ip           = local.secondary_region_cp_ips[k]
-      sovereign_fqdn          = var.sovereign_fqdn
-      sovereign_subdomain     = var.sovereign_subdomain
+      # Per-region CP's stable private IP. After the per-region network
+      # refactor (2026-05-15 DoD A2) every region has its OWN /24 inside
+      # its OWN hcloud_network, so every secondary CP also sits at
+      # 10.0.1.2 — uniform with the primary. Used by the bp-cilium
+      # HelmRelease's CILIUM_K8S_SERVICE_HOST substitute so cilium-
+      # operator on each cluster reaches its OWN local CP (matching CA).
+      cp_private_ip = local.secondary_region_cp_ips[k]
+      # Per-region k3s pod/service CIDRs (DoD gate D11). Each region gets
+      # its own /16 off the 10.42+i.0/12 + 10.96+i.0/12 supernets so
+      # ClusterMesh peer pods/services don't collide in routing tables.
+      cluster_cidr        = local.region_cluster_cidr[k]
+      service_cidr        = local.region_service_cidr[k]
+      sovereign_fqdn      = var.sovereign_fqdn
+      sovereign_subdomain = var.sovereign_subdomain
      # OpenovaFlow integration (Agent #3). The secondary CP's region
      # key is each.key from the secondary_regions for_each (e.g. "hel1"
      # for a Helsinki secondary). Multi-region Sovereigns thus emit
      # distinct region tags on FlowNodes, which the canvas groups into
      # per-region super-bubbles via `contains` relationships.
-      sovereign_deployment_id = var.sovereign_deployment_id
-      sovereign_region_key    = k
-      marketplace_enabled     = var.marketplace_enabled
+      sovereign_deployment_id   = var.sovereign_deployment_id
+      sovereign_region_key      = k
+      marketplace_enabled       = var.marketplace_enabled
      qa_fixtures_enabled       = var.qa_fixtures_enabled
      qa_test_session_enabled   = var.qa_test_session_enabled
      qa_fixtures_namespace     = var.qa_fixtures_namespace
@ -938,10 +1043,13 @@ locals {
      worker_cloud_init_b64      = base64encode(local.secondary_region_worker_cloud_init[k])

      # Issue #1778 (F7 multi-region completion) — same hcloud_*_name
-      # threading as the primary CP templatefile call (lines 483-485)
-      # so the secondary regions' cluster-autoscaler also has the
-      # private-network attachment names.
-      hcloud_network_name  = hcloud_network.main.name
+      # threading as the primary CP templatefile call so the secondary
+      # regions' cluster-autoscaler also has the private-network
+      # attachment names. Each secondary references its OWN region's
+      # network (per the 2026-05-15 DoD A2 per-region-network refactor)
+      # so autoscaler-spawned workers land in the same isolated /16 as
+      # the region's CP and reach k3s at 10.0.1.2 locally.
+      hcloud_network_name  = hcloud_network.region[k].name
      hcloud_firewall_name = hcloud_firewall.main.name
      hcloud_ssh_key_name  = hcloud_ssh_key.main.name

@ -990,7 +1098,7 @@ resource "hcloud_server" "secondary_control_plane" {
  user_data    = local.secondary_region_cloud_init[each.key]

  network {
-    network_id = hcloud_network.main.id
+    network_id = hcloud_network.region[each.key].id
    ip         = local.secondary_region_cp_ips[each.key]
  }

@ -1014,12 +1122,17 @@ resource "hcloud_server" "secondary_control_plane" {
    }
  }

-  depends_on = [hcloud_network_subnet.secondary]
+  depends_on = [hcloud_network_subnet.region]
 }

 # Per-secondary-region workers. Hetzner's `count` semantics inside a
 # `for_each` map require flattening — we expand the (region, worker-index)
 # product into a single map keyed on "{region-key}-w{index}".
+#
+# Worker private IPs are uniform across regions now that each region has
+# its own 10.0.1.0/24: workers count up from 10.0.1.10 in their region's
+# subnet, identical to the primary region's hcloud_server.worker layout
+# (`10.0.1.${count.index + 10}`).
 locals {
  secondary_workers = {
    for pair in flatten([
@ -1029,7 +1142,7 @@ locals {
          region_key = k
          region     = r
          worker_idx = i
-          private_ip = cidrhost(local.secondary_region_subnets[k], 10 + i)
+          private_ip = "10.0.1.${10 + i}"
        }
      ]
    ]) :
@ -1049,7 +1162,7 @@ resource "hcloud_server" "secondary_worker" {
  user_data    = local.secondary_region_worker_cloud_init[each.value.region_key]

  network {
-    network_id = hcloud_network.main.id
+    network_id = hcloud_network.region[each.value.region_key].id
    ip         = each.value.private_ip
  }

@ -1095,14 +1208,15 @@ resource "hcloud_load_balancer_network" "secondary" {
  for_each = local.secondary_regions

  load_balancer_id = hcloud_load_balancer.secondary[each.key].id
-  network_id       = hcloud_network.main.id
-  # Fix #182: pin secondary LB to top-of-its-own-subnet (10.0.10.254,
-  # 10.0.11.254, ...) so multi-region apply cannot race for IPs across
-  # subnets sharing a network zone. See `hcloud_load_balancer_network.main`
-  # comment for full context. depends_on ensures the subnet exists before
-  # Hetzner is asked to allocate inside its CIDR.
-  ip         = cidrhost(local.secondary_region_subnets[each.key], 254)
-  depends_on = [hcloud_network_subnet.secondary]
+  network_id       = hcloud_network.region[each.key].id
+  # Fix #182: pin LB private IP to top-of-its-own-subnet (10.0.1.254) so
+  # an apply cannot race the CP's explicit `ip = "10.0.1.2"`. After the
+  # per-region network refactor (DoD A2) every region has its OWN /24
+  # inside its OWN hcloud_network, so the cross-region IP collision
+  # class (prov #32 root cause) is gone by construction — but the .254
+  # pin still guards intra-region CP/LB races at apply time.
+  ip         = "10.0.1.254"
+  depends_on = [hcloud_network_subnet.region]
 }

 resource "hcloud_load_balancer_target" "secondary_control_plane" {
--- a/infra/hetzner/tests/multi_region.tftest.hcl
+++ b/infra/hetzner/tests/multi_region.tftest.hcl
@ -167,6 +167,37 @@ run "legacy_no_regions_payload" {
    condition     = length(output.load_balancer_ips_by_region) == 0
    error_message = "load_balancer_ips_by_region must be empty when var.regions=[]."
  }
+
+  # Per-region network refactor: even with NO secondary regions, the
+  # primary region's hcloud_network.region["primary"] must exist. The
+  # legacy `hcloud_network.main` and `hcloud_network_subnet.main`
+  # singletons have been deleted; their job is now done by the
+  # for_each map keyed on local.all_region_keys.
+  assert {
+    condition     = length(hcloud_network.region) == 1
+    error_message = "Single-region (var.regions=[]) must still produce exactly 1 hcloud_network keyed 'primary' (the legacy hcloud_network.main was retired)."
+  }
+
+  assert {
+    condition     = contains(keys(hcloud_network.region), "primary")
+    error_message = "The primary region key must be 'primary'; for_each over local.all_region_keys."
+  }
+
+  assert {
+    condition     = hcloud_network_subnet.region["primary"].ip_range == "10.0.1.0/24"
+    error_message = "Primary subnet must be 10.0.1.0/24 — uniform layout across regions."
+  }
+
+  # Firewall must include the Cilium WG inter-region rule (UDP 51871).
+  # DoD A2 (docs/SOVEREIGN-MULTI-REGION-DOD.md) — without this, the
+  # WireGuard mesh between regions cannot form and gate D11 fails.
+  assert {
+    condition = length([
+      for r in hcloud_firewall.main.rule :
+      r if r.protocol == "udp" && r.port == "51871"
+    ]) == 1
+    error_message = "hcloud_firewall.main must declare exactly 1 inbound rule for UDP 51871 (Cilium WireGuard inter-region encryption per DoD A2)."
+  }
 }

 # ── Scenario 2: single-entry regions[] ────────────────────────────────────
@ -248,6 +279,64 @@ run "three_region_mgmt_fsn_hel" {
    condition     = contains(output.secondary_region_keys, "hel1-2")
    error_message = "secondary_region_keys must contain the hel1-2 key for regions[2] (Helsinki data plane)."
  }
+
+  # Per-region network refactor (2026-05-15 DoD A2) — one hcloud_network
+  # per region, NO shared private net across regions. Verify the
+  # for_each map declares one Network for "primary" + one for each
+  # secondary region, all on the same 10.0.0.0/16 (the ranges live in
+  # ISOLATED networks so the collision is intentional).
+  assert {
+    condition     = length(hcloud_network.region) == 3
+    error_message = "Three-region payload must produce exactly 3 hcloud_network entries (primary + 2 secondaries) — one isolated /16 per region per DoD A2."
+  }
+
+  assert {
+    condition     = hcloud_network.region["primary"].ip_range == "10.0.0.0/16"
+    error_message = "Each region's hcloud_network must be 10.0.0.0/16 (identical inside isolated networks)."
+  }
+
+  assert {
+    condition     = hcloud_network.region["fsn1-1"].ip_range == "10.0.0.0/16"
+    error_message = "Secondary region's hcloud_network must be 10.0.0.0/16 (same range as primary inside its OWN isolated network)."
+  }
+
+  assert {
+    condition     = length(hcloud_network_subnet.region) == 3
+    error_message = "Three-region payload must produce exactly 3 hcloud_network_subnet entries — one /24 per region's isolated /16."
+  }
+
+  assert {
+    condition     = hcloud_network_subnet.region["hel1-2"].ip_range == "10.0.1.0/24"
+    error_message = "Each region's subnet must be 10.0.1.0/24 (uniform CP=.2, workers=.10+, LB=.254 layout)."
+  }
+
+  # Per-region pod/service CIDRs (DoD gate D11 — no collision across
+  # ClusterMesh peers). Verify primary, fsn1-1, hel1-2 get distinct
+  # cluster-cidrs (10.42/43/44.0.0/16) + service-cidrs (10.96/97/98.0.0/16).
+  assert {
+    condition     = local.region_cluster_cidr["primary"] == "10.42.0.0/16"
+    error_message = "primary region's cluster-cidr must be 10.42.0.0/16 (index 0)."
+  }
+
+  assert {
+    condition     = local.region_cluster_cidr["fsn1-1"] == "10.43.0.0/16"
+    error_message = "fsn1-1 (secondary index 0 → region index 1) must get cluster-cidr 10.43.0.0/16."
+  }
+
+  assert {
+    condition     = local.region_cluster_cidr["hel1-2"] == "10.44.0.0/16"
+    error_message = "hel1-2 (secondary index 1 → region index 2) must get cluster-cidr 10.44.0.0/16."
+  }
+
+  assert {
+    condition     = local.region_service_cidr["primary"] == "10.96.0.0/16"
+    error_message = "primary region's service-cidr must be 10.96.0.0/16 (index 0)."
+  }
+
+  assert {
+    condition     = local.region_service_cidr["hel1-2"] == "10.98.0.0/16"
+    error_message = "hel1-2 (region index 2) must get service-cidr 10.98.0.0/16 — non-overlapping across peers."
+  }
 }

 # ── Scenario 4: same-region duplicate ────────────────────────────────────