BSS/OSS Academy
πŸ”§
Section 11.6

Cross-Domain Composition: Worked Examples

Four worked examples that compose across the domains established earlier: Enterprise SD-WAN (MPLS L3VPN + CPE + IPAM + optional 5G backup), 5G enterprise URLLC slice (RAN + transport + core SLA composition), FTTH residential broadband (OLT + BNG + IPAM + CPE + workforce), and Enterprise Ethernet (L2VPN/EVPN + NID + activation testing TMF634). Each example shows the dependency graph, execution sequence, rollback scope, and assurance feedback path. Cross-cutting lessons: IPAM first, capacity-as-reservation, saga rollback (not transaction), activation testing as a gate, assurance armed at activation, workforce as a first-class actor.

The previous four sections established the layering (11.1), the universal integration pattern (11.2), and three domain integrations (11.3 IP/MPLS, 11.4 mobile core, 11.5 fixed access). This section shows the composition β€” how a real service order traverses multiple domains at once, with sequencing, rollback, and assurance feedback woven across them. Composition is where multi-domain orchestration earns its name; it is also where most orchestration architectures discover the gaps in their domain abstractions.

Each worked example below shows: (a) the dependency graph β€” which domain produces what for which other domain; (b) the execution sequence β€” what must happen in what order; (c) the rollback scope β€” what unwinds if a step fails; and (d) the assurance feedback β€” how runtime SLA observations close the loop. The examples are deliberately simplified for pedagogical clarity; production orchestration of these services involves more idempotency checks, more compensating actions, and more vendor-specific intermediaries than shown.

Worked Example 1: Enterprise SD-WAN

Service: Enterprise SD-WAN with MPLS L3VPN underlay, managed CPE at each site, public IP per site, and optional 5G FWA backup link for resilience. Domains touched: IP/MPLS (underlay VPN), fixed access or 5G FWA (last-mile), CPE management, IPAM, mobile core (for 5G backup).

Dependency graph β€” SD-WAN service

Producer domainOutputConsumer domain
IPAMPer-site WAN IPs (IPv4 + IPv6 prefix)IP/MPLS (VRF), CPE, mobile core (5G backup APN)
IP/MPLSL3VPN VRF on PE routers, route-targets, transport classCPE (SD-WAN overlay endpoints)
CPE managementCPE config, overlay tunnel keys, monitoringCustomer CPE (via TR-369 USP)
Mobile core (optional)5G FWA APN with backup-class QoS, SIM provisioningCPE secondary uplink
InventoryService instance with all sub-RFS items linkedAssurance, billing, customer portal

Execution sequence β€” SD-WAN order to active service

1
COM/SOM receives the order
COM β†’ SOM

Enterprise orders "SD-WAN Premium, 5 sites, 100 Mbps each, 5G backup". COM decomposes into per-site CFS instances. SOM decomposes each CFS into RFS items: L3VPN underlay, CPE config, public-IP allocation, optional 5G backup.

2
XDO requests IPAM allocations
XDO β†’ IPAM

Cross-domain orchestrator allocates per-site WAN IP and IPv6 prefix from IPAM as the first concrete step. Allocations are reserved (not yet committed to network). If any IPAM allocation fails, the service is rejected before any network change.

3
IP/MPLS domain provisions L3VPN
XDO β†’ IP/MPLS controller

XDO sends an L3NM-style request to NSO/NSP. The controller computes the PE template, configures VRFs, BGP-VPN sessions, route-targets, transport class. Returns service-instance state. Idempotent: replay produces the same VRF.

4
Mobile core (if 5G backup) provisions APN
XDO β†’ 5GC NFs

XDO calls 5GC SBI (Nudm/Nsmf/Npcf) to create the backup APN/DNN, bind QoS profile, register the FWA SIM. UPF capacity is checked; new pods scaled if needed via MANO.

5
CPE configured via TR-369 USP
XDO β†’ CPE management β†’ CPE

CPE management pushes overlay tunnel keys, primary-link IP, optional secondary-link APN credentials, monitoring profile. CPE comes online; first-byte test succeeds; TMF634 service-test result captured.

6
Service instance committed; assurance armed
XDO β†’ Inventory + Assurance

XDO commits the service instance to inventory. Assurance arms SLA monitors: latency budget per site, link availability, failover-to-5G test cadence. Closed-loop policies set: alert on link down, auto-failover within 30s.

Rollback scope
Failure at step 3 unwinds IPAM reservations. Failure at step 4 unwinds the L3VPN AND the IPAM reservations β€” this is a saga, not a transaction. Failure at step 5 unwinds 5GC config, L3VPN, and IPAM. Each compensating action is itself idempotent; each may itself fail (CPE unreachable, 5GC throttled), forcing manual ops escalation. Production designs assume at least one rollback per 100 orders fails midway.
Assurance feedback
Site-level link-down events from BNG / 5GC trigger automatic failover at the CPE. Persistent failures escalate via the assurance product to a service-level KPI breach, which can re-enter the orchestrator (auto-replan, scale 5G backup capacity, dispatch field tech). Without this loop, every degradation becomes a manual ticket.

Worked Example 2: 5G Enterprise Slice (URLLC)

Service: URLLC slice for an industrial customer β€” 10 ms end-to-end latency, 99.999% availability, isolated user plane, dedicated UPF capacity. Domains touched: 5GC, RAN (via O-RAN/SMO), transport (IP/MPLS with strict-class), assurance. The hard part is end-to-end SLA composition across three independent domains.

Dependency graph β€” 5G enterprise slice

Producer domainOutputConsumer domain
Slice catalog (CSMF)NEST-aligned slice template; S-NSSAI assigned5GC, RAN, transport
Transport (IP/MPLS)SR transport tunnel with latency budget commit5GC user plane (UPF anchoring)
5GC (NSMF + NSSMF-core)Slice-aware AMF/SMF/UPF, NSSF policy, dedicated UPF instanceRAN, subscribers
RAN (NSSMF-RAN)Slice-aware scheduler weights, PRB partitioning per cellAir interface
AssurancePer-domain KPIs aggregated into slice-SLA dashboardClosed-loop orchestration

Execution sequence β€” slice order to operational slice

1
Slice request received via CSMF
CSMF β†’ NSMF

Enterprise (or vertical platform) requests a NEST-template slice. CSMF translates the customer-visible NEST into network-side requirements and hands a slice-instantiation request to NSMF.

2
XDO computes per-domain slice budgets
NSMF (XDO)

NSMF (cross-domain orchestrator) decomposes the 10 ms end-to-end target into RAN budget (~3 ms), transport budget (~3 ms), core budget (~4 ms). Capacity check: do RAN cells, transport paths, and UPF capacity exist? If not, scale or reject.

3
Transport slice provisioned
XDO β†’ IP/MPLS controller

XDO sends an IETF slice-ng / SR-TE policy request to the IP/MPLS controller, which computes a strict-latency SR path between RAN sites and the UPF anchor. Reservation only β€” not yet committed.

4
Core slice realised β€” UPF scaled if needed
NSSMF-core β†’ MANO + 5GC

NSSMF-core requests dedicated UPF capacity. NFVO + VNFM scale a new UPF instance; new pods register with NRF; SMF policies are bound to the slice S-NSSAI. AMF and PCF are configured for slice-aware admission and policy.

5
RAN slice configured
NSSMF-RAN β†’ RAN

NSSMF-RAN configures slice-aware scheduler weights and PRB partitioning on the relevant gNBs. This is a multi-cell, often multi-vendor operation; vendor RAN orchestration (e.g. SMO in O-RAN) executes the per-cell config.

6
End-to-end test + slice live
XDO β†’ Inventory + Assurance

Slice-level service test (3GPP TS 28.554 KPIs) executed: round-trip latency, packet loss, availability under load. Slice instance committed to inventory; assurance arms slice KPIs with closed-loop scaling and replanning policies.

Why this is the hardest worked example
Every other worked example involves a single SLA target (bandwidth, availability) at a single domain boundary. The 5G slice has distributed SLA composition β€” the 10 ms end-to-end target is met by independent budgets across RAN, transport, and core, each managed by a different team or vendor, each evolving on a different cadence. The orchestrator owns the budget allocation and the renegotiation logic when one domain reports it cannot meet its share.
Rollback scope and assurance feedback
Failure at step 4 (core scaling) unwinds the transport slice reservation and slice template. The orchestrator must not commit transport before core capacity is confirmed, because rolling back SR paths under live traffic forces convergence events that violate the slice SLA β€” a classic "rollback caused the outage" failure mode. Assurance feeds slice KPI breaches back into the orchestrator: scale UPF, reroute transport, or refuse new attaches. Without this loop, slicing degrades silently.

Worked Example 3: FTTH Residential Broadband

Service: 1 Gbps FTTH residential broadband with VoIP, public IPv4 (CGNAT) + IPv6 prefix, and a managed Wi-Fi mesh CPE. Domains touched: fixed access (OLT/ONT/BNG), CPE management, IPAM, billing/CRM, and field workforce for the truck roll. This is the most volume-driven worked example β€” Tier-1 operators run hundreds of thousands of these per year.

Dependency graph β€” FTTH residential service

Producer domainOutputConsumer domain
Workforce / dispatchFibre installation, ONT placement, customer presentOLT (line ready), CPE (ready to be plugged)
Access controllerOLT line config, ONT registration, BNG subscriber profileCPE, AAA, IPAM
IPAMIPv4 (CGNAT) + IPv6 prefix leaseBNG, CPE
CPE managementCPE config push, Wi-Fi profile, mesh node provisioningCustomer CPE
Billing / CRMService active event β†’ billing start; customer portal updatedRevenue assurance

Execution sequence β€” FTTH order to active service

1
Order captured; feasibility checked
COM

Customer orders FTTH 1 Gbps via retail channel. COM checks feasibility against access network inventory: is there a serviceable OLT? capacity on the right line card? available ONT in stock? If yes, order proceeds.

2
Truck roll scheduled
COM β†’ Workforce

Workforce dispatches a field tech to install the fibre and ONT at the customer premise on a chosen appointment. Until media is in place, no network config can succeed.

3
IPAM and BNG provisioned in advance of customer present
XDO β†’ IPAM + AAA

XDO allocates the IPAM lease (CGNAT IPv4 + /56 IPv6 prefix) and creates the BNG subscriber profile in AAA. These can complete before the truck roll lands; they cost nothing until the CPE comes online.

4
On site: ONT registered + line activated
Access controller + Field tech

Field tech connects the fibre, plugs in the ONT. Access controller registers the ONT against the OLT line, applies the service profile, completes line activation. The CPE is plugged in; first DHCP/IPoE event hits the BNG.

5
CPE configured via TR-369 USP
CPE management β†’ CPE

CPE registers with the ACS; CPE management pushes Wi-Fi SSID/password from customer order, mesh-node configuration, IPv6 prefix delegation, voice (VoIP) line config. Customer's primary device gets internet within seconds.

6
Service active; billing starts
XDO β†’ Billing + Assurance

Service-active event published. Billing starts, customer portal updates, customer-care notes the activation. Assurance arms: link health, Wi-Fi quality KPIs, NPS-relevant metrics.

The classic FTTH failure mode
Activation logic that reverses steps 3 and 4 β€” registering the ONT before the BNG profile and IPAM lease exist β€” is the single most common Tier-1 production incident in fixed access. The customer's line is "lit" (link up) but the BNG drops the session because there is no profile, or the IPAM has nothing to lease. The customer sees no service; the field tech is dispatched to a working line; activation SLA is missed. Catalog dependency modelling prevents this; ad-hoc workflow scripting cannot.

Worked Example 4: Enterprise Ethernet (E-Line / E-LAN)

Service: Symmetric 10 Gbps Ethernet between two enterprise sites, with strict CIR/EIR, MEF-compliant SLA, and end-to-end activation testing. Domains touched: IP/MPLS (L2VPN/EVPN), fibre/transport, CPE (NID at each site), workforce, assurance.

Dependency graph β€” Enterprise Ethernet service

Producer domainOutputConsumer domain
Optical / transportWavelength or sub-Ξ» between PoPs (if not already in place)IP/MPLS (PE-PE link)
IP/MPLSL2VPN/EVPN or VPLS instance, MAC-VRF, MTU policyCPE NID, customer
CPE / NIDPer-site network interface device with policer, monitoringCustomer Ethernet handoff
WorkforceOn-site CPE/NID install, fibre patchActivation testing
Activation testing (TMF634)Y.1731 / RFC 2544 resultsService-active gate

Execution sequence β€” Enterprise Ethernet activation

1
Order with feasibility against PoP capacity
CPQ β†’ COM

CPQ + COM produces a feasibility-checked order: are both sites reachable from PoPs? does optical capacity exist? if not, plan optical augmentation as a dependency.

2
Optical reservation (if needed)
XDO β†’ Optical controller

XDO reserves wavelength/sub-Ξ» between the two PoPs via the optical controller. If no capacity, the workflow pauses for a separate transport build β€” composition does not bypass capacity reality.

3
L2VPN / EVPN provisioned on PEs
XDO β†’ IP/MPLS controller

IP/MPLS controller configures EVPN MAC-VRF or VPLS instance on both PEs, with route-targets, MTU, CIR/EIR policer. Idempotent service-model push via NETCONF/YANG (L2NM-aligned).

4
NID install + service test
Workforce + Activation testing

Workforce installs NID at each site, patches fibre. Activation test (RFC 2544 throughput, Y.1731 OAM, latency check) executes. TMF634 test report captured. If test fails, escalate to NOC before customer handoff.

5
Customer handoff and service active
XDO β†’ Billing + Assurance

Customer accepts handoff; service marked active in inventory. Billing starts. Assurance arms: SLA monitoring against MEF KPIs, monthly availability reporting, breach-to-credit logic.

Activation testing as a first-class step
Enterprise Ethernet differs from residential broadband in that the activation test is the gate for service-active, not an afterthought. TMF634 service test is part of the order workflow β€” a failed test pauses the activation, not the service. Operators that bolt activation testing on later, instead of integrating it into the orchestrator, end up with services declared "active" in BSS while customers experience packet loss in production.

Cross-Cutting Lessons from Composition

Patterns that recur across all four examples

PatternHow it appearsWhy it matters
IPAM allocation earlyReserve IP / prefix before any network changeCheapest step to fail; prevents partial activation if IP unavailable
Capacity check before commitOptical capacity, UPF capacity, OLT line β€” checked as reservation, committed as part of sagaAvoids partial commits that take live traffic down before failure is even known
Saga rollback, not transactionEach domain commit is rolled back via compensating action; some compensations may failProduction designs must plan for rollback failure, not assume rollback always succeeds
Activation testing as a gateTest result blocks "active" state; test failure escalates rather than progressingOtherwise the BSS sees services as live when the network is delivering nothing
Assurance loop registered at activation, not laterKPIs and closed-loop policies armed when the service is committedOtherwise service operates uninstrumented for weeks; SLA violations go undetected
Workforce as a first-class actorTruck rolls and field-tech handoff modelled in the workflow with explicit ready/blocked statesPhysical reality dominates timing; the orchestrator must wait, not assume

Section 11.6 Key Takeaways

  • Composition is where multi-domain orchestration earns its name. The four worked examples β€” SD-WAN, 5G slice, FTTH, Enterprise Ethernet β€” show how dependency graphs, sequencing, rollback scope, and assurance feedback compose across domains.
  • IPAM allocation is the universal first concrete step. It costs nothing to roll back, and it gates the rest of the saga.
  • Capacity is checked as a reservation, then committed as part of the saga. Bypassing the reservation step turns capacity exhaustion into a partial activation, which is harder to recover from than a clean rejection.
  • Cross-domain rollback is a saga of compensating actions, not a transaction. Some compensations fail. Production designs assume at least one rollback per N orders fails midway.
  • Activation testing (TMF634) is a gate for service-active state on enterprise services. Without it, BSS sees a live service while the network delivers nothing.
  • Assurance loops must be armed at activation. Services that go live without assurance instrumentation drift unnoticed until the customer escalates.
  • Workforce is a first-class actor. Truck rolls, fibre installs, CPE shipping all dominate timing and must be modelled with explicit ready/blocked states β€” not assumed away.
  • The 5G slice is the hardest worked example because it requires distributed SLA composition across RAN, transport, and core. Every other example has a simpler SLA shape; this one trains the architectural muscle for the next decade of telco services.