The CATS OAM hierarchical model leverages the principles of Maintenance Domain (MD) levels, as defined in IEEE 802.1ag and ITU-T Y.1731, to extend traditional connectivity and performance management into the computing domain. This framework enables CATS-Forwarders and underlying network nodes to perform integrated anomaly detection and performance monitoring.¶
Based on the scope of awareness and functional granularity, the CATS OAM mechanisms are organized into four distinct layers: Link OAM, Path OAM, Instance OAM, and Service OAM. The architecture is illustrated in Figure 1.¶
+------+ +--+--------+ +---+----+ +--------+--+ +--------+
|client+-+ CATS- +----+underlay+---+ CATS- +-+service |
| | |Forwarder 1| | node | |Forwarder 2| |instance|
+------+ +-----------+ +--------+ +-----------+ +----+---+
o-------------------- Service OAM -------------------o
o---- Instance OAM ----o
o------------- Path OAM -----------o
o----o o----o o----o Link OAM
Figure 1: CATS OAM Layering Model
¶
-
Link OAM: This layer is directed at the detection of physical links or single-hop IP interfaces, representing the foundational underlay infrastructure in CATS framework [I-D.ldbc-cats-framework]. It encompasses both the internal links within the operator's infrastructure and the external interfaces interconnecting the network and computing domains. The primary objective is to monitor the operational status between two adjacent devices to ensure fundamental IP reachability. The existing detection tools include IEEE 802.3ah, ICMP [RFC4443], and single-hop BFD [RFC5881].¶
-
Path OAM: This layer focuses on the monitoring of transport paths between an ingress CATS-Forwarder and an egress CATS-Forwarder in CATS framework [I-D.ldbc-cats-framework]. As an aggregate transport path typically carries multiple services, Path OAM is critical for ensuring that network-layer faults or resource contention from traffic multiplexing do not degrade specific service performance. Fault detection and performance monitoring are executed at the Label Switched Path (LSP) or Segment Routing (SR) path layer [RFC8402] to facilitate rapid service protection. The existing detection mechanisms include ITU-T Y.1711, MPLS Loss and Delay Measurement (LM-DM) [RFC6374], and BFD for LSP.¶
-
Instance OAM: This layer is dedicated to the monitoring of status of computing resource and the operational health of service instances in CATS framework [I-D.ldbc-cats-framework]. The metrics collected at this layer, such as CPU load and memory availability, are defined in [I-D.ietf-cats-metric-definition]. Monitoring mechanisms MUST support flexible implementation modes, including a "Push" mode, where computing nodes report dynamic load status in real-time, and a "Pull" mode, where the egress CATS-Forwarder proactively retrieves computing metrics by extending the existing OAM mechanisms.¶
-
Service OAM: This layer provides either end-to-end or segmented visibility into the connectivity and performance between the Ingress CATS-Forwarder and the target Service Instance (SI), as defined in the CATS framework [I-D.ldbc-cats-framework]. Beyond basic reachability verification, SVC-OAM is responsible for monitoring service-specific liveness, transaction success rates, and application-layer latency to ensure consistent end-to-end Quality of Experience (QoE). The supported detection mechanisms include, but are not limited to, the Two-Way Active Measurement Protocol (TWAMP) [RFC5357], application-aware probing, and HTTP-based health checks, with extensibility to accommodate emerging application-specific requirements.¶