| Internet-Draft | Export of RoCEv2 BTH Using IPFIX | February 2026 |
| Lin & Zhang | Expires 31 August 2026 | [Page] |
This document defines a new set of IP Flow Information Export (IPFIX) Information Elements (IEs) for exporting Base Transport Header (BTH) information for RDMA over Converged Ethernet version 2 (RoCEv2) traffic. These extensions enable network monitoring systems to collect and analyze the characteristics of RDMA traffic widely used in high-performance computing, storage, and artificial intelligence applications.¶
This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79.¶
Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet-Drafts is at https://datatracker.ietf.org/drafts/current/.¶
Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress."¶
This Internet-Draft will expire on 31 August 2026.¶
Copyright (c) 2026 IETF Trust and the persons identified as the document authors. All rights reserved.¶
This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Revised BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Revised BSD License.¶
Remote Direct Memory Access (RDMA) [RFC5040] is a network technology that allows a computer to read from or write to the memory of another computer directly, without involving the operating system. This zero-copy and kernel-bypass feature greatly reduces CPU overhead and communication latency. InfiniBand [IBTA-SPEC] and RDMA over Converged Ethernet (RoCE) are two mainstream RDMA implementations that bypass the operating system kernel and achieve zero-copy data transfer.¶
RoCE technology has become a key component of high-performance data center networks, especially in low-latency, high-throughput scenarios such as artificial intelligence training, distributed storage, and financial transactions. RDMA over Converged Ethernet version 2 (RoCEv2) runs on top of UDP (port 4791) and inherits the transport layer protocol of the InfiniBand Architecture (IBA).¶
The existing IPFIX [RFC7011] standard lacks the ability to monitor specific fields of RoCEv2, which limits the ability of network operators to perform in-depth analysis, troubleshooting, and performance optimization of RDMA traffic. To close this gap, this document defines a new set of Information Elements (IEs) to carry RoCEv2 BTH key fields.¶
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all capitals, as shown here.¶
The following terms are used as defined in [RFC7011]:¶
The following terms are used in this document:¶
This section defines new IPFIX IEs for exporting RoCEv2 BTH key fields.¶
The exporter needs to parse RoCEv2 BTH information, which may affect the exporter's performance. Implementers SHOULD consider measures to mitigate this impact, such as sampling rate limiting or hardware acceleration.¶
The Security Considerations for IPFIX [RFC7011] apply to this document as well.¶
When exporting RDMA BTH information across security domains, to prevent information security risks caused by the leakage of sensitive network topology details such as QP numbers, it is recommended to implement comprehensive protection measures, such as using the encrypted transmission options of the IPFIX framework (such as DTLS [RFC9147]) to ensure the confidentiality and integrity of data during transmission.¶
To defend against Denial-of-Service (DOS) attacks that may be caused by maliciously crafted RoCEv2 packets and to prevent the exporter from being overloaded by processing a large amount of invalid traffic, it is recommended to adopt certain protection strategies, such as configuring reasonable rate limiting policies to prevent the monitoring system from being overwhelmed by massive data packets and to ensure its stable operation.¶
This document specifies new IPFIX IEs to enable export of RoCEv2 BTH key fields along with other flow information. This document requests IANA to add these IPFIX IEs to the "IPFIX Information Elements" registry available at [IANA-IPFIX].¶
Table 1 lists the new IPFIX IEs for RoCEv2 BTH:¶
+============+==========================+===============+ | Element ID | Name | Reference | +============+======================--==+===============+ | TBD1 | rdmaOpCode | This document | +------------+--------------------------+---------------+ | TBD2 | rdmaPartitionKey | This document | +------------+--------------------------+---------------+ | TBD3 | rdmaDestinationQP | This document | +------------+--------------------------+---------------+ | TBD4 | rdmaSourceQP | This document | +------------+--------------------------+---------------+ | TBD5 | rdmaPacketSequenceNumber | This document | +------------+--------------------------+---------------+ | TBD6 | rdmaBTHFlags1 | This document | +------------+--------------------------+---------------+ | TBD7 | rdmaBTHFlags2 | This document | +------------+--------------------------+---------------+ | TBD8 | rdmaBTHFlags3 | This document | +------------+--------------------------+---------------+ Table 1: New IEs in the "IPFIX Information Elements" Registry¶
The SE, M, Pad, and TVer fields of The RoCEv2 BTH.¶
The size of this Information Element is 1 octet.¶
0 1 2 3 4 5 6 7 +--+-+-+-+-+-+-+-+ |SE|M|Pad| TVer | +--+-+-+-+-+-+-+-+ Bits 0: Solicited Event (SE) field. Bits 1: Migration Request (M) field. Bits 2-3: Pad Count (Pad) field. Bits 4-7: Transport Header Version (TVer) field.¶
The F/R, and B/R fields of The RoCEv2 BTH.¶
The size of this Information Element is 1 octet.¶
0 1 2 3 4 5 6 7
+---+---+---+---+---+---+---+---+
|F/R|B/R| Resv |
+---+---+---+---+---+---+---+---+
Bits 0: Forward Explicit Congestion Notification (FECN)/Res1
(F/R) field.
Bits 1: Backward Explicit Congestion Notification (BECN)/Res1
(B/R) field.
Bits 2-7: Reserved field.
¶
The A field of The RoCEv2 BTH.¶
The size of this Information Element is 1 octet.¶
0 1 2 3 4 5 6 7 +-+-+-+-+-+-+-+-+ |A| Resv | +-+-+-+-+-+-+-+-+ Bits 0: Acknowledge Request (A) field. Bits 1-7: Reserved field.¶