IPsec and IKE anti-replay sequence number subspaces for multi-path tunnels and multi-core processing

Internet-Draft	IPsec anti-replay subspaces	October 2022
Ponchon, et al.	Expires 27 April 2023	[Page]

Abstract

This document discusses the challenges of running IPsec with anti-replay in environments where packets may be re-ordered (e.g., when sent over multiple IP paths, traffic-engineered paths and/or using different QoS classes) as well as when processed on multiple cores. Different approaches to solving this problem are discussed, and a new solution based on splitting the anti-replay sequence number space into multiple different sequencing subspaces is proposed. Since this solution requires support on both parties, an IKE extension is proposed in order to negotiate the use of the Anti-Replay sequence number subspaces.¶

2. Problem Statement

A common approach to leverage multiple paths between IPsec peers is to allocate one Child SA per path, but given the quadratic number of paths that may exist between peers ('number of uplinks of peer 1' x 'number of uplinks of peer 2'), this shows scalability issues in both IKE and IPsec implementations:¶

Increased number of IKE negotiations and re-key operations.¶
Increased IKE memory usage.¶
Data-plane performance degradation due to the use of a larger number of keys.¶
Data-plane reduced number of connected peers, due to a hard limit to the number of supported Child SAs.¶

On the other hand, using a single Child SA per peer on multiple paths comes with challenges related to the anti-replay mechanism. Packets traveling on different paths may arrive out-of-order. A packet could travel on a slower network path, compared to another faster path, and arrive too late for the anti-replay window to be able to check whether the packet is a replay or not, causing the packet to be dropped.¶

The same problem may also be observed when multiple QoS policies are used: Packets may be re-ordered, and a lower priority queue could have its packets arrive too late compared to others for the anti-replay mechanism to function properly.¶

Finally, anti-replay implementations also suffer from performance issues when multiple threads are involved in sending or receiving encrypted packets for the same Child SA. This is discussed in [I-D.pwouters-ipsecme-multi-sa-performance], which mainly focuses on high-throughput IPsec tunnels, but the problem also arises with small tunnels since multiple inner flows processed by multiple threads often need to be transmitted on the same tunnel (causing multiple threads to need to access shared resources).¶

2.1. Solution Space

The solution detailed in [I-D.pwouters-ipsecme-multi-sa-performance] proposes to use one Child SA per core, and could be extended to provide a solution involving one SA per path as well. But allocating one SA per path, as well as per core, would further multiply the number of Child SAs. For example, with 6 paths between two peers (one peer has 2 uplinks, the other has 3), and 4 cores to process the IPsec packets, the number of Child SAs would be multiplied by 24. This will divide by 24 the number of peers that an IPsec concentrator supports.¶

Alternatively, a single Child SA per peer could be used with a very large anti-replay window (e.g., 128k bits), in order to mitigate the risk of packets being dropped when packets are sent on multiple paths. But this solution has some serious drawbacks:¶

The window size is guessed based on the expected throughput and delay difference between the best and the worst path, but the best value depends on many factors, and hence cannot be guessed.¶
The window size can get very large, even more so as the throughput increases.¶
Large window sizes cause performance degradation and scalability limitations by increasing the amount of memory accessed by the data-plane.¶

4. Multiple sequence number subspaces

The problems of using a single Child SA on multiple paths and processing the packets associated with a single Child SA on multiple cores both suffer from limitations due to the anti-replay mechanism.¶

As a result, this section describes a solution which modifies the anti-replay mechanism by allowing the 32 bits or 64 bits (with ESN) anti-replay sequence number space into multiple subspaces. Each path, or core, or combination of both, can then use their own unique anti-replay sequence number subspace. The changes needed to the ESP header and IPsec protocol are described in Section 4.1, Section 4.2 and Section 4.3.¶

Since this specification requires both IPsec peers to implement this specification, an IKE extension is presented in Section 4.5, allowing peers to coordinate on the use, or not, of this specification.¶

4.1. Sequence number subspace encoding in IPSec

The 32-bit sequence number field of the ESP header is split into two sub-fields:¶

The higher order 8 bits contain the new sequence number subspace ID.¶
The lower order 24 bits continue to be used as the explicit part of the sequence number.¶

  0                   1                   2                   3
   0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
  +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
  |               Security Parameters Index (SPI)                 |
  +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
  |  Subspace ID  |                Sequence Number                |
  +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
  |                    Rest of the ESP payload

4.2. Sender Behavior

The sender MAY set the sequence number subspace ID to any value. For example, the sender would use different values per path or per processing core.¶

The sender MUST maintain one sequence number counter per sequence number subspace that it makes use of. But the sender MAY use only some (and as few as a single one) of the available 256 subspaces.¶

When transmitting a packet, the sender MUST use the sequence number counter associated with the sequence number subspace in use for that packet. The lower-order 24 bits of the sequence number counter are placed in the sequence number field, as specified in [RFC4303].¶

4.3. Receiver Behavior

The receiver MUST maintain one anti-replay window and counter for each sequence number subspace being used.¶

When receiving a packet, the receiver MUST use the anti-replay window and counter associated with the sequence number subspace identified with the subspace ID field.¶

Note: Since the sender may decide to only use a subset of the available subspace values, receivers SHOULD NOT allocate 256 anti-replay windows per peer by default. Two mitigation mechanisms may be used to reduce the number of anti-replay windows:¶

The receiver SHOULD limit the number of allocated anti-replay windows to the number of subspaces negotiated during the IKE Child SA creation exchange, as specified in Section 4.5.¶
The receiver MAY reactively allocate an anti-replay window when receiving the first packet for a given subspace, since the sender may decide to not use all of the available values. When doing so, the receiver SHOULD first check the authenticity of the packet before allocating the new anti-replay window.¶

4.4. Extended Sequence Numbers (ESN) considerations

When using Extended Sequence Numbers (ESNs), the sender and receiver MUST consider the 8 bits at indices 24 to 31 of the sequence number 64 bits long counter to be implicit.¶

Note that the reduction of the sequence number space to 24 bits specified in this document has some implications over the speed at which the explicit part of the sequence number will loop. At a 10Gbps rate, with 1500B ethernet frames, the 24 bits sequence number space will loop every 20 seconds.¶

As a result:¶

The peers SHOULD use ESN, unless the tunnel is known to run at low speed rates.¶
Any outage lasting longer than 20 seconds might cause a resync (as defined in appendix A3 of [RFC4303]).¶

Note: A peer may unilaterally enforce the use of ESN by specifying the IKE ESN transform with the value "1" (implying that the peer suggests to use ESN) in its CHILD_SA proposal, while ommitting to include the IKE ESN transform with the value "0" (implying that disabling ESN is not accepted).¶

Authors' note: Is it too much of a problem ? Should we consider adding a new 32 bits field in the ESP header to contain the subspace ID, as well as future ESP extensions ?¶

4.5. Negotiating sequence-number subspaces using IKE

A mechanism will be needed to:¶

Make sure both parties agree to use sequence-number subspaces when creating a Child SA.¶
Let the sender express a preferred number of subspaces it wants to use, as well as the maximum number of subspaces it is capable of using.¶
Let the receiver peer decide on a number of subspaces to be used by the sender, between zero (meaning that subspaces should not be used), and the maximum number of subspaces that the sender supports.¶

This document will specify such negotiation.¶

The authors would like to request the ipsecme working group for input on how to best implement the negotiation of this new functionality in IKEv2 [RFC5996].¶

4.6. Solution Analysis

As described in Section 2, anti-replay comes with implementation and scalability challenges when running in environments where IPsec peers may leverage multiple paths to send packets or multiple cores to process them.¶

Since the anti-replay mechanism seems to be the source cause of these observed challenges, this document provides a solution which relies on a small and optional change at the anti-replay level.¶

By using sequence number subspaces, IPsec peers may:¶

use different subspaces for different cores, which allows distributing a Child SA between cores to increase performance¶
use different subspaces for different QoS classes or different paths, which avoids unwanted drops due to potential reordering of packets, either at the egress or during its flight.¶
combine the above per-path and per-core approaches without multiplying the number of required Child SAs.¶

6. Implementation Considerations

When a single sequence number space is used within a given Child SA, encryption and decryption operations must always happen on the same core (locking anti-replay structures or using contended atomic operations has a dramatic performance hit).¶

On reception, this requires packets which are received (and load-balanced to cores) to be often handoff to a different thread for processing.¶
On transmisson, multiple flows, processed by different cores, need to be transmitted using the same Child SA. This requires the packets to be handoff to the thread in charge of the given Child SA.¶

To avoid the performance degradation caused by packet handoffs, each thread may use its own sequence number subspace:¶

On transmission, the core will always select the subspace it is assigned when generating the ESP header.¶
On reception, the subspace ID could be used to load-balance the packets to their proper thread.¶

Similarly, when multiple paths are used:¶

On transmission, a different sequence number subspace is used for each packet path. Ensuring that out-of-order packets are not dropped by the anti-replay mechanism.¶
On reception, the 5-tuple based packet steering would provide a decent level of load-balancing between threads, since different IP paths would use different 5-tuples.¶

If a combination of both multi-path and multi-core load-balancing is needed, the subspace field could be used partly to encode a path ID, partly to encode a core ID. But this is purely implementation specific and does not require coordination between the peers.¶

6.1. Initialization Vector (IV) Considerations

Depending on the cryptographic mode of operations, the Initialization Vector (IV) comes with specific requirements.¶

Some modes (e.g., CBC) make use of random IV values. When implementing this specification, each thread independently generates its independent stream of random values, ensuring the IV randomness property. Care must be taken as to limit the global number of transmitted packets using the same Child SA in order to avoid birthday paradox attacks. A lockless counter, or batched token bucket mechanism, may be used to efficiently implement this process without performance degradation.¶

Other cryptographic modes (e.g., GCM) do not have randomness requirements over the IV, but the IV values must only be used once. RFC4106 Section 3.1 states that "The most natural way to implement this is with a counter, but anything that guarantees uniqueness can be used, such as a linear feedback shift register (LFSR). Note that the encrypter can use any IV generation method that meets the uniqueness requirement, without coordinating with the decrypter." . One simple way to implement this specification is to divide the IV into a subspace field, which reuses the ESP sequence number subspace value, and a variable IV part, which is simply incremented for each encrypted packet.¶

Author's note: Are there other cryptographic modes with different requirements over the IV ?¶

IPsec and IKE anti-replay sequence number subspaces for multi-path tunnels and multi-core processing

Abstract

Status of This Memo

Copyright Notice

Table of Contents

1. Introduction

2. Problem Statement

2.1. Solution Space

3. Conventions and Definitions

4. Multiple sequence number subspaces

4.1. Sequence number subspace encoding in IPSec

4.2. Sender Behavior

4.3. Receiver Behavior

4.4. Extended Sequence Numbers (ESN) considerations

4.5. Negotiating sequence-number subspaces using IKE

4.6. Solution Analysis

5. Security Considerations

6. Implementation Considerations

6.1. Initialization Vector (IV) Considerations

7. Operational Considerations

8. IANA Considerations

9. References

9.1. Normative References

9.2. Informative References

Authors' Addresses