Internet-Draft | URICrypt | September 2025 |
Denis | Expires 28 March 2026 | [Page] |
This document specifies URICrypt, a deterministic, prefix-preserving encryption scheme for Uniform Resource Identifiers (URIs). URICrypt encrypts URI paths while preserving their hierarchical structure, enabling systems that rely on URI prefix relationships to continue functioning with encrypted URIs. The scheme provides authenticated encryption for each URI path component, preventing tampering, reordering, or mixing of encrypted segments.¶
This note is to be removed before publishing as an RFC.¶
Source for this draft and an issue tracker can be found at https://github.com/jedisct1/draft-denis-uricrypt.¶
This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79.¶
Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet-Drafts is at https://datatracker.ietf.org/drafts/current/.¶
Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress."¶
This Internet-Draft will expire on 28 March 2026.¶
Copyright (c) 2025 IETF Trust and the persons identified as the document authors. All rights reserved.¶
This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document.¶
This document specifies URICrypt, a method for encrypting Uniform Resource Identifiers (URIs) while preserving their hierarchical structure. The primary motivation is to enable systems that rely on URI prefix relationships for routing, filtering, or access control to continue functioning with encrypted URIs.¶
URICrypt achieves prefix preservation through a chained encryption model where the encryption of each URI component depends cryptographically on all preceding components. This ensures that URIs sharing common prefixes produce ciphertexts that also share common encrypted prefixes.¶
The scheme uses an extendable-output function (XOF) as its cryptographic primitive and provides authenticated encryption for each component, preventing tampering, reordering, or mixing of encrypted segments.¶
The main motivations include:¶
Access Control in CDNs: Content Delivery Networks often use URI prefixes for routing and access control. URICrypt allows encrypting resource paths while preserving the prefix structure needed for CDN operations.¶
Privacy-Preserving Logging: Systems can log encrypted URIs without exposing sensitive path information, while still enabling analysis based on URI structure.¶
Confidential Data Sharing: When sharing links to sensitive resources, URICrypt prevents the path structure itself from revealing confidential information.¶
Token-Based Access Systems: Systems that issue time-limited access tokens can use URICrypt to obfuscate the underlying resource location while maintaining routability.¶
Multi-Tenant Systems: In systems where multiple tenants share infrastructure, URICrypt can isolate tenant data while allowing shared components to be processed efficiently.¶
Privacy-Preserving Analytics: URICrypt can complement IPCrypt [I-D.draft-denis-ipcrypt]. Together, they enable systems to perform analytics on encrypted network flows and resource access patterns without exposing sensitive information about either the network endpoints or the specific resources being accessed.¶
The key words “MUST”, “MUST NOT”, “REQUIRED”, “SHALL”, “SHALL NOT”, “SHOULD”, “SHOULD NOT”, “RECOMMENDED”, “NOT RECOMMENDED”, “MAY”, and “OPTIONAL” in this document are to be interpreted as described in BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all capitals, as shown here.¶
Throughout this document, the following terms and conventions apply:¶
URI Component: A segment of a URI path, typically separated by ‘/’ characters. For encryption purposes, components include the trailing separator except for the final component.¶
Scheme: The URI scheme (e.g., “https://”) which is preserved in plaintext.¶
XOF: Extendable-Output Function, a hash function that can produce output of arbitrary length.¶
SIV: Synthetic Initialization Vector, a 16-byte value derived from the accumulated state of all previous components, used for authentication and as input to keystream generation.¶
Domain Separation: The practice of using distinct inputs to cryptographic functions to ensure outputs for different purposes are not compatible.¶
Prefix-Preserving Encryption: An encryption scheme where if two plaintexts share a common prefix, their corresponding ciphertexts also share a common (encrypted) prefix.¶
Chained Encryption: A mode where encryption of each component depends cryptographically on all preceding components.¶
This section describes how URIs are processed for encryption and decryption.¶
Before encryption, a URI must be split into its scheme and path components. The path is further divided into individual components for chained encryption.¶
For a full URI including a scheme:¶
Input: "https://example.com/a/b/c" Components: - Scheme: "https://" - Component 1: "example.com/" - Component 2: "a/" - Component 3: "b/" - Component 4: "c"¶
Note that all components except the last include the trailing ‘/’ character. This ensures proper reconstruction during decryption.¶
For URIs without a scheme:¶
Input: "/a/b/c" Components: - Scheme: "" (empty) - Component 1: "a/" - Component 2: "b/" - Component 3: "c"¶
The leading ‘/’ is not treated as a separate component but is implied during reconstruction.¶
During decryption, components are joined to reconstruct the original path:¶
Components: ["example.com/", "a/", "b/", "c"] Reconstructed Path: "example.com/a/b/c" When combined with the scheme: "https://example.com/a/b/c"¶
URICrypt uses three parallel TurboSHAKE128 [I-D.draft-irtf-cfrg-kangarootwelve] instances for different purposes, all initialized from the same base hasher.¶
The base hasher is initialized with the secret key and context parameter using length-prefixed encoding to prevent ambiguities.¶
Two hashers are derived from the base hasher:¶
Components Hasher: Updated with each component’s plaintext to generate SIVs¶
Base Keystream Hasher: Used as the starting point for generating keystream for each component¶
The initialization process:¶
base_hasher = TurboSHAKE128() base_hasher.update(len(secret_key)) base_hasher.update(secret_key) base_hasher.update(len(context)) base_hasher.update(context) components_hasher = base_hasher.clone() components_hasher.update("IV") base_keystream_hasher = base_hasher.clone() base_keystream_hasher.update("KS")¶
For each component, the encryption process is:¶
Update components_hasher
with the component plaintext¶
Generate SIV from components_hasher
(16 bytes)¶
Create keystream_hasher
by cloning base_keystream_hasher
and updating with SIV¶
Calculate padding needed for base64 encoding¶
Generate keystream of length (component_length + padding)
¶
XOR padded component with keystream¶
Output SIV concatenated with encrypted_component
¶
The padding length is calculated as:
padding_len = (3 - (16 + component_len) % 3) % 3
¶
For each encrypted component, the decryption process is:¶
Read SIV from input (16 bytes)¶
Create keystream_hasher
by cloning base_keystream_hasher
and updating with SIV¶
Read encrypted component data (length determined from encoding)¶
Generate keystream and decrypt component¶
Remove padding to recover plaintext¶
Update components_hasher
with plaintext¶
Generate expected SIV from components_hasher
¶
Compare expected SIV with received SIV (constant-time)¶
If mismatch, return error
¶
Any tampering with the encrypted data will cause the SIV comparison to fail.¶
Each encrypted component (SIV || ciphertext)
is padded to make its
length a multiple of 3 bytes, enabling clean base64 encoding without
padding characters.¶
The final output is encoded using URL-safe base64 [RFC4648] without padding.¶
This section provides the complete algorithms for encryption and decryption. The following functions and operations are used throughout the algorithms:¶
TurboSHAKE128()
: Creates a new TurboSHAKE128 hash instance with domain separation parameter 0x1F. This function produces an extensible output function (XOF) that can generate arbitrary-length outputs.¶
.update(data)
: Absorbs the provided data into the hash state. The data is processed sequentially and updates the internal state of the hash function.¶
.read(length)
: Squeezes the specified number of bytes from the hash function’s output. Each call continues from where the previous read left off, producing a continuous stream of pseudorandom bytes.¶
.clone()
: Creates a new hash instance with an identical internal state to the original. This enables multiple independent computation paths from the same initial state.¶
XOR operation: The bitwise exclusive OR operation between two byte sequences of equal length. This operation is used to combine plaintext with keystream for encryption, and ciphertext with keystream for decryption.¶
base64url_encode(data)
: Converts binary data to a base64 string using URL-safe encoding (replacing ‘+’ with ‘-‘ and ‘/’ with ‘_’) and omitting padding characters.¶
base64url_decode(string)
: Converts a URL-safe base64 string back to binary data, automatically handling the absence of padding characters.¶
Stream(data)
: Creates a sequential reader for binary data, enabling byte-by-byte or block-based access to the contents.¶
constant_time_compare(a, b)
: Compares two byte sequences in constant time, regardless of their contents. This prevents timing attacks by ensuring the comparison duration does not depend on which bytes differ.¶
len(data)
: Returns the length of the provided data in bytes.¶
concatenated with: The operation of joining two byte sequences end-to-end to form a single sequence.¶
zeros(count)
: Generates a sequence of zero-valued bytes of the specified length, used for padding.¶
remove_padding(data)
: Removes trailing zero bytes from a byte sequence to recover the original data length.¶
join(components)
: Combines multiple path components into a single path string using ‘/’ as the separator between components.¶
Input: secret_key, context, uri_string¶
Output: encrypted_uri¶
Steps:¶
Split URI into scheme and components¶
Initialize hashers as described in Section 4.1¶
encrypted_output = empty byte array
¶
For each component:¶
Update components_hasher with component
¶
SIV = components_hasher.read(16)
¶
keystream_hasher = base_keystream_hasher.clone()
¶
keystream_hasher.update(SIV)
¶
padding_len = (3 - (16 + len(component)) % 3) % 3
¶
keystream = keystream_hasher.read(len(component) + padding_len)
¶
padded_component = component concatenated with zeros(padding_len)
¶
encrypted_part = padded_component XOR keystream
¶
encrypted_output = encrypted_output concatenated with SIV concatenated with encrypted_part
¶
base64_output = base64url_encode(encrypted_output)
¶
Return scheme + base64_output
¶
Input: secret_key, context, encrypted_uri¶
Output: decrypted_uri or error¶
Steps:¶
Split encrypted URI into scheme and base64 part¶
decoded = base64url_decode(base64_part)
. If decoding fails, return error
¶
Initialize hashers as described in Section 4.1¶
decrypted_components = empty list
¶
input_stream = Stream(decoded)
¶
While input_stream is not empty:¶
SIV = input_stream.read(16)
. If not enough bytes, return error
¶
keystream_hasher = base_keystream_hasher.clone()
¶
keystream_hasher.update(SIV)
¶
Determine component length from remaining data and padding¶
encrypted_part = input_stream.read(component_length)
¶
keystream = keystream_hasher.read(len(encrypted_part))
¶
padded_plaintext = encrypted_part XOR keystream
¶
component = remove_padding(padded_plaintext)
¶
Update components_hasher
with component¶
expected_SIV = components_hasher.read(16)
¶
If constant_time_compare(SIV, expected_SIV) == false
: return error
¶
decrypted_components.append(component)
¶
decrypted_path = join(decrypted_components)
¶
Return scheme + decrypted_path
¶
Implementations MUST use TurboSHAKE128 with a domain separation
parameter of 0x1F
for all operations. The TurboSHAKE128 XOF is used
for:¶
Generating SIVs from the components hasher¶
Generating keystream for encryption/decryption¶
All hash operations in the initialization¶
TurboSHAKE128 is specified in [I-D.draft-irtf-cfrg-kangarootwelve] and provides the security properties needed for this construction.¶
The secret key MUST be at least 16 bytes long. Keys shorter than 16 bytes MUST be rejected. Implementations SHOULD validate that the key does not consist of repeated patterns (e.g., identical first and second halves) as a best practice.¶
The context parameter is a string that provides domain separation. Different applications SHOULD use different context strings to prevent cross-application attacks. The context string MAY be empty.¶
Both key and context are length-prefixed when absorbed into the base hasher:¶
base_hasher.update(len(secret_key) as uint8) base_hasher.update(secret_key) base_hasher.update(len(context) as uint8) base_hasher.update(context)¶
The length is encoded as a single byte, limiting keys and contexts to 255 bytes. This is sufficient for all practical use cases.¶
Implementations MUST NOT reveal the cause of decryption failures. All error conditions (invalid base64, incorrect padding, SIV mismatch, insufficient data) MUST result in identical, generic error messages.¶
SIV comparison MUST be performed in constant-time to prevent timing attacks.¶
URICrypt provides the following cryptographic security guarantees:¶
URICrypt achieves semantic security for URI path components through its use of TurboSHAKE128 as a pseudorandom function. Each component is encrypted using a unique keystream derived from:¶
The secret key¶
The application context¶
A synthetic initialization vector (SIV) that depends on all preceding components¶
This construction ensures that:¶
An attacker without the secret key cannot recover plaintext components from ciphertexts¶
The keystream generation is computationally indistinguishable from random for each unique (key, context, path-prefix) tuple¶
Components are protected by at least 128 bits of security against brute-force attacks¶
Each URI component is authenticated through the SIV mechanism:¶
The SIV acts as a Message Authentication Code (MAC) computed over the component and all preceding components¶
Any modification to a component will cause the SIV verification to fail during decryption¶
The chained construction ensures that reordering, insertion, or deletion of components is detected¶
Authentication provides 128-bit security against forgery attempts¶
URICrypt maintains a controlled information leakage pattern: - URIs sharing a common prefix will produce ciphertexts with the same encrypted prefix - This property is deterministic and intentional, enabling systems to perform prefix-based operations - The leakage is limited to prefix structure only - no information about non-matching suffixes is revealed¶
The context parameter provides cryptographic domain separation: - Different contexts with the same key produce completely independent ciphertexts - This prevents cross-context attacks where ciphertexts from one application could be used in another - Context binding is cryptographically enforced through the hasher initialization¶
URICrypt resists several categories of attacks:¶
Chosen-Plaintext Attacks (CPA): While deterministic, URICrypt is CPA-secure for unique inputs. The determinism is a design requirement for prefix preservation.¶
Tampering Detection: Any bit flip, truncation, or modification in the ciphertext will be detected with overwhelming probability (1 - 2-128).¶
Length Extension: The use of length-prefixed encoding and domain separation prevents length extension attacks.¶
Replay Attacks: Within a single (key, context) pair, replay is possible due to determinism. Applications requiring replay protection should incorporate timestamps or nonces in the context.¶
Key Recovery: TurboSHAKE128’s security properties ensure that observing ciphertexts does not leak information about the secret key.¶
The security of URICrypt is bounded by:¶
URICrypt makes specific security trade-offs for functionality:¶
Deterministic encryption: Same inputs produce same outputs, enabling certain traffic analysis¶
Length preservation: Component lengths are not hidden, potentially revealing information patterns¶
Prefix structure leakage: The hierarchical structure of URIs is preserved by design¶
These trade-offs are intentional and necessary for the prefix-preserving functionality. Applications requiring stronger privacy guarantees should evaluate whether URICrypt’s properties align with their threat model.¶
URICrypt provides confidentiality and integrity for URI paths while preserving prefix relationships. The security properties depend on:¶
Key Secrecy: The security of URICrypt depends entirely on the secrecy of the secret key. Keys MUST be generated using a cryptographically secure random number generator [RFC4086] and stored securely.¶
Deterministic Encryption: URICrypt is deterministic - identical inputs produce identical outputs. This allows observers to detect when the same URI is encrypted multiple times. Applications requiring unlinkability SHOULD incorporate additional entropy (e.g., via the context parameter).¶
Prefix Preservation: While essential for functionality, prefix preservation leaks information about URI structure. Systems where this information is sensitive SHOULD consider alternative approaches.¶
Context Separation: The context parameter prevents cross-context attacks. Applications MUST use distinct contexts for different purposes, even when sharing keys.¶
Component Authentication: Each component is authenticated via the SIV mechanism. Any modification, reordering, or truncation of components will be detected during decryption.¶
Length Leakage: The length of each component is preserved in the encrypted output. Applications sensitive to length information SHOULD consider padding components to fixed lengths.¶
Key Reuse: Using the same key with different contexts is safe, but using the same (key, context) pair for different applications is NOT RECOMMENDED.¶
This document has no actions for IANA.¶
function extract_components(uri_string): if uri_string contains "://": scheme = substring up to and including "://" path = substring after "://" else: scheme = "" path = uri_string if path starts with "/": path = substring after first "/" components = [] while path is not empty: slash_pos = find("/", path) if slash_pos found: component = substring(0, slash_pos + 1) path = substring(slash_pos + 1) components.append(component) else: components.append(path) path = "" return (scheme, components)¶
function initialize_hashers(secret_key, context): // Initialize base hasher base_hasher = TurboSHAKE128(0x1F) // Absorb key and context with length prefixes base_hasher.update(uint8(len(secret_key))) base_hasher.update(secret_key) base_hasher.update(uint8(len(context))) base_hasher.update(context) // Create components hasher components_hasher = base_hasher.clone() components_hasher.update("IV") // Create base keystream hasher base_keystream_hasher = base_hasher.clone() base_keystream_hasher.update("KS") return (components_hasher, base_keystream_hasher)¶
function uricrypt_encrypt(secret_key, context, uri_string): // Extract components (scheme, components) = extract_components(uri_string) // Initialize hashers with secret key and context (components_hasher, base_keystream_hasher) = initialize_hashers(secret_key, context) if error: return error encrypted_output = byte_array() // Process each component for component in components: // Update components hasher for SIV computation components_hasher.update(component) // Generate 16-byte Synthetic Initialization Vector (SIV) siv = components_hasher.squeeze(16) // Create keystream hasher for this component keystream_hasher = base_keystream_hasher.clone() keystream_hasher.update(siv) // Calculate padding for base64 encoding alignment component_len = len(component) padding_len = (3 - (16 + component_len) % 3) % 3 // Generate keystream keystream = keystream_hasher.squeeze(component_len + padding_len) // Pad component to align with base64 encoding requirements padded_component = component + byte_array(padding_len) // Encrypt using XOR with keystream encrypted_part = xor_bytes(padded_component, keystream) // Append to output encrypted_output.extend(siv) encrypted_output.extend(encrypted_part) // Base64 encode with URL-safe characters and no padding base64_output = base64_urlsafe_no_pad_encode(encrypted_output) // Return with scheme return scheme + base64_output¶
function uricrypt_decrypt(secret_key, context, encrypted_uri): // Split scheme and base64 if encrypted_uri contains "://": scheme = substring up to and including "://" base64_part = substring after "://" else: scheme = "" base64_part = encrypted_uri // Decode base64 try: decoded = base64_urlsafe_no_pad_decode(base64_part) catch: return error("Decryption failed") // Initialize hashers with secret key and context (components_hasher, base_keystream_hasher) = initialize_hashers(secret_key, context) if error: return error decrypted_components = [] input_stream = ByteStream(decoded) // Process each component while not input_stream.empty(): // Read SIV siv = input_stream.read(16) if len(siv) != 16: return error("Decryption failed") // Create keystream hasher keystream_hasher = base_keystream_hasher.clone() keystream_hasher.update(siv) // Determine component length by checking padding constraints remaining = input_stream.remaining() if remaining == 0: return error("Decryption failed") // Find valid component length by checking padding alignment component_data = None for possible_len in range(1, remaining + 1): total_len = 16 + possible_len padding_len = (3 - total_len % 3) % 3 if possible_len >= padding_len: component_data = input_stream.peek(possible_len) break if component_data is None: return error("Decryption failed") // Read encrypted data encrypted_part = input_stream.read(len(component_data)) // Generate keystream and decrypt keystream = keystream_hasher.squeeze(len(encrypted_part)) padded_plaintext = xor_bytes(encrypted_part, keystream) // Remove padding bytes added for base64 alignment padding_len = (3 - (16 + len(encrypted_part)) % 3) % 3 component = padded_plaintext[:-padding_len] if padding_len > 0 else padded_plaintext // Update hasher with plaintext components_hasher.update(component) // Generate expected SIV expected_siv = components_hasher.squeeze(16) // Authenticate using constant-time comparison to prevent timing attacks if not constant_time_equal(siv, expected_siv): return error("Decryption failed") decrypted_components.append(component) // Reconstruct URI if scheme and decrypted_components: path = "".join(decrypted_components) return scheme + path elif decrypted_components: return "/" + "".join(decrypted_components) else: return ""¶
function calculate_padding(component_len): // Calculate padding needed for base64 encoding alignment return (3 - (16 + component_len) % 3) % 3 function base64_urlsafe_no_pad_encode(data): // Use standard base64 encoding encoded = standard_base64_encode(data) // Make URL-safe and remove padding for URI compatibility encoded = encoded.replace('+', '-') .replace('/', '_') .rstrip('=') return encoded function base64_urlsafe_no_pad_decode(encoded): // Add padding if needed for standard decoder padding = (4 - len(encoded) % 4) % 4 if padding > 0: encoded = encoded + ('=' * padding) // Make standard base64 encoded = encoded.replace('-', '+') .replace('_', '/') // Decode return standard_base64_decode(encoded)¶
These test vectors were generated using the reference Rust implementation of URICrypt with TurboSHAKE128.¶
Test Configuration: secret_key (hex): 0102030405060708090a0b0c0d0e0f10 context: "test-context"¶
Input: "https://example.com/a/b/c" Output: "https://HOGo9vauZ3b3xsPNPQng5apSzL5V7QW94C7USgN8mHZJ337AKSWOucUwMuD-uUfF95SsSHCNgBkXUnH1uGll_YtBltXSqKEHNcYJJwbdFdhfWz19"¶
Input: "/a/b/c" Output: "/b9bCOhqZsvU9XxGOMk6d8QFQhTIdI_xYKpds2lWXpZCms5-az9wtfUft3rec3d9YkUo0N7VcxO5MXfxE5UobvgTJX8UpRdNN"¶
Input: "https://cdn.example.com/videos/2025/03/file.mp4" Output: "https://hxUM2N3txwYjGxjvCpWn30SznxR0v0fDbkSQgCTXCUu7Rq8iSbWP40OvYxKs9zC3kw1JNzAc4Wuj7RZvRd0VUprJWLs5KJPnWsA9Kguxa_J7XviTS3GTqf-XZdPxYyq1Y1MXVE9_4ojHwm6jBDUkVthAkuNe5Cqk_h6d"¶
Input: "https://example.com/" Output: "https://HOGo9vauZ3b3xsPNPQng5apSzL5V7QW94C7USgN8"¶
Input: "/path/to/resource" Output: "/b9bCOhqZsvU9XxGOMk6d8QFQPTuMlsQKDBhAbc77JvsdRj0kxiFipunATQmmCkNhAe0BPP2EqQoxORElY_ukfUYSrr9mIMfiO9joa3Kn5RS7eSKr"¶