Grant Negotiation and Authorization Protocol

Grant Negotiation and Authorization Protocol Bespoke Engineering

ietf@justin.richer.org https://bspk.io/

Okta

aaron@parecki.com https://aaronparecki.com

acert.io

fabien.imbault@acert.io https://acert.io/

Security GNAP Internet-Draft GNAP defines a mechanism for delegating authorization to a piece of software, and conveying that delegation to the software. This delegation can include access to a set of APIs as well as information passed directly to the software.

Introduction This protocol allows a piece of software, the client instance, to request delegated authorization to resource servers and to request direct information. This delegation is facilitated by an authorization server usually on behalf of a resource owner. The end user operating the software may interact with the authorization server to authenticate, provide consent, and authorize the request. The process by which the delegation happens is known as a grant, and GNAP allows for the negotiation of the grant process over time by multiple parties acting in distinct roles. This specification focuses on the portions of the delegation process facing the client instance. In particular, this specification defines interoperable methods for a client instance to request, negotiate, and receive access to information facilitated by the authorization server. This specification also discusses discovery mechanisms for the client instance to configure itself dynamically. The means for an authorization server and resource server to interoperate are discussed in the companion document, . The focus of this protocol is to provide interoperability between the different parties acting in each role, and is not to specify implementation details of each. Where appropriate, GNAP may make recommendations about internal implementation details, but these recommendations are to ensure the security of the overall deployment rather than to be prescriptive in the implementation. This protocol solves many of the same use cases as OAuth 2.0 , OpenID Connect , and the family of protocols that have grown up around that ecosystem. However, GNAP is not an extension of OAuth 2.0 and is not intended to be directly compatible with OAuth 2.0. GNAP seeks to provide functionality and solve use cases that OAuth 2.0 cannot easily or cleanly address. further details the protocol rationale compared to OAuth 2.0. GNAP and OAuth 2.0 will likely exist in parallel for many deployments, and considerations have been taken to facilitate the mapping and transition from legacy systems to GNAP. Some examples of these can be found in .

Terminology The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in BCP 14 when, and only when, they appear in all capitals, as shown here. This document contains non-normative examples of partial and complete HTTP messages, JSON structures, URIs, query components, keys, and other elements. Whenever possible, the document uses URI as a generic term, since it aligns with recommendations and matches better with the intent that the identifier may be reachable through various/generic means (compared to URLs). Some examples use a single trailing backslash \ to indicate line wrapping for long values, as per . The \ character and leading spaces on wrapped lines are not part of the value.

Roles The parties in GNAP perform actions under different roles. Roles are defined by the actions taken and the expectations leveraged on the role by the overall protocol.

| | +-------------+ | | +------------+ + | | + | | + | | + | | + | | + +----------+ + | Client | + | Instance | + +----------+ + + + + + + +-----------+ + +------------+ | | + + + | | | Resource | | End | | Owner | ~ ~ ~ ~ ~ ~ | User | | | | | +-----------+ +------------+ Legend + + + indicates interaction between a human and computer ----- indicates interaction between two pieces of software ~ ~ ~ indicates a potential equivalence or out-of-band communication between roles ]]>

Authorization Server (AS): server that grants delegated privileges to a particular instance of client software in the form of access tokens or other information (such as subject information).
Client: application that consumes resources from one or several RSs, possibly requiring access privileges from one or several ASs. The client is operated by the end user or it runs autonomously on behalf of a resource owner. Example: a client can be a mobile application, a web application, etc. Note: this specification differentiates between a specific instance (the client instance, identified by its unique key) and the software running the instance (the client software). For some kinds of client software, there could be many instances of that software, each instance with a different key.
Resource Server (RS): server that provides operations on protected resources, where operations require a valid access token issued by an AS.
Resource Owner (RO): subject entity that may grant or deny operations on resources it has authority upon. Note: the act of granting or denying an operation may be manual (i.e. through an interaction with a physical person) or automatic (i.e. through predefined organizational rules).
End user: natural person that operates a client instance. Note: that natural person may or may not be the same entity as the RO.

The design of GNAP does not assume any one deployment architecture, but instead attempts to define roles that can be fulfilled in a number of different ways for different use cases. As long as a given role fulfills all of its obligations and behaviors as defined by the protocol, GNAP does not make additional requirements on its structure or setup. Multiple roles can be fulfilled by the same party, and a given party can switch roles in different instances of the protocol. For example, the RO and end user in many instances are the same person, where a user is authorizing the client instance to act on their own behalf at the RS. In this case, one party fulfills both of the RO and end-user roles, but the roles themselves are still defined separately from each other to allow for other use cases where they are fulfilled by different parties. For another example, in some complex scenarios, an RS receiving requests from one client instance can act as a client instance for a downstream secondary RS in order to fulfill the original request. In this case, one piece of software is both an RS and a client instance from different perspectives, and it fulfills these roles separately as far as the overall protocol is concerned. A single role need not be deployed as a monolithic service. For example, a client instance could have components that are installed on the end user's device as well as a back-end system that it communicates with. If both of these components participate in the delegation protocol, they are both considered part of the client instance. If there are several copies of the client software that run separately but all share the same key material, such as a deployed cluster, then this cluster is considered a single client instance. In these cases, the distinct components of what is considered a GNAP client instance may use any number of different communication mechanisms between them, all of which would be considered an implementation detail of the client instances and out of scope of GNAP. For another example, an AS could likewise be built out of many constituent components in a distributed architecture. The component that the client instance calls directly could be different from the component that the RO interacts with to drive consent, since API calls and user interaction have different security considerations in many environments. Furthermore, the AS could need to collect identity claims about the RO from one system that deals with user attributes while generating access tokens at another system that deals with security rights. From the perspective of GNAP, all of these are pieces of the AS and together fulfill the role of the AS as defined by the protocol. These pieces may have their own internal communications mechanisms which are considered out of scope of GNAP.

Elements In addition to the roles above, the protocol also involves several elements that are acted upon by the roles throughout the process.

Attribute: characteristics related to a subject.
Access Token: a data artifact representing a set of rights and/or attributes. Note: an access token can be first issued to an client instance (requiring authorization by the RO) and subsequently rotated.
Grant: (verb): to permit an instance of client software to receive some attributes at a specific time and valid for a specific duration and/or to exercise some set of delegated rights to access a protected resource (noun): the act of granting.
Privilege: right or attribute associated with a subject. Note: the RO defines and maintains the rights and attributes associated to the protected resource, and might temporarily delegate some set of those privileges to an end user. This process is refered to as privilege delegation.
Protected Resource: protected API (Application Programming Interface) served by an RS and that can be accessed by a client, if and only if a valid access token is provided. Note: to avoid complex sentences, the specification document may simply refer to "resource" instead of "protected resource".
Right: ability given to a subject to perform a given operation on a resource under the control of an RS.
Subject: person, organization or device. It decides whether and under which conditions its attributes can be disclosed to other parties.
Subject Information: statement asserted by an AS about a subject.

Trust relationships GNAP defines its trust objective as: "the RO trusts the AS to ensure access validation and delegation of protected resources to end users, through third party clients." This trust objective can be decomposed into trust relationships between software elements and roles, especially the pairs end user/RO, end user/client, client/AS, RS/RO, AS/RO, AS/RS. Trust of an agent by its pair can exist if the pair is informed that the agent has made a promise to follow the protocol in the past (e.g. pre-registration, uncompromised cryptographic components) or if the pair is able to infer by indirect means that the agent has made such a promise (e.g. a compliant client request). Each agent defines its own valuation function of promises given or received. Examples of such valuations can be the benefits from interacting with other agents (e.g. safety in client access, interoperability with identity standards), the cost of following the protocol (including its security and privacy requirements and recommendations), a ranking of promise importance (e.g. a policy decision made by the AS), the assessment of one's vulnerability or risk of not being able to defend against threats, etc. Those valuations may depend on the context of the request. For instance, the AS may decide to either take into account or discard hints provided by the client, the RS may refuse bearer tokens, etc. depending on the specific case in which GNAP is used. Some promises can be conditional of some previous interactions (e.g. repeated requests). Looking back on each trust relationship: end user/RO: this relationship exists only when the end user and the RO are different, in which case the end user needs some out of band mechanism of getting the RO consent (see ). GNAP generally assumes that humans can be authenticated thanks to identity protocols (for instance, through an id_token assertion in ). end user/client: the client acts as a user agent. Depending on the technology used (browser, SPA, mobile application, IoT device, etc.), some interactions may or may not be possible (as described in ). Client developers promise to implement requirements and generally some recommendations or best practices, so that the end users may confidently use their software. However, end users might also be facing some attacker's client software, without even realizing it. end user/AS: when the client supports it (see ), the end user gets to interact with front-channel URIs provided by the AS. See for some considerations in trusting these interactions. client/AS: An honest AS may be facing an attacker's client (as discussed just above), or the reverse, and GNAP aims at making common attacks impractical. The core specification makes access tokens opaque to the client and defines the request/response scheme in detail, therefore avoiding extra trust hypotheses from this critical piece of software. Yet the AS may further define cryptographic attestations or optional rules to simplify the access of clients it already trusts, due to past behavior or organizational policies (see ). RS/RO: the RS promises it protects its resources from unauthorized access, and only accepts valid access tokens issued by a trusted AS. In case tokens are key bound, proper validation is expected from the RS. AS/RO: the AS is expected to follow the decisions made by the RO, either through interactive consent requests, repeated interactions or automated rules (as described in ). Privacy considerations aim to reduce the risk of an honest but too curious AS, or the consequences of an unexpected user data exposure. AS/RS: the AS promises to issue valid access tokens to legitimate client requests (i.e. after carrying out appropriate due diligence, as defined in the GNAP protocol). Some optional configurations are covered by . A global assumption made by GNAP is that authorization requests are security and privacy sensitive, and appropriate measures are respectively detailed in and . A formal trust model is out of scope of this specification, but might be carried out thanks to .

Sequences GNAP can be used in a variety of ways to allow the core delegation process to take place. Many portions of this process are conditionally present depending on the context of the deployments, and not every step in this overview will happen in all circumstances. Note that a connection between roles in this process does not necessarily indicate that a specific protocol message is sent across the wire between the components fulfilling the roles in question, or that a particular step is required every time. For example, for a client instance interested in only getting subject information directly, and not calling an RS, all steps involving the RS below do not apply. In some circumstances, the information needed at a given stage is communicated out of band or is preconfigured between the components or entities performing the roles. For example, one entity can fulfill multiple roles, and so explicit communication between the roles is not necessary within the protocol flow. Additionally some components may not be involved in all use cases. For example, a client instance could be calling the AS just to get direct user information and have no need to get an access token to call an RS.

Overall Protocol Sequence The following diagram provides a general overview of GNAP, including many different optional phases and connections. The diagrams in the following sections provide views of GNAP under more specific circumstances.

| Authorization | | | | |<-(3)--| Server | | | | | | (AS) | | | | |--(4)->| | | | | |<-(5)--| | | | | |--------------(6)------------->| | | | | | (7) | | | |<-------------(8)------------->| | | |--(9)->| | | | | |<-(10)-| | | | | |--------------(11)------------>| | | | | | (12) | | | |-(13)->| | | | | | | | | | +--------+ +---------------+ +------------+ Legend + + + indicates a possible interaction with a human ----- indicates an interaction between protocol roles ~ ~ ~ indicates a potential equivalence or out-of-band communication between roles ]]> (A) The end user interacts with the client instance to indicate a need for resources on behalf of the RO. This could identify the RS the client instance needs to call, the resources needed, or the RO that is needed to approve the request. Note that the RO and end user are often the same entity in practice, but GNAP makes no general assumption that they are. (1) The client instance determines what access is needed and which AS to approach for access. Note that for most situations, the client instance is pre-configured with which AS to talk to and which kinds of access it needs, but some more dynamic processes are discussed in . (2) The client instance requests access at the AS. (3) The AS processes the request and determines what is needed to fulfill the request. (See .) The AS sends its response to the client instance. (B) If interaction is required, the AS interacts with the RO to gather authorization. The interactive component of the AS can function using a variety of possible mechanisms including web page redirects, applications, challenge/response protocols, or other methods. The RO approves the request for the client instance being operated by the end user. Note that the RO and end user are often the same entity in practice, and many of GNAP's interaction methods allow the client instance to facilitate the end user interacting with the AS in order to fulfill the role of the RO. (4) The client instance continues the grant at the AS. (5) If the AS determines that access can be granted, it returns a response to the client instance including an access token for calling the RS and any directly returned information about the RO. (6) The client instance uses the access token to call the RS. (7) The RS determines if the token is sufficient for the request by examining the token. The means of the RS determining this access are out of scope of this specification, but some options are discussed in . (8) The client instance calls the RS using the access token until the RS or client instance determine that the token is no longer valid. (9) When the token no longer works, the client instance fetches an updated access token based on the rights granted in (5). (10) The AS issues a new access token to the client instance. (11) The client instance uses the new access token to call the RS. (12) The RS determines if the new token is sufficient for the request. The means of the RS determining this access are out of scope of this specification, but some options are discussed in . (13) The client instance disposes of the token once the client instance has completed its access of the RS and no longer needs the token. The following sections and contain specific guidance on how to use GNAP in different situations and deployments. For example, it is possible for the client instance to never request an access token and never call an RS, just as it is possible for there not to be a user involved in the delegation process.

Redirect-based Interaction In this example flow, the client instance is a web application that wants access to resources on behalf of the current user, who acts as both the end user and the resource owner (RO). Since the client instance is capable of directing the user to an arbitrary URI and receiving responses from the user's browser, interaction here is handled through front-channel redirects using the user's browser. The redirection URI used for interaction is a service hosted by the AS in this example. The client instance uses a persistent session with the user to ensure the same user that is starting the interaction is the user that returns from the interaction.

| | | | | | | | | | | |<-(3)-- Interaction Needed -------| | | | | | | | | | | |+ (4) + Redirect for Interaction + + + + + + + + + > | | | | | | | | | | | |<+ (5) +>| | | | | | AuthN | | | | | | | | | | | |<+ (6) +>| | | | | | AuthZ | | | | | | | | | |< (7) + Redirect for Continuation + + + + + + + + + +| | | | | | +------+ | |--(8)--- Continue Request ------->| | | | | | | |<-(9)----- Grant Access ----------| | | | | | | | | | +--------+ | |--(10)-- Access API ---------------------------->| RS | | | | | | | | |<-(11)-- API Response ---------------------------| | | | | | +--------+ +--------+ +--------+ ]]> The client instance establishes a verifiable session to the user, in the role of the end user. The client instance requests access to the resource. The client instance indicates that it can redirect to an arbitrary URI and receive a redirect from the browser. The client instance stores verification information for its redirect in the session created in (1). The AS determines that interaction is needed and responds with a URI to send the user to and information needed to verify the redirect in (7). The AS also includes information the client instance will need to continue the request in (8). The AS associates this continuation information with an ongoing request that will be referenced in (4), (6), and (8). The client instance stores the verification and continuation information from (3) in the session from (1). The client instance then redirects the user to the URI given by the AS in (3). The user's browser loads the interaction redirect URI. The AS loads the pending request based on the incoming URI generated in (3). The user authenticates at the AS, taking on the role of the RO. As the RO, the user authorizes the pending request from the client instance. When the AS is done interacting with the user, the AS redirects the user back to the client instance using the redirect URI provided in (2). The redirect URI is augmented with an interaction reference that the AS associates with the ongoing request created in (2) and referenced in (4). The redirect URI is also augmented with a hash of the security information provided in (2) and (3). The client instance loads the verification information from (2) and (3) from the session created in (1). The client instance calculates a hash based on this information and continues only if the hash validates. Note that the client instance needs to ensure that the parameters for the incoming request match those that it is expecting from the session created in (1). The client instance also needs to be prepared for the end user never being returned to the client instance and handle timeouts appropriately. The client instance loads the continuation information from (3) and sends the interaction reference from (7) in a request to continue the request. The AS validates the interaction reference ensuring that the reference is associated with the request being continued. If the request has been authorized, the AS grants access to the information in the form of access tokens and direct subject information to the client instance. The client instance uses the access token to call the RS. The RS validates the access token and returns an appropriate response for the API. An example set of protocol messages for this method can be found in .

User-code Interaction In this example flow, the client instance is a device that is capable of presenting a short, human-readable code to the user and directing the user to enter that code at a known URI. The URI the user enters the code at is an interactive service hosted by the AS in this example. The client instance is not capable of presenting an arbitrary URI to the user, nor is it capable of accepting incoming HTTP requests from the user's browser. The client instance polls the AS while it is waiting for the RO to authorize the request. The user's interaction is assumed to occur on a secondary device. In this example it is assumed that the user is both the end user and RO, though the user is not assumed to be interacting with the client instance through the same web browser used for interaction at the AS.

| | | | | | | | | | | |<-(2)-- Interaction Needed -------| | | | | | | | | | | |+ (3) + + Display User Code + + + + + + + + + + + + >| | | | | | | | | | | |<+ (4) + | | | | | |Open URI | | | | | | | | | | | |<+ (5) +>| | | | | | AuthN | | | |--(9)--- Continue Request (A) --->| | | | | | | |<+ (6) +>| | | |<-(10)- Not Yet Granted (Wait) ---| | Code | | | | | | | | | | | |<+ (7) +>| | | | | | AuthZ | | | | | | | | | | | |<+ (8) +>| | | | | |Completed| | | | | | | | | |--(11)-- Continue Request (B) --->| | +------+ | | | | | |<-(12)----- Grant Access ---------| | | | | | | | | | +--------+ | |--(13)-- Access API ---------------------------->| RS | | | | | | | | |<-(14)-- API Response ---------------------------| | | | | | +--------+ +--------+ +--------+ ]]> The client instance requests access to the resource. The client instance indicates that it can display a user code. The AS determines that interaction is needed and responds with a user code to communicate to the user. This could optionally include a URI to direct the user to, but this URI should be static and so could be configured in the client instance's documentation. The AS also includes information the client instance will need to continue the request in (8) and (10). The AS associates this continuation information with an ongoing request that will be referenced in (4), (6), (8), and (10). The client instance stores the continuation information from (2) for use in (8) and (10). The client instance then communicates the code to the user given by the AS in (2). The users directs their browser to the user code URI. This URI is stable and can be communicated via the client software's documentation, the AS documentation, or the client software itself. Since it is assumed that the RO will interact with the AS through a secondary device, the client instance does not provide a mechanism to launch the RO's browser at this URI. The end user authenticates at the AS, taking on the role of the RO. The RO enters the code communicated in (3) to the AS. The AS validates this code against a current request in process. As the RO, the user authorizes the pending request from the client instance. When the AS is done interacting with the user, the AS indicates to the RO that the request has been completed. Meanwhile, the client instance loads the continuation information stored at (3) and continues the request. The AS determines which ongoing access request is referenced here and checks its state. If the access request has not yet been authorized by the RO in (6), the AS responds to the client instance to continue the request at a future time through additional polled continuation requests. This response can include updated continuation information as well as information regarding how long the client instance should wait before calling again. The client instance replaces its stored continuation information from the previous response (2). Note that the AS may need to determine that the RO has not approved the request in a sufficient amount of time and return an appropriate error to the client instance. The client instance continues to poll the AS with the new continuation information in (9). If the request has been authorized, the AS grants access to the information in the form of access tokens and direct subject information to the client instance. The client instance uses the access token to call the RS. The RS validates the access token and returns an appropriate response for the API. An example set of protocol messages for this method can be found in .

Asynchronous Authorization In this example flow, the end user and RO roles are fulfilled by different parties, and the RO does not interact with the client instance. The AS reaches out asynchronously to the RO during the request process to gather the RO's authorization for the client instance's request. The client instance polls the AS while it is waiting for the RO to authorize the request.

| | | | | | | | | | | |<-(2)-- Not Yet Granted (Wait) ---| | | | | | | |<+ (3) +>| | | | | | AuthN | | | |--(6)--- Continue Request (A) --->| | | | | | | |<+ (4) +>| | | |<-(7)-- Not Yet Granted (Wait) ---| | AuthZ | | | | | | | | | | | |<+ (5) +>| | | | | |Completed| | | | | | | | | |--(8)--- Continue Request (B) --->| | +------+ | | | | | |<-(9)------ Grant Access ---------| | | | | | | | | | +--------+ | |--(10)-- Access API ---------------------------->| RS | | | | | | | | |<-(11)-- API Response ---------------------------| | | | | | +--------+ +--------+ +--------+ ]]> The client instance requests access to the resource. The client instance does not send any interaction modes to the server, indicating that it does not expect to interact with the RO. The client instance can also signal which RO it requires authorization from, if known, by using the user request section. The AS determines that interaction is needed, but the client instance cannot interact with the RO. The AS responds with the information the client instance will need to continue the request in (6) and (8), including a signal that the client instance should wait before checking the status of the request again. The AS associates this continuation information with an ongoing request that will be referenced in (3), (4), (5), (6), and (8). The AS determines which RO to contact based on the request in (1), through a combination of the user request, the resources request, and other policy information. The AS contacts the RO and authenticates them. The RO authorizes the pending request from the client instance. When the AS is done interacting with the RO, the AS indicates to the RO that the request has been completed. Meanwhile, the client instance loads the continuation information stored at (2) and continues the request. The AS determines which ongoing access request is referenced here and checks its state. If the access request has not yet been authorized by the RO in (6), the AS responds to the client instance to continue the request at a future time through additional polling. This response can include refreshed credentials as well as information regarding how long the client instance should wait before calling again. The client instance replaces its stored continuation information from the previous response (2). Note that the AS may need to determine that the RO has not approved the request in a sufficient amount of time and return an appropriate error to the client instance. The client instance continues to poll the AS with the new continuation information from (7). If the request has been authorized, the AS grants access to the information in the form of access tokens and direct subject information to the client instance. The client instance uses the access token to call the RS. The RS validates the access token and returns an appropriate response for the API. An example set of protocol messages for this method can be found in .

Software-only Authorization In this example flow, the AS policy allows the client instance to make a call on its own behalf, without the need for an RO to be involved at runtime to approve the decision. Since there is no explicit RO, the client instance does not interact with an RO.

| | | | | | | |<-(2)---- Grant Access -----| | | | | | +--------+ | |--(3)--- Access API ------------------->| RS | | | | | | | | |<-(4)--- API Response ------------------| | | | | | +--------+ +--------+ +--------+ ]]> The client instance requests access to the resource. The client instance does not send any interaction modes to the server. The AS determines that the request has been authorized, the AS grants access to the resource in the form of access tokens to the client instance. Note that direct subject information is not generally applicable in this use case, as there is no user involved. The client instance uses the access token to call the RS. The RS validates the access token and returns an appropriate response for the API. An example set of protocol messages for this method can be found in .

Refreshing an Expired Access Token In this example flow, the client instance receives an access token to access a resource server through some valid GNAP process. The client instance uses that token at the RS for some time, but eventually the access token expires. The client instance then gets a new access token by rotating the expired access token at the AS using the token's management URI.

| | | | | | | |<-(2)--- Grant Access --------------------| | | | | | | | +--------+ | | | |--(3)--- Access Resource --->| RS | | | | | | | | | | |<-(4)--- Success Response ---| | | | | | | | | | | | | | | | | | | | | | | |--(5)--- Access Resource --->| | | | | | | | | | | |<-(6)--- Error Response -----| | | | | | +--------+ | | | | | | | |--(7)--- Rotate Token ------------------->| | | | | | | |<-(8)--- Rotated Token -------------------| | | | | | +--------+ +--------+ ]]> The client instance requests access to the resource. The AS grants access to the resource with an access token usable at the RS. The access token response includes a token management URI. The client instance uses the access token to call the RS. The RS validates the access token and returns an appropriate response for the API. Time passes and the client instance uses the access token to call the RS again. The RS validates the access token and determines that the access token is expired. The RS responds to the client instance with an error. The client instance calls the token management URI returned in (2) to rotate the access token. The client instance uses the access token in this call as well as the appropriate key, see the token rotation section for details. The AS validates the rotation request including the signature and keys presented in (5) and returns a new access token. The response includes a new access token and can also include updated token management information, which the client instance will store in place of the values returned in (2).

Requesting User Information In this scenario, the client instance does not call an RS and does not request an access token. Instead, the client instance only requests and is returned direct subject information. Many different interaction modes can be used in this scenario, so these are shown only in the abstract as functions of the AS here.

| | | | | | | | | | | |<-(2)-- Interaction Needed -------| | | | | | | | | | | |+ (3) + Facilitate Interaction + + + + + + + + + + > | | | | | | | | | | | |<+ (4) +>| | | | | | AuthN | | | | | | | | | | | |<+ (5) +>| | | | | | AuthZ | | | | | | | | | |< (6) + Signal Continuation + + + + + + + + + + + + +| | | | | | +------+ | |--(7)--- Continue Request ------->| | | | | | | |<-(8)----- Grant Access ----------| | | | | | +--------+ +--------+ ]]> The client instance requests access to subject information. The AS determines that interaction is needed and responds with appropriate information for facilitating user interaction. The client instance facilitates the user interacting with the AS as directed in (2). The user authenticates at the AS, taking on the role of the RO. As the RO, the user authorizes the pending request from the client instance. When the AS is done interacting with the user, the AS returns the user to the client instance and signals continuation. The client instance loads the continuation information from (2) and calls the AS to continue the request. If the request has been authorized, the AS grants access to the requested direct subject information to the client instance. At this stage, the user is generally considered "logged in" to the client instance based on the identifiers and assertions provided by the AS. Note that the AS can restrict the subject information returned and it might not match what the client instance requested, see the section on subject information for details.

Requesting Access To start a request, the client instance sends a JSON document with an object as its root. Each member of the request object represents a different aspect of the client instance's request. Each field is described in detail in a section below.

access_token (object / array of objects):: Describes the rights and properties associated with the requested access token. REQUIRED if requesting an access token. See .
subject (object):: Describes the information about the RO that the client instance is requesting to be returned directly in the response from the AS. REQUIRED if requesting subject information. See .
client (object / string):: Describes the client instance that is making this request, including the key that the client instance will use to protect this request and any continuation requests at the AS and any user-facing information about the client instance used in interactions. REQUIRED. See .
user (object / string):: Identifies the end user to the AS in a manner that the AS can verify, either directly or by interacting with the end user to determine their status as the RO. OPTIONAL. See .
interact (object):: Describes the modes that the client instance supports for allowing the RO to interact with the AS and modes for the client instance to receive updates when interaction is complete. REQUIRED if interaction is supported. See .

Additional members of this request object can be defined by extensions to this protocol as described in . A non-normative example of a grant request is below:

The request and response MUST be sent as a JSON object in the body of the HTTP POST request with Content-Type application/json, unless otherwise specified by the signature mechanism. The authorization server MUST include the HTTP "Cache-Control" response header field with a value set to "no-store".

Requesting Access to Resources If the client instance is requesting one or more access tokens for the purpose of accessing an API, the client instance MUST include an access_token field. This field MUST be an object (for a single access token) or an array of these objects (for multiple access tokens), as described in the following sections.

Requesting a Single Access Token To request a single access token, the client instance sends an acccess_token object composed of the following fields.

access (array of objects/strings):: Describes the rights that the client instance is requesting for one or more access tokens to be used at RS's. REQUIRED. See .
label (string):: A unique name chosen by the client instance to refer to the resulting access token. The value of this field is opaque to the AS. If this field is included in the request, the AS MUST include the same label in the token response. REQUIRED if used as part of a multiple access token request, OPTIONAL otherwise.
flags (array of strings):: A set of flags that indicate desired attributes or behavior to be attached to the access token by the AS. OPTIONAL.

The values of the flags field defined by this specification are as follows:

"bearer":: If this flag is included, the access token being requested is a bearer token. If this flag is omitted, the access token is bound to the key used by the client instance in this request (or that key's most recent rotation) and the access token MUST be presented using the same key and proofing method. Methods for presenting bound and bearer access tokens are described in . See for additional considerations on the use of bearer tokens.
"split":: If this flag is included, the client instance is capable of receiving a different number of tokens than specified in the token request, including receiving multiple access tokens in response to any single token request or a different number of access tokens than requested in a multiple access token request. The label fields of the returned additional tokens are chosen by the AS. The client instance MUST be able to tell from the token response where and how it can use each of the access tokens. [[ See issue #37 ]]

Flag values MUST NOT be included more than once. Additional flags can be defined by extensions using a registry TBD. In the following example, the client instance is requesting access to a complex resource described by a pair of access request object.

If access is approved, the resulting access token is valid for the described resource and is bound to the client instance's key (or its most recent rotation). The token is labeled "token1-23" and could be split into multiple access tokens by the AS, if the AS chooses. The token response structure is described in .

Requesting Multiple Access Tokens To request multiple access tokens to be returned in a single response, the client instance sends an array of objects as the value of the access_token parameter. Each object MUST conform to the request format for a single access token request, as specified in requesting a single access token. Additionally, each object in the array MUST include the label field, and all values of these fields MUST be unique within the request. If the client instance does not include a label value for any entry in the array, or the values of the label field are not unique within the array, the AS MUST return an error. The following non-normative example shows a request for two separate access tokens, token1 and token2.

All approved access requests are returned in the multiple access token response structure using the values of the label fields in the request.

Requesting Subject Information If the client instance is requesting information about the RO from the AS, it sends a subject field as a JSON object. This object MAY contain the following fields (or additional fields defined in a registry TBD).

sub_id_formats (array of strings):: An array of subject identifier subject formats requested for the RO, as defined by . REQUIRED if subject identifiers are requested.
assertion_formats (array of strings):: An array of requested assertion formats. Possible values include id_token for an ID Token and saml2 for a SAML 2 assertion. Additional assertion formats are defined by a registry TBD. REQUIRED if assertions are requested.

The AS can determine the RO's identity and permission for releasing this information through interaction with the RO, AS policies, or assertions presented by the client instance. If this is determined positively, the AS MAY return the RO's information in its response as requested. Subject identifier types requested by the client instance serve only to identify the RO in the context of the AS and can't be used as communication channels by the client instance, as discussed in . The AS SHOULD NOT re-use subject identifiers for multiple different ROs. The "formats" and "assertions" request fields are independent of each other, and a returned assertion MAY use a different subject identifier than other assertions and subject identifiers in the response. All subject identifiers and assertions returned MUST refer to the same person.

Identifying the Client Instance When sending a non-continuation request to the AS, the client instance MUST identify itself by including the client field of the request and by signing the request as described in . Note that for a continuation request, the client instance is identified by its association with the request being continued and so this field is not sent under those circumstances. When client instance information is sent by value, the client field of the request consists of a JSON object with the following fields.

key (object / string):: The public key of the client instance to be used in this request as described in or a reference to a key as described in . REQUIRED.
class_id (string):: An identifier string that the AS can use to identify the client software comprising this client instance. The contents and format of this field are up to the AS. OPTIONAL.
display (object):: An object containing additional information that the AS MAY display to the RO during interaction, authorization, and management. OPTIONAL.

Additional fields are defined in a registry TBD. The client instance MUST prove possession of any presented key by the proof mechanism associated with the key in the request. Proof types are defined in a registry TBD and an initial set of methods is described in . If the same public key is sent by value on different access requests, the AS MUST treat these requests as coming from the same client instance for purposes of identification, authentication, and policy application. If the AS does not know the client instance's public key ahead of time, the AS MAY accept or reject the request based on AS policy, attestations within the client request, and other mechanisms. [[ See issue #44 ]] The client instance MUST NOT send a symmetric key by value in the request, as doing so would expose the key directly instead of simply proving possession of it. See considerations on symmetric keys in . The client instance's key MAY be pre-registered with the AS ahead of time and associated with a set of policies and allowable actions pertaining to that client. If this pre-registration includes other fields that can occur in the client request object described in this section, such as class_id or display, the pre-registered values MUST take precedence over any values given at runtime. Additional fields sent during a request but not present in a pre-registered client instance record at the AS SHOULD NOT be added to the client's pre-registered record. See additional considerations regarding client instance impersonation in . A client instance that is capable of talking to multiple AS's SHOULD use a different key for each AS to prevent a class of mix-up attacks as described in .

Identifying the Client Instance by Reference If the client instance has an instance identifier that the AS can use to determine appropriate key information, the client instance can send this instance identifier as a direct reference value in lieu of the client object. The instance identifier MAY be assigned to a client instance at runtime through a grant response () or MAY be obtained in another fashion, such as a static registration process at the AS.

When the AS receives a request with an instance identifier, the AS MUST ensure that the key used to sign the request is associated with the instance identifier. If the AS does not recognize the instance identifier, the request MUST be rejected with an error. If the client instance is identified in this manner, the registered key for the client instance MAY be a symmetric key known to the AS. See considerations on symmetric keys in .

Providing Displayable Client Instance Information If the client instance has additional information to display to the RO during any interactions at the AS, it MAY send that information in the "display" field. This field is a JSON object that declares information to present to the RO during any interactive sequences.

name (string):: Display name of the client software. RECOMMENDED.
uri (string):: User-facing web page of the client software. OPTIONAL.
logo_uri (string): Display image to represent the client software. The logo MAY be passed by value by using a data: URI referencing an image mediatype. OPTIONAL.

Additional display fields are defined by a registry TBD. The AS SHOULD use these values during interaction with the RO. The values are for informational purposes only and MUST NOT be taken as authentic proof of the client instance's identity or source. The AS MAY restrict display values to specific client instances, as identified by their keys in . See additional considerations for displayed client information in .

Authenticating the Client Instance If the presented key is known to the AS and is associated with a single instance of the client software, the process of presenting a key and proving possession of that key is sufficient to authenticate the client instance to the AS. The AS MAY associate policies with the client instance identified by this key, such as limiting which resources can be requested and which interaction methods can be used. For example, only specific client instances with certain known keys might be trusted with access tokens without the AS interacting directly with the RO as in . The presentation of a key allows the AS to strongly associate multiple successive requests from the same client instance with each other. This is true when the AS knows the key ahead of time and can use the key to authenticate the client instance, but also if the key is ephemeral and created just for this series of requests. As such the AS MAY allow for client instances to make requests with unknown keys. This pattern allows for ephemeral client instances, such as single-page applications, and client software with many individual long-lived instances, such as mobile applications, to generate key pairs per instance and use the keys within the protocol without having to go through a separate registration step. The AS MAY limit which capabilities are made available to client instances with unknown keys. For example, the AS could have a policy saying that only previously-registered client instances can request particular resources, or that all client instances with unknown keys have to be interactively approved by an RO.

Identifying the User If the client instance knows the identity of the end user through one or more identifiers or assertions, the client instance MAY send that information to the AS in the "user" field. The client instance MAY pass this information by value or by reference.

sub_ids (array of objects):: An array of subject identifiers for the end user, as defined by . OPTIONAL.
assertions (array of objects): An array containing assertions as objects each containing the assertion format and the assertion value as the JSON string serialization of the assertion. OPTIONAL.

Subject identifiers are hints to the AS in determining the RO and MUST NOT be taken as declarative statements that a particular RO is present at the client instance and acting as the end user. Assertions SHOULD be validated by the AS. [[ See issue #49 ]] If the identified end user does not match the RO present at the AS during an interaction step, the AS SHOULD reject the request with an error. [[ See issue #50 ]] If the AS trusts the client instance to present verifiable assertions, the AS MAY decide, based on its policy, to skip interaction with the RO, even if the client instance provides one or more interaction modes in its request. See for considerations that the AS has to make when accepting and processing assertions from the client instance.

Identifying the User by Reference The AS can identify the current end user to the client instance with a reference which can be used by the client instance to refer to the end user across multiple requests. If the client instance has a reference for the end user at this AS, the client instance MAY pass that reference as a string. The format of this string is opaque to the client instance.

One means of dynamically obtaining such a user reference is from the AS returning an opaque subject identifier as described in . Other means of configuring a client instance with a user identifier are out of scope of this specification. User reference identifiers are not intended to be human-readable user identifiers or structured assertions. For the client instance to send either of these, use the full user request object instead. If the AS does not recognize the user reference, it MUST return an error.

Interacting with the User Often, the AS will require interaction with the RO in order to approve a requested delegation to the client instance for both access to resources and direct subject information. Many times the end user using the client instance is the same person as the RO, and the client instance can directly drive interaction with the end user by facilitating the process through means such as redirection to a URI or launching an application. Other times, the client instance can provide information to start the RO's interaction on a secondary device, or the client instance will wait for the RO to approve the request asynchronously. The client instance could also be signaled that interaction has concluded through a callback mechanism. The client instance declares the parameters for interaction methods that it can support using the interact field. The interact field is a JSON object with three keys whose values declare how the client can initiate and complete the request, as well as provide hints to the AS about user preferences such as locale. A client instance MUST NOT declare an interaction mode it does not support. The client instance MAY send multiple modes in the same request. There is no preference order specified in this request. An AS MAY respond to any, all, or none of the presented interaction modes in a request, depending on its capabilities and what is allowed to fulfill the request.

start (array of strings/objects):: Indicates how the client instance can start an interaction. REQUIRED.
finish (object):: Indicates how the client instance can receive an indication that interaction has finished at the AS. OPTIONAL.
hints (object):: Provides additional information to inform the interaction process at the AS. OPTIONAL.

In this non-normative example, the client instance is indicating that it can redirect the end user to an arbitrary URI and can receive a redirect through a browser request.

In this non-normative example, the client instance is indicating that it can display a user code and direct the end user to an arbitrary URI on a secondary device, but it cannot accept a redirect or push callback.

If the client instance does not provide a suitable interaction mechanism, the AS cannot contact the RO asynchronously, and the AS determines that interaction is required, then the AS SHOULD return an error since the client instance will be unable to complete the request without authorization. The AS SHOULD handle any interact request as a one-time-use mechanism and SHOULD apply suitable timeouts to any interaction mechanisms provided, including user codes and redirection URIs. The client instance SHOULD apply suitable timeouts to any callback URIs.

Start Mode Definitions This specification defines the following interaction start modes as an array of string values under the start key:

"redirect":: Indicates that the client instance can direct the end user to an arbitrary URI for interaction.
"app":: Indicates that the client instance can launch an application on the end user's device for interaction.
"user_code":: Indicates that the client instance can communicate a human-readable short code to the end user for use with a stable URI.
"user_code_uri":: Indicates that the client instance can communicate a human-readable short code to the end user for use with a short, dynamic URI.

Redirect to an Arbitrary URI If the client instance is capable of directing the end user to a URI defined by the AS at runtime, the client instance indicates this by including redirect in the array under the start key. The means by which the client instance will activate this URI is out of scope of this specification, but common methods include an HTTP redirect, launching a browser on the end user's device, providing a scannable image encoding, and printing out a URI to an interactive console. While this URI is generally hosted at the AS, the client instance can make no assumptions about its contents, composition, or relationship to the AS grant URI.

If this interaction mode is supported for this client instance and request, the AS returns a redirect interaction response . The client instance manages this interaction method as described in . See for more considerations regarding the use of front-channel communication techniques such as this.

Open an Application-specific URI If the client instance can open a URI associated with an application on the end user's device, the client instance indicates this by including app in the array under the start key. The means by which the client instance determines the application to open with this URI are out of scope of this specification.

If this interaction mode is supported for this client instance and request, the AS returns an app interaction response with an app URI payload . The client instance manages this interaction method as described in . [[ See issue #54 ]]

Display a Short User Code If the client instance is capable of displaying or otherwise communicating a short, human-entered code to the RO, the client instance indicates this by including user_code in the array under the start key. This code is to be entered at a static URI that does not change at runtime. The client instance has no reasonable means to communicate a dynamic URI to the RO, and so this URI is usually communicated out of band to the RO through documentation or other messaging outside of GNAP. While this URI is generally hosted at the AS, the client instance can make no assumptions about its contents, composition, or relationship to the AS grant URI.

If this interaction mode is supported for this client instance and request, the AS returns a user code as specified in . The client instance manages this interaction method as described in .

Display a Short User Code and URI If the client instance is capable of displaying or otherwise communicating a short, human-entered code along with a short, human-entered URI to the RO, the client instance indicates this by including user_code_uri in the array under the start key. This code is to be entered at the dynamic URL given in the response. While this URL is generally hosted at the AS, the client instance can make no assumptions about its contents, composition, or relationship to the AS grant URL.

If this interaction mode is supported for this client instance and request, the AS returns a user code and interaction URL as specified in . The client instance manages this interaction method as described in .

Finish Interaction Methods If the client instance is capable of receiving a message from the AS indicating that the RO has completed their interaction, the client instance indicates this by sending the following members of an object under the finish key.

method (string):: The callback method that the AS will use to contact the client instance. REQUIRED.
uri (string):: Indicates the URI that the AS will either send the RO to after interaction or send an HTTP POST request. This URI MAY be unique per request and MUST be hosted by or accessible by the client instance. This URI MUST NOT contain any fragment component. This URI MUST be protected by HTTPS, be hosted on a server local to the RO's browser ("localhost"), or use an application-specific URI scheme. If the client instance needs any state information to tie to the front channel interaction response, it MUST use a unique callback URI to link to that ongoing state. The allowable URIs and URI patterns MAY be restricted by the AS based on the client instance's presented key information. The callback URI SHOULD be presented to the RO during the interaction phase before redirect. REQUIRED for redirect and push methods.
nonce (string):: Unique value to be used in the calculation of the "hash" query parameter sent to the callback URI, must be sufficiently random to be unguessable by an attacker. MUST be generated by the client instance as a unique value for this request. REQUIRED.
hash_method (string):: The hash calculation mechanism to be used for the callback hash in . Can be one of sha3 or sha2. If absent, the default value is sha3. OPTIONAL. [[ See issue #56 ]]

This specification defines the following values for the method parameter, with other values defined by a registry TBD:

"redirect":: Indicates that the client instance can receive a redirect from the end user's device after interaction with the RO has concluded.
"push":: Indicates that the client instance can receive an HTTP POST request from the AS after interaction with the RO has concluded.

If this interaction mode is supported for this client instance and request, the AS returns a nonce for use in validating the callback response. Requests to the callback URI MUST be processed as described in , and the AS MUST require presentation of an interaction callback reference as described in . [[ See issue #58 ]]

Receive an HTTP Callback Through the Browser A finish method value of redirect indicates that the client instance will expect a request from the RO's browser using the HTTP method GET as described in .

Requests to the callback URI MUST be processed by the client instance as described in . Since the incoming request to the callback URI is from the RO's browser, this method is usually used when the RO and end user are the same entity. See for considerations on ensuring the incoming HTTP message matches the expected context of the request. See for more considerations regarding the use of front-channel communication techniques such as this.

Receive an HTTP Direct Callback A finish method value of push indicates that the client instance will expect a request from the AS directly using the HTTP method POST as described in .

Requests to the callback URI MUST be processed by the client instance as described in . Since the incoming request to the callback URI is from the AS and not from the RO's browser, this request is not expected to have any shared session information from the start method. See and for more considerations regarding the use of back-channel and polling mechanisms like this.

Hints The hints key is an object describing one or more suggestions from the client instance that the AS can use to help drive user interaction. This specification defines the following properties under the hints key:

ui_locales (array of strings):: Indicates the end user's preferred locales that the AS can use during interaction, particularly before the RO has authenticated. OPTIONAL.

The following sections detail requests for interaction hints. Additional interaction hints are defined in a registry TBD.

Indicate Desired Interaction Locales If the client instance knows the end user's locale and language preferences, the client instance can send this information to the AS using the ui_locales field with an array of locale strings as defined by .

If possible, the AS SHOULD use one of the locales in the array, with preference to the first item in the array supported by the AS. If none of the given locales are supported, the AS MAY use a default locale.

Extending Interaction Modes Additional interaction start modes, finish modes, and hints are defined in a registry TBD.

Extending The Grant Request The request object MAY be extended by registering new items in a registry TBD. Extensions SHOULD be orthogonal to other parameters. Extensions MUST document any aspects where the extension item affects or influences the values or behavior of other request and response objects.

Grant Response In response to a client instance's request, the AS responds with a JSON object as the HTTP entity body. Each possible field is detailed in the sections below.

continue (object):: Indicates that the client instance can continue the request by making one or more continuation requests. REQUIRED if continuation calls are allowed for this client instance on this grant request. See .
access_token (object / array of objects):: A single access token or set of access tokens that the client instance can use to call the RS on behalf of the RO. REQUIRED if an access token is included. See .
interact (object):: Indicates that interaction through some set of defined mechanisms needs to take place. REQUIRED if interaction is needed or allowed. See .
subject (object):: Claims about the RO as known and declared by the AS. REQUIRED if subject information is included. See .
instance_id (string):: An identifier this client instance can use to identify itself when making future requests. OPTIONAL. See .
error (object):: An error code indicating that something has gone wrong. REQUIRED for an error condition. If included, other fields MUST NOT be included. See .

In this example, the AS is returning an interaction URI, a callback nonce, and a continuation response.

In this example, the AS is returning a bearer access token with a management URI and a subject identifier in the form of an opaque identifier.

In this example, the AS is returning set of subject identifiers, simultaneously as an opaque identifier, an email address, and a decentralized identifier URL (DID).

Request Continuation If the AS determines that the request can be continued with additional requests, it responds with the continue field. This field contains a JSON object with the following properties.

uri (string):: The URI at which the client instance can make continuation requests. This URI MAY vary per request, or MAY be stable at the AS. The client instance MUST use this value exactly as given when making a continuation request. REQUIRED.
wait (integer):: The amount of time in integer seconds the client instance MUST wait after receiving this request continuation response and calling the continuation URI. The value SHOULD NOT be less than five seconds, and omission of the value MUST NOT be interpreted as zero (i.e., no delay between requests). RECOMMENDED.
access_token (object):: A unique access token for continuing the request, called the "continuation access token". The value of this property MUST be in the format specified in . This access token MUST be bound to the client instance's key used in the request and MUST NOT be a bearer token. As a consequence, the flags array of this access token MUST NOT contain the string bearer and the key field MUST be omitted. The client instance MUST present the continuation access token in all requests to the continuation URI as described in . REQUIRED.

The client instance can use the values of this field to continue the request as described in . Note that the client instance MUST sign all continuation requests with its key as described in and MUST present the access token in its continuation request. This field SHOULD be returned when interaction is expected, to allow the client instance to follow up after interaction has been concluded.

Access Tokens If the AS has successfully granted one or more access tokens to the client instance, the AS responds with the access_token field. This field contains either a single access token as described in or an array of access tokens as described in . The client instance uses any access tokens in this response to call the RS as described in .

Single Access Token If the client instance has requested a single access token and the AS has granted that access token, the AS responds with the "access_token" field. The value of this field is an object with the following properties.

value (string):: The value of the access token as a string. The value is opaque to the client instance. The value SHOULD be limited to ASCII characters to facilitate transmission over HTTP headers within other protocols without requiring additional encoding. REQUIRED.
label (string):: The value of the label the client instance provided in the associated token request, if present. If the token has been split by the AS, the value of the label field is chosen by the AS and the split flag is used. REQUIRED for multiple access tokens, OPTIONAL for single access token.
manage (string):: The management URI for this access token. If provided, the client instance MAY manage its access token as described in . This management URI is a function of the AS and is separate from the RS the client instance is requesting access to. This URI MUST NOT include the access token value and SHOULD be different for each access token issued in a request. OPTIONAL.
access (array of objects/strings):: A description of the rights associated with this access token, as defined in . If included, this MUST reflect the rights associated with the issued access token. These rights MAY vary from what was requested by the client instance. REQUIRED.
expires_in (integer):: The number of seconds in which the access will expire. The client instance MUST NOT use the access token past this time. An RS MUST NOT accept an access token past this time. Note that the access token MAY be revoked by the AS or RS at any point prior to its expiration. OPTIONAL.
key (object / string):: The key that the token is bound to, if different from the client instance's presented key. The key MUST be an object or string in a format described in . The client instance MUST be able to dereference or process the key information in order to be able to sign the request. OPTIONAL.
flags (array of strings):: A set of flags that represent attributes or behaviors of the access token issued by the AS. OPTIONAL.

The values of the flags field defined by this specification are as follows:

"bearer":: This flag indicates whether the token is a bearer token, not bound to a key and proofing mechanism. If the bearer flag is present, the access token is a bearer token, and the key field in this response MUST be omitted. If the bearer flag is omitted and the key field in this response is omitted, the token is bound the key used by the client instance in its request for access. If the bearer flag is omitted, and the key field is present, the token is bound to the key and proofing mechanism indicated in the key field. See for additional considerations on the use of bearer tokens.
"durable":: Flag indicating a hint of AS behavior on token rotation. If this flag is present, then the client instance can expect a previously-issued access token to continue to work after it has been rotated or the underlying grant request has been modified, resulting in the issuance of new access tokens. If this flag is omitted, the client instance can anticipate a given access token could stop working after token rotation or grant request modification. Note that a token flagged as durable can still expire or be revoked through any normal means.
"split":: Flag indicating that this token was generated by issuing multiple access tokens in response to one of the client instance's token request objects. This behavior MUST NOT be used unless the client instance has specifically requested it by use of the split flag.

Flag values MUST NOT be included more than once. Additional flags can be defined by extensions using a registry TBD. The following non-normative example shows a single access token bound to the client instance's key used in the initial request, with a management URI, and that has access to three described resources (one using an object and two described by reference strings).

The following non-normative example shows a single bearer access token with access to two described resources.

If the client instance requested a single access token, the AS MUST NOT respond with the multiple access token structure unless the client instance sends the split flag as described in . If the AS has split the access token response, the response MUST include the split flag. [[ See issue #69 ]]

Multiple Access Tokens If the client instance has requested multiple access tokens and the AS has granted at least one of them, the AS responds with the "access_token" field. The value of this field is a JSON array, the members of which are distinct access tokens as described in . Each object MUST have a unique label field, corresponding to the token labels chosen by the client instance in the multiple access token request. In this non-normative example, two tokens are issued under the names token1 and token2, and only the first token has a management URI associated with it.

Each access token corresponds to one of the objects in the access_token array of the client instance's request. The multiple access token response MUST be used when multiple access tokens are requested, even if only one access token is issued as a result of the request. The AS MAY refuse to issue one or more of the requested access tokens, for any reason. In such cases the refused token is omitted from the response and all of the other issued access tokens are included in the response the requested names appropriate names. If the client instance requested multiple access tokens, the AS MUST NOT respond with a single access token structure, even if only a single access token is granted. In such cases, the AS responds with a multiple access token structure containing one access token. If the AS has split the access token response, the response MUST include the split flag in the flags array.

Each access token MAY be bound to different keys with different proofing mechanisms. The manage URI MUST NOT contain the access token value.

Interaction Modes If the client instance has indicated a capability to interact with the RO in its request, and the AS has determined that interaction is both supported and necessary, the AS responds to the client instance with any of the following values in the interact field of the response. There is no preference order for interaction modes in the response, and it is up to the client instance to determine which ones to use. All supported interaction methods are included in the same interact object.

redirect (string):: Redirect to an arbitrary URI. REQUIRED if the redirect interaction start mode is possible for this request. See .
app (string):: Launch of an application URI. REQUIRED if the app interaction start mode is possible for this request. See .
user_code (object):: Display a short user code. REQUIRED if the user_code interaction start mode is possible for this request. See .
user_code_uri (object):: Display a short user code and URL. REQUIRED if the user_code_uri interaction start mode is possible for this request.
finish (string):: A nonce used by the client instance to verify the callback after interaction is completed. REQUIRED if the interaction finish method requested by the client instance is possible for this request. See .

Additional interaction mode responses can be defined in a registry TBD. The AS MUST NOT respond with any interaction mode that the client instance did not indicate in its request. The AS MUST NOT respond with any interaction mode that the AS does not support. Since interaction responses include secret or unique information, the AS SHOULD respond to each interaction mode only once in an ongoing request, particularly if the client instance modifies its request.

Redirection to an arbitrary URI If the client instance indicates that it can redirect to an arbitrary URI and the AS supports this mode for the client instance's request, the AS responds with the "redirect" field, which is a string containing the URI to direct the end user to. This URI MUST be unique for the request and MUST NOT contain any security-sensitive information such as user identifiers or access tokens.

The URI returned is a function of the AS, but the URI itself MAY be completely distinct from the URI the client instance uses to request access, allowing an AS to separate its user-interactive functionality from its back-end security functionality. If the AS does not directly host the functionality accessed through the given URI, then the means for the interaction functionality to communicate with the rest of the AS are out of scope for this specification. [[ See issue #72 ]] The client instance sends the end user to the URI to interact with the AS. The client instance MUST NOT alter the URI in any way. The means for the client instance to send the end user to this URI is out of scope of this specification, but common methods include an HTTP redirect, launching the system browser, displaying a scannable code, or printing out the URI in an interactive console. See details of the interaction in .

Launch of an application URI If the client instance indicates that it can launch an application URI and the AS supports this mode for the client instance's request, the AS responds with the "app" field, which is a string containing the URI for the client instance to launch. This URI MUST be unique for the request and MUST NOT contain any security-sensitive information such as user identifiers or access tokens.

The means for the launched application to communicate with the AS are out of scope for this specification. The client instance launches the URI as appropriate on its platform, and the means for the client instance to launch this URI is out of scope of this specification. The client instance MUST NOT alter the URI in any way. The client instance MAY attempt to detect if an installed application will service the URI being sent before attempting to launch the application URI. See details of the interaction in . [[ See issue #71 ]]

Display of a Short User Code If the client instance indicates that it can display a short user-typeable code and the AS supports this mode for the client instance's request, the AS responds with a "user_code" field. This field is an object that contains the following members.

code (string):: A unique short code that the user can type into a web page. This string MUST be case-insensitive, MUST consist of only easily typeable characters (such as letters or numbers). The time in which this code will be accepted SHOULD be short lived, such as several minutes. It is RECOMMENDED that this code be no more than eight characters in length. REQUIRED.

The client instance MUST communicate the "code" to the end user in some fashion, such as displaying it on a screen or reading it out audibly. This code is used by the interaction component of the AS as a means of identifying the pending grant request and does not function as an authentication factor for the RO. The URI that the end user is intended to enter the code into MUST be stable, since the client instance is expected to have no means of communicating a dynamic URI to the end user at runtime. As this interaction mode is designed to facilitate interaction via a secondary device, it is not expected that the client instance redirect the end user to the URL given here at runtime. If the client instance is capable of communicating an short arbitrary URI to the end user for use with the user code, the client instance can instead use the "user_code_uri" method instead. If the client instance is capable of communicating a long arbitrary URI to the end user, such as through a scannable code, the client instance can use the "redirect" mode for this purpose instead of or in addition to the user code mode. See details of the interaction in .

Display of a Short User Code and URI If the client instance indicates that it can display a short user-typeable code and the AS supports this mode for the client instance's request, the AS responds with a "user_code_uri" object that contains the following members.

code (string):: A unique short code that the end user can type into a provided URI. This string MUST be case-insensitive, MUST consist of only easily typeable characters (such as letters or numbers). The time in which this code will be accepted SHOULD be short lived, such as several minutes. It is RECOMMENDED that this code be no more than eight characters in length. REQUIRED.
uri (string):: The interaction URI that the client instance will direct the RO to. This URI MUST be short enough to be communicated to the end user. It is RECOMMENDED that this URI be short enough for an end user to type in manually. The URI MUST NOT contain the code value. REQUIRED.

The client instance MUST communicate the "code" to the end user in some fashion, such as displaying it on a screen or reading it out audibly. This code is used by the interaction component of the AS as a means of identifying the pending grant request and does not function as an authentication factor for the RO. The client instance MUST also communicate the URI to the end user. Since it is expected that the end user will continue interaction on a secondary device, the URI needs to be short enough to allow the end user to type or copy it to a secondary device without mistakes. The URI returned is a function of the AS, but the URI itself MAY be completely distinct from the URI the client instance uses to request access, allowing an AS to separate its user-interactive functionality from its back-end security functionality. If the AS does not directly host the functionality accessed through the given URI, then the means for the interaction functionality to communicate with the rest of the AS are out of scope for this specification. See details of the interaction in .

Interaction Finish If the client instance indicates that it can receive a post-interaction redirect or push at a URI and the AS supports this mode for the client instance's request, the AS responds with a finish field containing a nonce that the client instance will use in validating the callback as defined in .

When the interaction is completed, the interaction component MUST contact the client instance using either a redirect or launch of the RO's browser or through an HTTP POST to the client instance's callback URI using the method indicated in the interaction request as described in . If the AS returns a nonce, the client instance MUST NOT continue a grant request before it receives the associated interaction reference on the callback URI. See details in .

Extending Interaction Mode Responses Extensions to this specification can define new interaction mode responses in a registry TBD. Extensions MUST document the corresponding interaction request.

Returning Subject Information If information about the RO is requested and the AS grants the client instance access to that data, the AS returns the approved information in the "subject" response field. The AS MUST return the subject field only in cases where the AS is sure that the RO and the end user are the same party. This can be accomplished through some forms of interaction with the RO. This field is an object with the following OPTIONAL properties.

sub_ids (array of objects):: An array of subject identifiers for the RO, as defined by . REQUIRED if returning subject identifiers.
assertions (array of objects):: An array containing assertions as objects each containing the assertion format and the assertion value as the JSON string serialization of the assertion. Possible formats include id_token for an ID Token and saml2 for a SAML 2 assertion. Additional assertion formats are defined by a registry TBD. REQUIRED if returning assertions.
updated_at (string):: Timestamp as an ISO8610 date string, indicating when the identified account was last updated. The client instance MAY use this value to determine if it needs to request updated profile information through an identity API. The definition of such an identity API is out of scope for this specification. RECOMMENDED.

Subject identifiers returned by the AS SHOULD uniquely identify the RO at the AS. Some forms of subject identifier are opaque to the client instance (such as the subject of an issuer and subject pair), while others forms (such as email address and phone number) are intended to allow the client instance to correlate the identifier with other account information at the client instance. The AS MUST ensure that the returned subject identifiers only apply to the authenticated end user. The client instance MUST NOT request or use any returned subject identifiers for communication purposes (see ). That is, a subject identifier returned in the format of an email address or a phone number only identifies the RO to the AS and does not indicate that the AS has validated that the represented email address or phone number in the identifier is suitable for communication with the current user. To get such information, the client instance MUST use an identity protocol to request and receive additional identity claims. The details of an identity protocol and associated schema are outside the scope of this specification. Extensions to this specification MAY define additional response properties in a registry TBD. See for considerations that the client instance has to make when accepting and processing assertions from the AS.

Returning a Dynamically-bound Client Instance Identifier Many parts of the client instance's request can be passed as either a value or a reference. The use of a reference in place of a value allows for a client instance to optimize requests to the AS. Some references, such as for the client instance's identity or the requested resources, can be managed statically through an admin console or developer portal provided by the AS or RS. The developer of the client software can include these values in their code for a more efficient and compact request. If desired, the AS MAY also generate and return an instance identifier dynamically to the client instance in the response to facilitate multiple interactions with the same client instance over time. The client instance SHOULD use this instance identifier in future requests in lieu of sending the associated data values in the client field. Dynamically generated client instance identifiers are string values that MUST be protected by the client instance as secrets. Instance identifier values MUST be unguessable and MUST NOT contain any information that would compromise any party if revealed. Instance identifier values are opaque to the client instance.

instance_id (string):: A string value used to represent the information in the client object that the client instance can use in a future request, as described in . OPTIONAL.

This non-normative example shows an instance identifier along side an issued access token.

[[ See issue #77 ]] [[ See issue #78 ]]

Error Response If the AS determines that the request cannot be issued for any reason, it responds to the client instance with an error message.

error (string):

A single ASCII error code from the following, with additional values available in a registry TBD. REQUIRED.

"invalid_request":: The request is missing a required parameter, includes an invalid parameter value or is otherwise malformed.
"invalid_client":: The request was made from a client that was not recognized or allowed by the AS, or the client's signature validation failed.
"user_denied":: The RO denied the request.
"too_fast":: The client instance did not respect the timeout in the wait response.
"unknown_request":: The request referenced an unknown ongoing access request.
"request_denied":: The request was denied for an unspecified reason.

error_description (string):

A human-readable string description of the error intended for the developer of the client. OPTIONAL.

For example, if the RO denied the request while interacting with the AS, the AS would return the following error when the client instance tries to continue the grant request:

Extending the Response Extensions to this specification MAY define additional fields for the grant response in a registry TBD.

Determining Authorization and Consent When the client instance makes its initial request to the AS for delegated access, it is capable of asking for several different kinds of information in response: the access being requested in the access_token request parameter the subject information being requested in the subject request parameter any additional requested information defined by extensions of this protocol The AS determines what authorizations and consents are required to fulfill this requested delegation. The details of how the AS makes this determination are out of scope for this document. However, there are several common patterns defined and supported by GNAP for fulfilling these requirements, including information sent by the client instance, information gathered through the interaction process, and information supplied by external parties. An individual AS can define its own policies and processes for deciding when and how to gather the necessary authorizations and consent. The client instance can supply information directly to the AS in its request. From this information, the AS can determine if the requested delegation can be granted immediately. The client instance can send several kinds of things, including: the identity of the client instance, known from the presented keys or associated identifiers the identity of the end user presented in the user request parameter any additional information presented by the client instance in the request, including any extensions The AS will verify this presented information in the context of the client instance's request and can only trust the information as much as it trusts the presentation and context of the information. If the AS determines that the information presented in the initial request is sufficient for granting the requested access, the AS MAY return the positive results immediately in its response with access tokens and subject information. If the AS determines that additional runtime authorization is required, the AS can either deny the request outright or use a number of means at its disposal to gather that authorization from the appropriate ROs, including for example: starting interaction with the end user facilitated by the client software, such as a redirection or user code challenging the client instance through a challenge-response mechanism requesting that the client instance present specific additional information, such as a user's credential or an assertion contacting an RO through an out-of-band mechanism, such as a push notification contacting an auxiliary software process through an out-of-band mechanism, such as querying a digital wallet The authorization and consent gathering process in GNAP is left deliberately flexible to allow for a wide variety of different deployments, interactions, and methodologies. In this process, the AS can gather consent from the RO as necessitated by the access that has been requested. The AS can sometimes determine which RO needs to consent based on what has been requested by the client instance, such as a specific RS record, an identified user, or a request requiring specific access such as approval by an administrator. If the AS has a means of contacting the RO directly, it could do so without involving the client instance in its consent gathering process. For example, the AS could push a notification to a known RO and have the RO approve the pending request asynchronously. These interactions can be through an interface of the AS itself (such as a hosted web page), through another application (such as something installed on the RO's device), through a messaging fabric, or any other means. When interacting with an RO, the AS can do anything it needs to determine the authorization of the requested grant, including: authenticate the RO, through a local account or some other means such as federated login validate the RO through presentation of claims, attributes, or other information prompt the RO for consent for the requested delegation describe to the RO what information is being released, to whom, and for what purpose provide warnings to the RO about potential attacks or negative effects of allowing the information allow the RO to modify the client instance's requested access, including limiting or expanding that access provide the RO with artifacts such as receipts to facilitate an audit trail of authorizations allow the RO to deny the requested delegation The AS is also allowed to request authorization from more than one RO, if the AS deems fit. For example, a medical record might need to be released by both an attending nurse and a physician, or both owners of a bank account need to sign off on a transfer request. Alternatively, the AS could require N of M possible RO's to approve a given request in order. The AS could also determine that the end user is not the appropriate RO for a given request and reach out to the appropriate RO asynchronously. The details of determining which RO's are required for a given request are out of scope for this specification. The client instance can also indicate that it is capable of facilitating interaction with the end user, another party, or another piece of software through its interaction start request. In many cases, the end user is delegating their own access as RO to the client instance. Here, the AS needs to determine the identity of the end user and will often need to interact directly with the end user to determine their status as an RO and collect their consent. If the AS has determined that authorization is required and the AS can support one or more of the requested interaction start methods, the AS returns the associated interaction start responses. The client instance SHOULD initiate one or more of these interaction methods in order to facilitate the granting of the request. If more than one interaction start method is available, the means by which the client chooses which methods to follow is out of scope of this specification. The client instance MUST use each interaction method once at most. After starting interaction, the client instance can then make a continuation request either in response to a signal indicating the finish of the interaction, through polling, or through some other method defined by an extension of this specification. If the AS and client instance have not reached a state where the delegation can be granted, the AS and client instance can repeat the interaction process as long as the AS supplies the client instance with continuation information to facilitate the ongoing requests.

Interaction Start Methods To initiate an interaction start method indicated by the interaction start responses from the AS, the client instance follows the steps defined by that interaction method. The actions of the client instance required for the interaction start modes defined in this specification are described in the following sections.

Interaction at a Redirected URI When the end user is directed to an arbitrary URI through the "redirect" mode, the client instance facilitates opening the URI through the end user's web browser. The client instance could launch the URI through the system browser, provide a clickable link, redirect the user through HTTP response codes, or display the URI in a form the end user can use to launch such as a multidimensional barcode. With this method, it is common (though not required) for the RO to be the same party as the end user, since the client instance has to communicate the redirection URI to the end user. In many cases, the URI indicates a web page hosted at the AS, allowing the AS to authenticate the end user as the RO and interactively provide consent. The URI value is used to identify the grant request being authorized. If the URI cannot be associated with a currently active request, the AS MUST display an error to the RO and MUST NOT attempt to redirect the RO back to any client instance even if a redirect finish method is supplied. If the URI is not hosted by the AS directly, the means of communication between the AS and this URI are out of scope for this specification. The client instance MUST NOT modify the URI when launching it, in particular the client instance MUST NOT add any parameters to the URI. The URI MUST be reachable from the end user's browser, though the URI MAY be opened on a separate device from the client instance itself. The URI MUST be accessible from an HTTP GET request and MUST be protected by HTTPS or equivalent means.

Interaction at the Static User Code URI When the end user is directed to enter a short code through the "user_code" mode, the client instance communicates the user code to the end user and directs the end user to enter that code at an associated URI. This mode is used when the client instance is not able to communicate or facilitate launching an arbitrary URI. The associated URI could be statically configured with the client instance or in the client software's documentation. As a consequence, these URIs SHOULD be short. The user code URI MUST be reachable from the end user's browser, though the URI is usually be opened on a separate device from the client instance itself. Since it is designed to be typed in, the URI SHOULD be accessible from an HTTP GET request and MUST be protected by HTTPS or equivalent means. In many cases, the URI indicates a web page hosted at the AS, allowing the AS to authenticate the end user as the RO and interactively provide consent. The value of the user code is used to identify the grant request being authorized. If the user code cannot be associated with a currently active request, the AS MUST display an error to the RO and MUST NOT attempt to redirect the RO back to any client instance even if a redirect finish method is supplied. If the interaction component at the user code URI is not hosted by the AS directly, the means of communication between the AS and this URI, including communication of the user code itself, are out of scope for this specification. When the RO enters this code at the user code URI, the AS MUST uniquely identify the pending request that the code was associated with. If the AS does not recognize the entered code, the interaction component MUST display an error to the user. If the AS detects too many unrecognized code enter attempts, the interaction component SHOULD display an error to the user and MAY take additional actions such as slowing down the input interactions. The user should be warned as such an error state is approached, if possible.

Interaction at a Dynamic User Code URI When the end user is directed to enter a short code through the "user_code_uri" mode, the client instance communicates the user code and associated URI to the end user and directs the end user to enter that code at the URI. This mode is used when the client instance is not able to facilitate launching an arbitrary URI but can communicate arbitrary values like URIs. As a consequence, these URIs SHOULD be short. The client instance MUST NOT modify the URI when communicating it to the end user; in particular the client instance MUST NOT add any parameters to the URI. The user code URI MUST be reachable from the end user's browser, though the URI is usually be opened on a separate device from the client instance itself. Since it is designed to be typed in, the URI SHOULD be accessible from an HTTP GET request and MUST be protected by HTTPS or equivalent means. In many cases, the URI indicates a web page hosted at the AS, allowing the AS to authenticate the end user as the RO and interactively provide consent. The value of the user code is used to identify the grant request being authorized. If the user code cannot be associated with a currently active request, the AS MUST display an error to the RO and MUST NOT attempt to redirect the RO back to any client instance even if a redirect finish method is supplied. If the interaction component at the user code URI is not hosted by the AS directly, the means of communication between the AS and this URI, including communication of the user code itself, are out of scope for this specification. When the RO enters this code at the given URI, the AS MUST uniquely identify the pending request that the code was associated with. If the AS does not recognize the entered code, the interaction component MUST display an error to the user. If the AS detects too many unrecognized code enter attempts, the interaction component SHOULD display an error to the user and MAY take additional actions such as slowing down the input interactions. The user should be warned as such an error state is approached, if possible.

Interaction through an Application URI When the client instance is directed to launch an application through the "app" mode, the client launches the URI as appropriate to the system, such as through a deep link or custom URI scheme registered to a mobile application. The means by which the AS and the launched application communicate with each other and perform any of the required actions are out of scope for this specification.

Post-Interaction Completion If an interaction "finish" method is associated with the current request, the AS MUST follow the appropriate method at upon completion of interaction in order to signal the client instance to continue, except for some limited error cases discussed below. If a finish method is not available, the AS SHOULD instruct the RO to return to the client instance upon completion. The AS MUST create an interaction reference and associate that reference with the current interaction and the underlying pending request. This interaction reference value MUST be sufficiently random so as not to be guessable by an attacker. The interaction reference MUST be one-time-use to prevent interception and replay attacks. The AS MUST calculate a hash value based on the client instance and AS nonces and the interaction reference, as described in . The client instance will use this value to validate the "finish" call. The AS MUST send the hash and interaction reference based on the interaction finish mode as described in the following sections. Note that the "finish" method still occurs in many error cases, such as when the RO has denied access. This pattern allows the client instance to potentially recover from the error state by modifying its request or providing additional information directly to the AS in a continuation request. The AS MUST NOT follow the "finish" method in the following circumstances: The AS has determined that any URIs involved with the finish method are dangerous or blocked. The AS cannot determine which ongoing grant request is being referenced. The ongoing grant request has been cancelled or otherwise blocked.

Completing Interaction with a Browser Redirect to the Callback URI When using the redirect interaction finish method, the AS signals to the client instance that interaction is complete and the request can be continued by directing the RO (in their browser) back to the client instance's redirect URI sent in the callback request. The AS secures this redirect by adding the hash and interaction reference as query parameters to the client instance's redirect URI.

hash:: The interaction hash value as described in . REQUIRED.
interact_ref:: The interaction reference generated for this interaction. REQUIRED.

The means of directing the RO to this URI are outside the scope of this specification, but common options include redirecting the RO from a web page and launching the system browser with the target URI. See for considerations on which HTTP status code to use when redirecting a request that potentially contains credentials.

When receiving the request, the client instance MUST parse the query parameters to calculate and validate the hash value as described in . If the hash validates, the client instance sends a continuation request to the AS as described in using the interaction reference value received here.

Completing Interaction with a Direct HTTP Request Callback When using the push interaction finish method, the AS signals to the client instance that interaction is complete and the request can be continued by sending an HTTP POST request to the client instance's callback URI sent in the callback request. The entity message body is a JSON object consisting of the following two fields:

hash (string):: The interaction hash value as described in . REQUIRED.
interact_ref (string): The interaction reference generated for this interaction. REQUIRED.

When processing such a call, the AS MUST protect itself against SSRF attacks as discussed in . When receiving the request, the client instance MUST parse the JSON object and validate the hash value as described in . If the hash validates, the client instance sends a continuation request to the AS as described in using the interaction reference value received here.

Calculating the interaction hash The "hash" parameter in the request to the client instance's callback URI ties the front channel response to an ongoing request by using values known only to the parties involved. This security mechanism allows the client instance to protect itself against several kinds of session fixation and injection attacks. The AS MUST always provide this hash, and the client instance MUST validate the hash when received. To calculate the "hash" value, the party doing the calculation creates a hash string by concatenating the following values in the following order using a single newline (\n) character to separate them: the "nonce" value sent by the client instance in the interaction "finish" section of the initial request the AS's nonce value from the interaction finish response the "interact_ref" returned from the AS as part of the interaction finish method the grant endpoint URI the client instance used to make its initial request There is no padding or whitespace before or after any of the lines, and no trailing newline character.

The party then hashes this string with the appropriate algorithm based on the "hash_method" parameter under the "finish" key. If the "hash_method" value is not present in the client instance's request, the algorithm defaults to "sha3". [[ See issue #56 ]]

SHA3-512 The "sha3" hash method consists of hashing the input string with the 512-bit SHA3 algorithm. The byte array is then encoded using URL Safe Base64 with no padding . The resulting string is the hash value.

SHA2-512 The "sha2" hash method consists of hashing the input string with the 512-bit SHA2 algorithm. The byte array is then encoded using URL Safe Base64 with no padding . The resulting string is the hash value.

Continuing a Grant Request While it is possible for the AS to return a grant response with all the client instance's requested information (including access tokens and direct user information), it's more common that the AS and the client instance will need to communicate several times over the lifetime of an access grant. This is often part of facilitating interaction, but it could also be used to allow the AS and client instance to continue negotiating the parameters of the original grant request. To enable this ongoing negotiation, the AS provides a continuation API to the client software. The AS returns a continue field in the response that contains information the client instance needs to access this API, including a URI to access as well as a continuation access token to use during the requests. The continuation access token is initially bound to the same key and method the client instance used to make the initial request. As a consequence, when the client instance makes any calls to the continuation URI, the client instance MUST present the continuation access token as described in and present proof of the client instance's key (or its most recent rotation) by signing the request as described in . The AS MUST validate all keys presented by the client instance or referenced in an ongoing request for each call within that request. Access tokens other than the continuation access tokens MUST NOT be usable for continuation requests. [[ See issue #85 ]] For example, here the client instance makes a POST request to a unique URI and signs the request with HTTP Message Signatures:

The AS MUST be able to tell from the client instance's request which specific ongoing request is being accessed, using a combination of the continuation URI, the provided continuation access token, and the client instance identified by the key signature. If the AS cannot determine a single active grant request to map the continuation request to, the AS MUST return an error. The ability to continue an already-started request allows the client instance to perform several important functions, including presenting additional information from interaction, modifying the initial request, and getting the current state of the request. All requests to the continuation API are protected by this bound continuation access token. For example, here the client instance makes a POST request to a stable continuation endpoint URI with the interaction reference, includes the access token, and signs with HTTP Message Signatures:

If a wait parameter was included in the continuation response, the client instance MUST NOT call the continuation URI prior to waiting the number of seconds indicated. If no wait period is indicated, the client instance MUST NOT poll immediately and SHOULD wait at least 5 seconds. If the client instance does not respect the given wait period, the AS MUST return the error too_fast defined in . The response from the AS is a JSON object and MAY contain any of the fields described in , as described in more detail in the sections below. If the AS determines that the client instance can make a further continuation request, the AS MUST include a new "continue" response. The new continue response MUST include a continuation access token as well, and this token SHOULD be a new access token, invalidating the previous access token. If the AS does not return a new continue response, the client instance MUST NOT make an additional continuation request. If a client instance does so, the AS MUST return an error. [[ See issue #87 ]] For continuation functions that require the client instance to send a message body, the body MUST be a JSON object.

Continuing After a Completed Interaction When the AS responds to the client instance's finish method as in , this response includes an interaction reference. The client instance MUST include that value as the field interact_ref in a POST request to the continuation URI.

Since the interaction reference is a one-time-use value as described in , if the client instance needs to make additional continuation calls after this request, the client instance MUST NOT include the interaction reference. If the AS detects a client instance submitting the same interaction reference multiple times, the AS MUST return an error and SHOULD invalidate the ongoing request. The grant response MAY contain any newly-created access tokens or newly-released subject claims. The response MAY contain a new "continue" response as described above. The response SHOULD NOT contain any interaction responses. [[ See issue #89 ]] For example, if the request is successful in causing the AS to issue access tokens and release opaque subject claims, the response could look like this:

With this example, the client instance can not make an additional continuation request because a continue field is not included. [[ See issue #88 ]]

Continuing During Pending Interaction When the client instance does not include a finish parameter, the client instance will often need to poll the AS until the RO has authorized the request. To do so, the client instance makes a POST request to the continuation URI as in , but does not include a message body.

The grant response MAY contain any newly-created access tokens or newly-released subject claims. The response MAY contain a new "continue" response as described above. If a continue field is included, it SHOULD include a wait field to facilitate a reasonable polling rate by the client instance. The response SHOULD NOT contain interaction responses. For example, if the request has not yet been authorized by the RO, the AS could respond by telling the client instance to make another continuation request in the future. In this example, a new, unique access token has been issued for the call, which the client instance will use in its next continuation request.

[[ See issue #90 ]] [[ See issue #91 ]] If the request is successful in causing the AS to issue access tokens and release subject claims, the response could look like this example:

See for considerations on polling for continuation without an interaction finish method.

Modifying an Existing Request The client instance might need to modify an ongoing request, whether or not tokens have already been issued or claims have already been released. In such cases, the client instance makes an HTTP PATCH request to the continuation URI and includes any fields it needs to modify. Fields that aren't included in the request are considered unchanged from the original request. The client instance MAY include the access_token and subject fields as described in and . Inclusion of these fields override any values in the initial request, which MAY trigger additional requirements and policies by the AS. For example, if the client instance is asking for more access, the AS could require additional interaction with the RO to gather additional consent. If the client instance is asking for more limited access, the AS could determine that sufficient authorization has been granted to the client instance and return the more limited access rights immediately. [[ See issue #92 ]] The client instance MAY include the interact field as described in . Inclusion of this field indicates that the client instance is capable of driving interaction with the RO, and this field replaces any values from a previous request. The AS MAY respond to any of the interaction responses as described in , just like it would to a new request. The client instance MAY include the user field as described in to present new assertions or information about the end user. [[ See issue #93 ]] The client instance MUST NOT include the client section of the request. [[ See issue #94 ]] The client instance MAY include post-interaction responses such as described in . [[ See issue #95 ]] Modification requests MUST NOT alter previously-issued access tokens. Instead, any access tokens issued from a continuation are considered new, separate access tokens. The AS MAY revoke existing access tokens after a modification has occurred. [[ See issue #96 ]] If the modified request can be granted immediately by the AS, the grant response MAY contain any newly-created access tokens or newly-released subject claims. The response MAY contain a new "continue" response as described above. If interaction can occur, the response SHOULD contain interaction responses as well. For example, a client instance initially requests a set of resources using references:

Access is granted by the RO, and a token is issued by the AS. In its final response, the AS includes a continue field, which includes a separate access token for accessing the continuation API:

This continue field allows the client instance to make an eventual continuation call. In the future, the client instance realizes that it no longer needs "write" access and therefore modifies its ongoing request, here asking for just "read" access instead of both "read" and "write" as before.

The AS replaces the previous access from the first request, allowing the AS to determine if any previously-granted consent already applies. In this case, the AS would likely determine that reducing the breadth of the requested access means that new access tokens can be issued to the client instance. The AS would likely revoke previously-issued access tokens that had the greater access rights associated with them, unless they had been issued with the durable flag.

For another example, the client instance initially requests read-only access but later needs to step up its access. The initial request could look like this example.

Access is granted by the RO, and a token is issued by the AS. In its final response, the AS includes a continue field:

This allows the client instance to make an eventual continuation call. The client instance later realizes that it now needs "write" access in addition to the "read" access. Since this is an expansion of what it asked for previously, the client instance also includes a new interaction section in case the AS needs to interact with the RO again to gather additional authorization. Note that the client instance's nonce and callback are different from the initial request. Since the original callback was already used in the initial exchange, and the callback is intended for one-time-use, a new one needs to be included in order to use the callback again.

From here, the AS can determine that the client instance is asking for more than it was previously granted, but since the client instance has also provided a mechanism to interact with the RO, the AS can use that to gather the additional consent. The protocol continues as it would with a new request. Since the old access tokens are good for a subset of the rights requested here, the AS might decide to not revoke them. However, any access tokens granted after this update process are new access tokens and do not modify the rights of existing access tokens.

Canceling a Grant Request If the client instance wishes to cancel an ongoing grant request, it makes an HTTP DELETE request to the continuation URI.

If the request is successfully cancelled, the AS responds with an HTTP 202. The AS SHOULD revoke all associated access tokens.

Token Management If an access token response includes the manage parameter as described in , the client instance MAY call this URI to manage the access token with any of the actions defined in the following sections: rotate and revoke. Other actions are undefined by this specification. The access token being managed acts as the access element for its own management API. The client instance MUST present proof of an appropriate key along with the access token. If the token is sender-constrained (i.e., not a bearer token), it MUST be sent with the appropriate binding for the access token. If the token is a bearer token, the client instance MUST present proof of the same key identified in the initial request as described in . The AS MUST validate the proof and assure that it is associated with either the token itself or the client instance the token was issued to, as appropriate for the token's presentation type.

Rotating the Access Token If the client instance has an access token and that access token expires, the client instance might want to rotate the access token. Rotating an access token consists of issuing a new access token in place of an existing access token, with the same rights and properties as the original token, apart from an updated expiration time. To rotate an access token, the client instance makes an HTTP POST to the token management URI, sending the access token in the appropriate header and signing the request with the appropriate key.

The AS validates that the token presented is associated with the management URI, that the AS issued the token to the given client instance, and that the presented key is appropriate to the token. Note that in many cases, the access token will have expired for regular use. To facilitate token rotation, the AS SHOULD honor the rotation request of the expired access token since it is likely that the client instance is attempting to refresh the expired token. To support this, the AS MAY allow a longer lifetime for token management compared to its use at an RS. An AS MUST NOT honor a rotation request for an access token that has been revoked or otherwise disabled. If the token is validated and the key is appropriate for the request, the AS MUST invalidate the current access token associated with this URI, if possible. Note that stateless access tokens can make proactive revocation difficult within a system, see . The AS responds with an HTTP 200 with a JSON body consisting of the rotated access token in the access_token field described in . The value of the access token MUST NOT be the same as the current value of the access token used to access the management API. The response MUST include an access token management URI, and the value of this URI MAY be different from the URI used by the client instance to make the rotation call. The client instance MUST use this new URI to manage the rotated access token. The access rights in the access array for the rotated access token MUST be included in the response and MUST be the same as the token before rotation. If the client instance requires different access rights, the client instance can request a new access token by creating a new request or by updating an existing grant request.

[[ See issue #103 ]]

Revoking the Access Token If the client instance wishes to revoke the access token proactively, such as when a user indicates to the client instance that they no longer wish for it to have access or the client instance application detects that it is being uninstalled, the client instance can use the token management URI to indicate to the AS that the AS should invalidate the access token for all purposes. The client instance makes an HTTP DELETE request to the token management URI, presenting the access token and signing the request with the appropriate key.

If the key presented is associated with the token (or the client instance, in the case of a bearer token), the AS MUST invalidate the access token, if possible, and return an HTTP 204 response code.

Though the AS MAY revoke an access token at any time for any reason, the token management function is specifically for the client instance's use. If the access token has already expired or has been revoked through other means, the AS SHOULD honor the revocation request to the token management URI as valid, since the end result is still the token not being usable.

Securing Requests from the Client Instance In GNAP, the client instance secures its requests to the AS and RS by presenting an access token, presenting proof of a key that it possesses (aka, a "key proof"), or both an access token and key proof together. When an access token is used with a key proof, this is a bound token request. This type of request is used for calls to the RS as well as the AS during negotiation. When a key proof is used with no access token, this is a non-authorized signed request. This type of request is used for calls to the AS to initiate a negotiation. When an access token is used with no key proof, this is a bearer token request. This type of request is used only for calls to the RS, and only with access tokens that are not bound to any key as described in . When neither an access token nor key proof are used, this is an unsecured request. This type of request is used optionally for calls to the RS as part of an RS-first discovery process as described in .

Key Formats Several different places in GNAP require the presentation of key material by value. Proof of this key material MUST be bound to a request, the nature of which varies with the location in the protocol the key is used. For a key used as part of a client instance's initial request in , the key value is the client instance's public key, and proof of that key MUST be presented in that request. For a key used as part of an access token response in , the proof of that key MUST be used when presenting the access token. A key presented by value MUST be a public key in at least one supported format. If a key is sent in multiple formats, all the key format values MUST be equivalent. Note that while most formats present the full value of the public key, some formats present a value cryptographically derived from the public key.

proof (string):: The form of proof that the client instance will use when presenting the key. The valid values of this field and the processing requirements for each are detailed in . REQUIRED.
jwk (object):: The public key and its properties represented as a JSON Web Key . A JWK MUST contain the alg (Algorithm) and kid (Key ID) parameters. The alg parameter MUST NOT be "none". The x5c (X.509 Certificate Chain) parameter MAY be used to provide the X.509 representation of the provided public key. OPTIONAL.
cert (string):: PEM serialized value of the certificate used to sign the request, with optional internal whitespace per . The PEM header and footer are optionally removed. OPTIONAL.
cert#S256 (string):: The certificate thumbprint calculated as per OAuth-MTLS in base64 URL encoding. Note that this format does not include the full public key. OPTIONAL.

Additional key formats are defined in a registry TBD. This non-normative example shows a single key presented in multiple formats. This example key is intended to be used with the HTTP Message Signatures proofing mechanism, as indicated by the httpsig value of the proof field.

Key References Keys in GNAP can also be passed by reference such that the party receiving the reference will be able to determine the appropriate keying material for use in that part of the protocol.

Keys referenced in this manner MAY be shared symmetric keys. The key reference MUST NOT contain any unencrypted private or shared symmetric key information. Keys referenced in this manner MUST be bound to a single proofing mechanism. The means of dereferencing this value are out of scope for this specification. Commonly, key references are created by the AS and are not necessarily needed to be dereferencable by the client. These types of key references are an internal reference to the AS, such as an identifier of a record in a database. In other applications, it can be useful to use key references that are resolvable by both clients and ASs, which could be accomplished by e.g. a client publishing a public key at a URI. For interoperability, this method could later be described as an extension.

Key Protection The security of GNAP relies on the cryptographic security of the keys themselves. When symmetric keys are used in GNAP, a key management system or secure key derivation mechanism MUST be used to supply the keys. Symmetric keys MUST NOT be a human memorable password or a value derived from one. Symmetric keys MUST NOT be passed by value from the client instance to the AS.

Presenting Access Tokens The method the client instance uses to send an access token depends on whether the token is bound to a key, and if so which proofing method is associated with the key. This information is conveyed by the key parameter and the bearer flag in the single and multiple access tokens responses. If the flags field does not contain the bearer flag and the key is absent, the access token MUST be sent using the same key and proofing mechanism that the client instance used in its initial request (or its most recent rotation). If the flags field does not contain the bearer flag and the key value is an object as described in , the access token MUST be sent using the key and proofing mechanism defined by the value of the proof field within the key object. The access token MUST be sent using the HTTP "Authorization" request header field and the "GNAP" authorization scheme along with a key proof as described in for the key bound to the access token. For example, an "httpsig"-bound access token is sent as follows:

If the flags field contains the bearer flag, the access token is a bearer token that MUST be sent using the Authorization Request Header Field method defined in .

The Form-Encoded Body Parameter and URI Query Parameter methods of MUST NOT be used. [[ See issue #104 ]] The client software MUST reject as an error a situation where the flags field contains the bearer flag and the key field is present with any value.

Proving Possession of a Key with a Request Any keys presented by the client instance to the AS or RS MUST be validated as part of the request in which they are presented. The type of binding used is indicated by the proof parameter of the key object in . Values defined by this specification are as follows:

"httpsig":: HTTP Signing signature headers. See .
"mtls":: Mutual TLS certificate verification. See .
"jwsd":: A detached JWS signature header. See .
"jws":: Attached JWS payload. See .

Additional proofing methods are defined by a registry TBD. All key binding methods used by this specification MUST cover all relevant portions of the request, including anything that would change the nature of the request, to allow for secure validation of the request. Relevant aspects include the URI being called, the HTTP method being used, any relevant HTTP headers and values, and the HTTP message body itself. The verifier of the signed message MUST validate all components of the signed message to ensure that nothing has been tampered with or substituted in a way that would change the nature of the request. Key binding method definitions SHOULD enumerate how these requirements are fulfilled. When a key proofing mechanism is bound to an access token, the key being presented MUST be the key associated with the access token and the access token MUST be covered by the signature method of the proofing mechanism. The key binding methods in this section MAY be used by other components making calls as part of GNAP, such as the extensions allowing the RS to make calls to the AS defined in . To facilitate this extended use, the sections below are defined in generic terms of the "signer" and "verifier" of the HTTP message. In the core functions of GNAP, the "signer" is the client instance and the "verifier" is the AS or RS, as appropriate. When used for delegation in GNAP, these key binding mechanisms allow the AS to ensure that the keys presented by the client instance in the initial request are in control of the party calling any follow-up or continuation requests. To facilitate this requirement, the continuation response includes an access token bound to the client instance's key, and that key (or its most recent rotation) MUST be proved in all continuation requests . Token management requests are similarly bound to either the access token's own key or, in the case of bearer tokens, the client instance's key. [[ See issue #105 ]] In the following sections, unless otherwise noted, the RS256 JOSE Signature Algorithm is applied using the following RSA key (presented here in JWK format):

HTTP Message Signing This method is indicated by httpsig in the proof field. The signer creates an HTTP Message Signature as described in . The covered components of the signature MUST include the following:

"@method":: The method used in the HTTP request.
"@target-uri":: The full request URI of the HTTP request.

When the message contains a request body, the covered components MUST also include the following:

"content-digest":: The Content-Digest header as defined in . When the request message has a body, the signer MUST calculate this header value and the verifier MUST validate this field value. Use of content-encoding agnostic digest methods (such as sha-256) is RECOMMENDED.

When the request is bound to an access token, the covered components MUST also include the following:

"authorization":: The Authorization header used to present the access token as discussed in .

Other message components MAY also be included. If the signer's key presented is a JWK, the keyid parameter of the signature MUST be set to the kid value of the JWK, the signing algorithm used MUST be the JWS algorithm denoted by the key's alg field, and the explicit alg signature parameter MUST NOT be included. In this example, the message body is the following JSON object:

This body is hashed for the Content-Digest header using sha-256 into the following encoded value:

The HTTP message signature input string is calculated to be the following:

This leads to the following full HTTP message request:

If the HTTP Message includes a message body, the verifier MUST calculate and verify the value of the Digest or Content-Digest header. The verifier MUST ensure that the signature covers all required message components. The verifier MUST validate the signature against the expected key of the signer.

Mutual TLS This method is indicated by mtls in the proof field. The signer presents its TLS client certificate during TLS negotiation with the verifier. In this example, the certificate is communicated to the application through the Client-Cert header from a TLS reverse proxy, leading to the following full HTTP request message:

The verifier compares the TLS client certificate presented during mutual TLS negotiation to the expected key of the signer. Since the TLS connection covers the entire message, there are no additional requirements to check. Note that in many instances, the verifier will not do a full certificate chain validation of the presented TLS client certificate, as the means of trust for this certificate could be in something other than a PKI system, such as a static registration or trust-on-first-use. See and for some additional considerations for this key proofing method.

Detached JWS This method is indicated by jwsd in the proof field. A JWS object is created as follows: To protect the request, the JOSE header of the signature contains the following claims:

kid (string):: The key identifier. REQUIRED if the key is presented in JWK format, this MUST be the value of the kid field of the key.
alg (string):: The algorithm used to sign the request. MUST be appropriate to the key presented. If the key is presented as a JWK, this MUST be equal to the alg parameter of the key. MUST NOT be none. REQUIRED.
typ (string):: The type header, value ”gnap-binding+jwsd”. REQUIRED.
htm (string):: The HTTP Method used to make this request, as a case-sensitive ASCII string. Note that most public HTTP methods are in uppercase ASCII by convention. REQUIRED.
uri (string):: The HTTP URI used for this request, including all path and query components and no fragment component. REQUIRED.
created (integer):: A timestamp of when the signature was created, in integer seconds since UNIX Epoch. REQUIRED.

When the request is bound to an access token, the JOSE header MUST also include the following:

ath (string):: The hash of the access token. The value MUST be the result of Base64url encoding (with no padding) the SHA-256 digest of the ASCII encoding of the associated access token's value. REQUIRED.

If the HTTP request has a message body, such as an HTTP POST or PUT method, the payload of the JWS object is the Base64url encoding (without padding) of the SHA256 digest of the bytes of the body. If the request being made does not have a message body, such as an HTTP GET, OPTIONS, or DELETE method, the JWS signature is calculated over an empty payload. The signer presents the signed object in compact form in the Detached-JWS HTTP Header field. In this example, the JOSE Header contains the following parameters:

The request body is the following JSON object:

This is hashed to the following Base64 encoded value:

This leads to the following full HTTP request message:

When the verifier receives the Detached-JWS header, it MUST parse and validate the JWS object. The signature MUST be validated against the expected key of the signer. All required fields MUST be present and their values MUST be valid. If the HTTP message request contains a body, the verifier MUST calculate the hash of body just as the signer does, with no normalization or transformation of the request.

Attached JWS This method is indicated by jws in the proof field. A JWS object is created as follows: To protect the request, the JWS header contains the following claims.

kid (string):: The key identifier. REQUIRED if the key is presented in JWK format, this MUST be the value of the kid field of the key.
alg (string):: The algorithm used to sign the request. MUST be appropriate to the key presented. If the key is presented as a JWK, this MUST be equal to the alg parameter of the key. MUST NOT be none. REQUIRED.
typ (string):: The type header, value ”gnap-binding+jwsd”. REQUIRED.
htm (string):: The HTTP Method used to make this request, as a case-sensitive ASCII string. (Note that most public HTTP methods are in uppercase.) REQUIRED.
uri (string):: The HTTP URI used for this request, including all path and query components and no fragment component. REQUIRED.
created (integer):: A timestamp of when the signature was created, in integer seconds since UNIX Epoch. REQUIRED.

When the request is bound to an access token, the JOSE header MUST also include the following:

ath (string):: The hash of the access token. The value MUST be the result of Base64url encoding (with no padding) the SHA-256 digest of the ASCII encoding of the associated access token's value. REQUIRED.

If the HTTP request has a message body, such as an HTTP POST or PUT method, the payload of the JWS object is the JSON serialized body of the request, and the object is signed according to JWS and serialized into compact form . The signer presents the JWS as the body of the request along with a content type of application/jose. The verifier MUST extract the payload of the JWS and treat it as the request body for further processing. If the request being made does not have a message body, such as an HTTP GET, OPTIONS, or DELETE method, the JWS signature is calculated over an empty payload and passed in the Detached-JWS header as described in . In this example, the JOSE header contains the following parameters:

The request body, used as the JWS Payload, is the following JSON object:

This leads to the following full HTTP request message:

When the verifier receives an attached JWS request, it MUST parse and validate the JWS object. The signature MUST be validated against the expected key of the signer. All required fields MUST be present and their values MUST be valid. If the HTTP message request contains a body, the verifier MUST decode the payload of the JWS object and treat this as the HTTP message body.

Resource Access Rights GNAP provides a rich structure for describing the protected resources hosted by RSs and accessed by client software. This structure is used when the client instance requests an access token and when an access token is returned. The root of this structure is a JSON array. The elements of the JSON array represent rights of access that are associated with the the access token. The resulting access is the union of all elements within the array. The access associated with the access token is described using objects that each contain multiple dimensions of access. Each object contains a REQUIRED type property that determines the type of API that the token is used for.

type (string):: The type of resource request as a string. This field MAY define which other fields are allowed in the request object. REQUIRED.

The value of the type field is under the control of the AS. This field MUST be compared using an exact byte match of the string value against known types by the AS. The AS MUST ensure that there is no collision between different authorization data types that it supports. The AS MUST NOT do any collation or normalization of data types during comparison. It is RECOMMENDED that designers of general-purpose APIs use a URI for this field to avoid collisions between multiple API types protected by a single AS. While it is expected that many APIs will have their own properties, a set of common properties are defined here. Specific API implementations SHOULD NOT re-use these fields with different semantics or syntax. The available values for these properties are determined by the API being protected at the RS. All values are OPTIONAL at the discretion of the API definition.

actions (array of strings):: The types of actions the client instance will take at the RS as an array of strings. For example, a client instance asking for a combination of "read" and "write" access.
locations (array of strings):: The location of the RS as an array of strings. These strings are typically URIs identifying the location of the RS.
datatypes (array of strings):: The kinds of data available to the client instance at the RS's API as an array of strings. For example, a client instance asking for access to raw "image" data and "metadata" at a photograph API.
identifier (string):: A string identifier indicating a specific resource at the RS. For example, a patient identifier for a medical API or a bank account number for a financial API.
privileges (array of strings):: The types or levels of privilege being requested at the resource. For example, a client instance asking for administrative level access, or access when the resource owner is no longer online.

The following non-normative example is describing three kinds of access (read, write, delete) to each of two different locations and two different data types (metadata, images) for a single access token using the fictitious photo-api type definition.

The access requested for a given object when using these fields is the cross-product of all fields of the object. That is to say, the object represents a request for all actions listed to be used at all locations listed for all possible datatypes listed within the object. Assuming the request above was granted, the client instance could assume that it would be able to do a read action against the images on the first server as well as a delete action on the metadata of the second server, or any other combination of these fields, using the same access token. To request a different combination of access, such as requesting one of the possible actions against one of the possible locations and a different choice of possible actions against a different one of the possible locations, the client instance can include multiple separate objects in the resources array. The following non-normative example uses the same fictitious photo-api type definition to request a single access token with more specifically targeted access rights by using two discrete objects within the request.

The access requested here is for read access to images on one server while simultaneously requesting write and delete access for metadata on a different server, but importantly without requesting write or delete access to images on the first server. It is anticipated that API designers will use a combination of common fields defined in this specification as well as fields specific to the API itself. The following non-normative example shows the use of both common and API-specific fields as part of two different fictitious API type values. The first access request includes the actions, locations, and datatypes fields specified here as well as the API-specific geolocation field. The second access request includes the actions and identifier fields specified here as well as the API-specific currency field.

If this request is approved, the resulting access token's access rights will be the union of the requested types of access for each of the two APIs, just as above.

Requesting Resources By Reference Instead of sending an object describing the requested resource, access rights MAY be communicated as a string known to the AS or RS representing the access being requested. Each string SHOULD correspond to a specific expanded object representation at the AS.

This value is opaque to the client instance and MAY be any valid JSON string, and therefore could include spaces, unicode characters, and properly escaped string sequences. However, in some situations the value is intended to be seen and understood by the client software's developer. In such cases, the API designer choosing any such human-readable strings SHOULD take steps to ensure the string values are not easily confused by a developer, such as by limiting the strings to easily disambiguated characters. This functionality is similar in practice to OAuth 2.0's scope parameter , where a single string represents the set of access rights requested by the client instance. As such, the reference string could contain any valid OAuth 2.0 scope value as in . Note that the reference string here is not bound to the same character restrictions as in OAuth 2.0's scope definition. A single access array MAY include both object-type and string-type resource items. In this non-normative example, the client instance is requesting access to a photo-api and financial-transaction API type as well as the reference values of read, dolphin-metadata, and some other thing.

The requested access is the union of all elements of the array, including both objects and reference strings.

Discovery By design, the protocol minimizes the need for any pre-flight discovery. To begin a request, the client instance only needs to know the endpoint of the AS and which keys it will use to sign the request. Everything else can be negotiated dynamically in the course of the protocol. However, the AS can have limits on its allowed functionality. If the client instance wants to optimize its calls to the AS before making a request, it MAY send an HTTP OPTIONS request to the grant request endpoint to retrieve the server's discovery information. The AS MUST respond with a JSON document with Content-Type application/json containing a single object with the following information:

grant_request_endpoint (string):: The location of the AS's grant request endpoint. The location MUST be a URL with a scheme component that MUST be https, a host component, and optionally, port, path and query components and no fragment components. This URL MUST match the URL the client instance used to make the discovery request. REQUIRED.
interaction_start_modes_supported (array of strings):: A list of the AS's interaction start methods. The values of this list correspond to the possible values for the interaction start section of the request. OPTIONAL.
interaction_finish_methods_supported (array of strings):: A list of the AS's interaction finish methods. The values of this list correspond to the possible values for the method element of the interaction finish section of the request. OPTIONAL.
key_proofs_supported (array of strings):: A list of the AS's supported key proofing mechanisms. The values of this list correspond to possible values of the proof field of the key section of the request. OPTIONAL.
sub_id_formats_supported (array of strings):: A list of the AS's supported subject identifier formats. The values of this list correspond to possible values of the subject identifier section of the request. OPTIONAL.
assertion_formats_supported (array of strings):: A list of the AS's supported assertion formats. The values of this list correspond to possible values of the subject assertion section of the request. OPTIONAL.

The information returned from this method is for optimization purposes only. The AS MAY deny any request, or any portion of a request, even if it lists a capability as supported. For example, a given client instance can be registered with the mtls key proofing mechanism, but the AS also returns other proofing methods from the discovery document, then the AS will still deny a request from that client instance using a different proofing mechanism.

RS-first Method of AS Discovery If the client instance calls an RS without an access token, or with an invalid access token, the RS MAY respond to the client instance with an authentication header indicating that GNAP needs to be used to access the resource. The address of the GNAP endpoint MUST be sent in the "as_uri" parameter. The RS MAY additionally return a resource reference that the client instance MAY use in its access token request. This resource reference MUST be sufficient for at least the action the client instance was attempting to take at the RS and MAY be more powerful. The means for the RS to determine the resource reference are out of scope of this specification, but some dynamic methods are discussed in . The content of the resource reference is opaque to the client instance.

The client instance then makes a request to the "as_uri" as described in , with the value of "access" as one of the members of the access array in the access_token portion of the request. The client instance MAY request additional resources and other information. The client instance MAY request multiple access tokens. In this non-normative example, the client instance is requesting a single access token using the resource reference FWWIKYBQ6U56NL1 received from the RS in addition to the dolphin-metadata resource reference that the client instance has been configured with out of band.

If issued, the resulting access token would contain sufficient access to be used at both referenced resources.

Acknowledgements The editors would like to thank the feedback of the following individuals for their reviews, implementations, and contributions: , Aaron Parecki, Adam Omar Oueidat, Andrii Deinega, Annabelle Backman, Dick Hardt, Dmitri Zagidulin, Dmitry Barinov, Fabien Imbault, Florian Helmschmidt, Francis Pouatcha, George Fletcher, Haardik Haardik, Hamid Massaoud, Jacky Yuan, Joseph Heenan, Justin Richer, Kathleen Moriarty, Mike Jones, Mike Varley, Nat Sakimura, Takahiko Kawasaki, Takahiro Tsuchiya. The editors would also like to thank the GNAP working group design team of Kathleen Moriarty, Fabien Imbault, Dick Hardt, Mike Jones, and Justin Richer, who incorporated elements from the XAuth and XYZ proposals to create the first version of this document. In addition, the editors would like to thank Aaron Parecki and Mike Jones for insights into how to integrate identity and authentication systems into the core protocol, and Justin Richer and Dick Hardt for the use cases, diagrams, and insights provided in the XYZ and XAuth proposals that have been incorporated here. The editors would like to especially thank Mike Varley and the team at SecureKey for feedback and development of early versions of the XYZ protocol that fed into this standards work.

IANA Considerations [[ TBD: There are a lot of items in the document that are expandable through the use of value registries. ]]

Security Considerations

TLS Protection in Transit All requests in GNAP have to be made over TLS or equivalent as outlined in to protect the contents of the request and response from manipulation and interception by an attacker. This includes all requests from a client instance to the AS, all requests from the client instance to an RS, any requests back to a client instance such as the push-based interaction finish method, and any back-end communications such as from an RS to an AS as described in . Additionally, all requests between a browser and other components, such as during redirect-based interaction, need to be made over TLS or use equivalent protection. Even though requests from the client instance to the AS are signed, the signature method alone does not protect the request from interception by an attacker. TLS protects the response as well as the request, preventing an attacker from intercepting requested information as it is returned. This is particularly important in the core protocol for security artifacts such as nonces and for personal information such as subject information. The use of key-bound access tokens does not negate the requirement for protecting calls to the RS with TLS. While the keys and signatures associated a bound access token will prevent an attacker from using a stolen token, without TLS an attacker would be able to watch the data being sent to the RS and returned from the RS during legitimate use of the client instance under attack. Additionally, without TLS an attacker would be able to profile the calls made between the client instance and RS, possibly gaining information about the functioning of the API between the client software and RS software that would be otherwise unknown to the attacker. TLS or equivalent protection also needs to be used between the browser and any other components. This applies during initial redirects to an AS's components during interaction, during any interaction with the resource owner, and during any redirect back to the client instance. Without TLS protection on these portions of the process, an attacker could wait for a valid request to start and then take over the resource owner's interaction session.

Signing Requests from the Client Software Even though all requests in GNAP need to be transmitted over TLS or its equivalent, the use of TLS alone is not sufficient to protect all parts of a multi-party and multi-stage protocol like GNAP, and TLS is not targeted at tying multiple requests to each other over time. To account for this, GNAP makes use of message-level protection and key presentation mechanisms that strongly associate a request with a key held by the client instance (see ). During the initial request from a client instance to the AS, the client instance has to identify and prove possession of a cryptographic key. If the key is known to the AS, such as if it is previously registered or dereferenceable to a trusted source, the AS can associate a set of policies to the client instance identified by the key. Without the requirement that the client instance prove that it holds that key, the AS could not trust that the connection came from any particular client and could not apply any associated policies. Even more importantly, the client instance proving possession of a key on the first request allows the AS to associate future requests with each other. The access token used for grant continuation is bound to the same key and proofing mechanism used by the client instance in its initial request, which means that the client instance needs to prove possession of that same key in future requests allowing the AS to be sure that the same client instance is executing the follow-ups for a given ongoing grant request. Therefore, the AS has to ensure that all subsequent requests for a grant are associated with the same key that started the grant, or the most recent rotation of that key. This need holds true even if the initial key is previously unknown to the AS, such as would be the case when a client instance creates an ephemeral key for its request. Without this ongoing association, an attacker would be able to impersonate a client instance in the midst of a grant request, potentially stealing access tokens and subject information with impunity. Additionally, all access tokens in GNAP default to be associated with the key that was presented during the grant request that created the access token. This association allows an RS to know that the presenter of the access token is the same party that the token was issued to, as identified by their keys. While non-bound bearer tokens are an option in GNAP, these types of tokens have their own tradeoffs discussed elsewhere in this section. TLS functions at the socket layer, ensuring that only the parties on either end of that socket connection can read the information passed along that connection. Each time a new socket connection is made, such as for a new HTTP request, a new trust is re-established that is unrelated to previous connections. As such, it is not possible with TLS alone to know that the same party is making a set of calls, and therefore TLS alone cannot provide the continuity of security needed for GNAP. However, mutual TLS (MTLS) does provide such security characteristics through the use of the TLS client certificate, and thus MTLS is acceptable as a key-presentation mechanism when applied as described in .

Protection of Client Instance Key Material Client instances are identified by their unique keys, and anyone with access to a client instance's key material will be able to impersonate that client instance to all parties. This is true for both calls to the AS as well as calls to an RS using a key-bound access token. Different types of client software have different methods available for creating, managing, and registering keys. GNAP explicitly allows for ephemeral clients, such as SPAs, and single-user clients, such as mobile applications, to create and present their own keys during the initial grant request. The client software can securely generate a keypair on-device and present the public key, along with proof of holding that public key, to the AS as part of the initial request. To facilitate trust in these ephemeral keys, GNAP further allows for an extensible set of client information to be passed with the request. This information can include device posture and third-party attestations of the client software's provenance and authenticity, depending on the needs and capabilities of the client software and its deployment. From GNAP's perspective, each distinct key is a different client instance. However, multiple client instances can be grouped together by an AS policy and treated similarly to each other. For instance, if an AS knows of several different keys for different servers within a cluster, the AS can decide that authorization of one of these servers applies to all other servers within the cluster. An AS that chooses to do this needs to be careful with how it groups different client keys together in its policy, since the breach of one instance would have direct effects on the others in the cluster. Additionally, if an end user controls multiple instances of a single type of client software, such as having an application installed on multiple devices, each of these instances is expected to have a separate key and be issued separate access tokens. However, if the AS is able to group these separate instances together as described above, it can streamline the authorization process for new instances of the same client software. For example, if two client instances can present proof of a valid installation of a piece of client software, the AS would be able to associate the approval of the first instance of this software to all related instances. The AS could then choose to bypass an explicit prompt of the resource owner for approval during authorization, since such approval has already been given. An AS doing such a process would need to take assurance measures that the different instances are in fact correlated and authentic, as well as ensuring the expected resource owner is in control of the client instance. Finally, if multiple instances of client software each have the same key, then from GNAP's perspective, these are functionally the same client instance as GNAP has no reasonable way to differentiate between them. This situation could happen if multiple instances within a cluster can securely share secret information among themselves. Even though there are multiple copies of the software, the shared key makes these copies all present as a single instance. It is considered bad practice to share keys between copies of software unless they are very tightly integrated with each other and can be closely managed. It is particularly bad practice to allow an end user to copy keys between client instances and to willingly use the same key in multiple instances.

Protection of Authorization Server The AS performs critical functions in GNAP, including authenticating client software, managing interactions with end users to gather consent and provide notice, and issuing access tokens for client instances to present to resource servers. As such, protecting the AS is central to any GNAP deployment. If an attacker is able to gain control over an AS, they would be able to create fraudulent tokens and manipulate registration information to allow for malicious clients. These tokens and clients would be trusted by other components in the ecosystem under the protection of the AS. If the AS is using signed access tokens, an attacker in control of the AS's signing keys would be able to manufacture fraudulent tokens for use at RS's under the protection of the AS. If an attacker is able to impersonate an AS, they would be able to trick legitimate client instances into making signed requests for information which could potentially be proxied to a real AS. To combat this, all communications to the AS need to be made over TLS or its equivalent, and the software making the connection has to validate the certificate chain of the host it is connecting to. Consequently, protecting, monitoring, and auditing the AS is paramount to preserving the security of a GNAP-protected ecosystem.

Symmetric and Asymmetric Client Instance Keys The cryptographic methods used by GNAP for key-proofing can support both asymmetric and symmetric cryptography, and can be extended to use a wide variety of mechanisms. While symmetric cryptographic systems have some benefits in speed and simplicity, they have a distinct drawback that both parties need access to the same key in order to do both signing and verification of the message. This means that when the client instance calls the AS to request a token, the AS needs to know the exact value of the client instance's key (or be able to derive it) in order to validate the key proof signature. With asymmetric keys, the client needs only to send its public key to the AS to allow for verification that the client holds the associated private key, regardless of whether that key was pre-registered or not with the AS. When used to bind to an access token, a key value must be known by the RS in order to validate the proof signature on the request. Common methods for communicating these proofing keys include putting information in a structured access token and allowing the RS to look up the associated key material against the value of the access token. With symmetric cryptography, both of these methods would expose the signing key to the RS, and in the case of an structured access token, potentially to any party that can see the access token itself unless the token's payload has been encrypted. Any of these parties would then be able to make calls using the access token by creating a valid signature. With asymmetric cryptography, the RS only needs to know the public key associated with the token in order to validate, and therefore cannot create any new calls. Symmetric keys also have the expected advantage of providing better protection against quantum threats in the future. Also, these types of keys (and their secure derivations) are widely supported among many cloud-based key management systems. While both signing approaches are allowed, GNAP treats these two classes of keys somewhat differently. Only the public portion of asymmetric keys are allowed to be sent by value in requests to the AS when establishing a connection. Since sending a symmetric key (or the private portion of an asymmetric key) would expose the signing material to any parties on the request path, including any attackers, sending these kinds of keys is prohibited. Symmetric keys can still be used by client instances, but only a reference to the key and not its value can be sent. This allows the AS to use pre-registered symmetric keys as well as key derivation schemes to take advantage of symmetric cryptography but without requiring key distribution at runtime, which would expose the keys in transit. Both the AS and client software can use systems such as hardware security modules to strengthen their key security storage and generation for both asymmetric and symmetric keys (see also ).

Generation of Access Tokens The content of access tokens need to be such that only the generating AS would be able to create them, and the contents cannot be manipulated by an attacker to gain different or additional access rights. One method for accomplishing this is to use a cryptographically random value for the access token, generated by the AS using a secure randomization function with sufficiently high entropy. The odds of an attacker guessing the output of the randomization function to collide with a valid access token are exceedingly small, and even then the attacker would not have any control over what the access token would represent since that information would be held close by the AS. Another method for accomplishing this is to use a structured token that is cryptographically signed. In this case, the payload of the access token declares to the RS what the token is good for, but the signature applied by the AS during token generation covers this payload. Only the AS can create such a signature and therefore only the AS can create such a signed token. The odds of an attacker being able to guess a signature value with a useful payload are exceedingly small. This technique only works if all targeted RS's check the signature of the access token. Any RS that does not validate the signature of all presented tokens would be susceptible to injection of a modified or falsified token. Furthermore, an AS has to carefully protect the keys used to sign access tokens, since anyone with access to these signing keys would be able to create seemingly-valid access tokens using them.

Bearer Access Tokens Bearer access tokens can be used by any party that has access to the token itself, without any additional information. As a natural consequence, any RS that a bearer token is presented to has the technical capability of presenting that bearer token to another RS, as long as the token is valid. It also means that any party that is able capture of the token value in storage or in transit is able to use the access token. While bearer tokens are inherently simpler, this simplicity has been misapplied and abused in making needlessly insecure systems. In GNAP, key-bound access tokens are the default due to their higher security properties. While bearer tokens can be used in GNAP, their use should be limited to cases where the simplicity benefits outweigh the significant security downsides.

Key-Bound Access Tokens Key-bound access tokens, as the name suggests, are bound to a specific key and must be presented along with proof of that key during use. The key itself is not presented at the same time as the token, so even if a token value is captured, it cannot be used to make a new request. This is particularly true for an RS, which will see the token value but will not see the keys used to make the request. Key-bound access tokens provide this additional layer of protection only when the RS checks the signature of the message presented with the token. Acceptance of an invalid presentation signature, or failure to check the signature entirely, would allow an attacker to make calls with a captured access token without having access to the related signing key material. In addition to validating the signature of the presentation message itself, the RS also needs to ensure that the signing key used is appropriate for the presented token. If an RS does not ensure that the right keys were used to sign a message with a specific token, an attacker would be able to capture an access token and sign the request with their own keys, thereby negating the benefits of using key-bound access tokens. The RS also needs to ensure that sufficient portions of the message are covered by the signature. Any items outside the signature could still affect the API's processing decisions, but these items would not be strongly bound to the token presentation. As such, an attacker could capture a valid request, then manipulate portions of the request outside of the signature envelope in order to cause unwanted actions at the protected API. Some key-bound tokens are susceptible to replay attacks, depending on the details of the signing method used. If a signature method covers only portions of a given request, that same signature proof can be used by an attacker to make a similar call, potentially even varying elements that are outside of the protection of the signature. Key proofing mechanisms used with access tokens therefore need to use replay protection mechanisms covered under the signature such as a per-message nonce, a reasonably short time validity window, or other uniqueness constraints. The details of using these will vary depending on the key proofing mechanism in use, but for example, HTTP Message Signatures has both a created and nonce signature parameter as well as the ability to cover significant portions of the HTTP message.

Exposure of End-user Credentials to Client Instance As a delegation protocol, one of the main goals of GNAP is to prevent the client software from being exposed to any credentials or information about the end user or resource owner as a requirement of the delegation process. By using the variety of interaction mechanisms, the resource owner can interact with the AS without ever authenticating to the client software, and without the client software having to impersonate the resource owner through replay of their credentials. Consequently, no interaction methods defined in the GNAP core require the end user to enter their credentials, but it is technologically possible for an extension to be defined to carry such values. Such an extension would be dangerous as it would allow rogue client software to directly collect, store, and replay the end user's credentials outside of any legitimate use within a GNAP request. The concerns of such an extension could be mitigated through use of a challenge and response unlocked by the end user's credentials. For example, the AS presents a challenge as part of an interaction start method, and the client instance signs that challenge using a key derived from a password presented by the end user. It would be possible for the client software to collect this password in a secure software enclave without exposing the password to the rest of the client software or putting it across the wire to the AS. The AS can validate this challenge response against a known password for the identified end user. While an approach such as this does not remove all of the concerns surrounding such a password-based scheme, it is at least possible to implement in a more secure fashion than simply collecting and replaying the password. Even so, such schemes should only ever be used by trusted clients due to the ease of abusing them.

Mixing Up Authorization Servers If a client instance is able to work with multiple AS's simultaneously, it is more possible for an attacker to add a compromised AS to the client instance's configuration and cause the client software to start a request at the compromised AS. This AS could then proxy the client's request to a valid AS in order to attempt to get the resource owner to approve access for the legitimate client instance. A client instance needs to always be aware of which AS it is talking to throughout a grant process, and ensure that any callback for one AS does not get conflated with the callback to different AS. The interaction finish hash calculate allows a client instance to protect against this kind of substitution, but only if the client instance validates the hash. If the client instance does not use an interaction finish method or does not check the interaction finish hash value, the compromised AS can be granted a valid access token on behalf of the resource owner. See for details of one such attack, which has been since addressed in this document by including the grant endpoint in the interaction hash calculation. The client instance still needs to validate the hash for the attack to be prevented.

Processing of Client-Presented User Information GNAP allows the client instance to present assertions and identifiers of the current user to the AS as part of the initial request. This information should only ever be taken by the AS as a hint, since the AS has no way to tell if the represented person is present at the client software, without using an interaction mechanism. This information does not guarantee the given user is there, but it does constitute a statement by the client software that the AS can take into account. For example, if a specific user is claimed to be present prior to interaction, but a different user is shown to be present during interaction, the AS can either determine this to be an error or signal to the client instance through returned subject information that the current user has changed from what the client instance thought. This user information can also be used by the AS to streamline the interaction process when the user is present. For example, instead of having the user type in their account identifier during interaction at a redirected URI, the AS can immediately challenge the user for their account credentials. Alternatively, if an existing session is detected, the AS can determine that it matches the identifier provided by the client and subsequently skip an explicit authentication event by the resource owner. In cases where the AS trusts the client software more completely, due to policy or by previous approval of a given client instance, the AS can take this user information as a statement that the user is present and could issue access tokens and release subject information without interaction. The AS should only take such action in very limited circumstances, as a client instance could assert whatever it likes for the user's identifiers in its request. When a client instance presents an assertion to the AS, the AS needs to evaluate that assertion. Since the AS is unlikely to be the intended audience of an assertion held by the client software, the AS will need to evaluate the assertion in a different context. Even in this case, the AS can still evaluate that the assertion was generated by a trusted party, was appropriately signed, and is within any time validity windows stated by the assertion. If the client instance's audience identifier is known to the AS and can be associated with the client instance's presented key, the AS can also evaluate that the appropriate client instance is presenting the claimed assertion. All of this will prevent an attacker from presenting a manufactured assertion, or one captured from an untrusted system. However, without validating the audience of the assertion, a captured assertion could be presented by the client instance to impersonate a given end user. In such cases, the assertion offers little more protection than a simple identifier would. A special case exists where the AS is the generator of the assertion being presented by the client instance. In these cases, the AS can validate that it did issue the assertion and it is associated with the client instance presenting the assertion.

Client Instance Pre-registration Each client instance is identified by its own unique key, and for some kinds of client software such as a web server or backend system, this identification can be facilitated by registering a single key for a piece of client software ahead of time. This registration can be associated with a set of display attributes to be used during the authorization process, identifying the client software to the user. In these cases, it can be assumed that only one instance of client software will exist, likely to serve many different users. A client's registration record needs to include its identifying key. Furthermore, it is the case that any clients using symmetric cryptography for key proofing mechanisms need to have their keys pre-registered. The registration should also include any information that would aid in the authorization process, such as a display name and logo. The registration record can also limit a given client to ask for certain kinds of information and access, or be limited to specific interaction mechanisms at runtime. It also is sensible to pre-register client instances when the software is acting autonomously, without the need for a runtime approval by a resource owner or any interaction with an end user. In these cases, an AS needs to rest on the trust decisions that have been determined prior to runtime in determining what rights and tokens to grant to a given client instance. However, it does not make sense to pre-register many types of clients. Single-page applications (SPAs) and mobile/desktop applications in particular present problems with pre-registration. For SPAs, the instances are ephemeral in nature and long-term registration of a single instance leads to significant storage and management overhead at the AS. For mobile applications, each installation of the client software is a separate instance, and sharing a key among all instances would be detrimental to security as the compromise of any single installation would compromise all copies for all users. An AS can treat these classes of client software differently from each other, perhaps by allowing access to certain high-value APIs only to pre-registered known clients, or by requiring an active end user delegation of authority to any client software not pre-registered. An AS can also provide warnings and caveats to resource owners during the authorization process, allowing the user to make an informed decision regarding the software they are authorizing. For example, if the AS has done vetting of the client software and this specific instance, it can present a different authorization screen compared to a client instance that is presenting all of its information at runtime.

Client Instance Impersonation If client instances are allowed to set their own user-facing display information, such as a display name and website URL, a malicious client instance could impersonate legitimate client software for the purposes of tricking users into authorizing the malicious client. Requiring clients to pre-register does not fully mitigate this problem since many pre-registration systems have self-service portals for management of client registration, allowing authenticated developers to enter self-asserted information into the management portal. An AS can mitigate this by actively filtering all self-asserted values presented by client software, both dynamically as part of GNAP and through a registration portal, to limit the kinds of impersonation that would be done. An AS can also warn the resource owner about the provenance of the information it is displaying, allowing the resource owner to make a more informed delegation decision. For example, an AS can visually differentiate between a client instance that can be traced back to a specific developer's registration and an instance that has self-asserted its own key and display information.

Interception of Information in the Browser Most information passed through the web-browser is susceptible to interception and possible manipulation by elements within the browser such as scripts loaded within pages. Information in the URI is exposed through browser and server logs, and can also leak to other parties through HTTP Referer headers. GNAP's design limits the information passed directly through the browser, allowing for opaque URIs in most circumstances. For the redirect-based interaction finish mechanism, named query parameters are used to carry unguessable opaque values. For these, GNAP requires creation and validation of a cryptographic hash to protect the query parameters added to the URI and associate them with an ongoing grant process. The client instance has to properly validate this hash to prevent an attacker from injecting an interaction reference intended for a different AS or client instance. Several interaction start mechanisms use URIs created by the AS and passed to the client instance. While these URIs are opaque to the client instance, it's possible for the AS to include parameters, paths, and other pieces of information that could leak security data or be manipulated by a party in the middle of the transaction.

Callback URI Manipulation The callback URI used in interaction finish mechanisms is defined by the client instance. This URI is opaque to the AS, but can contain information relevant to the client instance's operations. In particular, the client instance can include state information to allow the callback request to be associated with an ongoing grant request. Since this URI is exposed to the end user's browser, it is susceptible to both logging and manipulation in transit before the request is made to the client software. As such, a client instance should never put security-critical or private information into the callback URI in a cleartext form. For example, if the client software includes a post-redirect target URI in its callback URI to the AS, this target URI could be manipulated by an attacker, creating an open redirector at the client. Instead, a client instance can use an unguessable identifier into the URI that can then be used by the client software to look up the details of the pending request. Since this approach requires some form of statefulness by the client software during the redirection process, clients that are not capable of holding state through a redirect should not use redirect-based interaction mechanisms.

Redirection Status Codes As already described in , a server should never use the HTTP 307 status code to redirect a request that potentially contains user credentials. If an HTTP redirect is used for such a request, the HTTP status code 303 "See Other" should be used instead. The status code 307, as defined in the HTTP standard , requires the user agent to preserve the method and body of a request, thus submitting the body of the POST request to the redirect target. In the HTTP standard , only the status code 303 unambiguously enforces rewriting the HTTP POST request to an HTTP GET request, which eliminates the POST body from the redirected request. For all other status codes, including status code 302, user agents are allowed not to rewrite a POST request into a GET request and thus to resubmit the body. The use of status code 307 results in a vulnerability when using the redirect interaction finish method. With this method, the AS potentially prompts the RO to enter their credentials in a form that is then submitted back to the AS (using an HTTP POST request). The AS checks the credentials and, if successful, may directly redirect the RO to the client instance's redirect URI. Due to the use of status code 307, the RO's user agent now transmits the RO's credentials to the client instance. A malicious client instance can then use the obtained credentials to impersonate the RO at the AS. Redirection away from the initial URI in an interaction session could also leak information found in that initial URI through the HTTP Referer header field, which would be sent by the user agent to the redirect target. To avoid such leakage, a server can first redirect to an internal interstitial page without any identifying or sensitive information on the URI before processing the request. When the user agent is ultimately redirected from this page, no part of the original interaction URI will be found in the Referrer header.

MTLS Message Integrity The MTLS key proofing mechanism provides a means for a client instance to present a key using a certificate at the TLS layer. Since TLS protects the entire HTTP message in transit, verification of the TLS client certificate presented with the message provides a sufficient binding between the two. However, since TLS is functioning at a separate layer from HTTP, there is no direct connection between the TLS key presentation and the message itself, other than the fact that the message was presented over the TLS channel. That is to say, any HTTP message can be presented over the TLS channel in question with the same level of trust. The verifier is responsible for ensuring the key in the TLS client certificate is the one expected for a particular request. For example, if the request is a grant request, the AS needs to compare the TLS client certificate presented at the TLS layer to the key identified in the request body itself (either by value or through a referenced identifier). Furthermore, the prevalence of the TLS-terminating reverse proxy (TTRP) pattern in deployments adds a wrinkle to the situation. In this common pattern, the TTRP validates the TLS connection and then forwards the HTTP message contents onward to an internal system for processing. The system processing the HTTP message no longer has access to the original TLS connection's information and context. To compensate for this, the TTRP could inject the TLS client certificate into the forwarded request as a header parameter using , giving the downstream system access to the certificate information. The TTRP has to be trusted to provide accurate certificate information, and the connection between the TTRP and the downstream system also has to be protected. The TTRP could provide some additional assurance, for example, by adding its own signature to the Client-Cert header field using . This signature would be effectively ignored by GNAP but understood by the downstream service as part of its deployment. Additional considerations for different types of deployment patterns and key distribution mechanisms for MTLS are found in .

MTLS Deployment Patterns GNAP does not specify how a client instance's keys could be made known to the AS ahead of time. Public Key Infrastructure (PKI) can be used to manage the keys used by client instances when calling the AS, allowing the AS to trust a root key from a trusted authority. This method is particularly relevant to the MTLS key proofing method, where the client instance presents its certificate to the AS as part of the TLS connection. An AS using PKI to validate the MTLS connection would need to ensure that the presented certificate was issued by a trusted certificate authority before allowing the connection to continue. PKI-based certificates would allow a key to be revoked and rotated through management at the certificate authority without requiring additional registration or management at the AS. PKI has historically been difficult to deploy, especially at scale, but it remains an appropriate solution for systems where the required overhead is not an impediment. MTLS in GNAP need not use a PKI backing, as self-signed certificates and certificates from untrusted authorities can still be presented as part of a TLS connection. In this case, the verifier would validate the connection but accept whatever certificate was presented by the client software. This specific certificate would then be bound to all future connections from that client software by being bound to the resulting access tokens. See for more considerations on MTLS as a key proofing mechanism.

Interception of Responses from the AS Responses from the AS contain information vital to both the security and privacy operations of GNAP. This information includes nonces used in cryptographic calculations, subject identifiers, assertions, public keys, and information about what client software is requesting and was granted. In addition, if bearer tokens are used or keys are issued alongside a bound access token, the response from the AS contains all information necessary for use of the contained access token. Any party that is capable of viewing such a response, such as an intermediary proxy, would be able to exfiltrate and use this token. If the access token is instead bound to the client instance's presented key, intermediaries no longer have sufficient information to use the token. They can still, however, gain information about the end user as well as the actions of the client software.

Key Distribution The keys for client instances could be distributed as part of the deployment process of instances of the client software. For example, an application installation framework could generate a keypair for each copy of client software, then both install it into the client software upon installation and registering that instance with the AS. Additionally, it's possible for the AS to generate keys to be used with access tokens that are separate from the keys used by the client instance to request tokens. In this method, the AS would generate the asymmetric keypair or symmetric key and return the entire key, including all private signing information, to the client instance alongside the access token itself. This approach would make interception of the return from the token endpoint equivalent to that of a bearer token, since all information required to use the access token would be present in the request.

Interaction Finish Modes and Polling During the interaction process, the client instance usually hands control of the user experience over to another component, beit the system browser, another application, or some action the resource owner is instructed to take on another device. By using an interaction finish method, the client instance can be securely notified by the AS when the interaction is completed and the next phase of the protocol should occur. This process includes information that the client instance can use to validate the finish call from the AS and prevent some injection, session hijacking, and phishing attacks. Some types of client deployment are unable to receive an interaction finish message. Without an interaction finish method to notify it, the client instance will need to poll the grant continuation API while waiting for the resource owner to approve or deny the request. An attacker could take advantage of this situation by capturing the interaction start parameters and phishing a legitimate user into authorizing the attacker's waiting client instance, which would in turn have no way of associating the completed interaction with the start of the request. However, it is important to note that this pattern is practically indistinguishable from some legitimate use cases. For example, a smart device emits a code for the resource owner to enter on a separate device. The smart device has to poll because the expected behavior is that the interaction will take place on the separate device, without a way to return information to the original device's context. As such, developers need to weigh the risks of forgoing an interaction finish method against the deployment capabilities of the client software and its environment. Due to the increased security, an interaction finish method should be employed whenever possible.

Session Management for Interaction Finish Methods When using an interaction finish method such as redirect or push, the client instance receives an unsolicited HTTP request from an unknown party. The client instance needs to be able to successfully associate this incoming request with a specific pending grant request being managed by the client instance. If the client instance is not careful and precise about this, an attacker could associate their own session at the client instance with a stolen interaction response. The means of preventing this varies by the type of client software and interaction methods in use. Some common patterns are enumerated here. If the end user interacts with the client instance through a web browser and the redirect interaction finish method is used, the client instance can ensure that the incoming HTTP request from the finish method is presented in the same browser session that the grant request was started in. This technique is particularly useful when the redirect interaction start mode is used as well, since in many cases the end user will follow the redirection with the same browser that they are using to interact with the client instance. The client instance can then store the relevant pending grant information in the session, either in the browser storage directly (such as with a single-page application) or in an associated session store on a back-end server. In both cases, when the incoming request reaches the client instance, the session information can be used to ensure that the same party that started the request is present as the request finishes. Ensuring that the same party that started a request is present when that request finishes can prevent phishing attacks, where an attacker starts a request at an honest client instance and tricks an honest RO into authorizing it. For example, if an honest end user (that also acts as the RO) wants to start a request through a client instance controlled by the attacker, the attacker can start a request at an honest client instance and then redirect the honest end user to the interaction URI from the attackers session with the honest client instance. If the honest end user then fails to realize that it is not authorizing the attacker-controlled client instance (with which it started its request) but the honest client instance when interacting with the AS, the attacker's session with the honest client instance would be authorized. This would give the attacker access to the honest end user's resources that the honest client instance is authorized to access. However, if after the interaction the AS redirects the honest end user back to the client instance whose grant request the end user just authorized, the honest end user is redirected to the honest client instance. The honest client instance can then detect that it is not the party that started the request that is present, since the request at the honest client instance was started by the attacker, which can prevent the attack. This is related to , because again the attack can be prevented by the AS informing the user as much as possible about the client instance that is to be authorized. If the end user does not interact with the client instance through a web browser or the interaction start method does not use the same browser or device that the end user is interacting through (such as the launch of a second device through a scannable code or presentation of a user code) the client instance will not be able to strongly associate an incoming HTTP request with an established session with the end user. This is also true when the push interaction finish method is used, since the HTTP request comes directly from the interaction component of the AS. In these circumstances, the client instance can at least ensure that the incoming HTTP request can be uniquely associated with an ongoing grant request by making the interaction finish callback URI unique for the grant when making the interaction request. Mobile applications and other client instances that generally serve only a single end user at a time can use this unique incoming URL to differentiate between a legitimate incoming request and an attacker's stolen request. If the client instance does not have the ability to use an interaction finish method, it can use polling to continue the request. The tradeoffs of this approach are discussed in , and if possible, an explicit interaction finish method should be used instead.

Storage of Information During Interaction and Continuation When starting an interactive grant request, a client application has a number of protocol elements that it needs to manage, including nonces, references, keys, access tokens, and other elements. During the interaction process, the client instance usually hands control of the user experience over to another component, beit the system browser, another application, or some action the resource owner is instructed to take on another device. In order for the client instance to make its continuation call, it will need to recall all of these protocol elements. Usually this means the client instance will need to store these protocol elements in some retrievable fashion. If the security protocol elements are stored on the end user's device, such as in browser storage or in local application data stores, capture and exfiltration of this information could allow an attacker to continue a pending transaction instead of the client instance. Client software can make use of secure storage mechanisms, including hardware-based key and data storage, to prevent such exfiltration. Note that in GNAP, the client instance has to choose its interaction finish URI prior to making the first call to the AS. As such, the interaction finish URI will often have a unique identifier for the ongoing request, allowing the client instance to access the correct portion of its storage. Since this URI is passed to other parties and often used through a browser, this URI should not contain any security-sensitive information that would be valuable to an attacker, such as any token identifier, nonce, or user information. Instead, a cryptographically random value is suggested.

Denial of Service (DoS) through Grant Continuation When a client instance starts off an interactive process, it will eventually need to continue the grant request in a subsequent message to the AS. It's possible for a naive client implementation to continuously send continuation requests to the AS while waiting for approval, especially if no interaction finish method is used. Such constant requests could overwhelm the AS's ability to respond to both these and other requests. To mitigate this for well-behaved client software, the continuation response contains a wait parameter that is intended to tell the client instance how long it should wait until making its next request. This value can be used to back off client software that is checking too quickly by returning increasing wait times for a single client instance. If client software ignores the wait value and makes its continuation calls too quickly, or if the client software assumes the absence of the wait values means it should poll immediately, the AS can choose to return errors to the offending client instance, including possibly canceling the ongoing grant request. With well-meaning client software these errors can indicate a need to change the client software's programmed behavior.

Exhaustion of Random Value Space Several parts of the GNAP process make use of unguessable randomized values, such as nonces, tokens, and randomized URIs. Since these values are intended to be unique, a sufficiently powerful attacker could make a large number of requests to trigger generation of randomized values in an attempt to exhaust the random number generation space. While this attack is particularly applicable to the AS, client software could likewise be targeted by an attacker triggering new grant requests against an AS. To mitigate this, software can ensure that its random values are chosen from a significantly large pool that exhaustion of that pool is prohibitive for an attacker. Additionally, the random values can be time-boxed in such a way as their validity windows are reasonably short. Since many of the random values used within GNAP are used within limited portions of the protocol, it is reasonable for a particular random value to be valid for only a small amount of time. For example, the nonces used for interaction finish hash calculation need only to be valid while the client instance is waiting for the finish callback and can be functionally expired when the interaction has completed. Similarly, artifacts like access tokens and the interaction reference can be limited to have lifetimes tied to their functional utility. Finally, each different category of artifact (nonce, token, reference, identifier, etc.) can be generated from a separate random pool of values instead of a single global value space.

Front-channel URIs Some interaction methods in GNAP make use of URIs accessed through the end user's browser, known collectively as front-channel communication. These URIs are most notably present in the redirect interaction start method and the redirect interaction finish mode. Since these URIs are intended to be given to the end user, the end user and their browser will be subjected to anything hosted at that URI including viruses, malware, and phishing scams. This kind of risk is inherent to all redirection-based protocols, including GNAP when used in this way. When talking to a new or unknown AS, a client instance might want to check the URI from the interaction start against a blocklist and warn the end user before redirecting them. Many client instances will provide an interstitial message prior to redirection in order to prepare the user for control of the user experience being handed to the domain of the AS, and such a method could be used to warn the user of potential threats. For instance, a rogue AS impersonating a well-known service provider. Client software can also prevent this by managing an allowlist of known and trusted AS's. Alternatively, an attacker could start a GNAP request with a known and trusted AS but include their own attack site URI as the callback for the redirect finish method. The attacker would then send the interaction start URI to the victim and get them to click on it. Since the URI is at the known AS, the victim is inclined to do so. The victim will then be prompted to approve the attacker's application, and in most circumstances the victim will then be redirected to the attacker's site whether or not the user approved the request. The AS could mitigate this partially by using a blocklist and allowlist of interaction finish URIs during the client instance's initial request, but this approach can be especially difficult if the URI has any dynamic portion chosen by the client software. The AS can couple these checks with policies associated with the client instance that has been authenticated in the request. If the AS has any doubt about the interaction finish URI, the AS can provide an interstitial warning to the end user before processing the redirect. Ultimately, all protocols that use redirect-based communication through the user's browser are susceptible to having an attacker try to co-opt one or more of those URIs in order to harm the user. It is the responsibility of the AS and the client software to provide appropriate warnings, education, and mitigation to protect end users.

Processing Assertions Identity assertions can be used in GNAP to convey subject information, both from the AS to the client instance in a response and from the client instance to the AS in a request. In both of these circumstances, when an assertion is passed in GNAP, the receiver of the assertion needs to parse and process the assertion. As assertions are complex artifacts with their own syntax and security, special care needs to be taken to prevent the assertion values from being used as an attack vector. All assertion processing needs to account for the security aspects of the assertion format in use. In particular, the processor needs to parse the assertion from a JSON string object, and apply the appropriate cryptographic processes to ensure the integrity of the assertion. For example, when SAML 2 assertions are used, the receiver hast to parse an XML document. There are many well-known security vulnerabilities in XML parsers, and the XML standard itself can be attacked through the use of processing instructions and entity expansions to cause problems with the processor. Therefore, any system capable of processing SAML 2 assertions also needs to have a secure and correct XML parser. In addition to this, the SAML 2 specification uses XML Signatures, which have their own implementation problems that need to be accounted for. Similar requirements exist for OpenID Connect's ID token, which is based on the JSON Web Token (JWT) format and the related JSON Object Signing And Encryption (JOSE) cryptography suite.

Stolen Token Replay If a client instance can request tokens at multiple AS's, and the client instance uses the same keys to make its requests across those different AS's, then it is possible for an attacker to replay a stolen token issued by an honest AS from a compromised AS, thereby binding the stolen token to the client instance's key in a different context. The attacker can manipulate the client instance into using the stolen token at an RS, particularly at an RS that is expecting a token from the honest AS. Since the honest AS issued the token and the client instance presents the token with its expected bound key, the attack succeeds. This attack has several preconditions. In this attack, the attacker does not need access to the client instance's key and cannot use the stolen token directly at the RS, but the attacker is able to get the access token value in some fashion. The client instance also needs to be configured to talk to multiple AS's, including the attacker's controlled AS. Finally, the client instance needs to be able to be manipulated by the attacker to call the RS while using a token issued from the stolen AS. The RS does not need to be compromised or made to trust the attacker's AS. To protect against this attack, the client instance can use a different key for each AS that it talks to. Since the replayed token will be bound to the key used at the honest AS, the uncompromised RS will reject the call since the client instance will be using the key used at the attacker's AS instead with the same token. When the MTLS key proofing method is used, a client instance can use self-signed certificates to use a different key for each AS that it talks to, as discussed in . Additionally, the client instance can keep a strong association between the RS and a specific AS that it trusts to issue tokens for that RS. This strong binding also helps against some forms of AS mix-up attacks. Managing this binding is outside the scope of GNAP core, but it can be managed either as a configuration element for the client instance or dynamically through discovering the AS from the RS. The details of this attack are available in with additional discussion and considerations.

Self-contained Stateless Access Tokens The contents and format of the access token are at the discretion of the AS, and are opaque to the client instance within GNAP. As discussed in the companion document, , the AS and RS can make use of stateless access tokens with an internal structure and format. These access tokens allow an RS to validate the token without having to make any external calls at runtime, allowing for benefits in some deployments, the discussion of which are outside the scope of this specification. However, the use of such self-contained access tokens has an effect on the ability of the AS to provide certain functionality defined within this specification. Specifically, since the access token is self-contained, it is difficult or impossible for an AS to signal to all RS's within an ecosystem when a specific access token has been revoked. Therefore, an AS in such an ecosystem should probably not offer token revocation functionality to client instances, since the client instance's calls to such an endpoint is effectively meaningless. However, a client instance calling the token revocation function will also throw out its copy of the token, so such a placebo endpoint might not be completely meaningless. Token rotation similarly difficult because the AS has to revoke the old access token after a rotation call has been made. If the access tokens are completely self-contained and non-revocable, this means that there will be a period of time during which both the old and new access tokens are valid and usable, which is an increased security risk for the environment. These problems can be mitigated by keeping the validity time windows of self-contained access tokens reasonably short, limiting the time after a revocation event that a revoked token could be used. Additionally, the AS could proactively signal to RS's under its control identifiers for revoked tokens that have yet to expire. This type of information push would be expected to be relatively small and infrequent, and its implementation is outside the scope of this specification.

Network Problems and Token and Grant Management If a client instance makes a call to rotate an access token but the network connection is dropped before the client instance receives the response with the new access token, the system as a whole can end up in an inconsistent state, where the AS has already rotated the old access token and invalidated it, but the client instance only has access to the invalidated access token and not the newly rotated token value. If the client instance retries the rotation request, it would fail because the client is no longer presenting a valid and current access token. A similar situation can occur during grant continuation, where the same client instance calls to continue or update a grant request without successfully receiving the results of the update. To combat this, both grant Management and token management are designed to be idempotent, where subsequent calls to the same function with the same credentials are meant to produce the same results. For example, multiple calls to rotate the same access token need to result in the same rotated token value. In practice, an AS can hold on to an old token value for such limited purposes. For example, to support rotating access tokens over unreliable networks, the AS receives the initial request to rotate an access token and creates a new token value and returns it. The AS also marks the old token value as having been used to create the newly-rotated token value. If the AS sees the old token value within a small enough time window, such as a few seconds since the first rotation attempt, the AS can return the same rotated access token. Furthermore, once the system has seen the newly-rotated token in use, the original token can be discarded because the client instance has proved that it did receive the token. The result of this is a system that is eventually self-consistent without placing an undue complexity burden on the client instance.

Server-side Request Forgery (SSRF) There are several places within GNAP where a URI can be given to a party causing it to fetch that URI during normal operation of the protocol. If an attacker is able to control the value of one of these URIs within the protocol, the attacker could cause the target system to execute a request on a URI that is within reach of the target system but normally unavailable to the attacker. For example, an attacker sending a URL of http://localhost/admin to cause the server to access an internal function on itself, or https://192.168.0.14/ to call a service behind a firewall. Even if the attacker does not gain access to the results of the call, the side effects of such requests coming from a trusted host can be problematic to the security and sanctity of such otherwise unexposed endpoints. In GNAP, the most vulnerable place in the core protocol is the push-based post-interaction finish method, as the client instance is less trusted than the AS and can use this method to make the AS call an arbitrary URI. While it is not required by the protocol, the AS can fetch other client-instance provided URIs such as the logo image or home page, for verification or privacy-preserving purposes before displaying them to the resource owner as part of a consent screen. Furthermore, extensions to GNAP that allow or require URI fetch could also be similarly susceptible, such as a system for having the AS fetch a client instance's keys from a presented URI instead of the client instance presenting the key by value. Such extensions are outside the scope of this specification, but any system deploying such an extension would need to be aware of this issue. To help mitigate this problem, similar approaches to protecting parties against malicious redirects can be used. For example, all URIs that can result in a direct request being made by a party in the protocol can be filtered through an allowlist or blocklist. For example, an AS that supports the push based interaction finish can compare the callback URI in the interaction request to a known URI for a pre-registered client instance, or it can ensure that the URI is not on a blocklist of sensitive URLs such as internal network addresses. However, note that because these types of calls happen outside of the view of human interaction, it is not usually feasible to provide notification and warning to someone before the request needs to be executed, as is the case with redirection URLs. As such, SSRF is somewhat more difficult to manage at runtime, and systems should generally refuse to fetch a URI if unsure.

Privacy Considerations The privacy considerations in this section are modeled after the list of privacy threats in , "Privacy Considerations for Internet Protocols", and either explain how these threats are mitigated or advise how the threats relate to GNAP.

Surveillance Surveillance is the observation or monitoring of an individual's communications or activities. Surveillance can be conducted by observers or eavesdroppers at any point along the communications path. GNAP assumes the TLS protection used throughout the spec is intact. Without the protection of TLS, there are many points throughout the use of GNAP that would lead to possible surveillance.

Surveillance by the Client The purpose of GNAP is to authorize clients to be able to access information on behalf of a user. So while it is expected that the client may be aware of the user's identity as well as data being fetched for that user, in some cases the extent of the client may be beyond what the user is aware of. For example, a client may be implemented as multiple distinct pieces of software, such as a logging service or a mobile app that reports usage data to an external backend service.

Surveillance by the Authorization Server The role of the authorization server is to manage the authorization of client instances to protect access to the user's data. In this role, the authorization server is by definition aware of each authorization of a client instance by a user. When the authorization server shares user information with the client instance, it needs to make sure that it has the permission from that user to do so. Additionally, as part of the authorization grant process, the authorization server may be aware of which resource servers the client intends to use an access token at. However, it is possible to design a system using GNAP in which this knowledge is not made available to the authorization server, such as by avoiding the use of the locations object in the authorization request. If the authorization server's implementation of access tokens is such that it requires a resource server call back to the authorization server to validate them, then the authorization server will be aware of which resource servers are actively in use and by which users and which clients. To avoid this possibility, the authorization server would need to structure access tokens in such a way that they can be validated by the resource server without notifying the authorization server that the token is being validated.

Stored Data Several parties in the GNAP process are expected to persist data at least temporarily, if not semi-permanently, for the normal functioning of the system. If compromised, this could lead to exposure of sensitive information. This section documents the potentially sensitive information each party in GNAP is expected to store for normal operation. Naturally it is possible that any party is storing information for longer than technically necessary of the protocol mechanics (such as audit logs, etc). The authorization server is expected to store subject identifiers for users indefinitely, in order to be able to include them in the responses to clients. The authorization server is also expected to store client key identifiers associated with display information about the client such as its name and logo. The client is expected to store its client instance key indefinitely, in order to authenticate to the authorization server for the normal functioning of the GNAP flows. Additionally, the client will be temporarily storing artifacts issued by the authorization server during a flow, and these artifacts SHOULD be discarded by the client when the transaction is complete. The resource server is not required to store any state for its normal operation. Depending on the implementation of access tokens, the resource server may need to cache public keys from the authorization server in order to validate access tokens.

Intrusion Intrusion refers to the ability of various parties to send unsolicited messages or cause denial of service for unrelated parties. If the resource owner is different from the end user, there is an opportunity for the end user to cause unsolicited messages to be sent to the resource owner if the system prompts the resource owner for consent when an end user attempts to access their data. The format and contents of subject identifiers are intentionally not defined by GNAP. If the authorization server uses values for subject identifiers that are also identifiers for communication channels, (e.g. an email address or phone number), this opens up the possibility for a client to learn this information when it was not otherwise authorized to access this kind of data about the user.

Correlation The threat of correlation is the combination of various pieces of information related to an individual in a way that defies their expectations of what others know about them.

Correlation by Clients The biggest risk of correlation in GNAP is when an authorization server returns stable consistent user identifiers to multiple different applications. In this case, applications created by different parties would be able to correlate these user identifiers out of band in order to know which users they have in common. The most common example of this in practice is tracking for advertising purposes, such that client A shares their list of user IDs with an ad platform that is then able to retarget ads to applications created by other parties. In contrast, a positive example of correlation is a corporate acquisition where two previously unrelated clients now do need to be able to identify the same user between the two clients.

Correlation by Resource Servers Unrelated resource servers also have an opportunity to correlate users if the authorization server includes stable user identifiers in access tokens or in access token introspection responses. In some cases a resource server may not actually need to be able to identify users, (such as a resource server providing access to a company cafeteria menu which only needs to validate whether the user is a current employee), so authorization servers should be thoughtful of when user identifiers are actually necessary to communicate to resource servers for the functioning of the system. However, note that the lack of inclusion of a user identifier in an access token may be a risk if there is a concern that two users may voluntarily share access tokens between them in order to access protected resources. For example, if a website wants to limit access to only people over 18, and such does not need to know any user identifiers, an access token may be issued by an AS contains only the claim "over 18". If the user is aware that this access token doesn't reference them individually, they may be willing to share the access token with a user who is under 18 in order to let them get access to the website. (Note that the binding of an access token to a non-extractable client instance key also prevents the access token from being voluntarily shared.)

Correlation by Authorization Servers Clients are expected to be identified by their client instance key. If a particular client instance key is used at more than one authorization server, this could open up the possibility for multiple unrelated authorization servers to correlate client instances. This is especially a problem in the common case where a client instance is used by a single individual, as it would allow the authorization servers to correlate that individual between them. If this is a concern of a client, the client should use distinct keys with each authorization server.

Disclosure in Shared References Throughout many parts of GNAP, the parties pass shared references between each other, sometimes in place of the values themselves. For example the interact_ref value used throughout the flow. These references are intended to be random strings and should not contain any private or sensitive data that would potentially leak information between parties.

Recommendations for Secure Use of Transport Layer Security (TLS) and Datagram Transport Layer Security (DTLS) Key words for use in RFCs to Indicate Requirement Levels In many standards track documents several words are used to signify the requirements in the specification. These words are often capitalized. This document defines these words as they should be interpreted in IETF documents. This document specifies an Internet Best Current Practices for the Internet Community, and requests discussion and suggestions for improvements. The "data" URL scheme A new URL scheme, "data", is defined. It allows inclusion of small data items as "immediate" data, as if it had been included externally. [STANDARDS-TRACK] Uniform Resource Identifier (URI): Generic Syntax A Uniform Resource Identifier (URI) is a compact sequence of characters that identifies an abstract or physical resource. This specification defines the generic URI syntax and a process for resolving URI references that might be in relative form, along with guidelines and security considerations for the use of URIs on the Internet. The URI syntax defines a grammar that is a superset of all valid URIs, allowing an implementation to parse the common components of a URI reference without knowing the scheme-specific requirements of every possible identifier. This specification does not define a generative grammar for URIs; that task is performed by the individual specifications of each URI scheme. [STANDARDS-TRACK] The Base16, Base32, and Base64 Data Encodings This document describes the commonly used base 64, base 32, and base 16 encoding schemes. It also discusses the use of line-feeds in encoded data, use of padding in encoded data, use of non-alphabet characters in encoded data, use of different encoding alphabets, and canonical encodings. [STANDARDS-TRACK] Tags for Identifying Languages This document describes the structure, content, construction, and semantics of language tags for use in cases where it is desirable to indicate the language used in an information object. It also describes how to register values for use in language tags and the creation of user-defined extensions for private interchange. This document specifies an Internet Best Current Practices for the Internet Community, and requests discussion and suggestions for improvements. Hypertext Transfer Protocol (HTTP/1.1): Semantics and Content The Hypertext Transfer Protocol (HTTP) is a stateless \%application- level protocol for distributed, collaborative, hypertext information systems. This document defines the semantics of HTTP/1.1 messages, as expressed by request methods, request header fields, response status codes, and response header fields, along with the payload of messages (metadata and body content) and mechanisms for content negotiation. Hypertext Transfer Protocol (HTTP/1.1): Caching The Hypertext Transfer Protocol (HTTP) is a stateless \%application- level protocol for distributed, collaborative, hypertext information systems. This document defines HTTP caches and the associated header fields that control cache behavior or indicate cacheable response messages. Textual Encodings of PKIX, PKCS, and CMS Structures This document describes and discusses the textual encodings of the Public-Key Infrastructure X.509 (PKIX), Public-Key Cryptography Standards (PKCS), and Cryptographic Message Syntax (CMS). The textual encodings are well-known, are implemented by several applications and libraries, and are widely deployed. This document articulates the de facto rules by which existing implementations operate and defines them so that future implementations can interoperate. JSON Web Signature (JWS) JSON Web Signature (JWS) represents content secured with digital signatures or Message Authentication Codes (MACs) using JSON-based data structures. Cryptographic algorithms and identifiers for use with this specification are described in the separate JSON Web Algorithms (JWA) specification and an IANA registry defined by that specification. Related encryption capabilities are described in the separate JSON Web Encryption (JWE) specification. JSON Web Key (JWK) A JSON Web Key (JWK) is a JavaScript Object Notation (JSON) data structure that represents a cryptographic key. This specification also defines a JWK Set JSON data structure that represents a set of JWKs. Cryptographic algorithms and identifiers for use with this specification are described in the separate JSON Web Algorithms (JWA) specification and IANA registries established by that specification. The OAuth 2.0 Authorization Framework The OAuth 2.0 authorization framework enables a third-party application to obtain limited access to an HTTP service, either on behalf of a resource owner by orchestrating an approval interaction between the resource owner and the HTTP service, or by allowing the third-party application to obtain access on its own behalf. This specification replaces and obsoletes the OAuth 1.0 protocol described in RFC 5849. [STANDARDS-TRACK] The OAuth 2.0 Authorization Framework: Bearer Token Usage This specification describes how to use bearer tokens in HTTP requests to access OAuth 2.0 protected resources. Any party in possession of a bearer token (a "bearer") can use it to get access to the associated resources (without demonstrating possession of a cryptographic key). To prevent misuse, bearer tokens need to be protected from disclosure in storage and in transport. [STANDARDS-TRACK] Ambiguity of Uppercase vs Lowercase in RFC 2119 Key Words RFC 2119 specifies common key words that may be used in protocol specifications. This document aims to reduce the ambiguity by clarifying that only UPPERCASE usage of the key words have the defined special meanings. The JavaScript Object Notation (JSON) Data Interchange Format JavaScript Object Notation (JSON) is a lightweight, text-based, language-independent data interchange format. It was derived from the ECMAScript Programming Language Standard. JSON defines a small set of formatting rules for the portable representation of structured data.This document removes inconsistencies with other specifications of JSON, repairs specification errors, and offers experience-based interoperability guidance. OAuth 2.0 Mutual-TLS Client Authentication and Certificate-Bound Access Tokens This document describes OAuth client authentication and certificate-bound access and refresh tokens using mutual Transport Layer Security (TLS) authentication with X.509 certificates. OAuth clients are provided a mechanism for authentication to the authorization server using mutual TLS, based on either self-signed certificates or public key infrastructure (PKI). OAuth authorization servers are provided a mechanism for binding access tokens to a client's mutual-TLS certificate, and OAuth protected resources are provided a method for ensuring that such an access token presented to it was issued to the client presenting the token. Grant Negotiation and Authorization Protocol Resource Server Connections Bespoke Engineering Okta acert.io GNAP defines a mechanism for delegating authorization to a piece of software, and conveying that delegation to the software. This extension defines methods for resource servers (RS) to communicate with authorization servers (AS) in an interoperable fashion. HTTP Message Signatures Amazon Bespoke Engineering Digital Bazaar This document describes a mechanism for creating, encoding, and verifying digital signatures or message authentication codes over components of an HTTP message. This mechanism supports use cases where the full HTTP message may not be known to the signer, and where the message may be transformed (e.g., by intermediaries) before reaching the verifier. This document also describes a means for requesting that a signature be applied to a subsequent HTTP message in an ongoing HTTP exchange. Digest Fields Team Digitale, Italian Government Cloudflare This document defines HTTP fields that support integrity checksums. The Digest field can be used for the integrity of HTTP representations. The Content-Digest field can be used for the integrity of HTTP message content. Want-Digest and Want-Content- Digest can be used to indicate a sender's desire to receive integrity fields respectively. This document obsoletes RFC 3230. Subject Identifiers for Security Event Tokens Amazon Coinbase Security events communicated within Security Event Tokens may support a variety of identifiers to identify subjects related to the event. This specification formalizes the notion of subject identifiers as structured information that describe a subject, and named formats that define the syntax and semantics for encoding subject identifiers as JSON objects. It also defines a registry for defining and allocating names for such formats, as well as the sub_id JSON Web Token (JWT) claim. OAuth 2.0 Rich Authorization Requests yes.com Bespoke Engineering Ping Identity This document specifies a new parameter authorization_details that is used to carry fine-grained authorization data in OAuth messages. OpenID Connect Core 1.0 incorporating errata set 1 Handling Long Lines in Content of Internet-Drafts and RFCs This document defines two strategies for handling long lines in width-bounded text content. One strategy, called the "single backslash" strategy, is based on the historical use of a single backslash ('\') character to indicate where line-folding has occurred, with the continuation occurring with the first character that is not a space character (' ') on the next line. The second strategy, called the "double backslash" strategy, extends the first strategy by adding a second backslash character to identify where the continuation begins and is thereby able to handle cases not supported by the first strategy. Both strategies use a self-describing header enabling automated reconstitution of the original content. Privacy Considerations for Internet Protocols This document offers guidance for developing privacy considerations for inclusion in protocol specifications. It aims to make designers, implementers, and users of Internet protocols aware of privacy-related design choices. It suggests that whether any individual RFC warrants a specific privacy considerations section will depend on the document's content. Client-Cert HTTP Header Field Ping Identity Akamai This document defines HTTP extension header fields that allow a TLS terminating reverse proxy to convey the client certificate information of a mutually-authenticated TLS connection to the origin server in a common and predictable manner. OAuth 2.0 Security Best Current Practice yes.com Yubico Independent Researcher yes.com This document describes best current security practice for OAuth 2.0. It updates and extends the OAuth 2.0 Security Threat Model to incorporate practical experiences gathered since OAuth 2.0 was published and covers new threats relevant due to the broader application of OAuth 2.0. Promise theory Security Analysis of Attack Surfaces on the Grant Negotiation and Authorization Protocol tbd

Document History -09 Added security considerations on redirection status codes. Added security considerations on cuckoo token attack. Made token management URL required on token rotation. Added considerations on token rotation and self-contained tokens. Added security considerations for SSRF. Moved normative requirements about end user presence to security considerations. Clarified default wait times for continuation requests (including polling). Clarified URI vs. URL. Added "user_code_uri" mode, removed "uri" from "user_code" mode. Consistently formatted all parameter lists. Updated examples for HTTP Signatures. -08 Update definition for "Client" to account for the case of no end user. Change definition for "Subject". Expanded security and privacy considerations for more situations. Added cross-links from security and privacy considerations. Editorial updates. -07 Replace user handle by opaque identifier Added trust relationships Added privacy considerations section Added security considerations. -06 Removed "capabilities" and "existing_grant" protocol fields. Removed separate "instance_id" field. Split "interaction_methods_supported" into "interaction_start_modes_supported" and "interaction_finish_methods_supported". Added AS endpoint to hash calculation to fix mix-up attack. Added "privileges" field to resource access request object. Moved client-facing RS response back from GNAP-RS document. Removed oauthpop key binding. Removed dpop key binding. Added example DID identifier. Changed token response booleans to flag structure to match request. Updated signature examples to use HTTP Message Signatures. -05 Changed "interaction_methods" to "interaction_methods_supported". Changed "key_proofs" to "key_proofs_supported". Changed "assertions" to "assertions_supported". Updated discovery and field names for subject formats. Add an appendix to provide protocol rationale, compared to OAuth2. Updated subject information definition. Refactored the RS-centric components into a new document. Updated cryptographic proof of possession methods to match current reference syntax. Updated proofing language to use "signer" and "verifier" generically. Updated cryptographic proof of possession examples. Editorial cleanup and fixes. Diagram cleanup and fixes. -04 Updated terminology. Refactored key presentation and binding. Refactored "interact" request to group start and end modes. Changed access token request and response syntax. Changed DPoP digest field to 'htd' to match proposed FAPI profile. Include the access token hash in the DPoP message. Removed closed issue links. Removed function to read state of grant request by client. Closed issues related to reading and updating access tokens. -03 Changed "resource client" terminology to separate "client instance" and "client software". Removed OpenID Connect "claims" parameter. Dropped "short URI" redirect. Access token is mandatory for continuation. Removed closed issue links. Editorial fixes. -02 Moved all "editor's note" items to GitHub Issues. Added JSON types to fields. Changed "GNAP Protocol" to "GNAP". Editorial fixes. -01 "updated_at" subject info timestamp now in ISO 8601 string format. Editorial fixes. Added Aaron and Fabien as document authors. -00 Initial working group draft.

Compared to OAuth 2.0 GNAP's protocol design differs from OAuth 2.0's in several fundamental ways: Consent and authorization flexibility: OAuth 2.0 generally assumes the user has access to the a web browser. The type of interaction available is fixed by the grant type, and the most common interactive grant types start in the browser. OAuth 2.0 assumes that the user using the client software is the same user that will interact with the AS to approve access. GNAP allows various patterns to manage authorizations and consents required to fulfill this requested delegation, including information sent by the client instance, information supplied by external parties, and information gathered through the interaction process. GNAP allows a client instance to list different ways that it can start and finish an interaction, and these can be mixed together as needed for different use cases. GNAP interactions can use a browser, but don’t have to. Methods can use inter-application messaging protocols, out-of-band data transfer, or anything else. GNAP allows extensions to define new ways to start and finish an interaction, as new methods and platforms are expected to become available over time. GNAP is designed to allow the end user and the resource owner to be two different people, but still works in the optimized case of them being the same party. Intent registration and inline negotiation: OAuth 2.0 uses different “grant types” that start at different endpoints for different purposes. Many of these require discovery of several interrelated parameters. GNAP requests all start with the same type of request to the same endpoint at the AS. Next steps are negotiated between the client instance and AS based on software capabilities, policies surrounding requested access, and the overall context of the ongoing request. GNAP defines a continuation API that allows the client instance and AS to request and send additional information from each other over multiple steps. This continuation API uses the same access token protection that other GNAP-protected APIs use. GNAP allows discovery to optimize the requests but it isn’t required thanks to the negotiation capabilities. Client instances: OAuth 2.0 requires all clients to be registered at the AS and to use a client_id known to the AS as part of the protocol. This client_id is generally assumed to be assigned by a trusted authority during a registration process, and OAuth places a lot of trust on the client_id as a result. Dynamic registration allows different classes of clients to get a client_id at runtime, even if they only ever use it for one request. GNAP allows the client instance to present an unknown key to the AS and use that key to protect the ongoing request. GNAP’s client instance identifier mechanism allows for pre-registered clients and dynamically registered clients to exist as an optimized case without requiring the identifier as part of the protocol at all times. Expanded delegation: OAuth 2.0 defines the “scope” parameter for controlling access to APIs. This parameter has been coopted to mean a number of different things in different protocols, including flags for turning special behavior on and off, including the return of data apart from the access token. The “resource” parameter and RAR extensions (as defined in ) expand on the “scope” concept in similar but different ways. GNAP defines a rich structure for requesting access, with string references as an optimization. GNAP defines methods for requesting directly-returned user information, separate from API access. This information includes identifiers for the current user and structured assertions. The core GNAP protocol makes no assumptions or demands on the format or contents of the access token, but the RS extension allows a negotiation of token formats between the AS and RS. Cryptography-based security: OAuth 2.0 uses shared bearer secrets, including the client_secret and access token, and advanced authentication and sender constraint have been built on after the fact in inconsistent ways. In GNAP, all communication between the client instance and AS is bound to a key held by the client instance. GNAP uses the same cryptographic mechanisms for both authenticating the client (to the AS) and binding the access token (to the RS and the AS). GNAP allows extensions to define new cryptographic protection mechanisms, as new methods are expected to become available over time. GNAP does not have a notion of “public clients” because key information can always be sent and used dynamically. Privacy and usable security: OAuth 2.0's deployment model assumes a strong binding between the AS and the RS. GNAP is designed to be interoperable with decentralized identity standards and to provide a human-centric authorization layer. In addition to the core protocol, GNAP supports various patterns of communication between RSs and ASs through extensions. GNAP tries to limit the odds of a consolidation to just a handful of super-popular AS services.

Component Data Models While different implementations of this protocol will have different realizations of all the components and artifacts enumerated here, the nature of the protocol implies some common structures and elements for certain components. This appendix seeks to enumerate those common elements. TBD: Client has keys, allowed requested resources, identifier(s), allowed requested subjects, allowed TBD: AS has "grant endpoint", interaction endpoints, store of trusted client keys, policies TBD: Token has RO, user, client, resource list, RS list,

Example Protocol Flows The protocol defined in this specification provides a number of features that can be combined to solve many different kinds of authentication scenarios. This section seeks to show examples of how the protocol would be applied for different situations. Some longer fields, particularly cryptographic information, have been truncated for display purposes in these examples.

Redirect-Based User Interaction In this scenario, the user is the RO and has access to a web browser, and the client instance can take front-channel callbacks on the same device as the user. This combination is analogous to the OAuth 2.0 Authorization Code grant type. The client instance initiates the request to the AS. Here the client instance identifies itself using its public key.

The AS processes the request and determines that the RO needs to interact. The AS returns the following response giving the client instance the information it needs to connect. The AS has also indicated to the client instance that it can use the given instance identifier to identify itself in future requests.

The client instance saves the response and redirects the user to the interaction_url by sending the following HTTP message to the user's browser.

The user's browser fetches the AS's interaction URI. The user logs in, is identified as the RO for the resource being requested, and approves the request. Since the AS has a callback parameter, the AS generates the interaction reference, calculates the hash, and redirects the user back to the client instance with these additional values added as query parameters.

The client instance receives this request from the user's browser. The client instance ensures that this is the same user that was sent out by validating session information and retrieves the stored pending request. The client instance uses the values in this to validate the hash parameter. The client instance then calls the continuation URI and presents the handle and interaction reference in the request body. The client instance signs the request as above.

The AS retrieves the pending request based on the handle and issues an access token and returns this to the client instance.

Secondary Device Interaction In this scenario, the user does not have access to a web browser on the device and must use a secondary device to interact with the AS. The client instance can display a user code or a printable QR code. The client instance is not able to accept callbacks from the AS and needs to poll for updates while waiting for the user to authorize the request. The client instance initiates the request to the AS.

The AS processes this and determines that the RO needs to interact. The AS supports both redirect URIs and user codes for interaction, so it includes both. Since there is no interaction finish mode, the AS does not include a nonce, but does include a "wait" parameter on the continuation section because it expects the client instance to poll for results.

The client instance saves the response and displays the user code visually on its screen along with the static device URI. The client instance also displays the short interaction URI as a QR code to be scanned. If the user scans the code, they are taken to the interaction endpoint and the AS looks up the current pending request based on the incoming URI. If the user instead goes to the static page and enters the code manually, the AS looks up the current pending request based on the value of the user code. In both cases, the user logs in, is identified as the RO for the resource being requested, and approves the request. Once the request has been approved, the AS displays to the user a message to return to their device. Meanwhile, the client instance periodically polls the AS every 60 seconds at the continuation URI. The client instance signs the request using the same key and method that it did in the first request.

The AS retrieves the pending request based on the handle and determines that it has not yet been authorized. The AS indicates to the client instance that no access token has yet been issued but it can continue to call after another 60 second timeout.

Note that the continuation URI and access token have been rotated since they were used by the client instance to make this call. The client instance polls the continuation URI after a 60 second timeout using this new information.

The AS retrieves the pending request based on the URI and access token, determines that it has been approved, and issues an access token for the client to use at the RS.

No User Involvement In this scenario, the client instance is requesting access on its own behalf, with no user to interact with. The client instance creates a request to the AS, identifying itself with its public key and using MTLS to make the request.

The AS processes this and determines that the client instance can ask for the requested resources and issues an access token.

Asynchronous Authorization In this scenario, the client instance is requesting on behalf of a specific RO, but has no way to interact with the user. The AS can asynchronously reach out to the RO for approval in this scenario. The client instance starts the request at the AS by requesting a set of resources. The client instance also identifies a particular user.

The AS processes this and determines that the RO needs to interact. The AS determines that it can reach the identified user asynchronously and that the identified user does have the ability to approve this request. The AS indicates to the client instance that it can poll for continuation.

The AS reaches out to the RO and prompts them for consent. In this example, the AS has an application that it can push notifications in to for the specified account. Meanwhile, the client instance periodically polls the AS every 60 seconds at the continuation URI.

Note that the continuation handle has been rotated since it was used by the client instance to make this call. The client instance polls the continuation URI after a 60 second timeout using the new handle.

The AS retrieves the pending request based on the handle and determines that it has been approved and it issues an access token.

Applying OAuth 2.0 Scopes and Client IDs While GNAP is not designed to be directly compatible with OAuth 2.0 , considerations have been made to enable the use of OAuth 2.0 concepts and constructs more smoothly within GNAP. In this scenario, the client developer has a client_id and set of scope values from their OAuth 2.0 system and wants to apply them to the new protocol. Traditionally, the OAuth 2.0 client developer would put their client_id and scope values as parameters into a redirect request to the authorization endpoint.

Now the developer wants to make an analogous request to the AS using GNAP. To do so, the client instance makes an HTTP POST and places the OAuth 2.0 values in the appropriate places.

The client_id can be used to identify the client instance's keys that it uses for authentication, the scopes represent resources that the client instance is requesting, and the redirect_uri and state value are pre-combined into a finish URI that can be unique per request. The client instance additionally creates a nonce to protect the callback, separate from the state parameter that it has added to its return URI. From here, the protocol continues as above.

JSON Structures and Polymorphism GNAP makes use of polymorphism within the JSON structures used for the protocol. Each portion of this protocol is defined in terms of the JSON data type that its values can take, whether it's a string, object, array, boolean, or number. For some fields, different data types offer different descriptive capabilities and are used in different situations for the same field. Each data type provides a different syntax to express the same underlying semantic protocol element, which allows for optimization and simplification in many common cases. Even though JSON is often used to describe strongly typed structures, JSON on its own is naturally polymorphic. In JSON, the named members of an object have no type associated with them, and any data type can be used as the value for any member. In practice, each member has a semantic type that needs to make sense to the parties creating and consuming the object. Within this protocol, each object member is defined in terms of its semantic content, and this semantic content might have expressions in different concrete data types for different specific purposes. Since each object member has exactly one value in JSON, each data type for an object member field is naturally mutually exclusive with other data types within a single JSON object. For example, a resource request for a single access token is composed of an array of resource request descriptions while a request for multiple access tokens is composed of an object whose member values are all arrays. Both of these represent requests for access, but the difference in syntax allows the client instance and AS to differentiate between the two request types in the same request. Another form of polymorphism in JSON comes from the fact that the values within JSON arrays need not all be of the same JSON data type. However, within this protocol, each element within the array needs to be of the same kind of semantic element for the collection to make sense, even when the data types are different from each other. For example, each aspect of a resource request can be described using an object with multiple dimensional components, or the aspect can be requested using a string. In both cases, the resource request is being described in a way that the AS needs to interpret, but with different levels of specificity and complexity for the client instance to deal with. An API designer can provide a set of common access scopes as simple strings but still allow client software developers to specify custom access when needed for more complex APIs. Extensions to this specification can use different data types for defined fields, but each extension needs to not only declare what the data type means, but also provide justification for the data type representing the same basic kind of thing it extends. For example, an extension declaring an "array" representation for a field would need to explain how the array represents something akin to the non-array element that it is replacing.