HTTPAPI R. Polli Internet-Draft Digital Transformation Department, Italian Government Intended status: Informational E. Wilde Expires: 28 November 2022 Axway E. Aro Mozilla 27 May 2022 YAML Media Type draft-ietf-httpapi-yaml-mediatypes-01 Abstract This document registers the application/yaml media type and the +yaml structured syntax suffix on the IANA Media Types registry. Note to Readers _RFC EDITOR: please remove this section before publication_ Discussion of this draft takes place on the HTTP APIs working group mailing list (httpapi@ietf.org), which is archived at https://mailarchive.ietf.org/arch/browse/httpapi/ (https://mailarchive.ietf.org/arch/browse/httpapi/). The source code and issues list for this draft can be found at https://github.com/ietf-wg-httpapi/mediatypes (https://github.com/ ietf-wg-httpapi/mediatypes). Status of This Memo This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79. Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet- Drafts is at https://datatracker.ietf.org/drafts/current/. Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress." This Internet-Draft will expire on 28 November 2022. Polli, et al. Expires 28 November 2022 [Page 1] Internet-Draft YAML Media Type May 2022 Copyright Notice Copyright (c) 2022 IETF Trust and the persons identified as the document authors. All rights reserved. This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/ license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Revised BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Revised BSD License. Table of Contents 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . 2 1.1. Notational Conventions . . . . . . . . . . . . . . . . . 3 1.2. Fragment identification . . . . . . . . . . . . . . . . . 3 2. Media Type and Structured Syntax Suffix registrations . . . . 4 2.1. Media Type application/yaml . . . . . . . . . . . . . . . 4 2.2. The +yaml Structured Syntax Suffix . . . . . . . . . . . 5 3. Interoperability Considerations . . . . . . . . . . . . . . . 6 3.1. YAML is an Evolving Language . . . . . . . . . . . . . . 6 3.2. YAML and JSON . . . . . . . . . . . . . . . . . . . . . . 6 4. Security Considerations . . . . . . . . . . . . . . . . . . . 7 4.1. Arbitrary Code Execution . . . . . . . . . . . . . . . . 7 4.2. Resource Exhaustion . . . . . . . . . . . . . . . . . . . 8 5. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 8 6. Normative References . . . . . . . . . . . . . . . . . . . . 9 Appendix A. Acknowledgements . . . . . . . . . . . . . . . . . . 10 FAQ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 Change Log . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . 10 1. Introduction YAML [YAML] is a data serialization format that is widely used on the Internet, including in the API sector (e.g. see [oas]), but the relevant media type and structured syntax suffix previously had not been registered by IANA. To increase interoperability when exchanging YAML data and leverage content negotiation mechanisms when exchanging YAML resources, this specification registers the application/yaml media type and the +yaml structured syntax suffix. Polli, et al. Expires 28 November 2022 [Page 2] Internet-Draft YAML Media Type May 2022 Moreover, it provides security considerations and interoperability considerations related to [YAML], including its relation with [JSON]. 1.1. Notational Conventions The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all capitals, as shown here. These words may also appear in this document in lower case as plain English words, absent their normative meanings. This document uses the Augmented BNF defined in [RFC5234] and updated by [RFC7405]. The terms "content", "content negotiation", "resource", and "user agent" in this document are to be interpreted as in [SEMANTICS]. The terms "fragment" and "fragment identifier" in this document are to be interpreted as in [URI]. The terms "node", "anchor" and "named anchor" in this document are to be intepreded as in [YAML]. 1.2. Fragment identification This section describes how to use named anchors (see Section 3.2.2.2 of [YAML]) as fragment identifiers to designate nodes. A YAML named anchor can be represented in a URI fragment identifier by encoding it into octects using UTF-8 [UTF-8], while percent- encoding those characters not allowed by the fragment rule in Section 3.5 of [URI]. If multiple nodes would match a fragment identifier, the first such match is selected. Users concerned with interoperability of fragment identifiers: * SHOULD limit named anchors to a set of characters that do not require encoding to be expressed as URI fragment identifiers: this is always possible since named anchors are a serialization detail; * SHOULD NOT use a named anchor that matches multiple nodes. Polli, et al. Expires 28 November 2022 [Page 3] Internet-Draft YAML Media Type May 2022 In the example resource below, the URL file.yaml#foo references the anchor foo pointing to the node with value scalar; whereas the URL file.yaml#bar references the anchor bar pointing to the node with value [ some, sequence, items ]. %YAML 1.2 --- one: &foo scalar two: &bar - some - sequence - items 2. Media Type and Structured Syntax Suffix registrations This section describes the information required to register the above media type according to [MEDIATYPE] 2.1. Media Type application/yaml The media type for YAML text is application/yaml; the following information serves as the registration form for this media type. Type name: application Subtype name: yaml Required parameters: None Optional parameters: None; unrecognized parameters should be ignored Encoding considerations: binary Security considerations: see Section 4 of this document Interoperability considerations: see Section 3 of this document Published specification: [YAML] Applications that use this media type: HTTP Fragment identifier considerations: see Section 1.2 Additional information: * Deprecated alias names for this type: application/x-yaml, text/ yaml, text/x-yaml Polli, et al. Expires 28 November 2022 [Page 4] Internet-Draft YAML Media Type May 2022 * Magic number(s) n/a * File extension(s): yaml, yml * Macintosh file type code(s): n/a Person and email address to contact for further information: See Aut hors' Addresses section. Intended usage: COMMON Restrictions on usage: None. Author: See Authors' Addresses section. Change controller: n/a 2.2. The +yaml Structured Syntax Suffix The suffix +yaml MAY be used with any media type whose representation follows that established for application/yaml. The media type structured syntax suffix registration form follows. See [MEDIATYPE] for definitions of each of the registration form headings. Name: YAML Ain't Markup Language (YAML) +suffix: +yaml References: [YAML] Encoding considerations: see Section 2.1 Fragment identifier considerations: Differently from application/ yaml, there is no fragment identification syntax defined for +yaml. A specific xxx/yyy+yaml media type needs to define the syntax and semantics for fragment identifiers because the ones in Section 2.1 do not apply unless explicitly expressed. Interoperability considerations: See Section 2.1 Security considerations: See Section 2.1 Contact: See Authors' Addresses section. Author: See Authors' Addresses section Polli, et al. Expires 28 November 2022 [Page 5] Internet-Draft YAML Media Type May 2022 Change controller: n/a 3. Interoperability Considerations 3.1. YAML is an Evolving Language YAML is an evolving language and, over time, some features have been added and others removed. While this document is based on a given YAML version [YAML], the media type registration does not imply a specific version. This allows content negotiation of version-independent YAML resources. Implementers concerned about features related to a specific YAML version can specify it in documents using the %YAML directive (see Section 6.8.1 of [YAML]). 3.2. YAML and JSON When using flow collection styles (see Section 7.4 of [YAML]) a YAML document could look like JSON [JSON], thus similar interoperability considerations apply. When using YAML as a more efficient format to serialize information intented to be consumed as JSON, information can be discarded: this includes comments (see Section 3.2.3.3 of [YAML]) and alias nodes (see Section 7.1 of [YAML]), that do not have a JSON counterpart. # This comment will be lost # when serializing in JSON. Title: type: string maxLength: &text_limit 64 Name: type: string maxLength: *text_limit # Replaced by the value 64. Figure 1: JSON replaces alias nodes with static values. Implementers need to ensure that relevant information will not be lost during the processing. For example, they might consider acceptable that alias nodes are replaced by static values. In some cases an implementer may want to define a list of allowed YAML features, taking into account that the following ones might have interoperability issues with JSON: Polli, et al. Expires 28 November 2022 [Page 6] Internet-Draft YAML Media Type May 2022 * non UTF-8 encoding, since YAML supports UTF-16 and UTF-32 in addition to UTF-8; * mapping keys that are not strings; * circular references represented using anchor (see Section 4.2 and Figure 3); * .inf and .nan float values, since JSON does not support them; * non-JSON types, including the ones associated with tags like !!timestamp that were included in the default schema of older YAML versions; * tags in general, and specifically the ones that do not map to JSON types like custom and local tags such as !!python/object and !mytag (see Section 2.4 of [YAML]); non-json-keys: 2020-01-01: a timestamp [0, 1]: a sequence ? {k: v} : a map non-json-value: 2020-01-01 Figure 2: Example of mapping keys not supported in JSON 4. Security Considerations Security requirements for both media type and media type suffix registrations are discussed in Section 4.6 of [MEDIATYPE]. 4.1. Arbitrary Code Execution Care should be used when using YAML tags, because their resolution might trigger unexpected code execution. Code execution in deserializers should be disabled by default, and only be enabled explicitly. In those cases, the implementation should ensure - for example, via specific functions - that the code execution results in strictly bounded time/memory limits. Many implementations provide safe deserializers addressing these issues. Polli, et al. Expires 28 November 2022 [Page 7] Internet-Draft YAML Media Type May 2022 4.2. Resource Exhaustion YAML documents are rooted, connected, directed graphs and can contain reference cycles, so they can't be treated as simple trees (see Section 3.2.1 of [YAML]). An implementation that attempts to do that can infinite-loop at some point (e.g. when trying to serialize a YAML document in JSON). x: &x y: *x Figure 3: A cyclic document Even if a document is not cyclic, treating it as a simple tree could lead to improper behaviors (such as the "billion laughs" problem). x1: &a1 ["a", "a"] x2: &a2 [*a1, *a1] x3: &a3 [*a2, *a2] Figure 4: A billion laughs document This can be addressed using processors limiting the anchor recursion depth and validating the input before processing it; even in these cases it is important to carefully test the implementation you are going to use. The same considerations apply when serializing a YAML representation graph in a format that does not support reference cycles (see Section 3.2). 5. IANA Considerations This specification defines the following new Internet media type [MEDIATYPE]. IANA has updated the "Media Types" registry at https://www.iana.org/assignments/media-types (https://www.iana.org/assignments/media-types) with the registration information provided below. +==================+==============================+ | Media Type | Section | +==================+==============================+ | application/yaml | Section 2.1 of this document | +------------------+------------------------------+ Table 1 Polli, et al. Expires 28 November 2022 [Page 8] Internet-Draft YAML Media Type May 2022 IANA has updated the "Structured Syntax Suffixes" registry at https://www.iana.org/assignments/media-type-structured-suffix (https://www.iana.org/assignments/media-type-structured-suffix) with the registration information provided below. +========+==============================+ | Suffix | Section | +========+==============================+ | +yaml | Section 2.2 of this document | +--------+------------------------------+ Table 2 6. Normative References [JSON] Bray, T., Ed., "The JavaScript Object Notation (JSON) Data Interchange Format", STD 90, RFC 8259, DOI 10.17487/RFC8259, December 2017, . [MEDIATYPE] Freed, N., Klensin, J., and T. Hansen, "Media Type Specifications and Registration Procedures", BCP 13, RFC 6838, DOI 10.17487/RFC6838, January 2013, . [oas] Darrel Miller, Jeremy Whitlock, Marsh Gardiner, Mike Ralphson, Ron Ratovsky, and Uri Sarid, "OpenAPI Specification 3.0.0", 26 July 2017. [RFC2119] Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels", BCP 14, RFC 2119, DOI 10.17487/RFC2119, March 1997, . [RFC5234] Crocker, D., Ed. and P. Overell, "Augmented BNF for Syntax Specifications: ABNF", STD 68, RFC 5234, DOI 10.17487/RFC5234, January 2008, . [RFC7405] Kyzivat, P., "Case-Sensitive String Support in ABNF", RFC 7405, DOI 10.17487/RFC7405, December 2014, . [RFC8174] Leiba, B., "Ambiguity of Uppercase vs Lowercase in RFC 2119 Key Words", BCP 14, RFC 8174, DOI 10.17487/RFC8174, May 2017, . Polli, et al. Expires 28 November 2022 [Page 9] Internet-Draft YAML Media Type May 2022 [SEMANTICS] Fielding, R. T., Nottingham, M., and J. Reschke, "HTTP Semantics", Work in Progress, Internet-Draft, draft-ietf- httpbis-semantics-19, 12 September 2021, . [URI] Berners-Lee, T., Fielding, R., and L. Masinter, "Uniform Resource Identifier (URI): Generic Syntax", STD 66, RFC 3986, DOI 10.17487/RFC3986, January 2005, . [UTF-8] Yergeau, F., "UTF-8, a transformation format of ISO 10646", STD 63, RFC 3629, DOI 10.17487/RFC3629, November 2003, . [YAML] Oren Ben-Kiki, Clark Evans, Ingy dot Net, Tina Müller, Pantelis Antoniou, Eemeli Aro, and Thomas Smith, "YAML Ain't Markup Language Version 1.2", 1 October 2021, . Appendix A. Acknowledgements Thanks to Erik Wilde and David Biesack for being the initial contributors of this specification, and to Darrel Miller and Rich Salz for their support during the adoption phase. In addition to the people above, this document owes a lot to the extensive discussion inside and outside the HTTPAPI workgroup. The following contributors have helped improve this specification by opening pull requests, reporting bugs, asking smart questions, drafting or reviewing text, and evaluating open issues: Tina (tinita) Mueller, Ben Hutton, Manu Sporny and Jason Desrosiers. FAQ Q: Why this document? After all these years, we still lack a proper media-type for YAML. This has some security implications too (eg. wrt on identifying parsers or treat downloads) Change Log RFC EDITOR PLEASE DELETE THIS SECTION. Authors' Addresses Polli, et al. Expires 28 November 2022 [Page 10] Internet-Draft YAML Media Type May 2022 Roberto Polli Digital Transformation Department, Italian Government Italy Email: robipolli@gmail.com Erik Wilde Axway Switzerland Email: erik.wilde@dret.net Eemeli Aro Mozilla Finland Email: eemeli@gmail.com Polli, et al. Expires 28 November 2022 [Page 11]