Network Working Group B. Hoeneisen Internet-Draft H. Marques Intended status: Standards Track pEp Foundation Expires: 19 June 2023 16 December 2022 IANA Registration of Trustword Lists: Guide, Template and IANA Considerations draft-pep-trustwords-00 Abstract This document specifies the IANA Registration Guidelines for Trustwords, describes corresponding registration procedures, and provides a guideline for creating Trustword list specifications. Trustwords are common words in a natural language (e.g., English), which hexadecimal strings are mapped to. Such a mapping makes verification processes like fingerprint comparisons more practical, and less prone to misunderstandings. Status of This Memo This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79. Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet- Drafts is at https://datatracker.ietf.org/drafts/current/. Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress." This Internet-Draft will expire on 19 June 2023. Copyright Notice Copyright (c) 2022 IETF Trust and the persons identified as the document authors. All rights reserved. This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/ license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components Hoeneisen & Marques Expires 19 June 2023 [Page 1] Internet-Draft IANA Registration of Trustword Lists December 2022 extracted from this document must include Revised BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Revised BSD License. Table of Contents 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . 2 1.1. Requirements Language . . . . . . . . . . . . . . . . . . 3 1.2. Terms . . . . . . . . . . . . . . . . . . . . . . . . . . 3 2. The Concept of Trustword Mapping . . . . . . . . . . . . . . 4 2.1. Example . . . . . . . . . . . . . . . . . . . . . . . . . 4 2.2. Previous work . . . . . . . . . . . . . . . . . . . . . . 4 2.3. Number of Trustwords for a language . . . . . . . . . . . 5 2.4. Language . . . . . . . . . . . . . . . . . . . . . . . . 6 2.5. The nature of the words . . . . . . . . . . . . . . . . . 6 3. Security Considerations . . . . . . . . . . . . . . . . . . . 6 4. Privacy Considerations . . . . . . . . . . . . . . . . . . . 6 5. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 6 5.1. Registration Template (XML chunk) . . . . . . . . . . . . 6 5.2. IANA Registration . . . . . . . . . . . . . . . . . . . . 8 5.2.1. Language Code () . . . . . . . . . . . 8 5.2.2. Bit Size () . . . . . . . . . . . . . . . . 8 5.2.3. Number Of Unique Words () . . . 8 5.2.4. Bijectivity () . . . . . . . . . . . . . . 8 5.2.5. Version () . . . . . . . . . . . . . . . . . 9 5.2.6. Registration Document(s) () . . . . 9 5.2.7. Requesters () . . . . . . . . . . . . . . 9 5.2.8. Further Information () . . . . . . . 10 5.2.9. Wordlist () . . . . . . . . . . . . . . . . 10 6. Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . 11 7. References . . . . . . . . . . . . . . . . . . . . . . . . . 11 7.1. Normative References . . . . . . . . . . . . . . . . . . 11 7.2. Informative References . . . . . . . . . . . . . . . . . 11 Appendix A. IANA XML Template Example . . . . . . . . . . . . . 13 Appendix B. Document Changelog . . . . . . . . . . . . . . . . . 14 Appendix C. Open Issues . . . . . . . . . . . . . . . . . . . . 15 Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . 16 1. Introduction In public-key cryptography, comparing the respective public key fingerprints for each of the communication partners involved is vital to ensure that there is no Man-in-the-Middle (MITM) attack on the communication channel. These fingerprints normally consist of a chain of hexadecimal characters, which are often impractical, cumbersome, and prone to misunderstandings for end-users. Hoeneisen & Marques Expires 19 June 2023 [Page 2] Internet-Draft IANA Registration of Trustword Lists December 2022 To mitigate these challenges, several systems offer Trustword comparison as an alternative to these hexadecimal strings. Trustwords are common words in a natural language (e.g., English), which these hexadecimal strings are mapped to. Using Trustwords makes verification processes like fingerprint comparisons more natural for users. For example, in pEp's Privacy by Default proposition [I-D.pep-general] Trustwords are used to facilitate easy contact verification for end-to-end encryption. Trustword comparison is offered after the peers have opportunistically exchanged public keys. Examples of Trustword lists used by current pEp implementations can be found here in CSV format: https://gitea.pep.foundation/pEp.foundation/pEpEngine/src/branch/ master/db In addition to contact verification, Trustwords are also used for other purposes, such as Human-Readable 128-bit Keys [RFC1751], One Time Passwords (OTP) [RFC1760] [RFC2289], SSH host-key verification, VPN server certificate verification, deriving private keys in blockchain applications for cryptocurrencies, and to import or synchronize secret keys across multiple devices owned by a single user [I-D.pep-keysync]. Further ideas include the use of Trustwords for private key recovery in case of loss, contact verification in Extensible Messaging and Presence Protocol (XMPP) [RFC6120], or for X.509 certificate verification in browsers [RFC3647]. 1.1. Requirements Language The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all capitals, as shown here. 1.2. Terms The following terms are defined for the scope of this document: * pEp Handshake: The process of one user contacting another over an independent channel in order to verify Trustwords (or fingerprints as a fallback). This can be done in-person or through established verbal communication channels, like a phone call. [I-D.pep-handshake] Hoeneisen & Marques Expires 19 June 2023 [Page 3] Internet-Draft IANA Registration of Trustword Lists December 2022 * Man-in-the-middle (MITM) attack: cf. [RFC4949], which states: "A form of active wiretapping attack in which the attacker intercepts and selectively modifies communicated data to masquerade as one or more of the entities involved in a communication association." Note: Historically, MITM has stood for '_Man_-in-the-middle'. However, to indicate that the entity in the middle is not always a human attacker, MITM can also stand for 'Machine-in-the-middle' or 'Meddler-in-the-middle'. 2. The Concept of Trustword Mapping 2.1. Example As already discussed, fingerprints normally consist of a string of hexadecimal characters. A typical fingerprint looks like this: F482 E952 2F48 618B 01BC 31DC 5428 D7FA ACDC 3F13 Instead of the hexadecimal string, Trustwords allow users to compare ten common words of a language of their choosing. For example, the above fingerprint, mapped to English Trustwords, might appear as: dog house brother town fat bath school banana kite task The same fingerprint might appear in German Trustwords as: klima gelb lappen weg trinken alles kaputt rasen rucksack durch Note: These examples are for illustration purposes only, and are not derived from any published Trustword list. 2.2. Previous work The basic concept of Trustword mapping - also known as a biometric word list - for fingerprint comparison is well-documented. Examples of this concept are used with One-Time Passwords (OTP) [RFC1751] [RFC1760] [RFC2289], as well as the PGP Word List ("Pretty Good Privacy word list" [PGP.wl]. Furthermore, cryptocurrencies use a similar concept for deriving private keys [bitcoin.wl]. Regarding today's needs, previous proposals have the following shortcomings: * Small/limited word lists, which generally result in more words to compare Hoeneisen & Marques Expires 19 June 2023 [Page 4] Internet-Draft IANA Registration of Trustword Lists December 2022 * Existing word lists are usually only available in English, which limits their usefulness for non-English speakers Furthermore, there are differences in the basic concept: * The Trustword concept suggested herein intends to improve usability and security for all users, instead of only the technically-savvy. * In many use cases, Trustwords are only read (aloud) during the comparison process, rather than being written or typed. For example, two users might compare their respective Trustwords during a phone call. Verbal comparison reduces the need to keep the actual Trustwords short. The use of longer Trustwords increases the entropy within the system, as it allows for a larger dictionary, and thus reduces the likelihood of phonetic collisions. 2.3. Number of Trustwords for a language If the number of Trustwords in a dictionary is low, shorter parts of the original string (e.g., fingerprint) can be mapped to a single Trustword. Thus, many Trustwords will need to be compared, which results in a potentially cumbersome process for users, and lead to reduced usability. To reduce the number of Trustwords that need to be compared, pEp's Privacy by Default proposition [I-D.pep-general] calls for 16-bit scalars to be mapped to natural language words. Therefore, the size (by number of key-value pairs) of any key-value pair structure is 65536. However, the number of unique values to be used in a language may be smaller than this number. This discrepancy can be addressed by using the same value, or Trustword, for more than one key. In such cases, the entropy of the representation is slightly reduced. For example, a Trustword list of 42000 words still allows for an entropy of log_2(42000), which is roughly 15.36 bits in 16-bit mappings. As a consequence such Trustword lists are not bijective. On the other hand, small Trustword lists allow for Trustwords consisting of words with shorter strings (number of short words per natural language is normally limited), which are easier to use in implementations where Trustwords have to be typed or written, such as in OTP applications. Note: This specification allows for registration of variable numbers of Trustwords per dictionary. Hoeneisen & Marques Expires 19 June 2023 [Page 5] Internet-Draft IANA Registration of Trustword Lists December 2022 2.4. Language Although English is used around the world, the vast majority of the global population is not English-speaking. For an application to be useful to as wide of a user base as possible, localization is essential. Therefore, this specification allows for registration of Trustword lists in different languages. In applications where two humans are attempting to establish secure communications, it is likely that they share a common language. At this time, no real-world use cases for Trustword list translation capability have been identified. Because the translation process inherently - and drastically - increases complexity from an IANA registration standpoint, the topic of Trustword translation is beyond the scope of this document. 2.5. The nature of the words Every Trustword list SHOULD be clear of offensive language (i.e., swear/curse words, slurs, derogatory language, etc.). This process SHOULD be performed by native speakers of each respective language. 3. Security Considerations There are no specific security considerations to this document. 4. Privacy Considerations There are no specific privacy considerations to this document. 5. IANA Considerations Each natural language requires a different set of Trustwords. To allow implementers for identical Trustword lists, a IANA registry is to be established. The IANA registration policy according to [RFC8126] is "Expert Review" and "Specification Required". [[ Note: Further details of the IANA registry and requirements for the expert to assess the specification are for further study. A similar approach as used in [RFC6117] is likely followed. ]] 5.1. Registration Template (XML chunk) Hoeneisen & Marques Expires 19 June 2023 [Page 6] Internet-Draft IANA Registration of Trustword Lists December 2022 first second [...] last Hoeneisen & Marques Expires 19 June 2023 [Page 7] Internet-Draft IANA Registration of Trustword Lists December 2022 Authors of a Wordlist are encouraged to use these XML chunks as a template to create the IANA Registration Template. 5.2. IANA Registration An IANA registration will contain the fallowing elements: 5.2.1. Language Code () The language code follows the ISO 639-3 specification [ISO639], e.g., eng, deu. [[ Note: It is for further study, which of the ISO 639 Specifications is most suitable to address the Trustwords' challenge. ]] Example usage for German: e.g. deu 5.2.2. Bit Size () The bit size is the number of bits that can be mapped with the Wordlist. The number of registered words in a word list MUST be 2 ^ (). Example usage for 16-bit Wordlist: e.g. 16 5.2.3. Number Of Unique Words () The number of unique words that are registered. e.g. 65536 5.2.4. Bijectivity () Whether the registered Wordlist has a one-to-one mapping, meaning the number of unique words registered equals 2 ^ (). Hoeneisen & Marques Expires 19 June 2023 [Page 8] Internet-Draft IANA Registration of Trustword Lists December 2022 Valid content: ( yes | no ) e.g. yes 5.2.5. Version () The version of the Wordlist MUST be unique within a language code. [[ Note: Requirements to a "smart" composition of the version number are for further study ]] e.g. b.1.2 5.2.6. Registration Document(s) () Reference(s) to the Document(s) containing the Wordlist e.g. e.g. (obsoleted by RFC 9999) e.g. [International Telecommunications Union, "Wordlist for Foobar application", ITU-F Recommendation B.193, Release 73, Mar 2009.] 5.2.7. Requesters () The persons requesting the registration of the Wordlist. Usually these are the authors of the Wordlist. Hoeneisen & Marques Expires 19 June 2023 [Page 9] Internet-Draft IANA Registration of Trustword Lists December 2022 e.g. John Doe Example Inc. mailto:john.doe@example.com 2018-06-20 Note: If there is more than one requester, there must be one element per requester in the element, and one chunk per requester in the element. 5.2.8. Further Information () Any other information the authors deem interesting. e.g. more info goes here Note: If there is no such additional information, then the element is omitted. 5.2.9. Wordlist () The full Wordlist to be registered. The number of words MUST be a power of 2 as specified above. The element names serve as key used for enumeration of the Trustwords (starting at 0) and the elements contains the values being individual natural language words in the respective language. e.g. first second [...] last ] ]> [[ Note: The exact representation of the Wordlist is for further study. ]] Hoeneisen & Marques Expires 19 June 2023 [Page 10] Internet-Draft IANA Registration of Trustword Lists December 2022 6. Acknowledgments The authors would like to thank the following people who have provided feedback or significant contributions to the development of this document: Andrew Sullivan, Claudio Luck, Daniel Kahn Gilmore, Kelly Bristol, Michael Richardson, Rich Salz, Volker Birk, and Yoav Nir. This work was initially created by pEp Foundation, and then reviewed and extended with funding by the Internet Society's Beyond the Net Programme on standardizing pEp. [ISOC.bnet] 7. References 7.1. Normative References [RFC2119] Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels", BCP 14, RFC 2119, DOI 10.17487/RFC2119, March 1997, . [RFC4949] Shirey, R., "Internet Security Glossary, Version 2", FYI 36, RFC 4949, DOI 10.17487/RFC4949, August 2007, . [RFC8126] Cotton, M., Leiba, B., and T. Narten, "Guidelines for Writing an IANA Considerations Section in RFCs", BCP 26, RFC 8126, DOI 10.17487/RFC8126, June 2017, . [RFC8174] Leiba, B., "Ambiguity of Uppercase vs Lowercase in RFC 2119 Key Words", BCP 14, RFC 8174, DOI 10.17487/RFC8174, May 2017, . 7.2. Informative References [bitcoin.wl] "Seed Phrase", June 2019, . [I-D.pep-general] Birk, V., Marques, H., and B. Hoeneisen, "pretty Easy privacy (pEp): Privacy by Default", Work in Progress, Internet-Draft, draft-pep-general-01, 21 October 2022, . Hoeneisen & Marques Expires 19 June 2023 [Page 11] Internet-Draft IANA Registration of Trustword Lists December 2022 [I-D.pep-handshake] Marques, H. and B. Hoeneisen, "pretty Easy privacy (pEp): Contact and Channel Authentication through Handshake", Work in Progress, Internet-Draft, draft-marques-pep- handshake-05, 8 July 2020, . [I-D.pep-keysync] Birk, V., Hoeneisen, B., and K. Bristol, "pretty Easy privacy (pEp): Key Synchronization Protocol (KeySync)", Work in Progress, Internet-Draft, draft-pep-keysync-02, 13 July 2020, . [ISO639] "Language codes - ISO 639", n.d., . [ISOC.bnet] Simao, I., "Beyond the Net. 12 Innovative Projects Selected for Beyond the Net Funding. Implementing Privacy via Mass Encryption: Standardizing pretty Easy privacy's protocols", June 2017, . [PGP.wl] "PGP word list", November 2017, . [RFC1751] McDonald, D., "A Convention for Human-Readable 128-bit Keys", RFC 1751, DOI 10.17487/RFC1751, December 1994, . [RFC1760] Haller, N., "The S/KEY One-Time Password System", RFC 1760, DOI 10.17487/RFC1760, February 1995, . [RFC2289] Haller, N., Metz, C., Nesser, P., and M. Straw, "A One- Time Password System", STD 61, RFC 2289, DOI 10.17487/RFC2289, February 1998, . [RFC3647] Chokhani, S., Ford, W., Sabett, R., Merrill, C., and S. Wu, "Internet X.509 Public Key Infrastructure Certificate Policy and Certification Practices Framework", RFC 3647, DOI 10.17487/RFC3647, November 2003, . Hoeneisen & Marques Expires 19 June 2023 [Page 12] Internet-Draft IANA Registration of Trustword Lists December 2022 [RFC6117] Hoeneisen, B., Mayrhofer, A., and J. Livingood, "IANA Registration of Enumservices: Guide, Template, and IANA Considerations", RFC 6117, DOI 10.17487/RFC6117, March 2011, . [RFC6120] Saint-Andre, P., "Extensible Messaging and Presence Protocol (XMPP): Core", RFC 6120, DOI 10.17487/RFC6120, March 2011, . Appendix A. IANA XML Template Example This section contains a non-normative example of the IANA Registration Template XML chunk. lat 16 57337 no n.0.1 This Wordlist has been optimized for the Roman Standards Process. errare humanum [...] est Julius Caesar Curia Romana mailto:julius.cesar@example.com 1999-12-31 Hoeneisen & Marques Expires 19 June 2023 [Page 13] Internet-Draft IANA Registration of Trustword Lists December 2022 Appendix B. Document Changelog [[ RFC Editor: This section is to be removed before publication ]] * draft-pep-trustwords-00: - Updated dangling link - Updated Security/Privacy Considerations - Updated terms and references - Minor editorial changes * draft-birk-pep-trustwords-05: - Update terms and references * draft-birk-pep-trustwords-04: - Add Privacy Considerations section - Swapped Security and IANA Consideration Sections - Corrected typo in ISO references - Updated Introduction, Terms and concept Sections * draft-birk-pep-trustwords-03: - Update references - Minor edits * draft-birk-pep-trustwords-02: - Minor editorial changes and bug fixes - Added more items to Open Issues - Add usage example * draft-birk-pep-trustwords-01: - Included feedback from mailing list and IETF-101 SECDISPATCH WG, e.g. Hoeneisen & Marques Expires 19 June 2023 [Page 14] Internet-Draft IANA Registration of Trustword Lists December 2022 o Added more explanatory text / less focused on the main use case o Bit size as parameter - Explicitly stated translations are out-of-scope for this document - Added draft IANA XML Registration template, considerations, explanation and examples - Added Changelog to Appendix - Added Open Issue section to Appendix Appendix C. Open Issues [[ RFC Editor: This section should be empty and is to be removed before publication. ]] * Better explain previous work on Trustwords * More explanatory text for Trustword use cases, properties and requirements * Further details of the IANA registry and requirements for the expert to assess the specification * Decide which ISO language code either 639-1 or 639-3 to use, i.e., ISO-639-1 (e.g., ca, de, en, ...) as currently used in pEp implementations (running code) or ISO-639-3 (eng, deu, ita, ...) * Adjust exact representation of wordlists - e.g. XML, CSV, ... - Syntax for non-ASCII letters or language symbols (UTF-8) in Wordlists * Need for optional entropy value assigned to words, to account for similar phonetics among words in the same wordlist? * Need for an additional field, to define what a wordlist is optimized for, e.g., "entropy", "minimize word lengths", ...? * Work out (requirements for) "smart" composition of the version number Hoeneisen & Marques Expires 19 June 2023 [Page 15] Internet-Draft IANA Registration of Trustword Lists December 2022 * Decide whether in non-bijective Wordlists the redundant words need to be repeated in the IANA Registration * Register only a hash over the wordlist with IANA? * Does it make sense to open registrations for other patterns than just words, e.g., images? Authors' Addresses Bernie Hoeneisen pEp Foundation Oberer Graben 4 CH- 8400 Winterthur Switzerland Email: bernie.hoeneisen@pep.foundation URI: https://pep.foundation/ Hernani Marques pEp Foundation Oberer Graben 4 CH- 8400 Winterthur Switzerland Email: hernani.marques@pep.foundation URI: https://pep.foundation/ Hoeneisen & Marques Expires 19 June 2023 [Page 16]