<?xml version="1.0" encoding="UTF-8"?>
  <?xml-stylesheet type="text/xsl" href="rfc2629.xslt" ?>
  <!-- generated by https://github.com/cabo/kramdown-rfc version 1.7.36 (Ruby 4.0.3) -->


<!DOCTYPE rfc  [
  <!ENTITY nbsp    "&#160;">
  <!ENTITY zwsp   "&#8203;">
  <!ENTITY nbhy   "&#8209;">
  <!ENTITY wj     "&#8288;">

]>


<rfc ipr="trust200902" docName="draft-rswg-rfc7997bis-10" category="info" submissionType="editorial" obsoletes="7997" tocInclude="true" sortRefs="true" symRefs="true">
  <front>
    <title abbrev="Text in RFCs">Text in RFCs</title>

    <author initials="P." surname="Hoffman" fullname="Paul Hoffman">
      <organization></organization>
      <address>
        <email>paul.hoffman@icann.org</email>
      </address>
    </author>

    <date year="2026" month="May" day="20"/>

    
    
    <keyword>Internet-Draft</keyword>

    <abstract>


<?line 41?>

<t>This document sets policy for the inclusion of characters in the definitive versions and publication formats of RFCs.
The policy for the RFC Series is that all displayable text is allowed as long as there is a high expectation that readers of an RFC will be able to interpret its text as intended.
This document obsoletes RFC 7997.</t>



    </abstract>



  </front>

  <middle>


<?line 47?>

<section anchor="introduction"><name>Introduction</name>

<t>The early policy for the RFC Series was that RFCs could only contain characters from the ASCII character set.
Later policies, from <xref target="RFC7997"/>, allowed more characters, set the language of the RFC Series to be English, and set the encoding for RFCs of UTF-8.
In the time since <xref target="RFC7997"/> was published, the IETF community has had much more experience of using non-ASCII characters in RFCs.</t>

<t>This document obsoletes <xref target="RFC7997"/>.
This document makes substantial changes to the policies in <xref target="RFC7997"/> based on the positive experience since its publication.</t>

<t>The RFC Publication Center (RPC) is responsible for implementing the policies in this document, as described in <xref target="RFC9720"/>.
The RPC style guides may define which characters authors may use and how they are used.</t>

<section anchor="terminology"><name>Terminology</name>

<t>The term "non-ASCII characters" means characters outside the set that was defined in ASCII.
ASCII is described in <xref target="RFC20"/>.</t>

<t>The term "Unicode characters" means characters defined in <xref target="UnicodeLatest"/>.</t>

<t>"U+ notation" means using the characters "U+" and a hexadecimal number to represent a Unicode code point.
See <xref target="BCP137"/> for more on U+ notation.</t>

<t>More terminology about characters and encoding formats can be found in <xref target="RFC6365"/>.</t>

</section>
</section>
<section anchor="basic-requirements-for-text-in-rfcs"><name>Basic Requirements for Text in RFCs</name>

<t>RFCs should only contain text that can be displayed correctly across a wide range of readers and browsers.
People whose systems do not have the fonts needed to display part of a particular RFC still need to be able to read the definitive versions and publication formats correctly in order to understand and implement the information described in the document.</t>

<t>The ability to use non-ASCII characters in RFCs in a clear and consistent manner will allow the correct display of proper names and improve the ability to describe internationalized protocols.
Apart from their role in proper names, non-ASCII characters should be used only when they enhance the technical content and accuracy of the document.</t>

</section>
<section anchor="policy-for-text-in-rfcs"><name>Policy for Text in RFCs</name>

<t>English is the required language of RFCs.
However, because non-ASCII characters are often required for instances including proper names and examples, the policy for the RFC Series is that all displayable text is allowed as long as there is a high expectation that readers of an RFC will be able to interpret its text as intended.
Apart from their role in proper names, non-ASCII characters should be used only when they enhance the technical content and accuracy of the document.</t>

<t>There are many Unicode characters that obviously cannot be displayed (such as control characters), and many whose ability to be displayed is debatable.
If an RFC includes such characters in normative or descriptive text, the RFC needs to also clearly describe the character.</t>

<t>The preferred method for describing such characters is using the U+ notation from <xref target="BCP137"/> and/or using the character's official name from the Unicode Standard <xref target="UnicodeLatest"/>.
<xref target="BCP137"/> describes the pros and cons of different options for identifying Unicode characters and may help authors decide how to represent the non-ASCII characters in their documents.</t>

<t>Note that this policy only applies to normative or descriptive text; text such as names does not need character description.
Further, some RFC authors might choose to use something other than the U+ notation to describe characters, such as if the RFC already covers a different syntax that the reader will understand from the rest of the RFC.</t>

<t>Characters in an RFC will generally appear in Normalization Form C (NFC) as defined in <xref target="UnicodeLatest"/>.
If the RFC would be more correct and more understandable with particular characters not in NFC, the RPC can use unnormalized text.
In such a case, a note should be included to describe why unnormalized text was used.</t>

<section anchor="names"><name>Names</name>

<t>Authors of RFCs whose names include non-ASCII characters will likely have preferences for how their names are displayed.
If the author's preferred name is in a non-Latin script, they must also provide an alternate name in Latin script.
Authors' preferences for display of their names should be honored.</t>

<t>Company names and geographic names generally do not need Latin alternates, but they can be included at the discretion of the author and the RPC.</t>

</section>
<section anchor="examples"><name>Examples</name>

<t>Where the use of non-ASCII characters is purely part of an example or not otherwise required for correct protocol operation, giving the Unicode code points and Unicode names of the non-ASCII characters is not required, but it can improve the readability of the RFC.
An RFC can use either or both forms, whichever is sensible in the circumstance.
For example, for text that might just say "The value can be followed by a monetary symbol such as ¥ or €", text that says one of the following is likely more beneficial to the reader:</t>

<t><list style="symbols">
  <t>The value can be followed by a monetary symbol such as ¥ (U+00A5) or € (U+20AC)</t>
  <t>The value can be followed by a monetary symbol such as ¥ (YEN SIGN) or € (EURO SIGN)</t>
  <t>The value can be followed by a monetary symbol such as ¥ (U+00A5, YEN SIGN) or € (U+20AC, EURO SIGN)</t>
</list></t>

<t>RFCs may be viewed using only black and white or grayscale, particularly when printed.
Because of this, examples should generally use characters that do not specify a color.
However, some examples might require text with color due to the nature of the examples.
If so, those examples need to also include U+ notation.
For example, "A color display should be able to differentiate 🔴 (U+1F534, LARGE RED CIRCLE), 🟢 (U+1F7E2, LARGE GREEN CIRCLE), and 🔵 (U+1F535, LARGE BLUE CIRCLE)."</t>

</section>
</section>
<section anchor="rfc-publication-language"><name>RFC Publication Language</name>

<t>The RFC publication language is English.</t>

</section>
<section anchor="rfc-publication-encoding"><name>RFC Publication Encoding</name>

<t>The encoding format for the RFC Series is UTF-8 <xref target="STD63"/>.</t>

</section>
<section anchor="iana-considerations"><name>IANA Considerations</name>

<t>This document contains no IANA considerations.</t>

</section>
<section anchor="security-considerations"><name>Security Considerations</name>

<t>Authors and the RPC should cross-check that the used characters match their code point numbers or Unicode character names.</t>

</section>


  </middle>

  <back>


<references title='References' anchor="sec-combined-references">

    <references title='Normative References' anchor="sec-normative-references">



<referencegroup anchor="BCP137" target="https://www.rfc-editor.org/info/bcp137">
  <reference anchor="RFC5137" target="https://www.rfc-editor.org/info/rfc5137">
    <front>
      <title>ASCII Escaping of Unicode Characters</title>
      <author fullname="J. Klensin" initials="J." surname="Klensin"/>
      <date month="February" year="2008"/>
      <abstract>
        <t>There are a number of circumstances in which an escape mechanism is needed in conjunction with a protocol to encode characters that cannot be represented or transmitted directly. With ASCII coding, the traditional escape has been either the decimal or hexadecimal numeric value of the character, written in a variety of different ways. The move to Unicode, where characters occupy two or more octets and may be coded in several different forms, has further complicated the question of escapes. This document discusses some options now in use and discusses considerations for selecting one for use in new IETF protocols, and protocols that are now being internationalized. This document specifies an Internet Best Current Practices for the Internet Community, and requests discussion and suggestions for improvements.</t>
      </abstract>
    </front>
    <seriesInfo name="BCP" value="137"/>
    <seriesInfo name="RFC" value="5137"/>
    <seriesInfo name="DOI" value="10.17487/RFC5137"/>
  </reference>
</referencegroup>
<referencegroup anchor="STD63" target="https://www.rfc-editor.org/info/std63">
  <reference anchor="RFC3629" target="https://www.rfc-editor.org/info/rfc3629">
    <front>
      <title>UTF-8, a transformation format of ISO 10646</title>
      <author fullname="F. Yergeau" initials="F." surname="Yergeau"/>
      <date month="November" year="2003"/>
      <abstract>
        <t>ISO/IEC 10646-1 defines a large character set called the Universal Character Set (UCS) which encompasses most of the world's writing systems. The originally proposed encodings of the UCS, however, were not compatible with many current applications and protocols, and this has led to the development of UTF-8, the object of this memo. UTF-8 has the characteristic of preserving the full US-ASCII range, providing compatibility with file systems, parsers and other software that rely on US-ASCII values but are transparent to other values. This memo obsoletes and replaces RFC 2279.</t>
      </abstract>
    </front>
    <seriesInfo name="STD" value="63"/>
    <seriesInfo name="RFC" value="3629"/>
    <seriesInfo name="DOI" value="10.17487/RFC3629"/>
  </reference>
</referencegroup>
<reference anchor="RFC7997">
  <front>
    <title>The Use of Non-ASCII Characters in RFCs</title>
    <author fullname="H. Flanagan" initials="H." role="editor" surname="Flanagan"/>
    <date month="December" year="2016"/>
    <abstract>
      <t>In order to support the internationalization of protocols and a more diverse Internet community, the RFC Series must evolve to allow for the use of non-ASCII characters in RFCs. While English remains the required language of the Series, the encoding of future RFCs will be in UTF-8, allowing for a broader range of characters than typically used in the English language. This document describes the RFC Editor requirements and gives guidance regarding the use of non-ASCII characters in RFCs.</t>
      <t>This document updates RFC 7322. Please view this document in PDF form to see the full text.</t>
    </abstract>
  </front>
  <seriesInfo name="RFC" value="7997"/>
  <seriesInfo name="DOI" value="10.17487/RFC7997"/>
</reference>
<reference anchor="RFC9720">
  <front>
    <title>RFC Formats and Versions</title>
    <author fullname="P. Hoffman" initials="P." surname="Hoffman"/>
    <author fullname="H. Flanagan" initials="H." surname="Flanagan"/>
    <date month="January" year="2025"/>
    <abstract>
      <t>In order to improve the readability of RFCs while supporting their archivability, the definitive version of the RFC Series transitioned from plain-text ASCII to XML using the RFCXML vocabulary; different publication versions are rendered from that base document. This document describes how RFCs are published.</t>
      <t>This document obsoletes RFC 7990. This document also updates the stability policy in RFC 9280.</t>
    </abstract>
  </front>
  <seriesInfo name="RFC" value="9720"/>
  <seriesInfo name="DOI" value="10.17487/RFC9720"/>
</reference>

<reference anchor="UnicodeLatest" target="http://www.unicode.org/versions/latest/">
  <front>
    <title>The Unicode Standard</title>
    <author >
      <organization>The Unicode Consortium</organization>
    </author>
    <date year="n.d."/>
  </front>
</reference>


    </references>

    <references title='Informative References' anchor="sec-informative-references">



<reference anchor="RFC20">
  <front>
    <title>ASCII format for network interchange</title>
    <author fullname="V.G. Cerf" initials="V.G." surname="Cerf"/>
    <date month="October" year="1969"/>
  </front>
  <seriesInfo name="STD" value="80"/>
  <seriesInfo name="RFC" value="20"/>
  <seriesInfo name="DOI" value="10.17487/RFC0020"/>
</reference>
<reference anchor="RFC6365">
  <front>
    <title>Terminology Used in Internationalization in the IETF</title>
    <author fullname="P. Hoffman" initials="P." surname="Hoffman"/>
    <author fullname="J. Klensin" initials="J." surname="Klensin"/>
    <date month="September" year="2011"/>
    <abstract>
      <t>This document provides a list of terms used in the IETF when discussing internationalization. The purpose is to help frame discussions of internationalization in the various areas of the IETF and to help introduce the main concepts to IETF participants. This memo documents an Internet Best Current Practice.</t>
    </abstract>
  </front>
  <seriesInfo name="BCP" value="166"/>
  <seriesInfo name="RFC" value="6365"/>
  <seriesInfo name="DOI" value="10.17487/RFC6365"/>
</reference>



    </references>

</references>


<?line 140?>

<section anchor="acknowledgements"><name>Acknowledgements</name>

<t>This document is based on <xref target="RFC7997"/> which was authored by Heather Flanagan.</t>

<t>The acknowledgements from <xref target="RFC7997"/> are
to the members of the IAB i18n program,
to the RFC Format Design Team:
Nevil Brownlee, Tony Hansen, Joe
Hildebrand, Paul Hoffman, Ted Lemon, Julian Reschke, Adam Roach,
Alice Russo, Robert Sparks, and Dave Thaler.</t>

<t>Writing this document was greatly helped by contributions from the RFC Series Working Group (RSWG), including:
Brian Carpenter,
Carsten Bormann,
Eliot Lear,
John Klensin,
John Levine,
Martin Dürst,
Martin Thomson,
Pete Resnick,
Rob Sayre, and
Russ Housley.</t>

</section>


  </back>

<!-- ##markdown-source:
H4sIAAAAAAAAA9VZzXIbuRG+z1Og5MPatUNatuP1WrmEoilbjlZRSXK5ckph
ZkAOohmAC8yIZlx7yTPk7qpU5ZZjcsspeZM8wT5Cvm5gfkjJTuXnkPhgk0NM
o/vrr79uwJPJJGl0U6kjca0+NEIbcXky94nMMqdu9x4WNjeyxtLCyWUzcX6z
mrhl/uLlyxeZ9pMnh0niG2mKX8nKGixrXKsSGHmW2MzbSjXKHwlanSR67fh3
3zw9PHx5+DS52RyJU9MoZ1QzeUX2k1w2R9h7aRPfZrX2XltzvV3DsCp0Y52W
VZKs9VEiRGPzI7FVHh+9dY1TS99/39bD10S2TWkdXpngJxjH84upeGOXy1oa
ehQCvJBtNX6qaqmrI7HG42kZHv9M59KYqXWrJDHW1bLRt4p8OZ5fPHn2gj5d
Xb/65hl9AHoUdvz48sXTQ/r4zujcFupMApeGHiCOmIpSdb+KK0JUuoJ/793H
n4nA3rtr59ZQ+LqtgzXpVgoYlk2zPnr8eLPZTNuwktx+fKscQeofV+zBY2QF
YI8iga/BU3z45tk3z4+SZDKZCJn5xsm8SZLrUnsBVrS1Mo3wqvFibSudbwXs
iAaeaZNXLe0i7FLkpaT3sC1Rin4u1FIbTfuJzhuBcMW6zWAGjuDF4JInA8TC
aUIR722DH8SVclrBsscT2QhZVaLQfl3JrcwqJRpmsqfndqMKIb0AS1f0Lyw4
xb+JUq9KoT6sVd6E3dmWU7Igr+GC5FoQGw3zmRLBtEU8CGvtFLaAq7yX9PzU
FKqY7iHVlwPbImpMA7S1LopKJckDqgVnizYnJxIOWUlXbb8Q+EbGyAklkdu2
KoQ1eCW3ppEAfAT/0tma359dzU9Ph18oh9OEGOnCTjCchtUfP0YW//BD2oNY
WwA32E3pfbZbSbNq5UoRZHt+Ai0AtzCrSvsy5XR3bykDcmokhcLjMPD6u+uT
ybfT5DQwptG1Eh60UmOPOHgmjS9VkfLK08X1CWKva3C+2YoSK0oJl9u8DH5T
muERmcI2YCk2NtZM9jDxnf5N9wk/pHHkyn6ua3mDBVAwksYGmkWWzSoA0XRU
ZuaanZAy6RVlMC7yoUxGTgcUiG+jcpkGshDeF6MimisiqHh4eTF/REx3yq9R
bJroS2Drel0p8pdA2PeqGQeUErEL5XOnM/jXOU2iFoLH5hdz4ZstTK9ajaXA
YBtKXYlNqYH/CNwgaWFN6xXzobQbcmIrJNKEhyig5MEDNCNXa2Mru9qGKGGg
Fgf35exA1EpCTEYb2bbx8IajC4RDrWw4GPKMQ2Ez0yRY0/fFGaIc7d5p7xf3
Hm3x8eOO7rO1g3dfg3lBcbq3Ax/J2ZEdLDxghKBU6gNEKdc1GGXaOkN2QSin
IEKeeCf7rsB/rS3EaJpcKaqb0KLAMUo91wIoMvIBLn1HT5sBcCgdANxJHNwY
FyyLNHoiVffStmbAjHoHx/lAHEuvc3Gpvm+1Y7559mFnzEi48n15V8FYVzlv
cZ8o8EA2t85BtLFY5s560vINZdtRrVF9dxpObmfObjy+TJMLZUF8sNKCen7r
G1UT1QkJqMVtYMvSkp9GKUg5gRx3xTjgGu4J/EnnbSVZtkB+ag/0QlS7rk2Q
E/9y4xtCAwTWFSHTABgvNkwGwror4Nh2YyOHmR0O896xkiONZaYrEkiyCRC+
pID0rxR5hVbEm+YkIcCMZc4YOMZ9kbtDoG5wvUcMYK2dhYLxnOU7z52NSI98
6dwOrdVwLLLSv0EcWI+Bz1bI34xz0LUz7YSDIpOb423S+4OKDMuCxASmbUpl
gvQoA5nOg1uNyksUE2m3NRwuw57nLWxtux43wvWBuBj69C65Y+MLcwr4GUqh
2OmYodu8QYsFOVJ4mMvPpoYk0i7h1GCKBd0QN3IWcIxgXKR3oIeEEG18Okj+
/9FM9T+a+2uOnNKCmtiKu/0hxG+zW21bT+qG0oHc7MjZQ09jivS8KeIavf4o
zExsPOjWqGp2jHADy2RDgGKA6oEOjOCpZLcXA7v+KAOliTW45q+Ef9oTg7SN
JxhZeRsUodoONbvTtqLOIJNL5YiftULLDzSNbxA77zgz7oGj5tSNo30TAxqP
YeqehvkV0WuJOYZ6JNgwzL37p6v72vJoiy6wULPgl+/1jxhQ6CVC45Fw3bCW
cwkWNE8tt+TWPSwIScRgqqp1PwVRR8cyHoDG3Zy2/ZwyB/J3DKQx9dw2KrCM
R7dY2ExyuV5XcQr/Yqp/Ggquo2HQjMLiL+Iqt7bh1NC/TKPDSeuo+HEWsHUg
Sz/iQQhohLBE2thuaBGcBESW3iKvzZ2Mj9vBzmkjeqeHQ4asSFhoZLhlkEe5
8VtMER86XFRUoCA7o27acwTQN6PzC3Cd7+A+lq2VQveDDDLA1B7x+znhi44V
YjjBNzEXD89PMIHLfzYSng4RbTqlCoet2FGZPfRg8Jx1c6ObcjyOjKhCiSO3
TuaxjjGm0xxFaWiNid7SzILM84Er4ItFXkF0yIAaKWeUkWInPZtye9cYj9nD
HH9OZEqSWaRFbHhRzALTou37Oc+IV/pGVdswpAVpUdzuqPLi+UH3vc6NZLEH
N9ASGjEoE4uEjnMO7Y2M4HNgdxqaQ936JsgezS1UrcBQVmFIUdGEEeM3p12s
X91xdTQZjT0eUC4twGTk5rZek+gP/Xul7MrJNY5U8eFAwzjDcqEGV3oXUThZ
24Rg4hzdpzKWBrzK0Xjjvc0AFu8auRNyuYgjRJK8575HPxKh8Nr9ekWnVUeZ
66dn080hpEPkNCvBRnu1O9J01O/GP0FdnosrFSt927eKO8eeAFb3PCAVw/qc
j+RGt3mAS4dDx3haJQHpeu9YJ2ZBGLraUpqVDRFkiIzHeqSAD8I039F20Phw
Fo8jeq4d1DxMcFBUvBohSsN41p+DgqT+mijpwaIDarS3smrVcBKLs1kGaYJi
GNVIt6Vb0QwIdgL61z+Qe3//7R8P0pFxWARQpr/ECbYIZ7gc6481KAPtYpuN
lxpBW+nCUPz7Lj189/Xh4ez5o+gbfX96OJs/+s+M/nJxLq5OX58PZhfvLn8R
Hv033E3F3R2C46kY7RROuTQAYI9brWiHMMJwn84qmd8wb0GUhksDpb71GERB
gkHgu7l17Wgwhkwcx8MC50yDaN2Q32nKoBG0bH8sjbrhMa1jdCHxx/HfjQ4j
3NV7m4F/sVKi1lMH4rdE0aqOEBCe1vVM6t5nLfaWlJW0vzfbHZ1ZZrtesHNB
sVMTB7Nuw6img3x2h4l+DNAk0j9++t2fKCtPTp4/+0kqzmaXrxficvFKzE8v
52cLjNg/fvr0+7DixeJpt+L15QKp7ddQdmDpz52l592647N3i27Z9ICOg/v3
cWfxtDfc1o1P/v1ZEHUWz4vT+6ws4vVLvCDevYz5zEGOr1MxcvB/TsRbmdPZ
+Yz/8wDdLCiq37/rjFcwpIxheb6znM1cKRyKSAz3TXWdftQ9uhTxZc0ESgi2
95MZn8lGzEQ0qLLQHwdZj9denmrjznwdZD5eq2eoJXJwlt8Yu6lUsQp3T/tB
4nN/67pzu8yXljTEhDYYxOCNkizsJ0iXXMnu6lXubXLn+pzGkSSWRa1iCKEu
TmfHQj/5lo+wKPc67dZRDk9CWl8pr1dGXCtZHyXn6lZX4tjZjakUKuHaYkB4
Iw0aSireWpW80RXOgA7Ipzv/q4WlNBmomrrn27bSNM1iiCtvYGVWyFpcWpmX
aTID27B/66lMLy28bcQV9OfGhwJ4RQPYdQlZooPee+Q/NOIxroTcCi2B7rDo
uBPw46OtRm8NJ6Zu7B7R9b11N2TttbPtWjy8vHr/GmXXX2gcJceO/J5Lt+b7
7TTBR7qPEseElTFpsqg05OwMI3mavLWlET+vqNea+O0M+BmVJt+RoBrx6m9/
wfv91+vS1h4AJRcKogF4wLKbNAEK4kpunWIEEsIGsOIsr7bT5B8bfLCnUh0A
AA==

-->

</rfc>

