Internationalized Email Addresses in X.509 certificates
Isode Ltd14 Castle MewsHamptonMiddlesexTW12 2NPUKAlexey.Melnikov@isode.comGoogle, Inc.1600 Amphitheatre ParkwayMountain ViewCA94043USweihaw@google.com
Security
LAMPSEAIPKIXemail address
This document defines a new name form for inclusion in the otherName
field of an X.509 Subject Alternative Name and Issuer Alternate Name
extension that allows a certificate subject to be associated with an
Internationalized Email Address.
defines rfc822Name subjectAltName choice for representing
email addresses. This form is restricted to a subset of US-ASCII
characters and thus can't be used to represent Internationalized Email addresses
. To facilitate use of these
Internationalized Email addresses with X.509 certificates, this document
specifies a new name form in otherName so that subjectAltName and issuerAltName
can carry them. In addition this document calls for all email address domain in X.509 certificates
to conform to IDNA2008 .
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in
this document are to be interpreted as described in
.The formal syntax use the Augmented
Backus-Naur Form (ABNF) notation.
The GeneralName structure is defined in , and supports
many different names forms including otherName for extensibility. This section
specifies the SmtpUtf8Name name form of otherName, so that Internationalized
Email addresses can appear in the subjectAltName of a certificate, the
issuerAltName of a certificate, or anywhere else that GeneralName is used.
id-on-smtpUtf8Name OBJECT IDENTIFIER ::= { id-on 9 }SmtpUtf8Name ::= UTF8String (SIZE (1..MAX))
When the subjectAltName (or issuerAltName) extension contains an Internationalized Email address,
the address MUST be stored in the SmtpUtf8Name name form of otherName. The
format of SmtpUtf8Name is defined as the ABNF rule SmtpUtf8Mailbox.
SmtpUtf8Mailbox is a modified version of the Internationalized
Mailbox which is defined in Section 3.3 of which is
itself derived from SMTP Mailbox from Section 4.1.2 of .
defines the following ABNF rules for Mailbox whose
parts are modified for internationalization: <Local-part>,
<Dot-string>, <Quoted-string>, <QcontentSMTP>, <Domain>,
and <Atom>. In particular, <Local-part> was updated to also support
UTF8-non-ascii. UTF8-non-ascii is described by Section 3.1 of . Also, sub-domain is extended to support U-label, as
defined in .
This document further refines Internationalized Mailbox ABNF rules
and calls this SmtpUtf8Mailbox. In SmtpUtf8Mailbox, sub-domain that encode non-ASCII characters SHALL
use U-label Unicode native character labels and MUST NOT use A-label .
This restriction prevents having to determine which label
encoding A- or U-label is present in the Domain. As per
Section 2.3.2.1 of , U-label use
UTF-8 with Normalization Form C and other properties
specified there. In SmtpUtf8Mailbox, sub-domain that encode solely
ASCII character labels SHALL use NR-LDH restrictions as specified by
section 2.3.1 of and restricted to lower case letters.
Note that a SmtpUtf8Mailbox has no
phrase (such as a common name) before it, has no comment (text surrounded in
parentheses) after it, and is not surrounded by "<" and ">".
In the context of building name constraint as needed by ,
the SmtpUtf8Mailbox rules are modified to allow partial productions to allow
for additional forms required by . Name constraints
may specify a complete email address, host name, or domain. This means
that the local-part may be missing, and domain partially specified.
SmtpUtf8Name is encoded as UTF8String. The UTF8String encoding MUST NOT
contain a Byte-Order-Mark (BOM) to aid consistency
across implementations particularly for comparison.
To facilitate comparison between email addresses, all email address domain in X.509 certificates
MUST conform to IDNA2008 . Otherwise non-conforming email address
domains introduces the possibility of conversion errors between alternate forms. This applies to
SmtpUtf8Mailbox and rfc822Name in subjectAltName, issuerAltName and anywhere else that GeneralName
is used.
In equivalence comparison with SmtpUtf8Name, there may be some setup work to enable
the comparison i.e. processing of the SmtpUtf8Name content or the email
address that is being compared against.
The process for setup for comparing with SmtpUtf8Name is split into domain
steps and local-part steps. The comparison form for local-part always is UTF-8.
The comparison form for domain depends on context.
While some contexts such as certificate path validation in
specify transforming domain to A-label, this document RECOMMENDS transforming to UTF-8
U-label instead. This reduces the likelihood of errors by reducing conversions as more
implementations natively support U-label domains.
Comparison of two SmtpUtf8Name can be straightforward. No setup work is needed and it
can be an octet for octet comparison. For other email address forms such as
Internationalized email address or rfc822Name, the comparison requires additional
setup to convert the format for comparison. Domain setup is particularly important
for forms that may contain A- or U-label such as International email address, or
A-label only forms such as rfc822Name. This document specifies the process to
transform the domain to U-label. (To convert the domain to A-label, follow
the process specified in section 7.5 and 7.2 in )
The first step is to detect A-label by using section 5.1 of .
Next if necessary, transform the A-label to U-label Unicode as specified in
section 5.2 of . Finally if necessary convert the
Unicode to UTF-8 as specified in section 3 of . For
ASCII NR-LDH labels, upper case letters are converted to lower case letters. In
setup for SmtpUtf8Mailbox, the email address local-part MUST conform to the
requirements of and , including
already being a string in UTF-8 form. In particular, the local-part
MUST NOT be transformed in any way, such as by doing case folding or normalization
of any kind. The <Local-part> part of an Internationalized
email address is already in UTF-8. For rfc822Name the local-part, which is IA5String
(ASCII), trivially maps to UTF-8 without change. Once setup is completed,
comparison is again checking for octet for octet equivalence.
To summarize non-normatively the domain setup steps are:
if the domain contains A-labels, transform them to U-labelif the domain contains ASCII NR-LDH labels, lowercase them
This enables an octet for octet comparison.
This specification expressly does not define any wildcards characters and
SmtpUtf8Name comparison implementations MUST NOT interpret any character as
wildcards. Instead, to specify multiple specifying multiple email addresses
through SmtpUtf8Name, the certificate should use multiple subjectAltNames or
issuerAltNames to explicitly carry those email addresses.
This section defines use of SmtpUtf8Name name for name constraints. The format
for SmtpUtf8Name in name constraints is identical to the use in subjectAltName as
specified in with the extension as noted there
for partial productions.
Constraint comparison on complete email address with
SmtpUtf8Name name uses the matching procedure defined by .
As with rfc822Name name constraints as specified in Section 4.2.1.10 of , SmtpUtf8Name name can specify a particular mailbox, all addresses
at a host, or all mailboxes in a domain by specifying the complete email
address, a host name, or a domain.
Name constraint comparisons in the context is
specified with SmtpUtf8Name name are only done on the subjectAltName
(and issuerAltName) SmtpUtf8Name name, and says nothing more about constraints on other
email address forms such as rfc822Name. Consequently it may be necessary to include
other name constraints such as rfc822Name in addition to SmtpUtf8Name to constrain
all potential email addresses. For example a domain with both
ASCII and non-ASCII local-part email addresses may require both rfc822Name and
SmtpUtf8Name name constraints. This can be illustrated in the following
non-normative diagram
which shows a name constraint set in the intermediate CA certificate, which
then applies to the children entity certificates. Note that a constraint
on rfc822Name does not apply to SmtpUtf8Name and vice versa as is
shown in non-normative diagram .
For email addresses whose local-part is ASCII it may be more reasonable to
continue using rfc822Name instead of SmtpUtf8Name. The use of rfc822Name rather than
SmtpUtf8Name is currently more likely to be supported. Also use of SmtpUtf8Name incurs higher
byte representation overhead due to encoding with otherName and the additional OID
needed. This may be offset if domain requires non-ASCII characters as smptUtf8Name
supports U-label whereas rfc822Name supports A-label. This document RECOMMENDS using SmtpUtf8Name
when local-part contains non-ASCII characters, and otherwise rfc822Name.
Use for SmtpUtf8Name for certificate subjectAltName (and issuerAltName) will incur many of the
same security considerations of Section 8 in but is further
complicated by permitting non-ASCII characters in the email address local-part.
This complication, as mentioned in Section 4.4 of and in Section 4 of
, is that use of Unicode introduces the risk of visually
similar and identical characters which can be exploited to deceive the recipient.
The former document references some means to mitigate against these attacks.
This document makes use of object identifiers for the SmtpUtf8Name defined
in Section and the ASN.1 module identifier
defined in Section . IANA is kindly
requested to make the following assignments for:
The LAMPS-EaiAddresses-2016 ASN.1 module in the "SMI
Security for PKIX Module Identifier" registry (1.3.6.1.5.5.7.0).
The SmtpUtf8Name otherName in the "PKIX Other Name Forms" registry
(1.3.6.1.5.5.7.8).The following ASN.1 module normatively specifies the SmtpUtf8Name structure.
This specification uses the ASN.1 definitions from
with the 2002 ASN.1 notation used in that document.
updates normative documents using older ASN.1 notation.This non-normative example demonstrates using SmtpUtf8Name as an otherName in
GeneralName to encode the email address "u+8001u+5E2B@example.com".The example was encoded on the OSS Nokalva ASN.1 Playground and the above text decoding
is an output of Peter Gutmann's "dumpasn1" program.Thank you to Magnus Nystrom for motivating this document. Thanks to
Russ Housley, Nicolas Lidzborski, Laetitia Baudoin, Ryan Sleevi, Sean Leonard, Sean Turner,
John Levine, Viktor Dukhovni and Patrik Falstrom for their feedback.
Also special thanks to John Klensin for his valuable input on internationalization,
Unicode and ABNF formatting, and to Jim Schaad for his help with the ASN.1 example
and his helpful feedback.