D. Connolly
Internet-Draft World Wide Web Consortium (W3C)
L. Masinter
Xerox Corporation
September 21, 1999
draft-connolly-text-html-00.txt
Obsoletes: RFC 1866, RFC 2070, RFC 1980, RFC 1867, RFC 1942
The 'text/html' Media Type
Status of this Memo
This document is an Internet-Draft and is in full conformance with
all provisions of Section 10 of RFC 2026.
This document is an Internet-Draft. Internet-Drafts are working
documents of the Internet Engineering Task Force (IETF), its areas,
and its working groups. Note that other groups may also distribute
working documents as Internet-Drafts.
Internet-Drafts are draft documents valid for a maximum of six
months and may be updated, replaced, or obsoleted by other
documents at any time. It is inappropriate to use Internet-Drafts
as reference material or to cite them other than as ``work in
progress''.
The list of current Internet-Drafts can be accessed at
http://www.ietf.org/ietf/1id-abstracts.txt
The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html.
Copyright Notice
Copyright (C) The Internet Society (1999). All Rights Reserved.
Abstract
This document summarizes the history of HTML development, and
defines the "text/html" MIME type by pointing to the relevant W3C
recommendations, It is intended to obsolete the previous IETF
documents defining HTML, including RFC 1866, RFC 1867, RFC 1980,
RFC 1942 and RFC 2070.
This document was prepared at the request of the W3C HTML working
group. Please send comments to www-html@w3.org, a public mailing
list with archive at
.
1. Introduction and background
The text/html media type was originally defined in [HTML20] in Nov
1995. Extensions to HTML were proposed in [HTML30], [UPLOAD],
[TABLES], [CLIMAPS], and [I18N].
The IETF HTML working group closed Sep 1996, and work on HTML moved
to the World Wide Web Consortium (W3C). The proposed extensions
were incorporated, to some extent, in [HTML32], and to a larger
extent, in [HTML40]. The definition of multipart/form-data from
[UPLOAD] was described in [FORMDATA]. In addition, a reformulation
of HTML 4.0 in XML 1.0 is being developed [XHTML1].
[HTML32] notes "This specification defines HTML version 3.2. HTML
3.2 aims to capture recommended practice as of early '96 and as
such to be used as a replacement for HTML 2.0 (RFC 1866)."
Subsequent specifications for HTML describe the differences
in each version.
2. Definition for 'text/html'
The 'text/html' media type is now defined by W3C recommendations;
the latest published version is [HTML40]. As of this writing, a
revision, HTML 4.01 [HTML401], is being developed. In both [HTML40]
and [HTML401].
The "text/html" media type may be used to refer to any W3C
published version of HTML; the versions are distinguishable by the
"DOCTYPE" declaration contained within them. In addition, [XHTML1]
defines an profile of use of XHTML which is compatible with HTML
4.0, and which may also be served as 'text/html'.
MIME media type name: text
MIME subtype name: html
Required parameters: none
Optional parameters: charset
The optional parameter "charset" refers to the character encoding
used to represent the HTML document as a sequence of bytes. Any
registered IANA charset may be used, but UTF-8 is preferred.
Although this parameter is optional, it is recommended that it
always be present.
Encoding considerations:
Because of the availability within HTML itself for using
character entity references for non-ASCII characters, it is
possible that text/html documents with a wide repertoire may be
transported without encoding. Otherwise, transport of text/html
using charsets other than US-ASCII may require base64 or
quoted-printable encoding for 7-bit channels.
Security considerations:
See section 3 of this document.
Additional information:
Magic number:
There is no single initial string that is always present for
HTML files.
Almost all HTML files have the string ".
[HTML20] "Hypertext Markup Language - 2.0." T. Berners-Lee &
D. Connolly. RFC 1866. November 1995. Additional information
available at .
[UPLOAD] "Form-based File Upload in HTML." E. Nebel & L. Masinter. RFC
1867. November 1995.
[TABLES] "HTML Tables." D. Raggett. RFC 1942. May 1996.
[CLIMAPS] "A Proposed Extension to HTML : Client-Side Image Maps."
J. Seidman. RFC 1980. August 1996.
[HTML32] "HTML 3.2 Reference Specification." Dave Raggett. W3C
Recomendation. 14 January 1997. Available at
.
[I18N] "Internationalization of the Hypertext Markup Language." RFC
2070. F. Yergeau, G. Nicol, G. Adams, M. Duerst. January
1997.
[FORMDATA] "Returning Values from Forms: multipart/form-data". RFC
2388. L. Masinter. August 1998.
[HTML40] "HTML 4.0 Specification." Raggett, Le Hors, Jacobs. W3C
Recommendation. 18 Dec 1997. Available at
.
[HTML401] "HTML 4.01 Specification." D. Raggett, A. Le Hors,
I. Jacobs. W3C Proposed Recommendation (work in progress),
August 1999. Available at
.
[XHTML1] "XHTML 1.0: The Extensible HyperText Markup Language: A
Reformulation of HTML 4.0 in XML 1.0." W3C HTML Working
Group. W3C Proposed Recommendation (work in progress). August
1999. Available at .