The Harmful Consequences of the Robustness Principle

The Harmful Consequences of the Robustness Principle Mozilla

martin.thomson@gmail.com

Jon Postel’s famous statement in RFC 1122 of “Be liberal in what you accept, and conservative in what you send” is a principle that has long guided the design and implementation of Internet protocols. The posture this statement advocates promotes interoperability, but can produce negative effects in the protocol ecosystem in the long term. Those effects can be avoided by properly maintaining protocols.

Of the great many contributions Jon Postel made to the Internet, his remarkable technical achievements are often ignored in favor of the design and implementation philosophy of what is known as the robustness principle: Be strict when sending and tolerant when receiving. Implementations must follow specifications precisely when sending to the network, and tolerate faulty input from the network. When in doubt, discard faulty input silently, without returning an error message unless this is required by the specification. This being the version of the text that appears in IAB RFC 1958 . In comparison, his contributions to the underpinnings of the Internet, which are in many respects more significant, enjoy less conscious recognition. Postel’s robustness principle has been hugely influential in shaping the Internet and the systems that use Internet protocols. Many consider this principle to be instrumental in the success of the Internet as well as the design of interoperable protocols in general. Over time, considerable changes have occurred in both the scale of the Internet and the level of skill and experience available to protocol and software designers. Much of that experience is with protocols that were designed, informed by Postel’s maxim, in the early phases of the Internet. That experience shows that there are negative long-term consequences to interoperability if an implementation applies Postel’s advice. Correcting the problems caused by divergent behavior in implementations can be difficult. This document describes the negative consequences of the application of Postel’s principle to the modern Internet. It recommends against following Postel’s advice, instead recommending that protocols receive continuing maintenance after their initial design and deployment. Active maintenance of protocols reduces or eliminates the opportunities to apply Postel’s guidance. There is good evidence to suggest that protocols are routinely maintained beyond their inception. This document serves primarily as a record of the shortcomings of the robustness principle.

What is often missed in discussions of the robustness principle is the context in which it appears. The earliest form of the principle in the RFC series (in RFC 760 ) is preceded by a sentence that reveals the motivation for the principle: While the goal of this specification is to be explicit about the protocol there is the possibility of differing interpretations. In general, an implementation should be conservative in its sending behavior, and liberal in its receiving behavior. This motivating statement is a frank admission of fallibility and remarkable for it. Here Postel recognizes the possibility that the specification could be imperfect. This is an important statement, but inexplicably absent from the later versions in and . Indeed, an imperfect specification is natural, largely because it was - and remains thus - more important to proceed to implementation and deployment than it is to perfect the protocol specification. A protocol, like software, benefits greatly from experience in deployment, and a deployed protocol is immeasurably more useful than a perfect protocol. As shown demonstrates, success or failure of a protocol depends far more on factors like usefulness than on on technical excellence. Postel’s timely publication of protocol specifications, even with the potential for flaws, likely had a significant effect in the eventual success of the Internet. The problem is therefore not with the premise, but with its conclusion: the robustness principle itself.

Divergent implementations of a specification emerge over time. When variations occur in the interpretation or expression of semantic components, implementations cease to be perfectly interoperable. Implementation bugs are often identified as the cause of variation, though it is often a combination of factors. Application of a protocol to new and unanticipated uses, and ambiguities or errors in the specification are often confounding factors. Situations where two peers disagree on interpretation should be expected over the lifetime of a protocol. Even with the best intentions, the pressure to interoperate can be significant. No implementation can hope to avoid having to trade correctness for interoperability indefinitely. An implementation that reacts to variations in the manner advised by Postel sets up a feedback cycle: Over time, implementations progressively add new code to constrain how data is transmitted, or to permit variations in what is received. Errors in implementations, or confusion about semantics can thereby be masked. These errors can become entrenched, forcing other implementations to be tolerant of those errors. In this way an flaw can become entrenched as a de facto standard. Any implementation of the protocol is required to replicate the aberrant behavior, or it is not interoperable. This is both a consequence of applying Postel’s advice, and a product of a natural reluctance to avoid fatal error conditions. Ensuring interoperability in this environment is often colloquially referred to as aiming to be “bug for bug compatible”. For example, in TLS messages use an extension format with a tag-length-value format. TLS extensions can be added to handshake messages in any order. However, some server implementations terminate connections if they encounter a TLS ClientHello message that ends with an empty extension. Thus, client implementations are required to be aware of this incompatibility and ensure that a ClientHello message ends in a non-empty extension. While TLS highlights the potential for implementation error to cause problems, the original JSON specification demonstrates the effect of specification shortcomings. RFC 4627 omitted critical details on a range of key details including Unicode handling, ordering and duplication of object members, and number encoding. Consequently, a range of interpretations were used by implementations. An updated JSON specification was unable to correct these errors, instead concentrating on defining the interoperable subset of JSON. I-JSON defines a new format that is substantially similar to JSON while prohibiting the problematic variations. However, that prohibition means that I-JSON is not fully interoperable: an I-JSON implementation will fail to accept some valid JSON texts. Consequently, I-JSON is not widely implemented in parsers. Many JSON parsers instead implement the more precise algorithm specified in . It is debatable as to whether decay can be completely avoided, but the robustness principle encourages a reaction that compounds problems.

Once deviations become entrenched, it can be extremely difficult - if not impossible - to rectify the situation. For widely used protocols, the massive scale of the Internet makes large-scale interoperability testing infeasible for all but a privileged few. As the set of changes needed to maintain interoperability grows in size, the cost of building a new implementation increases. This is particularly relevant as the set of tweaks necessary for interoperability can be difficult to learn. Consequently, new implementations can be restricted to niche uses, where the problems arising from interoperability issues can be more closely managed. Restricting new implementations to narrow contexts also risks causing forks in the protocol. If implementations cannot interoperate, the chances of the addition of further incompatible changes is significant. This has a negative impact on the ecosystem of a protocol. New implementations are important in ensuring the continued viability of a protocol. New protocol implementations are also more likely to be developed for new and diverse use cases and often are the origin of features and capabilities that can be of benefit to existing users. The need to work around interoperability problems also reduce the ability of established implementations to change. For instance, an accumulation of mitigations for interoperability issues makes implementations more difficult to maintain.

The robustness principle is best suited to safeguarding against flaws in a specification that is intended to remain unchanged for an extended period of time. Indeed, in the face of divergent interpretations of an immutable specification, the only hope for an implementation to remain interoperable is to be tolerant of differences in interpretation and occasional outright implementation errors. From this perspective, application of Postel’s advice to the implementation of a protocol specification that does not change is logical, even necessary. But that suggests that the problem is with the presumption of immutability of specifications. Active maintenance of a protocol can ensure that specifications remain accurate and that new implementations are possible. A maintained protocol is not a static construct, it improves and changes as it is used. Protocol designers are strongly encouraged to continue to maintain and evolve protocols beyond their initial inception and definition. Maintenance is needed in response to the discovery of errors in specification that might cause interoperability issues. Maintenance is also critical for ensuring that the protocol is viable for application to use cases that might not have been envisaged during its original design. New use cases are an indicator that the protocol could be successful . Maintenance does not demand the development of new versions of protocols or protocol specifications. For instance, RFC 793 remains the canonical TCP reference, but that document alone is no longer sufficient to implement the protocol. TCP is the subject of a very large number of update and extension RFCs that together document the deployed protocol. Good extensibility can make it easier to respond to new use cases or changes in the environment in which the protocol is deployed. The process of maintenance ideally begins even before the specification for a protocol is complete. Neglect can quickly produce the negative consequences this document describes. Restoring the protocol to a state where it can be maintained involves first discovering the properties of the protocol as it is deployed, rather than the protocol as it was originally documented. This can be difficult and time-consuming, particularly if the protocol has a diverse set of implementations. Such a process was undertaken for HTTP after a period of minimal maintenance. Restoring the specification to currency took significant effort over more than 6 years.

Protocol maintenance is only possible if there is sufficient information about the deployment of the protocol. Feedback from deployment is critical to effective protocol maintenance. For a protocol specification, the primary and most effective form of feedback comes from people who implement or deploy the protocol. An active community of protocol implementers and users is the most valuable source of feedback. It is this community that implements and deploys changes, in addition to contributing to the ongoing maintenance of protocol specifications.

Protocol implementations should include automated error reporting mechanisms. Exposing faults through operations and management systems is highly valuable, but it might be necessary to ensure that the information is propagated further. Building telemetry and error logging systems that report faults to the developers of the implementation is superior in many respects. However, this is only possible in deployments that are conducive to the collection of this type of information. Giving consideration to protection of the privacy of protocol participants is critical prior to deploying any such system.

Favoring strict error handling over attempting error recovery is an effective technique for ensuring that faults receive attention. A fatal errors provide excellent motivation to address problems. However, generating fatal errors is only feasible if such errors are sufficiently rare. Frequent problems can result in users to ignoring them or finding workarounds, whereas rare events demand greater attention. Fatal errors or crashes are also preferable if there is any risk that the error might represent an implementation or security issue. Exposing errors is particularly important for early implementations of a protocol. Enabling stricter error handling helps validate the design of both protocol and implementations. If strict checks are retained when implementations are more widely deployed, those checks better enable the detection and correction of errors in new implementations. This doesn’t mean that protocol implementations need to treat all inputs equally strictly. A protocol could be designed with the explicit goal of accepting a wide range of inputs (see for example ). As long as proper reactions to inputs are clearly and unambiguously specified, these considerations don’t apply.

Implementations and deployments are critical to the ongoing maintenance of a protocol. Implementations have ample motivation to prefer stability and interoperability over maintenance or correctness. It is likely that - over time - an implementation will accumulate allowances for errors of specification or implementation. Without care, this can lead to suppression of feedback about problems. Even when an implementation chooses to attempt to adapt to an abnormal condition, communicating that decision to the community is valuable. At a minimum, it allows others to benefit from knowledge of the problem. Discussion about problems might also lead to alternative strategies or protocol enhancements that can avoid further problems. Over time, mitigations for interoperability issues could become redundant. Occasional review of any special interoperability measures might reveal opportunities for an implementation to switch to stricter handling of exceptional conditions. Removing special allowances in favor of stricter error reporting restores feedback measures that new implementations can then benefit from. Managing and deploying changes can be expensive. However, it is widely recognized that maintenance is a critical part of the deployment of computer systems for security reasons . Managing updates for interoperability problems represents a small additional cost in exchange for ensuring the ability to interoperate with other users of the network.

Sloppy implementations, lax interpretations of specifications, and uncoordinated extrapolation of requirements to cover gaps in specification can result in security problems. Hiding the consequences of protocol variations encourages the hiding of issues, which can conceal bugs and make them difficult to discover. Designers and implementers of security protocols generally understand these concerns. However, general-purpose protocols are not exempt from careful consideration of security issues. Furthermore, because general-purpose protocols tend to deal with flaws or obsolescence in a less urgent fashion than security protocols, there can be fewer opportunities to correct problems in protocols that develop interoperability problems.

This document has no IANA actions.

ECMAScript(R) 2017 Language Specification HTML Architectural Principles of the Internet The Internet and its architecture have grown in evolutionary fashion from modest beginnings, rather than from a Grand Plan. While this process of evolution is one of the main reasons for the technology's success, it nevertheless seems useful to record a snapshot of the current principles of the Internet architecture. This is intended for general guidance and general interest, and is in no way intended to be a formal or invariant reference model. This memo provides information for the Internet community. This memo does not specify an Internet standard of any kind. DoD standard Internet Protocol Requirements for Internet Hosts - Communication Layers This RFC is an official specification for the Internet community. It incorporates by reference, amends, corrects, and supplements the primary protocol standards documents relating to hosts. [STANDARDS-TRACK] What Makes for a Successful Protocol? The Internet community has specified a large number of protocols to date, and these protocols have achieved varying degrees of success. Based on case studies, this document attempts to ascertain factors that contribute to or hinder a protocol's success. It is hoped that these observations can serve as guidance for future protocol work. This memo provides information for the Internet community. The Transport Layer Security (TLS) Protocol Version 1.2 This document specifies Version 1.2 of the Transport Layer Security (TLS) protocol. The TLS protocol provides communications security over the Internet. The protocol allows client/server applications to communicate in a way that is designed to prevent eavesdropping, tampering, or message forgery. [STANDARDS-TRACK] The application/json Media Type for JavaScript Object Notation (JSON) JavaScript Object Notation (JSON) is a lightweight, text-based, language-independent data interchange format. It was derived from the ECMAScript Programming Language Standard. JSON defines a small set of formatting rules for the portable representation of structured data. This memo provides information for the Internet community. The JavaScript Object Notation (JSON) Data Interchange Format JavaScript Object Notation (JSON) is a lightweight, text-based, language-independent data interchange format. It was derived from the ECMAScript Programming Language Standard. JSON defines a small set of formatting rules for the portable representation of structured data.This document removes inconsistencies with other specifications of JSON, repairs specification errors, and offers experience-based interoperability guidance. The I-JSON Message Format I-JSON (short for "Internet JSON") is a restricted profile of JSON designed to maximize interoperability and increase confidence that software can process it successfully with predictable results. Transmission Control Protocol Design Considerations for Protocol Extensions This document discusses architectural issues related to the extensibility of Internet protocols, with a focus on design considerations. It is intended to assist designers of both base protocols and extensions. Case studies are included. A companion document, RFC 4775 (BCP 125), discusses procedures relating to the extensibility of IETF protocols. This document is not an Internet Standards Track specification; it is published for informational purposes. Hypertext Transfer Protocol (HTTP/1.1): Message Syntax and Routing The Hypertext Transfer Protocol (HTTP) is a stateless application-level protocol for distributed, collaborative, hypertext information systems. This document provides an overview of HTTP architecture and its associated terminology, defines the "http" and "https" Uniform Resource Identifier (URI) schemes, defines the HTTP/1.1 message syntax and parsing requirements, and describes related security concerns for implementations. Report from the Internet of Things Software Update (IoTSU) Workshop 2016 This document provides a summary of the Internet of Things Software Update (IoTSU) Workshop that took place at Trinity College Dublin, Ireland on the 13th and 14th of June, 2016. The main goal of the workshop was to foster a discussion on requirements, challenges, and solutions for bringing software and firmware updates to IoT devices. This report summarizes the discussions and lists recommendations to the standards community.Note that this document is a report on the proceedings of the workshop. The views and positions documented in this report are those of the workshop participants and do not necessarily reflect IAB views and positions.

Constructive feedback on this document has been provided by a surprising number of people including Mark Nottingham, Brian Trammell, and Anne Van Kesteren. Please excuse any omission.