| Network Working Group | J. Reschke | 
| Internet-Draft | greenbytes | 
| Intended status: Standards Track | July 2, 2014 | 
| Expires: January 3, 2015 | 
This document establishes a convention for use of JSON-encoded field values in HTTP header fields.¶
This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79.¶
Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet-Drafts is at http://datatracker.ietf.org/drafts/current/.¶
Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as “work in progress”.¶
This Internet-Draft will expire on January 3, 2015.¶
Copyright (c) 2014 IETF Trust and the persons identified as the document authors. All rights reserved.¶
This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (http://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Simplified BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Simplified BSD License.¶
Distribution of this document is unlimited. Although this is not a work item of the HTTPbis Working Group, comments should be sent to the Hypertext Transfer Protocol (HTTP) mailing list at ietf-http-wg@w3.org, which may be joined by sending a message with subject "subscribe" to ietf-http-wg-request@w3.org.¶
Discussions of the HTTPbis Working Group are archived at <http://lists.w3.org/Archives/Public/ietf-http-wg/>.¶
XML versions and latest edits for this document are available from <http://greenbytes.de/tech/webdav/#draft-reschke-http-jfv>.¶
Defining syntax for new HTTP header fields ([RFC7230], Section 3.2) is non-trivial. Among the commonly encountered problems are: ¶
(See Section 8.3.1 of [RFC7231] for a summary of considerations for new header fields.)¶
In HTTP, header fields with the same field name can occur multiple times within a single message (Section 3.2.2 of [RFC7230]). When this happens, recipients are allowed to combine the field values using commas as delimiter. This rule matches nicely JSON's array format (Section 5 of [RFC7159]). Thus, the basic data model used here is the JSON array.¶
Header field definitions that need only a single value can restrict themselves to arrays of lenght 1, and are encouraged to define error handling in case more values are received (such as "first wins", "last wins", or "abort with fatal error message").¶
JSON arrays are mapped to field values by creating a sequence of serialized member elements, separated by commas and optionally whitespace. This is equivalent to using the full JSON array format, while leaving out the "begin-array" ('[') and "end-array" (']') delimiters.¶
The ABNF character names and classes below are used (copied from [RFC5234], Appendix B.1):
CR = %x0D ; carriage return HTAB = %x09 ; horizontal tab LF = %x0A ; line feed SP = %x20 ; space VCHAR = %x21-7E ; visible (printing) characters
Characters in JSON strings that are not allowed or discouraged in HTTP header field values — that is, not in the "VCHAR" definition — need to be represented using JSON's "backslash" escaping mechanism ([RFC7159], Section 7).¶
The control characters CR, LF, and HTAB do not appear inside JSON strings, but can be used outside (line breaks, indentation etc). These characters can be either stripped or replaced by space characters (ABNF "SP").¶
[rfc.comment.1: The text below assumes we're starting with a JSON-formatted sequence of characters, not octets; need to clarify.] To map a JSON array to an HTTP header field value, process each array element separately by: ¶
The resulting list of strings is transformed into an HTTP field value by combining them using comma (%x2C) plus optional SP as delimiter, and encoding the resulting string into an octet sequence using the US-ASCII character encoding scheme.¶
To map a set of HTTP header field instances to a JSON array: ¶
The result of the parsing operation is either an error (in which case the header field values needs to be considered invalid), or a JSON array.¶
[rfc.comment.3: Explain what a definition of a new header field needs to do precisely to use this format] ¶
This section shows how some of the existing HTTP header fields would look like if they would use the format defined by this specification.¶
"Content-Length" is defined in Section 3.3.2 of [RFC7230], with the field value's ABNF being:¶
Content-Length = 1*DIGIT
Content-Length is restricted to a single field instance, as it doesn't use the list production (as per Section 3.2.2 of [RFC7230]). However, in practice multiple instances do occur, and the definition of the header field does indeed discuss how to handle these cases.¶
If Content-Length was defined using the JSON format discussed here, the ABNF would be something like:¶
...and the prose definition would: ¶
Content-Disposition field values, defined in [RFC6266], consist of a "disposition type" (a string), plus multiple parameters, of which at least one ("filename") sometime needs to carry non-ASCII characters.¶
Attachment; filename=example.html
has a disposition type of "Attachment", with filename parameter value "example.html". A JSON representation of this information might be:¶
  {
    "Attachment": {
      "filename" : "example.html"
    }
  }
which would translate to a header field value of:¶
  { "Attachment": { "filename" : "example.html" } }
The third example in Section 5 of [RFC6266] uses a filename parameter containing non-US-ASCII characters:¶
attachment; filename*=UTF-8''%e2%82%ac%20rates
Note that in this case, the "filename*" parameter uses the encoding defined in [RFC5987], representing a filename starting with the Unicode character U+20AC (EURO SIGN), followed by " rates". If the definition of Content-Disposition would have used the format proposed here, the workaround involving the "parameter*" syntax would not have been needed at all.¶
The JSON representation of this value could then be:¶
  { "attachment": { "filename" : "\u20AC rates" } }
The WWW-Authenticate is defined in Section 4.1 of [RFC7235] as a list of "challenges":¶
WWW-Authenticate = 1#challenge
...where a challenge consists of a scheme with optional parameters:¶
challenge = auth-scheme [ 1*SP ( token68 / #auth-param ) ]
An example for a complex header field value given in the definition of the header field is:¶
Newauth realm="apps", type=1, title="Login to \"apps\"", Basic realm="simple"
(line break added for readability)
A possible JSON representation of this field value would be the array below:¶
  [
    {
      "Newauth" : {
        "realm": "apps",
        "type" : 1,
        "title" : "Login to \"apps\""
      }
    },
    {
      "Basic" : {
        "realm": "simple"
      }
    } 
  ]
...which would translate to a header field value of:¶
  { "Newauth" : { "realm": "apps", "type" : 1,
                  "title": "Login to \"apps\"" }},
  { "Basic" : { "realm": "simple"}}
This approach uses a default of "JSON array", using implicit array markers. An alternative would be a default of "JSON object". This would simplify the syntax for non-list-typed haeders, but all the benefits of having the same data model for both types of header fields would be gone. A hybrid approach might make sense, as long as it doesn't require any heuristics on the recipient's side.¶
[rfc.comment.4: Use of generic libs vs compactness of field values..] ¶
[rfc.comment.5: Mention that some code might be refused by double quotes not being used for quoted-string.] ¶
[rfc.comment.6: TBD, mention migration path to message format that is robust wrt UTF-8, or other binary encodings of JSON] ¶
[rfc.comment.7: TBD] ¶
[rfc.comment.8: TBD] ¶