httpbis: Ticket #101: Definition of validator weakness

Link: http://trac.tools.ietf.org/wg/httpbis/trac/ticket/101

Origin: http://www.w3.org/mid/47767C72.6030507@onlinehome.de

Component: p4-conditional

From 13.3.3 Weak and Strong Validators:

Entity tags are normally "strong validators," but the protocol provides a mechanism to tag an entity tag as "weak." One can think of a strong validator as one that changes whenever the bits of an entity changes, while a weak value changes whenever the meaning of an entity changes. Alternatively, one can think of a strong validator as part of an identifier for a specific entity, while a weak validator is part of an identifier for a set of semantically equivalent entities.

Note: One example of a strong validator is an integer that is incremented in stable storage every time an entity is changed.

An entity's modification time, if represented with one-second resolution, could be a weak validator, since it is possible that the resource might be modified twice during a single second.

While in paragraph 1 "weak validator" is defined in terms of semantic equivalence, paragraph 3 qualifies modification time as "weak validator". But the second modification of a file within the same second may change the file into anything. There is no means to guarantee semantic equivalence in this case. Both this paragraphs are mutual exclusive.

The reason for this is the abstraction "weak validator" itself. While "validator" is a good abstraction from the details of Last-Modified and Etag, and also "strong validator" is quite clear, this can't work for "weak".

"weak validator" tries do build a common abstraction from two different, completely unrelated kinds of "weakness".

Weak etags: the weakness is not to guarantee byte-equivalence, but they guarantee semantic equivalence. Of course, the server needs some concept of semantic equivalence build in, to use weak etags. (Oh, and it would be fine, if the client would have the same idea about semantics.)

Last-Modified date: the weakness is the limited time resolution. It is *unreliable* (or not a validator at all), unless it meets some extra conditions. There is no concept of semantic equivalence whatsoever.

On consequence are the strange restrictions on "weak validators". Clients must only use them in conditional (full body) GET requests. This is reasonable for Last-Modified (if it does not meet the additional restrictions), but not at all justified for weak etags.

The only reasonable restriction on weak etags is not to use them in range requests. But a PUT with If-Match: W/"xxx" is perfectly ok.

I suggest to remove the term "weak validator" from the spec. Validator is either a Last-Modified Date or an Etag. Etags can be strong or weak. I should be made clear, that weak etags ore only meant to validate semantic equivalence and it should be clear, that everything said about semantic equivalence is related to weak etags.

Practical issue: Apache misuses weak etags when it can not create a strong one, due to the limited time resolution (and mtime is the main component of Apache's etags). This etags will *never* match. (IIS seems to do something similar.) Although I'm sure, this is not what weak etags are intended for, one could use the inconsistent definition in the spec to justify this (one has to be either a lawyer or a programmer to do so).

I don't know, if there is any application, that uses weak etags as they are intended (for validating semantic equivalence). But if there is, or will be, the above misuse will most likely create interoperability problems. WebDAV-clients (e.g. davfs2) already have problems to work around this wrong "weak etags".

Mails

Mails by Sender (Top 10) Mails by Month

NEW ISSUE: weak validator: definition inconsistent Werner Baumann (werner.baumann@onlinehome.de) (2007-12-29)
RE: NEW ISSUE: weak validator: definition inconsistent Larry Masinter (LMM@acm.org) (2008-01-02)
- Re: NEW ISSUE: weak validator: definition inconsistent Werner Baumann (werner.baumann@onlinehome.de) (2008-01-03)
- Re: NEW ISSUE: weak validator: definition inconsistent Lisa Dusseault (lisa@osafoundation.org) (2008-01-07)
  - Re: NEW ISSUE: weak validator: definition inconsistent Werner Baumann (werner.baumann@onlinehome.de) (2008-01-11)
  - Re: NEW ISSUE: weak validator: definition inconsistent Robert Siemer (Robert.Siemer-httpwg@backsla.sh) (2008-01-12)
Re: NEW ISSUE: weak validator: definition inconsistent Larry Masinter (LMM@acm.org) (2008-01-13)
- Re: NEW ISSUE: weak validator: definition inconsistent Robert Siemer (Robert.Siemer-httpwg@backsla.sh) (2008-01-14)
Re: NEW ISSUE: weak validator: definition inconsistent #101 Mark Nottingham (mnot@mnot.net) (2008-02-28)
PROPOSAL: Weak Validator definition [i101] Mark Nottingham (mnot@mnot.net) (2008-03-11)
- Re: PROPOSAL: Weak Validator definition [i101] Henrik Nordstrom (henrik@henriknordstrom.net) (2008-03-13)
  - Re: PROPOSAL: Weak Validator definition [i101] Werner Baumann (werner.baumann@onlinehome.de) (2008-03-14)
    - Re: PROPOSAL: Weak Validator definition [i101] Henrik Nordstrom (henrik@henriknordstrom.net) (2008-03-14)
      - Re: PROPOSAL: Weak Validator definition [i101] Werner Baumann (werner.baumann@onlinehome.de) (2008-03-15)
        
        Re: PROPOSAL: Weak Validator definition [i101] Henrik Nordstrom (henrik@henriknordstrom.net) (2008-03-15)
        
        Re: PROPOSAL: Weak Validator definition [i101] Werner Baumann (werner.baumann@onlinehome.de) (2008-03-15)
        
        Re: PROPOSAL: Weak Validator definition [i101] Henrik Nordstrom (henrik@henriknordstrom.net) (2008-03-18)
        
        Re: PROPOSAL: Weak Validator definition [i101] Robert Siemer (Robert.Siemer-httpwg@backsla.sh) (2008-03-16)
        
        Re: PROPOSAL: Weak Validator definition [i101] Werner Baumann (werner.baumann@onlinehome.de) (2008-03-16)
        
        Re: PROPOSAL: Weak Validator definition [i101] Robert Siemer (Robert.Siemer-httpwg@backsla.sh) (2008-03-17)
        
        Re: PROPOSAL: Weak Validator definition [i101] Mark Nottingham (mnot@mnot.net) (2008-03-17)
        
        Re: PROPOSAL: Weak Validator definition [i101] Lisa Dusseault (lisa@osafoundation.org) (2008-03-17)
        
        Re: PROPOSAL: Weak Validator definition [i101] Mark Nottingham (mnot@mnot.net) (2008-03-17)
        
        Re: PROPOSAL: Weak Validator definition [i101] Robert Siemer (Robert.Siemer-httpwg@backsla.sh) (2008-03-17)
        
        Re: PROPOSAL: Weak Validator definition [i101] Mark Nottingham (mnot@mnot.net) (2008-03-18)
        
        Re: PROPOSAL: Weak Validator definition [i101] Robert Siemer (Robert.Siemer-httpwg@backsla.sh) (2008-03-18)
        
        Re: PROPOSAL: Weak Validator definition [i101] Henrik Nordstrom (henrik@henriknordstrom.net) (2008-03-18)
        
        Re: PROPOSAL: Weak Validator definition [i101] Henrik Nordstrom (henrik@henriknordstrom.net) (2008-03-18)
        
        Re: PROPOSAL: Weak Validator definition [i101] Roy T. Fielding (fielding@gbiv.com) (2008-03-18)
        
        Re: PROPOSAL: Weak Validator definition [i101] Henrik Nordstrom (henrik@henriknordstrom.net) (2008-03-18)
        
        Re: PROPOSAL: Weak Validator definition [i101] Henrik Nordstrom (henrik@henriknordstrom.net) (2008-03-18)
        
        Re: PROPOSAL: Weak Validator definition [i101] Robert Siemer (Robert.Siemer-httpwg@backsla.sh) (2008-03-18)
        
        Re: PROPOSAL: Weak Validator definition [i101] Henrik Nordstrom (henrik@henriknordstrom.net) (2008-03-18)
- Re: PROPOSAL: Weak Validator definition [i101] Mark Nottingham (mnot@mnot.net) (2008-03-25)
RE: ETags and concurrency control Brian Smith (brian@briansmith.org) (2008-04-28)
- Re: ETags and concurrency control #101 Robert Siemer (Robert.Siemer-httpwg@backsla.sh) (2008-04-28)
  - Re: ETags and concurrency control Henrik Nordstrom (henrik@henriknordstrom.net) (2008-04-28)
    - RE: ETags and concurrency control Brian Smith (brian@briansmith.org) (2008-05-01)
      - Re: ETags and concurrency control Adrien de Croy (adrien@qbik.com) (2008-05-01)
      - RE: ETags and concurrency control Henrik Nordstrom (henrik@henriknordstrom.net) (2008-05-02)
    - Re: ETags and concurrency control Mark Baker (distobj@acm.org) (2008-05-02)
      - Re: ETags and concurrency control Julian Reschke (julian.reschke@gmx.de) (2008-05-02)
        
        RE: ETags and concurrency control Pablo Castro (Pablo.Castro@microsoft.com) (2008-05-02)
        
        Re: ETags and concurrency control #101 Julian Reschke (julian.reschke@gmx.de) (2008-05-02)
        
        [#116] ETags and concurrency control #116 , #101 Mark Nottingham (mnot@mnot.net) (2008-05-09)
        
        Re: [#116] ETags and concurrency control #116 Julian Reschke (julian.reschke@gmx.de) (2008-05-09)
        
        Re: [#116] ETags and concurrency control Julian Reschke (julian.reschke@gmx.de) (2008-05-09)
        
        Re: [#116] ETags and concurrency control Henrik Nordstrom (henrik@henriknordstrom.net) (2008-05-11)
        
        Re: [#116] ETags and concurrency control Julian Reschke (julian.reschke@gmx.de) (2008-05-14)
        
        Re: [#116] ETags and concurrency control [245] Julian Reschke (julian.reschke@gmx.de) (2008-05-15)
        
        RE: [#116] ETags and concurrency control [245] Pablo Castro (Pablo.Castro@microsoft.com) (2008-05-18)
RE: ETags and concurrency control Brian Smith (brian@briansmith.org) (2008-05-01)
Re: ETags and concurrency control Werner Baumann (werner.baumann@onlinehome.de) (2008-05-02)
- Re: ETags and concurrency control Henrik Nordstrom (henrik@henriknordstrom.net) (2008-05-02)
  - Re: ETags and concurrency control Jamie Lokier (jamie@shareable.org) (2008-05-02)
- Re: ETags and concurrency control Julian Reschke (julian.reschke@gmx.de) (2008-05-02)
  - Re: ETags and concurrency control Werner Baumann (werner.baumann@onlinehome.de) (2008-05-02)
    - Re: ETags and concurrency control Jamie Lokier (jamie@shareable.org) (2008-05-02)
    - Re: ETags and concurrency control Mark Nottingham (mnot@mnot.net) (2008-05-08)
Re: PROPOSAL: Weak Validator definition [i101] #101 Julian Reschke (julian.reschke@gmx.de) (2008-07-29)
- Re: PROPOSAL: Weak Validator definition [i101] #101 Julian Reschke (julian.reschke@gmx.de) (2008-07-29)
  - RE: PROPOSAL: Weak Validator definition [i101] #101 Brian Smith (brian@briansmith.org) (2008-07-29)
    - Re: PROPOSAL: Weak Validator definition [i101] Julian Reschke (julian.reschke@gmx.de) (2008-07-29)
      - Issue 124: "entity value" terminology, was: PROPOSAL: Weak Validator definition [i101] #124 Julian Reschke (julian.reschke@gmx.de) (2008-07-29)
      - RE: PROPOSAL: Weak Validator definition [i101] Brian Smith (brian@briansmith.org) (2008-07-29)

History

: comment added (Thu, 28 Feb 2008 00:18:43 GMT)

remove the notion of "semantic equivalence" and replace it with "good enough, from the server's point of view". That is, a server is free to report a "match" on a weak validator if the server thinks an entity previously served with that validator is "good enough", from the server's point of view. Whether that's semantically equivalent doesn't need to come into the picture, except as an example of one reason why, even if something has changed, you might be content to let the client use old content.

-- http://www.w3.org/mid/002001c84d84$13e4c240$3bae46c0$@org

: comment added (Tue, 11 Mar 2008 05:52:41 GMT)

Proposal:

Remove the notion of "semantic equivalence" and replace it with (roughly) "good enough, from the server's point of view".

: comment added; milestone changed (Wed, 26 Mar 2008 00:23:51 GMT)

milestone changed from unassigned to 03.

Proposal accepted.

: comment added; milestone changed (Thu, 17 Jul 2008 14:23:29 GMT)

milestone changed from 03 to 04.

: comment added (Mon, 28 Jul 2008 14:07:03 GMT)

Suggestion:

P4, section 3:

A "weak entity tag," indicated by the "W/" prefix, MAY be shared by

two entities of a resource only if the entities are equivalent and could be substituted for each other with no significant change in semantics.

to:

... MAY be shared by two representations of a resource only if the origin server considers them to be semantically equivalent.

Section 5:

However, there might be cases when a server prefers to change the validator only on semantically significant changes, and not when insignificant aspects of the entity change. A validator that does not always change when the resource changes is a "weak validator."

Entity tags are normally "strong validators," but the protocol provides a mechanism to tag an entity tag as "weak." One can think of a strong validator as one that changes whenever the bits of an entity changes, while a weak value changes whenever the meaning of an entity changes. Alternatively, one can think of a strong validator as part of an identifier for a specific entity, while a weak validator is part of an identifier for a set of semantically equivalent entities.

Section 6:

In order to be legal, a strong entity tag MUST change whenever the associated entity value changes in any way. A weak entity tag SHOULD change whenever the associated entity changes in a semantically significant way.

to:

... A weak entity tag SHOULD change whenever the associated entity changes in a way that the server determines is semantically significant.

: comment added; owner set (Mon, 28 Jul 2008 14:09:58 GMT)

owner set to julian.reschke@gmx.de

: comment added; attachment set (Tue, 29 Jul 2008 18:13:07 GMT)

attachment set to i101.diff

Proposed change for part 4.

: comment added (Tue, 29 Jul 2008 23:14:08 GMT)

I don't see how that addresses the issue of "semantically equivalent" being an untestable quality. What it should say is

A weak entity tag SHOULD change whenever the origin server considers prior representations to be unacceptable as a substitute for the current representation. In other words, an entity tag SHOULD change whenever the origin server wants caches to invalidate old responses.

Note that this same pattern needs to be used in other cache descriptions, because 2616 mistakenly defines HTTP in terms of implementation rather than in terms of what the interface describes to the message recipient.

: comment added; milestone changed (Fri, 29 Aug 2008 08:44:06 GMT)

milestone changed from 04 to unassigned.

Related Information

Issues List Index