<?xml version='1.0' encoding='utf-8'?>

<!DOCTYPE rfc SYSTEM "rfc2629-xhtml.ent">

<rfc xmlns:xi="http://www.w3.org/2001/XInclude" number="8673" consensus="true"
     submissionType="IETF" category="exp" ipr="trust200902"
     docName="draft-ietf-httpbis-rand-access-live-04" tocInclude="true"
     symRefs="true" sortRefs="true" version="3" xml:lang="en">

  <!-- xml2rfc v2v3 conversion 2.28.0 -->

  <!-- ***** FRONT MATTER ***** -->
  <front>
    <title>HTTP Random Access and Live Content</title>

  <seriesInfo name="RFC" value="8673" />

    <author fullname="Craig Pratt" initials="C." surname="Pratt">
      <address>
        <postal>
          <street/>
          <city>Portland</city>
          <region>OR</region>
          <code>97229</code>
          <country>United States of America</country>
        </postal>
        <email>pratt@acm.org</email>
      </address>
    </author>
    <author fullname="Darshak Thakore" initials="D." surname="Thakore">
      <organization abbrev="CableLabs">CableLabs</organization>
      <address>
        <postal>
          <street>858 Coal Creek Circle</street>
          <city>Louisville</city>
          <region>CO</region>
          <code>80027</code>
          <country>United States of America</country>
        </postal>
        <email>d.thakore@cablelabs.com</email>
      </address>
    </author>
    <author fullname="Barbara Stark" initials="B." surname="Stark">
      <organization>AT&amp;T</organization>
      <address>
        <postal>
          <street/>
          <city>Atlanta</city>
          <region>GA</region>
          <country>United States of America</country>
        </postal>
        <email>barbara.stark@att.com</email>
      </address>
    </author>
    <date year="2019" month="November"/>

    <area>Applications and Real-Time</area>

    <workgroup>HTTP</workgroup>

    <keyword>http</keyword>
    <keyword>range</keyword> 
    <keyword>live</keyword> 
    <keyword>aggregation</keyword> 

    <abstract>
      <t>
            To accommodate byte-range requests for content that has
            data appended over time, this document defines semantics
            that allow an HTTP client and a server to perform byte-range
      GET and HEAD requests that start at an arbitrary byte offset
      within the representation and end at an indeterminate offset.
      </t>
    </abstract>
  </front>
  <middle>


    <section anchor="introduction">
      <name>Introduction</name>
      <t>
      Some Hypertext Transfer Protocol (HTTP) clients use byte-range requests
      (range requests using the "bytes" range unit) to transfer select
      portions of large representations <xref target="RFC7233"/>. In some
      cases, large representations require content to be continuously or
      periodically appended, such as representations consisting of live audio 
      or video sources, blockchain databases, and log files. Clients cannot 
      access the appended/live content using a range request with the "bytes"
      range unit using the currently defined byte-range semantics without 
      accepting performance or behavior sacrifices that are not acceptable for 
      many applications.
      </t>
      <t>
	For instance, HTTP clients have the ability to access appended content 
	on an indeterminate-length resource by transferring the entire 
	representation from the beginning and continuing to read the appended 
	content as it's made available. Obviously, this is highly inefficient 
	for cases where the representation is large and only the most recently 
	appended content is needed by the client.
      </t>
      <t>		
	Alternatively, clients can access appended
	content by sending periodic, open-ended byte-range
	requests using the last known end byte position as the
	range start. Performing low-frequency periodic
	byte-range requests in this fashion (polling) introduces
	latency since the client will necessarily be somewhat
	behind in transferring the aggregated content, effectively 
	resulting in the same kind of latency issues with the segmented content 
	transfer mechanisms in "HTTP Live Streaming" (HLS) <xref
	target="RFC8216"/> and "Dynamic Adaptive Streaming
	over HTTP" <xref target="MPEG-DASH"/>.
	While performing 
        these range requests at higher frequency can reduce this latency, 
        it also incurs more processing overhead and HTTP exchanges as 
        many of the requests will return no content, since content is 
	usually aggregated in groups of bytes (e.g., a video frame, audio 
	sample, block, or log entry).
      </t>
      <t>
           This document describes a usage model for range requests that enables
           efficient retrieval of representations that are appended to over time
           by using large values and associated semantics for communicating
           range end positions. This model allows representations to be
           progressively delivered by servers as new content is added. It also
           ensures compatibility with servers and intermediaries that don't
           support this technique.
      </t>


      <section>
        <name>Notational Conventions</name>

        <t>This document cites Augmented Backus-Naur Form (ABNF) productions
            from <xref target="RFC7233"/>, using the notation defined in <xref target="RFC5234"/>.
        </t>
      </section>
    </section>
    <section anchor="definition">
      <name>Performing Range Requests on Random-Access Aggregating (Live) Content</name>
      <t>
            This document recommends a two-step process for accessing resources
            that have indeterminate-length representations.
      </t>
      <t>
            Two steps are necessary because of limitations with the range request
            header fields and the Content-Range response header fields. A server cannot
            know from a range request that a client wishes to receive a response
            that does not have a definite end. More critically, the header fields
            do not allow the server to signal that a resource has indeterminate
            length without also providing a fixed portion of the resource.
      </t>
      <t>
            A client first learns that the resource has a representation of
            indeterminate length by requesting a range of the resource. The server
            responds with the range that is available but indicates that the
            length of the representation is unknown using the existing
            Content-Range syntax. See <xref target="establishing-range"/>
            for details and examples.
      </t>
      <t>
            Once the client knows the resource has indeterminate length, it can
            request a range with a very large end position from the resource. The
            client chooses an explicit end value larger than can be transferred in
            the foreseeable term. A server that supports range requests of
            indeterminate length signals its understanding of the client's
            indeterminate range request by indicating that the range it is
            providing has a range end that exactly matches the client's requested
            range end rather than a range that is bounded by what is currently
            available. See <xref target="live-range-requests"/> for details.
      </t>
      <section anchor="establishing-range">
       <name>Establishing the Randomly Accessible Byte Range</name>

        <t>
        Determining if a representation is continuously aggregating ("live")
	and determining the randomly accessible byte range can both be
	performed using the existing definition for an open-ended byte-range
	request. Specifically, <xref target="RFC7233"
	sectionFormat="of" section="2.1"/> defines a byte-range request of the form:
        </t>

   <sourcecode type="abnf"><![CDATA[
   byte-range-spec = first-byte-pos "-" [ last-byte-pos ]
]]></sourcecode >
        <t>
      which allows a client to send a HEAD request with a first-byte-pos and 
      leave last-byte-pos absent. A server that receives a satisfiable
      byte-range request (with first-byte-pos smaller than the current
      representation length) may respond with a 206 status code (Partial
      Content) with a Content-Range
      header field indicating the currently satisfiable byte range. For example:
        </t>
        <artwork type="message/http; msgtype=&quot;request&quot;"><![CDATA[
HEAD /resource HTTP/1.1
Host: example.com
Range: bytes=0-
]]></artwork>
        <t>
      returns a response of the form:
        </t>
        <artwork type="message/http; msgtype=&quot;response&quot;"><![CDATA[
HTTP/1.1 206 Partial Content
Content-Range: bytes 0-1234567/*
]]></artwork>
        <t>
      from the server indicating that (1) the complete representation length
      is unknown (via the "*" in place of the complete-length field) and (2)
      only bytes 0-1234567 were accessible at the time the request was
      processed by the server. The client can infer from this response that
      bytes 0-1234567 of the representation can be requested and transfer
      can be performed immediately.

        </t>
      </section>
      <section anchor="live-range-requests">
        <name>Byte-Range Requests beyond the Randomly Accessible Byte Range</name>
        <t>
        Once a client has determined that a representation has an
	indeterminate length and established the byte range that can be
	accessed, it may want to perform a request with a start position
	within the randomly accessible content range and an end position 
	at an indefinite/live point -- a point where the byte-range GET 
	request is fulfilled on-demand as the content is aggregated.
        </t>
        <t>
        For example, for a large video asset, a client may wish to start a 
	content transfer from the video "key" frame immediately before the 
	point of aggregation and continue the content transfer indefinitely 
	as content is aggregated, in order to support low-latency startup 
	of a live video stream.
        </t>

        <t>
        Unlike a byte-range request header field, a byte-content-range response
        header field cannot be "open-ended", per <xref
	target="RFC7233" sectionFormat="of" section="4.2"/>:
        </t>
        <sourcecode type="abnf"><![CDATA[
   byte-content-range  = bytes-unit SP
                        ( byte-range-resp / unsatisfied-range )

   byte-range-resp     = byte-range "/" ( complete-length / "*" )
   byte-range          = first-byte-pos "-" last-byte-pos
   unsatisfied-range   = "*/" complete-length

   complete-length     = 1*DIGIT
]]></sourcecode>
        <t>
        Specifically, last-byte-pos is required in byte-range. So, in order 
	to preserve interoperability with existing HTTP clients, servers, 
	proxies, and caches, this document proposes a mechanism for a client 
	to indicate support for handling an indeterminate-length byte-range 
	response and a mechanism for a server to indicate if/when it's 
	providing an indeterminate-length response.
        </t>
        <t>
        A client can indicate support for handling indeterminate-length 
	byte-range responses by providing a very large value for the 
	last-byte-pos in the byte-range request. For example, a client can 
	perform a byte-range GET request of the form:
        </t>
        <artwork type="message/http; msgtype=&quot;request&quot;"><![CDATA[
GET /resource HTTP/1.1
Host: example.com
Range: bytes=1230000-999999999999
]]></artwork>
        <t>
        where the last-byte-pos in the request is much larger than the 
	last-byte-pos returned in response to an open-ended byte-range 
	HEAD request, as described above, and much larger than the expected 
	maximum size of the representation. See <xref target="Security"/> for 
	range value considerations.
        </t>
        <t>
        In response, a server may indicate that it is supplying a continuously 
	aggregating/live response by supplying the client request's 
	last-byte-pos in the Content-Range response header field.
        </t>
        <t>
        For example:
        </t>
        <artwork type="message/http; msgtype=&quot;request&quot;"><![CDATA[
GET /resource HTTP/1.1
Host: example.com
Range: bytes=1230000-999999999999
]]></artwork>
        <t>
        returns
        </t>
        <artwork type="message/http; msgtype=&quot;response&quot;"><![CDATA[
HTTP/1.1 206 Partial Content
Content-Range: bytes 1230000-999999999999/*
]]></artwork>
        <t>
        from the server to indicate that the response will start at byte
	1230000 and continue indefinitely to include all aggregated content, as it becomes available.
        </t>
        <t>
        A server that doesn't support or supply a continuously aggregating/live 
	response will supply the currently satisfiable byte range, 
	as it would with an open-ended byte request.
        </t>
        <t>
        For example:
        </t>
        <artwork type="message/http; msgtype=&quot;request&quot;"><![CDATA[
GET /resource HTTP/1.1
Host: example.com
Range: bytes=1230000-999999999999
]]></artwork>
        <t>
        returns
        </t>
        <artwork type="message/http; msgtype=&quot;response&quot;"><![CDATA[
HTTP/1.1 206 Partial Content
Content-Range: bytes 1230000-1234567/*
]]></artwork>
        <t>
        from the server to indicate that the response will start at byte
	1230000, end at byte 1234567, and not include any aggregated content. 
	This is the response expected from a typical HTTP server -- one that 
	doesn't support byte-range requests on aggregating content.
        </t>
        <t>
        A client that doesn't receive a response indicating it is continuously 
	aggregating must use other means to access aggregated content (e.g., 
	periodic byte-range polling).
        </t>
        <t>
        A server that does return a continuously aggregating/live response
	should return data using chunked transfer coding and not provide a
	Content-Length header field. A 0-length chunk indicates the end of the
	transfer, per <xref target="RFC7230" sectionFormat="of"
	section="4.1"/>.
        </t>
      </section>
    </section>
    <section anchor="other-applications">
      <name>Other Applications of Random-Access Aggregating Content</name>
      <section anchor="starting-at-live">
        <name>Requests Starting at the Aggregation/Live Point</name>
        <t>
          A client that wishes to only receive newly aggregated portions of a
          resource (i.e., start at the live point) can use a HEAD request to
          learn what range the server has currently available and initiate an
          indeterminate-length transfer. For example:
        </t>
        <artwork type="message/http; msgtype=&quot;request&quot;"><![CDATA[
HEAD /resource HTTP/1.1
Host: example.com
Range: bytes=0-
]]></artwork>
        <t>
            with the Content-Range response header field indicating the range
	    (or ranges) available. For example:
        </t>
        <artwork type="message/http; msgtype=&quot;response&quot;"><![CDATA[
206 Partial Content
Content-Range: bytes 0-1234567/*
]]></artwork>


        <t>
            The client can then issue a request for a range starting at the end
            value (using a very large value for the end of a range) and receive
            only new content.
        </t>
        <t>
          For example:
        </t>
        <artwork type="message/http; msgtype=&quot;request&quot;"><![CDATA[
GET /resource HTTP/1.1
Host: example.com
Range: bytes=1234567-999999999999
]]></artwork>
        <t>
          with a server returning a Content-Range response indicating that an
	  indeterminate-length response body will be provided:
        </t>
        <artwork type="message/http; msgtype=&quot;response&quot;"><![CDATA[
206 Partial Content
Content-Range: bytes 1234567-999999999999/*
]]></artwork>
      </section>

      <section anchor="shift-buffers">
        <name>Shift-Buffer Representations</name>
        <t>
        Some representations lend themselves to front-end content removal in
	addition to aggregation. While still supporting random access,
	representations of this type have a portion at the beginning (the "0"
	end) of the randomly accessible region that becomes inaccessible over
	time. Examples of this kind of representation would be an audio-video 
	time-shift buffer or a rolling log file.
        </t>
        <t>
        For example, a range request containing:
        </t>
        <artwork type="message/http; msgtype=&quot;request&quot;"><![CDATA[
HEAD /resource HTTP/1.1
Host: example.com
Range: bytes=0-
]]></artwork>
        <t>
        returns
        </t>
        <artwork type="message/http; msgtype=&quot;response&quot;"><![CDATA[
206 Partial Content
Content-Range: bytes 1000000-1234567/*
]]></artwork>
        <t>
        indicating that the first 1000000 bytes were not accessible at the
	time the HEAD request was processed. Subsequent HEAD requests could return:
        </t>


        <artwork type="example"><![CDATA[
Content-Range: bytes 1000000-1234567/*
]]></artwork>
        <artwork type="example"><![CDATA[
Content-Range: bytes 1010000-1244567/*
]]></artwork>
        <artwork type="example"><![CDATA[
Content-Range: bytes 1020000-1254567/*
]]></artwork>
        <t>
        Note though that the difference between the first-byte-pos and 
	last-byte-pos need not be constant.
        </t>
        <t>
        The client could then follow up with a GET range request containing:
        </t>
        <artwork type="message/http; msgtype=&quot;request&quot;"><![CDATA[
GET /resource HTTP/1.1
Host: example.com
Range: bytes=1020000-999999999999
]]></artwork>
        <t>
        with the server returning
        </t>
        <artwork type="message/http; msgtype=&quot;response&quot;"><![CDATA[
206 Partial Content
Content-Range: bytes 1020000-999999999999/*
]]></artwork>
        <t>
        with the response body returning bytes 1020000-1254567 immediately 
	and aggregated/live data being returned as the content is aggregated.
        </t>
        <t>
        A server that doesn't support or supply a continuously aggregating/live
	response will supply the currently satisfiable byte range, as
	it would with an open-ended byte request. For example:
        </t>
        <artwork type="message/http; msgtype=&quot;request&quot;"><![CDATA[
GET /resource HTTP/1.1
Host: example.com
Range: bytes=0-999999999999
]]></artwork>
        <t>
        returns
        </t>
        <artwork type="message/http; msgtype=&quot;response&quot;"><![CDATA[
HTTP/1.1 206 Partial Content
Content-Range: bytes 1020000-1254567/*
]]></artwork>
        <t>
        from the server to indicate that the response will start at byte
	1020000, end at byte 1254567, and not include any aggregated 
	content. This is the response expected from a typical HTTP 
	server -- one that doesn't support byte-range requests on aggregating content.
        </t>
        <t>
          Note that responses to GET requests performed on shift-buffer
          representations using Range headers can be cached by intermediaries, since
          the Content-Range response header indicates which portion of the
          representation is being returned in the response body. However, GET
          requests without a Range header cannot be cached since the first
          byte of the response body can vary from request to request. To
          ensure GET requests without Range headers on shift-buffer representations
          are not cached, servers hosting a shift-buffer representation should
          either not return a 200-level response (e.g., send a 300-level
          redirect response with a URI that represents the current start of
          the shift buffer) or indicate the response is non-cacheable. See
          <xref target="RFC7234"/> for details on HTTP cache control.
        </t>
      </section>
    </section>

    <section anchor="RecommendedVLV">
      <name>Recommendations for Byte-Range Request last-byte-pos Values</name>
      <t>
            While it would be ideal to define a single large last-byte-pos
            value for byte-range requests, there's no single value that would work for all
            applications and platforms. For example, JavaScript numbers cannot
            represent all integer values above 2^^53, so a JavaScript
            application may want to use 2^^53-1 for last-byte-pos. This
            value, however, would not be sufficient for all applications, such
            as long-duration high-bitrate streams. So 
            2^^53-1 (9007199254740991) is recommended as a last-byte-pos
            unless an application has a good justification to use a smaller or
            larger value. For example, if it is always known that the resource won't
            exceed a value smaller than the recommended last-byte-pos for
            an application, a smaller value can be used. If it's likely
            that an application will utilize resources larger than the
            recommended last-byte-pos (such as a continuously aggregating
            high-bitrate media stream), a larger value should be used.
      </t>
      <t>
            Note that, in accordance with the semantics defined above, servers
	    that support random-access live content will need to return the 
	    last-byte-pos provided in the byte-range request in some cases -- even
	    if the last-byte-pos cannot be represented as a numerical value 
	    internally by the server. As is the case with any
	    continuously aggregating/live resource, the server should
	    terminate the content transfer when the end of the resource is 
	    reached -- whether the end is due to termination of the content
	    source or the content length exceeds the server's maximum representation length.
      </t>
    </section>
    <section anchor="IANA">
      <name>IANA Considerations</name>
      <t>This document has no IANA actions.</t>
    </section>
    <section anchor="Security">
      <name>Security Considerations</name>
      <t>
            As described above, servers need to be prepared to receive 
	    last-byte-pos values in range requests that are numerically larger 
	    than the server implementation supports and return these values in 
	    Content-Range response header fields. Servers should check the 
	    last-byte-pos value before converting and storing them into
	    numeric form to ensure the value doesn't cause an overflow or
	    index incorrect data. The simplest way to satisfy the live-range 
	    semantics defined in this document without potential overflow
	    issues is to store the last-byte-pos as a string value and return 
	    it in the byte-range Content-Range response header's last-byte-pos field.
      </t>
    </section>
  </middle>
  <!--  *****BACK MATTER ***** -->

  <back>
    <references>
      <name>References</name>

      <references>

        <name>Normative References</name>


<xi:include
    href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.7230.xml"/>

<xi:include
    href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.7233.xml"/>

<xi:include
    href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.7234.xml"/>

<xi:include
    href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.5234.xml"/>

      </references>

      <references>
        <name>Informative References</name>


<xi:include
    href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.8216.xml"/>



        <reference anchor="MPEG-DASH"
		   target="https://www.iso.org/standard/75485.html">

          <front>
            <title>Information technology -- Dynamic adaptive streaming over HTTP (DASH) -- Part 1: Media presentation description and segment formats
            </title>
            <seriesInfo name="ISO/IEC" value="23009-1"/>
            <author>
              <organization>ISO</organization>
            </author>
            <date month="August" year="2019"/>
          </front>
        </reference>
      </references>
    </references>


    <section anchor="Acknowledgements" numbered="false">
      <name>Acknowledgements</name>
      <t>
      The authors would like to thank Mark Nottingham, Patrick McManus, Julian Reschke, Remy Lebeau, Rodger
      Combs, Thorsten Lohmar, Martin Thompson, Adrien de Croy, K.&nbsp;Morgan, Roy
      T.&nbsp;Fielding, and Jeremy Poulter.
      </t>
    </section>

  </back>
</rfc>
