Syntax of temporal URI fragment specifications Commonwealth Scientific and Industrial Research Organisation CSIRO, Australia
Locked Bag 17 North Ryde NSW 2113 Australia +61 2 9325 3141 Silvia.Pfeiffer@csiro.au http://www.cmis.csiro.au/Silvia.Pfeiffer/
Commonwealth Scientific and Industrial Research Organisation CSIRO, Australia
Locked Bag 17 North Ryde NSW 2113 Australia +61 2 9325 3133 Conrad.Parker@csiro.au http://www.cmis.csiro.au/Conrad.Parker/
This document specifies a syntax for temporal offsets and intervals as URI fragments. Such fragment identifiers are useful to directly access temporal offset points and intervals in time-continuous resources such as audio and video. The URI fragment syntax specified in this document is comformant to the Generic URI Syntax as specified in RFC 2396. The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in RFC 2119.
Many resources on the Internet are time-continuous data such as audio or video files or streams. This document describes a standard way of addressing temporal offsets into and temporal intervals of such resources through a temporal URI fragment syntax. In this way, points of interest and intervals in time-continuous files or streams can be directly accessed. The aim is to make it simple to incorporate infrastructure into the Web which supports the browsing and searching of time-continuous media. The interpretation of the temporal fragment is however dependent on the URI scheme in use and the content type of the resource referenced.
Fragments are generally specified in a URI after the crosshatch ("#") character. The format of fragment identifiers specified in this document is conformant to the URI RFC 2396 for fragment identifiers, but retricts their use to an interpretation on the user agent only. See section 5 for a proposed alternative usage. Temporal fragments start with the reserved character "@", representing the time-continuous resource "at" a certain temporal offset. The "@" character is reserved and this specification is giving it a reserved purpose. Having the "@" character at the start simplifies parsing of a temporal fragment specification, helping to e.g. distinguish between a fragment given by name as "#smpte-25" and a fragment given as a temporal offset as "#@smpte-25=01:01:01:01". The specification of a temporal fragment offset itself is given as a name-value pair, where the name specifies a time scheme to use and the value is the time specification itself. The syntax is closely related to the specification of relative timestamps of the RTSP protocol parameters as given in RFC 2326. Temporal intervals can be specified as well. This is achieved by adding the reserved character "-" and another time specification that adheres to the time scheme used for the specification of the first time point. The BNF for temporal fragment offsets and temporal intervals is: There are several time schemes that can be used. The default time scheme is "npt" (normal play time). The available time schemes and their specifications are described in the next section.
A time scheme is a short identifier for a type of time specification. The following general time schemes are specified in this document. Further time schemes are expected to emerge and should probably be registered through IANA (XXX this needs to be discussed XXX). "npt" (Normal Play Time; base of seconds as used in the RTSTP standard) "smpte" (Society of Motion Picture and Television Engineers time codes as specified in the SMPTE time and control code standard) "utc" (Universal Time Code giving wall-clock time as specified in the ISO 8601 standard). Thus, the available time schemes are: NPT time has a second or subsecond resolution. It is specified as H:M:S.h (npt-hhmmss) or S.h (npt-sec), where H=hours, M=minutes, S=second and h=fractions of a second. Negative values are not allowed. Specification as BNF: SMPTE time codes are optimized for frame level accuracy. SMPTE is specified as HH:MM:SS:FF, where HH=hours, MM=minutes, SS=second, FF=frames. The drop-frame algorithms for calculating the exact times can be found in the references SMPTE standard. Negative values are not allowed. "smpte-24=" SMPTE time with a 24 fps basis "smpte-24-drop=" SMPTE time with a 24/1.001 fps basis "smpte-25=" SMPTE time with a 25 fps basis "smpte-30=" SMPTE time with a 30 fps basis "smpte-30-drop=" SMPTE time with a 30/1.001 fps basis "smpte-50=" SMPTE time with a 50 fps basis "smpte-60=" SMPTE time with a 60 fps basis "smpte-60-drop=" SMPTE time with a 60/1.001 fps basis Specification as BNF: UTC time has a second or subsecond resolution. It is given as YYYYMMDDTHHmmss.hhZ, where Y=year, M=month, D=day, H=hour, m=minute, s=second, h=subseconds (one-hundredth of a second). Specification as BNF: Examples for specifications of temporal fragment offsets are:
The semantic interpretation of time specifications given with any of the schemes depends on the resource. With every resource there are two associated timebases: a UTC timebase which may e.g. specify the creation time of the resource, and a playback timebase used for display in a user agent while viewing the resource. The playback timebase of a resource defaults to 0 seconds if the resource has no other timebase associated with it. For example, with professional video production, the first frame of video of a program normally refers to a SMPTE timebase of 01:00:00:00, not 00:00:00:00. This practice arose from the requirements of program production and analog videotape recording technology, and it has subsequently become a uniform, almost ironclad practice worldwide. Associating such a practice to a digital video resource requires a way to store that timebase with the resource, which may or may not be possible, depending on the content type of the resource. Examples: If a resource has an associated timebase of 3600 seconds, and the given temporal fragment offset is 4000 sec, a seek time 400 sec into the resource is requested. If the timebase is given as clock time 20001010T142211.23Z and the temporal offset specified is 20001010T145511.23Z, the time 33 minutes into the resource is requested. The UTC timebase of a resource defaults to non-specified. Associating such a UTC timebase with a resource requires a way to store that timebase with the resource. For example, for a resource that is a file on a server, it may be chosen to be the time of storage of that resource. Examples for specifications of temporal intervals are:
The semantic interpretation of these temporal intervals depends on the time scheme. Unless specified differently, the temporal intervals given are closed intervals, i.e. they start at the first time point and finish at the second time point: [time_from;time_to]. For SMPTE timecodes, however, it is conventional to express such temporal intervals as IN and OUT times for editing. Thus, the IN time specifies the first frame that is included in the interval and the OUT time specifies the first frame that is not included in the interval. Therefore, a SMPTE interval is specified as [time_from;time_to[, which explains the additional frame in the above example.
The temporal fragment specification scheme is intended to be used on time-continuous resources. An example of such resources are all resources of MIME types "audio/*" and "video/*". The protocol through which these resources are accessed are expected to be mainly http or rtp/rtsp, which are especially suited for such resources. It is RECOMMENDED that user agents do not strip off the temporal fragment from a given URI before forwarding it to a server. [XXX: This is contrary to the current prescription in the URI standard and needs to get resolved -> see Section 6.] A retrieval action on a URI that includes a temporal fragment SHOULD result in a time-continuous resource that starts at the given temporal offset. As time-continuous resources often come with high bandwidth requirements, this avoids unnecessary network load. For example, a 1 hour Digital Video (DV format) requires about 25 GB (MPEG-2 reduces that to about 3 GB, but this format must be prepared for addressing high-quality, high-resolution time-continuous bitstreams of the future). Serving out only the requested interval of a resource also significantly reduces the delay for the user agent for receiving relevant data. Alternatively, the user agent MAY wait until the retrieval action has failed, then resend the URI with the fragment stripped off and perform the offset action locally on the retrieved resource. Servers that support the temporal fragment offset MUST implement a retrieval action of time-continuous resources with such fragment specifications by serving the requested resource from the temporal offset onwards. For many time-continuous resources - especially when in compressed format - this means that the server has to parse the structure of the resource and construct another valid resource from the original resource's header information and data frames. If a server cannot perform the fragment offset, it MUST return an error as otherwise the user agent cannot identify if the offset action was performed or not. It is expected that over time more servers and client applications understand and handle the temporal fragment offset and thus enable direct networked access to content in time-continuous resources. Also network proxies may begin to understand such temporal offsets and can exploit them for caching.
This specification does not create any new security issues beyond the ones already specified for URIs in RFC 2396.
In the current version of the URI standard, it is prescribed that fragment specifications get interpreted by the user agent. Therefore, the intended use of fragments to retrieve only a temporal interval of the time-continuous data or the data from a certain offset point onwards is not generally allowed. However, there are ways to effectively get around this restriction by using (mis-using?) communication protocols. When using http, we can invent a new protocol parameter that gets filled by the user agent with the temporal fragment specification and that gets interpreted by the server. If we don't do that, we seriously defect http from being usable for time-continuous media in the future. When using rtp/rtsp, a client can strip off the fragment specification and map it onto the Range header field of the rtsp protocol, which will then tell the server which subpart of the time-continuous data bistream to serve out. A much cleaner way to resolve this problem would be to change the URI standard to allow for a server-side interpretation of fragment offsets after all other actions have been performed on the resource. This will make it independent of the protocol in use and it will enable intermediate proxies to store and forward parts of a media resource. It is then the user agent's choice whether or not to strip off the fragment offset and interpret it locally after the retrieval action or to forward it to the server with an expectation to receive only that subpart of the resource. Servers are not forced to implement that specification. It is however necessary to return an error if they cannot handle temporal fragment specifications to avoid a double offset action by both server and client.
draft-pfeiffer-temporal-fragments-01: Extension of the number of available SMPTE time-schemes. Many thanks to Bill Miller and Oliver Morgan of the SMPTE for their input on these changes. Deleted "start" and "now" as time specification values. Extension of the temporal fragment addressing to also address temporal intervals, not only time points. Added section that includes some key points of discussion where the existing URI standard contradicts the use of fragments for time-continuous data.
Key words for use in RFCs to Indicate Requirements Levels Harvard University
29 Oxford Street Cambridge MA 02138 US +1 617 495 3864 sob@harvard.edu
Uniform Resource Identifiers (URI): Generic Syntax World Wide Web Consortium
MIT Laboratory for Computer Science 545 Technology Square Cambridge MA 02139 US +1 617 253 5702 +1 617 258 8682 timbl@w3.org
University of California, Irvine
Department of Information and Computer Science University of California, Irvine Irvine CA 92697-3425 US +1 949 824 7403 +1 949 824 1715 fielding@ics.uci.edu
Xerox PARC
3333 Coyote Hill Road Palo Alto CA 94304 US +1 650 812 4365 +1 650 812 4333 masinter@parc.xerox.com
Real Time Streaming Protocol (RTSP) Columbia University
Dept. of Computer Science 1214 Amsterdam Avenue New York NY 10027 US schulzrinne@cs.columbia.edu
Netscape Communications Corp.
501 E. Middlefield Road Mountain View CA 94043 US anup@netscape.com
RealNetworks
1111 Third Avenue Suite 2900 Seattle WA 98101 US robla@real.com
SMPTE STANDARD for Television, Audio and Film - Time and Control Code The Society of Motion Picture and Television Engineers
595 W. Hartsdale Ave. White Plains NY 10607 USA smpte@smpte.org
Data elements and interchange formats -- Information interchange -- Representation of dates and times International Organization for Standardization
1 rue de Varembre Case Postale 56 Geneva 20 1211 CH central@iso.org
The authors greatly acknowledge the contributions of Andre Pang and Andrew Nesbit in developing this syntax. We also thank the SMPTE for their contributions and the URI discussion group at the W3C (uri@w3.org) for their many comments on this document.