GB2517060A - Adaptive data streaming method with push messages control - Google Patents

Adaptive data streaming method with push messages control Download PDF

Info

Publication number
GB2517060A
GB2517060A GB1410540.7A GB201410540A GB2517060A GB 2517060 A GB2517060 A GB 2517060A GB 201410540 A GB201410540 A GB 201410540A GB 2517060 A GB2517060 A GB 2517060A
Authority
GB
United Kingdom
Prior art keywords
media data
push
client
server
client device
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
GB1410540.7A
Other versions
GB201410540D0 (en
GB2517060B (en
Inventor
Na L Ouedraogo
Franck Denoual
Herv Ruellan
Youenn Fablet
Romain Bellessort
Fr D Ric Maze
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Inc filed Critical Canon Inc
Priority to EP14737279.1A priority Critical patent/EP3020208B1/en
Priority to US14/903,989 priority patent/US10104190B2/en
Priority to JP2016524842A priority patent/JP6419173B2/en
Priority to CN201480050434.0A priority patent/CN105532013B/en
Priority to PCT/EP2014/064949 priority patent/WO2015004276A2/en
Priority to KR1020187029384A priority patent/KR102024311B1/en
Priority to KR1020197027171A priority patent/KR102264477B1/en
Priority to RU2016104523A priority patent/RU2625328C1/en
Priority to KR1020167003172A priority patent/KR101909160B1/en
Priority to CN201811637213.XA priority patent/CN109842613B/en
Publication of GB201410540D0 publication Critical patent/GB201410540D0/en
Publication of GB2517060A publication Critical patent/GB2517060A/en
Priority to RU2017123329A priority patent/RU2659041C1/en
Application granted granted Critical
Publication of GB2517060B publication Critical patent/GB2517060B/en
Priority to RU2018119682A priority patent/RU2683595C1/en
Priority to US16/107,093 priority patent/US10728353B2/en
Priority to JP2018192305A priority patent/JP6632682B2/en
Priority to JP2019223944A priority patent/JP6918910B2/en
Priority to US16/908,418 priority patent/US11375031B2/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/61Network streaming of media packets for supporting one-way streaming services, e.g. Internet radio
    • H04L65/613Network streaming of media packets for supporting one-way streaming services, e.g. Internet radio for the control of the source by the destination
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/235Processing of additional data, e.g. scrambling of additional data or processing content descriptors
    • H04N21/2355Processing of additional data, e.g. scrambling of additional data or processing content descriptors involving reformatting operations of additional data, e.g. HTML pages
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/61Network streaming of media packets for supporting one-way streaming services, e.g. Internet radio
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/61Network streaming of media packets for supporting one-way streaming services, e.g. Internet radio
    • H04L65/612Network streaming of media packets for supporting one-way streaming services, e.g. Internet radio for unicast
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/65Network streaming protocols, e.g. real-time transport protocol [RTP] or real-time control protocol [RTCP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/75Media network packet handling
    • H04L65/762Media network packet handling at the source 
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/60Scheduling or organising the servicing of application requests, e.g. requests for application data transmissions using the analysis and optimisation of the required network resources
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/254Management at additional data server, e.g. shopping server, rights management server
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/80Responding to QoS
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/02Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/55Push-based network services

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Databases & Information Systems (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention is based up DASH (dynamic adaptive streaming over HTTP) or similar in combination with the push function from HTTP/2. The invention proposes a system with a push policy to push additional media data (e.g. segments 2.1) along with explicitly requested media data 1.1. Both client and server know the policy, and this allows the client to know what data will be transmitted before it (or push promises related to it) are received. Therefore, the client may refuse or request changes to upcoming data in a timely fashion. The push policy may be implicit, or explicitly provided in a media presentation description (MPD) or similar. The client may also request changes to the push policy 1805.

Description

ADAPTIVE DATA STREAMING METHOD WITH PUSH MESSAGES
CONTROL
FIELD OF THE INVENTION
The present invention relates to data streaming over HTTP communication networks.
More particularly, the present invention relates to adaptive data streaming for satisfying network constraints. The invention may have applications in DASH networks.
DASH (acronym for Dynamic Adaptive Streaming over HTTP) is a communication standard allowing media content streaming (typically audio/video content) over HTTP. According to DASH, media presentations are described as XML files, called media presentation description" files (MPD in what follows). MPD files provide client devices with information allowing them to request and control the delivery of media contents.
BACKGROUND OF THE INVENTION
The general principle of Media streaming over HTTP is illustrated in Figure 3. Most of the new protocols and standards for adaptive media streaming over HTTP are based on this principle.
A media server 300 streams data to a client 310. The media server stores media presentations. For example, media presentation 301 contains audio and video data. Audio and video may be interleaved in a same file. The way the media presentation is built is described in what follows with reference to Figure 4a. The media presentation is temporally split into small independent and consecutive temporal segments 302a, 302b and 302c, such as MP4 segments, that can be addressed and downloaded independently. The downloading addresses (HTTP URL5) of the media content for each of these temporal segments are set by the server to the client. Each temporal segment of the audio/video media content is associated with one HTTP address.
The media server also stores a manifest file document 304 (described in what follows with reference to Figure 5) that describes the content of the media presentation including the media content characteristics (e.g. the type of media: audio, video, audio-video, text etc.), the encoding format (e.g. the bitrate, the timing information etc.), the list of temporal media segments and associated URLs. Alternatively, the document contains template information that makes it possible to rebuild the explicit list of the temporal media segments and associated URLs. This document may be written using the eXtensible Markup Language (XML).
The manifest file is sent to the client. Upon receipt of the manifest file during a step 305, the client is informed of the association between temporal segments of the media contents and HTTP addresses. Also, the manifest file provides the client with the information concerning the content of the media presentation (interleaved audio/video in the present example). The information may include the resolution, the bit-rate etc. Based on the information received, the HTTF' client module 311 of client can emit HTTP requests 306 for downloading temporal segments of the media content described in the manifest file. The server's HTTP responses 307 convey the requested temporal segments. The HTTP client module 311 extracts from the responses the temporal media segments and provides them to the input buffer 307 of the media engine 312. Finally, the media segments can be decoded and displayed during respective steps 308 and 309.
The media engine 312 interacts with the DASH control engine 313 in order to have the requests for next temporal segments to be issued at the appropriate time. The next segment is identified from the manifest file. The time at which the request is issued depends on whether or not the reception buffer 307 is full. The DASH control engine 313 controls the buffer in order to prevent it from being overloaded or completely empty.
The generation of the media presentation and the manifest file is described with reference to Figure 4a. During steps 400 and 401, audio and video data are acquired. Next, the audio data are compressed during 402. For example, the MP3 standard can be used. Also, the video data are compressed in parallel during step 403. Video compression algorithms such as MPEG4, MPEG/AVG SVG, HEVC or scalable HEVG can be used. Once compression of audio and video data has been performed, audio and video elementary streams 404, 405 are available. The elementary streams are encapsulated during a step 406 into a global media presentation. For example, the ISO BMFF standard (or the extension of the ISO BMFF standard to AVC, SVC, HEVC, scalable extension of HEVC etc.) can be used for describing the content of the encoded audio and video elementary streams as a global media presentation. The encapsulated media presentation 407 thereby obtained is used for generating, during step 408, an XML manifest file 409. Several representations of video data 401 and audio data 400 can be acquired, compressed, encapsulated and described in the media presentation 407.
For the specific case of MPEG/DASH streaming protocol illustrated in Figure 4b, the manifest file is called "Media Presentation Description" (or "MPD" file). The root element of the file is the MPD element that contains attributes applying to all the presentation plus DASH information like profile or schema. The media presentation is split into temporal periods represented by a Period element. The MPD file 410 contains all the data related to each temporal period. By receiving this information, the client is aware of the content for each period of time. For each Period 411, AdaptationSet elements are defined.
A possible organization is to have one or more Adaptationset per media type contained in the presentation. An AdaptationSet 412 related to video contains information about the different possible representations of the encoded videos available at the server. Each representation is described in a Representation element. For example, a first representation can be a video encoded with a spatial resolution of 640x480 and compressed with a bit rate of 500 kbits/s. A second representation can be the same video but compressed with a bit rate of 250 kbits/s. Each video can then be downloaded by HTTP requests if the client knows the HTTP addresses related to the video. The association between the content of each representation and the HTTP addresses is done by using an additional level of description: the temporal segments. Each video representation is split into temporal segments 413 (typically few seconds). Each temporal segment comprises content stored at the server that is accessible via an HTTP address (URL or URL with one byte range). Several elements can be used for describing the temporal segments in the MPD file: SegmentList, Segmentflase or SegmentTemp/ate. In addition, a specific segment is available: the initialization segment. The initialization segment contains MP4 initialization information (if the video has been encapsulated using the ISO BMFF or extensions thereof) that describes the encapsulated video stream. For example, it helps the client to instantiate the decoding algorithms related to the video. The HTTP addresses of the initialization segment and the media segments are indicated in the MPD file.
In Figure 5, there is shown an exemplary MPD file. Two media are described in the MPD file shown. The first one is an English audio stream and the second one is a video stream. The English audio stream is introduced using the Adaptati onSet tag 500. Two alternative representations are available for this audio stream: * the first representation 501 is an MP4 encapsulated elementary audio stream with a bit-rate of 64000 bits/sec. The codec to be used for handling this elementary stream (after MP4 parsing) is defined in the standard by the attribute codecs having the value: mp4a.0x40'. It is accessible via a request at the address formed by the concatenation of the BaseURL elements in the segment hierarchy: <BaseURL>7657412348.mp4</BaseURL>, which is a relative URI.
The <BaseURL> being defined at the top level in the MPD element by http://cdnl.example.com/' or by http://cdn2.example.com/' (two servers are available for streaming the same content) is the absolute URI. The client can then request the English audio stream from the request to the address http.//cdnl.example.com/7657412348.mp4' or to the address http://cdn2.examp/acornf/6574 12348. mp4'.
* the second representation 502 is an MP4 encapsulated elementary audio stream with a bit-rate of 32000 bits/sec.
The adaptation set 503 related to the video contains six representations. These representations contain videos with different spatial resolutions (320x240, 640x480, 1280x720) and with different bit rates (from 256000 to 2048000 bits per second). For each of these representations, a respective URL is associated through a BaseURL element. The client can therefore choose between these alternative representations of the same video according to different criteria like estimated bandwidth, screen resolution etc. The current DASH version does not provide description of Region-Of-Interest within the manifest files. Several approaches have been proposed
for such description.
In particular, components of media contents can be described using SubRepresentation elements. These elements describe the properties of one or several components that are embedded in a Representation. In Figure 6, there is shown an example of a DASH manifest file describing tile tracks as components of a video. For the sake of conciseness and clarity, only one Period 600 is represented. However, subsequent period elements would be organized in a same fashion. In part 601, a first adaptation set element is used for describing a base layer of the scalable video. For example, the video is encoded according to SVC or HEVC scalable. In part 602, a second adaptation set is used for describing the highest resolution layer of the scalable video. For non-scalable video, only the second adaptation set 602 would be present, without dependency to the base layer, i.e. the dependencyld attribute. In this second adaptation set 602, a single representation 603 is described, namely the one that corresponds to the displayable video. The representation is described as a list of segments 610 with respective URLs for client requests.
Thus, the representation depends on another representation identified by Ri' (dependencyld attribute), actually the base layer representation from the first adaptation set 601. The dependency forces the streaming client to first request the current segment for base layer before getting the current segment for the enhancement layer. This cannot be used to express dependencies with respect to tile tracks because the tracks that would be referenced this way would be automatically loaded by the client. This is something to be avoided, since it is up to the user to select the tiles of interest for him anytime during the media presentation. Therefore, in order to indicate the dependencies between the composite track and the tile tracks the SubRepresentation element is used. The displayable video is described as a list of sub-representations 604 to 608. Each sub representation actually represents a track in the encapsulated MP4 file. Thus, there is one sub-representation per tile (four tiles in the present example) plus one sub-representation for the composite track 608. Each sub-representation is described by a content component element 614 to 618 in order to indicate whether it corresponds to a tile track 614, 615, 616 and 617 or to the composite track 618. The Role descriptor type available in DASH/MPD is used with a specific scheme for tiling.
The Role descriptor also indicates the position of the tile in the full-frame video.
For example the component 614 describes the tile located at the top left of the video (1:1 for first in row and first in column). The dimensions of the tiles, width and height, are specified as attributes of the sub representation as made possible by MPD. Bandwidth information can also be put here for helping the DASH client in the determination of the number of tiles and the selection of the tiles, according to its bandwidth. Concerning the composite track, it has to be signalled in a different way than the tile tracks since it is mandatory to be able, at the end of the download, to build a video stream that can be decoded. To that purpose, two elements are added into the description. Firstly, the descriptor in the related content component 618 indicates that it is the main component among all the components. Secondly, in the sub representation, a new attribute required' is added in order to indicate to the client that the corresponding data have to be requested. All requests for the composite track or for one or more of the tile tracks are computed from the URL provided in the segment list 610 (one per time interval). In the example, "URL_X' combined with "BaseURL" at the beginning of the MPD provides a complete URL which the client can use for performing an HTTP GET request. With this request, the client would get the data for the composite track and all the data for all the tile tracks. In order to optimize the transmission, instead of the request, the client can first request the segment index information (typically the "ss/f and/or "sidx" information in ISO BMFF well known by the man skilled in the art), using the data available from the index_range attribute 620. This index information makes it possible to determine the byte ranges for each of the component. The DASH client can then send as many HTTP GET requests with appropriate byte range as selected tracks (including the required composite track).
When starting a streaming session, a DASH client requests the manifest file. Once received, the client analyzes the manifest file, selects a set of AdaptationSets suitable for its environment. Next, the client selects in the MPD, within each AdaptationSet, a Representation compatible with its bandwidth, decoding and rendering capabilities. Next, it builds in advance the list of segments to be requested, starting with initialization information for the media decoders. When initialization information is received by the decoders, they are initialized and the client requests first media data and buffers a minimum data amount before actually starting the display.
These multiple requests/responses may introduce delay in the startup of the streaming session. The risk is for service providers to see their clients leaving the service without starting to watch the video. It is common to name this time between the initial HTTP request for the first media data chunk, performed by the client, and the time when the media data chunk actually starts playing as the start-up delay. It depends on the network round-trip time but also on the size of the media segments.
Server Push is a useful feature for decreasing web resource loading time. Such servers are discussed with reference to Figures Ia to le.
In Figure lb, there is shown that in HTTP 2.0 exchanges, a request must be sent for every resource needed: resources Ri to R4 and sub-resources A to I (as shown in Figure Ia). However, when using the push feature by servers, as illustrated in Figure ic, the number of requests is limited to elements Ri to R4. Elements A to I are pushed" by the server to the client based on the dependencies shown in Figure Ia, thereby making the associated requests unnecessary.
Thus, as illustrated in Figures lb and Ic, when servers use the push feature, the number of HTTP round-trips (request + response) necessary for loading a resource with its sub-resources is reduced. This is particularly interesting for high-latency networks such as mobile networks.
B
HTTP is the protocol used for sending web resources, typically web pages. HTTP implies a client and a server: * The client sends a request to the server; * The server replies to the client's request with a response that contains a representation of the web resource.
Requests and responses are messages comprising various parts, notably the HTTP headers. An HTTP header comprises a name along with a value. For instance, Host: en.wikipedia.org" is the Host" header, and its value is "en.wikipedia.org". It is used for indicating the host of the resource queried (for instance, the Wikipedia page describing HTTP is available at http://en.wikipedia.org/wiki/HTTP). HTTP headers appear on client requests and server responses.
HTTP/2 makes it possible to exchange requests/responses through streams. A stream is created inside an HTTP/2 connection for every HTTP request and response. Frames are exchanged within a stream in order to convey the content and headers of the requests and responses.
HTTP/2 defines a limited set of frames with different meanings, such as: -HEADERS: which is provided for transmission of HTTP headers -DATA: which is provided for transmission of HTTP message content -PUSH_PROMISE: which is provided for announcing pushed content -PRIORITY: which is provided for setting the priority of a stream -WINDOW_UPDATE: which is provided for updating the value of the control flow window -SETTINGS: which is provided for conveying configuration parameters -CONTINUATION: which is provided for continuing a sequence of header block fragments -RST_STREAM: which is provided for terminating or cancelling a stream.
Push by servers has been introduced in HTTP/2 for allowing servers to send unsolicited web resource representations to clients. Web resources such as web pages generally contain links to other resources, which themselves may contain links to other resources. To fully display a web page, all the linked and sub-linked resources generally need to be retrieved by a client. This incremental discovery may lead to a slow display of a web page, especially on high latency networks such as mobile networks.
When receiving a request for a given web page, the server may know which other resources are needed for the full processing of the requested resource. By sending the requested resource and the linked resources at the same time, the server allows reducing the load time of the web page. Thus, using the push feature, a server may send additional resource representations at the time it is requested a given resource.
With reference to the flowchart of Figure le, an exemplary mode of operation of a server implementing the push feature is described.
During step 100, the server receives an initial request. Next, the server identifies during step 101 the resources to push as part of the response and starts sending the content response during step 102. In parallel, the server sends push promise messages to the client during step 103. These messages identify the other resources that the server is planning to push, for instance based on the dependencies shown in Figure Ia. These messages are sent in order to let the client know in advance which pushed resources will be sent. In particular, this reduces the risk that a client sends a request for a resource that is being pushed at the same time or about to be pushed. In order to further reduce this risk, a server should send a push promise message before sending any part of the response referring to the resource described in the push promise. This also allows clients to request cancellation of the push of the promised resources if clients do not want those resources. Next, the server sends the response and all promised resources during step 104. The process ends during a step 105.
The flowchart of Figure Id illustrates the process on the client side.
When the client has identified a resource to retrieve from the server, it first checks during a step 106 whether or not the corresponding data is already in its cache memory. In case the resource is already in the cache memory (Yes), it is retrieved from it during a step 107. Cached data may be either data retrieved from previous requests or data that were pushed by the server previously. In case it is not in the cache memory (No), the client sends a request during step 108 and waits for the server's response. Upon receipt of a frame from the server, the client checks during step 109 whether or not the frame corresponds to a PUSH promise. If the data frame corresponds to the PUSH promise (Yes), during step 110, the client processes the push promise.
The client identifies the resource to be pushed. If the client does not wish to receive the resource, the client may send an error message to the server so the server does not push that resource. Otherwise, the client stores the push promise until receiving the corresponding push content. The push promise is used so that the client does not request the promised resource while the server is pushing it. In case the data frame does not correspond to the PUSH promise (No), it is checked, during step 111, whether or not, the frame is a data frame related to push data. In case it is related to push data (Yes), the client processes the pushed data during step 112. The pushed data is stored within the client cache. In case the frame is not a data frame related to push data (No), it is checked, during step 113, whether it corresponds to a response received from the server. In case the frame corresponds to a response from the server (Yes), the response is processed during step 114 (e.g. sent to the application).
Otherwise (No), it is checked during step 115 whether or not the frame identifies the end of a response (Yes). In this case, the process is terminated during step 116. Otherwise, the process goes back to step 109.
Thus, it appears that the client receives the response and the promised resources. The promised resources are therefore generally stored in the client cache while the response is used by the application such as a browser displaying a retrieved web page. When a client application requests one of the resources that were pushed, the resource is immediately retrieved from the client cache, without incurring any network delay.
The storage of pushed resources in the cache is controlled using the cache control directives. The cache control directives are also used for controlling of the responses. These directives are in particular applicable to proxies: any resource pushed or not, may be stored by proxies or by the client only.
Figure Ia is a graph of a set of resources owned by a server with their relationships. The set of resources is intertwined: R1, R2, R3, and R4 are resources that need to be downloaded together to be properly processed by a client. In addition, sub-resources A to H are defined. These sub-resources are related to 1, 2 or 3 resources. For instance, A is linked to R1 and C is linked to R1, R2 and R4.
Figure Ib, already discussed hereinabove, shows an HTTP exchange without using the server PUSH feature: the client requests R1, next it discovers R2, A, B, C and D and request them. After receiving them, the client requests R3, R.4, F and G. Finally the client requests H and I sub-resources. This requires four round-trips to retrieve the whole set of resources.
Figure Ic, already discussed hereinabove, illustrates the HTTP exchange using the feature of pushing directly connected sub-resources by the server. After requesting R1, the server sends R1 and pushes A, B, C and D. The client identifies R2 and requests it. The server sends R2 and pushes F and G. Finally the client identifies R3, R4 and requests these resources. The server sends R3, R4 and pushes H and I. This process requires three round-trips to retrieve the whole set of resources.
In order to decrease the loading time of a set of resources, typically a web page and its sub-resources, HTTP/2 allows exchanging multiple request and response priorities in parallel. As illustrated in Figure 2, a web page may require the download of several resources, like JavaScript, images etc. During an initial HTTP exchange 200, the client retrieves an HTML file. This HTML file contains links to two JavaScript files (JS1, JS2), two images (IMG1, IMG2), one CSS file and one HTML file. During an exchange 201, the client sends a request for each file. The order given in the exchange 201 of Figure 2 is based on the web page order: the client sends a request as soon as a link is found.
The server then receives requests for JS1, CSS, IMG1, HTML, IMG2 and JS2 and processes these requests according that order. The client then retrieves these resources in that order.
HTTP priorities make it possible for the client to state which requests are more important and should be treated sooner than other requests. A particular use of priorities is illustrated in exchange 202. JavaScript files are assigned the highest priority. CSS and HTML files are assigned medium priority and images are assigned low priority. This approach allows receiving blocking files or files that may contain references to other resources sooner than other files. In response, the server is expected to try sending sooner the JavaScript files, the CSS and HTML files afterwards and the images at the end, as described in exchange 202. Servers are not mandated to follow client priorities.
In addition to priorities, HTTP/2 provides that the amount of data being exchanged simultaneously can be controlled. Client and server can specify which amount of data they can buffer on a per connection basis and a per stream basis. This is similar to TOP congestion control: a window size, which specifies an available buffer size, is initialized to a given value; each time the emitter sends data, the window size is decremented; the emitter must stop sending data so that the window size never goes below zero. The receiver receives the data and sends messages to acknowledge that the data was received and removed from the buffer; the message contains the amount of data that was removed from the buffer; the window size is then increased from the given value and the emitter can restart sending data.
In view of the above, it appears that DASH is based on the assumption that the client leads the streaming since the client can generally select the best representation of the content for the purpose of the application it is performing. For instance, a client may know whether to request High-Definition or Small-Definition content based on its form-factor and screen resolution.
Server-based streaming is typically done using RTP. Contrary to DASH, RTP does not use HTTP and cannot directly benefit from the web infrastructures, in particular proxies and caches. Web socket based media streaming has the same drawbacks. With HTTP/1.1, server-based streaming cannot be easily implemented since the server can generally only answer to client requests. With HTTP/2, in particular with the introduction of the push feature, DASH-based servers can lead the streaming. Thus, servers can use their knowledge of the characteristics of the content they are streaming for optimizing the user experience. For instance, a server may push a film as SD (due to limited bandwidth) but advertisements as HD since advertisements take an additional limited amount of bandwidth. Another example is the case of a server that starts to do fast start with a low-resolution video and switches to the best possible representation once bandwidth is well estimated.
In order to enable a server to lead the streaming, one approach is to let the server push data (in particular DASH data) as preferred. The client then uses whatever data is available to display the video. The server typically announces the push of several segments at once. The server then sends the segments in parallel or successively.
A problem that occurs is that client and server may not know if the promised data will be transmitted and received at the desired time: the client may not know when and in which order the video segments will be sent.
Also, the promised data pushed or announced by the server may mismatch the client needs, thus leading to resource wasting in particular at the server end.
Thus, there is a need for enhancing data streaming especially in the context of DASH-based communications.
SUMMARY OF THE INVENTION
The present invention lies within this context.
According to a first aspect of the invention corresponding to the server's perspective, a method of streaming media data by a server device to a client device, comprises the following steps: -receiving, from the client device, a request relating to first media data, -identifying second media data to be sent to the client device without having been requested, and -in response to said request, transmitting to said client device, data relating to said first media data, and preparing at least one announcement message respectively identifying said second media data with a view to transmit the announcement message or messages to the client device, and wherein the method further comprises the step of using a push policy shared with the client device and defining how to determine second media data, for the server device to determine the second non-requested media data to be sent to the client device.
According to a second aspect of the invention corresponding to the client's perspective, a method of accessing by a client device, media data streamed by a server device, the method comprises the following steps: -transmitting, to the server device, a request relating to first media data, -receiving from said server device, in response to said request, data relating to said first media data, wherein the method further comprises the step of using a push policy shared with the server device and defining how to determine second media data, for the client device to determine second media data to be sent by the server device without having been requested by the client device.
Thanks to the invention, the mismatch between the server's decision regarding the media data to be pushed and the client's needs can be decreased and thus resources can be saved.
This is achieved by using a shared push policy that makes it possible for the client to anticipate the server's behaviour, and thus the second media data that are about to be pushed. Thanks to the shared push policy than may be used for several client's subsequent requests, the client can anticipate the server's behaviour even before the requests are sent to the server.
As a consequence of the anticipation, the client can prepare and request the cancellation of such second media data that are not needed, in an anticipated fashion with respect to the announcement by the server.
The request relating to first media data may concern first media data and/or other data related to this first media data.
The second media data may be associated with said first media data, for example by the server device.
Embodiments of the invention provide a lightweight mechanism for server-guided streaming. Embodiments may be implemented in the context of DASH networks.
Server devices can make content recommendations to the client devices. Also, they can optimize the network usage.
Embodiments of the invention are compatible with existing HTTP/2 features. These features can advantageously be used for implementing embodiments of the invention.
Network performances are generally increased.
Correspondingly, the invention also regards a server device for streaming media data to a client device, the device comprising: -a receiver configured for receiving, from the client device, a request relating to first media data, -a control unit configured for identifying second media data to be sent to the client device without having been requested, and -a transmitter configured for, in response to said request, transmitting to said client device, data relating to said first media data, and preparing at least one announcement message respectively identifying said second media data with a view to transmit the announcement message or messages to the client device, and wherein the control unit is further configured for using a push policy shared with the client device and defining how to determine second media data, in order to determine the second non-requested media data to be sent to the client device.
The invention also regards a client device for accessing media data streamed by a server device, the device comprising: -a transmitter configured for transmitting, to the server device, a request relating to first media data, and -a receiver configured for receiving from said server device, in response to said request, data relating to said first media data, wherein the client device is configured for using a push policy shared with the server device and defining how to determine second media data, in order to determine second media data to be sent by the server device without having been requested by the client device.
The server and client devices have the same advantages as the corresponding methods as described above.
Optional features of the methods and devices are defined in the dependent claims. Some of them are explained below with respect to the methods. However they can also apply to the corresponding device.
In some embodiments referred below as to an explicit approach, the method from the server's perspective further comprises: determining by the server device a push policy, and transmitting, from the server device to the client device, push policy information describing the determined push policy to share the push policy with the client device.
Correspondingly at the client side, the method may further comprise receiving, from the server device, push policy information describing the shared push policy.
As described in some examples below, the push policy information describing the shared push policy is inserted in a description file that is transmitted from the server device to the client device, the description file containing description information that concerns media data including the first media data, the method further comprising determining the second non-requested media data based on said description file using the shared push policy.
In a specific embodiment, the description file describes the media data using a plurality of media data attribute levels, and various shared push policies are defined at various respective levels of the description file.
In other examples, the push policy information describing the shared push policy is embedded in a header of an HTTP frame transmitted from the server device to the client device.
According to specific features, the method may further comprise, at the server device, receiving push policy update information embedded in a header of an HTTP frame from the client device, and updating accordingly the shared push policy before determining non-requested media data from other media data requested by the client device.
Correspondingly, the method may further comprise, at the client device, sending push policy update information embedded in a header of an HTTP frame to the server device.
According to a hybrid approach, the push policy information describing the shared push policy is defined by a first push policy part and a second push policy part, the first push policy part being inserted in a description file that is transmitted from the server device to the client device, the description file containing description information that concerns media data including the first media data, the method further comprising determining the second non-requested media data based on said description file using the shared push policy, and the second push policy part being embedded in a header of an HTTP frame transmitted from the server device to the client device.
For example, the second push policy part may comprise one or more values for one or more associated variables defined in the first push policy part.
Also, the description file may include the description of a plurality of candidate push policies, and the second push policy part may thus comprise an identifier of a candidate push policy from said plurality, which identified candidate push policy thereby forming the first push policy part.
In other embodiments, the push policy information includes a JavaScript program embedded in a web page transmitted from the server device to the client device.
In yet other embodiments, the method further comprises determining the second non-requested media data based on a structured document (such as a description file described above or an HTML page introduced in examples below), the structured document containing description information that concerns media data including the first media data, and the push policy information includes an XPath expression to be evaluated on a tree representation of the structured document to identify the second non-requested media data.
Regarding the syntax of the push policy information, embodiments provide that the push policy information includes a first push attribute defining an amount of second non-requested media data to be identified in a description file, the description file containing description information that concerns media data including the first media data, and the method further comprising determining the second non-requested media data based on said description file using the shared push policy.
According to specific features, the first push attribute identifies the second non-requested media data relatively to the first media data requested within the description file. This may be done using operators as described below.
In a variant, the first push attribute is an identifier of specific media
data within the description file.
According to specific features, the description information in the description file describes media data according to at least one media data attribute from amongst a period attribute defining a time period to which the media data belong, an adaptation attribute defining a media type of the media data, a representation attribute defining an encoding version (e.g. bitrate, frame rate, frame resolution, timing information, etc.) of the media data and a segment attribute defining, and the push policy information includes at least a second push attribute defining a constraint on the media data attribute or attributes, for identifying the second non-requested media data.
This makes it possible to have very selective push policies
throughout the description file.
In particular, the push attribute or attributes may define a media data attribute or attributes of the second non-requested media data relatively to the corresponding media data attribute or attributes of the first media data within the
description file.
Alternatively, the push attribute or attributes may identify a node in the description file in which the second non-requested media data have to be retrieved.
In some embodiments, the description information in the description file includes priority attributes associated with the media data, one priority attribute for each media data, and an order of transmission of the second media data is based on the associated priority attributes. This is to define an order of transmission of the push data.
In embodiments, the shared push policy identifies the second media data from the first media data requested.
In embodiments referred below as to an implicit approach, the shared push policy is implemented using the same second media data determining algorithm at both the server device and the client device, the algorithm enabling the server device and the client device to determine the same second media data from the first media data requested.
In some embodiments adapted to both the implicit and explicit approaches, if the identified second media data comprise a plurality of media segments, each requiring an announcement message, the method may further comprise merging the corresponding plurality of announcement messages into a single announcement message to be transmitted to the client device. This is to reduce bandwidth consumption since less announcement messages will be sent.
To take actual advantage of the shared push policy and consequent anticipation of pushes by the client device, the method may further comprise receiving, from the client device, a cancellation request requesting to cancel transmission of a part of the second non-requested media data so that the server device does not transmit the corresponding prepared announcement message.
Correspondingly at the client, the method may further comprise sending, to the server device, a cancellation request requesting to cancel transmission of a part of the second non-requested media data, in order to drive the server device not to transmit an announcement message identifying the part of the second non-requested media data.
In embodiments of the invention, the second non-requested media data are determined by the client device independently of at least one announcement message prepared by (and possibly received from) the server device and identifying the second non-requested media data the server device intends to send to the client device without having been requested. Here, "independently" means that the client device is able to make the determination of the second non-requested data without being aware of such announcement message (i.e. PUSH_PROMISE) which is dedicated to inform the client device of the future transmission of such non-requested data.
In other embodiments of the invention, the same shared push policy is used for determining respective non-requested media data from a plurality of requests relating to respective first media data. By using the same push policy over time and successive requests, the client is even more in good position to efficiently anticipate the transmission of useless data by the server, and thus in position to efficiently cancel their transmission and the transmission of corresponding announcement messages.
Regarding the notification of an order of transmission of the push data from the server to the client, a method of streaming media data by a server device to a client device, may comprise the following steps: -receiving, from the client device, a request relating to first media data, -identifying second media data to be sent to the client device without having been requested, -transmitting to said client device, in response to said request, data relating to said first media data, and at least one announcement message respectively identifying said second media data, and wherein the method further comprises the following steps: -defining by the server device an order of transmission of the second media data, -transmitting information related to the order of transmission with said announcement messages, said information enabling the client device to determine the order of transmission defined by the server.
For example, the order of transmission of said second media is defined according to priority values according to the client device, the media data having the highest priority value being transmitted first.
Said priority values may be defined according to the HTTP/2 protocol.
According to embodiments, at least one priority value is associated with a network bandwidth estimation mechanism, and the method further comprises the following steps: -transmitting to the client device second media data with a priority value associated with said mechanism, -receiving from the client device, in response to said second media data, at least one control flow message, and -estimating an available bandwidth based on said at least one control flow message received.
For example, the server device transmits said second media data according to a plurality of data frames having respective and different sizes.
The method may further comprise defining by the server device, based on said bandwidth estimation, an updated order of transmission of the second media data.
According to embodiments said request from the client device comprises a request for receiving a description file related to media data comprising said first media data, the description file containing description information concerning said first media data, the method further comprising determining the second non-requested media data based on said description file.
For example, requested first media data are video segments.
The streaming may be performed according to the DASH standard.
For example, the method further comprises the following steps: -receiving, from the client device, an ordering update request, -defining, based on said ordering update request, a new order of transmission of the second media data and updating the information related to said new order of transmission of the second media data, and -transmitting said second media data to the client according to said updated information related to the order of transmission.
The method may further comprise transmitting to the client device, an ordering update confirmation message.
For example, said updated order is defined for the second media data for which transmission to the client device has not been initiated at the time of receipt of said ordering update request.
For example, said ordering update request comprises an ordering value for at least part of second media data.
According to embodiments, the order of transmission of said second media is defined according to priority values, and when a priority value is updated for at least part of a first media data, the priority values for at least part of second media data to be sent to the client device without having been requested and associated with said at least part of first media data, are updated accordingly.
For example, said first and second media are associated according to at least one of a temporal relationship, a spatial relationship and a quality relationship.
According to embodiments: -said second media data comprises enhancement data for enhancing quality of the first media data, and -when a priority value is updated for a media data of an enhancement layer, priority values are updated for all the media data of said enhancement layer.
For example, the first and second media data comprise video temporal segments, and the starting time of the enhancement media data is based on information related to a video content of the first media data.
For example, said information related to a video content of the first
media data is stored in said description file.
For example, said order of transmission is based at least on decoding relationships between first a second media data.
For example, said order of transmission is based at least on statistical popularities of the media data.
For example, said order of transmission is based at least on a playing time of the media data on the client device's end.
For example, said order of transmission is based at least on an estimated transmission time of the media data.
For example, said order of transmission is based at least on user-defined interests for the media data.
The method may further comprise the following steps: -receiving, from the client device, control messages, said control messages enabling the server device to identify media data currently being played, -defining by the server, based on said control messages, an updated order of transmission of the second media data, and -transmitting said second media data to the client according to said updated order of transmission.
The method may further comprise a step of transmitting to the client device, an ordering update confirmation message.
For example, said control messages relate to a use of a buffer memory of the client device, said buffer memory storing media data for them to be played by the client.
For example, the server device keeps record of first requested media data sent, and identification of the second media data is performed based on said use of the buffer memory and said record.
For example, said order of transmission information is transmitted within said announcement messages.
For example, said order of transmission information is transmitted within dedicated messages after said announcement messages.
From the client's perspective, a method of accessing by a client device, media data streamed by a server device, may comprise the following steps: -transmitting, to the server device, a request relating to first media data, -receiving from said server device, in response to said request, data relating to said first media data, and at least one announcement message respectively identifying second media to be sent to the client device without having been requested, wherein the method further comprises the following step: -receiving information related to an order of transmission of the second media data with said announcement messages, said information enabling the client device to determine an order of transmission of the second media data defined by the server.
The method may further comprise determining by the client device whether the order of transmission of the second media data defined by the server device satisfies streaming constraints at the client device's end, and if said constraints are not satisfied, transmitting, to the server device, an ordering update request.
For example, the order of transmission of said second media data is defined according to priority values according to the client device, the media data having the highest priority value being transmitted first.
For example, said priority values are defined according to the HTTP/2 protocol.
According to embodiments, at least one priority value is associated with a network bandwidth estimation mechanism, the method further comprises the following steps: -receiving from the server device second media data with a priority value associated with said mechanism, -transmitting to said server device, in response to said second media data, at least one control flow message, thereby enabling the server device to estimate an available bandwidth based on said at least one control flow message transmitted.
For example, the client device receives said second media data according to a plurality of data frames having respective and different sizes.
For example, an updated order of transmission of the second media data is defined, by the server device, based on said bandwidth estimation.
For example, said request from the client device comprises a request for receiving a description file related to media data comprising said first media data, the description file containing description information concerning said first media data, the method further comprising determining the second non-requested media data based on said description file.
For example, requested first media data are video segments.
For example, said streaming is performed according to the DASH standard.
The method may further comprise receiving said second media data from the server device according to updated information related to a new order of transmission of the second media data defined by the server device.
The method may further comprise a step of receiving from the server device, an ordering update confirmation message.
According to embodiments, said updated order is defined for the second media data for which transmission from the server device has not been initiated at the time of receipt of said ordering update request by the server device.
According to embodiments, said ordering update request comprises an ordering value for at least part of the second media data.
According to embodiments, the order of transmission of said second media is defined according to priority values, and when a priority value is updated for at least part of a first media data, the priority values for at least part of second media data to be sent to the client device without having been requested and associated with said at least part of first media data, are updated accordingly.
For example, said first and second media data are related according to at least one of a temporal relationship, a spatial relationship and a quality relationship.
According to embodiments: -said second media data comprise enhancement data for enhancing quality of the first media data, and -when a priority value is updated for at least pad of first media data of an enhancement layer, priority values are updated for all the media data of said enhancement layer.
For example, the first and second media data comprise video temporal segments, and the starting time of the enhancement media data is based on information related to a video content of the first media data.
According to embodiments, said information related to a video content of the first media data is stored in said description file.
According to embodiments, said order of transmission is based at least on decoding relationships between first and second media data.
According to embodiments, said order of transmission is based at least on statistical popularities of the media data.
According to embodiments, said order of transmission is based at least on a playing time of the media data on the client device's end.
According to embodiments, said order of transmission is based at least on an estimated transmission time of the media data.
According to embodiments, said order of transmission is based at least on user-defined interests for the media data.
The method may comprise the following steps: -transmitting, to the server device, control messages, said control message enabling the server device to identify a media data currently being played, and -receiving said second media data from the server device according to an updated order of transmission defined, by the server device, based on said control messages.
The method may comprise a step of receiving from the server device, an ordering update confirmation message.
For example, said control messages relate to a use of a buffer memory of the client device, said buffer memory storing media data for them to be played by the client device.
According to embodiments, the server device keeps record of first media data sent, and identification of the media being currently played is performed based on said use of the buffer memory and said record.
For example, said order of transmission information is received within said announcement messages.
For example, said order of transmission information is received within dedicated messages after said announcement messages.
Still referring to the order of transmission, a method of managing, by a proxy server, data exchanges between client devices and server devices, may comprise the following steps: -receiving, from a server implementing a method according to the first aspect, media data to be retransmitted to a client device, -determining, based on the order of transmission of the media data, a retransmission priority for the media data, and -performing retransmission of the media data received to the client device, based on said transmission priority determined.
The method may further comprise storing said media data received, based on said retransmission priority determined.
The method may further comprise the following steps: -receiving, from a client device implementing a method according to the second aspect, an ordering update request, -updating said retransmission priority according to said ordering update request, if said request is related to a media data to be retransmitted, and -performing retransmission of the media data according to the updated retransmission priority.
The method may further comprise the following steps: -receiving from a first client device, a request to a first server device, for media data, wherein said media data is stored by the proxy server for retransmission to a second client device from a second server device, -determining priority values respectively associated with said media data by said first and second server devices, -updating said priority values according to respective streaming constraints for the first and second client devices, and -retransmitting said media data to said first and second client devices according to said updated priority values, wherein said first and second server devices implement a method according to the first aspect and said first and second client devices implement a method according to the second aspect.
The method may further comprise sending to the first and second server devices update notifications relating to the updated priority values.
According to another aspect of the invention there is provided a method of streaming data between a server device and a client device comprising: -performing a method according to the first aspect by a server device, and -performing a method according to the second aspect by a client device.
According to yet another aspect of the invention there are provided computer programs and computer program products comprising instructions for implementing methods according to the first and/or second aspect(s) of the invention, when loaded and executed on computer means of a programmable apparatus.
According to yet another sixth aspect of the invention, there is provided a server device configured for implementing methods according to the first aspect.
According to yet another aspect of the invention, there is provided a client device configured for implementing methods according to the second aspect.
BRIEF DESCRIPTION OF THE DRAWINGS
Other features and advantages of the invention will become apparent from the following description of non-limiting exemplary embodiments, with reference to the appended drawings, in which, in addition to Figures 1 a to 6: -Figures 7a and 7b illustrate media segment reordering according to embodiments; -Figure 8 is a flowchart of exemplary steps performed by servers according to embodiments; -Figure 9 is a flowchart of exemplary steps performed by clients according to embodiments; -Figure 10 is a flowchart of exemplary steps performed by proxies according to embodiments; -Figure 11 illustrates bandwidth measurement according to embodiments; -Figure 12 illustrates video playing initialization according to embodiments; -Figure 13 is a schematic illustration of devices according to embodiments; -Figure 14a illustrates, using a flowchart, general steps of the invention at the client side; -Figure 14b illustrates, using a flowchart, general steps of the invention at the server side; -Figure 15a illustrates, using a flowchart, steps of determining the shared push policy at the client side based on the explicit approach; -Figure 15b illustrates, using a flowchart, steps of determining the push policy at the server side when the explicit approach is used; -Figure 16 shows an MPD document in which a PushPolicy node is used to specify a push policy applied by the server; -Figure 17 illustrates, using a flowchart, steps for identifying and marking some segments as ready to be pushed according to the shared push policy PushPolicy"; -Figure 18a illustrates an example of communication between a server and client with a push policy transmitted in a HTTP push-policy" header; -Figure 18b illustrates the same example with a client's request to change the push policy; -Figure 20 illustrates, using a flowchart, steps of the process at the server side according to embodiments merging announcement messages; -Figure 21 illustrates, using a flowchart, steps of the process at the server side when using HTTP headers to declare the push policy -Figure 22 illustrates, using a flowchart, steps of the process at the client side when using HTTP request to declare and share the push policy; -Figure 23 shows an MPD document in which a SupplementalProperty element is used to specify a push policy applied by the server at a hierarchical level of the document; -Figure 24 shows an MPD document used as example for an XPath based push policy; and -Figure 25 illustrates the reordering of elements in a priority tree, for example in a webpage, before applying a push policy.
DETAILED DESCRIPTION OF EMBODIMENTS OF THE INVENTION
In what follows, embodiments of the invention are described in the context of DASH-based networks implementing the HTTP 2.0 protocol. The data streamed is, for example, video data. Embodiments of the invention are not limited to DASH networks.
A server device of a communication network that streams data to a client device implements a push feature according to which it can transmit data elements to the client without explicit requests from the client for the data elements transmitted.
The server and the client may share push policies that drive the server to determine the push promises and to actually transmit the corresponding data. Thanks to this sharing, the client may anticipate the push of some useless data, to cancel such push. This results in reducing the server's processing, as well as the network use since PUSH_PROMISE frames may be cancelled before being sent.
In specific embodiments, the server can indicate in its push promises, by which it announces transmission of the not explicitly requested data elements, ordering information concerning the order in which the server intend to transmit the data elements. The order of the data elements may be defined using priority values, for example the priority values according to HTTP/2.
Upon receipt of the push promises, the client device can determine in advance the order of transmission intended by the server, thereby enabling the client to react to the proposed order in case it does not match its own desired order. For example, the client device can update the priority values and send the updated priority values to the server. The server can thus change the transmission ordering based on the new priority values in order to better match the client's needs. The server can use the updated priorities into account for future data transmissions.
According to embodiments, the client may request a full reordering or a partial reordering of the transmission of the data elements to the server.
Full reordering is described with reference to Figure 7a. A client requests, during a step 700, a Media Presentation Description (MPD hereinafter) to a server. The server retrieves the MPD to send back to the client and identifies corresponding data elements to push during a step 701. In the example of Figure 7a, the server identifies "Data 1.1", "Data 1.2" and Data 1.3" as data elements to push. These elements are for example data segments.
Element Data Xl" represents the base layer for data X, element Data X.2" represents the enhancement layer for data X and "Data X.3" represents the additional enhancement layer for data X. The server defines a specific order of transmission for the data elements. The server associates respective priority values with the PUSH_PROMISE frames to be sent to the client for announcing the upcoming push data elements. The server then sends the PUSH_PROMISE frames "P1.1", "P1.2" and "P1.3" with the associated priorities and the MPD during a step 702. Next, shortly after sending the MPD and the push promise, during a step 703, the server sends to the client a data frame corresponding to the "Data 1.1" element and a PUSH_PROMISE messages "P2.1", "P2.2" and "P2.3" respectively corresponding to the elements "Data 2.1", Data 2.2" and "Data 2.3", which are segments following "Data 1.1", "Data 1.2" and "Data 1.3" in the transmission order defined. In parallel to the receipt of the data frame and the push promise of step 703, the client decides, after receipt of the MPD and the "P1.1", "P1.2" and "P1.3" PUSH_PROMISE frames, that the enhancement layer "Data 1.2" is of lower priority compared to the additional enhancement layer "Data 1.3". Thus, the client sends a priority update frame to lower "Data 1.2" priority during a step 704. Upon receipt of the priority update request, the server changes the schedule of the transmission during a step 705. Hence, transmission of "Data 1.2" is postponed after "Data 1.3" is transmitted. In addition, the server uses the MPD to link the segments associated with "Data 1.2". It identifies "Data 2.2" and lowers its priority as well.
Partial reordering is described with reference to Figure lb. Steps 710 to 714 of Figure lb are substantially the same as steps 700 to 704 of Figure 7a. After receipt of the priority update frame, the server behaviour differs as compared to step 705 previously described. During step 715, the server already started transmission of "Data 1.2" and proceeds further with the transmission. For that segment, there is no change in the priority. The server nevertheless updates the priority of the connected segments, namely "Data 2.2" in the present example. In order to announce the fact that the priority change has been taken into account, the server may send a priority update massage for "Data 2.2". The client can thus be informed of the change.
Embodiments of the invention may be implemented in use cases wherein servers can push high quality video parts well enough in advance so that the whole part of the video can be played as high quality. For instance, the video can be split into a part 1, played as low quality, a part 2, played as high quality and a part 3 played as low quality. The bandwidth between the client and server allows real-time streaming of low quality but not high quality. In that case, the server may interleave part 1 with the enhancement of part 2. Once part 1 has been played, the enhanced part 2 is also available and the server sends the base layer of part 2 to be played as high quality jointly with the enhancement of the same part 2. Thus, the server makes sure that the whole part 2 is played as high quality. Part 3 is sent thereafter. Quality flickering, which disturbs the user experience, can be alleviated and quality switching only occurs at a limited number of moments. The server is in the best position to know when to switch to a different quality level since it knows the video content.
Figure 8 is a flowchart of steps performed by a server implementing a push-based DASH media streaming according to embodiments. Steps 800 to 812 describe the general principles. Steps 820 to 827 more specifically deal with the management of the priority feedback from the client.
During a step 800, the server receives a request R from the client.
This request identifies a specific media, typically by referring to an MPD file.
Next, the server performs an iterative process comprising steps 801 to 810. The process comprises sending data according to a defined order. The order of transmission is updated according to the client's feedback. Once the data are sent, they are received and played by the client. Next the server identifies new data to send and the process continues so on.
The first iteration starts with step 801, during which the data to be sent are identified. In case of the first performance of the iterative process, a fast start approach may be used in order to enable the client to start video playing as quickly as possible. In addition, the server may also identify subdivision of the media into chapters. In case the server knows that the client generally navigates using chapters, the server may actually select not only the segments that correspond to the beginning of the media but also the segments corresponding to the start of the first chapters in the media. After the first performance of the iteration, the server may also detect that the connection may support the transmission of a higher quality representation of the media. Thus, the server may identify when the resolution or quality switch should be done.
Once the server identified a list of segments to push, the server defines a transmission order for these segments. The transmission order is used for computing initial priority values for each pushed segment during a step 802. The ordering may be based on several parameters.
A first parameter may be the relationships between the different segments: for example some segments must be available for correctly decoding other segments. The segments that must be available are thus assigned higher priorities than said other segments.
A second parameter may be the popularity of video segments, which may be gathered from past statistics. As an example, with YouTube URL5 specific times in a video may be addressed. When clicking on the links associated with these URLS, only the video needed to start the video playing at the specified time is retrieved. In addition, if a video is being chaptered, the beginning of each chapter is generally more often retrieved from users than segments in between chapter starts. Segments of the chapter beginning are thus assigned higher priorities than the in-between chapter segments.
A third parameter may be the timeline: the priority of a video segment that is closer to being played is higher than the priority of a video segment that is to be played later.
A fourth parameter may be the estimated time spent to actually transmit the segment. When the video segment is large, it takes a long time to be transmitted and therefore, transmission should start as soon as possible, i.e. with a high priority.
In case two segments have identical priorities, the corresponding data frames can be interleaved during transmission.
In case regions of interests are identified in the media content, if the bandwidth is not large enough for a high quality representation but is large enough for a low quality representation, the server may select an enhancement layer only for the region of interest.
Once the priorities are computed, the server sends PUSH_PROMISE frames containing the priority values during step 803. Identification of all segments is not needed for starting transmission of the PUSH_PROMISE frames. In case an MPD is to be sent for the segments to be pushed (step 804), the MPD is sent (step 805). The segment transmission starts in parallel during step 806.
Once the PUSH_PROMISE frames are received by the client, the server may receive priority update changes and then change its transmission schedule accordingly (steps 807 to 808 and steps 820 to 828). While sending segments, the server awaits receipt of priority change messages. In case a priority change message is received (step 807), the server reorders the segments accordingly and continue the segment transmission (step 808). Once all segments are sent (step 809-1), the server restarts an iteration process in order to continue streaming the media until the end of the media. When the end of a media is reached (step 809-2), the server checks whether or not it should automatically start streaming another media (step 810). In case another media should be streamed (Yes), the server identifies the new media to stream (step 811) and restarts the process from step 801. In case no new data should be streamed, the process is stopped (step 812).
The management of the priority feedback from the client, i.e. of step 808, starts with the receipt of a priority update change message during step 820. The following steps may also be performed in case the client cancels a segment push: this case may be seen in practice as equivalent to assigning the lowest priority to that segment.
Upon receipt of the priority update change message, the server identifies the related segment during step 821. The server then proceeds with the reordering of the segment transmission (steps 822, 823). If the segment is already transmitted, the process ends. If the segment is being transmitted, depending on the server implementation, it may refuse to change the transmission (for example because it is too complex) or it may actually reschedule the remaining data to be sent.
The rescheduling of the data may be performed as follows. The server stores a list of video segments to push (and/or video segments that are being pushed). This list is ordered according to the priorities set by the server.
The server then sets the new priority value for the segment. The list is then reordered and the corresponding video segment transmission is made earlier or later accordingly.
Once the video segment is reordered, the server may actually decide to apply this priority change to other related video segments. If a client raised the priority of a video segment which is pad of an enhancement layer, the server may raise the priority of all the segments of this enhancement layer.
Conversely, if the client lowers the priority of a base video segment layer, the priority of all segments temporally related to this segment may be lowered. This process is described in steps 824 to 827. Based on the MPD and the rescheduled video segment, the server identifies a list of related segments (step 824). The relationship may be temporal, spatial, quality-based etc. The MPD may be enhanced in order to better show the potential relationships. In particular, when the priority of an initialization segment (which is necessary to play more than one video segment) is lowered or raised, all related segments may be rescheduled. This can be the case as well for base layer segments and enhancement segments. For each identified related segment, the server tests whether or not the transmission of the related segment should be changed (step 825). In case it should be changed, the server computes the new priority value for each segment (step 826) and reschedules the segment transmission accordingly (step 827). The new priority value may be computed by adding to the old value the difference between the new priority value received during step 820 and the initial priority value of the segment identified during step 821. The process stops when each related segment has been tested (step 828).
The server may also receive control flow messages, such as WINDOW_SIZE frames. These messages may enable the server to identify what the client is currently playing. When some additional buffer space is available on the client's end, it may be inferred that some data have been removed from the buffer, typically the oldest data. If the server keeps a history of the data sent, the server is able to identify which data have been removed.
Thus, provided the server knows the client's cache ordering, the server can have knowledge of which video segments the client is currently playing. This ordering may be based on the MPD that makes it possible to order the cached data according to the timeline. A server may then detect client time skipping for instance. The server may react by quickly sending the start of the next chapter in advance so that the client can continue skipping video chapters.
It should be noted that the sending of a PUSH_PROMISE frame with priorities may be done in various ways. A PUSH_PROMISE frame must relate to an opened stream which is initiated by the client. According to embodiments, the initial stream made by the client during step 800 may be always left open.
According to other embodiments, a PUSH_PROMISE frame is sent within a stream opened by the server. In this case, the client considers the PUSH_PROMISE frame as it is sent by the parent client-initiated stream. Thus, it can compute the right headers of the virtual request corresponding to the particular PUSH_PROMISE frame.
According to other embodiments, a priority message is sent jointly with a PUSH_PROMISE. A first possibility is to send it as a header within the PUSH_PROMISE frame. Another possibility is to send a PRIORITY frame with the stream ID reserved by the corresponding PUSH_PROMISE frame. A third possibility is to send the PUSH_PROMISE frame, then the corresponding HEADERS frame (to open the stream) and then the PRIORITY frame on this newly opened stream.
In order to further control the client's buffer, the server may send a new representation of a segment cached by the client. Within the headers sent as part of this new representation, HTTP cache directives may be used to request the client to actually remove the segment, for instance by marking it as not cacheable. This may make it possible to recover buffer space on the client's end. HTTP/2 control flow may be used. The server can then push additional data.
A server may send priority values for each video segment. The server may also send priority values for specific segments. In case the server did not send a priority value for a current PUSH_PROMISE frame, the client can compute a priority value from the last priority value sent from the server. For instance, the client may increment the priority value each time a new PUSH_PROMISE frame with no priority value associated with is received.
Hence, the PUSH_PROMISE frames can be grouped so that updating the priority of the specific segment will also update the priorities of all segments of the group.
The process on the client's side is described with reference to Figure 9.
The client should be able to play the content available at a given time. However, the client has to cope with potential buffer limitations and processing time. The client has to check whether or not the transmission ordering proposed by the server matches the memory space available in the client's buffer and matches the content currently played by the client.
During a first step 900, the client connects to the server and requests an MPD file. The client then retrieves the MPD file during a step 901 and waits (step 902) for the receipt of data. When data are received, the client checks (step 903) whether the data are a push promise. In case a push promise has been received, this means that a new video segment is being sent by the server. The client processes the push promise. In particular, the client may validate the priority values proposed by the server during step 904. In case the client wishes to change the priority values (step 905) for the current segment or another promised segment, the client compute a new priority value and sends it to the server (step 906).
In case the client receives video data (step 907), the client links the video segment to the MPD file (step 908) and stores the video data (step 909).
Linking the video data to the MPD file makes it possible for the client to retrieve the video segment when it will be further used for decoding the video (step 911). This may also provide efficient storage of the video data (step 909), for example if contiguous video segments are grouped.
The buffer storage constraints may further change the priority. Thus, the client may check again whether a priority value has to be changed and may communicate with the server if needed (steps 905, 906).
Once the client is ready to start or continue playing video (step 910), the client retrieves from its cache the next time slot video segments (step 911) and decodes and plays the video (step 912). As part of step 911, the client may query its cache in order to know which video segments are available. By default, the client may use all video segments available, in particular all enhancement segments if any. The client may let the server select the content: generally speaking, all segments should be used by the client. If some segments cannot be used jointly (like audio English tracks and French tracks), the client should dismiss the unused segments in the first place. It should be noted that not all clients may get access to the cache state: web applications in particular do not usually have access to the web browser cache. In such a case, the server may directly send the list of pushed segments to the web application client. For instance, this information may be exchanged from the server to the client using a web socket connection.
As the video is played and decoded, the corresponding video segments may be removed from the buffer. Hence, the client updates its available buffer size using a WINDOW_SIZE frame. The client may keep video segments that have been recently played in order to enable the user to rewind the video during a limited period of time. The flow control update mechanism may also be used when the user does a fast forward/time skip. The client may remove old stored video content to make room for new content and announces this change to the server using a WINDOW_SIZE frame. When the server receives the WINDOW_SIZE frame, the server may be able to compute which video segments were removed and then identify what the client is actually playing, as discussed above.
In what follows, step 904 is described in more details.
The client holds a list of all push promised video segments. This list is ordered according to the priority information found in the push promise frames. First, it is checked for potential frozen video issues. Based on an estimation of the available bandwidth and the ordered video segment list, transmission beginning and end times of each segment can be estimated.
Based on these times, it may be tested whether each video segment will be available at the time it should be used for video playing. If a promised video segment is expected to be delivered after its corresponding video playing use, its priority should be increased. Thus, the video segment is moved up in the push promised video segment list order. In order to compute the exact priority value, it is searched for the position in the video segment list that makes it possible to have the video segment delivered on time and that is the closest to the current video segment position. The priority is then set to a value between the priorities of the video segments in the list that are before and after the video segment new position.
Other factors may also be used by the client for changing the video segment priorities. For instance, if the client is expecting to do some chapter-switching, the client may actually increase the priority of all video segments that start the chapters, in particular the corresponding initialization segments.
According to embodiments, the client-side flow control comprises disabling the per-stream flow control and keeping only a per-connection flow control. The per-connection window size defines the maximum amount of video that a client may actually store at any given time. The client and the server may negotiate at initialization time and during the connection in order to decrease or increase this window size. If the server wants to push some HD content, the server may request the client to increase the window size. If the connection bandwidth is low, the server may need to anticipate well in advance the sending of HD content for a specific part of the video] in which case the buffer size should be made larger.
The order of transmission may be an important issue when the buffer has a single size. In particular, as the buffer is filled with data, the priority ordering becomes more and more important. An important constraint is that the video never freezes. As long as the buffer is largely empty, the server may push various video segments, like segments largely in advance in order to provide an efficient fast forward or chapter skipping. Once the buffer is almost fully filled, the video segments to push should be as close as possible to the video segments being played. This push behaviour may be done by the server if the server has accurate information concerning the client buffer. It may also be implemented by the client using the priority update mechanism.
In case of automated video switching, the flowchart of Figure 9 may be extended by detecting the push of a new MPD as part of the push promise check (step 903). When an MPD push is detected, the client may start receiving segments of a new video as part of step 908. The client must therefore identify the MPD related to the video data. Once the video playing is finished for a given MPD (step 902), the new MPD may be used for continuing video playing. The client may actually flush all video segments linked to the previous MPD.
With reference to Figure 10, the behaviour of a DASH-aware proxy is described. When receiving a segment pushed from a server, a proxy is not mandated to push it to the end-client. In case of DASH streaming though, it can be considered good practice (or default behaviour) to do so.
The proxy may be able to adjust the server and client behaviours, both in terms of priority processing as well as pushed data to be sent. A proxy may in fact handle independently the priorities with the client from the priorities with the server. In addition, the server may push more data than needed for a given client and the proxy may retrieve the additional pushed data to fulfil requests from other clients.
A server may push a video segment for several reasons. For example, a video segment may be pushed in case it is believed to be useful for the end-client. A video segment may also be pushed in case it is believed that the video segment can be used several times and that it is worth pushing it to proxies.
In the first case, proxies generally send the video segment to the client. Proxies may postpone its transmission in order to optimize the client or proxy network state, for instance the client radio state. An exemplary case may be the segment push for fast start video playing and bandwidth estimation, in which case data should be sent as fast as possible to the client. In case the server is interested in pushing data to proxies, proxies may not automatically send the video segment to the client, except if they have means to know that the video segment will be useful to the client. In order to make possible the identification of video segments that may not be sent to clients, a specific priority value may be used. Using a priority value makes it possible to have the proxy always check the priority value for optimizing the processing of the various frames that arrive.
Figure 10 comprises three flowcharts. One flowchart relates to the process of filtering pushed segments (steps 1000 to 1008). Another flowchart relates to the process performed when a segment is requested by a client while it is already promised to another client (steps 1010 to 1015). Another flowchart relates to the management of priority changes (steps 1020 to 1026).
The process of filtering pushed segments starts with the receipt (step 1000) of a pushed data event, typically when receiving a PUSH_PROMISE frame or a related DATA frame. The proxy checks whether the data are of high priority or not (step 1001). Data may be considered as of high priority if their priority value is much larger than priority values of other segments being transmitted. Data may also be considered as of high priority if its priority value has a special meaning, such as fast start or bandwidth estimation. If the data are of high priority, they are sent as quickly as possible to the client (step 1002).
The proxy then decides whether or not to store the data (steps 1003, 1004).
This decision may be made once when receiving the corresponding PUSH_PROMISE frame or the corresponding HEADERS frame that opens the pushed data stream. This decision may also be based on the proxy cache state, the envisioned use of the video, the popularity of the video source or other criteria. The proxy stores the video segment if the segment is pushed while being requested by one or more clients at the same time. The video segments may also be stored if segments are identified as fast start.
If the data are not of high priority, the proxy checks whether it is of low priority (step 1005). Data of low priority may be data for which transmission to the client may be skipped but that are considered by the server as interesting for network intermediaries like proxies. The proxy first decides whether or not to send the data to the client (step 1006). This decision may be made once when receiving the corresponding PUSH_PROMISE frame or the corresponding HEADERS frame that opens the pushed data stream. If it is decided so, the proxy sends the corresponding frame to the client (step 1002). The process then stops after deciding whether or not to store the data.
The priority value negotiated between the server and proxy may be different from the priority value negotiated between the client and proxy.
Therefore, in case the data is of usual priority (i.e. not of low priority and not of high priority), the proxy checks whether the segment priority value is managed by the proxy. As illustrated in Figure 10 (steps 1020 to 1026), the proxy uses the client-to-proxy value for scheduling the time when the data should be transmitted: the proxy holds a list of all to-be-transmitted video-related frames.
These frames are ordered according to the priority values before being sent following that order.
In the case the proxy is receiving a priority update frame (step 1010), the proxy identifies the related video segment (step 1011). If its priority value is not being managed by the proxy (step 1012) the proxy forwards the priority update frame to the server (step 1013). Otherwise, the proxy stores this new priority value and reorders the video segment transmission (step 1014) accordingly. In case a potential conflict appears, in particular in case the video segment delivery from the server is expected to be too late for the client needs, the proxy can then forward the priority value to the server.
Steps 1020 to 1026 relate to the case of a proxy that receives a request from a client to a video segment (step 1020) that is already promised by the server to another client (step 1021). Depending on the priority given to that request, the proxy computes the minimum proxy-to-server priority that would fulfil the client's request (step 1022). This computation is done by computing the proxy-to-server priority value that will ensure that the server-to-proxy delivery time is earlier than the proxy-to-client expected delivery time. The priority is changed if the computed priority is below the currently set priority (step 1023), in which case the proxy will send a priority update message to the server (step 1024) and the proxy will mark this video segment priority as managed by the proxy so that the proxy sends the video segment to its two clients at the best time for their needs. Similarly to this process, a proxy may receive several priority updates to the same segment from several clients] in which case the proxy may actually send the lowest priority value that satisfies all clients.
With reference to Figure 11 there is described an embodiment according to which a client receives a pushed data event whose priority value indicates that the server wants to use it for measuring bandwidth. Measuring bandwidth may be done using TCP/IP packets through active or passive measurements for computing round trip times. Based on round trip times, the available bandwidth may be computed as found in document Saubhasik et al. Bandwidth Estimation and Rate Control in Bit Vampire". This computation may potentially take into account effects of HTTP/2 control flow. By making notification that some data frames are used for bandwidth estimation possible, the bandwidth available without HTTP/2 control flow can be estimated.
The process starts with step 1100 during which a pushed data frame is received from the server. Next, it is checked whether the associated priority of the stream indicates that the server is measuring bandwidth (step 1101). In that case, the dedicated buffer is maximized (step 1102). Alternatively the stream flow control can be disabled. If the receiving node is a proxy (step 1103), it may forward the segment data. Otherwise, the client decides whether to store the segment (step 1104). The client stores the pushed segment (step 1105). In any case, the client sends an acknowledgement to the server in the form of a WINDOWS_UPDATE (step 1106) for the per-connection window. This acknowledgment will then be used by the server for estimating the connection bandwidth. In the case the client is a proxy, it forwards the pushed data (step 1108) as quickly as possible. When receiving an acknowledgment from the end-client, the proxy forwards it back to the server as well (steps 1109, 1110).
In order to estimate the available bandwidth, the server may use the round trip time of the sent data frame that is computed as the difference between the sending time of the data frame and the reception time of the acknowledgment message, the pairing between the two being based for instance on the data frame size which should be equal to the window size update. Round trip times can be computed from various data frames of one or more video segment. In order to increase accuracy, the data frames may have various sizes. Splitting a video segment into several DATA frames of different sizes can be performed by the server. The server only needs to ensure that the network layer will not split DATA frames into several TCP/IP packets (hence smaller DATA frames) or not buffer content to be sent and merge several DATA frames into a TCP/IP packet. Based on those measurements, standard techniques can be used for computing the available bandwidth (an example can be found in the above-mentioned document) that the server may use to actually decide which video representation to use.
With reference to Figure 12, there is described the case of an initial video playing. The server pushes data using the fast start priority. It is considered that the data probably have a low-bit rate and that the client will receive those data and send acknowledgments to the server so that the server can estimate the bandwidth and switch to the optimal representation. The client-side process is described in steps 1200 to 1207. The server-side process is described in steps 1210 to 1215.
The client process starts with a step 1200 of receipt of pushed data.
The client then checks whether the priority has the fast start value (step 1201).
In that case, the client typically maximizes the dedicated buffer (step 1202).
This maximization is performed when receiving the PUSH_PROMISE of the pushed data. The data are then stored (step 1203) and the client sends an acknowledgement to the server using the WINDOW_UPDATE frame (step 1204). The client then checks whether enough data are available to start playing the video (step 1205). If they are, the video playing starts (step 1206).
Otherwise the client waits for more data (step 1207), until enough data are available for starting playing the data.
The server process starts with a step 1211 of sending segment data frames with the fast start priority (step 1210). The server then receives acknowledgments (step 1211) that will allow computing the available bandwidth (step 1212). Once enough measurements are obtained, the server selects the optimal representation (step 1213) and starts pushing optimal representation segments (step 1214). The server decides when to switch representation. This has at least two benefits. First the server may know when the measurements are accurate enough and may switch from one resolution to another as soon as this is the case, while the client will need to handle some delay. Second, the server may decide to switch from one resolution to another at the time that is less disturbing for the user experience. Indeed, the server has the knowledge of the video content. In particular, the MPD may be augmented with information on the times at which resolution switch can be best envisioned.
The present invention relates to an enhanced streaming method where, at the server side, a request relating to first media data is received, from the client device; second media data to be sent to the client device without having been requested are identified; and then data relating to said first media data are transmitted to said client device, in response to said request, and at least one announcement message respectively identifying said second media data is prepared with a view to transmit the announcement message or messages to the client device.
At the client side, a request relating to first media data is transmitted to the server device; and data relating to said first media data are received from said server device, in response to said request, The enhanced streaming method reduces the mismatches between the servers decisions to push some media data and the client's needs for such data. As will be apparent from below, the server and the client share a push policy so that they both determine the same media data to be pushed from any media data requested by the client. A push policy defines how to determine the data to push, and may be seen as a rule for determining which resources linked to requested data are about to be pushed after the requested data are processed (after a GET request), and possibly how they are pushed (e.g. in which order). Usually, the linked resources are determined using one document, for instance a manifest file, such as an MPD file (in the DASH context for multimedia data), or an HTML document.
As a consequence, based on the shared push policy, the client is able to anticipate the server's behaviour to avoid, and more precisely to cancel, the transmission of useless media data from the server. Use of bandwidth in the communication network between the client and the server is thus reduced.
Furthermore, the number of HTTP requests and PUSH_PROMISE cancellation is reduced, which lowers the latency of the application in particular for low latency live video streaming.
According to the invention, the server may use a push policy shared with the client device for the server device to drive the identification and the transmission of the second non-requested media data to the client device. In particular, it may use a push policy shared with the client device and defining how to determine second media data, for the server device to determine the second non-requested media data to be sent to the client device.
Correspondingly, the client may use a push policy shared with the server device and defining how to determine second media data, for the client device to determine second media data to be sent by the server device without having been requested by the client device.
Figure 14a illustrates, using a flowchart, general steps of the invention at the client side, while Figure 14b illustrates, using a flowchart, general steps of the invention at the server side.
In comparison to the process described with reference to Figures Id and le, additional stages 1400 and 1402 make it possible for respectively the server and the client to determine the push strategy that is shared with the other, and thus to be used.
According to first embodiments, the shared push policy is an implicit push policy, meaning that the client and server do not exchange (explicit) policy data to tell the other what is the push policy to be shared. An implementation of the implicit approach for the shared push policy includes using the same algorithm, referred to as "second media data determining algorithm", at both the server device and the client device, the algorithm enabling the server device and the client device to determine the same second media data from the first media data requested.
For instance, the algorithm is predetermined either during the set-up of the client and server or relatively to a specific standard. A typical example of algorithm may consist in pushing the N resources following the requested resource in a parsing order of the manifest file, where N is a predetermined number, for instance 4.
Referring to the Figures, the steps 1400 and 1402 consist, in case of implicit push policy, in loading in memory the predetermined algorithm for identifying the resources to be pushed (step 1403 at the server side).
The client may efficiently use the so determined push policy for estimating the number of PUSH_PROMISE that are expected and for preparing cancellation messages for unwanted push data, for instance in step 1401.
For instance, this would result for the server in receiving, from the client device, a cancellation request requesting to cancel transmission of a part of the second non-requested media data so that the server device does not transmit the corresponding prepared announcement message. For its part, the client would thus send, to the server device, a cancellation request requesting to cancel transmission of a part of the second non-requested media data, in order to drive the server device not to transmit an announcement message identifying the part of the second non-requested media data. One may understand that such cancellation can occur before the announcement message is transmitted from the server device or received by the client device.
This approach may be useful for example, when the client decides to switch to another version of a medium. In such situation, it can decide to cancel the segments pushed for the previous version.
It may also be noted that, thanks to the knowledge of the resources to be pushed using the algorithm, the client can make a second request to the server in parallel, in order to retrieve subsequent resources without having to wait for the corresponding PUSH_PROMISE from the server. In case of DASH, this possibility for the client makes it possible to reduce the latency of the client while ensuring that the second request will not interfere with a PUSH_PROMISE that will be received later.
The client may also request other resources that it needs, if it determines from the results of the algorithm that these other needed resources are not about to be pushed.
According to second embodiments, the shared push policy is defined in exchanges between the client and server, either explicitly by defining the whole rule (i.e. algorithm or parameters of the algorithm), or using references to push policies predefined at both sides. This requires for the server to first determine a push policy information describing the push policy of the server.
Then the push policy information is transmitted to the client to share the push policy with the client. Correspondingly, the client thus receives, from the server device, push policy information describing the shared push policy.
One advantage of the explicit approach relies on the fact that a different push policy could be used by the server for each client or for each multimedia presentation (e.g. each MPD), in order to better meet their processing characteristics. Figure iSa illustrates, using a flowchart, step 1400 of determining the shared push policy at the client side based on the explicit approach, while Figure 15b illustrates, using a flowchart, step 1402 of determining the push policy at the server side when the explicit approach is used.
As shown in Figure 15b, the server generates at step 1504 a message to declare the push policy and then sends it to the client at step 1505, in order to share it. The information describing the push policy in the declaration message is referred to as push policy information.
Figures 16 to 18 described below give exemplary details on how the push policy is declared and transmitted to the client.
The resources to be pushed using the push policy as determined at step 1402 are then identified at step 1403 with the selection algorithm (or second media data determining algorithm) defined in the push policy declaration message generated in step 1504.
On the client side as shown in Figure 15a, the client is able to pre-identify the resources to be pushed for a given resource request by applying the same selection algorithm. This makes it possible for the client to predetermine the data that will be pushed by the server and thus ensure an efficient management of push data and a reduction in the number of GET requests if appropriate.
To apply the same selection algorithm, the client receives the push policy information describing the push policy applied by the server.
Various push policy declaration methods may be used.
In one embodiment, the push policy declaration is shared thanks to a JavaScript program that takes, as input parameters, a request R and a DOM tree corresponding to the document containing the resources to be pushed (typically the manifest file for DASH) and that outputs an ordered list of resources to be pushed. In this embodiment, the push policy information includes a JavaScript program embedded in a web page transmitted from the server device to the client device.
In other embodiments, the push policy is described within the manifest file. That is the push policy information describing the shared push policy is inserted in a description file that is transmitted from the server device to the client device using the shared push policy. The description file contains description information that concerns media data including the first media data, and is used by both sides to determine the second non-requested media data to be pushed.
In DASH, the description file is for instance the MAD file. The description below is mainly based on DASH and MAD files. However, the same approach applies to other manifest-based streaming methods like Smooth Streaming or HTTP Live Streaming.
According to specific embodiments, the push policy information includes a first push attribute defining an amount of second non-requested media data to be identified in a description file. This makes it possible to specify the number of segments to be pushed after one request R is received from the client.
This is illustrated by Figure 16 which shows an MAD document in which a PushPolicy node 1600 is used to specify the push policy applied by the server.
In this example, the PushPolicy node 1600 includes a push attribute, namely "Segmentldx", to declare the number of segments to be pushed after a GET request is received. For instance, if the client requests segment 1601 in its GET request, it will receive, as a response, a PUSH_PROMISE frame for the next two segments in parsing order of the MPD document. In this example, the first push attribute identifies the second non-requested media data relatively to the first media data requested within the description file. More generally, a predetermined number of K segments to be pushed is used to define the push policy value. Consequently, for each segment requested by the client, the server will push the K next segments.
Whereas the example 1600 of Figure 16 shows a single push attribute, there may be several push attributes. Each push attribute may represent a constraint on nodes of the DOM (Document Object Model) tree representing the manifest for selecting the segments to be pushed. Referring to the previous example of Figure 4b, the push policy node 1600 can refer to media data described in the description file (MPD file) using media data attributes (the MPD elements and/or attributes) including a period attribute "Periodldx" that refers to a Period element to which the media data belong, an adaptation attribute "AdaptationSetldx" that refers to an AdaptationSet element of the media data, a representation attribute "Representationldx" that refers to a Representation element, i.e. an encoding version (specific codec, resolution or bitrate...) of the media data and a segment attribute "Segmentldx" that refers to a segment in a given Representation.
Based on these existing media data attributes, the push policy information may include at least a second push attribute defining a constraint on the media data attribute or attributes, for identifying the second non-requested media data.
For instance, a push attribute may be related to the Periodldx attribute to specify a constraint on the period for selecting the segment to push; another one may be related to the AdaptationSetldx attribute to specify a constraint on the adaptation; another one may be related to the Representationldx attribute to specify a constraint on the representation; in addition to the above first push attribute related to the Segmentldx attribute.
When a push attribute is not present or void, the related media data attribute must be considered as unconstrained.
The value of the push attributes may use the following syntax: push attribute= [operator] operand where operator" is optional and takes the value + or -to define the segments to be pushed relatively (+" means after and "-" means prior to) to the requested segment, and where "operand" is either an integer value superior or equal to 0 or as wildcard parameter.
Figure 17 illustrates, using a flowchart, steps for identifying and marking some segments as ready to be pushed according to the shared push policy "PushPolicy". This flowchart illustrates step 1403.
First, the server identifies at step 1700 the segment requested in the manifest file. The request includes an identified "reqSegldx" of this segment.
For each node type in the manifest file MPD, an index value is attributed to each node. The value is incremented for each Node in the order of appearance in the manifest file.
Next, the indexes of the Period, AdaptationSet, Representation and SegmentURL that correspond to the requested segment (i.e. the segment specified in the GET request) are retrieved by parsing the whole MPD until the requested segment is reached.
The operator and operand values of the push attributes defined in the push policy are used to identify in which nodes the segments to be pushed are defined (except for the Segmentldx attribute that defines the amount of segments to be pushed, when associated with "+" or "-" operator).
When no operator is specified, the operand value identifies the index of the Node within which the data to be pushed have to be retrieved. For example, where the first push attribute "Segmentldx" has no operator, it is an identifier, within the description file, of a specific segment to be pushed. In one alternative, when the operator is not specified the operand value may identify range values, for example "Segmentldx=25" would return segments with index equal to 2, 3, 4 and 5.
Otherwise (an operator is specified), the operand value represents an offset value (named "idxOffset") to apply to the index of the requested segment ("reqSegldx" obtained in step 1700). In such case, the segments to be pushed should be in the Nodes with indexes comprised in the [reqsegldx, reqSegldx+idxOffset] range if the operator is "i-" and in [regSegldx-idxOffset, regSegldx] if the operator is "! The use of an operator makes it possible to define a media data attribute or attributes of the second non-requested media data relatively to the corresponding media data attribute or attributes of the first
media data within the description file.
For instance, let's consider the following push policies: 1. .cPushPolicy Representationldx="1" Segmentldx="2"/> 2. cPushPolicy Periodldx="-'-l" Segmentldx="-'-2"/> 3. cPushPolicy Periodldx="+O" Segmentldx="+2"/> PushPolicy #1 specifies that the server will push the segment of index 2 in the representation node preceding the representation node of the requested segment.
With PushPolicy #2, the server will push the two segments following the requested segment, either in current period or in the following. For example, when requesting segment 2401 on Figure 24, segments 2405 and 2402 would be pushed.
PushPolicy #3 is very similar to the PushPolicy#2, the main difference is when the requested segment is the penultimate of the Period. For example, when requesting 2401, only the last segment 2405 in the current Period (instead of two segments) would be pushed. With PushPolicy #3, Periodldx restricts the segment search to the requested segment's Period node and thus only the last segment of the Period is pushed (because the segment requested is the penultimate segment in the Period). On the contrary, with PushPolicy #2 the segments can be retrieved from the next period.
In an alternative or as an optional value, the value of the operand may be also * (wildcard meaning) which means that any segment should be pushed. When it is associated with operator + (respectively "-"), it means that all subsequent (resp. preceding) segments to the requested one should be pushed.
This alternative allows the client to send only a single HTTP request to retrieve all segments of one Period for instance with the following PushPolicy: <PushPolicy Periodldx="+O" Segmentldx=+*>.
In these examples, the use of the Segmentldx attribute to identify the second media data (to be pushed) relatively to the requested first media data requires that the second media data are adjacent to the first media data. In an embodiment, the Segmentldx attribute may include an offset (in addition to the operand) to apply to the index of the requested segment. This shifts the index of a reference segment from which a specified amount of segments has to be pushed. As an example, the syntax of the Segmentldx attribute may be: push attribute:[operator]operand[ offset] where offset" is a positive or negative integer different from 0 to apply to the requested segment index. In such case the search range is [reqSegldx+offset, reqSegldx+idxOffset+offset] when the operator is + and [reqSegldx-idxOffset +offset, reqSegldx +offset] when the operator is The syntax of the push policy can also contain conditions like (non restrictive) a maximum size of data or a time in the presentation being pushed, respectively. For instance: cPushPolicy Segmentldx=f*[sizec500O00]> defines a push policy to push no more than 500 kilo bytes of segments data.
<PushPolicy Segmentldx=÷*[timecO:O1:3O] > defines a push policy to push no more than 1 minute and 30 seconds of next segments data.
While the above examples show how to declare the push policy that determines which segments have to be pushed, there may be a need to also specify in which preferred order the segments will be pushed. This information should also be shared between the client and the server.
As an example, the declaration of an order of transmission of the pushed segments as described above with reference to Figures 7 to 12 could apply.
In one alternative embodiment for an order of transmission of pushed segments, the description information in the description file includes priority attributes associated with the media data, one priority attribute (for example "priorityldx") for each media data, and an order of transmission of the second media data is based on the associated priority attributes. Thanks to the transmission of the description file, the client is also aware of the values taken by these priority attributes, and is thus able to determine the intended order of transmission.
As shown in the example of Figure 16, each segment (for instance identified by one SegmentURL Node) described in the manifest file includes a priorityldx attribute (1604) which specifies the push order of the segment. In the example of Figure 16, segment 1603 is pushed before segment 1602. These priorities are computed during the media segments preparation at the server side. Different priority values can be used: a relative priority value in a given Representation (as on Figure 16) or an absolute priority value either as a 32 bit number with 4 most significant bits for the Period priority, 4 next MSBs for the AdaptationSet priority value, next 8 bits for the Representation priority value and least 16 significant bits for the segment priority. An alternative way of signalling the absolute priority value is to use a comma separated list of priority values, one for each of the above cited levels, for example: priorityldx='l, 1, 2, 1' to define successively the Period priority, the AdaptationSet priority, the Representation priority and then the segment priority. First embodiment with 32 bit value would give (in binary): priorityldx='OOOl 0001000000100000000000000001'.
A main advantage of using priorityldx values is to make it possible to define a priority order between segments from different Representation (typically an associate representation such as an alternate view of the video). It is useful when the push policy consists in sending segments of different Representation sets. A typical use case is for streaming of layered video (a layer being a view in multi-view or a scalability layer in scalable video) where segments from one layer would be interleaved with segments with one or more other layers.
Back to Figure 17, based on the push policy as defined in the MPD file, the server determines at step 1701 the number of segments to be pushed.
This number is directly inferred from the Segmentldx attribute value: if the operator is not used in the attribute value, this number is equal to 1; otherwise (operator is or "i-') the number is equal to the operand value and is assumed infinite when the operand is (but limited by other constraints and by the number of existing segments).
Next, an iterative process composed of steps 1702 to 1705 is applied by streaming server until the number of segments to push is reached (test 1702) to mark each of the segments to be pushed.
For each iteration, the server retrieves at step 1703 the list of segments defined in the MPD file that respect the PushPolicy constraints (Adaptation Set, Representation, Period and Segment constraints and optional conditions).
If the list of segment is empty or all its segments are already marked (test 1704) the process ends and the server starts sending (step 102 above) the response to client's request.
Otherwise, the first segment of the list is marked in step 1705 as to be pushed during steps 103 (PUSH_PROMISE) and 104 (promised segments).
In these MPD-based examples of declaring the push policy, one push policy is defined using a PushPolicy element (see 1600 in Figure 16).
It is recalled here that the description file describes the media data using a plurality of media data attribute levels, namely the Period, AdaptationSet and Representation elements defined above.
As a slight variant to the above, various shared push policies may be defined at various respective levels of the description file. This is to be able to define various push policies depending on the level (Adaptation Set, Representation, Period) concerned, so as to adapt the push strategy to the content of the media stream.
This is illustrated through Figure 23 in which the push policy is defined using for example the "SupplementalProperty" descriptor at the desired level, here at the Representation level.
Using a push policy per <MPD> level makes it possible to have a constant and same push strategy across media.
Using a push policy per <Period> level makes it possible to have a push strategy that can vary along time.
Using a push policy per <AdaptationSet> level makes it possible to have a media-adapted push strategy.
Using a push policy per <Representation> level makes it possible to have a push strategy that can be adapted to the media characteristics (bandwidth...).
In the example of Figure 23, the push policy specified at the Representation level is configured to push more segments for low bit rate video segments (2300) than for high bitrate video (2301), so to avoid using too much bandwidth with push data.
Note that the explanations above with respect to the syntax of the push attributes may also be applied to this slight variant. In particular the push policy can be signalled in a manifest as a new element (as in Figure 16), or using an existing descriptor with a new schemeldUri (as in Figure 23) or as a new descriptor (not represented) or any means compliant with MPD schema or MPD schema extension points.
The MPD could also contain a list of alternative PUSH policies each one having a unique identifier (see below for more explanation about the list).
In other alternative embodiments, a push policy may define that segments for complementary Representations are systematically pushed, for example using the following syntax: <push_policy Segments='+complementary'> or value='complementary' when using a DASH descriptor.
In case of layered video, this means that for a requested video segment, each segment at the same time from all Representations declared as complementary Representations (typically through a dependencyld attribute in the MPD signalling dependencies between different Representations) would also be pushed.
Another push policy could also consist in pushing segments from associated Representations, signalled either with the @associationld attribute or with a role='supplementary'.
In case of fully server driven streaming, the push policy could provide information on whether the server behaviour has to be aggressive' (or optimistic') or conservative', i.e. respectively trying to push segments of higher quality or trying to push at the same quality level (preserving bandwidth).
In other embodiments, the push policy is transmitted in a dedicated HTTP header, referred to as "push-policy" header. That is the push policy information describing the shared push policy is embedded in a header of an HTTP frame transmitted from the server device to the client device.
These embodiments make it possible to vary the push policy over time, since they are no longer dependent on the transmission of the MPD file as above and the client and the server exchange using HTTP/2 protocol.
Figure 18 is an example of communication between a server and client with a push policy transmitted in a HTTP "push-policy" header (the header name "push-policy" is just an example).
The push-policy header includes a list of push attributes, each defining a constraint on the data to be pushed. In particular, the syntax of the PushPolicy described previously may be transcribed to HTTP header syntax.
In Figure 18a, the server in response to a MPD request from the client (arrow 1800) transmits (step 1801) the push-policy in an HTTP header accompanying the MPD sent, to share the push policy.
For instance, the push policy specifies that the segment following a requested segment will be pushed. As consequence, when the client requests (arrow 1802) segment Datal.1, the server sends (arrow 1803) PUSH PROMISE for segment Data2.1 and then the data of segment Datal.1 (arrow 1804).
Any syntax could be used to define which data is about to be transmitted for subsequent segment request: a MPD-specific one or a more abstract one based on DOM tree node traversal.
In a specific embodiment dedicated to dynamic shared push policies, the client may request a specific push policy, i.e. may update the shared push policy, for example if the current shared push policy is not adapted to its needs or may be improved.
That means that the client device sends push policy update information embedded in a header of an HTTP frame to the server device.
Correspondingly, the server device receives push policy update information embedded in a header of an HTTP frame from the client device. The server device may thus update accordingly the shared push policy before determining non-requested media data from other media data requested by the client device (for instance, for the next request).
In an embodiment, the push policy request from the client is conveyed in an HTTP header or request named push-policy-request" (name here is just an example).
Figure 18b illustrates client-server exemplary exchanges when the client requests a new push policy.
The beginning of the exchanges is the same as in Figure 18a.
After receiving segment Data2.1, the client identifies that the current push policy should be modified, for instance because the available bandwidth is stable enough to let the server push more segments in response to a segment request.
As a consequence, the client sends at step 1805 a push-policy-request that asks the server to push more segments (3 instead of 1) for each new request.
The server positively answers this push policy request with an OK 200, at step 1806. This positive answer means the server will use the new push-policy described in push-policy-request for any new request from the same client.
If the server does not want to change its push-policy, it returns an error code answer to notify the client that the push policy request is rejected.
Next, when the client requests at step 1807 a next segment Data3.1, the server answers at step 1808 with PUSH PROMISE for the next three segments Data 4.1, Data5.1 and Data 6.1.
Figure 21 illustrates, using a flowchart, steps of the process at the server side when using the HTTP request for sharing push policy, while Figure 22 illustrates, using a flowchart, steps of the process at the client side when using the HTTP request for sharing push policy.
In comparison to the process of the Figure 14, the server includes new processing steps (2100 to 2105) to handle a push policy request from the client and also to send the initial push policy and updates thereof If the request received by the server is a push policy request from the client (test 2100), the server first parses the push policy request at step 2101 in order to extract the constraints of data push proposed by the client.
During this step, the server may decide to follow the push policy requested by the client. In such case the server updates its internal push policy (step 2102) and sends an OK 200 response to the client at step 2103, in order to validate the proposed push policy.
Otherwise, when the server discards the push policy (for instance because the proposed policy is too costly in terms of resources or cannot be applied), step 2102 does not modify the internal push policy at the server and an error code is transmitted to the client at step 2103.
According to a specific embodiment, the server may in addition update its push policy independently of the client's requests. In such case, the server determines a push policy during step 1402 and may either decide to change its characteristics (for instance by analysing the requests performed by the client and network characteristics) or see that the determined push policy is different from the current one. In such a situation, the server has to share the new push policy with the client if the latter is not already aware of it (test 2104), in which case the new push policy is transmitted in an HTTP header in step 2105.
The corresponding process at the client side is explained with reference to Figure 22. As for the server processing, new processing steps (2200 to 2204) are added in comparison to the process of Figure 14 in order to process push policy messages and perform push policy requests.
After having determined the current shared push policy (i.e. the push policy of the server) in step 1400, the client may desire a new push policy, for instance to reduce the number of HTTP requests to send for retrieving segments of a media stream. Thus, when a new push policy is required by the client (test 2200), the client sends at step 2201 an HTTP request with a "push-policy-request" as described previously.
The response to this request is processed in step 2204 in which the client checks whether or not the server validates the request by returning an OK response or otherwise an error code.
If the server returns an OK 200 response, the current push policy determined in step 1400 is replaced by the requested policy. Otherwise it is unchanged.
In addition to the process of Figure 14, when the client receives a frame with a new push policy from the server (test 2202), the push policy is parsed and stored (step 2203) in memory in order to be retrieved in a next occurrence of step 1400.
It has to be noted that when the push-policy request is in a frame that also includes other data (e.g. media data), the other data are processed through steps 109-1 11-113-115.
Whereas the above HTTP-based examples use an HTTP request to fully define the push policy to be applied, one specific embodiment may rely on having a set of the same predefined push policies defined at both client and server sides, and each having a unique identifier. In this case, the HTTP request is only used to specify an identifier of the push policy to be used from amongst the set. This specific embodiment reduces the size of the HTTP request.
In one embodiment, the push policy request is sent as an additional HEADER of one of the HTTP requests used for requesting one of the server resource: typically, the push policy request is sent in an "Accept-Push-Policy" HTTP header in the GET request for the MPD file.
In another embodiment, the client specifies several "Accept-Push-Policy" in one HTJP request to indicate the list of push policies supported (or required) by the client. In response to the HTTP request the server may either choose one of the push policy in the proposed list and then specify the push policy in the HTTP response or respond by a new push policy if none is supported.
In yet another embodiment, the push policy request is sent in a dedicated HTTP request independent of any of the resource known by the server. For example, a GET (or POST) request is formed with an URL corresponding to none of the resource of the web page, for instance http://server/push_policy and also with at least one Accept-Push-Policy header.
In yet another specific embodiment, the set of alternative push policies may be defined in the MPD file exchanged between the server and the client, each having a unique identifier. One of the push policies may be marked as default push policy selected by the server. The client may specify which push policy should be used by sending a new push policy request which includes the identifier of the push policy to be used in replacement of the default push policy.
In one embodiment, a specific push policy is defined to indicate which segment will be pushed just after the request to the MPD document for fast start.
In a hybrid approach, the push policy information describing the shared push policy is defined by a first push policy part and a second push policy part, the first push policy part being inserted in the description file (MPD), and the second push policy part being embedded in a header of an HTTP frame transmitted from the server device to the client device.
For instance, the MPD may define the push policy with template arguments that are then defined (or even overloaded) by the server thanks to push-policy HTTP request. As an example, the push policy defined in the MPD file may be: cPushPolicy Segmentldx="parameter"/> and the value of the variable "parameter" may be defined in a push-policy HTTP request. In this example, the second push policy part comprises (only) one or more values for one or more associated variables defined in the first push policy part.
Using the push-policy-identifier-based approach described above, the description file may include the description of a plurality of candidate push policies, and the second push policy part may thus comprise an identifier of a candidate push policy from said plurality, which identified candidate push policy thereby forming the first push policy part.
In another embodiment to declare the push policy to the client, the push policy relies on the <Role> descriptor defined in the MPD to indicate in which representation the push data will be selected. Typically, the push policy may specify that the push strategy will use a segment in a Representation with an "alternate' or "supplementary' role value.
In another embodiment, the document of resources, for example a streaming manifest or an HTML page, is transformed into a priority tree that is browsed to determine the resources to be pushed after a GET request is received. The navigation within the priority tree may be performed thanks to an XPath request. In this approach, the push policy information includes an XPath expression to be evaluated on a tree representation of the document of resources to identify the second non-requested media data.
For example, in the streaming manifest a "following[nameQ="SegmentURL"][2]" XPath expression could be used to select, as segments to be pushed, the next two segments following the segment requested by the client in the GET request. Also for the chapter-switching use case, a "((following[name="Period"]//SegmentU RL)[2])" XPath expression makes it possible to select two first segments of the following Period for preloading the first two segments of each chapter. For example, when the client requests segment 2401 in the MPD file of Figure 24, segments 2402 and 2403 of the following Period are also transmitted by the server as pushed data.
In addition, the priority tree could be first re-ordered for example using an XSLT instruction in order to simplify the XPath expression writing for advanced push policy rules. The XSLT instruction makes it possible to reorganise the tree before applying a push policy. The XPath expressions is preferably transmitted to the client, for instance in one HTTP header and the XSLT stylesheet is defined in a web page. This applies in particular to HTML documents for example to group all pictures declared in the document, all CSS resources as consecutive nodes at the same level of the DOM tree.
For instance, the tree 2501 of Figure 25 represents an HTML page with different resources of different types: hashed nodes (2511-2514) correspond to image resources and nodes in plain color (2521-2524) are scripted resources (CSS or Javascript). The tree 2502 is an example of XSLT transformation result to group the resources by type (images in 2530 and scripted resources in 2540). A simple XPath expression could thus be defined to indicate that some resources for a given type will be pushed once the first resource of this given type is requested.
In all the embodiments described above, it is very likely that for each client request the server replies with several PUSH PROMISE if the push policy requires several segments to be pushed.
For instance, MPD 1900 of Figure 19 has a push policy which indicates that the three segments following a requested segment will be pushed (see cPushPolicy> element). Consequently, if the client requests the initialization segment with a GET request for media 1901 with a byte range equal to 0-999, the server will send the three PUSH_PROMISE messages 1902 during step 103.
In one embodiment, if the identified second media data comprise a plurality of media segments, each requiring an announcement message (i.e. a PUSH_PROMISE), the corresponding plurality of announcement messages may be merged into a single announcement message to be transmitted to the client device.
To achieve this situation, as shown in Figure 20, the processing at the server preferably includes a pre-processing step 2000 just before sending the push promises in step 103, compared to the general process of Figure 14.
The pre-processing step seeks to perform the above-mentioned merger of announcement messages.
When the push promises include byte range requests as in 1902, the list of push promises 1902 is browsed to generate a reduced set of push promises 1903 that contains consecutive byte range addresses. Next, each set of push promises 1902 is replaced by a reduced set of push promises 1903 with a contiguous byte range equal to the concatenation of the byte ranges in the push promises set or by a single push promise with a list of non-contiguous byte ranges for example 1905.
For instance, the three push promises 1902 are replaced by the single push promise 1903 shown in Figure 19.
This approach of merging push promises makes it possible for the client to cancel the send of push data in a simpler way and at lower bandwidth and processing costs. This is because the client just has to close a single stream for the single push promise instead of closing several streams for each of the non-merged push promises.
In an alternative, even if the push promises have disjoint byte range intervals, all push promises may be replaced by a list of byte ranges (where consecutive byte range intervals have been concatenated).
In addition, if the push promises do not include byte range intervals but rather different SegmentlJRL values, the push promises may also be concatenated to generate a single push promise message as follows: the method of the generated push promise message is defined as MGET (for multiple GET) and the path field is a list of segment URLs as represented in 1904. Similar to the previous embodiment, the client has to close the single stream corresponding to the generated push promise to cancel the push of all segments.
Note that the server may include END_SEGMENT flags at the end of each segment in the data then transmitted, in order to ensure that the client is able to parse and identify each pushed segment.
In addition, the SETTINGS frame of HTTP/2 is extended to include a new SETTINGS_ENABLE_GROUP_PUSH_PROMISE parameter that makes it possible to indicate if the grouping of push promises is allowed for the streaming session.
Figure 13 is a schematic illustration of a device according to embodiments. The device may be a server, a client or a proxy. The device comprises a RAM memory 1302 which may be used as a working memory for a control unit 1301 configured for implementing a method according to embodiments. For example, the control unit may be configured to execute instructions of a computer program loaded from a ROM memory 1303. The program may also be loaded from a hard drive 1306. For example, the computer program is designed based on the flowcharts of figures 8-12 and the
above description.
The device also comprises a network interface 1304 which may be a single network interface, or comprise a set of network interfaces (for instance several wireless interfaces, or several types of wired or wireless interfaces). The device may comprise a user interface 1305 for displaying information to a user and for receiving inputs from the user.
The device may also comprise an input/output module 1307 for receiving and/or sending data from/to external devices.
While the invention has been illustrated and described in detail in the drawings and foregoing description, such illustration and description are to be considered illustrative or exemplary and not restrictive, the invention being not restricted to the disclosed embodiment. Other variations to the disclosed embodiment can be understood and effected by those skilled in the art in practicing the claimed invention, from a study of the drawings, the disclosure and the appended claims.
In the claims, the word "comprising" does not exclude other elements or steps, and the indefinite article "a" or "an" does not exclude a plurality. A single processor or other unit may fulfill the functions of several items recited in the claims. The mere fact that different features are recited in mutually different dependent claims does not indicate that a combination of these features cannot be advantageously used. Any reference signs in the claims should not be construed as limiting the scope of the invention.

Claims (36)

  1. CLAIMS1. A method of streaming media data by a server device to a client device, the method comprising the following steps: -receiving, from the client device, a request relating to first media data, -identifying second media data to be sent to the client device without having been requested, and -in response to said request, transmitting to said client device, data relating to said first media data, and preparing at least one announcement message respectively identifying said second media data with a view to transmit the announcement message or messages to the client device, and wherein the method further comprises the step of using a push policy shared with the client device and defining how to determine second media data, for the server device to determine the second non-requested media data to be sent to the client device.
  2. 2. A method of accessing by a client device, media data streamed by a server device, the method comprising the following steps: -transmitting, to the server device, a request relating to first media data, -receiving from said server device, in response to said request, data relating to said first media data, wherein the method further comprises the step of using a push policy shared with the server device and defining how to determine second media data, for the client device to determine second media data to be sent by the server device without having been requested by the client device.
  3. 3. The method of Claim 1, further comprising: determining by the server device a push policy, and transmitting, from the server device to the client device, push policy information describing the determined push policy to share the push policy with the client device.
  4. 4. The method of Claim 2, further comprising receiving, from the server device, push policy information describing the shared push policy.
  5. 5. The method of Claim 3 or 4, wherein the push policy information describing the shared push policy is inserted in a description file that is transmitted from the server device to the client device, the description file containing description information that concerns media data including the first media data, the method further comprising determining the second non-requested media data based on said description file using the shared push policy.
  6. 6. The method of Claim 5, wherein the description file describes the media data using a plurality of media data attribute levels, and various shared push policies are defined at various respective levels of the description file.
  7. 7. The method of Claim 3 or 4, wherein the push policy information describing the shared push policy is embedded in a header of an HTTP frame transmitted from the server device to the client device.
  8. 8. The method of Claim 7, further comprising, at the server device, receiving push policy update information embedded in a header of an HTTP frame from the client device, and updating accordingly the shared push policy before determining non-requested media data from other media data requested by the client device.
  9. 9. The method of Claim 7, further comprising, at the client device, sending push policy update information embedded in a header of an HTTP frame to the server device.
  10. 10. The method of Claim 3 or 4, wherein the push policy information describing the shared push policy is defined by a first push policy part and a second push policy part, the first push policy part being inserted in a description file that is transmitted from the server device to the client device, the description file containing description information that concerns media data including the first media data, the method further comprising determining the second non-requested media data based on said description file using the shared push po1 icy, and the second push policy part being embedded in a header of an HTTP frame transmitted from the server device to the client device.
  11. 11. The method of Claim 10, wherein the second push policy part comprises one or more values for one or more associated variables defined in the first push policy part.
  12. 12. The method of Claim 10, wherein the description file includes the description of a plurality of candidate push policies, and the second push policy part comprises an identifier of a candidate push policy from said plurality, which identified candidate push policy thereby forming the first push policy part.
  13. 13. The method of Claim 3 or 4, wherein the push policy information includes a JavaScript program embedded in a web page transmitted from the server device to the client device.
  14. 14. The method of Claim 3 or 4, wherein the method further comprises determining the second non-requested media data based on a structured document using the shared push policy, the structured document containing description information that concerns media data including the first media data, and the push policy information includes an XPath expression to be evaluated on a tree representation of the structured document to identify the second non-requested media data.
  15. 15. The method of any of Claims 5 to 13, wherein the push policy information includes a first push attribute defining an amount of second non-requested media data to be identified in a description file, the description file containing description information that concerns media data including the first media data, and the method further comprising determining the second non-requested media data based on said description file using the shared push policy.
  16. 16. The method of Claim 15, wherein the first push attribute identifies the second non-requested media data relatively to the first media datarequested within the description file.
  17. 17. The method of Claim 15, wherein the first push attribute is an identifier of specific media data within the description file.
  18. 18. The method of Claim 15, wherein the description information in the description file describes media data according to at least one media data attribute from amongst a period attribute defining a time period to which the media data belong, an adaptation attribute defining a media type of the media data, a representation attribute defining an encoding version of the media data and a segment attribute defining, and the push policy information includes at least a second push attribute defining a constraint on the media data attribute or attributes, for identifying the second non-requested media data.
  19. 19. The method of Claim 18, wherein the push attribute or attributes define a media data attribute or attributes of the second non-requested media data relatively to the corresponding media data attribute or attributes of the firstmedia data within the description file.
  20. 20. The method of Claim 18, wherein the push attribute or attributes identify a node in the description file in which the second non-requested media data have to be retrieved.
  21. 21. The method of Claim 15, wherein the description information in the description file includes priority attributes associated with the media data, one priority attribute for each media data, and an order of transmission of the second media data is based on the associated priority attributes.
  22. 22. The method of Claim 1 or 2, wherein the shared push policy identifies the second media data from the first media data requested.
  23. 23. The method of Claim 1 or 2, wherein the shared push policy is implemented using the same second media data determining algorithm at both the server device and the client device, the algorithm enabling the server device and the client device to determine the same second media data from the first media data requested.
  24. 24. The method of Claim 1, wherein if the identified second media data comprise a plurality of media segments, each requiring an announcement message, the method further comprises merging the corresponding plurality of announcement messages into a single announcement message to be transmitted to the client device.
  25. 25. The method of Claim 1, further comprising receiving, from the client device, a cancellation request requesting to cancel transmission of a part of the second non-requested media data so that the server device does not transmit the corresponding prepared announcement message.
  26. 26. The method of Claim 2, further comprising sending, to the server device, a cancellation request requesting to cancel transmission of a part of the second non-requested media data, in order to drive the server device not to transmit an announcement message identifying the part of the second non-requested media data.
  27. 27. The method of Claim 2, wherein the second non-requested media data are determined by the client device independently of at least one announcement message prepared by the server device and identifying the second non-requested media data the server device intends to send to the client device without having been requested.
  28. 28. The method of Claim 1 or 2, wherein the same shared push policy is used for determining respective non-requested media data from a plurality of requests relating to respective first media data.
  29. 29. A method of streaming data between a server device and a client device comprising: -performing a method according to Claim 1 by a server device, and -performing a method according to Claim 2 by a client device.
  30. 30. A server device for streaming media data to a client device, the device comprising: -a receiver configured for receiving, from the client device, a request relating to first media data, -a control unit configured for identifying second media data to be sent to the client device without having been requested, and -a transmitter configured for, in response to said request, transmitting to said client device, data relating to said first media data, and preparing at least one announcement message respectively identifying said second media data with a view to transmit the announcement message or messages to the client device, and wherein the control unit is further configured for using a push policy shared with the client device and defining how to determine second media data, in order to determine the second non-requested media data to be sent to the client device.
  31. 31. A client device for accessing media data streamed by a server device, the device comprising: -a transmitter configured for transmitting, to the server device, a request relating to first media data, and -a receiver configured for receiving from said server device, in response to said request, data relating to said first media data, wherein the client device is configured for using a push policy shared with the server device and defining how to determine second media data, in order to determine second media data to be sent by the server device without having been requested by the client device.
  32. 32. A system comprising: -at least one server device according to Claim 30, and -at least one client device according to Claim 31.
  33. 33. A computer program product comprising instructions for implementing a method according to any one of Claims 1 to 29 when the program is loaded and executed by a programmable apparatus.
  34. 34. A non-transitory information storage means readable by a computer or a microprocessor storing instructions of a computer program, for implementing a method according to any one of Claims 1 to 29, when the program is loaded and executed by the computer or microprocessor.
  35. 35. A device substantially as hereinbefore described with reference to, and as shown in, Figure 13 of the accompanying drawings.
  36. 36. A method substantially as hereinbefore described with reference to, and as shown in, Figure 14a; Figures 14a and 15a; Figures 14a and 18a; Figures 14a and 18b; Figure 22; Figure 14b; Figures 14b and 15b; Figures 14b, 15b and 17; Figures 14b and 18a; Figures 14b and 18b; Figure 21; Figure 20 of the accompanying drawings.
GB1410540.7A 2013-07-12 2014-06-12 Adaptive data streaming method with push messages control Active GB2517060B (en)

Priority Applications (16)

Application Number Priority Date Filing Date Title
CN201811637213.XA CN109842613B (en) 2013-07-12 2014-07-11 Method and apparatus for providing and receiving media data and storage medium
US14/903,989 US10104190B2 (en) 2013-07-12 2014-07-11 Adaptive data streaming method with push messages control
CN201480050434.0A CN105532013B (en) 2013-07-12 2014-07-11 The adaptive data stream transmission method controlled using PUSH message
PCT/EP2014/064949 WO2015004276A2 (en) 2013-07-12 2014-07-11 Adaptive data streaming method with push messages control
KR1020187029384A KR102024311B1 (en) 2013-07-12 2014-07-11 Adaptive data streaming method with push messages control
KR1020197027171A KR102264477B1 (en) 2013-07-12 2014-07-11 Adaptive data streaming method with push messages control
RU2016104523A RU2625328C1 (en) 2013-07-12 2014-07-11 Method of adaptive data streaming with active delivery message management
KR1020167003172A KR101909160B1 (en) 2013-07-12 2014-07-11 Adaptive data streaming method with push messages control
EP14737279.1A EP3020208B1 (en) 2013-07-12 2014-07-11 Adaptive data streaming with push messages control
JP2016524842A JP6419173B2 (en) 2013-07-12 2014-07-11 An Adaptive Data Streaming Method with Push Message Control
RU2017123329A RU2659041C1 (en) 2013-07-12 2017-07-03 Method of adaptive flow transfer of data with management of active delivery communications
RU2018119682A RU2683595C1 (en) 2013-07-12 2018-05-29 Method of adaptive flow transfer of data with management of active delivery communications
US16/107,093 US10728353B2 (en) 2013-07-12 2018-08-21 Adaptive data streaming method with push messages control
JP2018192305A JP6632682B2 (en) 2013-07-12 2018-10-11 Media data providing device, providing method, control device, control method, and program
JP2019223944A JP6918910B2 (en) 2013-07-12 2019-12-11 Media data provision method, media data reception method, and program
US16/908,418 US11375031B2 (en) 2013-07-12 2020-06-22 Adaptive data streaming method with push messages control

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GB1312561.2A GB2516116B (en) 2013-07-12 2013-07-12 Adaptive data streaming method with push messages control

Publications (3)

Publication Number Publication Date
GB201410540D0 GB201410540D0 (en) 2014-07-30
GB2517060A true GB2517060A (en) 2015-02-11
GB2517060B GB2517060B (en) 2018-03-28

Family

ID=49081228

Family Applications (2)

Application Number Title Priority Date Filing Date
GB1312561.2A Active GB2516116B (en) 2013-07-12 2013-07-12 Adaptive data streaming method with push messages control
GB1410540.7A Active GB2517060B (en) 2013-07-12 2014-06-12 Adaptive data streaming method with push messages control

Family Applications Before (1)

Application Number Title Priority Date Filing Date
GB1312561.2A Active GB2516116B (en) 2013-07-12 2013-07-12 Adaptive data streaming method with push messages control

Country Status (1)

Country Link
GB (2) GB2516116B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017114393A1 (en) * 2015-12-31 2017-07-06 中兴通讯股份有限公司 Http streaming media transmission method and device
US10084884B2 (en) 2015-07-31 2018-09-25 At&T Intellectual Property I, L.P. Facilitation of efficient web site page loading
US11665380B2 (en) 2017-12-20 2023-05-30 Telefonaktiebolaget Lm Ericsson (Publ) Methods and apparatus for receiving adaptive bit rate content and manifest for adaptive bit rate content

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3142335A1 (en) * 2015-09-11 2017-03-15 Thomson Licensing Method and cache for caching multimedia content
CN114363667B (en) * 2016-02-01 2024-01-02 松下电器(美国)知识产权公司 Client, server, receiving method and transmitting method
CN107171926B (en) * 2017-03-23 2020-08-14 深圳市口袋网络科技有限公司 Switching method and device of multi-platform message push service
US11659057B2 (en) * 2017-04-19 2023-05-23 Comcast Cable Communications, Llc Methods and systems for content delivery using server push
CN110730160B (en) * 2019-09-06 2022-02-22 北京钛星数安科技有限公司 Self-adaptive remote browsing data stream transmission method
US20230300409A1 (en) * 2022-03-21 2023-09-21 Amdocs Development Limited System, method, and computer program for online data transfer

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100161756A1 (en) * 2008-12-23 2010-06-24 At&T Mobility Ii Llc Streaming enhancements through pre-fetch background

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005012685A (en) * 2003-06-20 2005-01-13 Canon Inc Image processing method and image processing apparatus
US7953882B2 (en) * 2007-07-26 2011-05-31 Realnetworks, Inc. Adaptive variable fidelity media distribution system and method

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100161756A1 (en) * 2008-12-23 2010-06-24 At&T Mobility Ii Llc Streaming enhancements through pre-fetch background

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10084884B2 (en) 2015-07-31 2018-09-25 At&T Intellectual Property I, L.P. Facilitation of efficient web site page loading
US11356533B2 (en) 2015-07-31 2022-06-07 At&T Intellectual Property I, L.P. Facilitation of efficient web site page loading
WO2017114393A1 (en) * 2015-12-31 2017-07-06 中兴通讯股份有限公司 Http streaming media transmission method and device
CN106936808A (en) * 2015-12-31 2017-07-07 中兴通讯股份有限公司 HTTP flow-medium transmission methods and device
CN106936808B (en) * 2015-12-31 2021-01-26 中兴通讯股份有限公司 HTTP streaming media transmission method and device
US11665380B2 (en) 2017-12-20 2023-05-30 Telefonaktiebolaget Lm Ericsson (Publ) Methods and apparatus for receiving adaptive bit rate content and manifest for adaptive bit rate content

Also Published As

Publication number Publication date
GB201410540D0 (en) 2014-07-30
GB2516116A (en) 2015-01-14
GB2517060B (en) 2018-03-28
GB201312561D0 (en) 2013-08-28
GB2516116B (en) 2017-10-25

Similar Documents

Publication Publication Date Title
US11375031B2 (en) Adaptive data streaming method with push messages control
GB2538832B (en) Adaptive client-driven push of resources by a server device
JP2016531466A5 (en)
GB2517060A (en) Adaptive data streaming method with push messages control
CN112106375B (en) Differential media presentation description for video streaming
GB2575189A (en) Adaptive client-driven push of resources by a server device
GB2551674A (en) Adaptive data streaming method with push messages control