WO2016107192A1 - 自适应流媒体处理方法及装置 - Google Patents

自适应流媒体处理方法及装置 Download PDF

Info

Publication number
WO2016107192A1
WO2016107192A1 PCT/CN2015/087615 CN2015087615W WO2016107192A1 WO 2016107192 A1 WO2016107192 A1 WO 2016107192A1 CN 2015087615 W CN2015087615 W CN 2015087615W WO 2016107192 A1 WO2016107192 A1 WO 2016107192A1
Authority
WO
WIPO (PCT)
Prior art keywords
media
adaptive streaming
stream
time
transport stream
Prior art date
Application number
PCT/CN2015/087615
Other languages
English (en)
French (fr)
Inventor
黄成�
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Priority to EP15874878.0A priority Critical patent/EP3242490B1/en
Publication of WO2016107192A1 publication Critical patent/WO2016107192A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/236Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
    • H04N21/2362Generation or processing of Service Information [SI]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/4302Content synchronisation processes, e.g. decoder synchronisation
    • H04N21/4307Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen
    • H04N21/43076Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen of the same content streams on multiple devices, e.g. when family members are watching the same movie on different devices
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/462Content or additional data management, e.g. creating a master electronic program guide from data received from the Internet and a Head-end, controlling the complexity of a video stream by scaling the resolution or bit-rate based on the client capabilities
    • H04N21/4622Retrieving content or additional data from different sources, e.g. from a broadcast channel and the Internet
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/61Network physical structure; Signal processing
    • H04N21/6106Network physical structure; Signal processing specially adapted to the downstream path of the transmission network
    • H04N21/6112Network physical structure; Signal processing specially adapted to the downstream path of the transmission network involving terrestrial transmission, e.g. DVB-T
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/61Network physical structure; Signal processing
    • H04N21/6106Network physical structure; Signal processing specially adapted to the downstream path of the transmission network
    • H04N21/6125Network physical structure; Signal processing specially adapted to the downstream path of the transmission network involving transmission via Internet
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/631Multimode Transmission, e.g. transmitting basic layers and enhancement layers of the content over different transmission paths or transmitting with different error corrections, different keys or with different transmission protocols
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring
    • H04N21/8547Content authoring involving timestamps for synchronizing content

Definitions

  • the present invention relates to the field of communications, and in particular to an adaptive streaming media processing method and apparatus.
  • the user accesses the service provider and/or the content provider to develop the deployed communication service through a fixed mobile communication terminal such as a television, a desktop computer, a tablet computer, a mobile phone, etc., by accessing the customized communication client capability at the user terminal
  • the communication link is established between the service servers (content servers), and the above communication links are usually composed of, for example, various types of digital subscriber lines (X Digital Subscriber Line, xDSL), and fiber access (Fiber-to-the).
  • -x referred to as FTTx
  • the establishment of a wireless communication network such as Long Term Evolution (LTE).
  • LTE Long Term Evolution
  • Digital Video Broadcasting is one of the world's mainstream digital TV standards. Based on the Moving Picture Experts Group Standard 2 (MPEG-2) audio and video data compression algorithm and multiplexing technology, the DVB standard provides A complete set of specifications for digital television systems for different media, defining cable (DVB-C), satellite (DVB-S) and terrestrial (DVB-T) channels for media stream transmission and The modulation method expands the system layer of the MPEG-2 standard and enriches the network interface.
  • MPEG-2 Moving Picture Experts Group Standard 2
  • MPEG-2 Moving Picture Experts Group Standard 2
  • the modulation method expands the system layer of the MPEG-2 standard and enriches the network interface.
  • Dynamic Adaptive Streaming over HTTP is an adaptive bit rate streaming technology that provides high quality using traditional hypertext transfer protocol web pages (HTTP Web) servers over the Internet.
  • Streaming content divides a video into file partitions on many small HyperText Transfer Protocol (HTTP) servers, each of which includes a video content with a shorter playback time. These small video chunks may be part of a live video content for hours of movies (on demand) or sports events.
  • Video may be encoded into video clips of multiple bitrates to cover different network bandwidth requirements.
  • OTT technology represented by adaptive streaming media has the flexibility to consume video anytime and anywhere, and IPTV, digital Traditional broadcasting technologies such as television and satellite television have high definition and quality assurance of large screens. Therefore, the integration of OTT technology and broadcasting technologies such as digital television can provide a comprehensive solution suitable for a variety of scenarios.
  • Second Screen applications are currently receiving attention, users utilize "TV buddies" such as smartphones, tablets, etc., to consume personalized media information associated with TV screen broadcast programs.
  • broadcast networks such as DVB-C digital TV networks
  • OTT mode Internet Broadband Network
  • the above hybrid delivery mode has significant features such as content source independence, transport network heterogeneity, and differentiated playback terminals. Among them, is it possible to establish a time correlation between a broadcast program or a primary media content (Primary Media) and a broadband program or an auxiliary media content (Ancillary Media), that is, whether different content sources, different transmission formats, and media of different playback terminals can be guaranteed? Synchronous playback between content has become the key to whether such applications can be accepted by users.
  • Primary Media Primary media content
  • Ancillary Media auxiliary media content
  • the audio and video feature recognition technology and its application are currently a research direction to solve the problem of media content synchronization in the above hybrid delivery mode, and utilize the digital features of the audio and video media content as the time synchronization reference between media.
  • digital watermarking technology The digital watermark of the audio signal in the television program is collected, and the program channel identifier and time stamp information in the digital watermark can be used to detect the time progress of the program being played, thereby obtaining appropriate auxiliary media content and playing synchronously; 2) digital fingerprint technology.
  • the digital fingerprint information is extracted from the audio and video samples, and the fingerprint retrieval database is used to obtain the media synchronization information.
  • PCR/PTS program reference clock/display time stamp
  • IP Internet Protocol
  • the present invention provides an adaptive streaming media processing method and apparatus to at least solve the problem of media synchronous playback in a hybrid delivery mode of a broadcast network and a broadband network in the prior art.
  • an adaptive streaming media processing method including: acquiring location information and time information of an adaptive streaming media carried in a transport stream; and determining the adaptive streaming media according to the location information.
  • a media segment that is time aligned with the transport stream is required; and a display timestamp of a media presentation time of the media segment in the adaptive streaming media relative to a time axis of the transport stream is determined according to the time information.
  • the acquiring the location information of the adaptive streaming media carried in the transport stream includes: parsing the obtained program mapping table PMT in the transport stream, and determining, from the PMT, that the stream type value is a predetermined value.
  • Time-axis and external media information packet stream TEMI PES code stream packet identifier PID filtering the TEMI PES code stream according to the PID; extracting the location of the adaptive streaming media carried in the TEMI PES code stream
  • the descriptor is used as the location information.
  • acquiring time information of the adaptive streaming media carried in the transport stream includes: extracting the TEMI The PES code stream extends the display timestamp PTS included in the header field; and determines time information of the adaptive streaming media according to the PTS value.
  • acquiring the location information of the adaptive streaming media carried in the transport stream includes: acquiring a specific media component transport stream TS code stream whose value of the extended descriptor subtag in the transport stream is a predetermined value;
  • the adaptive streaming location descriptor carried in the specific code component TS code stream adaptation domain is used as the location information.
  • the acquiring the time information of the adaptive streaming media carried in the transport stream includes: extracting a PTS included in a PMS extended header field of the packet code stream carried in the specific media component TS code stream payload; The PTS value determines time information of the adaptive streaming media.
  • acquiring time information of the adaptive streaming media carried in the transport stream includes: acquiring a first one of the specific media components TS in the transport stream is the same as the specific media component TS code stream a TS code stream having a PID value and a payload data unit start indicator in the TS header field is a predetermined value; extracting a PTS included in a PES extension header field carried in the acquired TS code stream payload; according to the PTS value Determining time information of the adaptive streaming media.
  • extracting the location descriptor of the adaptive streaming media includes: extracting a label field value from the TEMI PES code stream payload or an adaptation domain of the specific media component TS code stream to a predetermined value and a service type field
  • the adaptation domain AF descriptor whose value is a predetermined value is used as a position descriptor of the adaptive streaming medium.
  • the location descriptor includes at least one of an event stream scheme identifier field, an event stream value field, and an event identifier field, where the event stream scheme identifier field value is set to uniquely identify an event stream scheme, The event stream value field value is set to uniquely identify an event stream in the event flow scheme, the event identifier field value being set to uniquely identify an event in the event stream.
  • the location descriptor further includes a media presentation description uniform resource locator URL field, wherein the media presentation description URL field value is set to determine a media presentation description URL of the adaptive streaming media.
  • the method further includes: extracting the forced reload flag from the location information; and reloading the media of the adaptive streaming media if the value of the forced reload flag is a predetermined value Present the description.
  • the method further includes: extracting, from the location information of the adaptive streaming media, the pre-announcement flag; and if the value of the pre-announcement flag is a predetermined value, extracting from the location information The activation countdown field and the time stamp field.
  • determining, according to the location information, that the media segment that needs to be time-aligned with the transport stream in the adaptive streaming media includes: determining a media presentation description MPD of the adaptive streaming media corresponding to the location information; The media segment aligned with the transport stream in a time axis is determined based on the location information and the MPD.
  • determining the MPD of the adaptive streaming media corresponding to the location information includes: extracting an MPD descriptor included in a program mapping table PMT in the transport stream; and obtaining a location from the MPD descriptor a uniform resource locator of the MPD corresponding to the location information; or acquiring a uniform resource locator of the MPD from the location information.
  • determining, according to the location information and the MPD, the media slice aligned with the transport stream in time axis The segment includes: replacing the $EventStream$ identifier of the media fragment template element @media attribute in the MPD by using an event stream scheme identifier field value and the event stream value field value carried in the location information; using the location An event identification field value carried in the information, replacing a $EventID$ identifier of the media fragment template element @media attribute in the MPD, configured to identify the adaptive streaming media that is time-aligned with the transport stream The Uniform Resource Locator URL of the media fragment.
  • determining, according to the location information and the MPD, the media segment that is time-aligned with the transport stream includes: using an event flow scheme identifier field value and an event stream value field value carried in the location information. Retrieving a time period element in the MPD; using the event identification field value carried in the location information to retrieve a media segment URL sub-element in the time period element; @media and @mediaRange according to the media segment URL element The attribute value constructs the URL of the media segment of the adaptive streaming media that needs to be time aligned with the transport stream.
  • determining, according to the location information and the MPD, the media segment that is time-aligned with the transport stream includes: using an event flow scheme identifier field value and an event stream value field value carried in the location information. Retrieving a time period element in the MPD; using an event identification field value carried in the location information to retrieve an event sub-element determining the time period element; and constructing according to the @media and @mediaRange attribute values of the event element The URL of the media segment of the adaptive streaming media that is time aligned with the transport stream.
  • determining, according to the time information, a display timestamp of a media presentation time of the media segment in the adaptive streaming media relative to a time axis of the transport stream comprising: determining the need and the transmitting according to the time information. Flowing a media presentation time of the time-axis aligned media segment relative to a PTS of the transport stream timeline; calculating a time period in the media segment during which the media segment of the media segment is time-aligned with the transport stream The PTS of the media clip.
  • determining, according to the time information, the display timestamp of the media presentation time of the media segment in the adaptive streaming media with respect to the time axis of the transport stream further includes: corresponding to the location descriptor of the adaptive streaming media
  • the PTS value is used as the PTS of the media presentation time of the media segment relative to the time axis of the transport stream.
  • an adaptive streaming media processing method including: adding location information and/or time information of an adaptive streaming media to a transport stream, wherein the location information is set to determine Determining, in the adaptive streaming media, a media segment that is time-aligned with the transport stream, the time information being set to determine a display of a media presentation time of the media segment in the adaptive streaming media relative to a time axis of the transport stream a timestamp; the transport stream is sent to the user terminal.
  • adding the location information that carries the adaptive streaming media in the transport stream includes: adding a packet identifier PID in a program mapping table PMT of the transport stream, where the PID is set to acquire a stream type.
  • adding the time information carrying the adaptive streaming media to the transport stream includes: carrying the time information in the TEMI PES code stream extended header field.
  • adding the location information that carries the adaptive streaming media to the transport stream includes: carrying, in the transport stream, a specific media component TS code stream whose value of the extended description subtag is a predetermined value, where Carrying in the TS code stream adaptation domain Bring the location information.
  • adding the time information carrying the adaptive streaming media to the transport stream includes: carrying a PTS in a packet code stream PES extension header field of the TS code stream payload, where the PTS value is set to be determined The time information of the adaptive streaming media.
  • adding the time information carrying the adaptive streaming media to the transport stream includes: setting a TS stream corresponding to a specific media component TS in the transport stream to be the same as the specific media component TS code stream.
  • a PID value and a payload data unit start indicator in the TS header field is a predetermined value; a PTS is carried in the PES extended header field of the subsequent TS code stream payload, wherein the PTS value determines the self Adapt to the time information of streaming media.
  • the AF descriptor is used as a position descriptor of the adaptive streaming medium, and the location descriptor includes the location information.
  • the location descriptor includes at least one of an event stream scheme identifier field, an event stream value field, and an event identifier field, where the event stream scheme identifier field value is set to uniquely identify an event stream scheme, The event stream value field value is set to uniquely identify an event stream in the event flow scheme, the event identifier field value being set to uniquely identify an event in the event stream.
  • the location descriptor further includes a media presentation description URL field, wherein the media presentation description URL field value is set to determine a media presentation description URL of the adaptive streaming media.
  • the MPD descriptor is carried in the program mapping table PMT in the transport stream, where the media presentation description URL field is included in the MPD descriptor, or the media presentation is carried in the location information. Describe the URL field.
  • an adaptive streaming media processing apparatus including: an obtaining module, configured to acquire location information and time information of the adaptive streaming media carried in the transport stream; Determining, according to the location information, a media segment that needs to be time-aligned with the transport stream in the adaptive streaming media; and the second determining module is configured to determine, according to the time information, a media segment in the adaptive streaming media
  • the media presentation time is relative to the display timestamp of the transport stream timeline.
  • an adaptive streaming media processing apparatus comprising: an adding module configured to add location information and/or time information of an adaptive streaming medium to a transport stream, wherein the location The information is configured to determine a media segment in the adaptive streaming media that needs to be time aligned with the transport stream, the time information being set to determine a media presentation time of the media segment in the adaptive streaming media relative to the transmission A display timestamp of the stream timeline; a sending module configured to send the transport stream to the user terminal.
  • FIG. 1 is a schematic diagram of a broadcast/broadband hybrid delivery media synchronization system architecture in accordance with an embodiment of the present invention
  • FIG. 2 is a flowchart of a dynamic adaptive streaming media processing method according to an embodiment of the present invention
  • FIG. 3 is a schematic diagram of acquiring an adaptive streaming media location descriptor according to an alternative embodiment of the present invention.
  • FIG. 4 is a schematic diagram of another method for acquiring an adaptive streaming media location according to an alternative embodiment of the present invention.
  • FIG. 5 is a diagram showing an example of determining media segments that require time axis alignment, in accordance with an embodiment of the present invention
  • FIG. 6 is another example diagram of determining that a time-axis aligned media segment is needed, in accordance with an embodiment of the present invention.
  • FIG. 7 is an exemplary diagram for determining that a time axis alignment of a media segment is required, in accordance with an embodiment of the present invention.
  • FIG. 8 is a flowchart of a method for processing a transport stream according to an embodiment of the present invention.
  • FIG. 9 is a structural block diagram of an adaptive streaming media processing apparatus according to an embodiment of the present invention.
  • FIG. 10 is a structural block diagram of another adaptive streaming media processing apparatus according to an embodiment of the present invention.
  • FIG. 1 is a schematic diagram of a broadcast/broadband hybrid transmission medium synchronization system architecture, as shown in FIG. 1, including: a broadcast content source 100, The MPD server 200, the streaming media server 300, the first terminal 410, and the second terminal 420, wherein the first terminal and the second terminal may be two independent terminals, and the functions of the two terminals may also be implemented in one physical terminal. .
  • the system shown in Fig. 1 will be described below.
  • the broadcast content source 100 may use the MPEG-2 standard, or other standards may be used.
  • MPEG-2 will be described as an example.
  • the broadcast content source 100 inputs audio data and video data to respective encoders for compression and encoding at the transmitting end to obtain an elementary stream (Elemental Stream, referred to as ES), ES of the audio and video.
  • ES elementary stream
  • the packet is divided into groups to form a packet stream (Packet Elemental Stream, PES for short), and the PES enters the stream multiplexer together with the program content information and the conditional information to generate a program stream (Program Stream) , referred to as PS) or Transport Stream (TS).
  • PS Program Stream
  • TS Transport Stream
  • the MPEG-2TS code stream payload sent by the broadcast content source 100 carries different packet identifiers (Packet Identification, referred to as PID) PES packet stream.
  • PID Packet Identification
  • the TS stream having a PID value of 0x0000 corresponds to a Program Association Table (PAT), and the PAT defines all programs in the TS, and provides program initialization information, which is a complete list of all programs in the TS, and The program number of the different programs and the PID of the Program Map Table (PMT) related to the program content are indicated.
  • PAT Program Association Table
  • the PMT indicates the correspondence between the program number and the transmission stream of audio and video and other content included therein, and a complete definition of a program, including the program number of the program, PCR PID, stream Information such as stream type (such as audio, video, and other data) and elementary stream PID (indicating the TS packet PID carrying audio and video content).
  • the MPEG-2 TS stream can also carry a TEMI PES code stream called Timeline and External Media Information (TEMI PES), which is set to indicate the external media resources carried in the MPEG-2 TS stream. Time information and location information.
  • TEMI PES Timeline and External Media Information
  • the adaptive streaming media (also referred to as dynamic adaptive streaming media) involved may be implemented in various manners, for example, DASH may be employed.
  • DASH may be implemented in various manners, for example, DASH may be employed.
  • the following embodiments are equally applicable to other adaptive streaming technologies.
  • the MPD server 200 provides MPD file storage in dynamic adaptive streaming media.
  • the MPD file describes the duration, URL, media attributes of each media segment, such as video resolution, adaptation bit rate, and the like.
  • the MPD file provides enough HTTP-URL information for the DASH client to access the streaming media content resource.
  • the DASH client starts playing a video, it selects the download according to the information in the MPD file to adapt to the currently available network bandwidth and the terminal. Cache the media clips and play them.
  • the streaming server 300 completes the generation of the slice of the original video and the generation of the MPD file. Video files are segmented and stored in a hierarchy of periods, representations, and segments. Upon request, the streaming server 300 will return the appropriate media shards to the terminal.
  • the terminal 400 may include a first terminal 410, such as a digital television set top box, or a broadcast ⁇ broadband dual mode set top box; and a second terminal 420, such as a tablet computer, a smart phone, a handheld television device, and the like.
  • a first terminal 410 such as a digital television set top box, or a broadcast ⁇ broadband dual mode set top box
  • second terminal 420 such as a tablet computer, a smart phone, a handheld television device, and the like.
  • the terminal 400 after receiving the MPEG-2 TS code stream, splits the code stream into different parsing channels according to different packet identifiers PID, The audio PES and the video PES are parsed, and the ES stream is reorganized into the corresponding cache of the audio and video.
  • a channel of the demultiplexing module is bound to a TS stream with a PID value of 0x0000, and the PAT is parsed therefrom to obtain the packet identifier PID of the PMT of the desired program.
  • the program mapping table PMT is found, and the system reference clock PCR in the code stream and the PTS and other time information of the audio and video PES code stream of the related program are extracted.
  • the terminal 400 may further extract a specific type of adaptation domain descriptor (AF Descriptor) carried in the TEMI PES code stream and the corresponding PES.
  • AF Descriptor a specific type of adaptation domain descriptor carried in the TEMI PES code stream and the corresponding PES.
  • the code stream header field displays a time stamp (Presence Time Stamp, PTS for short), and is set to obtain location information and time information of the dynamic adaptive streaming media.
  • the terminal may request the MPD server 200 to acquire the MPD file of the dynamic adaptive streaming media according to the location information.
  • the terminal 400 parses the acquired MPD file to determine the need for dynamic adaptive streaming media.
  • a time-axis aligned media segment of the MPEG-2 TS stream.
  • the terminal 400 may determine, according to the time information, a media presentation time (Media Presentation Time, for example, a time axis aligned media segment and a subsequent media segment in the same time period) in the dynamic adaptive streaming media. Show timestamp PTS for MPT) relative to the MPEG-2 TS stream timeline.
  • Media Presentation Time for example, a time axis aligned media segment and a subsequent media segment in the same time period
  • the terminal may periodically send an HTTP request through a dynamic adaptation (Quality Adaptation) algorithm to obtain a media segment of a suitable code rate or quality version in the dynamic adaptive streaming media, and according to the MPEG-2 TS code.
  • a dynamic adaptation Quality Adaptation
  • the audio and video PES code stream PTS information in the stream related program and the media segment PTS information in the dynamic adaptive streaming media realize audio and video synchronization processing.
  • FIG. 2 is a flowchart of a dynamic adaptive streaming media processing method according to an embodiment of the present invention. Including the following steps:
  • Step S202 Obtain location information and time information of the adaptive streaming media carried in the transport stream.
  • Step S204 determining, according to the location information, a media segment in the adaptive streaming media that needs time axis alignment with the transport stream;
  • Step S206 determining, according to the time information, a display timestamp of the media presentation time of the media segment in the adaptive streaming media with respect to the time axis of the transport stream.
  • the location information and the time information of the adaptive streaming media carried in the transport stream are used to determine the media segments in the adaptive streaming media that need to be time-aligned with the transport stream and their corresponding display time stamps, thereby solving the problem.
  • the problem caused by using a single reference clock for the mixed network transmission of the broadcast network and the broadband network makes the media synchronization processing in the hybrid delivery mode more efficient, and also improves the scalability of the service deployment.
  • step S202 The acquisition of the location information and the time information is described in step S202, and the acquisition of the location information and the time information will be described below in conjunction with an optional implementation.
  • the location information in step S202 may be a location descriptor (Location Descriptor).
  • FIG. 3 is a schematic diagram of acquiring an adaptive streaming media location descriptor according to an alternative embodiment of the present invention, as shown in FIG.
  • the child may also be an AF descriptor, which may be carried in the PES data payload, and the PTS time value may be carried in the PES extension header field.
  • the terminal first parses the program mapping table PMT in the obtained transport stream, and determines a packet identifier PID of the TEMI PES code stream whose stream type value is a predetermined value (for example, 0x26). The terminal binds the packet identifier PID to the demultiplexing module channel, and filters out the TS packet containing the TEMI PES code stream, where the stream identifier value of the TEMI PES code stream is private_stream_1.
  • the terminal extracts the adaptation domain AF descriptor of the TEMI PES code stream data payload TEMI_AU carrying the label field (af_descr_tag) value as a predetermined value (for example, 0x05) and the service type field (service_type) value is a predetermined value (for example, 0x01), That is, the location descriptor of the dynamic adaptive streaming media (Location Descriptor), thereby obtaining the location information of the dynamic adaptive streaming media.
  • the terminal acquires a TEMI PES code stream in the transport stream, and extracts a PTS included in the TEMI PES code stream extension header field as a PTS value corresponding to the dynamic adaptive streaming media location descriptor, thereby acquiring dynamic adaptive streaming media. Time information.
  • FIG. 4 is a schematic diagram of another method for obtaining an adaptive streaming media location descriptor according to an alternative embodiment of the present invention. As shown in FIG. 4, the AF descriptor may be carried in the TS stream. Adaptation domain.
  • the stream ID indicates whether the PES payload is audio, video or data stream.
  • the terminal acquires a specific media component (video or audio) TS code stream whose extension descriptor (Extension_descriptor_tag) value in the transport stream has a predetermined value (for example, 0x04).
  • the terminal extracts an AF descriptor whose tag field (af_descr_tag) value in the Adaptation Field of the specific media component is a predetermined value (0x05) and whose service type field (service_type) value is a predetermined value (for example, 0x01). That is, the location descriptor of the dynamic adaptive streaming media (Location Descriptor), thereby obtaining the location information of the dynamic adaptive streaming media.
  • the terminal acquires a specific media component (video or audio) TS code stream in the transport stream, and extracts a PTS included in a PES code stream extension header field carried in a specific media component (video or audio) TS code stream payload as a The PTS value corresponding to the dynamic adaptive streaming location descriptor is obtained, thereby obtaining time information of the dynamic adaptive streaming media.
  • the terminal acquires a subsequent TS code stream having the same PID value as the specific media component (video or audio) TS code stream and a payload data unit start indicator (payload_unit_start_indicator) in the TS code stream header field.
  • the PTS included in the extended header field of the PES stream carried in the TS stream payload is extracted as a PTS value corresponding to the dynamic adaptive streaming location descriptor to obtain time information of the dynamic adaptive streaming media.
  • Table 1 shows a syntax structure chart of the adaptive streaming media location descriptor according to an embodiment of the present invention.
  • the location descriptor (Location Descriptor) of the adaptive streaming media in this embodiment is defined as follows:
  • the service type is DASH as an example:
  • Timeline_id the unique identifier of the position descriptor in the MPEG-2TS code stream
  • Force_reload a 1-bit flag indicating whether to reload the MPD before DASH synchronization (ie, media segment timeline alignment);
  • Is_announcement a 1-bit flag indicating that the DASH media segment corresponding to the position descriptor has not been in a play (activated) state;
  • Time_before_activation indicates the remaining time of the resource (DASH media segment) referenced by the location descriptor into the playing state
  • Timescale the time stamp unit, set to represent the time value of the time_before_activation field
  • Service_type indicates the service type of the external resource referenced by the location descriptor.
  • the service type is "MPEG-DASH”.
  • the location information carried in the location descriptor will be set to achieve mapping between the media presentation time of the DASH media segment and the PTS in the MPEG-2 TS code stream, that is, between the completion of the DASH streaming media and the MPEG-2 TS code stream. Synchronize.
  • Event Stream the location information of the above-mentioned DASH streaming media time information and transport stream time information mapping relationship is referred to as a Timeline Alignment event stream (Event Stream), and is defined as follows:
  • URN Uniform Resource Name
  • URL Uniform Resource Locator
  • the event message plan identification field together with the event stream value field can uniquely identify the time axis aligned event stream.
  • the event ID field is set to uniquely identify a specific timeline alignment event in the timeline alignment event stream.
  • the terminal obtains the MPD descriptor of the adaptive streaming media included in the program mapping table PMT, which can be defined as shown in Table 2, and takes DASH as an example for description:
  • the mpd_url value is the URL of the MPD corresponding to the DASH of the adaptive streaming location descriptor.
  • step S204 describes that the location information determines a media segment in the adaptive streaming media that needs to be time-aligned with the transport stream; this will be described below in conjunction with an optional implementation.
  • the MPD is a document that includes the metadata required by the DASH client to construct the HTTP-URL of the media segment, so that the client can access the media segment and provide streaming services for the user.
  • the MPD may include an event set to notify the DASH client or the DASH application of non-periodic information.
  • Events are time-controlled, that is, events start at a particular media presentation time point and usually last for a while. Events include DASH-specific notification events and application-specific events.
  • the MPD medium period element may include a time axis alignment event stream (EventStream) sub-element, wherein:
  • the EventStream@schemeIdUri property is set to a message scheme (Scheme) that uniquely identifies the timeline alignment event stream.
  • the EventStream@value property is used as the value of the timeline alignment event stream whose value space will be defined by the message schema of the timeline alignment event stream identified by event_schemeIdUri.
  • the EventStream element includes a series of timeline alignment event Event elements of the same type, uniquely identified by the Event.id property value.
  • the terminal may acquire the corresponding MPD according to the location information according to different methods.
  • the terminal extracts the MPD descriptor of the dynamic adaptive streaming media included in the PMT, and obtains the URL of the MPD of the dynamic adaptive streaming media corresponding to the location information according to the mpd_url field value included therein. or,
  • the terminal extracts the dynamic adaptive streaming media location descriptor, and obtains the URL of the MPD of the dynamic adaptive streaming media corresponding to the location information according to the mpd_url field value included therein.
  • FIG. 5 is a diagram showing an example of determining a media segment that needs time axis alignment, as shown in FIG. 5, according to an embodiment of the present invention, including:
  • the terminal retrieves and determines the Period element in the MPD according to the event stream scheme identifier field with the value of "urn:uuid:XYZY" carried in the location descriptor and the event stream value field with the value of "ad", that is, the @id attribute in FIG. A Period element with a value of "1".
  • the terminal replaces the $EventStream of the @media attribute in the media fragment template (SegmentTemplate) sub-element of the Period element by using the event stream scheme identifier field value "urn:uuid:XYZY" and the event stream value field value "ad” carried in the location descriptor.
  • the $ identifier replaces the $EventID$ identifier of the @media attribute in the MediaTemplate Template (SegmentTemplate) sub-element of the Period element using the event identifier (event_id) field value "1" carried in the position descriptor.
  • the media fragment HTTP URL of the dynamically adaptive streaming media that needs to be time-axis aligned with the transport stream is "http://www.example.com/uuid:XYZY/ad_00002.ts", and so on.
  • FIG. 6 is another example diagram for determining that a time axis alignment of a media segment is required, as shown in FIG. 6, according to an embodiment of the present invention, including:
  • the terminal retrieves and determines the Period element in the MPD according to the event stream scheme identifier field with the value of "urn:uuid:XYZY" carried in the location descriptor and the event stream value field with the value of "ad", that is, the @id attribute in FIG. A Period element with a value of "1".
  • the terminal retrieves the media segment URL (SegmentURL) sub-element under the Period element in the MPD according to the event identifier field of the value "1" carried in the location descriptor, that is, the media segment whose @eventID attribute value is "1" in FIG. URL (SegmentURL) element.
  • the next constructed media fragment HTTP URL of the dynamic adaptive streaming media that needs time axis alignment with the transport stream It is "http://www.example.com/seg205.mp4", and the optional interval request header field "Range:bytes 500-999", and so on.
  • FIG. 7 is a diagram showing another example of determining that a time-axis aligned media segment needs to be performed, as shown in FIG. 7, including:
  • the terminal retrieves and determines the Period element in the MPD according to the event stream scheme identifier field with the value of "urn:uuid:XYZY" carried in the location descriptor and the event stream value field with the value of "ad", that is, the @id attribute in FIG. A Period element with a value of "1".
  • the terminal retrieves an event sub-element under the Period element in the MPD according to the event identification field of the value "1" carried in the position descriptor, that is, an event in which the @id attribute value is "1" in FIG. element.
  • BaseURL base address
  • the next constructed media fragment HTTP URL of the dynamic adaptive streaming media that needs time axis alignment with the transport stream It is "http://www.example.com/seg305.mp4", and the optional interval request header field "Range:bytes 500-999", and so on.
  • the method when determining a media segment that needs to be time aligned with the transport stream, the method further includes:
  • the method when determining a media segment that needs to be time aligned with the transport stream, the method further includes:
  • the terminal extracts the forced reload (force_reload) flag in the dynamic adaptive streaming media location descriptor; if the value of the forced reload (force_reload) flag is 0x01, the MPD of the dynamic adaptive streaming media is reloaded.
  • the method when determining a media segment that needs to be time aligned with the transport stream, the method further includes:
  • the terminal extracts an is_announcement flag in the dynamic adaptive streaming media location descriptor. If the value of the is_announcement flag is 0x01, the time_before_activation field in the dynamic adaptive streaming location descriptor is extracted. Timescale field.
  • the method when determining a media segment that needs to be time-aligned with the transport stream, the method further includes: extracting a forced reload flag from the location information; and if the value of the forced reload flag is a predetermined value Reload the MPD of the above DASH.
  • the method when determining a media segment that needs to be time-aligned with the transport stream, the method further includes: extracting a pre-announcement flag from the location information; and if the value of the pre-announcement flag is a predetermined value, The active countdown field and the timestamp field are extracted from the location information.
  • step S206 the PTS of the media presentation time of the media segment in the adaptive streaming media relative to the time axis of the transport stream is determined based on the time information; this will be described below in conjunction with an alternative embodiment.
  • One of the main features of DASH is that the encoded versions of different media components share a globally uniform timeline.
  • the presentation time of the access units in the media content is mapped to a global unified presentation timeline, facilitating synchronization of different media components and enabling the same seamless switching of different encoded versions of the same media component.
  • the media presentation time of each media segment in the time period has a predetermined relationship, for example, may be continuous, and therefore, the time period may be derived according to the media presentation time of a media segment.
  • the media presentation time of subsequent media segments may be derived according to the media presentation time of a media segment.
  • the terminal determines the media presentation time of the subsequent media segments in the period in which the media segment is located according to the media presentation time of the media segment that needs time axis alignment with the transport stream.
  • the media presentation time of a media segment requiring time axis alignment is expressed as a PTS with respect to the time axis of the transport stream, the following relationship exists.
  • Segments PTS(in seconds) [(S-Ssyn)*@duration]/SegmentBase.timescale+Segmentsyn PTS(in seconds).
  • Ssyn is the number of the media segments of the dynamic adaptive streaming media that need time axis alignment in the time period
  • S is the number of subsequent media segments in the time period
  • the Segmentsyn PTS is the media presentation time of the media segments of the dynamically adaptive streaming media that require time axis alignment relative to the PTS of the transport stream timeline.
  • the Segment PTS is the PTS of the media presentation time of subsequent media segments in the time period relative to the time axis of the transport stream.
  • @duration and SegmentBase.timescale are the @duration attribute value and the @timescale attribute value of the time zone element of the media segment, respectively;
  • the terminal determines, according to the PTS value corresponding to the dynamic adaptive streaming location descriptor, the media presentation time of the media segment of the dynamic adaptive streaming media that needs time axis alignment relative to the transport stream time.
  • the PTS of the axis ie:
  • PTSi is a PTS value corresponding to the dynamic adaptive streaming location descriptor.
  • the Segmentsyn PTS is the media presentation time of the media segments of the dynamically adaptive streaming media that require time axis alignment relative to the PTS of the transport stream timeline.
  • FIG. 8 is a flowchart of a method for processing a transport stream according to an embodiment of the present invention. As shown in FIG. 8, the process includes the following steps:
  • Step S802 adding location information and/or time information of the adaptive streaming media to the transport stream, where the location information is set to determine a media segment in the adaptive streaming media that needs to be time-aligned with the transport stream, and the time information is set. Determining a display timestamp of a media presentation time of a media segment in the adaptive streaming media relative to a time axis of the transport stream;
  • Step S804 the foregoing transport stream is sent to the user terminal.
  • the following method may be adopted: adding a PID in the PMT of the transport stream, where the PID is set to obtain the stream type value as a predetermined value.
  • a valued TEMI PES code stream carrying location information may be adopted: adding a PID in the PMT of the transport stream, where the PID is set to obtain the stream type value as a predetermined value.
  • adding the time information carrying the adaptive streaming media to the transport stream includes: The TEMI PES stream extends the header field to carry time information.
  • the following method may also be adopted: carrying the specific media component TS code of the extended description subtag with a value of a predetermined value in the transport stream. a stream, wherein the TS code stream adaptation domain carries location information.
  • adding the time information carrying the adaptive streaming media to the transport stream includes: PES for code stream loading
  • the extended header field carries a PTS, wherein the PTS value is set to determine time information of the adaptive streaming media.
  • Adding the time information carrying the adaptive streaming media in the transport stream may also adopt the following method: setting a TS stream corresponding to a specific media component TS in the transport stream and having the same PID value and TS header field in the specific media component TS code stream
  • the payload data unit start indicator takes a value of a predetermined value; the PTS is carried in the PES extended header field of the subsequent TS stream payload, wherein the PTS value determines time information of the adaptive streaming medium.
  • the AF descriptor carrying the label field value as a predetermined value and the service type field value is a predetermined value in the transport stream is used as a position descriptor of the adaptive streaming media, and the location descriptor includes location information.
  • the location descriptor further includes at least one of the following: a unique identifier of the location descriptor in the transport stream, a forced reload flag, a pre-announcement flag, an activation countdown flag, a time stamp unit, and a service.
  • the forced reload flag is set to indicate whether a media presentation description MPD needs to be reloaded before the media segment timeline alignment is performed, the pre-announcement flag being set to indicate that the media segment of the adaptive media corresponding to the position descriptor is not yet playing
  • the activation countdown flag is set to indicate a remaining time of the media segment of the adaptive media corresponding to the location descriptor to enter a playing state
  • the time stamp unit is set to indicate a remaining time identification field time value
  • the service type is set to indicate that the location descriptor is referenced The type of business for external resources.
  • the foregoing transport stream may further carry information about the media presentation description MPD that is set to determine the adaptive streaming media corresponding to the location information.
  • the MPD descriptor is carried in the PMT in the foregoing transport stream; wherein the MPD descriptor includes a URL of the MPD corresponding to the location information; or the URL of the MPD is obtained from the location information.
  • An adaptive streaming media processing device is also provided in this embodiment.
  • the device is configured to implement the foregoing embodiments and preferred embodiments, and details are not described herein.
  • the term "module” may implement a combination of software and/or hardware of a predetermined function.
  • the apparatus described in the following embodiments is preferably implemented in software, hardware, or a combination of software and hardware, is also possible and contemplated.
  • FIG. 9 is a structural block diagram of an adaptive streaming media processing apparatus according to an embodiment of the present invention. As shown in FIG. 9, the apparatus includes an obtaining module 92, a first determining module 94, and a second determining module 96. The adaptive media stream processing device will be described.
  • the obtaining module 92 is configured to obtain the location information and the time information of the adaptive streaming media carried in the transport stream.
  • the first determining module 94 is connected to the acquiring module 92, and is configured to determine the need for the adaptive streaming media according to the location information. Transmitting a time-axis aligned media segment; the second determining module 96 is coupled to the first determining module 94, and configured to determine, according to the time information, the media presentation time of the media segment in the adaptive streaming media relative to the time axis of the transport stream Timestamp.
  • FIG. 10 is a structural block diagram of another adaptive streaming media processing apparatus according to an embodiment of the present invention. As shown in FIG. 10, the apparatus includes an adding module 102 and a sending module 104, and the adaptive media stream processing apparatus is performed below. Description.
  • the adding module 102 is configured to add location information and/or time information of the adaptive streaming media in the transport stream, wherein the location information is set to determine a media segment in the adaptive streaming media that needs to be time-aligned with the transport stream.
  • Time letter The information is set to determine the media presentation time of the media segment in the adaptive streaming media relative to the time axis of the transport stream;
  • the sending module 104 is coupled to the adding module 102 and configured to send the transport stream to the user terminal.
  • the present invention utilizes the location information and time information of the adaptive streaming media carried in the transport stream to determine the media segments of the adaptive streaming media that need to be time-aligned with the transport stream and their corresponding The time stamp is displayed, thereby solving the problem caused by adopting a single reference clock for the mixed network transmission of the broadcast network and the broadband network in the related art, which makes the media synchronization processing in the hybrid delivery mode more efficient, and also improves the scalability of the service deployment.
  • modules or steps of the present invention described above can be implemented by a general-purpose computing device that can be centralized on a single computing device or distributed across a network of multiple computing devices. Alternatively, they may be implemented by program code executable by the computing device such that they may be stored in the storage device by the computing device and, in some cases, may be different from the order herein.
  • the steps shown or described are performed, or they are separately fabricated into individual integrated circuit modules, or a plurality of modules or steps thereof are fabricated as a single integrated circuit module.
  • the invention is not limited to any specific combination of hardware and software.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Computer Security & Cryptography (AREA)
  • Information Transfer Between Computers (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

本发明提供了一种自适应流媒体处理方法及装置,其中,该方法包括:获取传输流中携带的自适应流媒体的位置信息和时间信息;根据上述位置信息确定自适应流媒体中需要与传输流进行时间轴对齐的媒体片段;根据上述时间信息确定自适应流媒体中媒体片段的媒体呈现时间相对于传输流时间轴的显示时间戳。本发明解决了相关技术中为广播网络和宽带网络混合媒体传输采用单一参考时钟所导致的问题,使得混合传输模式下的媒体同步处理更加高效。

Description

自适应流媒体处理方法及装置 技术领域
本发明涉及通信领域,具体而言,涉及一种自适应流媒体处理方法及装置。
背景技术
用户通过电视、台式机、平板电脑、手机等固定移动通信终端访问业务提供商和/或内容提供商开发部署的通信服务,该访问是通过借助于定制化的通信客户端能力,在用户终端与业务服务器(内容服务器)之间建立通信链路来实施完成的,上述通信链路通常由诸如各种类型数字用户线路(X Digital Subscriber Line,简称为xDSL)、光纤接入(Fiber-to-the-x,简称为FTTx)、有线电视电缆(Cable)等有线通信网络,以及无线保真网(Wi-Fi)、第二代移动通信网(2G)、第三代移动通信网(3G)、长期演进(Long Term Evolution,简称为LTE)等无线通信网络建立。
数字视频广播(Digital Video Broadcasting,简称为DVB)是世界主流数字电视标准之一,在活动图像专家组标准2(MPEG-2)音视频数据压缩算法和复用技术的基础上,DVB标准提供了一套完整的、适用于不同媒介的数字电视***的规范,定义了有线(DVB-C)、卫星(DVB-S)和地面(DVB-T)三种供媒体数据流传输用的信道编码和调制方式,同时扩展了MPEG-2标准的***层,丰富了网络接口。
基于超文本传输协议的动态自适应流(Dynamic Adaptive Streaming over HTTP,简称为DASH)是一种自适应比特率流媒体技术,利用互联网上传统的超文本传输协议网页(HTTP Web)服务器提供高品质的流媒体内容。DASH将一段视频切分成许多小的超文本传输协议(HyperText Transfer Protocol,简称为HTTP)服务器上的文件分块,每个文件分块包括一个较短播放时间的视频内容。这些小的视频分块可能是数个小时的电影(点播)或体育赛事的现场直播视频内容的一部分。视频可能被编成多个码率的视频片段,以覆盖不同网络带宽需求。
在上述通信服务发展过程中,终端能力和网络环境不断提升,而用户对业务体验的需求也在提高,以自适应流媒体为代表的OTT技术具备随时随地消费视频的灵活性,而IPTV、数字电视、***等传统广播技术具有大屏幕的高清晰度和质量保证。因此,将OTT技术与数字电视等广播技术相互融合,才能提供适合多种场景的综合方案。典型地,如第二屏(Second Screen)应用目前广受关注,用户利用智能手机、平板电脑等“TV buddies”,消费与TV屏广播节目相关联的个性化媒体信息。其间,业务运营商利用广播网络(譬如DVB-C数字电视网络)为广大用户传输传统电视节目,同时借助互联网宽带网络(OTT模式)为特定用户群体额外递送具有长尾效应的互联网点播内容。这种混合递送(Hybrid Delivery)模式能够节约网络投资成本,综合广播网络和宽带网络技术优势,从而为用户提供内容丰富、个性化的业务体验。
上述混合递送模式具有内容来源独立、传输网络异构,以及差异化的播放终端等显著特征。其中,能否建立广播节目或者主要媒体内容(Primary Media)与宽带节目或者辅助媒体内容(Ancillary Media)之间时间关联,即,能否保证不同内容来源、不同传输格式、以及不同播放终端的媒体内容之间的同步播放,已成为此类应用能否被用户接受的关键。
音视频特征识别技术及其应用是当前解决上述混合递送模式下媒体内容同步问题的一个研究方向,它利用音视频媒体内容自身的数字特征作为媒体间时间同步参考。其中,包括:1)数字水印技术。采集电视节目中音频信号的数字水印,利用数字水印中节目频道标识符和时间戳信息,可以检测正在播放节目的时间进度,从而获取适当的辅助媒体内容与之同步播放;2)数字指纹技术。从音视频样本中提取数字指纹信息,利用指纹检索数据库,获取媒体同步信息。
然而,数字水印和数字指纹属于计算密集型技术实现,都依赖于第二屏终端捕获音视频样本质量和特征提取算法性能;同时,此类技术受环境噪音和终端能力影响较大,适用场景存在一定的局限性。
针对混合递送模式下媒体同步播放问题的另一个研究方向是为广播网络和宽带网络媒体递送采用单一的参考时钟。其中一个方法是将传输流,例如,MPEG-2TS码流,中携带的节目参考时钟/显示时间戳(PCR/PTS)时钟信息同时也包括在诸如实时传输协议(Real-time Transport Protocol,简称为RTP)的互网络协议(Internet Protocol,简称为IP)报文中传输。该方案的主要缺陷是传输流中的PCR时钟信息在解复用时可能被改变(但是,宽带运营商无法获知这一点),从而无法维护时钟的延续性。
针对上述广播网络和宽带网络中混合递送模式下的媒体同步播放问题,需要研究一种改进的信令机制。
发明内容
本发明提供了一种自适应流媒体处理方法及装置,以至少解决现有技术中广播网络和宽带网络混合递送模式下的媒体同步播放问题。
根据本发明的一个方面,提供了一种自适应流媒体处理方法,包括:获取传输流中携带的自适应流媒体的位置信息和时间信息;根据所述位置信息确定所述自适应流媒体中需要与所述传输流进行时间轴对齐的媒体片段;根据所述时间信息确定所述自适应流媒体中媒体片段的媒体呈现时间相对于所述传输流时间轴的显示时间戳。
可选地,获取所述传输流中携带的所述自适应流媒体的位置信息包括:解析获取的所述传输流中的节目映射表PMT,从所述PMT中确定流类型值为预定值的时间轴和外部媒体信息分组码流TEMI PES码流的包标识符PID;根据所述PID过滤出所述TEMI PES码流;提取所述TEMI PES码流中携带的所述自适应流媒体的位置描述子作为所述位置信息。
可选地,获取所述传输流中携带的所述自适应流媒体的时间信息包括:提取所述TEMI  PES码流扩展头域中包括的显示时间戳PTS;根据所述PTS值确定所述自适应流媒体的时间信息。
可选地,获取所述传输流中携带的所述自适应流媒体的位置信息包括:获取所述传输流中扩展描述子标签的值为预定值的特定媒体组件传输流TS码流;提取所述特定媒体组件TS码流适配域中携带的自适应流媒***置描述子作为所述位置信息。
可选地,获取所述传输流中携带的所述自适应流媒体的时间信息包括:提取所述特定媒体组件TS码流载荷中携带的分组码流PES扩展头域中包括的PTS;根据所述PTS值确定所述自适应流媒体的时间信息。
可选地,获取所述传输流中携带的所述自适应流媒体的时间信息包括:获取所述传输流中的特定媒体组件TS后续的第一个与所述特定媒体组件TS码流具有相同PID值且TS头域中载荷数据单元起始指示符取值为预定值的TS码流;提取获取的所述TS码流载荷中携带的PES扩展头域中包括的PTS;根据所述PTS值确定所述自适应流媒体的时间信息。
可选地,提取所述自适应流媒体的位置描述子包括:从所述TEMI PES码流载荷或所述特定媒体组件TS码流的适配域中提取标签字段值为预定值且业务类型字段值为预定值的适配域AF描述子作为所述自适应流媒体的位置描述子。
可选地,所述位置描述子包括以下至少之一:事件流方案标识字段、事件流值字段、事件标识字段,其中,所述事件流方案标识字段值设置为唯一标识事件流方案,所述事件流值字段值设置为唯一标识所述事件流方案中的事件流,所述事件标识字段值设置为唯一标识所述事件流中的事件。
可选地,所述位置描述子还包括媒体呈现描述统一资源定位符URL字段,其中,所述媒体呈现描述URL字段值设置为确定所述自适应流媒体的媒体呈现描述URL。
可选地,所述方法还包括:从所述位置信息中提取所述强制重载标志;在所述强制重载标志的值为预定值的情况下,重新加载所述自适应流媒体的媒体呈现描述。
可选地,所述方法还包括:从所述自适应流媒体的位置信息中提取所述预宣告标志;在所述预宣告标志的值为预定值的情况下,从所述位置信息中提取所述激活倒计时字段和所述时标字段。
可选地,根据所述位置信息确定所述自适应流媒体中需要与所述传输流进行时间轴对齐的媒体片段包括:确定与所述位置信息对应的自适应流媒体的媒体呈现描述MPD;根据所述位置信息和所述MPD确定与所述传输流进行时间轴对齐的所述媒体片段。
可选地,确定与所述位置信息对应的自适应流媒体的所述MPD包括:提取所述传输流中的节目映射表PMT中包括的MPD描述子;从所述MPD描述子中获取与所述位置信息对应的所述MPD的统一资源定位符;或者,从所述位置信息中获取所述MPD的统一资源定位符。
可选地,根据所述位置信息和所述MPD确定与所述传输流进行时间轴对齐的所述媒体片 段包括:使用所述位置信息中携带的事件流方案标识字段值和所述事件流值字段值,替换所述MPD中的媒体片段模板元素@media属性的$EventStream$标识符;使用所述位置信息中携带的事件标识字段值,替换所述MPD中的媒体片段模板元素@media属性的$EventID$标识符,构造设置为标识与所述传输流进行时间轴对齐的所述自适应流媒体的媒体片段的统一资源定位符URL。
可选地,根据所述位置信息和所述MPD确定与所述传输流进行时间轴对齐的所述媒体片段包括:使用所述位置信息中携带的事件流方案标识字段值以及事件流值字段值,检索确定所述MPD中的时段元素;使用所述位置信息中携带的事件标识字段值检索确定所述时段元素中的媒体片段URL子元素;根据所述媒体片段URL元素的@media及@mediaRange属性值,构造需要与所述传输流进行时间轴对齐的自适应流媒体的媒体片段的URL。
可选地,根据所述位置信息和所述MPD确定与所述传输流进行时间轴对齐的所述媒体片段包括:使用所述位置信息中携带的事件流方案标识字段值以及事件流值字段值,检索确定所述MPD中的时段元素;使用所述位置信息中携带的事件标识字段值检索确定所述时段元素的事件子元素;根据所述事件元素的@media及@mediaRange属性值,构造需要与所述传输流进行时间轴对齐的自适应流媒体的媒体片段的URL。
可选地,根据所述时间信息确定所述自适应流媒体中媒体片段的媒体呈现时间相对于所述传输流时间轴的显示时间戳包括:根据所述时间信息确定所述需要与所述传输流进行时间轴对齐的媒体片段的媒体呈现时间相对于所述传输流时间轴的PTS;根据所述需要与所述传输流进行时间轴对齐的媒体片段的PTS计算所述媒体片段所在时段中后续媒体片段的PTS。
可选地,根据所述时间信息确定所述自适应流媒体中媒体片段的媒体呈现时间相对于所述传输流时间轴的显示时间戳还包括:将所述自适应流媒体的位置描述子对应的PTS值作为所述媒体片段的媒体呈现时间相对于所述传输流时间轴的PTS。
根据本发明的另一方面,还提供了一种自适应流媒体处理方法,包括:在传输流中添加自适应流媒体的位置信息和/或时间信息,其中,所述位置信息设置为确定所述自适应流媒体中需要与所述传输流进行时间轴对齐的媒体片段,所述时间信息设置为确定所述自适应流媒体中媒体片段的媒体呈现时间相对于所述传输流时间轴的显示时间戳;将所述传输流发送给用户终端。
可选地,在所述传输流中添加携带所述自适应流媒体的位置信息包括:在所述传输流的节目映射表PMT中添加包标识符PID,其中,所述PID设置为获取流类型值为预定值的时间轴和外部媒体信息分组码流TEMI PES码流,所述TEMI PES码流中携带有所述位置信息。
可选地,在所述传输流中添加携带所述自适应流媒体的时间信息包括:在所述TEMI PES码流扩展头域中携带所述时间信息。
可选地,在所述传输流中添加携带所述自适应流媒体的位置信息包括:在所述传输流中携带扩展描述子标签的值为预定值的特定媒体组件TS码流,其中,所述TS码流适配域中携 带所述位置信息。
可选地,在所述传输流中添加携带所述自适应流媒体的时间信息包括:在所述TS码流载荷的分组码流PES扩展头域中携带PTS,其中所述PTS值设置为确定所述自适应流媒体的时间信息。
可选地,在所述传输流中添加携带所述自适应流媒体的时间信息包括:设置所述传输流中的特定媒体组件TS后续的TS码流与所述特定媒体组件TS码流具有相同PID值且TS头域中载荷数据单元起始指示符取值为预定值;在所述后续的所述TS码流载荷的PES扩展头域中携带PTS,其中,所述PTS值确定所述自适应流媒体的时间信息。
可选地,在所述传输流中携带标签字段值为预定值且业务类型字段值为预定值的,并且处于所述TEMI PES码流载荷或所述特定媒体组件TS码流适配域中的AF描述子作为所述自适应流媒体的位置描述子,所述位置描述子中包括所述位置信息。
可选地,所述位置描述子包括以下至少之一:事件流方案标识字段、事件流值字段、事件标识字段,其中,所述事件流方案标识字段值设置为唯一标识事件流方案,所述事件流值字段值设置为唯一标识所述事件流方案中的事件流,所述事件标识字段值设置为唯一标识所述事件流中的事件。
可选地,所述位置描述子还包括媒体呈现描述URL字段,其中,所述媒体呈现描述URL字段值设置为确定所述自适应流媒体的媒体呈现描述URL。
可选地,在所述传输流中的节目映射表PMT中携带MPD描述子;其中,所述MPD描述子中包括所述媒体呈现描述URL字段;或者在所述位置信息中携带所述媒体呈现描述URL字段。
根据本发明的另一个方面,还提供了一种自适应流媒体处理装置,包括:获取模块,设置为获取传输流中携带的自适应流媒体的位置信息和时间信息;第一确定模块,设置为根据所述位置信息确定所述自适应流媒体中需要与所述传输流进行时间轴对齐的媒体片段;第二确定模块,设置为根据所述时间信息确定所述自适应流媒体中媒体片段的媒体呈现时间相对于所述传输流时间轴的显示时间戳。
根据本发明的另一个方面,还提供了一种自适应流媒体处理装置,包括:添加模块,设置为在传输流中添加自适应流媒体的位置信息和/或时间信息,其中,所述位置信息设置为确定所述自适应流媒体中需要与所述传输流进行时间轴对齐的媒体片段,所述时间信息设置为确定所述自适应流媒体中媒体片段的媒体呈现时间相对于所述传输流时间轴的显示时间戳;发送模块,设置为将所述传输流发送给用户终端。
通过本发明方案,获取传输流中携带的自适应流媒体的位置信息和时间信息;根据所述位置信息确定所述自适应流媒体中需要与所述传输流进行时间轴对齐的媒体片段;根据所述时间信息确定所述自适应流媒体中媒体片段的媒体呈现时间相对于所述传输流时间轴的显示时间戳,从而解决了现有技术中采用单一参考时钟所导致的问题,实现了广播网络和宽带网 络混合递送模式下媒体同步播放。
附图说明
此处所说明的附图用来提供对本发明的进一步理解,构成本申请的一部分,本发明的示意性实施例及其说明用于解释本发明,并不构成对本发明的不当限定。在附图中:
图1是根据本发明实施例的广播/宽带混合递送媒体同步***架构示意图;
图2是本发明实施例的动态自适应流媒体处理方法的流程图;
图3是根据本发明可选实施例的一种获取自适应流媒***置描述子的示意图;
图4是根据本发明可选实施例的另一种获取自适应流媒***置描述子的示意图;
图5是根据本发明实施例的一种确定需要进行时间轴对齐的媒体片段的示例图;
图6是根据本发明实施例的另一种确定需要进行时间轴对齐媒体片段的示例图;
图7是根据本发明实施例的再一种确定需要进行时间轴对齐媒体片段的示例图;
图8是根据本发明实施例的一种传输流处理方法的流程图;
图9是根据本发明实施例的一种自适应流媒体处理装置的结构框图;
图10是根据本发明实施例的另一种自适应流媒体处理装置的结构框图。
具体实施方式
下文中将参考附图并结合实施例来详细说明本发明。需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互组合。
本发明实施例提供了一种广播/宽带混合传输媒体同步***,图1是根据本发明实施例的广播/宽带混合传输媒体同步***架构示意图,如图1所示,包括:广播内容源100、MPD服务器200、流媒体服务器300、第一终端410、第二终端420,其中第一终端和第二终端可以是独立的两个终端,这两个终端的功能也可以放在一个物理终端中实现。下面对图1中示出的***进行说明。
广播内容源100可以使用MPEG-2标准,或者也可以使用其他标准,以下以MPEG-2为例进行说明。根据MPEG-2标准编码模型,广播内容源100在发送端,将音频数据和视频数据输入到各自的编码器进行压缩和编码,得到音视频的基本码流(Elemental Stream,简称为ES),ES被送到打包器分割编组形成长度不等的分组码流(Packet Elemental Stream,简称为PES),PES再与节目内容信息、条件接收信息等一起进入码流复用器,生成节目流(Program Stream,简称为PS)或者传输流(Transport Stream,简称为TS)。
广播内容源100发送的MPEG-2TS码流载荷中携带具有不同的包标识符(Packet  Identification,简称为PID)的PES分组码流。其中,PID值为0x0000的TS码流对应节目关联表(Program Association Table,简称为PAT),PAT定义了TS中所有的节目,提供了节目初始化信息,是TS中所有节目的一个完整列表,并指示了不同节目的节目号以及与节目内容相关的节目映射表(Program Map Table,简称为PMT)的PID。PMT表明了节目号与其所包括的音视频及其他内容的传输码流之间的对应关系,对一套节目进行了完整的定义,其中包括了节目的节目号(program number),PCR PID,流类型(stream type,如音频、视频和其他数据等)以及基本流PID(指出携带音视频内容的TS包PID)等信息。此外,MPEG-2TS码流还可以携带称作时间轴和外部媒体信息(Timeline and External Media Information,简称为TEMI PES)的TEMI PES码流,设置为指示MPEG-2TS码流中携带的外部媒体资源的时间信息和位置信息。
在以下实施例中,涉及到的自适应流媒体(或者称为动态自适应流媒体)有多种实现方式,例如,可以采用DASH。下面以DASH为例对MPD服务器和流媒体服务器进行说明。以下实施例对其他自适应流媒体技术也同样适用。
MPD服务器200,提供动态自适应流媒体中MPD文件存储。MPD文件描述了每个媒体片段的持续时间,URL,媒体属性,如视频分辨率、适配比特率等。MPD文件给DASH客户端提供了足够的HTTP-URL信息设置为访问流媒体内容资源,当DASH客户端开始播放一个视频的时候,它根据MPD文件中的信息选择下载适应于当前可用网络带宽和终端缓存状况的媒体片段并进行播放。
流媒体服务器300,完成对原始视频的切片和MPD文件的生成。视频文件被切分,并以时段(period)、表示(representation)和媒体片段(segment)的层次存储。根据请求,流媒体服务器300将向终端返回适当的媒体分片。
终端400可以包括:第一终端410,例如数字电视机顶盒,或者广播\宽带双模机顶盒;以及第二终端420,例如平板电脑,智能手机、手持电视设备等。
终端400(第一终端410和/或第二终端420,为了描述下文中均使用终端)在接收到MPEG-2TS码流后,按照不同的包标识符PID将码流分流到不同的解析通道,解析出音频PES、视频PES,并重组成ES流存到音视频各自对应的缓存中。首先,指定解复用模块的一个通道绑定PID值为0x0000的TS流,并从中解析出PAT,获得所需节目的PMT的包标识符PID。通过通道绑定,找到节目映射表PMT,提取码流中的***参考时钟PCR,以及相关节目中音视频PES码流的PTS等时间信息。
在一个可选的实施方式中,终端400在接收到MPEG-2TS码流后,还可以将提取TEMI PES码流中携带的特定类型的适配域描述子(AF Descriptor)以及与之对应的PES码流头域显示时间戳(Presence Time Stamp,简称为PTS),设置为获取动态自适应流媒体的位置信息和时间信息。
在一个可选的实施方式中,终端可以根据位置信息请求MPD服务器200获取动态自适应流媒体的MPD文件。终端400解析获取到的MPD文件确定动态自适应流媒体中需要与 MPEG-2TS码流进行时间轴对齐的媒体片段。
在一个可选的实施方式中,终端400可以根据时间信息确定动态自适应流媒体中媒体片段(包括:时间轴对齐媒体片段以及同一时段中后续媒体片段)的媒体呈现时间(Media Presentation Time,简称为MPT)相对于MPEG-2TS码流时间轴的显示时间戳PTS。
在一个可选的实施方式中,终端可以通过动态自适应(Quality Adaptation)算法周期性的发出HTTP请求,获得动态自适应流媒体中适当码率或质量版本的媒体片段,并根据MPEG-2TS码流相关节目中的音视频PES码流PTS信息,以及动态自适应流媒体中媒体片段PTS信息,实现音视频同步处理。
下面结合图1中的架构对本实施例提供的动态自适应流媒体处理方法进行说明,图2是根据本发明实施例的动态自适应流媒体处理方法的流程图,如图2所示,该流程包括如下步骤:
步骤S202,获取传输流中携带的自适应流媒体的位置信息和时间信息;
步骤S204,根据上述位置信息确定自适应流媒体中需要与传输流进行时间轴对齐的媒体片段;
步骤S206,根据上述时间信息确定自适应流媒体中媒体片段的媒体呈现时间相对于传输流时间轴的显示时间戳。
通过上述步骤,利用传输流中携带的自适应流媒体的位置信息和时间信息来确定该自适应流媒体中需要与传输流进行时间轴对齐的媒体片段及其对应的显示时间戳,从而解决了相关技术中为广播网络和宽带网络混合媒体传输采用单一参考时钟所导致的问题,使得混合递送模式下媒体同步处理更加高效,同时也提高业务部署的可扩展性。
步骤S202中记载了获取位置信息和时间信息,下面结合可选实施来对位置信息和时间信息的获取来进行说明。
步骤S202中的位置信息可以是位置描述子(Location Descriptor),图3是根据本发明可选实施例的一种获取自适应流媒***置描述子的示意图,如图3所示,包括:位置描述子也可以是AF描述子,其可以携带在PES数据载荷中,PTS时间值可以携带在PES扩展头域中。
在一个可选的实施方式中,终端首先解析获得传输流中的节目映射表PMT,从中确定流类型(Stream Type)值为预定值(例如,0x26)的TEMI PES码流的包标识符PID,终端将包标识符PID绑定到解复用模块通道,过滤出含有TEMI PES码流的TS包,其中,TEMI PES码流的流标识(stream ID)值为private_stream_1。
终端提取TEMI PES码流数据载荷TEMI_AU中携带的标签字段(af_descr_tag)值为预定值(例如,0x05)且业务类型字段(service_type)值为预定值(例如,0x01)的适配域AF描述子,即动态自适应流媒体的位置描述子(Location Descriptor),从而获取动态自适应流媒体的位置信息。
参照图3,终端获取传输流中TEMI PES码流,提取TEMI PES码流扩展头域中包括的PTS,作为与动态自适应流媒***置描述子对应的PTS值,从而获取动态自适应流媒体的时间信息。
位置信息还可以通过另外一种方式携带,图4是根据本发明可选实施例的另一种获取自适应流媒***置描述子的示意图,如图4所示,AF描述子可以携带在TS流的适配域中。
根据MPEG-2标准PES语法说明,流标识(stream ID)表明了PES载荷中是音频、视频还是数据流。参照本发明中图4,终端获取传输流中扩展描述子标签(Extension_descriptor_tag)值为预定值(例如,0x04)的特定媒体组件(视频或者音频)TS码流。
终端提取特定媒体组件TS码流适配域(Adaptation Field)中携带的标签字段(af_descr_tag)值为预定值(0x05)且业务类型字段(service_type)值为预定值(例如,0x01)的AF描述子,即动态自适应流媒体的位置描述子(Location Descriptor),从而获取动态自适应流媒体的位置信息。
参照图4,终端获取传输流中特定媒体组件(视频或者音频)TS码流,提取特定媒体组件(视频或者音频)TS码流载荷中携带的PES码流扩展头域中包括的PTS,作为与动态自适应流媒***置描述子对应的PTS值,从而获取动态自适应流媒体的时间信息。
或者,终端获取后续第一个与特定媒体组件(视频或者音频)TS码流具有相同PID值且TS码流头域中载荷数据单元起始指示符(payload_unit_start_indicator)取值为1的TS码流,提取TS码流载荷中携带的PES码流扩展头域中包括的PTS,作为与动态自适应流媒***置描述子对应的PTS值,从而获取动态自适应流媒体的时间信息。
在上述实施例及可选的实施方式中,对现有的位置描述子进行扩展,表1所示的是根据本发明实施例的自适应流媒***置描述子的语法结构图表。
如表1所示,本实施例中自适应流媒体的位置描述子(Location Descriptor),扩展定义如下,以业务类型为DASH为例进行说明:
其中,uimsbf(unsigned integer most significant bit first):表示无符号整数最高有效位优先;
timeline_id:MPEG-2TS码流中位置描述子的唯一标识;
force_reload:1比特标志,指示DASH同步(即进行媒体片段时间轴对齐)前是否重新加载MPD;
is_announcement:1比特标志,指示与位置描述子对应的DASH媒体片段尚未处于播放(激活)状态;
time_before_activation:指示位置描述子所引用资源(DASH媒体片段)进入播放状态的剩余时间;
timescale:时标单位,设置为表示time_before_activation字段时间值;
表1
Figure PCTCN2015087615-appb-000001
service_type:指示位置描述子所引用外部资源的业务类型。当service_type==1时,业务类型为“MPEG-DASH”。此时,该位置描述子中所携带的位置信息将设置为实现DASH媒体片段的媒体呈现时间与MPEG-2TS码流中PTS之间的映射,即完成DASH流媒体与MPEG-2TS码流之间同步。
如表1所示,本发明实施例将上述建立DASH流媒体时间信息与传输流时间信息映射关系的位置信息称为时间轴对齐(Timeline Alignment)事件流(Event Stream),定义如下:
event_stream_schemeIdUri:当service_type==1时,位置描述子包括事件流方案标识event_stream_schemeIdUri字段,遵循统一资源名(URN)或者统一资源定位符(URL)语法规范,设置为唯一标识时间轴对齐事件流的方案(Scheme);
event_stream_value:当service_type==1时,位置描述子包括事件流值event_stream_value字段,其取值空间将由event_schemeIdUri标识的时间轴对齐事件流的消息方案定义,设置为标识事件流方案中的事件流;
event_id:当service_type==1时,位置描述子包括事件标识event_id字段,设置为标识时间轴对齐事件流中的事件;
其中,事件消息方案标识字段与事件流值字段一起可以唯一标识时间轴对齐事件流。事件标识字段则设置为唯一标识时间轴对齐事件流中的某个特定时间轴对齐事件。
可选的,mpd_url:当service_type==1时,mpd_url取值为位置描述子所对应的DASH的MPD的URL。
在一个可选的实施例中,终端获取节目映射表PMT中包括的自适应流媒体的MPD描述子,可以定义如表2所示,以DASH为例进行说明:
表2
Figure PCTCN2015087615-appb-000002
其中,bslbf(bit string,left bit first):表示比特串,左边的比特在前。
mpd_url取值为对应于自适应流媒***置描述子的DASH的MPD的URL。
上述步骤S204中记载了位置信息确定自适应流媒体中需要与传输流进行时间轴对齐的媒体片段;下面结合可选实施来对此来进行说明。
在动态自适应流媒体技术中,MPD是一个文档,该文档中包括了DASH客户端构建媒体片段HTTP-URL所需的元数据,以便客户端访问媒体片段,为用户提供流媒体服务。
MPD中可以包括事件(Event),设置为向DASH客户端或DASH应用通知非周期性的信息。事件是时控的,亦即,事件在某一特定的媒体呈现时间点开始,通常会持续一段时间。事件包括DASH特有的通知事件和应用特有的事件。
与上述位置描述子中所包括的时间轴对齐事件相对应,MPD中时段(Period)元素可以包括时间轴对齐事件流(EventStream)子元素,其中:
EventStream@schemeIdUri属性设置为唯一标识时间轴对齐事件流的消息方案(Scheme)。EventStream@value属性作为时间轴对齐事件流的值,其取值空间将由event_schemeIdUri标识的时间轴对齐事件流的消息方案定义。EventStream元素中包括一系列相同类型的时间轴对齐事件Event元素,由Event.id属性值唯一标识。
在本实施例中,终端可以按照不同方法根据位置信息获取对应的MPD。
终端提取PMT中包括的动态自适应流媒体的MPD描述子,根据其中包括的mpd_url字段值获取与位置信息对应的动态自适应流媒体的MPD的URL。或者,
终端提取动态自适应流媒***置描述子,根据其中包括的mpd_url字段值获取与位置信息对应的动态自适应流媒体的MPD的URL。
在根据动态自适应流媒体的位置信息以及对应的MPD确定需要与传输流进行时间轴对齐的动态自适应流媒体的媒体片段时,可以有多种方式,下面结合不同的方式进行说明:
图5是根据本发明实施例的一种确定需要进行时间轴对齐的媒体片段的示例图,如图5所示,包括:
终端根据位置描述子中携带的值为“urn:uuid:XYZY”的事件流方案标识字段以及值为“ad”的事件流值字段,检索确定MPD中的Period元素,即图6中@id属性值为“1”的Period元素。
终端使用位置描述子中携带的事件流方案标识字段值“urn:uuid:XYZY”和事件流值字段值“ad”替换该Period元素的媒体片段模板(SegmentTemplate)子元素中@media属性的$EventStream$标识符,使用位置描述子中携带的事件标识(event_id)字段值“1”替换该Period元素的媒体片段模板(SegmentTemplate)子元素中@media属性的$EventID$标识符。同时结合MPD其他层次中定义的资源URL地址基址(BaseURL)元素,譬如:“http://www.example.com/”,从而构造需要与传输流进行时间轴对齐的动态自适应流媒体的媒体片段的HTTP URL,即“http://www.example.com/uuid:XYZY/ad_00001.ts”。
对应传输流中携带的下一个相同类型位置描述子中的事件标识(event_id)字段值“2”, 则构造的下一个需要与传输流进行时间轴对齐的动态自适应流媒体的媒体片段HTTP URL为“http://www.example.com/uuid:XYZY/ad_00002.ts”,以此类推。
图6是根据本发明实施例的另一种确定需要进行时间轴对齐媒体片段的示例图,如图6所示,包括:
终端根据位置描述子中携带的值为“urn:uuid:XYZY”的事件流方案标识字段以及值为“ad”的事件流值字段,检索确定MPD中的Period元素,即图7中@id属性值为"1"的Period元素。
终端根据位置描述子中携带的值"1"的事件标识字段,检索确定MPD中该Period元素下的媒体片段URL(SegmentURL)子元素,即图7中@eventID属性值为“1”的媒体片段URL(SegmentURL)元素。
终端根据该媒体片段URL(SegmentURL)元素的@media及@mediaRange属性值,同时结合MPD其他层次中定义的资源URL地址基址(BaseURL)元素,譬如:“http://www.example.com/”,从而构造需要与传输流进行时间轴对齐的动态自适应流媒体的媒体片段的HTTP URL,即“http://www.example.com/seg201.mp4”,以及可选的区间请求头域“Range:bytes=0-499”。
对应传输流中携带的下一个相同类型位置描述子中的事件标识(event_id)字段值“2”,则构造的下一个需要与传输流进行时间轴对齐的动态自适应流媒体的媒体片段HTTP URL为“http://www.example.com/seg205.mp4”,以及可选的区间请求头域“Range:bytes=500-999”,以此类推。
图7是根据本发明实施例的再一种确定需要进行时间轴对齐媒体片段的示例图,如图7所示,包括:
终端根据位置描述子中携带的值为“urn:uuid:XYZY”的事件流方案标识字段以及值为“ad”的事件流值字段,检索确定MPD中的Period元素,即图8中@id属性值为“1”的Period元素。
终端根据位置描述子中携带的值“1”的事件标识字段,检索确定MPD中该Period元素下的事件(Event)子元素,即图8中@id属性值为"1"的事件(Event)元素。
终端根据该事件(Event)元素的@media及@mediaRange属性值,同时结合MPD其他层次中定义的资源URL地址基址(BaseURL)元素,譬如:“http://www.example.com/”,从而构造需要与传输流进行时间轴对齐的动态自适应流媒体的媒体片段的HTTP URL,即“http://www.example.com/seg301.mp4”,以及可选的区间请求头域“Range:bytes=0-499”。
对应传输流中携带的下一个相同类型位置描述子中的事件标识(event_id)字段值“2”,则构造的下一个需要与传输流进行时间轴对齐的动态自适应流媒体的媒体片段HTTP URL为“http://www.example.com/seg305.mp4”,以及可选的区间请求头域“Range:bytes=500-999”,以此类推。
在一个可选的实施例中,在确定需要与传输流进行时间轴对齐的媒体片段时,还包括:
在一个可选的实施例中,在确定需要与传输流进行时间轴对齐的媒体片段时,还包括:
终端提取动态自适应流媒***置描述子中的强制重载(force_reload)标志;如果强制重载(force_reload)标志的值为0x01,则重新加载动态自适应流媒体的MPD。
在一个可选的实施例中,在确定需要与传输流进行时间轴对齐的媒体片段时,还包括:
终端提取动态自适应流媒***置描述子中的预宣告(is_announcement)标志,如果预宣告(is_announcement)标志的值为0x01,则提取动态自适应流媒***置描述子中的激活倒计时(time_before_activation)字段以及时标(timescale)字段。
在一个可选的实施例中,在确定需要与传输流进行时间轴对齐的媒体片段时,还包括:从位置信息中提取强制重载标志;在强制重载标志的值为预定值的情况下,重新加载上述DASH的MPD。
在另一个可选的实施例中,在确定需要与传输流进行时间轴对齐的媒体片段时,还包括:从位置信息中提取预宣告标志;在预宣告标志的值为预定值的情况下,从位置信息中提取激活倒计时字段和时标字段。
步骤S206中记载了根据时间信息确定自适应流媒体中媒体片段的媒体呈现时间相对于传输流时间轴的PTS;下面结合可选实施例来对此来进行说明。
DASH的主要特性之一是不同媒体组件的编码版本共享全局统一的时间轴。媒体内容中的接入单元的呈现时间被映射到全局统一呈现时间轴,便于同步不同的媒体组件并实现相同媒体组件的不同的编码版本的相同的无缝切换。
各个时段内的呈现时间都是相对相应时段的PeriodStart时间减去所包括表示的@presentationTimeOffset即T0的值。这意味着,对于媒体流中呈现时间为TP的接入单元,其媒体呈现时间相对PeriodStart的值为TM=TP–TO。
根据MPD定义的动态自适应流媒体的媒体片段时间轴规则,时段中各个媒体片段的媒体呈现时间存在预定的关系,譬如可以是连续的,因此,根据一个媒体片段的媒体呈现时间可以推算出时段中后续媒体片段的媒体呈现时间。
在本发明一个可选的实施例中,终端根据需要与传输流进行时间轴对齐的媒体片段的媒体呈现时间确定该媒体片段所在时段中后续媒体片段的媒体呈现时间。特别地,当需要进行时间轴对齐的媒体片段的媒体呈现时间表示为相对于传输流时间轴的PTS时,存在以下关系,
Segments PTS(in seconds)=[(S-Ssyn)*@duration]/SegmentBase.timescale+Segmentsyn PTS(in seconds)。
其中,Ssyn为需要进行时间轴对齐的动态自适应流媒体的媒体片段在时段中的编号,S为时段中后续其他媒体片段的编号。
Segmentsyn PTS为需要进行时间轴对齐的动态自适应流媒体的媒体片段的媒体呈现时间相对于传输流时间轴的PTS。Segment PTS为时段中后续其他媒体片段的媒体呈现时间相对于传输流时间轴的PTS。
其中,@duration和SegmentBase.timescale分别为媒体片段所在时段元素的@duration属性值和@timescale属性值;
在本发明一个可选的实施例中,终端根据与动态自适应流媒***置描述子对应的PTS值确定需要进行时间轴对齐的动态自适应流媒体的媒体片段的媒体呈现时间相对于传输流时间轴的PTS,即:
Segmentsyn PTS(in seconds)=PTSi/90000。
其中,PTSi为与动态自适应流媒***置描述子对应的PTS值。
Segmentsyn PTS为需要进行时间轴对齐的动态自适应流媒体的媒体片段的媒体呈现时间相对于传输流时间轴的PTS。
图8是根据本发明实施例的一种传输流处理方法的流程图,如图8所示,该流程包括如下步骤:
步骤S802,在传输流中添加自适应流媒体的位置信息和/或时间信息,其中,该位置信息设置为确定自适应流媒体中需要与传输流进行时间轴对齐的媒体片段,该时间信息设置为确定自适应流媒体中媒体片段的媒体呈现时间相对于传输流时间轴的显示时间戳;
步骤S804,将上述传输流发送给用户终端。
通过上述步骤,通过将添加了预定类型码流的传输流发送给用户终端,可以有效解决相关技术中为广播网络和宽带网络混合媒体传输采用单一参考时钟所导致的问题,使得混合递送模式下媒体同步处理更加高效,同时也提高业务部署的可扩展性。
在一个可选的实施例中,在传输流中添加携带自适应流媒体的位置信息时,可以采用如下方法:在传输流的PMT中添加PID,其中,该PID设置为获取流类型值为预定值的TEMI PES码流,该TEMI PES码流中携带有位置信息。
在添加了自适应流媒体的位置信息后,还可以添加该自适应流媒体的时间信息,在一个可选的实施例中,在传输流中添加携带自适应流媒体的时间信息包括:在该TEMI PES码流扩展头域中携带时间信息。
在一个可选的实施例中,在传输流中添加携带自适应流媒体的位置信息时,还可以采用如下方法:在传输流中携带扩展描述子标签的值为预定值的特定媒体组件TS码流,其中,该TS码流适配域中携带位置信息。
在添加了自适应流媒体的位置信息后,还可以添加该自适应流媒体的时间信息,在一个可选的实施例中,在传输流中添加携带自适应流媒体的时间信息包括:在TS码流载荷的PES 扩展头域中携带PTS,其中该PTS值设置为确定自适应流媒体的时间信息。
在传输流中添加携带自适应流媒体的时间信息还可以采用如下方法:设置传输流中的特定媒体组件TS后续的TS码流与该特定媒体组件TS码流具有相同PID值且TS头域中载荷数据单元起始指示符取值为预定值;在后续的TS码流载荷的PES扩展头域中携带PTS,其中,该PTS值确定自适应流媒体的时间信息。
在一个可选的实施例中,在传输流中携带标签字段值为预定值且业务类型字段值为预定值的AF描述子作为自适应流媒体的位置描述子,该位置描述子中包括位置信息。
在一个可选的实施例中,上述的位置描述子还包括以下至少之一:在传输流中位置描述子的唯一标识、强制重载标志、预宣告标志、激活倒计时标志、时标单位、服务类型,其中,该强制重载标志设置为指示在进行媒体片段时间轴对齐之前是否需要重新加载媒体呈现描述MPD,预宣告标志设置为指示与位置描述子对应的自适应媒体的媒体片段尚未处于播放装置,激活倒计时标志设置为指示与位置描述子对应的自适应媒体的媒体片段进入播放状态的剩余时间,时标单位设置为指示剩余时间标识字段时间值,服务类型设置为指示位置描述子所引用外部资源的业务类型。
在一个可选的实施例中,上述传输流中还可以携带有设置为确定位置信息对应的自适应流媒体的媒体呈现描述MPD的信息。
在一个可选的实施例中,在上述传输流中的PMT中携带MPD描述子;其中,该MPD描述子中包括与位置信息对应的MPD的URL;或者,从位置信息中获取该MPD的URL。
在本实施例中还提供了一种自适应流媒体处理装置,该装置设置为实现上述实施例及优选实施方式,已经进行过说明的不再赘述。如以下所使用的,术语“模块”可以实现预定功能的软件和/或硬件的组合。尽管以下实施例所描述的装置较佳地以软件来实现,但是硬件,或者软件和硬件的组合的实现也是可能并被构想的。
图9是根据本发明实施例的一种自适应流媒体处理装置的结构框图,如图9所示,该装置包括获取模块92、第一确定模块94和第二确定模块96,下面对该自适应媒体流处理装置进行说明。
获取模块92,设置为获取传输流中携带的自适应流媒体的位置信息和时间信息;第一确定模块94,连接至上述获取模块92,设置为根据上述位置信息确定自适应流媒体中需要与传输流进行时间轴对齐的媒体片段;第二确定模块96,连接至上述第一确定模块94,设置为根据时间信息确定自适应流媒体中媒体片段的媒体呈现时间相对于传输流时间轴的显示时间戳。
图10是根据本发明实施例的另一种自适应流媒体处理装置的结构框图,如图10所示,该装置包括添加模块102和发送模块104,下面对该自适应媒体流处理装置进行说明。
添加模块102,设置为在传输流中添加自适应流媒体的位置信息和/或时间信息,其中,该位置信息设置为确定自适应流媒体中需要与传输流进行时间轴对齐的媒体片段,该时间信 息设置为确定自适应流媒体中媒体片段的媒体呈现时间相对于传输流时间轴的;发送模块104,连接至上述添加模块102,设置为将传输流发送给用户终端。
工业实用性:通过上述描述可知,本发明利用传输流中携带的自适应流媒体的位置信息和时间信息来确定该自适应流媒体中需要与传输流进行时间轴对齐的媒体片段及其对应的显示时间戳,从而解决了相关技术中为广播网络和宽带网络混合媒体传输采用单一参考时钟所导致的问题,使得混合递送模式下媒体同步处理更加高效,同时也提高业务部署的可扩展性。
显然,本领域的技术人员应该明白,上述的本发明的各模块或各步骤可以用通用的计算装置来实现,它们可以集中在单个的计算装置上,或者分布在多个计算装置所组成的网络上,可选地,它们可以用计算装置可执行的程序代码来实现,从而,可以将它们存储在存储装置中由计算装置来执行,并且在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤,或者将它们分别制作成各个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。这样,本发明不限制于任何特定的硬件和软件结合。
以上所述仅为本发明的优选实施例而已,并不用于限制本发明,对于本领域的技术人员来说,本发明可以有各种更改和变化。凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。

Claims (30)

  1. 一种自适应流媒体处理方法,包括:
    获取传输流中携带的自适应流媒体的位置信息和时间信息;
    根据所述位置信息确定所述自适应流媒体中需要与所述传输流进行时间轴对齐的媒体片段;
    根据所述时间信息确定所述自适应流媒体中媒体片段的媒体呈现时间相对于所述传输流时间轴的显示时间戳。
  2. 根据权利要求1所述的方法,其中,获取所述传输流中携带的所述自适应流媒体的位置信息包括:
    解析获取的所述传输流中的节目映射表PMT,从所述PMT中确定流类型值为预定值的时间轴和外部媒体信息分组码流TEMI PES码流的包标识符PID;
    根据所述PID过滤出所述TEMI PES码流;
    提取所述TEMI PES码流中携带的所述自适应流媒体的位置描述子作为所述位置信息。
  3. 根据权利要求2所述的方法,其中,获取所述传输流中携带的所述自适应流媒体的时间信息包括:
    提取所述TEMI PES码流扩展头域中包括的显示时间戳PTS;
    根据所述PTS值确定所述自适应流媒体的时间信息。
  4. 根据权利要求1所述的方法,其中,获取所述传输流中携带的所述自适应流媒体的位置信息包括:
    获取所述传输流中扩展描述子标签的值为预定值的特定媒体组件传输流TS码流;
    提取所述特定媒体组件TS码流适配域中携带的自适应流媒***置描述子作为所述位置信息。
  5. 根据权利要求4所述的方法,其中,获取所述传输流中携带的所述自适应流媒体的时间信息包括:
    提取所述特定媒体组件TS码流载荷中携带的分组码流PES扩展头域中包括的PTS;
    根据所述PTS值确定所述自适应流媒体的时间信息。
  6. 根据权利要求4所述的方法,其中,获取所述传输流中携带的所述自适应流媒体的时间信息包括:
    获取所述传输流中的特定媒体组件TS后续的第一个与所述特定媒体组件TS码流具 有相同PID值且TS头域中载荷数据单元起始指示符取值为预定值的TS码流;
    提取获取的所述TS码流载荷中携带的PES扩展头域中包括的PTS;
    根据所述PTS值确定所述自适应流媒体的时间信息。
  7. 根据权利要求2至6中任一项所述的方法,其中,提取所述自适应流媒体的位置描述子包括:
    从所述TEMI PES码流载荷或所述特定媒体组件TS码流的适配域中提取标签字段值为预定值且业务类型字段值为预定值的适配域AF描述子作为所述自适应流媒体的位置描述子。
  8. 根据权利要求7所述的方法,其中,所述位置描述子包括以下至少之一:
    事件流方案标识字段、事件流值字段、事件标识字段,其中,
    所述事件流方案标识字段值设置为唯一标识事件流方案,所述事件流值字段值设置为唯一标识所述事件流方案中的事件流,所述事件标识字段值设置为唯一标识所述事件流中的事件。
  9. 根据权利要求8所述的方法,其中,所述位置描述子还包括媒体呈现描述统一资源定位符URL字段,其中,所述媒体呈现描述URL字段值设置为确定所述自适应流媒体的媒体呈现描述URL。
  10. 根据权利要求8所述的方法,其中,所述方法还包括:
    从所述位置信息中提取所述强制重载标志;
    在所述强制重载标志的值为预定值的情况下,重新加载所述自适应流媒体的媒体呈现描述。
  11. 根据权利要求8所述的方法,其中,所述方法还包括:
    从所述自适应流媒体的位置信息中提取所述预宣告标志;
    在所述预宣告标志的值为预定值的情况下,从所述位置信息中提取所述激活倒计时字段和所述时标字段。
  12. 根据权利要求1至11中任一项所述的方法,其中,根据所述位置信息确定所述自适应流媒体中需要与所述传输流进行时间轴对齐的媒体片段包括:
    确定与所述位置信息对应的自适应流媒体的媒体呈现描述MPD;
    根据所述位置信息和所述MPD确定与所述传输流进行时间轴对齐的所述媒体片段。
  13. 根据权利要求12所述的方法,其中,确定与所述位置信息对应的自适应流媒体的所述MPD包括:
    提取所述传输流中的节目映射表PMT中包括的MPD描述子;从所述MPD描述子中获取与所述位置信息对应的所述MPD的统一资源定位符;或者,
    从所述位置信息中获取所述MPD的统一资源定位符。
  14. 根据权利要求12或13所述的方法,其中,根据所述位置信息和所述MPD确定与所述传输流进行时间轴对齐的所述媒体片段包括:
    使用所述位置信息中携带的事件流方案标识字段值和所述事件流值字段值,替换所述MPD中的媒体片段模板元素@media属性的$EventStream$标识符;使用所述位置信息中携带的事件标识字段值,替换所述MPD中的媒体片段模板元素@media属性的$EventID$标识符,构造设置为标识与所述传输流进行时间轴对齐的所述自适应流媒体的媒体片段的统一资源定位符URL。
  15. 根据权利要求12或13所述的方法,其中,根据所述位置信息和所述MPD确定与所述传输流进行时间轴对齐的所述媒体片段包括:
    使用所述位置信息中携带的事件流方案标识字段值以及事件流值字段值,检索确定所述MPD中的时段元素;使用所述位置信息中携带的事件标识字段值检索确定所述时段元素中的媒体片段URL子元素;
    根据所述媒体片段URL元素的@media及@mediaRange属性值,构造需要与所述传输流进行时间轴对齐的自适应流媒体的媒体片段的URL。
  16. 根据权利要求12或13所述的方法,其中,根据所述位置信息和所述MPD确定与所述传输流进行时间轴对齐的所述媒体片段包括:
    使用所述位置信息中携带的事件流方案标识字段值以及事件流值字段值,检索确定所述MPD中的时段元素;使用所述位置信息中携带的事件标识字段值检索确定所述时段元素的事件子元素;
    根据所述事件元素的@media及@mediaRange属性值,构造需要与所述传输流进行时间轴对齐的自适应流媒体的媒体片段的URL。
  17. 根据权利要求1至11中任一项所述的方法,其中,根据所述时间信息确定所述自适应流媒体中媒体片段的媒体呈现时间相对于所述传输流时间轴的显示时间戳包括:
    根据所述时间信息确定所述需要与所述传输流进行时间轴对齐的媒体片段的媒体呈现时间相对于所述传输流时间轴的PTS;
    根据所述需要与所述传输流进行时间轴对齐的媒体片段的PTS计算所述媒体片段所在时段中后续媒体片段的PTS。
  18. 根据权利要求17所述的方法,其中,根据所述时间信息确定所述自适应流媒体中媒体片段的媒体呈现时间相对于所述传输流时间轴的显示时间戳还包括:
    将所述自适应流媒体的位置描述子对应的PTS值作为所述媒体片段的媒体呈现时间相对于所述传输流时间轴的PTS。
  19. 一种自适应流媒体处理方法,包括:
    在传输流中添加自适应流媒体的位置信息和/或时间信息,其中,所述位置信息设置为确定所述自适应流媒体中需要与所述传输流进行时间轴对齐的媒体片段,所述时间信息设置为确定所述自适应流媒体中媒体片段的媒体呈现时间相对于所述传输流时间轴的显示时间戳;
    将所述传输流发送给用户终端。
  20. 根据权利要求19所述的方法,其中,在所述传输流中添加携带所述自适应流媒体的位置信息包括:
    在所述传输流的节目映射表PMT中添加包标识符PID,其中,所述PID设置为获取流类型值为预定值的时间轴和外部媒体信息分组码流TEMI PES码流,所述TEMI PES码流中携带有所述位置信息。
  21. 根据权利要求20所述的方法,其中,在所述传输流中添加携带所述自适应流媒体的时间信息包括:
    在所述TEMI PES码流扩展头域中携带所述时间信息。
  22. 根据权利要求19所述的方法,其中,在所述传输流中添加携带所述自适应流媒体的位置信息包括:
    在所述传输流中携带扩展描述子标签的值为预定值的特定媒体组件TS码流,其中,所述TS码流适配域中携带所述位置信息。
  23. 根据权利要求22所述的方法,其中,在所述传输流中添加携带所述自适应流媒体的时间信息包括:
    在所述TS码流载荷的分组码流PES扩展头域中携带PTS,其中所述PTS值设置为确定所述自适应流媒体的时间信息。
  24. 根据权利要求22所述的方法,其中,在所述传输流中添加携带所述自适应流媒体的时间信息包括:
    设置所述传输流中的特定媒体组件TS后续的TS码流与所述特定媒体组件TS码流具有相同PID值且TS头域中载荷数据单元起始指示符取值为预定值;
    在所述后续的所述TS码流载荷的PES扩展头域中携带PTS,其中,所述PTS值确定所述自适应流媒体的时间信息。
  25. 根据权利要求20至24中任一项所述的方法,其中,在所述传输流中携带标签字段值为 预定值且业务类型字段值为预定值的,并且处于所述TEMI PES码流载荷或所述特定媒体组件TS码流适配域中的AF描述子作为所述自适应流媒体的位置描述子,所述位置描述子中包括所述位置信息。
  26. 根据权利要求20至24中任一项所述的方法,其中,所述位置描述子包括以下至少之一:
    事件流方案标识字段、事件流值字段、事件标识字段,其中,
    所述事件流方案标识字段值设置为唯一标识事件流方案,所述事件流值字段值设置为唯一标识所述事件流方案中的事件流,所述事件标识字段值设置为唯一标识所述事件流中的事件。
  27. 根据权利要求26所述的方法,其中,所述位置描述子还包括媒体呈现描述URL字段,其中,所述媒体呈现描述URL字段值设置为确定所述自适应流媒体的媒体呈现描述URL。
  28. 根据权利要求27所述的方法,其中,
    在所述传输流中的节目映射表PMT中携带MPD描述子;其中,所述MPD描述子中包括所述媒体呈现描述URL字段;或者,
    在所述位置信息中携带所述媒体呈现描述URL字段。
  29. 一种自适应流媒体处理装置,包括:
    获取模块,设置为获取传输流中携带的自适应流媒体的位置信息和时间信息;
    第一确定模块,设置为根据所述位置信息确定所述自适应流媒体中需要与所述传输流进行时间轴对齐的媒体片段;
    第二确定模块,设置为根据所述时间信息确定所述自适应流媒体中媒体片段的媒体呈现时间相对于所述传输流时间轴的显示时间戳。
  30. 一种自适应流媒体处理装置,包括:
    添加模块,设置为在传输流中添加自适应流媒体的位置信息和/或时间信息,其中,所述位置信息设置为确定所述自适应流媒体中需要与所述传输流进行时间轴对齐的媒体片段,所述时间信息设置为确定所述自适应流媒体中媒体片段的媒体呈现时间相对于所述传输流时间轴的显示时间戳;
    发送模块,设置为将所述传输流发送给用户终端。
PCT/CN2015/087615 2014-12-31 2015-08-20 自适应流媒体处理方法及装置 WO2016107192A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP15874878.0A EP3242490B1 (en) 2014-12-31 2015-08-20 Self-adaptive streaming media processing method and device

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201410856623.9 2014-12-31
CN201410856623.9A CN105812961B (zh) 2014-12-31 2014-12-31 自适应流媒体处理方法及装置

Publications (1)

Publication Number Publication Date
WO2016107192A1 true WO2016107192A1 (zh) 2016-07-07

Family

ID=56284128

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/087615 WO2016107192A1 (zh) 2014-12-31 2015-08-20 自适应流媒体处理方法及装置

Country Status (3)

Country Link
EP (1) EP3242490B1 (zh)
CN (1) CN105812961B (zh)
WO (1) WO2016107192A1 (zh)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108111899B (zh) 2017-12-29 2021-05-11 中兴通讯股份有限公司 视频传输方法与装置、客户端、服务器及存储介质
US11470136B2 (en) * 2020-10-07 2022-10-11 Tencent America LLC URL customization using the session-based dash operations
CN117978992A (zh) * 2022-06-09 2024-05-03 腾讯科技(深圳)有限公司 点云媒体的数据处理方法、装置、设备、存储介质及产品
CN117676219A (zh) * 2022-08-29 2024-03-08 华为技术有限公司 一种数据传输方法及装置

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101179484A (zh) * 2006-11-09 2008-05-14 华为技术有限公司 一种不同媒体流间的同步方法及***
CN101547355A (zh) * 2008-03-24 2009-09-30 展讯通信(上海)有限公司 富媒体电视的解码方法
US20130291040A1 (en) * 2011-01-18 2013-10-31 Samsung Electronics Co. Ltd Transmission method and transmission apparatus for a combined broadcasting and communication service
US20140002598A1 (en) * 2012-06-29 2014-01-02 Electronics And Telecommunications Research Institute Transport system and client system for hybrid 3d content service
US20140059180A1 (en) * 2012-08-22 2014-02-27 Futurewei Technologies, Inc. Carriage of ISO-BMFF Event Boxes in an MPEG-2 Transport Stream
CN103875252A (zh) * 2011-10-13 2014-06-18 三星电子株式会社 内容显示方法、内容同步方法、广播内容显示方法及显示装置

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101887506B1 (ko) * 2011-09-22 2018-08-13 삼성전자주식회사 방송 시스템에서 방송 콘텐츠 재생 방법 및 장치
US9246971B2 (en) * 2012-09-07 2016-01-26 Futurewei Technologies, Inc. System and method for segment demarcation and identification in adaptive streaming
KR20160006164A (ko) * 2013-04-16 2016-01-18 엘지전자 주식회사 방송 전송 장치, 방송 수신 장치, 방송 전송 장치의 동작 방법 및 방송 수신 장치의 동작 방법

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101179484A (zh) * 2006-11-09 2008-05-14 华为技术有限公司 一种不同媒体流间的同步方法及***
CN101547355A (zh) * 2008-03-24 2009-09-30 展讯通信(上海)有限公司 富媒体电视的解码方法
US20130291040A1 (en) * 2011-01-18 2013-10-31 Samsung Electronics Co. Ltd Transmission method and transmission apparatus for a combined broadcasting and communication service
CN103875252A (zh) * 2011-10-13 2014-06-18 三星电子株式会社 内容显示方法、内容同步方法、广播内容显示方法及显示装置
US20140002598A1 (en) * 2012-06-29 2014-01-02 Electronics And Telecommunications Research Institute Transport system and client system for hybrid 3d content service
US20140059180A1 (en) * 2012-08-22 2014-02-27 Futurewei Technologies, Inc. Carriage of ISO-BMFF Event Boxes in an MPEG-2 Transport Stream

Also Published As

Publication number Publication date
EP3242490A1 (en) 2017-11-08
CN105812961A (zh) 2016-07-27
EP3242490B1 (en) 2020-12-16
EP3242490A4 (en) 2017-12-20
CN105812961B (zh) 2019-09-10

Similar Documents

Publication Publication Date Title
WO2016145913A1 (zh) 自适应流媒体处理方法及装置
US9860611B2 (en) Broadcast service transmitting method, broadcasting service receiving method and broadcast service receiving apparatus
US9661371B2 (en) Method for transmitting a broadcast service, apparatus for receiving same, and method for processing an additional service using the apparatus for receiving same
US9596510B2 (en) Method for transmitting broadcast service, method for receiving broadcast service, and apparatus for receiving broadcast service
US9667902B2 (en) Method for transmitting a broadcast service, method for receiving a broadcast service, and apparatus for receiving a broadcast service
US9723362B2 (en) Method for transmitting and receiving broadcast service and receiving device thereof
US20120189010A1 (en) Method for transmitting broadcast service, method for receiving the broadcasting service, and apparatus for receiving the broadcasting service
CN104885473A (zh) 用于经由http的动态自适应流式传输(dash)的实况定时
KR102130191B1 (ko) 수신 장치, 수신 방법, 송신 장치, 및 송신 방법
US20170078765A1 (en) Apparatus for transmitting broadcast signal, apparatus for receiving broadcast signal, method for transmitting broadcast signal and method for receiving broadcast signal
EP3145209A1 (en) Reception device, reception method, transmission device, and transmission method
WO2016107192A1 (zh) 自适应流媒体处理方法及装置
US9883239B2 (en) Method for transmitting broadcast service, receiving method thereof, and receiving device thereof
US20140380356A1 (en) Device and method for processing bi-directional service related to broadcast program
US9699487B2 (en) Method for transmitting a broadcasting service, method for receiving a broadcast service, and apparatus for receiving a broadcast service
CA2822968C (en) Broadcast service transmitting method, broadcasting service receiving method and broadcast service receiving apparatus

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15874878

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

REEP Request for entry into the european phase

Ref document number: 2015874878

Country of ref document: EP