WO2000041395A1 - Latency-based statistical multiplexing - Google Patents

Latency-based statistical multiplexing Download PDF

Info

Publication number
WO2000041395A1
WO2000041395A1 PCT/US2000/000249 US0000249W WO0041395A1 WO 2000041395 A1 WO2000041395 A1 WO 2000041395A1 US 0000249 W US0000249 W US 0000249W WO 0041395 A1 WO0041395 A1 WO 0041395A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
applications
encoders
latency
compressed
Prior art date
Application number
PCT/US2000/000249
Other languages
French (fr)
Inventor
Ravi Krishnamurthy
Sriram Sethuraman
Xiaobing Lee
Tihao Chiang
Original Assignee
Sarnoff Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/478,127 external-priority patent/US6665872B1/en
Priority claimed from US09/478,128 external-priority patent/US6754241B1/en
Application filed by Sarnoff Corporation filed Critical Sarnoff Corporation
Publication of WO2000041395A1 publication Critical patent/WO2000041395A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/236Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
    • H04N21/2365Multiplexing of several video streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/115Selection of the code volume for a coding unit prior to coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/124Quantisation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/136Incoming video signal characteristics or properties
    • H04N19/14Coding unit complexity, e.g. amount of activity or edge presence estimation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/146Data rate or code amount at the encoder output
    • H04N19/15Data rate or code amount at the encoder output by monitoring actual compressed data size at the memory before deciding storage at the transmission buffer
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/146Data rate or code amount at the encoder output
    • H04N19/152Data rate or code amount at the encoder output by measuring the fullness of the transmission buffer
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/157Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
    • H04N19/159Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/177Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a group of pictures [GOP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/179Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a scene or a shot
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/236Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
    • H04N21/2365Multiplexing of several video streams
    • H04N21/23655Statistical multiplexing, e.g. by controlling the encoder to alter its bitrate to optimize the bandwidth utilization
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/238Interfacing the downstream path of the transmission network, e.g. adapting the transmission rate of a video stream to network bandwidth; Processing of multiplex streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/434Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
    • H04N21/4347Demultiplexing of several video streams

Definitions

  • the present invention relates to the compression and transmission of video signals, and, in particular, to the compression and transmission of multiple compressed video streams over a single, shared communication channel.
  • each application is assigned a fixed - although possibly different - allocation of the total available bandwidth, where the sum of the fixed bandwidth allocations is not greater than total channel bandwidth.
  • Fixed multiplexing schemes are appropriate for video applications having constant or at least relatively constant bit rates, or for situations in which the total available channel bandwidth is greater than the sum of the maximum bandwidth requirements for all of the video applications.
  • Many video applications have variable bit rates.
  • a conventional MPEG-2 video encoder encodes sequences of video images by applying a repeating pattern of frame types referred to as the group of picture (GOP) structure.
  • a typical 15-frame GOP structure may be (IBBPBBPBBPBBPBB), where I represents a frame encoded using only intra-frame encoding techniques, P represents a frame encoded using inter-frame encoding techniques based on a previous anchor (i.e., the previous I or P) frame, and B represents a frame encoded using inter-frame encoding techniques based on either a previous anchor frame (forward prediction), a subsequent anchor frame (backward prediction), or an average of previous and subsequent anchor frames (interpolated prediction).
  • B frames are never used as anchor (i.e., reference) frames for encoding other frames.
  • I frames require significantly more bits to encode than P and B frames.
  • predictive encoding schemes like MPEG-2, take advantage of similarities between frames, frames associated with scene changes in video imagery, where frame-to-frame similarity is often low, will also typically require more bits to encode than those frames in the middle of a scene.
  • the compressed video bitstream for a typical video sequence encoded based on a video compression scheme like MPEG-2 that relies on a relatively steady GOP structure will have a variable bit rate profile typically consisting of relatively narrow "peaks" of high bit rate corresponding to I frames and/or scene changes, separated by relatively wide “valleys” of lower, more uniform bit rate corresponding to sequences of P and B frames.
  • Statistical multiplexing can be defined as:
  • Statistical multiplexing schemes often involve the implementation of a dynamic bandwidth manager that controls the allocation of bandwidth to the various video applications in real time. Such bandwidth managers are able to monitor the real-time bit-rate demands of the different video applications to control the transmission of data from those different applications over the shared communication channel.
  • the quality of service depends on the fidelity and uniformity of the video playback, where collectively high fidelity and high uniformity typical mean (1) uniform, full frame rates and (2) uniform high image quality both within each frame and between consecutive frames.
  • the end-to-end latency involved in the processing is of less importance.
  • the primary concern of bandwidth managers for conventional statistical multiplexing schemes involving conventional video applications is to ensure that there will always be sufficient data in the receiver buffer at each user node to provide high fidelity, uniform video playback to each user.
  • High levels of latency are not acceptable for all video applications.
  • Many interactive video applications such as video conferencing and distributed video games where two or more remotely located users compete against each other, require relatively low levels of latency - in addition to high levels of uniformity and fidelity — for acceptable QoS levels.
  • different video applications will have different latency requirements.
  • the latency requirements of even some individual video applications, such as web browsing may vary over time, when the type of video service changes during the application session.
  • conventional multiplexing schemes ⁇ even conventional statistical multiplexing schemes — will not provide acceptable QoS levels, because they do not take into account the different and varying levels of latency required by the different video applications being multiplexed for transmission over a shared communication channel.
  • the present invention is directed to statistical multiplexing schemes that do take into account the corresponding latency requirements of different video applications (in addition to other factors such as uniformity and fidelity of video playback) when managing the bandwidth of a shared communication channel.
  • the statistical multiplexing takes latency into account to provide (a) traffic control (i.e., the control of how the data for multiple compressed video bitstreams is transmitted over the shared communication channel) as well as (b) some level of control over the actual compression processing used to generate those bitstreams for the different video applications.
  • the present invention is a method for controlling transmission over a shared communication channel of multiple compressed video bitstreams generated by a plurality of video encoders and corresponding to multiple video applications, comprising the steps of (a) receiving information for each compressed video bitstream wherein at least two of the video applications have different latency requirements; (b) controlling the transmission of data from the multiple compressed video bitstreams over the shared communication channel taking into account the information for each compressed video bitstream and the latency requirement of each corresponding video application; and (c) adaptively controlling compression processing of at least one of the video encoders taking into account the information for the corresponding compressed video bitstream and the latency requirement of the corresponding video application.
  • the present invention is a video processing system for controlling transmission of multiple compressed video bitstreams corresponding to multiple video applications over a shared communication channel, comprising (a) a plurality of video encoders, each configured to generate a different compressed video bitstream for a different video application, wherein at least two of the video applications have different latency requirements; and (b) a controller, configured to (1) receive the compressed video bitstreams from the video encoders; (2) control the transmission of data from the multiple compressed video bitstreams over the shared communication channel taking into account information for each compressed video bitstream and the latency requirement of each corresponding video application; and (3) adaptively control the compression processing of at least one of the video encoders taking into account the information for the corresponding compressed video bitstream and the latency requirement of the corresponding video application.
  • the present invention is a controller for controlling transmission of multiple compressed video bitstreams corresponding to multiple video applications over a shared communication channel, in a video processing system further comprising a plurality of video encoders, each configured to generate a different compressed video bitstream for a different video application, wherein at least two of the video applications have different latency requirements, wherein the controller is configured to (1) receive the compressed video bitstreams from the video encoders; (2) control the transmission of data from the multiple compressed video bitstreams over the shared communication channel taking into account information for each compressed video bitstream and the latency requirement of each corresponding video application; and (3) adaptively control the compression processing of at least one of the video encoders taking into account the information for the corresponding compressed video bitstream and the latency requirement of the corresponding video application.
  • FIG. 1 shows a block diagram of a video processing system, according to one embodiment of the present invention
  • Fig. 2 shows an assumed piecewise linear cost function based on latency
  • Fig. 3 shows a system-level block diagram of computer system, according to one embodiment of the present invention
  • Fig. 4 shows a board-level block diagram of each encoder board of the computer system of Fig. 3;
  • Fig. 5 shows a board-level block diagram of the statistical multiplexing board of the computer system of Fig. 3.
  • DETAILED DESCRIPTION Fig. 1 shows a block diagram of a video processing system 100, according to one embodiment of the present invention.
  • Video processing system 100 compresses multiple video streams corresponding to different video applications for transmission over a single shared communication channel 116.
  • the different video applications may include any suitable combination of different type of video applications including video conferencing, interactive video games having different levels of sophistication, web browsing, and electronic mail.
  • the shared communication channel may be any suitable transmission path that supports the concurrent transmission of multiple data streams, such as Ethernet, TCP/IP, Broadband networks, satellite, cable transmission, ADSL, DSL, and cable modem.
  • one or more application servers 102 provide multiple video streams to a service admission manager 104, which manages the admission of new video application sessions onto the system.
  • service admission manager 104 In response to a request for admission by a new video application (received from application request queue 106), service admission manager 104 (a) determines whether to accept the request and admit the new video application and, if so, (b) assigns the new video application to an appropriate video encoder.
  • video processing systems in accordance with the present invention have multiple video encoders available to perform the required video compression processing for the different video applications, where different video encoders may provide different levels of video compression processing power (e.g., in terms of frame rate and image fidelity).
  • different video encoders may provide different levels of video compression processing power (e.g., in terms of frame rate and image fidelity).
  • differing levels of video compression processing power make these different video encoders more or less suitable for different video applications having differing bandwidth and latency requirements.
  • High-demand video applications such as high-end interactive video games, typically have high bandwidth and low latency requirements.
  • low-demand video applications such as web browsing and electronic mail, typically have low bandwidth and high latency requirements.
  • video applications such as video streaming and video conferencing, that typically have intermediate to high bandwidth requirements and intermediate to low latency requirements.
  • video encoders may also differ in the degree to which external processors are able to control the details of their internal compression processing. For example, some video encoders may provide external control only at the frame level (e.g., in terms of specifying target bit rates and/or average quantization levels per frame). Other video encoders may also provide external control at the sub-frame level (e.g., in terms of specifying target bit rates and quantization levels at the slice or even macroblock level within each frame).
  • Class 1 encoders 108 provide relatively high levels of video compression processing power (e.g., in terms of high frame rates and high image fidelity), while providing relatively low levels of external control over their internal video compression processing.
  • Class 1 video encoders such as typical hardware encoders, are suitable for video applications requiring both high bandwidth and low latency, such as high-end interactive video games.
  • Class 2 encoders 110 provide slightly lower levels of video compression processing power than Class 1 encoders 108, but higher levels of external control over their internal video compression processing.
  • Class 2 video encoders which are typically high-end software encoders, are suitable for (a) video applications requiring slightly lower bandwidth and/or slightly higher latency, such as video streaming applications and low-end interactive video games.
  • Class 3 encoders 112 provide even lower levels of video compression processing power than Class 2 video encoders 110 with similar or higher levels of external control over their internal video compression processing.
  • Class 3 encoders which are typically low-end software encoders, are suitable for non-time-critical (i.e., high latency) applications with either high or low bandwidth requirements, such as web browsing and electronic mail.
  • video processing system 100 also has a multiplexer (mux) and traffic controller 114 (also referred to herein simply as the multiplexer), which controls the transmission of data from the compressed video bitstreams generated by the various video encoders over the shared communication channel 116.
  • controller 114 uses information corresponding to the various compressed video bitstreams (generated by the various video encoders) to generate control signals that are transmitted back to one or more of the video encoders to adaptively control - at least at some level - the video compression processing performed by those video encoders.
  • the information may include current frame rate, number of bits per frame, picture type, picture duration, picture capture time, and other statistics, such as scene change information, picture variance, motion-compensated error variance, and mode statistics (e.g., number of intra vs. inter macroblocks).
  • scene change information e.g., scene change information
  • picture variance e.g., motion-compensated error variance
  • mode statistics e.g., number of intra vs. inter macroblocks.
  • different types of information can be generated and reported at frame level, slice level, or picture unit level.
  • controller 114 generates two types of video compression control signals: (1) coarse control signals used to control video compression processing, e.g., at the frame level and (2) fine control signals used to control video compression processing at a finer level, e.g., at the sub- frame level. Controller 114 transmits specific coarse video compression control signals to any of the individual video encoders over a coarse control bus 118. In addition, controller 114 transmits specific fine video compression control signals to any individual video encoders (e.g., Class 1 encoders 108 and Class 2 encoders 110) that provide finer external control (e.g., at the sub-frame level) over their internal video compression processing over a fine control bus 120.
  • coarse control signals used to control video compression processing
  • fine control signals used to control video compression processing at a finer level
  • Controller 114 transmits specific coarse video compression control signals to any of the individual video encoders over a coarse control bus 118.
  • controller 114 transmits specific fine video compression control signals to any individual video encoders (e.g.,
  • Coarse video compression control signals correspond to relatively high-level control over video compression processing and may include frame rates, target numbers of bits per frame, and/or average quantization levels over a frame.
  • Fine video compression control signals correspond to relatively low-level control over video compression processing and may include target numbers of bits per slice within a frame, average quantization levels per slice or even per macroblock within a frame. Other types of fine video compression control signals will be described later in this specification.
  • controller 114 takes into account both bandwidth and latency requirements of the various corresponding video applications when performing both its traffic control and compression control functions.
  • Video processing system 100 also has an off-line profiling tool 122, which analyzes, in non-real- time, typical sets of video sequences corresponding to different types of video applications and stores the results of those analyses in an application profiles server 124.
  • the service admission manager 104 accesses information in the application profiles server 124 in order (1) to determine whether to admit a particular new video application and, if so, (2) to determine to which video encoder to assign the newly admitted video application.
  • controller 114 also accesses information in the application profiles server 124 in order to (1) determine an acceptable level of buffering for at least one video application and (2) order packets of data from different video applications.
  • the controller can attempt to maintain this constraint on all the encoders.
  • region of interest information is available, and slice level MQUANT setting is possible, the controller can intelligently trade-off and change the MQUANT over a frame. Similar control for frame- rate and spatial resolution is also possible.
  • video processing system 100 has one or more Class 1 encoders 108, one or more Class 2 encoders 110, and one or more Class 3 encoders 112. It will be understood that, in alternative implementations of the present invention, video processing systems may have fewer or more different classes of encoders available, including those (hardware or software) encoders that provide no degree of external control over their internal video compression processing. With this latter class of "uncontrolled" encoders, the traffic controller processes the corresponding received compressed video bitstreams for transmission over the shared communication channel in an open-loop manner.
  • the traffic controller may be able to exercise some "post-processing" control by altering the bitstream before transmission by dropping frames or even replacing portions of frames such as slices or individual macroblocks with special skip codes. Since the encoders will be unaware of these changes, such post-processing control may adversely affect the quality of the video playback at the end users.
  • video processing system 100 corresponds to three different generic functions: (1) off-line application profiling for content classification (implemented by off-line profiling tool 122), (2) service admission processing (implemented by service admission manager 104), and (3) traffic and compression control (implemented by controller 114). Each of these three functions is described in further detail in the following sections.
  • off-line profiling tool 122 analyzes, in non-real-time, typical sets of video sequences corresponding to different types of video applications and stores the results of those analyses in application profiles server 124.
  • the profiling is semi-automatic and each video application is characterized according to the following parameters: (a) Level of interactivity (related to latency tolerance); (b) Extent of frame-to-frame motion (both peak and average);
  • the peak can be obtained by performing I-frame-only encoding at an acceptable average frame-level quantization (MQUANT) level and picking its peak.
  • the average bit rate can be obtained by IP-only encoding (i.e., no B frames) at the same MQUANT level.
  • Rol Region of Interest
  • Objectionable artifacts Some applications may be very sensitive to frame dropping, others may be sensitive to slice dropping, and still others may be sensitive to spatial adaptation of the quantizer.
  • profiling tool 122 processes the various results to make generalizations about groups of video applications based on their collective similarities and respective differences in order to generate rules used by video processing system 100 in real-time processing of other video applications.
  • profiling can be relatively simple, such as characterizing the level of interactivity of different video applications as either "high,” “intermediate,” or “low.” Alternatively, more and more sophisticated schemes can be implemented.
  • the resulting profile information is stored in application profiles server 124 for eventual use by service admission manager 104 for initial service admission as well as by controller 114 for traffic control and multiplexing.
  • the service provider for a particular video application may be able to maintain user profiles which indicate the behavior of particular users (such as type of games played, levels reached, typical browsing patterns, etc.). This information might only be used as a second-order control, since there may be multiple users with access to a particular user node.
  • Service admission manager 104 determines the mix of the active applications at any given time. The main task of this tool will be to ensure that only services for which (a) the required encoder resources are available and (b) a minimum QoS can be guaranteed for the entire session, are admitted into a multiplex pool.
  • the service admission decision is based on the profiles of the applications that are requested. In one possible implementation, the different video applications are divided into the following classes:
  • service admission manager 104 obtains the following information from application profiles server 124:
  • Class of application e.g., video game (high-end, intermediate, or low-end), web browsing, e- mail, etc.);
  • Motion extent used to determine the frame rate required for the application, which is used by the controller to control the encoders. It can also be used for resource allocation to assign an encoder to the application; (4) Peak bandwidth required; and
  • service admission manager 104 will admit the new application, if and only if both of the following two rules would be satisfied after admitting the new application:
  • the first rule is fairly conservative, and applies to relatively simple implementations of the present invention.
  • controller 114 is provided a high degree of control (i.e., more fine control) over the video compression processing implemented within the various video encoders
  • the first rule can be relaxed.
  • Such fine control may involve control of slice-level and even macroblock-level quantizers as well as the staggering of intra frames across different applications (to ensure that a limited number of applications have intra frame within the same frame time).
  • service admission manager 104 can use a more complicated formula depending on the QoS requirements of the various video applications and take further advantage of the statistical nature of video streams.
  • more applications across the various types may be able to be admitted, as compared to the above solution, which is constrained based on the peak bandwidths of the Cl applications.
  • the motion extent and interactivity can also be used to allocate encoding resources to application.
  • An alternative call admission strategy would be to replace the stringent first condition by: Maximum of the sum of the peak bandwidths of concurrent I frames possible at a time based on the GOP structures for Cl applications + the sum of the average bandwidths of the remaining applications is less than the total channel bandwidth.
  • Such a policy would allow more Cl applications.
  • the probability of not meeting the minimum QoS at a given time instant increases as the number of active applications increase.
  • low latency applications are assigned to video encoders that use only short GOP structures having only I and P (and no B) frames, such as IPPP, where every fourth frame is an I frame.
  • IPPP IP packet
  • Using shorter GOP structures supports interactivity.
  • hardware encoders may be required for such applications.
  • the GOP period should be less than two seconds to handle errors as well as to allow decoder resynchronization when the user flips through channels.
  • an adaptive intra-refresh strategy can be used to avoid having to send I frames so frequently. Instead, different parts of eacii frame are intra-refreshed in different P pictures over a period corresponding to a chosen GOP size.
  • Multiplexer and traffic controller 114 handles the following tasks:
  • the bit allocation and buffer control range from a mere frame-level interaction between controller 114 and each encoder to finer levels, such as at the slice- or even macroblock-level.
  • the fact that the different applications are not frame synchronized can be exploited to provide frame- (or finer) level control of other services, while responding to an unexpectedly high instantaneous bit rate from a particular service.
  • the individual encoders can be staggered with respect to one another over the frame time to allow controller 114 to control the compression processing for certain applications based on the results of compression processing for other applications that fall later within the same frame time.
  • o Class Cl applications These are encoded using hardware encoders that may provide external control over only the specification of frame- level target number of bits and average MQUANT over the frame.
  • o Class C2 applications These are games that are software encoded and do not take a very large bandwidth.
  • the applications are encoded without B frames using GOP structures in which I frames may be encoded at relatively large intervals.
  • Implementing an adaptive macroblock refresh strategy that will intra-code a fraction of the macroblocks in every P frame can support switching back and forth between applications while containing error propagation as well. This will smooth out the bit profile. Any variations will come from content and not from the GOP structure and picture types.
  • Controller 114 acts as a video rate controller and controls the picture type, rate, etc.
  • the control is hierarchical: at one level, picture type and frame- level targets are controlled; at another level, slice-level targets are controlled.
  • the adaptive refresh strategy is also staggered across the different mid-range encoders and are scheduled to coincide with the valleys between the peaks of the Class Cl applications whenever possible.
  • o Class C3 applications It is assumed that web browsing and email applications have virtually no QoS requirements compared to Class Cl and C2 applications. Class C3 applications can be scheduled in the gaps and valleys of the bit profiles of the other services, so as to increase channel utilization. Hence, their latencies can be quite high (of the order of several frame times). For more sophisticate encoding and multiplexing strategies, a dynamic QoS for thess services can be determined on the fly and bandwidth allocation proportional to this dynamic QoS can be made.
  • Advance bit allocation refers to allocation o ' a fraction of the instantaneous bandwidth to each encoder based on its past statistics without actually l nowing the actual complexity of the current frame. This is important for applications having low-latency requirements, which preclude look-ahead based bit allocation.
  • the advance bit allocation for each encoder is implemented based on:
  • control can also decide to skip frames based on the quality requirements.
  • a frame-level target is computed for the encoder that will start encoding a frame next (at any given time), based on the average MQUANT chosen for that encoder.
  • the bit count for a frame can be estimated from prior data.
  • An example of the constraint on MQUANT can be that the quality is uniform across the applications, while ensuring that the temporal rate of change of average MQUANT is within a tolerance threshold.
  • the channel bit rate is divided between the applications according to their respective complexities and relative significance. The complexities are updated on the fly, and the relative significance can be obtained from the results of off-line profiling stored in application profiles server 124.
  • the frame-level target (or average MQUANT) might be able to be communicated to the encoder.
  • the basic unit of operation will be a slice (e.g., a row of macroblocks). Because the encoders are not synchronized, this will require a worst-case buffer requirement of 2 slices.
  • a slice-level target is computed for each controllable encoder based on the frame target, the buffer fullness for that encoder (which is indicative of the buffer delay), and the instantaneous bit rate available after deducting the bits (within a latency window) from the less controllable encoders.
  • the slice targets are also constrained by the fact that MQUANTs cannot change too much within a frame.
  • a one-frame bit buffer is used for Class C3 applications.
  • the encoders encode a new frame only after all the bits for the frame that was encoded before the last frame have been transmitted by controller 114.
  • This on-demand encoding eliminates the possibility of congestion due to Class C3 services.
  • Other strategies to tune the encoding to suit the application's demands are discussed in the following section.
  • Channel Bandwidth Allocation - Embodiment #1 Channel bandwidth allocation is different from the instantaneous bit rate from each encoder because of the mux buffer in controller 114.
  • a certain amount of mux buffering is needed to prevent the individual rate controllers from entering into an oscillatory mode, constantly correcting the allocation and ending up with a highly varying spatial quality rcross a frame.
  • the statistical multiplexing gain tends to be higher as multiplex is performed at a finer level.
  • the actual amount of buffering has to be chosen carefully.
  • the exact amount of buffering at controller 114 for particular applications depends on their latency requirements and the strategies used for handling pathological cases.
  • the channel bandwidth allocation step implemented by controller 114 ensures that the latency requirement for each application is met.
  • each application in Classes Cl and C2 is allocated a bandwidth that is the minimum of the buffer occupancy and the slice-level bit rate used by service admission manager 104, and (2) the remaining bandwidth, if any, is then distributed among all the applications, in turn, to meet their latency requirements.
  • Class Cl applications take precedence over Class C2
  • Class C2 takes precedence over Class C3.
  • Class C3 bits are transmitted only when bits remain in an allocation after the latency requirement for Classes Cl and C2 are met.
  • the buffer occupancy is maintained below the maximum allowed buffer delay for a service during normal operation.
  • the exceptions i.e., when the requirement for Classes Cl and C2 cannot be met) are handled under the pathological cases.
  • Lnom Nominal Latency
  • Lmax Maximum Latency
  • Fig. 2 shows an assumed piecewise linear cost function based on latency. This is the quality measure in terms of latency for a frame that will be used in statistical multiplexing.
  • the costs Ca and Cb in Fig. 2 are obtained from off-line profiling.
  • the following measurement data is received from the encoders: o Picture capture time; o Picture type; o Picture duration; o Average MQUANT used to encode the picture; o Number of bits used to encode the picture; o Advanced statistics such as macroblock variance and other macroblock activity measures; o Whether the picture corresponds to a scene change; and o Similar information for different groups of macroblocks within a picture.
  • the collection of such information over an interval ⁇ Tcurr-M*Tfi, Tcurr ⁇ is denoted as Mij for all the frames in that interval.
  • Each encoder has Mi frames in the buffer, some of which may be partial frames.
  • Let bij be the bits transmitted from each frame of each encoder i.
  • the following iterative procedure provides this:
  • Mij ⁇ This is a modeling problem that estimates the time spent by frame ij in the physical multiplexer, given the current state and current measurements of the system, and the current allocation. This is accomplished by simulating the action of the physical MUX over the next few time-grains (until the frame is transmitted). This involves prediction of future values of bij, which can use the same formula as the initialization step 1.
  • Controller 114 may drop packets, but then processing cannot recover till the next I frame. If picture types can be requested, then controller 114 can request an I frame from the encoder after dropping packets. If picture type cannot be dictated, it may be preferable to delay the frames instead of allowing packet dropping. Then, at the next I frame, the buffer can be flushed thereby dropping packets right before the I frame and resynchronization can then be established with the I frame.
  • controller 114 drops slices and communicates that information back to the encoder.
  • the encoder can keep track of the decoder state.
  • a good strategy at the multiplexer is to drop the whole slice, and instead send a slice with all skipped macroblocks instead. If the encoder knows this information, it can refresh these macroblocks so that the decoder can recover.
  • the encoder may have the ability to save a previous reference frame. In that case, when controller 114 drops a P frame or even just a slice of a P frame, it can inform the encoder so that the encoder will use the previous P frame for subsequent encoding, thereby avoiding prediction errors between the encoder and decoder.
  • the overall system latency is the sum of the latencies introduced by the following components: (1) Decoder Latency: At worst case, this is a delay of 2-frame duration, including the decoding delay and the display delay. A higher frame rate will lead to reduced decoder latency. For frame pictures, this delay will be 66 ms for 30 frames per second (fps). This latency can be reduced by up to 16.5 ms by using field pictures, instead of frame pictures.
  • Encoder latency The encoder is assumed to be reasonably pipelined and the delay is assumed to be about 40% of frame delay. In that case, the delay is roughly 15 ms for a 30-fps transmission. Further computational pipelining of the encoder can reduce this number.
  • Mux buffering at controller 114 This is a buffering delay that can be used for rate control. It is expected to have about 5- 10ms of buffering that can be used for this purpose.
  • the latency estimates for video processing system 100 total less than 100 ms.
  • o Simple Profile Encoding Since B pictures lead to re-ordering delays, in order to maintain low latency, encoding is performed with only I and P pictures. In addition, using dual-prime motion vectors can result in improved compression efficiency for IP-only encoding.
  • o Pipelining the encoder Computational pipelining refers to performing all the encoding tasks on a minimum unit of encoding, e.g., macroblock, slice. Typical hardware encoders use hierarchical motion search and cannot be pipelined entirely.
  • the hierarchical motion estimator can be tailored to start a slice-level pipeline after 3 rows of macroblocks are available.
  • o Field pictures One possibility is to perform field-picture encoding (even though material is progressive). The decoding delay will only be one field interval and this will save l A frame interval in decoding delay.
  • the encoding algorithm would have to be tailored for this coding mode.
  • the fields can either be from the same progressive frames at 30 frames/sec in which case the top and bottom fields are at the same time instant, or they can come by sampling at 60 frames/sec and throwing away alternate fields. The latter solution may better match the interlaced display in the home. In both cases, special preprocessing may then become necessary.
  • the algorithms can be tailored to enable good quality while using this field-picture mode. Algorithmic Improvements for Game/Web Content Encoding
  • Region-of-Interest Encoding Many games have specific regions of interest that are of more importance to the player. For example, most games have a center-weighted region of attention. This can be exploited in the bit- allocation strategy within a frame. Furthermore, it can also be used for intelligent packet-dropping at controller 114 when buffer or latency requirements are not met.
  • the following encoding parameters can be tuned to improve the compression efficiency for game/web content. Note that hardware encoders are usually tuned for natural video scenes and hence might not perform as well on graphics and text content.
  • Rate control initialization A careful initialization of the rate control to match the multiplexer operation as well as the GOP structure can provide substantial improvements in quality.
  • Quantizer matrix selection The quantizer matrices commonly used are tailored to natural video. Matrices can be developed that are tailored to graphics and text.
  • Pre-processing The final output display de-ice is interlaced, even though the encoded material is progressive. Further, field picture coding modes are proposed to reduce latency. Thus, suitable preprocessing by vertical filtering, etc. is essential for good display quality.
  • Rate-distortion models enable prediction of encoding complexities for a frame based on distortion parameters. These models will be useful for the advance allocation statistical multiplexer. However, the models have mostly been developed for natural video and need modifications for game and web content.
  • motion is typically very even and translational across a region of the image. This assumption can be used to reduce the complexity of motion estimation. For example, within a row, motion estimation could be performed on a subset of the macroblocks and if the motion is determined to be similar, the same motion vector can be used for the other macroblocks.
  • Motion estimation complexity can be reduced by exploiting the knowledge about the graphics commands. Intercepted graphics commands can be used to quickly and accurately estimate motion without going through the complete search process. Again, this may lead to significant computational savings.
  • the frame rate can be dynamically adjusted based on the content and the state of controller 114. In cases where the channel is overloaded, frame rates could be reduced to maintain acceptable spatial quality. Note that this solution will mainly work for intermediate- to low-interactivity applications. Another innovation would involve dynamic changes is spatial resolution (to half-horizontal, for example), whenever the content is less detailed or whenever channel constraints so dictate. In MPEG-2 encoding, this is done at the GOP level, rather than at the picture level. However, this is a better response to channel congestion than the catastrophic case handling described in the previous section.
  • the GOP structure can be limited to a relatively simple structure consisting of an I frame followed by a number of consecutive P frames.
  • the frequency of I frames can be dynamically adjusted by controller 114 across the encoders in order to stagger the I frames to take advantage of statistical multiplexing gains.
  • controller 114 should delay and reschedule I frames for the other encoders in order to maintain QoS across different applications.
  • controller 114 may be able to perform one or more of the following miscellaneous features: o Scheduling I frames based on advance knowledge acquired from the application.
  • controller 114 uses advance knowledge from a video application to control the encoding process for that application.
  • One method is when an application like a web-browser can anticipate a scene change when a user clicks a new page, and inform controller 114.
  • Controller 114 can anticipate a large bit rate for the frame and use it to control the compression processing of the other video applications as well as this particular application. For example, any scheduled I frames of the other applications can be switched to adaptive refresh mode. o Use of adaptive intra-refresh for handling scene changes.
  • controller 114 can include the use of intra- macroblocks in the region of interest as a means of control when a scene change has occurred.
  • controller 114 cannot match the latency requirement for a particular video application, it sends a signal back to the application delaying the application. Thus, the application knows that the user has not been given a chance to respond and thus pauses. This is useful in high-interactivity services like video games. This delay can be achieved by using the pause command available on many applications.
  • ROI information region-of- interest
  • controller 114 uses pre-encoded portions of the bitstream and does some bit-stream manipulation. This can be used in web-browsing and for backgrounds of games. In particular, the pre-encoded portions will be used for sections outside the ROIs, as a special method for handling ROI-based control.
  • the proposed statistical multiplexer tools offer the following advantages over other off-the-shelf multiplexers: 1. Exploiting the varying QoS requirements to improve channel utilization while providing an acceptable quality for all applications;
  • a user may decide to keep his initial application running on one channel while surfing other channels in order to return to the initial application. Or, he may run two sessions in parallel and switch between sessions. These cases should be handled effectively, including taking advantage of these situations to reduce transmission bandwidth. For example, after detecting that the user has moved to another channel (e.g., based on monitoring the return path and the content served), a low-bit-rate slide show (e.g., I frames spaced relatively far apart) can be sent for decoder resynchronization when the user comes back to the original interactive application. If the slide show lasts longer than a certain timeout period, the user's session can be automatically terminated. An alternative can be to save the game for later resumption.
  • a low-bit-rate slide show e.g., I frames spaced relatively far apart
  • Computer systems in accordance with the present invention avoid PCI bus delay by using the built-in multi-channel Synchronized Serial Interface (SSI) ports of multiple Digital Signal Processors (DSPs), where each DSP performs video and audio encoder control, PES/TS layer multiplexing, and computation of statistical measurements of its corresponding video stream payload.
  • the DSPs' on-chip memories may also eliminate the need for bitstream First-In, First-Out (FIFO) chips and some common SDRAM (Synchronized Dynamic Random Access Memory) chips.
  • FIFO First-In, First-Out
  • SDRAM Synchronized Dynamic Random Access Memory
  • Fig. 3 shows a system-level block diagram of computer system 300, according to one embodiment of the present invention.
  • Computer system 300 is a PCI bus-based industrial PC (Personal Computer) enclosure with multiple PCI boards.
  • CPU Central Processing Unit
  • Stat-mux statistical multiplexing
  • CPU board 304 is a conventional industrial PC motherboard having a suitable central processor, such as an Intel Pentium IIITM microprocessor by Intel Corporation of Santa Clara, California.
  • CPU board 304 has a conventional PCI interface 310, an ISA (Industry Standards Association) bus interface 312, RS232 ports 314, a (e.g., 100-MHz) Local Area Network (LAN) interface 316, a hard disk / floppy disk (HD/FD) controller, and other standard PC periphery interfaces.
  • Software e.g., in the "C" programming language
  • the Pentium processor may provide main system controls, fault-tolerant controls, and/or statistical multiplexing of those bitstreams that do not have low- latency requirements.
  • Each encoder board 306 is an integrated video/audio encoder with an SDI (Serial Digital Interface or Serial Dl) or ASI (Asynchronous Serial Interface) input port 318, a video encoder 320, an audio encoder 322, a PCI bus interface 324, and a DSP controller 326 (with an SSI port 328) for board- level sub-system control and low-delay PES / TS multiplexing plus bitstream statistics parameter measurement.
  • SDI Serial Digital Interface or Serial Dl
  • ASI Asynchronous Serial Interface
  • Stat-mux board 308 has a PCI bus interface 330 and four DSP chips 332, where each DSP chip 332 has a six-channel SSI DMA (Direct Memory Address) 334 with six SSI ports 336, SRAMs 338, two DSP cores 340, and an ASI/TAXITM chip set from Advanced Micro Devices, Inc., of Sunnyvale, California, and, in block 342, a DHEI (Digital High-speed Expansion Interface) I/O port from General Instrument Corporation (GI) of Horsham, Pennsylvania, for GI's modulator and CA (Conditional
  • stat-mux board 308 can support up to 24 channels of low-delay MPEG2 video/audio input bitstreams.
  • PCI bus 302 is used for power supply and system control for each PCI board.
  • a DSP chip on each encoder board 306 will directly transfer low-delay MPEG2 bitstreams to a corresponding DSP on stat-mux board 308.
  • each low-delay MPEG2 video/audio bitstream will be directly transmitted from the SSI port 328 of the corresponding encoder board 306 to an SSI port 336 on stat- mux board 308.
  • the associated delay can be controlled to correspond to as few as four transport packet delays, with a two-packet delay in the encoder DSP 326, a one-packet delay at an input port 336 of stat-mux board 308, and a one-packet delay at an output port 342 of stat-mux board 308.
  • PCI bus 302 can be used to transmit additional MPEG2 video/audio bitstreams that do not have low-latency requirements.
  • these high-latency bitstreams may be generated by video/audio encoders implemented in software within the central processor on CPU board 304.
  • Fig. 4 shows a board-level block diagram of each encoder board 306 of computer system 300 of Fig. 3, according to one embodiment of the present invention.
  • Encoder board 306 comprises an internal board bus 402 configured with an input interface module 318, an MPEG2 video encoder 320, an AC3 or MP3 audio encoder module 322, a DSP controller 326 with PES/TS-layer multiplexing firmware, and 27-MHz SCR/PCR circuits 408, where SCR is the System Clock Reference in an MPEG video decoder and PCR is the Program Clock Reference in an MPEG transport decoder.
  • Input interface module 318 can support both SDI and ASI circuits with a 270-MHz or 180-MHz line-coded clock, respectively.
  • the SDI or ASI signals can be customized to interlace the uncompressed digital video data and multi-channel audio data.
  • CPLD Complex Programmable Logic Device
  • FPGA Field-Programmable Gate Array
  • MPEG2 video encoder 320 can be any suitable single-chip encoder, such as those supplied by
  • the glue logic can be combined within the input CPLD firmware. There are also some downloadable micro-codes from the MPEG2 chip manufacturer.
  • Audio encoder 322 can be any suitable off-shelf DSP-based sub-system that can support either the AC3 or MP3 encoding function depending on the DSP software. If a TMS320c5410 DSP chip from Texas Instruments Incorporated of Dallas, Texas is used, then the audio encoding functions of audio encoder 322 can be combined with DSP controller 326, shared memories 406, and the PES/TS multiplexing firmware for less board area and lower integration costs.
  • DSP 326 may be a TMS320c5402 DSP from Texas Instruments. DSP 326 will provide of video encoder control, audio encoder control, the SCR/PCR time-base controls, and the overall board-level controls. It will also perform the PES/TS multiplexing of compressed video and audio bitstreams, and the statistical parameter measurements of the video stream. It will also execute the commands of statistical multiplexing controls received from PCI bus 302 of Fig. 3.
  • DSP on-chip SSI output port 328 can be directly connected to an SSI input port of a DSP on stat-mux board 308 of Fig. 3.
  • the on-chip DMA will automatically move data from the TS output buffer of on-chip memory to the serial output port.
  • the TMS320c5410 DSP has 128 Kbytes of on-chip memory and a DMA-controlled host interface port, such that external SRAM and FIFO devices may be eliminated.
  • video encoder 320 is an IBM39 MPEGS422 video encoder chip
  • the video encoder can directly write its compressed video data into the TMS320c5410 on-chip SRAM with a simple CPLD to emulate the FIFO signals.
  • the PES/TS MUX delay can be within transmitting two TS packets of video streams, such as 2 x 188 x 8 * video rate delay.
  • DSP on-chip timer 408 can also be programmed for the 27-MHz SCR/PCR time-base by incorporating on-chip PLL (Phase-Locked Loop) circuits. All of the 27-MHz clocks will be derived from the same 27-MHz clock on stat-mux board 308 through the clocks of the SSI ports connected to all of the encoder boards 306.
  • Fig. 5 shows a board-level block diagram of statistical multiplexing board 308 of computer system 300 of Fig. 3, according to one embodiment of the present invention.
  • Stat-mux board 308 is a low-delay Input/Output (I/O) interface PCI board with the statistical multiplexing system and PCR time- base correction firmware.
  • I/O Input/Output
  • Stat-mux board 308 comprises an internal sub-system bus 502 configured with four Texas Instruments TMS320c5420 DSP chips 332, each having six SSI serial ports 336 and 512 Kbytes of on-chip SRAM memory 338, such that stat-mux board 308 can receive up to 24 different channels of transport bitstreams.
  • Each SSI serial input port 336 has three wires carrying a clock signal (sclk), a data signal (sdat), and a frame signal. All 24 clock signals sclk should be configured as the input clock signals and connected to an on-board 27-MHz clock oscillator 504. 27-MHz clock 504 will also be used as the DSP clock, and on-chip PLL circuits will generate a 90-MHz DSP clock. In that case, on-chip timers can be used for the PCR time-base corrections.
  • the frame signals will indicate whether or not the data signal sdat carries meaningful data.
  • the data signals sdat are burst with a maximum rate of 27 Mbps.
  • the frame signals can also be programmed in a "multi-channel mode" to send multiple packets into assigned on-chip buffers for transmitting the individual encoders' statistical parameters.
  • ASI interface 506 uses a TAXI transmitter chip with parallel interface from Advanced Micro Devices, such that there are FIFO and CPLD control circuits to handle the TAXI interface and ASI controls.
  • a DHEI interface 508 from GI will need additional PLL circuits to generate the output clock, if there is no available input clock signal from DHE T port 510.
  • each of the central processing sub-system, the statistical multiplexing sub-system, and each encoding sub-system is implemented on a separate computer board of the computer system
  • the present invention is not so limited.
  • two or more of the different sub-systems could be implemented on a single board.
  • any of the sub-systems could be implemented on more than one board.
  • the important characteristics of the present invention relate to how the various components of the different sub-systems communicate with one another, rather than where those components are physically located.
  • the present invention has been described in the context of a system having a central processing sub-system, in addition to the statistical multiplexing sub-system and multiple encoding subsystems, all of which are configured to a PCI bus, it will be understood that the present invention is not so limited.
  • the present invention can also be implemented in computer systems in which there is no separate central processing sub-system, but where all of the centralized control functions are implemented in the DSPs of the statistical multiplexing sub-system.
  • such a computer system may be implemented with or without a system bus, such as a PCI bus.
  • the present invention can be embodied in the form of methods and apparatuses for practicing those methods.
  • the present invention can also be embodied in the form of program code embodied in tangible media, such as floppy diskettes, CD-ROMs, hard drives, or any other machine-readable storage medium, wherein, when the program code is loaded into and executed by a machine, such as a computer, the machine becomes an apparatus for practicing the invention.
  • the present invention can also be embodied in the form of program code, for example, whether stored in a storage medium, loaded into and/or executed by a machine, or transmitted over some transmission medium or carrier, such as over electrical wiring or cabling, through fiber optics, or via electromagnetic radiation, wherein, when the program code is loaded into and executed by a machine, such as a computer, the machine becomes an apparatus for practicing the invention.
  • the program code segments combine with the processor to provide a unique device that operates analogously to specific logic circuits.

Abstract

When two or more different video streams are compressed for concurrent transmission of multiple compressed video bitstreams over a single shared communication channel, control over both (1) the transmission of data over the shared channel and (2) the compression processing that generates the bitstreams is exercised taking into account the differing levels of latency required for the corresponding video applications. For example, interactive video games typically require lower latency than other video applications such as video streaming, web browsing, and electronic mail. A multiplexer and traffic controller takes these differing latency requirements, along with bandwidth and image fidelity requirements, into account when controlling both traffic flow and compression processing. In addition, an off-line profiling tool analyzes typical video applications off-line in order to generate profiles of different types of video applications that are then accessed in real-time by a call admission manager responsible to controlling the admission of new video application sessions as well as the assignment of admitted applications to specific available video encoders, which themselves may differ in video compression processing power as well as in the degree to which they allow external processors (like the multiplexer and traffic controller) to control their internal compression processing.

Description

LATENCY-BASED STATISTICAL MULTIPLEXING
BACKGROUND OF THE INVENTION Field of the Invention The present invention relates to the compression and transmission of video signals, and, in particular, to the compression and transmission of multiple compressed video streams over a single, shared communication channel.
Cross-Reference to Related Applications This application claims the benefit of the filing dates of U.S. provisional application no.
60/114,834, filed on 01/06/99 as attorney docket no. SAR 13370P, U.S. provisional application no.
60/114,842, filed on 01/06/99 as attorney docket no. SAR 13409P, and U.S. provisional no. 60/xxx,xxx, filed on 12/15/99 as attorney docket no. SAR 13614P using U.S. Express Mail Label No.
EL416189565US.
Description of the Related Art
Whenever two or more different video applications share a single communication channel having a finite bandwidth, management of the allocation of that bandwidth to those different applications needs to be performed, at least at some level. In a fixed multiplexing scheme, each application is assigned a fixed - although possibly different - allocation of the total available bandwidth, where the sum of the fixed bandwidth allocations is not greater than total channel bandwidth. Fixed multiplexing schemes are appropriate for video applications having constant or at least relatively constant bit rates, or for situations in which the total available channel bandwidth is greater than the sum of the maximum bandwidth requirements for all of the video applications. Many video applications, on the other hand, have variable bit rates. A conventional MPEG-2 video encoder, for example, encodes sequences of video images by applying a repeating pattern of frame types referred to as the group of picture (GOP) structure. For example, a typical 15-frame GOP structure may be (IBBPBBPBBPBBPBB), where I represents a frame encoded using only intra-frame encoding techniques, P represents a frame encoded using inter-frame encoding techniques based on a previous anchor (i.e., the previous I or P) frame, and B represents a frame encoded using inter-frame encoding techniques based on either a previous anchor frame (forward prediction), a subsequent anchor frame (backward prediction), or an average of previous and subsequent anchor frames (interpolated prediction). B frames are never used as anchor (i.e., reference) frames for encoding other frames. In typical video sequences, I frames require significantly more bits to encode than P and B frames. In addition, since predictive encoding schemes, like MPEG-2, take advantage of similarities between frames, frames associated with scene changes in video imagery, where frame-to-frame similarity is often low, will also typically require more bits to encode than those frames in the middle of a scene. As such, the compressed video bitstream for a typical video sequence encoded based on a video compression scheme like MPEG-2 that relies on a relatively steady GOP structure will have a variable bit rate profile typically consisting of relatively narrow "peaks" of high bit rate corresponding to I frames and/or scene changes, separated by relatively wide "valleys" of lower, more uniform bit rate corresponding to sequences of P and B frames.
For such non-uniform bit-rate video applications, fixed multiplexing schemes which allocate bandwidth based on peak bit-rate requirements may be inefficient, because most of the time (i.e., the time corresponding to the lower bit-rate valleys), any given video application will not be using its full allocation of bandwidth. For such applications, statistical multiplexing may be applied to improve the efficiency of bandwidth usage.
Statistical multiplexing can be defined as:
(a) the control required for allocation of bits in proportion to the complexity and importance of each video application within the limits of control allowed by each video encoder, such that: (i) the aggregate instantaneous bit rate is less than or equal to the channel capacity;
(ii) the minimum quality of service (QoS) requirements for all applications are met; and (iii)the quality is maximized for applications in the order of their importance; and
(b) the control required in pathological cases, where the aggregate instantaneous bit rate is greater than the channel capacity, to minimize the loss in QoS for as minimal a number of applications as possible.
To achieve these levels of control, statistical multiplexing takes into account the variations in bit rate of different video applications when allocating transmission bandwidth.
Statistical multiplexing schemes often involve the implementation of a dynamic bandwidth manager that controls the allocation of bandwidth to the various video applications in real time. Such bandwidth managers are able to monitor the real-time bit-rate demands of the different video applications to control the transmission of data from those different applications over the shared communication channel.
For conventional video applications, such as video streaming which involves the one-way transmission of a compressed video bitstream from a video server to one or more remote users, the quality of service depends on the fidelity and uniformity of the video playback, where collectively high fidelity and high uniformity typical mean (1) uniform, full frame rates and (2) uniform high image quality both within each frame and between consecutive frames. For these applications, the end-to-end latency involved in the processing is of less importance. As such, the primary concern of bandwidth managers for conventional statistical multiplexing schemes involving conventional video applications is to ensure that there will always be sufficient data in the receiver buffer at each user node to provide high fidelity, uniform video playback to each user.
High levels of latency, however, are not acceptable for all video applications. Many interactive video applications, such as video conferencing and distributed video games where two or more remotely located users compete against each other, require relatively low levels of latency - in addition to high levels of uniformity and fidelity — for acceptable QoS levels. Moreover, in many multiplexing situations, different video applications will have different latency requirements. Furthermore, the latency requirements of even some individual video applications, such as web browsing, may vary over time, when the type of video service changes during the application session. For all these situations, conventional multiplexing schemes ~ even conventional statistical multiplexing schemes — will not provide acceptable QoS levels, because they do not take into account the different and varying levels of latency required by the different video applications being multiplexed for transmission over a shared communication channel.
SUMMARY OF THE INVENTION
The present invention is directed to statistical multiplexing schemes that do take into account the corresponding latency requirements of different video applications (in addition to other factors such as uniformity and fidelity of video playback) when managing the bandwidth of a shared communication channel. According to embodiments of the present invention, the statistical multiplexing takes latency into account to provide (a) traffic control (i.e., the control of how the data for multiple compressed video bitstreams is transmitted over the shared communication channel) as well as (b) some level of control over the actual compression processing used to generate those bitstreams for the different video applications.
According to one embodiment, the present invention is a method for controlling transmission over a shared communication channel of multiple compressed video bitstreams generated by a plurality of video encoders and corresponding to multiple video applications, comprising the steps of (a) receiving information for each compressed video bitstream wherein at least two of the video applications have different latency requirements; (b) controlling the transmission of data from the multiple compressed video bitstreams over the shared communication channel taking into account the information for each compressed video bitstream and the latency requirement of each corresponding video application; and (c) adaptively controlling compression processing of at least one of the video encoders taking into account the information for the corresponding compressed video bitstream and the latency requirement of the corresponding video application.
According to another embodiment, the present invention is a video processing system for controlling transmission of multiple compressed video bitstreams corresponding to multiple video applications over a shared communication channel, comprising (a) a plurality of video encoders, each configured to generate a different compressed video bitstream for a different video application, wherein at least two of the video applications have different latency requirements; and (b) a controller, configured to (1) receive the compressed video bitstreams from the video encoders; (2) control the transmission of data from the multiple compressed video bitstreams over the shared communication channel taking into account information for each compressed video bitstream and the latency requirement of each corresponding video application; and (3) adaptively control the compression processing of at least one of the video encoders taking into account the information for the corresponding compressed video bitstream and the latency requirement of the corresponding video application. According to yet another embodiment, the present invention is a controller for controlling transmission of multiple compressed video bitstreams corresponding to multiple video applications over a shared communication channel, in a video processing system further comprising a plurality of video encoders, each configured to generate a different compressed video bitstream for a different video application, wherein at least two of the video applications have different latency requirements, wherein the controller is configured to (1) receive the compressed video bitstreams from the video encoders; (2) control the transmission of data from the multiple compressed video bitstreams over the shared communication channel taking into account information for each compressed video bitstream and the latency requirement of each corresponding video application; and (3) adaptively control the compression processing of at least one of the video encoders taking into account the information for the corresponding compressed video bitstream and the latency requirement of the corresponding video application.
BRIEF DESCRIPTION OF THE DRAWINGS Other aspects, features, and advantages of the present invention will become more fully apparent from the following detailed description, the appended claims, and the accompanying drawings in which: Fig. 1 shows a block diagram of a video processing system, according to one embodiment of the present invention;
Fig. 2 shows an assumed piecewise linear cost function based on latency; Fig. 3 shows a system-level block diagram of computer system, according to one embodiment of the present invention;
Fig. 4 shows a board-level block diagram of each encoder board of the computer system of Fig. 3; and
Fig. 5 shows a board-level block diagram of the statistical multiplexing board of the computer system of Fig. 3. DETAILED DESCRIPTION Fig. 1 shows a block diagram of a video processing system 100, according to one embodiment of the present invention. Video processing system 100 compresses multiple video streams corresponding to different video applications for transmission over a single shared communication channel 116. The different video applications may include any suitable combination of different type of video applications including video conferencing, interactive video games having different levels of sophistication, web browsing, and electronic mail. Depending on the implementation, the shared communication channel may be any suitable transmission path that supports the concurrent transmission of multiple data streams, such as Ethernet, TCP/IP, Broadband networks, satellite, cable transmission, ADSL, DSL, and cable modem.
In particular, one or more application servers 102 provide multiple video streams to a service admission manager 104, which manages the admission of new video application sessions onto the system. In response to a request for admission by a new video application (received from application request queue 106), service admission manager 104 (a) determines whether to accept the request and admit the new video application and, if so, (b) assigns the new video application to an appropriate video encoder.
As indicated in Fig. 1 , video processing systems in accordance with the present invention have multiple video encoders available to perform the required video compression processing for the different video applications, where different video encoders may provide different levels of video compression processing power (e.g., in terms of frame rate and image fidelity). In general, differing levels of video compression processing power make these different video encoders more or less suitable for different video applications having differing bandwidth and latency requirements. High-demand video applications, such as high-end interactive video games, typically have high bandwidth and low latency requirements. At the other end of the spectrum, low-demand video applications, such as web browsing and electronic mail, typically have low bandwidth and high latency requirements. In between are video applications, such as video streaming and video conferencing, that typically have intermediate to high bandwidth requirements and intermediate to low latency requirements.
In addition to video compression processing power, video encoders may also differ in the degree to which external processors are able to control the details of their internal compression processing. For example, some video encoders may provide external control only at the frame level (e.g., in terms of specifying target bit rates and/or average quantization levels per frame). Other video encoders may also provide external control at the sub-frame level (e.g., in terms of specifying target bit rates and quantization levels at the slice or even macroblock level within each frame).
Although video compression processing power and the degree of external control over internal compression processing are technically both continuous and independent parameters, video encoders can be grouped into three basic classes, as shown in Fig. 1. Class 1 encoders 108 provide relatively high levels of video compression processing power (e.g., in terms of high frame rates and high image fidelity), while providing relatively low levels of external control over their internal video compression processing. Class 1 video encoders, such as typical hardware encoders, are suitable for video applications requiring both high bandwidth and low latency, such as high-end interactive video games.
Class 2 encoders 110 provide slightly lower levels of video compression processing power than Class 1 encoders 108, but higher levels of external control over their internal video compression processing. Class 2 video encoders, which are typically high-end software encoders, are suitable for (a) video applications requiring slightly lower bandwidth and/or slightly higher latency, such as video streaming applications and low-end interactive video games.
Lastly, Class 3 encoders 112 provide even lower levels of video compression processing power than Class 2 video encoders 110 with similar or higher levels of external control over their internal video compression processing. Class 3 encoders, which are typically low-end software encoders, are suitable for non-time-critical (i.e., high latency) applications with either high or low bandwidth requirements, such as web browsing and electronic mail.
As shown in Fig. 1, video processing system 100 also has a multiplexer (mux) and traffic controller 114 (also referred to herein simply as the multiplexer), which controls the transmission of data from the compressed video bitstreams generated by the various video encoders over the shared communication channel 116. In addition, controller 114 uses information corresponding to the various compressed video bitstreams (generated by the various video encoders) to generate control signals that are transmitted back to one or more of the video encoders to adaptively control - at least at some level - the video compression processing performed by those video encoders. The information may include current frame rate, number of bits per frame, picture type, picture duration, picture capture time, and other statistics, such as scene change information, picture variance, motion-compensated error variance, and mode statistics (e.g., number of intra vs. inter macroblocks). Depending on the implementation, different types of information can be generated and reported at frame level, slice level, or picture unit level.
As indicated in Fig. 1, controller 114 generates two types of video compression control signals: (1) coarse control signals used to control video compression processing, e.g., at the frame level and (2) fine control signals used to control video compression processing at a finer level, e.g., at the sub- frame level. Controller 114 transmits specific coarse video compression control signals to any of the individual video encoders over a coarse control bus 118. In addition, controller 114 transmits specific fine video compression control signals to any individual video encoders (e.g., Class 1 encoders 108 and Class 2 encoders 110) that provide finer external control (e.g., at the sub-frame level) over their internal video compression processing over a fine control bus 120. Coarse video compression control signals correspond to relatively high-level control over video compression processing and may include frame rates, target numbers of bits per frame, and/or average quantization levels over a frame. Fine video compression control signals, on the other hand, correspond to relatively low-level control over video compression processing and may include target numbers of bits per slice within a frame, average quantization levels per slice or even per macroblock within a frame. Other types of fine video compression control signals will be described later in this specification.
In addition to information for each compressed video bitstream, controller 114 takes into account both bandwidth and latency requirements of the various corresponding video applications when performing both its traffic control and compression control functions.
Video processing system 100 also has an off-line profiling tool 122, which analyzes, in non-real- time, typical sets of video sequences corresponding to different types of video applications and stores the results of those analyses in an application profiles server 124. The service admission manager 104 accesses information in the application profiles server 124 in order (1) to determine whether to admit a particular new video application and, if so, (2) to determine to which video encoder to assign the newly admitted video application. In addition, controller 114 also accesses information in the application profiles server 124 in order to (1) determine an acceptable level of buffering for at least one video application and (2) order packets of data from different video applications. Moreover, if there is profile information on the nominal MQUANT and MQUANT tolerance that can be used to encode a particular application, the controller can attempt to maintain this constraint on all the encoders. As another example, if region of interest information is available, and slice level MQUANT setting is possible, the controller can intelligently trade-off and change the MQUANT over a frame. Similar control for frame- rate and spatial resolution is also possible.
According to the embodiment shown in Fig. 1, video processing system 100 has one or more Class 1 encoders 108, one or more Class 2 encoders 110, and one or more Class 3 encoders 112. It will be understood that, in alternative implementations of the present invention, video processing systems may have fewer or more different classes of encoders available, including those (hardware or software) encoders that provide no degree of external control over their internal video compression processing. With this latter class of "uncontrolled" encoders, the traffic controller processes the corresponding received compressed video bitstreams for transmission over the shared communication channel in an open-loop manner. Nevertheless, even in these situations, the traffic controller may be able to exercise some "post-processing" control by altering the bitstream before transmission by dropping frames or even replacing portions of frames such as slices or individual macroblocks with special skip codes. Since the encoders will be unaware of these changes, such post-processing control may adversely affect the quality of the video playback at the end users.
Furthermore, as new and improved software encoders provide higher and higher levels of video compression processing power, not to mention greater and greater levels of external control, hardware encoders might not be needed at all in video processing system 100, even for high-end interactive video games.
The main operations of video processing system 100 correspond to three different generic functions: (1) off-line application profiling for content classification (implemented by off-line profiling tool 122), (2) service admission processing (implemented by service admission manager 104), and (3) traffic and compression control (implemented by controller 114). Each of these three functions is described in further detail in the following sections.
Off-Line Application Profiling for Content Classification As mentioned earlier, off-line profiling tool 122 analyzes, in non-real-time, typical sets of video sequences corresponding to different types of video applications and stores the results of those analyses in application profiles server 124. In a preferred implementation, the profiling is semi-automatic and each video application is characterized according to the following parameters: (a) Level of interactivity (related to latency tolerance); (b) Extent of frame-to-frame motion (both peak and average);
(c) Encoding resource requirement (i.e., identification of acceptable classes of encoders) and the levels of external control offered by those encoders;
(d) Type of graphics driver and ability to intercept the graphics commands;
(e) Bit rates required (both peak and average) for acceptable quality. The peak can be obtained by performing I-frame-only encoding at an acceptable average frame-level quantization (MQUANT) level and picking its peak. The average bit rate can be obtained by IP-only encoding (i.e., no B frames) at the same MQUANT level.
(f) Minimum frame rate required to achieve acceptable quality for the application.
(g) Required spatial resolution determined by identifying the highest spatial frequency present (e.g., from quantized DCT coefficients) and characterizing how critical the high-frequency components are for the application.
(h) Region of Interest (Rol): In many applications, especially video games, the Rol can be bounded within a region. Knowledge of this can help the encoder as well as the multiplexer.
(i) Objectionable artifacts: Some applications may be very sensitive to frame dropping, others may be sensitive to slice dropping, and still others may be sensitive to spatial adaptation of the quantizer.
This profile will suggest the best overflow handling strategy at the multiplexer as well as the best way to control the encoder.
After a sufficient number of video applications have been analyzed off-line according to the preceding parameters, profiling tool 122 processes the various results to make generalizations about groups of video applications based on their collective similarities and respective differences in order to generate rules used by video processing system 100 in real-time processing of other video applications. Such profiling can be relatively simple, such as characterizing the level of interactivity of different video applications as either "high," "intermediate," or "low." Alternatively, more and more sophisticated schemes can be implemented. The resulting profile information is stored in application profiles server 124 for eventual use by service admission manager 104 for initial service admission as well as by controller 114 for traffic control and multiplexing. In addition, the service provider for a particular video application may be able to maintain user profiles which indicate the behavior of particular users (such as type of games played, levels reached, typical browsing patterns, etc.). This information might only be used as a second-order control, since there may be multiple users with access to a particular user node.
Service Admission Processing
Service admission manager 104 determines the mix of the active applications at any given time. The main task of this tool will be to ensure that only services for which (a) the required encoder resources are available and (b) a minimum QoS can be guaranteed for the entire session, are admitted into a multiplex pool. The service admission decision is based on the profiles of the applications that are requested. In one possible implementation, the different video applications are divided into the following classes:
(Cl) High-end video games having very stringent latency requirements, high motion, and high spatial complexity, requiring hardware encoders to achieve high bandwidth and low latency, even though there is little external control over the video compression processing;
(C2) Low-end video games having moderate to high latency requirements and lower encoding complexity, that can be processed using high-end software encoders to achieve low latency; and
(C3) Web browsing and e-mail applications with high latency requirements that can be processed using low-end software encoders.
When a request is made to add a new application, service admission manager 104 obtains the following information from application profiles server 124:
(1) Class of application (e.g., video game (high-end, intermediate, or low-end), web browsing, e- mail, etc.);
(2) Interactivity of application (usually represented as latency requirement and classification in the profiles server) used in classifying the service, service admission, assignment of resources, control of encoder, and traffic control;
(3) Motion extent used to determine the frame rate required for the application, which is used by the controller to control the encoders. It can also be used for resource allocation to assign an encoder to the application; (4) Peak bandwidth required; and
(5) Average bandwidth required.
Based on this information, service admission manager 104 will admit the new application, if and only if both of the following two rules would be satisfied after admitting the new application:
(a) Sum of the peak bandwidths for all Cl applications plus sum of the average bandwidths for all C2 applications is less than the total channel bandwidth; and
(b) Sum of average bit rates of all applications (i.e., Cl, C2, and C3) is less than the total channel bandwidth; and (c) Encoding resources are available for the new application.
The first rule is fairly conservative, and applies to relatively simple implementations of the present invention. For more sophisticated implementations in which controller 114 is provided a high degree of control (i.e., more fine control) over the video compression processing implemented within the various video encoders, the first rule can be relaxed. Such fine control may involve control of slice-level and even macroblock-level quantizers as well as the staggering of intra frames across different applications (to ensure that a limited number of applications have intra frame within the same frame time). In that case, service admission manager 104 can use a more complicated formula depending on the QoS requirements of the various video applications and take further advantage of the statistical nature of video streams. Thus, more applications across the various types may be able to be admitted, as compared to the above solution, which is constrained based on the peak bandwidths of the Cl applications. Note that the motion extent and interactivity can also be used to allocate encoding resources to application.
An alternative call admission strategy would be to replace the stringent first condition by: Maximum of the sum of the peak bandwidths of concurrent I frames possible at a time based on the GOP structures for Cl applications + the sum of the average bandwidths of the remaining applications is less than the total channel bandwidth. Such a policy would allow more Cl applications. However, it should be noted that the probability of not meeting the minimum QoS at a given time instant increases as the number of active applications increase.
GOP Structure and Big Picture Handling
In one implementation of video processing system 100, low latency applications are assigned to video encoders that use only short GOP structures having only I and P (and no B) frames, such as IPPP, where every fourth frame is an I frame. Using shorter GOP structures supports interactivity. However, since I frames appear so frequently, hardware encoders may be required for such applications. In any case, the GOP period should be less than two seconds to handle errors as well as to allow decoder resynchronization when the user flips through channels. For some software encoders that provide a high degree of external control, an adaptive intra-refresh strategy can be used to avoid having to send I frames so frequently. Instead, different parts of eacii frame are intra-refreshed in different P pictures over a period corresponding to a chosen GOP size.
Traffic and Compression Control
Multiplexer and traffic controller 114 handles the following tasks:
(a) advance bit allocation for each video encoder based on the spatial and temporal quality desired for the corresponding application,
(b) multiplexing the different bitstreams while meeting the latency requirements of each application, and
(c) handling the pathological cases in such a way to minimize noticeable QoS degradation and to communicate the handling strategy to the controllable encoders.
Due to the varying degrees of control available at th. different encoders, the bit allocation and buffer control range from a mere frame-level interaction between controller 114 and each encoder to finer levels, such as at the slice- or even macroblock-level. In addition, the fact that the different applications are not frame synchronized can be exploited to provide frame- (or finer) level control of other services, while responding to an unexpectedly high instantaneous bit rate from a particular service. In other words, the individual encoders can be staggered with respect to one another over the frame time to allow controller 114 to control the compression processing for certain applications based on the results of compression processing for other applications that fall later within the same frame time.
For one implementation of video processing system 100, the impact of the varying degrees of control and the varying QoS requirements for each class are briefly summarized below: o Class Cl applications: These are encoded using hardware encoders that may provide external control over only the specification of frame- level target number of bits and average MQUANT over the frame. o Class C2 applications: These are games that are software encoded and do not take a very large bandwidth. The applications are encoded without B frames using GOP structures in which I frames may be encoded at relatively large intervals. Implementing an adaptive macroblock refresh strategy that will intra-code a fraction of the macroblocks in every P frame can support switching back and forth between applications while containing error propagation as well. This will smooth out the bit profile. Any variations will come from content and not from the GOP structure and picture types. Note that Class C2 applications require low latency encoding/multiplexing. Controller 114 acts as a video rate controller and controls the picture type, rate, etc. The control is hierarchical: at one level, picture type and frame- level targets are controlled; at another level, slice-level targets are controlled. The adaptive refresh strategy is also staggered across the different mid-range encoders and are scheduled to coincide with the valleys between the peaks of the Class Cl applications whenever possible. o Class C3 applications: It is assumed that web browsing and email applications have virtually no QoS requirements compared to Class Cl and C2 applications. Class C3 applications can be scheduled in the gaps and valleys of the bit profiles of the other services, so as to increase channel utilization. Hence, their latencies can be quite high (of the order of several frame times). For more sophisticate encoding and multiplexing strategies, a dynamic QoS for thess services can be determined on the fly and bandwidth allocation proportional to this dynamic QoS can be made.
Advance Bit-allocation to Various Sources
Advance bit allocation refers to allocation o 'a fraction of the instantaneous bandwidth to each encoder based on its past statistics without actually l nowing the actual complexity of the current frame. This is important for applications having low-latency requirements, which preclude look-ahead based bit allocation. The advance bit allocation for each encoder is implemented based on:
(a) the minimum spatial quality setting needed for the corresponding application;
(b) the complexity and average MQUANT for tlie previous frame of the same picture type; and
(c) the encoder buffer fullness.
In addition, the control can also decide to skip frames based on the quality requirements.
Since the applications are not synchronized at the frame level, a frame-level target is computed for the encoder that will start encoding a frame next (at any given time), based on the average MQUANT chosen for that encoder. Using a rate-distortion model linking bit consumption, average MQUANT, and motion compensated distortion, and enforcing constraints on MQUANT, the bit count for a frame can be estimated from prior data. An example of the constraint on MQUANT can be that the quality is uniform across the applications, while ensuring that the temporal rate of change of average MQUANT is within a tolerance threshold. The channel bit rate is divided between the applications according to their respective complexities and relative significance. The complexities are updated on the fly, and the relative significance can be obtained from the results of off-line profiling stored in application profiles server 124.
For the less controllable encoders, only the frame-level target (or average MQUANT) might be able to be communicated to the encoder. For the more controllable encoders, the basic unit of operation will be a slice (e.g., a row of macroblocks). Because the encoders are not synchronized, this will require a worst-case buffer requirement of 2 slices. A slice-level target is computed for each controllable encoder based on the frame target, the buffer fullness for that encoder (which is indicative of the buffer delay), and the instantaneous bit rate available after deducting the bits (within a latency window) from the less controllable encoders. The slice targets are also constrained by the fact that MQUANTs cannot change too much within a frame.
For Class C3 applications, a one-frame bit buffer is used. In other words, the encoders encode a new frame only after all the bits for the frame that was encoded before the last frame have been transmitted by controller 114. This on-demand encoding eliminates the possibility of congestion due to Class C3 services. Other strategies to tune the encoding to suit the application's demands are discussed in the following section.
Channel Bandwidth Allocation - Embodiment #1 Channel bandwidth allocation is different from the instantaneous bit rate from each encoder because of the mux buffer in controller 114. A certain amount of mux buffering is needed to prevent the individual rate controllers from entering into an oscillatory mode, constantly correcting the allocation and ending up with a highly varying spatial quality rcross a frame. However, the statistical multiplexing gain tends to be higher as multiplex is performed at a finer level. Hence, the actual amount of buffering has to be chosen carefully. The exact amount of buffering at controller 114 for particular applications depends on their latency requirements and the strategies used for handling pathological cases. The channel bandwidth allocation step implemented by controller 114 ensures that the latency requirement for each application is met. For example, up to 10-ms latencies can be allowed for the multiplexing delay for Class 1 and 2 encoders. Alternatively, mux buffering can be tailored based on actual data. The allocation decisions for all applications are made at the slice level. After all the bits for a slice in each encoder arrive at controller 114, the allocation is made based on the buffer fullness and the latency requirement for the application. This can be done in two steps: (1) each application in Classes Cl and C2 is allocated a bandwidth that is the minimum of the buffer occupancy and the slice-level bit rate used by service admission manager 104, and (2) the remaining bandwidth, if any, is then distributed among all the applications, in turn, to meet their latency requirements. Class Cl applications take precedence over Class C2, and Class C2 takes precedence over Class C3. Hence, Class C3 bits are transmitted only when bits remain in an allocation after the latency requirement for Classes Cl and C2 are met. The buffer occupancy is maintained below the maximum allowed buffer delay for a service during normal operation. The exceptions (i.e., when the requirement for Classes Cl and C2 cannot be met) are handled under the pathological cases.
Channel Bandwidth Allocation - Embodiment #2
Assume that the following profile is available for each frame (or data unit) of the source: (1) Lnom (Nominal Latency): This is the latency up to which the user will not perceive any appreciable decrease in quality; and (2) Lmax (Maximum Latency): This is the latency above which quality is completely unacceptable to the user. As such, if latency will exceed this, the frame might as well be dropped.
Fig. 2 shows an assumed piecewise linear cost function based on latency. This is the quality measure in terms of latency for a frame that will be used in statistical multiplexing. The costs Ca and Cb in Fig. 2 are obtained from off-line profiling.
For the control system, the following variables are described. Assume that the current time is Tcurr, and let the time for encoding a frame of encoder i be Tfi.
Definitions State of system
The state of the system is described by a set of vectors, Pij = {Nij = number of bits in frame j of encoder i, Tij = time spent by frame so far in physical multiplexer (PM) buffer}, where i=0,l,2,...N, where N = number of encoders, j runs over the fram3s in PM buffer for encoder i.
Input Measurements
In the control system, the following measurement data is received from the encoders: o Picture capture time; o Picture type; o Picture duration; o Average MQUANT used to encode the picture; o Number of bits used to encode the picture; o Advanced statistics such as macroblock variance and other macroblock activity measures; o Whether the picture corresponds to a scene change; and o Similar information for different groups of macroblocks within a picture. The collection of such information over an interval {Tcurr-M*Tfi, Tcurr} , is denoted as Mij for all the frames in that interval.
Output Measurements
Output measurements are derived from input measurements and from the state of the system. Essentially this measurement is the latency of a frame Lij = {Tij when the last bit leaves buffer} and {Spatial quality measured by average MQUANT} . The controller attempts to control and minimize these costs.
Traffic Control and Allocation of Channel Bandwidth Among the Sources Each encoder has Mi frames in the buffer, some of which may be partial frames. Let bij be the bits transmitted from each frame of each encoder i. The problem is then to allocate bij such that Σbij<=Bagg, while ensuring that the frame latency is met. The following iterative procedure provides this:
(1) Initialize bij. If Bagg is the total bits available, bij is chosen to be proportional to Cost (time_spent_so_far) . (2) Given bij, calculate the expected frame latency αij = Expected Value {frame latency | bij, Pij,
Mij } . This is a modeling problem that estimates the time spent by frame ij in the physical multiplexer, given the current state and current measurements of the system, and the current allocation. This is accomplished by simulating the action of the physical MUX over the next few time-grains (until the frame is transmitted). This involves prediction of future values of bij, which can use the same formula as the initialization step 1.
(3) Update bij in proportion to the expected latencies of the sources.
(4) Repeat Steps 2 and 3 until convergence when bij is stable, i.e., does not change by a large amount. A formula ||Δbij||<x*||bij|| is used, where x is nominally 10%.
Congestion Control
Good service admission procedures can reduce the number of pathological cases for hardware encoders. Still, pathological cases will happen due to the fact that profiling cannot provide accurate slice peaks. Small deviations in latency requiremen s can be relaxed, hoping that the rest of the frame will not be equally hard to code. Controller 114 may drop packets, but then processing cannot recover till the next I frame. If picture types can be requested, then controller 114 can request an I frame from the encoder after dropping packets. If picture type cannot be dictated, it may be preferable to delay the frames instead of allowing packet dropping. Then, at the next I frame, the buffer can be flushed thereby dropping packets right before the I frame and resynchronization can then be established with the I frame. For software encoders, the tighter control explained before will significantly reduce catastrophic breakdowns. However, in case it occurs, controller 114 drops slices and communicates that information back to the encoder. The encoder can keep track of the decoder state. A good strategy at the multiplexer is to drop the whole slice, and instead send a slice with all skipped macroblocks instead. If the encoder knows this information, it can refresh these macroblocks so that the decoder can recover. Alternatively, the encoder may have the ability to save a previous reference frame. In that case, when controller 114 drops a P frame or even just a slice of a P frame, it can inform the encoder so that the encoder will use the previous P frame for subsequent encoding, thereby avoiding prediction errors between the encoder and decoder.
Encoder Optimizations and Tuning For Low-Latency Applications The overall system latency is the sum of the latencies introduced by the following components: (1) Decoder Latency: At worst case, this is a delay of 2-frame duration, including the decoding delay and the display delay. A higher frame rate will lead to reduced decoder latency. For frame pictures, this delay will be 66 ms for 30 frames per second (fps). This latency can be reduced by up to 16.5 ms by using field pictures, instead of frame pictures.
(2) Encoder latency: The encoder is assumed to be reasonably pipelined and the delay is assumed to be about 40% of frame delay. In that case, the delay is roughly 15 ms for a 30-fps transmission. Further computational pipelining of the encoder can reduce this number.
(3) Mux buffering at controller 114: This is a buffering delay that can be used for rate control. It is expected to have about 5- 10ms of buffering that can be used for this purpose.
Of these latencies, it is assumed that only the encoder and mux buffer latencies can be controlled. Increased buffering latency at controller 114 is desirable from the rate-control point of view since it gives more time for controller 114 and the encoder to respond to changing traffic conditions. It is assumed that the latency at the decoder cannot be controlled, although this knowledge can be used to design coding modes that reduce this latency.
The latency estimates for video processing system 100 total less than 100 ms.
Strategies for Reducing Latency
As latency is reduced in specific components, greater ability is obtained to fine-tune the encoder and adapt to changing content and traffic conditions using some or all of the following strategies: o Simple Profile Encoding: Since B pictures lead to re-ordering delays, in order to maintain low latency, encoding is performed with only I and P pictures. In addition, using dual-prime motion vectors can result in improved compression efficiency for IP-only encoding. o Pipelining the encoder: Computational pipelining refers to performing all the encoding tasks on a minimum unit of encoding, e.g., macroblock, slice. Typical hardware encoders use hierarchical motion search and cannot be pipelined entirely. On the other hand, in software encoders, the hierarchical motion estimator can be tailored to start a slice-level pipeline after 3 rows of macroblocks are available. o Field pictures: One possibility is to perform field-picture encoding (even though material is progressive). The decoding delay will only be one field interval and this will save lA frame interval in decoding delay. The encoding algorithm would have to be tailored for this coding mode. The fields can either be from the same progressive frames at 30 frames/sec in which case the top and bottom fields are at the same time instant, or they can come by sampling at 60 frames/sec and throwing away alternate fields. The latter solution may better match the interlaced display in the home. In both cases, special preprocessing may then become necessary. The algorithms can be tailored to enable good quality while using this field-picture mode. Algorithmic Improvements for Game/Web Content Encoding
In addition to the above-mentioned low latency improvements, a number of other possible improvements can be implemented to improve the coding performance, as well as reduce latencies for graphics and web content.
Pre-Encoding of Static Portions of Web/Email Frowsers
If browser signals were intercepted, it would be possible to pre-encode the various options and pop-up menus. This can lead to better I-frame coding of the static portions and so will require fewer bits subsequently. The constancy in the quality of the browser menus and icons will improve the perceptual quality considerably. The encoding latency will be reduced, though this is not a major issue in these applications. However, the savings in cycles could be significant enough to allow more web/email users to be admitted at the same time.
Region-of-Interest Encoding Many games have specific regions of interest that are of more importance to the player. For example, most games have a center-weighted region of attention. This can be exploited in the bit- allocation strategy within a frame. Furthermore, it can also be used for intelligent packet-dropping at controller 114 when buffer or latency requirements are not met.
Encoder Parameter Tuning
The following encoding parameters can be tuned to improve the compression efficiency for game/web content. Note that hardware encoders are usually tuned for natural video scenes and hence might not perform as well on graphics and text content.
(a) Rate control initialization: A careful initialization of the rate control to match the multiplexer operation as well as the GOP structure can provide substantial improvements in quality.
(b) Quantizer matrix selection: The quantizer matrices commonly used are tailored to natural video. Matrices can be developed that are tailored to graphics and text.
(c) Perceptually adaptive quantization: In MPEG-2 encoding, the complexity or activity of a block is used for perceptually adaptive quantization. These computations should be modified for graphics and text content, and different measures of activity and distortion should be used.
(d) Pre-processing: The final output display de-ice is interlaced, even though the encoded material is progressive. Further, field picture coding modes are proposed to reduce latency. Thus, suitable preprocessing by vertical filtering, etc. is essential for good display quality.
(e) Low-latency scene change detection: If scene changes are quickly detected, controller 114 can be provided with this information to allow it to respond by changing the allocations for various applications and perhaps postponing intra frames on other channels whenever possible. (f) Encoding complexity estimation: Rate-distortion models enable prediction of encoding complexities for a frame based on distortion parameters. These models will be useful for the advance allocation statistical multiplexer. However, the models have mostly been developed for natural video and need modifications for game and web content.
Distributed Intra-Refresh Strategies
A large amount of application bit-rate fluctuations come from changes in picture types with I frames typically using more bits than P frames. This fluctuation can be reduced by distributing the intra- coding of macroblocks over a number of P frames. In the absence of scene changes, this strategy can yield a relatively smooth bit-rate profile. This choice can easily be implemented on software encoders, but not on hardware encoders.
Motion Estimation Complexity Reduction
In text browsing application, motion is typically very even and translational across a region of the image. This assumption can be used to reduce the complexity of motion estimation. For example, within a row, motion estimation could be performed on a subset of the macroblocks and if the motion is determined to be similar, the same motion vector can be used for the other macroblocks.
Motion estimation complexity can be reduced by exploiting the knowledge about the graphics commands. Intercepted graphics commands can be used to quickly and accurately estimate motion without going through the complete search process. Again, this may lead to significant computational savings.
Dynamic Frame Rate Selection and Spatial Resolution Change
The frame rate can be dynamically adjusted based on the content and the state of controller 114. In cases where the channel is overloaded, frame rates could be reduced to maintain acceptable spatial quality. Note that this solution will mainly work for intermediate- to low-interactivity applications. Another innovation would involve dynamic changes is spatial resolution (to half-horizontal, for example), whenever the content is less detailed or whenever channel constraints so dictate. In MPEG-2 encoding, this is done at the GOP level, rather than at the picture level. However, this is a better response to channel congestion than the catastrophic case handling described in the previous section.
Dynamic GOP Structure
The GOP structure can be limited to a relatively simple structure consisting of an I frame followed by a number of consecutive P frames. The frequency of I frames can be dynamically adjusted by controller 114 across the encoders in order to stagger the I frames to take advantage of statistical multiplexing gains. In many cases, due to scene changes, an encoder might start I-frame encoding at instances when it was not scheduled. In those cases, controller 114 should delay and reschedule I frames for the other encoders in order to maintain QoS across different applications.
Miscellaneous Features In addition, depending on the implementation, controller 114 may be able to perform one or more of the following miscellaneous features: o Scheduling I frames based on advance knowledge acquired from the application. In general, controller 114 uses advance knowledge from a video application to control the encoding process for that application. One method is when an application like a web-browser can anticipate a scene change when a user clicks a new page, and inform controller 114. Controller 114 can anticipate a large bit rate for the frame and use it to control the compression processing of the other video applications as well as this particular application. For example, any scheduled I frames of the other applications can be switched to adaptive refresh mode. o Use of adaptive intra-refresh for handling scene changes. This can include the use of intra- macroblocks in the region of interest as a means of control when a scene change has occurred. o In case controller 114 cannot match the latency requirement for a particular video application, it sends a signal back to the application delaying the application. Thus, the application knows that the user has not been given a chance to respond and thus pauses. This is useful in high-interactivity services like video games. This delay can be achieved by using the pause command available on many applications. o Use of region-of- interest (ROD information by controller 114. One way is for the encoder to send priority information on groups of macroblocks. Controller 114 then drops the low-priority regions in case of congestion. In addition or alternatively, controller 114 uses pre-encoded portions of the bitstream and does some bit-stream manipulation. This can be used in web-browsing and for backgrounds of games. In particular, the pre-encoded portions will be used for sections outside the ROIs, as a special method for handling ROI-based control.
Summary
The proposed statistical multiplexer tools offer the following advantages over other off-the-shelf multiplexers: 1. Exploiting the varying QoS requirements to improve channel utilization while providing an acceptable quality for all applications;
2. Reacting to the less controllable encoders by exercising rate control measures on the more controllable software encoders;
3. Taking advantage of the knowledge about the software encoder to improve perceptual quality; 4. Achieving low latency through advance allocation of bit budget and through proper buffer management at the multiplexer; 5. Making frame-level bit allocation proportional to content complexity; and
6. Performing graceful degradation of quality during congestion through better understanding of the effect of packet dropping from profiling and by effectively communicating with the controllable encoders.
Channel Surfing
In some cases, a user may decide to keep his initial application running on one channel while surfing other channels in order to return to the initial application. Or, he may run two sessions in parallel and switch between sessions. These cases should be handled effectively, including taking advantage of these situations to reduce transmission bandwidth. For example, after detecting that the user has moved to another channel (e.g., based on monitoring the return path and the content served), a low-bit-rate slide show (e.g., I frames spaced relatively far apart) can be sent for decoder resynchronization when the user comes back to the original interactive application. If the slide show lasts longer than a certain timeout period, the user's session can be automatically terminated. An alternative can be to save the game for later resumption.
Possible System Architecture
Low-delay MPEG2 video/audio encoding and statistical multiplexing are key technical requirements for many Digital Television (DTV) and digital cable TV applications. In a conventional low-cost PCI (Peripheral Component Interconnect) bus-based computer system, significant processing delays are contributed by the system control, program layer PES (Packetized Elementary Stream) and transport TS (Transport Stream) multiplexing, and the PCI bus. In particular, the PCI bus delay will introduce uncertain delays based on the PCI-BIOS (PCI Basic Input / Output System and the Windows™ operating system from Microsoft Corporation of Redmond, Washington. Computer systems in accordance with the present invention avoid PCI bus delay by using the built-in multi-channel Synchronized Serial Interface (SSI) ports of multiple Digital Signal Processors (DSPs), where each DSP performs video and audio encoder control, PES/TS layer multiplexing, and computation of statistical measurements of its corresponding video stream payload. The DSPs' on-chip memories may also eliminate the need for bitstream First-In, First-Out (FIFO) chips and some common SDRAM (Synchronized Dynamic Random Access Memory) chips.
Fig. 3 shows a system-level block diagram of computer system 300, according to one embodiment of the present invention. Computer system 300 is a PCI bus-based industrial PC (Personal Computer) enclosure with multiple PCI boards. In particular, computer system 300 comprises a PCI bus 302 configured with a Central Processing Unit (CPU) board 304, up to «=24 encoder boards 306, and a statistical multiplexing (stat-mux) board 308. Although computer system 300 relies on a PCI bus, it will be understood that any other suitable system bus could be used in alternative embodiments of the present invention.
CPU board 304 is a conventional industrial PC motherboard having a suitable central processor, such as an Intel Pentium III™ microprocessor by Intel Corporation of Santa Clara, California. In addition, CPU board 304 has a conventional PCI interface 310, an ISA (Industry Standards Association) bus interface 312, RS232 ports 314, a (e.g., 100-MHz) Local Area Network (LAN) interface 316, a hard disk / floppy disk (HD/FD) controller, and other standard PC periphery interfaces. Software (e.g., in the "C" programming language) implemented by the Pentium processor may provide main system controls, fault-tolerant controls, and/or statistical multiplexing of those bitstreams that do not have low- latency requirements.
Each encoder board 306 is an integrated video/audio encoder with an SDI (Serial Digital Interface or Serial Dl) or ASI (Asynchronous Serial Interface) input port 318, a video encoder 320, an audio encoder 322, a PCI bus interface 324, and a DSP controller 326 (with an SSI port 328) for board- level sub-system control and low-delay PES / TS multiplexing plus bitstream statistics parameter measurement.
Stat-mux board 308 has a PCI bus interface 330 and four DSP chips 332, where each DSP chip 332 has a six-channel SSI DMA (Direct Memory Address) 334 with six SSI ports 336, SRAMs 338, two DSP cores 340, and an ASI/TAXI™ chip set from Advanced Micro Devices, Inc., of Sunnyvale, California, and, in block 342, a DHEI (Digital High-speed Expansion Interface) I/O port from General Instrument Corporation (GI) of Horsham, Pennsylvania, for GI's modulator and CA (Conditional
Access) equipment. As such, stat-mux board 308 can support up to 24 channels of low-delay MPEG2 video/audio input bitstreams.
PCI bus 302 is used for power supply and system control for each PCI board. A DSP chip on each encoder board 306 will directly transfer low-delay MPEG2 bitstreams to a corresponding DSP on stat-mux board 308. In particular, each low-delay MPEG2 video/audio bitstream will be directly transmitted from the SSI port 328 of the corresponding encoder board 306 to an SSI port 336 on stat- mux board 308. The associated delay can be controlled to correspond to as few as four transport packet delays, with a two-packet delay in the encoder DSP 326, a one-packet delay at an input port 336 of stat-mux board 308, and a one-packet delay at an output port 342 of stat-mux board 308. In addition, PCI bus 302 can be used to transmit additional MPEG2 video/audio bitstreams that do not have low-latency requirements. Depending on the implementation, these high-latency bitstreams may be generated by video/audio encoders implemented in software within the central processor on CPU board 304.
Fig. 4 shows a board-level block diagram of each encoder board 306 of computer system 300 of Fig. 3, according to one embodiment of the present invention. Encoder board 306 comprises an internal board bus 402 configured with an input interface module 318, an MPEG2 video encoder 320, an AC3 or MP3 audio encoder module 322, a DSP controller 326 with PES/TS-layer multiplexing firmware, and 27-MHz SCR/PCR circuits 408, where SCR is the System Clock Reference in an MPEG video decoder and PCR is the Program Clock Reference in an MPEG transport decoder. Input interface module 318 can support both SDI and ASI circuits with a 270-MHz or 180-MHz line-coded clock, respectively. The SDI or ASI signals can be customized to interlace the uncompressed digital video data and multi-channel audio data. There is CPLD (Complex Programmable Logic Device) or FPGA (Field-Programmable Gate Array) based deframing firmware to split the video and audio data, and to reproduce the video synchronization signals for the MPEG2 video encoder chip. MPEG2 video encoder 320 can be any suitable single-chip encoder, such as those supplied by
IBM, C-Cube, or Philips, with supporting SDRAM, SRAM, and/or flash memories 404 and necessary glue logic circuits. The glue logic can be combined within the input CPLD firmware. There are also some downloadable micro-codes from the MPEG2 chip manufacturer.
Audio encoder 322 can be any suitable off-shelf DSP-based sub-system that can support either the AC3 or MP3 encoding function depending on the DSP software. If a TMS320c5410 DSP chip from Texas Instruments Incorporated of Dallas, Texas is used, then the audio encoding functions of audio encoder 322 can be combined with DSP controller 326, shared memories 406, and the PES/TS multiplexing firmware for less board area and lower integration costs.
Alternatively, DSP 326 may be a TMS320c5402 DSP from Texas Instruments. DSP 326 will provide of video encoder control, audio encoder control, the SCR/PCR time-base controls, and the overall board-level controls. It will also perform the PES/TS multiplexing of compressed video and audio bitstreams, and the statistical parameter measurements of the video stream. It will also execute the commands of statistical multiplexing controls received from PCI bus 302 of Fig. 3.
DSP on-chip SSI output port 328 can be directly connected to an SSI input port of a DSP on stat-mux board 308 of Fig. 3. The on-chip DMA will automatically move data from the TS output buffer of on-chip memory to the serial output port. The TMS320c5410 DSP has 128 Kbytes of on-chip memory and a DMA-controlled host interface port, such that external SRAM and FIFO devices may be eliminated. For example, when video encoder 320 is an IBM39 MPEGS422 video encoder chip, the video encoder can directly write its compressed video data into the TMS320c5410 on-chip SRAM with a simple CPLD to emulate the FIFO signals. The PES/TS MUX delay can be within transmitting two TS packets of video streams, such as 2 x 188 x 8 * video rate delay.
DSP on-chip timer 408 can also be programmed for the 27-MHz SCR/PCR time-base by incorporating on-chip PLL (Phase-Locked Loop) circuits. All of the 27-MHz clocks will be derived from the same 27-MHz clock on stat-mux board 308 through the clocks of the SSI ports connected to all of the encoder boards 306. Fig. 5 shows a board-level block diagram of statistical multiplexing board 308 of computer system 300 of Fig. 3, according to one embodiment of the present invention. Stat-mux board 308 is a low-delay Input/Output (I/O) interface PCI board with the statistical multiplexing system and PCR time- base correction firmware. Stat-mux board 308 comprises an internal sub-system bus 502 configured with four Texas Instruments TMS320c5420 DSP chips 332, each having six SSI serial ports 336 and 512 Kbytes of on-chip SRAM memory 338, such that stat-mux board 308 can receive up to 24 different channels of transport bitstreams.
Each SSI serial input port 336 has three wires carrying a clock signal (sclk), a data signal (sdat), and a frame signal. All 24 clock signals sclk should be configured as the input clock signals and connected to an on-board 27-MHz clock oscillator 504. 27-MHz clock 504 will also be used as the DSP clock, and on-chip PLL circuits will generate a 90-MHz DSP clock. In that case, on-chip timers can be used for the PCR time-base corrections. The frame signals will indicate whether or not the data signal sdat carries meaningful data. The data signals sdat are burst with a maximum rate of 27 Mbps. The frame signals can also be programmed in a "multi-channel mode" to send multiple packets into assigned on-chip buffers for transmitting the individual encoders' statistical parameters.
ASI interface 506 uses a TAXI transmitter chip with parallel interface from Advanced Micro Devices, such that there are FIFO and CPLD control circuits to handle the TAXI interface and ASI controls. A DHEI interface 508 from GI will need additional PLL circuits to generate the output clock, if there is no available input clock signal from DHET port 510. There are also the DHEI line drive chips for the proper bi-level output interface.
Although the present invention has been described in the context of a computer system in which each of the central processing sub-system, the statistical multiplexing sub-system, and each encoding sub-system is implemented on a separate computer board of the computer system, the present invention is not so limited. In particular, two or more of the different sub-systems could be implemented on a single board. Alternatively or in addition, any of the sub-systems could be implemented on more than one board. The important characteristics of the present invention relate to how the various components of the different sub-systems communicate with one another, rather than where those components are physically located.
Although the present invention has been described in the context of a system having a central processing sub-system, in addition to the statistical multiplexing sub-system and multiple encoding subsystems, all of which are configured to a PCI bus, it will be understood that the present invention is not so limited. In particular, the present invention can also be implemented in computer systems in which there is no separate central processing sub-system, but where all of the centralized control functions are implemented in the DSPs of the statistical multiplexing sub-system. Moreover, such a computer system may be implemented with or without a system bus, such as a PCI bus. The present invention can be embodied in the form of methods and apparatuses for practicing those methods. The present invention can also be embodied in the form of program code embodied in tangible media, such as floppy diskettes, CD-ROMs, hard drives, or any other machine-readable storage medium, wherein, when the program code is loaded into and executed by a machine, such as a computer, the machine becomes an apparatus for practicing the invention. The present invention can also be embodied in the form of program code, for example, whether stored in a storage medium, loaded into and/or executed by a machine, or transmitted over some transmission medium or carrier, such as over electrical wiring or cabling, through fiber optics, or via electromagnetic radiation, wherein, when the program code is loaded into and executed by a machine, such as a computer, the machine becomes an apparatus for practicing the invention. When implemented on a general-purpose processor, the program code segments combine with the processor to provide a unique device that operates analogously to specific logic circuits.
It will be further understood that various changes in the details, materials, and arrangements of the parts which have been described and illustrated in order to explain the nature of this invention may be made by those skilled in the art without departing from the principle and scope of the invention as expressed in the following claims.

Claims

CLAIMS What is claimed is:
1. A method for controlling transmission over a shared communication channel of multiple compressed video bitstreams generated by a plurality of video encoders and corresponding to multiple video applications, comprising the steps of:
(a) receiving information for each compressed video bitstream wherein at least two of the video applications have different latency requirements;
(b) controlling the transmission of data from the multiple compressed video bitstreams over the shared communication channel taking into account the information for each compressed video bitstream and the latency requirement of each corresponding video application; and
(c) adaptively controlling compression processing of at least one of the video encoders taking into account the information for the corresponding compressed video bitstream and the latency requirement of the corresponding video application.
2. The invention of claim 1 , further comprising the step of (d) controlling admission of a new video application for transmission of a corresponding compressed video bitstream over the shared communication channel.
3. The invention of claim 1, wherein at least two of the video encoders have different video compression capabilities.
4. The invention of claim 1, wherein the processing of at least two of the video encoders is staggered within a frame time and step (c) comprises the step of controlling the process of encoding at least one compressed video bitstream taking into account the information for at least one other compressed video bitstream earlier in the same frame time.
5. The invention of claim 1 , further comprising the step of (d) performing off-line profiling of typical video streams corresponding to different classifications of video applications to generate profiling results for use during at least one of steps (b) and (c).
6. The invention of claim 1, wherein step (b) comprises the step of dropping data from a compressed video bitstream.
7. The invention of claim 1, wherein step (b) comprises the step of scheduling transmission of compressed data corresponding to a video application having a relatively high latency requirement to coincide with relatively low-bit-rate periods of one or more other video applications having a relatively low latency requirement.
8. The invention of claim 1 , further comprising the step of instructing at least one of the video applications to delay processing when the latency requirement for the video application is not met.
9. A video processing system for controlling transmission of multiple compressed video bitstreams corresponding to multiple video applications over a shared communication channel, comprising:
(a) a plurality of video encoders, each configured to generate a different compressed video bitstream for a different video application, wherein at least two of the video applications have different latency requirements; and
(b) a controller, configured to:
(1) receive the compressed video bitstreams from the video encoders;
(2) control the transmission of data from the multiple compressed video bitstreams over the shared communication channel taking into account information for each compressed video bitstream and the latency requirement of each corresponding video application; and
(3) adaptively control the compression processing of at least one of the video encoders taking into account the information for the corresponding compressed video bitstream and the latency requirement of the corresponding video application.
10. A controller for controlling transmission of multiple compressed video bitstreams corresponding to multiple video applications over a shared communication channel, in a video processing system further comprising a plurality of video encoders, each configured to generate a different compressed video bitstream for a different video application, wherein at least two of the video applications have different latency requirements, wherein the controller is configured to:
(1) receive the compressed video bitstreams from the video encoders;
(2) control the transmission of data from the multiple compressed video bitstreams over the shared communication channel taking into account information for each compressed video bitstream and the latency requirement of each corresponding video application; and (3) adaptively control the compression processing of at least one of the video encoders taking into account the information for the corresponding compressed video bitstream and the latency requirement of the corresponding video application.
PCT/US2000/000249 1999-01-06 2000-01-06 Latency-based statistical multiplexing WO2000041395A1 (en)

Applications Claiming Priority (10)

Application Number Priority Date Filing Date Title
US11484299P 1999-01-06 1999-01-06
US11483499P 1999-01-06 1999-01-06
US60/114,834 1999-01-06
US60/114,842 1999-01-06
US17088399P 1999-12-15 1999-12-15
US60/170,883 1999-12-15
US09/478,127 US6665872B1 (en) 1999-01-06 2000-01-05 Latency-based statistical multiplexing
US09/478,128 2000-01-05
US09/478,128 US6754241B1 (en) 1999-01-06 2000-01-05 Computer system for statistical multiplexing of bitstreams
US09/478,127 2000-01-05

Publications (1)

Publication Number Publication Date
WO2000041395A1 true WO2000041395A1 (en) 2000-07-13

Family

ID=27537393

Family Applications (2)

Application Number Title Priority Date Filing Date
PCT/US2000/000245 WO2000041397A1 (en) 1999-01-06 2000-01-06 Computer system for statistical multiplexing of bitstreams
PCT/US2000/000249 WO2000041395A1 (en) 1999-01-06 2000-01-06 Latency-based statistical multiplexing

Family Applications Before (1)

Application Number Title Priority Date Filing Date
PCT/US2000/000245 WO2000041397A1 (en) 1999-01-06 2000-01-06 Computer system for statistical multiplexing of bitstreams

Country Status (1)

Country Link
WO (2) WO2000041397A1 (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007014384A2 (en) * 2005-07-28 2007-02-01 Thomson Licensing Method and apparatus for transmitting multiple video streams over a video channel
WO2007114862A3 (en) * 2005-12-02 2008-04-24 Qualcomm Inc Time slicing techniques for variable data rate encoding
EP1983756A1 (en) * 2007-04-18 2008-10-22 St Microelectronics S.A. Portable device with video output
WO2009120782A1 (en) * 2008-03-26 2009-10-01 Cisco Technology, Inc Distributing digtal video content to multiple end-user devices
US9264533B2 (en) 2000-01-07 2016-02-16 Tq Delta, Llc Systems and methods for establishing a diagnostic transmission mode and communicating over the same
US9286251B2 (en) 2004-10-12 2016-03-15 Tq Delta, Llc Resource sharing in a telecommunications environment
US9300601B2 (en) 2001-10-05 2016-03-29 Tq Delta, Llc Bonding device and method
US9300324B2 (en) 2004-09-25 2016-03-29 Tq Delta, Llc CRC counter normalization
US9485128B2 (en) 1999-11-09 2016-11-01 Tq Delta, Llc System and method for scrambling using a bit scrambler and a phase scrambler
US9485055B2 (en) 2006-04-12 2016-11-01 Tq Delta, Llc Packet retransmission and memory sharing
US9521003B2 (en) 1998-01-26 2016-12-13 Tq Delta, Llc Multicarrier transmission system with low power sleep mode and rapid-on capability
US9621198B2 (en) 2004-10-15 2017-04-11 Tq Delta, Llc DMT symbol repetition in the presence of impulse noise
US10567112B2 (en) 2004-03-03 2020-02-18 Tq Delta, Llc Impulse noise management
WO2021021550A1 (en) * 2019-08-01 2021-02-04 Qualcomm Incorporated Intra refresh and error tracking based on feedback information

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6859496B1 (en) 1998-05-29 2005-02-22 International Business Machines Corporation Adaptively encoding multiple streams of video data in parallel for multiplexing onto a constant bit rate channel
US8284842B2 (en) 2005-07-08 2012-10-09 Activevideo Networks, Inc. Video game system using pre-encoded macro-blocks and a reference grid
US7936819B2 (en) 2005-07-08 2011-05-03 Tag Networks, Inc. Video encoder with latency control
US8118676B2 (en) 2005-07-08 2012-02-21 Activevideo Networks, Inc. Video game system using pre-encoded macro-blocks
US9061206B2 (en) 2005-07-08 2015-06-23 Activevideo Networks, Inc. Video game system using pre-generated motion vectors
US8074248B2 (en) 2005-07-26 2011-12-06 Activevideo Networks, Inc. System and method for providing video content associated with a source image to a television in a communication network
US9826197B2 (en) 2007-01-12 2017-11-21 Activevideo Networks, Inc. Providing television broadcasts over a managed network and interactive content over an unmanaged network to a client device
US9042454B2 (en) 2007-01-12 2015-05-26 Activevideo Networks, Inc. Interactive encoded content system including object models for viewing on a remote device
AU2011315950B2 (en) 2010-10-14 2015-09-03 Activevideo Networks, Inc. Streaming digital video between video devices using a cable television system
WO2012138660A2 (en) 2011-04-07 2012-10-11 Activevideo Networks, Inc. Reduction of latency in video distribution networks using adaptive bit rates
US10409445B2 (en) 2012-01-09 2019-09-10 Activevideo Networks, Inc. Rendering of an interactive lean-backward user interface on a television
US9800945B2 (en) 2012-04-03 2017-10-24 Activevideo Networks, Inc. Class-based intelligent multiplexing over unmanaged networks
US9123084B2 (en) 2012-04-12 2015-09-01 Activevideo Networks, Inc. Graphical application integration with MPEG objects
WO2014145921A1 (en) 2013-03-15 2014-09-18 Activevideo Networks, Inc. A multiple-mode system and method for providing user selectable video content
US9219922B2 (en) 2013-06-06 2015-12-22 Activevideo Networks, Inc. System and method for exploiting scene graph information in construction of an encoded video sequence
EP3005712A1 (en) 2013-06-06 2016-04-13 ActiveVideo Networks, Inc. Overlay rendering of user interface onto source video
US9294785B2 (en) 2013-06-06 2016-03-22 Activevideo Networks, Inc. System and method for exploiting scene graph information in construction of an encoded video sequence
US9788029B2 (en) 2014-04-25 2017-10-10 Activevideo Networks, Inc. Intelligent multiplexing using class-based, multi-dimensioned decision logic for managed networks

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1995029545A1 (en) * 1994-04-20 1995-11-02 Thomson Consumer Electronics, Inc. Asynchronous control signal generating apparatus
US5506844A (en) * 1994-05-20 1996-04-09 Compression Labs, Inc. Method for configuring a statistical multiplexer to dynamically allocate communication channel bandwidth
WO1997018676A1 (en) * 1995-11-15 1997-05-22 Philips Electronics N.V. Method and device for global bitrate control of a plurality of encoders
US5796724A (en) * 1995-12-28 1998-08-18 Intel Corporation Method and apparatus for partitioning transmission bandwidth among different data streams

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5657091A (en) * 1995-11-01 1997-08-12 International Business Machines Corporation Video display controller, user interface and programming structure for such interface
JPH1066125A (en) * 1996-08-21 1998-03-06 Nec Shizuoka Ltd Radio selective calling receiver
US5793425A (en) * 1996-09-13 1998-08-11 Philips Electronics North America Corporation Method and apparatus for dynamically controlling encoding parameters of multiple encoders in a multiplexed system
BR9706810A (en) * 1996-10-03 1999-08-31 Motorola Inc Videophone apparatus, process and system for telephony and audio and video conference with wire line.

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1995029545A1 (en) * 1994-04-20 1995-11-02 Thomson Consumer Electronics, Inc. Asynchronous control signal generating apparatus
US5506844A (en) * 1994-05-20 1996-04-09 Compression Labs, Inc. Method for configuring a statistical multiplexer to dynamically allocate communication channel bandwidth
WO1997018676A1 (en) * 1995-11-15 1997-05-22 Philips Electronics N.V. Method and device for global bitrate control of a plurality of encoders
US5796724A (en) * 1995-12-28 1998-08-18 Intel Corporation Method and apparatus for partitioning transmission bandwidth among different data streams

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
BENNY BING ET AL: "A new multiaccess technique for multimedia wireless LANs", GLOBECOM 97. IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE. CONFERENCE RECORD (CAT. NO.97CH36125), GLOBECOM 97. IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE. CONFERENCE RECORD, PHOENIX, AZ, USA, 3-8 NOV. 1997, 1997, New York, NY, USA, IEEE, USA, pages 1318 - 1322 vol.3, XP002139188, ISBN: 0-7803-4198-8 *
KNIGHTLY E W: "On the accuracy of admission control tests", PROCEEDINGS. 1997 INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (CAT. NO.97TB100174), PROCEEDINGS 1997 INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS, ATLANTA, GA, USA, 28-31 OCT. 1997, 1997, Los Alamitos, CA, USA, IEEE Comput. Soc, USA, pages 125 - 133, XP002139186, ISBN: 0-8186-8061-X *
KWOK T: "Communications requirements of multimedia applications: a preliminary study", 1992 IEEE INTERNATIONAL CONFERENCE ON SELECTED TOPICS IN WIRELESS COMMUNICATIONS. CONFERENCE PROCEEDINGS (CAT. NO.92TH0462-2), VANCOUVER, BC, CANADA, 25-26 JUNE 1992, 1992, New York, NY, USA, IEEE, USA, pages 138 - 142, XP002139187, ISBN: 0-7803-0723-2 *
REININGER D J ET AL: "BANDWIDTH RENEGOTIATION FOR VBR VIDEO OVER ATM NETWORKS", IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS,US,IEEE INC. NEW YORK, vol. 14, no. 6, 1 August 1996 (1996-08-01), pages 1076 - 1085, XP000620594, ISSN: 0733-8716 *

Cited By (50)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9521003B2 (en) 1998-01-26 2016-12-13 Tq Delta, Llc Multicarrier transmission system with low power sleep mode and rapid-on capability
US9485128B2 (en) 1999-11-09 2016-11-01 Tq Delta, Llc System and method for scrambling using a bit scrambler and a phase scrambler
US9755876B2 (en) 1999-11-09 2017-09-05 Tq Delta, Llc System and method for scrambling the phase of the carriers in a multicarrier communications system
US10187240B2 (en) 1999-11-09 2019-01-22 Tq Delta, Llc System and method for scrambling the phase of the carriers in a multicarrier communications system
US9264533B2 (en) 2000-01-07 2016-02-16 Tq Delta, Llc Systems and methods for establishing a diagnostic transmission mode and communicating over the same
US10264119B2 (en) 2000-01-07 2019-04-16 Tq Delta, Llc Systems and methods for establishing a diagnostic transmission mode and communicating over the same
US10623559B2 (en) 2000-01-07 2020-04-14 Tq Delta, Llc Systems and methods for establishing a diagnostic transmission mode and communicating over the same
US9973624B2 (en) 2000-01-07 2018-05-15 Tq Delta, Llc Systems and methods for establishing a diagnostic transmission mode and communicating over the same
US9479637B2 (en) 2000-01-07 2016-10-25 Tq Delta, Llc Systems and methods for establishing a diagnostic transmission mode and communicating over the same
US9838531B2 (en) 2000-01-07 2017-12-05 Tq Delta, Llc Systems and methods for establishing a diagnostic transmission mode and communicating over the same
US9319512B2 (en) 2000-01-07 2016-04-19 Tq Delta, Llc Systems and methods for establishing a diagnostic transmission mode and communicating over the same
US10341261B2 (en) 2001-10-05 2019-07-02 Tq Delta, Llc Bonding device and method
US9894014B2 (en) 2001-10-05 2018-02-13 Tq Delta, Llc Bonding device and method
US9300601B2 (en) 2001-10-05 2016-03-29 Tq Delta, Llc Bonding device and method
US10567112B2 (en) 2004-03-03 2020-02-18 Tq Delta, Llc Impulse noise management
US10805040B2 (en) 2004-03-03 2020-10-13 Tq Delta, Llc Impulse noise management
US11005591B2 (en) 2004-03-03 2021-05-11 Tq Delta, Llc Impulse noise management
US10346243B2 (en) 2004-09-25 2019-07-09 Tq Delta, Llc CRC counter normalization
US9300324B2 (en) 2004-09-25 2016-03-29 Tq Delta, Llc CRC counter normalization
US10049003B2 (en) 2004-09-25 2018-08-14 Tq Delta, Llc CRC counter normalization
US9286251B2 (en) 2004-10-12 2016-03-15 Tq Delta, Llc Resource sharing in a telecommunications environment
US10579291B2 (en) 2004-10-12 2020-03-03 Tq Delta, Llc Resource sharing in a telecommunications environment
US11543979B2 (en) 2004-10-12 2023-01-03 Tq Delta, Llc Resource sharing in a telecommunications environment
US9547608B2 (en) 2004-10-12 2017-01-17 Tq Delta, Llc Resource sharing in a telecommunications environment
US10409510B2 (en) 2004-10-12 2019-09-10 Tq Delta, Llc Resource sharing in a telecommunications environment
US9898220B2 (en) 2004-10-12 2018-02-20 Tq Delta, Llc Resource sharing in a telecommunications environment
US11010073B2 (en) 2004-10-12 2021-05-18 Tq Delta, Llc Resource sharing in a telecommunications environment
US9621198B2 (en) 2004-10-15 2017-04-11 Tq Delta, Llc DMT symbol repetition in the presence of impulse noise
US11394485B2 (en) 2004-10-15 2022-07-19 Tq Delta, Llc DMT symbol repetition in the presence of impulse noise
WO2007014384A3 (en) * 2005-07-28 2007-05-31 Thomson Licensing Method and apparatus for transmitting multiple video streams over a video channel
US8160140B2 (en) 2005-07-28 2012-04-17 Thomson Licensing Method and apparatus for transmitting multiple video streams over a video channel
CN102833548B (en) * 2005-07-28 2015-07-22 汤姆森特许公司 Method and apparatus for transmitting multiple video streams over video channel
WO2007014384A2 (en) * 2005-07-28 2007-02-01 Thomson Licensing Method and apparatus for transmitting multiple video streams over a video channel
US8451937B2 (en) 2005-12-02 2013-05-28 Qualcomm Incorporated Time slicing techniques for variable data rate encoding
TWI383681B (en) * 2005-12-02 2013-01-21 Qualcomm Inc Time slicing techniques for variable data rate encoding
CN101322415B (en) * 2005-12-02 2010-12-08 高通股份有限公司 Time slicing method and system for variable data rate encoding
US7839948B2 (en) 2005-12-02 2010-11-23 Qualcomm Incorporated Time slicing techniques for variable data rate encoding
WO2007114862A3 (en) * 2005-12-02 2008-04-24 Qualcomm Inc Time slicing techniques for variable data rate encoding
US10833809B2 (en) 2006-04-12 2020-11-10 Tq Delta, Llc Techniques for packet and message communication in a multicarrier transceiver environment
US10498495B2 (en) 2006-04-12 2019-12-03 Tq Delta, Llc Packet retransmission
US10484140B2 (en) 2006-04-12 2019-11-19 Tq Delta, Llc Packet retransmission and memory sharing
US10044473B2 (en) 2006-04-12 2018-08-07 Tq Delta, Llc Packet retransmission and memory sharing
US11362765B2 (en) 2006-04-12 2022-06-14 Tq Delta, Llc Packet retransmission using one or more delay requirements
US9485055B2 (en) 2006-04-12 2016-11-01 Tq Delta, Llc Packet retransmission and memory sharing
US9749235B2 (en) 2006-04-12 2017-08-29 Tq Delta, Llc Packet retransmission
EP1983756A1 (en) * 2007-04-18 2008-10-22 St Microelectronics S.A. Portable device with video output
WO2009120782A1 (en) * 2008-03-26 2009-10-01 Cisco Technology, Inc Distributing digtal video content to multiple end-user devices
US8468572B2 (en) 2008-03-26 2013-06-18 Cisco Technology, Inc. Distributing digital video content to multiple end-user devices
WO2021021550A1 (en) * 2019-08-01 2021-02-04 Qualcomm Incorporated Intra refresh and error tracking based on feedback information
US11039149B2 (en) 2019-08-01 2021-06-15 Qualcomm Incorporated Dynamic video insertion based on feedback information

Also Published As

Publication number Publication date
WO2000041397A8 (en) 2001-06-21
WO2000041397A1 (en) 2000-07-13

Similar Documents

Publication Publication Date Title
US6665872B1 (en) Latency-based statistical multiplexing
US6754241B1 (en) Computer system for statistical multiplexing of bitstreams
WO2000041395A1 (en) Latency-based statistical multiplexing
Lee et al. Scalable rate control for MPEG-4 video
KR100927083B1 (en) Quasi-constant-quality rate control with look-ahead
US7652993B2 (en) Multi-stream pro-active rate adaptation for robust video transmission
US7876821B2 (en) Method and an apparatus for controlling the rate of a video sequence; a video encoding device
JP5676705B2 (en) Improved video rate control for video coding standards
JP5180294B2 (en) Buffer-based rate control that utilizes frame complexity, buffer level, and intra-frame location in video encoding
US9225980B2 (en) Video quality of sevice management and constrained fidelity constant bit rate video encoding systems and methods
US9325998B2 (en) Wireless video transmission system
US6324217B1 (en) Method and apparatus for producing an information stream having still images
Ramanujan et al. Adaptive streaming of MPEG video over IP networks
US7843824B2 (en) Method and apparatus for statistically multiplexing services
Van der Auwera et al. Implications of smoothing on statistical multiplexing of H. 264/AVC and SVC video streams
US8254449B2 (en) Video traffic bandwidth prediction
US7826530B2 (en) Use of out of order encoding to improve video quality
CN110381316A (en) A kind of method for controlling video transmission, device, equipment and storage medium
Yang et al. Rate control for VBR video over ATM: simplification and implementation
Ringenburg et al. Global MINMAX interframe bit allocation for embedded video coding
KR100949755B1 (en) A method and an apparatus for controlling the rate of a video sequence, a video encoding device
Overmeire et al. Constant quality video coding using video content analysis
Fang et al. Hardware oriented rate control algorithm and implementation for realtime video coding
Ozcelebi et al. An analysis of constant bitrate and constant PSNR video encoding for wireless networks
Lu et al. A complexity-aware video adaptation mechanism for live streaming systems

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): CA JP

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
122 Ep: pct application non-entry in european phase