WO2003090207A1 - Parametric multi-channel audio representation - Google Patents

Parametric multi-channel audio representation Download PDF

Info

Publication number
WO2003090207A1
WO2003090207A1 PCT/IB2003/001591 IB0301591W WO03090207A1 WO 2003090207 A1 WO2003090207 A1 WO 2003090207A1 IB 0301591 W IB0301591 W IB 0301591W WO 03090207 A1 WO03090207 A1 WO 03090207A1
Authority
WO
WIPO (PCT)
Prior art keywords
audio signal
information
channel audio
encoded
parameters
Prior art date
Application number
PCT/IB2003/001591
Other languages
French (fr)
Inventor
Arnoldus W. J. Oomen
Erik G. P. Schuijers
Dirk J. Breebaart
Steven L. J. D. E. Van De Par
Original Assignee
Koninklijke Philips Electronics N.V.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics N.V. filed Critical Koninklijke Philips Electronics N.V.
Priority to KR1020047017069A priority Critical patent/KR101021079B1/en
Priority to DE60306512T priority patent/DE60306512T2/en
Priority to US10/511,806 priority patent/US8498422B2/en
Priority to EP03712597A priority patent/EP1500083B1/en
Priority to BRPI0304542A priority patent/BRPI0304542B1/en
Priority to BR0304542-0A priority patent/BR0304542A/en
Priority to CN038089777A priority patent/CN1647156B/en
Priority to AU2003216686A priority patent/AU2003216686A1/en
Priority to JP2003586872A priority patent/JP4714415B2/en
Publication of WO2003090207A1 publication Critical patent/WO2003090207A1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/03Application of parametric coding in stereophonic audio systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels

Definitions

  • the invention relates to a method of encoding a multi-channel audio signal, an encoder for encoding a multi-channel audio signal, an apparatus for supplying an audio signal, an encoded audio signal, a storage medium on which the encoded audio signal is stored, a method of decoding an encoded audio signal, a decoder for decoding an encoded audio signal, and an apparatus for supplying a decoded audio signal.
  • EP-A-1107232 discloses a parametric coding scheme to generate a representation of a stereo audio signal which is composed of a left channel signal and a right channel signal.
  • a representation contains information concerning only a monaural signal which is either the left channel signal or the right channel signal, and parametric information.
  • the other stereo signal can be recovered based on the monaural signal together with the parametric information.
  • the parametric information comprises localization cues of the stereo audio signal, including intensity and phase characteristics of the left and the right channel.
  • a first aspect of the invention provides a method of encoding a multi-channel audio signal as claimed in claim 1.
  • a second aspect of the invention provides a method of encoding a multi-channel audio signal as claimed in claim 2.
  • a third aspect of the invention provides an encoder for encoding a multi-channel audio signal as claimed in claim 14.
  • a fourth aspect of the invention provides an encoder for encoding a multi-channel audio signal as claimed in claim 15.
  • a fifth aspect of the invention provides an apparatus for supplying an audio signal as claimed in claim 16.
  • a sixth aspect of the invention provides an encoded audio signal as claimed in claim 17.
  • a seventh aspect of the invention provides a storage medium on which the encoded signal is stored is claimed in claim 18.
  • An eight aspect of the invention provides a method of decoding as claimed in claim 19.
  • a ninth aspect of the invention provides a decoder for decoding an encoded audio signal as claimed in claim 20.
  • a tenth aspect of the invention provides an apparatus for supplying a decoded audio signal as claimed in claim 21.
  • Advantageous embodiments are defined in the dependent claims.
  • a single channel audio signal is generated.
  • information is generated from the multi-channel audio signal allowing recovering, with a required quality level, the multi-channel audio signal from the single channel audio signal and the information.
  • the information comprises sets of parameters, for example, as known from EP-A-1107232.
  • the information is generated by determining a first portion of the information for a first frequency region of the multi-channel audio signal, and by determining a second portion of the information for a second frequency region of the multi-channel audio signal.
  • the second frequency region is a portion of the first frequency region and thus is a sub-range of the first frequency region.
  • the first portion may comprise a single set of parameters determined within a frequency region which covers the full bandwidth of the multi-channel audio signal.
  • the second portion may comprise several sets of parameters, each set of parameters being determined for a sub-range or portion of the full bandwidth. Together, the portions preferably cover the full bandwidth.
  • the first portion may comprise two sets of parameters, the first set being determined for a frequency region which covers a lower part of the full bandwidth, and the second set being determined for a frequency region covering the other part of the full bandwidth.
  • the second portion may comprise two sets of parameters determined for two frequency regions within the lower part of the full bandwidth. It is not required that the number of sets of parameters for the lower part and the higher part of the full bandwidth are equal.
  • This representation of the encoded audio signal allows a quality of the decoded audio signal to depend on the complexity of the decoder. For example, in a simple portable decoder a low complexity decoder may be used which has a low power consumption and which is therefore able to use only part of the information. In a high end application, a complex decoder is used which uses all the information available in the coded signal. The quality of the decoded audio can also depend on the available transmission bandwidth. If the transmission bandwidth is high the decoder can decode all available layers, since they are all transmitted.
  • the encoder receives a maximum allowable bit rate of the encoded multi-channel audio signal.
  • This maximum allowable bit rate may be defined by the available bit rate of a transmission channel such as Internet, or of a storage medium.
  • the encoder encodes all available layers. It is decided at the transmitting- end what layers to transmit, depending on the available channel capacity.
  • the encoder only adds the second portion of the information for the second frequency region of the multi-channel audio signal to the encoded audio signal if a bit rate of the encoded multi-channel audio signal which comprises the single channel audio signal, and the first and second portion of the information is not higher than the maximum allowable bit rate.
  • the second portion is not present in the coded audio signal if the transmission bandwidth is not large enough to support the transmission of the second portion.
  • the information comprises sets of parameters, each one of the portions of the information is represented by one or more sets of parameters.
  • the number of sets of parameters depending on the number of frequency regions present in the portions of the information.
  • the sets of parameters comprise at least one of the localization cues.
  • the first frequency region substantially covers the full bandwidth of the multi-channel audio signal. In this way, one set of parameters suffices to provide the basic information required to decode the single channel audio signal into the multi-channel audio signal. In this way a basic level of quality of the decoded audio signal is guaranteed.
  • the second frequency range covers part of the full bandwidth. In this way, the second portion when present in the coded audio signal improves the quality of the decoded audio signal in this frequency range.
  • the second portion of the information comprises at least two frequency ranges which together substantially cover the full bandwidth of the multi-channel audio signal. In this way, the quality improvement provided by the second portion is present over the complete bandwidth.
  • the base layer which comprises the single channel audio signal and the first portion of the information is always present in the encoded audio signal.
  • the enhancement layer which comprises the second portion of the information is encoded only if the bit rate of the encoded audio signal does not exceed the maximally allowable bit rate. In this way, the quality of the decoded audio signal will depend on the maximally allowable bit rate. If the maximally allowable bit rate is too low to accommodate the enhancement layer, the decoded audio signal will be obtained from the base layer which will produce a better quality of the decoded audio than will be the case if unpredictable parts of the coded audio will not reach the decoder.
  • the portions of the information (usually containing sets of parameters, one set for each frequency band represented) in a next frame are coded based on the parameters of the previous frame.
  • this reduces the bit rate of the encoded portions of the information, because, due to correlation, the information in two successive frames will not differ substantially.
  • the difference of the parameters of two successive frames is coded instead of the parameters itself.
  • Prior solutions in audio coders that have been suggested to reduce the bit rate of stereo program material include intensity stereo and M/S stereo.
  • high frequencies are represented by a single audio signal (i.e., mono) combined with time-varying and frequency- dependent scale factors or intensity factors which allow to recover an decoded audio signal which resembles the original stereo signal for these frequency regions.
  • the signal is decomposed into a sum (or mid, or common) signal and a difference (or side, or uncommon) signal. This decomposition is sometimes combined with principle component analysis or time- varying scale factors.
  • These signals are then coded independently, either by a transform coder or sub-band coder [which are both waveform coders]. The amount of information reduction achieved by this algorithm strongly depends on the spatial properties of the source signal.
  • the difference signal is zero and can be discarded.
  • this scheme offers only little bit rate reduction. For the lower frequency regions M/S coding generally provides significant merit.
  • Fig. 1 shows a block diagram of a multi-channel encoder for stereo audio
  • Fig. 2 shows a block diagram of a multi-channel decoder for stereo audio
  • Fig. 3 shows a representation of the encoded data stream
  • Fig. 4 shows an embodiment of the frequency ranges in accordance with the invention
  • Fig. 5 shows another embodiment of the frequency ranges in accordance with the invention
  • Fig. 6 shows the determination of the sets of parameters based on parameters in a previous frame in accordance with an embodiment of the invention
  • Fig. 7 shows a set of parameters
  • Fig. 8 shows the differential determination of the parameters of the base layer
  • Fig. 9 shows the differential determination of the parameters corresponding to a frequency region of an enhancement layer.
  • Fig. 1 shows a block diagram of a multi-channel encoder. The encoder receives a multi-channel audio signal which is shown as a stereo signal RI, LI and the encoder supplies the encoded multi-channel audio signal EBS.
  • the down mixer 1 combines the stereo signal or stereo channels RI, LI into a single channel audio signal (also referred to as monaural signal) SC.
  • the down mixer 1 may determine the average of the input audio signals RI, LI.
  • the encoder 3 encodes the monaural signal SC to obtain an encoded monaural signal ESC.
  • the encoder 3 may be of a known kind, for example, an MPEG coder (MPEG- LII, MPEG-LIII (mp3), or MPEG2-AAC).
  • MPEG- LII MPEG- LII
  • MPEG-LIII MPEG-LIII
  • MPEG2-AAC MPEG2-AAC
  • the parameter determining circuit 2 receives the maximum allowable bit rate MBR to only determine the parameter sets SI, S2, ... which when coded by the parameter coder 4, together with the encoded monaural signal ESC do not exceed the maximum allowable bit rate MBR.
  • the encoded parameters are denoted by EIN.
  • the formatter 5 combines the encoded monaural signal SC and the encoded parameters EIN in a data stream in a desired format to obtain the encoded multi-channel audio signal EBS.
  • the operation of the encoder is elucidated in more detail in the now following, by way of example, with respect to an embodiment.
  • the multi-channel audio signal LI, RI is encoded in a single monaural signal SC (further also referred to as single channel audio signal).
  • the parameterization of spatial attributes of the multi-channel audio signals LI, RI is performed by the parameter determining circuit 2.
  • the parameters contain information on how to restore the multi-channel audio signal LI, RI from the monaural signal SC.
  • the parameters are usually encoded by the parameter encoder 4 before combining them with the encoded single monaural signal ESC. Thus, for general audio coding applications, these parameters combined with only one monaural audio signal are transmitted or stored.
  • the combined coded signal is the encoded multi-channel audio signal EBS.
  • the transmission or storage capacity necessary to transmit or store the encoded multi-channel audio signal EBS is strongly reduced compared to audio coders that process the multi-channels independently. Nevertheless, the original spatial impression is maintained by the information INF which contains the (sets of) parameters.
  • the parametric description of multi-channel audio RI, LI is related to a binaural processing model which aims at describing the effective signal processing of the binaural auditory system.
  • the model splits the incoming audio LI, RI into several band-limited signals, which, preferably, are spaced linearly at an ERB-rate scale.
  • the bandwidth of tliese signals depends on the center frequency, following the ERB-rate. Subsequently, preferably, for every frequency band, the following properties of the incoming signals are analyzed:
  • the interaural level difference, or ILD defined by the relative levels of the band- limited signal stemming from the left and right ears
  • ITD interaural time (or phase) difference
  • IPD interaural delay
  • FR1, FR2, ... vary over time.
  • the update rate of these properties is rather low (typically tens of milliseconds). It may be assumed that the (slowly) time-varying parameters are the only spatial signal properties that the binaural auditory system has available, and that from these time and frequency dependent parameters, the perceived auditory world is reconstructed by higher levels of the auditory system.
  • Fig. 2 shows a block diagram of a multi-channel decoder.
  • the decoder receives the encoded multi-channel audio signal EBS and supplies the recovered decoded multi-channel audio signal which is shown as a stereo signal RO, LO.
  • the deformatter 6 retrieves the encoded monaural signal ESC and the encoded parameters EIN' from the data stream EBS.
  • the decoder 7 decodes the encoded monaural signal ESC into the output monaural signal SCO.
  • the decoder 7 may be of any known kind (of course matched to the encoder that has been used), for example, the decoder
  • the decoder 7 is an MPEG decoder.
  • the decoder 8 decodes the encoded parameters EIN' into output parameters INO.
  • the demultiplexer 9 recovers the output stereo audio signals LO and RO by applying the parameter sets SI, S2, ... of the output parameters INO on the output monaural signal SCO.
  • Fig. 3 shows a representation of the encoded data stream.
  • the data package starts with a header H followed by the coded monaural signal ECS now indicated by A, a first portion PI of the encoded information EIN, a second portion P2 of the encoded information EIN, and a third portion P3 of the encoded information EIN.
  • the frame FI, F2, ... only comprises the header H and the coded monaural signal ECS, only the monaural signal SC is transmitted.
  • the full frequency band in which the input audio signal occurs is divided into a plurality of sub-frequency bands, which together cover the full frequency band.
  • the multichannel information INF is encoded in a plurality of parameter sets SI, S2,... one set for each sub-frequency band FR1 , FR2, ....
  • This plurality of parameter sets S 1 , S2, ... is coded in the first portion PI of the encode information EIN.
  • the bit stream comprises the header H, the portion A which is the coded monaural signal ECS and the first portion PI .
  • the first portion PI consists of a single set parameters SI, only.
  • the single set being determined for the full bandwidth FR1.
  • This bit stream which comprises the header H and the portions A and PI provides a basic layer of quality, indicated by BL in Fig. 3.
  • further portions P2, P3 of the coded information EIN are present in the bit stream. These further portions form an enhancement layer EL.
  • the bit stream may comprise a single further portion P2 or more than 1 further portion.
  • the further portion P2 preferably comprises a plurality of sets S2, S3,... of parameters, one set for each sub-frequency band FR2, FR3, ..., the sub- frequency bands FR2, FR3, ... preferably covering the full frequency band FR1.
  • the enhanced quality may also be present in a step- wise manner, a first enhancement level is provided by the enhancement layer ELI which comprises the first portion.
  • a second enhancement layer EL comprises the first enhancement layer ELI and the second enhancement layer EL2 which comprises the portion P3.
  • the further portion P2 may also comprise a single set S2 of parameters corresponding to a single frequency band FR2 which is a sub-band of the full frequency band FRl.
  • the further portion P2 may also comprise a number of sets of parameters S2, S3, ... which correspond to frequency bands FR2, FR3, ... which together do not cover the complete full frequency band FRl.
  • the further portion P3 preferably contains parameter sets for frequency bands which sub-divide at least one of the sub-bands of the further portion P2.
  • This format of the bit stream in accordance with the invention allows at the transmission channel, or at the decoder to scale the quality of the decoded audio signal with the bit rate of the transmission channel, or the decoding complexity of the decoder. For example, if the audio decoder should have a low power consumption, as is important in portable applications, the decoder may have a low complexity and only uses the portions H, A and PI . It would even be possible that the decoder is able to perform more complex operations at a higher power consumption if the user indicates that he desires a higher quality of the decoded audio.
  • the encoder is aware of the maximum allowable bit rate MBR which may be transmitted via the transmission channel or which may be stored on a storage medium. Now, the encoder is able to decide on how many, if any, further portions PI, P2, ... fit within the maximum allowable bit rate MBR. The encoder codes only these allowable portions PI , P2, ... in the bit stream.
  • Fig. 4 shows an embodiment of the frequency ranges in accordance with the invention.
  • the frequency band FRl is equal to the full bandwidth FBW of the multi-channel audio signal LI, RI, and the frequency band FR2 is a sub-frequency band of the full bandwidth FBW.
  • a single parameter set S 1 is determined for the frequency band FRl and is present in the portion PI
  • a single parameter set S2 is determined for the frequency band FR2 and is present in the portion P2.
  • the quality scaling is possible by either using or not using the portion P2.
  • Fig. 5 shows another embodiment of the frequency ranges in accordance with the invention.
  • the frequency band FRl is again equal to the full bandwidth FBW, and the sub-frequency bands FR2 and FR3 together cover the full bandwidth FBW.
  • the frequency band FRl is subdivided into the sub- frequency bands FR2 and FR3.
  • the portion PI comprises a single parameter set SI determined for de frequency band FRl
  • the portion P2 comprises two parameter sets S2 and S3 determined for the frequency band FR2 and FR3, respectively.
  • the quality scaling is possible by either using or not using the portion P2.
  • Fig. 6 shows the determination of the sets of parameters based on parameters in a previous frame in accordance with an embodiment of the invention.
  • Fig. 6 shows a data stream which comprises in each frame FI, F2, ... the coded information EIN which comprises the portion PI which is part of the base layer BL and the portion P2 which forms the enhancement layer EL.
  • the portion PI comprises a single set of parameters SI which are determined for the full bandwidth FRl .
  • the portion P2 by way of example, comprises four sets of parameters S2, S3, S4, S5 which are determined for the sub-frequency bands FR2, FR3, FR4, FR5, respectively.
  • the four sub-frequency bands FR2, FR3, FR4, FR5 subdivide the frequency band FRl .
  • the portion PI comprises a single set of parameters S 1 ' which are determined for the full bandwidth FRl and are part of the base layer BL'.
  • the portion P2 comprises four sets of parameters S2', S3', S4', S5' which are again determined for the sub-frequency bands FR2, FR3, FR4, FR5, respectively and which form the enhancement layer EL'.
  • each of the sets of parameters SI, S2, ... for each one of the frames FI, F2, ... separately. It is also possible to code the sets of parameters of the portion P2 with respect to the parameters of the portion PI. This is indicated by the arrows starting at SI and ending at S2 to S5 in the frame FI. Of course this is also possible in the other frames F2, ... (not shown). In the same manner, it is possible to code the set of parameters SI' with respect to SI. And finally, the sets of parameters S2', S3', S4', S5' may be coded with respect to the sets of parameters S2, S3, S4, S5.
  • bit rate of the encoded information EIN can be reduced as the redundancy or correlation between sets of parameters Si is used.
  • the new parameters of the new sets of parameters SI', S2', S3', S4', S5' are coded as the difference of their value and the value of the parameters of the previous sets of parameters SI, S2, S3, S4, S5.
  • Fig. 7 shows a set of parameters.
  • Each set of parameters Si may comprise one or more parameters.
  • the parameters are localization cues which provide information about the localization of sound objects in the audio information.
  • the localization cues are the interaural level difference ILD, the interaural time or phase difference ITD or IPD, and the interaural cross-correlation IC. More detailed information on these parameters is provided in the Audio Engineering Society Convention Paper 5574 "Binaural Cue Coding Applied to Stereo and Multi-channel Audio Compression" presented at the 112 th Convention 2002 May 10-13 Kunststoff, Germany, by Christof Faller et al.
  • Fig. 8 shows the differential determination of a parameter of the base layer.
  • the horizontal axis indicates successive frames FI to F5.
  • the vertical axis shows the value PNG of a parameter of the set of parameters SI of the base layer BL.
  • This parameter has the values Al to A5 for the frames FI to F5 respectively.
  • the contribution of this parameter to the bit rate of the coded information EI ⁇ will decrease if not the actual values A2 to A5 of the parameter are coded but the smaller differences Dl, D2 ,....
  • Fig. 9 shows the differential determination of the parameters corresponding to a frequency region of an enhancement layer.
  • the horizontal axis indicates two successive frames FI and F2.
  • the vertical axis indicates the values of a particular parameter of the base layer BL and the enhancement layer EL.
  • the base layer BL comprises the portion PI of information I ⁇ F with a single set of parameters determined for the full frequency range FBW
  • the particular parameter of the portion PI has the value Al for the frame FI and A2 for the frame F2.
  • the enhancement layer EL comprises the portion P2 of information I ⁇ F with three sets of parameters determined for three respective frequency ranges FR2, FR3, FR4 which together fill the full frequency range FBW.
  • the three particular parameters (for example, the parameter representing the ILD) have a value B11, B12, B13 in the frame FI and a value B21, B22, B23 in the frame F2.
  • a base layer BL contains one set of parameters (preferably the time/level difference and the correlation) SI which is determined for the full bandwidth FBW of the multi-channel audio signal LI, RI.
  • the enhancement layer EL contains multiple sets of parameters S2, S3, ... which correspond to subsequent frequency intervals FR2, FR3, ... within the full bandwidth FBW.
  • the sets of parameters S2, S3, ... in the enhancement layer EL can be differentially encoded with respect to the set of parameters SI in the base layer BL.
  • the information INF is encoded in a multi-layered manner to enable a scaling of the decoding quality versus bit rate.
  • the data ESC for the monaural representation SC the data EIN for the set of stereo parameters SI for the full bandwidth FBW, and the stereo parameters S2, S3, ... for the frequency bins (or regions) FR2, FR3, ... is determined.
  • the program code is shown at the left hand side, and an elucidation of the program code is provided under description at the right hand side.
  • example_mono_frame(f) get data for monaural signal representation (the portion A in Fig. 3)
  • example_stereo_extension_layer_l(f) get data stereo parameters full bandwidth (the portion PI)
  • example_stereo_extension_layer_2(f) get data stereo parameters frequency bins (the portion P2)
  • the stereo parameters for all of the frequency bins are coded absolutely (the actual value is coded) or the difference with the corresponding parameters for the full bandwidth is coded.
  • the following code is valid for the interaural level difference ILD.
  • ⁇ ild_bin_diff[f, b] code the ild within a particular bin relative to the value in that bin in the previous frame
  • ild_global[sf] represents the Huffman encoded absolute representation level of the ILD for the whole frequency area for frame f.
  • ild_global_diff[f] represents the Huffman encoded relative representation level of the ILD for the whole frequency area for frame f.
  • ild_bin[f, b] represents the Huffman encoded absolute representation level of the ILD for frame f and bin b.
  • ild_bin_diff[f, b] represents the Huffman encoded relative representation level of the ILD for frame f and bin b.
  • any reference signs placed between parentheses shall not be construed as limiting the claim.
  • the word "comprising” does not exclude the presence of elements or steps other than those listed in a claim.
  • the invention can be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In the device claim enumerating several means, several of these means can be embodied by one and the same item of hardware. The mere fact that certain measures are recited in mutually different dependent claims does not indicate that a combination of these measures cannot be used to advantage.
  • multi-channel audio signals are coded into a monaural audio signal and information allowing to recover the multi-channel audio signal from the monaural audio signal and the information.
  • the information is generated by determining a first portion of the information for a first frequency region of the multi-channel audio signal, and by determining a second portion of the information for a second frequency region of the multichannel audio signal.
  • the second frequency region is a portion of the first frequency region and thus is a sub-range of the first frequency region.
  • the information is multi-layered enabling a scaling of the decoding quality versus bit rate.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Mathematical Physics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Stereophonic System (AREA)
  • Stereo-Broadcasting Methods (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)

Abstract

Multi-channel audio signals are coded into a monaural audio signal and information allowing to recover the multi-channel audio signal from the monaural audio signal and the information. The information is generated by determining a first portion of the information for a first frequency region of the multi-channel audio signal, and by determining a second portion of the information for a second frequency region of the multi-channel audio signal. The second frequency region is a portion of the first frequency region and thus is a sub-range of the first frequency region. The information is multi-layered enabling a scaling of the decoding quality versus bit rate.

Description

Parametric multi-channel audio representation
The invention relates to a method of encoding a multi-channel audio signal, an encoder for encoding a multi-channel audio signal, an apparatus for supplying an audio signal, an encoded audio signal, a storage medium on which the encoded audio signal is stored, a method of decoding an encoded audio signal, a decoder for decoding an encoded audio signal, and an apparatus for supplying a decoded audio signal.
EP-A-1107232 discloses a parametric coding scheme to generate a representation of a stereo audio signal which is composed of a left channel signal and a right channel signal. To efficiently utilize transmission bandwidth, such a representation contains information concerning only a monaural signal which is either the left channel signal or the right channel signal, and parametric information. The other stereo signal can be recovered based on the monaural signal together with the parametric information. The parametric information comprises localization cues of the stereo audio signal, including intensity and phase characteristics of the left and the right channel.
It is an object of the invention to provide a parametric multi-channel audio system which is able to scale the quality of the encoded audio signal with the available bit rate or to scale the quality of the decoded audio signal with the complexity of the decoder or the available transmission bandwidth.
A first aspect of the invention provides a method of encoding a multi-channel audio signal as claimed in claim 1. A second aspect of the invention provides a method of encoding a multi-channel audio signal as claimed in claim 2. A third aspect of the invention provides an encoder for encoding a multi-channel audio signal as claimed in claim 14. A fourth aspect of the invention provides an encoder for encoding a multi-channel audio signal as claimed in claim 15. A fifth aspect of the invention provides an apparatus for supplying an audio signal as claimed in claim 16. A sixth aspect of the invention provides an encoded audio signal as claimed in claim 17. A seventh aspect of the invention provides a storage medium on which the encoded signal is stored is claimed in claim 18. An eight aspect of the invention provides a method of decoding as claimed in claim 19. A ninth aspect of the invention provides a decoder for decoding an encoded audio signal as claimed in claim 20. A tenth aspect of the invention provides an apparatus for supplying a decoded audio signal as claimed in claim 21. Advantageous embodiments are defined in the dependent claims.
In the method of encoding a multi-channel audio signal in accordance with the first aspect of the invention, a single channel audio signal is generated. Further, information is generated from the multi-channel audio signal allowing recovering, with a required quality level, the multi-channel audio signal from the single channel audio signal and the information. Preferably, the information comprises sets of parameters, for example, as known from EP-A-1107232.
In accordance with the first aspect of the invention, the information is generated by determining a first portion of the information for a first frequency region of the multi-channel audio signal, and by determining a second portion of the information for a second frequency region of the multi-channel audio signal. The second frequency region is a portion of the first frequency region and thus is a sub-range of the first frequency region. Now, two levels of quality of decoding are possible. For a low quality level of the decoded multi-channel audio signal, the decoder uses the encoded single channel audio signal, and the first portion of the information. For a higher quality level, the decoder uses the encoded single channel audio signal, and both the first and the second portion of the information. Of course, it is possible to select the decoding quality out of a multitude of levels if a multitude of portions of information each being associated with a different frequency region are present. For example, the first portion may comprise a single set of parameters determined within a frequency region which covers the full bandwidth of the multi-channel audio signal. And the second portion may comprise several sets of parameters, each set of parameters being determined for a sub-range or portion of the full bandwidth. Together, the portions preferably cover the full bandwidth. But many other possibilities exist. For example, the first portion may comprise two sets of parameters, the first set being determined for a frequency region which covers a lower part of the full bandwidth, and the second set being determined for a frequency region covering the other part of the full bandwidth. The second portion may comprise two sets of parameters determined for two frequency regions within the lower part of the full bandwidth. It is not required that the number of sets of parameters for the lower part and the higher part of the full bandwidth are equal. This representation of the encoded audio signal allows a quality of the decoded audio signal to depend on the complexity of the decoder. For example, in a simple portable decoder a low complexity decoder may be used which has a low power consumption and which is therefore able to use only part of the information. In a high end application, a complex decoder is used which uses all the information available in the coded signal. The quality of the decoded audio can also depend on the available transmission bandwidth. If the transmission bandwidth is high the decoder can decode all available layers, since they are all transmitted. If the transmission bandwidth is low the transmitter can decide to only transmit a limited number of layers. In a second aspect of the invention, the encoder receives a maximum allowable bit rate of the encoded multi-channel audio signal. This maximum allowable bit rate may be defined by the available bit rate of a transmission channel such as Internet, or of a storage medium. In applications wherein the transmission bandwidth is variable and thus the maximum allowable bit rate changes in time, it is important to be able to adapt to these fluctuations of the transmission bandwidth to prevent a very low quality of the decoded audio signal. Normally, the encoder encodes all available layers. It is decided at the transmitting- end what layers to transmit, depending on the available channel capacity. It is possible to do this with the encoder in the loop, but this is more complicated that just stripping some layers prior to transmission. The encoder only adds the second portion of the information for the second frequency region of the multi-channel audio signal to the encoded audio signal if a bit rate of the encoded multi-channel audio signal which comprises the single channel audio signal, and the first and second portion of the information is not higher than the maximum allowable bit rate. Thus, the second portion is not present in the coded audio signal if the transmission bandwidth is not large enough to support the transmission of the second portion.
In an embodiment as defined in claim 4, the information comprises sets of parameters, each one of the portions of the information is represented by one or more sets of parameters. The number of sets of parameters depending on the number of frequency regions present in the portions of the information. In an embodiment as defined in claim 6, the sets of parameters comprise at least one of the localization cues.
In an embodiment as defined in claim 7, the first frequency region substantially covers the full bandwidth of the multi-channel audio signal. In this way, one set of parameters suffices to provide the basic information required to decode the single channel audio signal into the multi-channel audio signal. In this way a basic level of quality of the decoded audio signal is guaranteed. The second frequency range covers part of the full bandwidth. In this way, the second portion when present in the coded audio signal improves the quality of the decoded audio signal in this frequency range. In an embodiment as defined in claim 8, the second portion of the information comprises at least two frequency ranges which together substantially cover the full bandwidth of the multi-channel audio signal. In this way, the quality improvement provided by the second portion is present over the complete bandwidth.
In an embodiment as defined in claim 9, the base layer which comprises the single channel audio signal and the first portion of the information is always present in the encoded audio signal. The enhancement layer which comprises the second portion of the information is encoded only if the bit rate of the encoded audio signal does not exceed the maximally allowable bit rate. In this way, the quality of the decoded audio signal will depend on the maximally allowable bit rate. If the maximally allowable bit rate is too low to accommodate the enhancement layer, the decoded audio signal will be obtained from the base layer which will produce a better quality of the decoded audio than will be the case if unpredictable parts of the coded audio will not reach the decoder.
In the embodiments as defined in any one of the claims 10 to 12, the portions of the information (usually containing sets of parameters, one set for each frequency band represented) in a next frame are coded based on the parameters of the previous frame.
Usually, this reduces the bit rate of the encoded portions of the information, because, due to correlation, the information in two successive frames will not differ substantially.
In the embodiments as defined in claim 13, the difference of the parameters of two successive frames is coded instead of the parameters itself. Prior solutions in audio coders that have been suggested to reduce the bit rate of stereo program material include intensity stereo and M/S stereo.
In the intensity stereo algorithm, high frequencies (typically above 5 kHz) are represented by a single audio signal (i.e., mono) combined with time-varying and frequency- dependent scale factors or intensity factors which allow to recover an decoded audio signal which resembles the original stereo signal for these frequency regions. In the M/S algorithm, the signal is decomposed into a sum (or mid, or common) signal and a difference (or side, or uncommon) signal. This decomposition is sometimes combined with principle component analysis or time- varying scale factors. These signals are then coded independently, either by a transform coder or sub-band coder [which are both waveform coders]. The amount of information reduction achieved by this algorithm strongly depends on the spatial properties of the source signal. For example, if the source signal is monaural, the difference signal is zero and can be discarded. However, if the correlation of the left and right audio signals is low (which is often the case for the higher frequency regions), this scheme offers only little bit rate reduction. For the lower frequency regions M/S coding generally provides significant merit.
Parametric descriptions of audio signals have gained interest during the last years, especially in the field of audio coding. It has been shown that transmitting (quantized) parameters that describe audio signals requires only little transmission capacity to re- synthesize a perceptually equal signal at the receiving end. However, current parametric audio coders focus on coding monaural signals, and stereo signals are processed as dual mono signals.
These and other aspects of the invention are apparent from and will be elucidated with reference to the embodiments described hereinafter.
In the drawings:
Fig. 1 shows a block diagram of a multi-channel encoder for stereo audio, Fig. 2 shows a block diagram of a multi-channel decoder for stereo audio, Fig. 3 shows a representation of the encoded data stream,
Fig. 4 shows an embodiment of the frequency ranges in accordance with the invention,
Fig. 5 shows another embodiment of the frequency ranges in accordance with the invention, Fig. 6 shows the determination of the sets of parameters based on parameters in a previous frame in accordance with an embodiment of the invention, Fig. 7 shows a set of parameters,
Fig. 8 shows the differential determination of the parameters of the base layer, and Fig. 9 shows the differential determination of the parameters corresponding to a frequency region of an enhancement layer. Fig. 1 shows a block diagram of a multi-channel encoder. The encoder receives a multi-channel audio signal which is shown as a stereo signal RI, LI and the encoder supplies the encoded multi-channel audio signal EBS.
The down mixer 1 combines the stereo signal or stereo channels RI, LI into a single channel audio signal (also referred to as monaural signal) SC. For example, the down mixer 1 may determine the average of the input audio signals RI, LI.
The encoder 3 encodes the monaural signal SC to obtain an encoded monaural signal ESC. The encoder 3 may be of a known kind, for example, an MPEG coder (MPEG- LII, MPEG-LIII (mp3), or MPEG2-AAC). The parameter determining circuit 2 determines the sets of parameters SI, S2,
... characterizing the information INF based on the input audio signals RI, LI. Optionally, the parameter determining circuit 2 receives the maximum allowable bit rate MBR to only determine the parameter sets SI, S2, ... which when coded by the parameter coder 4, together with the encoded monaural signal ESC do not exceed the maximum allowable bit rate MBR. The encoded parameters are denoted by EIN.
The formatter 5 combines the encoded monaural signal SC and the encoded parameters EIN in a data stream in a desired format to obtain the encoded multi-channel audio signal EBS.
The operation of the encoder is elucidated in more detail in the now following, by way of example, with respect to an embodiment. The multi-channel audio signal LI, RI is encoded in a single monaural signal SC (further also referred to as single channel audio signal). The parameterization of spatial attributes of the multi-channel audio signals LI, RI is performed by the parameter determining circuit 2. The parameters contain information on how to restore the multi-channel audio signal LI, RI from the monaural signal SC. The parameters are usually encoded by the parameter encoder 4 before combining them with the encoded single monaural signal ESC. Thus, for general audio coding applications, these parameters combined with only one monaural audio signal are transmitted or stored. The combined coded signal is the encoded multi-channel audio signal EBS. The transmission or storage capacity necessary to transmit or store the encoded multi-channel audio signal EBS is strongly reduced compared to audio coders that process the multi-channels independently. Nevertheless, the original spatial impression is maintained by the information INF which contains the (sets of) parameters. In particular, the parametric description of multi-channel audio RI, LI is related to a binaural processing model which aims at describing the effective signal processing of the binaural auditory system.
The model splits the incoming audio LI, RI into several band-limited signals, which, preferably, are spaced linearly at an ERB-rate scale. The bandwidth of tliese signals depends on the center frequency, following the ERB-rate. Subsequently, preferably, for every frequency band, the following properties of the incoming signals are analyzed:
- The interaural level difference, or ILD, defined by the relative levels of the band- limited signal stemming from the left and right ears, - The interaural time (or phase) difference ITD (or IPD), defined by the interaural delay
(or phase shift) corresponding to the peak in the interaural cross-correlation function, and
- The (dis)similarity of the waveforms that can not be accounted for by ITDs or ILDs, which can be parameterized by the maximum interaural cross-correlation IC (for example, the value of the cross-correlation at the position of the maximum peak).
The sets SI, S2, ... of the three parameters, one set for each frequency band
FR1, FR2, ..., vary over time. However, since the binaural auditory system is very sluggish in its processing, the update rate of these properties is rather low (typically tens of milliseconds). It may be assumed that the (slowly) time-varying parameters are the only spatial signal properties that the binaural auditory system has available, and that from these time and frequency dependent parameters, the perceived auditory world is reconstructed by higher levels of the auditory system.
Fig. 2 shows a block diagram of a multi-channel decoder. The decoder receives the encoded multi-channel audio signal EBS and supplies the recovered decoded multi-channel audio signal which is shown as a stereo signal RO, LO.
The deformatter 6 retrieves the encoded monaural signal ESC and the encoded parameters EIN' from the data stream EBS. The decoder 7 decodes the encoded monaural signal ESC into the output monaural signal SCO. The decoder 7 may be of any known kind (of course matched to the encoder that has been used), for example, the decoder
7 is an MPEG decoder. The decoder 8 decodes the encoded parameters EIN' into output parameters INO. The demultiplexer 9 recovers the output stereo audio signals LO and RO by applying the parameter sets SI, S2, ... of the output parameters INO on the output monaural signal SCO.
Fig. 3 shows a representation of the encoded data stream. For example, in each frame FI, F2, ..., the data package starts with a header H followed by the coded monaural signal ECS now indicated by A, a first portion PI of the encoded information EIN, a second portion P2 of the encoded information EIN, and a third portion P3 of the encoded information EIN.
If the frame FI, F2, ... only comprises the header H and the coded monaural signal ECS, only the monaural signal SC is transmitted.
As disclosed in EP-A-1107232, the full frequency band in which the input audio signal occurs is divided into a plurality of sub-frequency bands, which together cover the full frequency band. In the terminology in accordance with the invention, the multichannel information INF is encoded in a plurality of parameter sets SI, S2,... one set for each sub-frequency band FR1 , FR2, .... This plurality of parameter sets S 1 , S2, ... is coded in the first portion PI of the encode information EIN. Thus, to transmit a basic level quality multichannel audio signal, the bit stream comprises the header H, the portion A which is the coded monaural signal ECS and the first portion PI .
In the bit stream in accordance with an embodiment of the invention, the first portion PI consists of a single set parameters SI, only. The single set being determined for the full bandwidth FR1. This bit stream which comprises the header H and the portions A and PI provides a basic layer of quality, indicated by BL in Fig. 3.
To support an enhanced quality, further portions P2, P3 of the coded information EIN are present in the bit stream. These further portions form an enhancement layer EL. The bit stream may comprise a single further portion P2 or more than 1 further portion. The further portion P2 preferably comprises a plurality of sets S2, S3,... of parameters, one set for each sub-frequency band FR2, FR3, ..., the sub- frequency bands FR2, FR3, ... preferably covering the full frequency band FR1. The enhanced quality may also be present in a step- wise manner, a first enhancement level is provided by the enhancement layer ELI which comprises the first portion. And a second enhancement layer EL comprises the first enhancement layer ELI and the second enhancement layer EL2 which comprises the portion P3.
The further portion P2 may also comprise a single set S2 of parameters corresponding to a single frequency band FR2 which is a sub-band of the full frequency band FRl. The further portion P2 may also comprise a number of sets of parameters S2, S3, ... which correspond to frequency bands FR2, FR3, ... which together do not cover the complete full frequency band FRl.
The further portion P3 preferably contains parameter sets for frequency bands which sub-divide at least one of the sub-bands of the further portion P2.
This format of the bit stream in accordance with the invention allows at the transmission channel, or at the decoder to scale the quality of the decoded audio signal with the bit rate of the transmission channel, or the decoding complexity of the decoder. For example, if the audio decoder should have a low power consumption, as is important in portable applications, the decoder may have a low complexity and only uses the portions H, A and PI . It would even be possible that the decoder is able to perform more complex operations at a higher power consumption if the user indicates that he desires a higher quality of the decoded audio.
It is also possible that the encoder is aware of the maximum allowable bit rate MBR which may be transmitted via the transmission channel or which may be stored on a storage medium. Now, the encoder is able to decide on how many, if any, further portions PI, P2, ... fit within the maximum allowable bit rate MBR. The encoder codes only these allowable portions PI , P2, ... in the bit stream.
Fig. 4 shows an embodiment of the frequency ranges in accordance with the invention. In this embodiment, the frequency band FRl is equal to the full bandwidth FBW of the multi-channel audio signal LI, RI, and the frequency band FR2 is a sub-frequency band of the full bandwidth FBW.
If these are the only frequency ranges for which parameter sets SI, S2, ... are determined, a single parameter set S 1 is determined for the frequency band FRl and is present in the portion PI , and a single parameter set S2 is determined for the frequency band FR2 and is present in the portion P2. The quality scaling is possible by either using or not using the portion P2.
Fig. 5 shows another embodiment of the frequency ranges in accordance with the invention. In this embodiment, the frequency band FRl is again equal to the full bandwidth FBW, and the sub-frequency bands FR2 and FR3 together cover the full bandwidth FBW. Or said in other words, the frequency band FRl is subdivided into the sub- frequency bands FR2 and FR3.
If these are the only frequency ranges for which parameter sets SI, S2, ... are determined, the portion PI comprises a single parameter set SI determined for de frequency band FRl, and the portion P2 comprises two parameter sets S2 and S3 determined for the frequency band FR2 and FR3, respectively. The quality scaling is possible by either using or not using the portion P2.
Fig. 6 shows the determination of the sets of parameters based on parameters in a previous frame in accordance with an embodiment of the invention.
Fig. 6 shows a data stream which comprises in each frame FI, F2, ... the coded information EIN which comprises the portion PI which is part of the base layer BL and the portion P2 which forms the enhancement layer EL.
In the frame FI, the portion PI comprises a single set of parameters SI which are determined for the full bandwidth FRl . The portion P2, by way of example, comprises four sets of parameters S2, S3, S4, S5 which are determined for the sub-frequency bands FR2, FR3, FR4, FR5, respectively. The four sub-frequency bands FR2, FR3, FR4, FR5 subdivide the frequency band FRl .
In the frame F2 which succeeds the frame FI, the portion PI comprises a single set of parameters S 1 ' which are determined for the full bandwidth FRl and are part of the base layer BL'. The portion P2 comprises four sets of parameters S2', S3', S4', S5' which are again determined for the sub-frequency bands FR2, FR3, FR4, FR5, respectively and which form the enhancement layer EL'.
It is possible to code each of the sets of parameters SI, S2, ... for each one of the frames FI, F2, ... separately. It is also possible to code the sets of parameters of the portion P2 with respect to the parameters of the portion PI. This is indicated by the arrows starting at SI and ending at S2 to S5 in the frame FI. Of course this is also possible in the other frames F2, ... (not shown). In the same manner, it is possible to code the set of parameters SI' with respect to SI. And finally, the sets of parameters S2', S3', S4', S5' may be coded with respect to the sets of parameters S2, S3, S4, S5.
In this manner, the bit rate of the encoded information EIN can be reduced as the redundancy or correlation between sets of parameters Si is used.
Preferably, the new parameters of the new sets of parameters SI', S2', S3', S4', S5' are coded as the difference of their value and the value of the parameters of the previous sets of parameters SI, S2, S3, S4, S5.
At regular time intervals, at least the parameter set SI has to be coded absolutely and not differential to prevent errors to propagate too long.
Fig. 7 shows a set of parameters. Each set of parameters Si may comprise one or more parameters. Usually the parameters are localization cues which provide information about the localization of sound objects in the audio information. Usually the localization cues are the interaural level difference ILD, the interaural time or phase difference ITD or IPD, and the interaural cross-correlation IC. More detailed information on these parameters is provided in the Audio Engineering Society Convention Paper 5574 "Binaural Cue Coding Applied to Stereo and Multi-channel Audio Compression" presented at the 112th Convention 2002 May 10-13 Munich, Germany, by Christof Faller et al.
Fig. 8 shows the differential determination of a parameter of the base layer. The horizontal axis indicates successive frames FI to F5. The vertical axis shows the value PNG of a parameter of the set of parameters SI of the base layer BL. This parameter has the values Al to A5 for the frames FI to F5 respectively. The contribution of this parameter to the bit rate of the coded information EIΝ will decrease if not the actual values A2 to A5 of the parameter are coded but the smaller differences Dl, D2 ,....
Fig. 9 shows the differential determination of the parameters corresponding to a frequency region of an enhancement layer. The horizontal axis indicates two successive frames FI and F2. The vertical axis indicates the values of a particular parameter of the base layer BL and the enhancement layer EL. In this example, the base layer BL comprises the portion PI of information IΝF with a single set of parameters determined for the full frequency range FBW, the particular parameter of the portion PI has the value Al for the frame FI and A2 for the frame F2. The enhancement layer EL comprises the portion P2 of information IΝF with three sets of parameters determined for three respective frequency ranges FR2, FR3, FR4 which together fill the full frequency range FBW. The three particular parameters (for example, the parameter representing the ILD) have a value B11, B12, B13 in the frame FI and a value B21, B22, B23 in the frame F2.
The contribution of these parameters to the bit rate of the coded information EIΝ will decrease if not the actual values Bl 1 to B23 of the particular parameter are coded but the differences Dl 1, D12,..., because these differences can be encoded more efficiently than the actual values.
To summarize, in a preferred embodiment in accordance with the invention, it is proposed to organize the stereo parameter information IΝF such that a base layer BL contains one set of parameters (preferably the time/level difference and the correlation) SI which is determined for the full bandwidth FBW of the multi-channel audio signal LI, RI. The enhancement layer EL contains multiple sets of parameters S2, S3, ... which correspond to subsequent frequency intervals FR2, FR3, ... within the full bandwidth FBW. For bit-rate efficiency, the sets of parameters S2, S3, ... in the enhancement layer EL can be differentially encoded with respect to the set of parameters SI in the base layer BL.
The information INF is encoded in a multi-layered manner to enable a scaling of the decoding quality versus bit rate.
To conclude, in the now following, an preferred embodiment in accordance with the invention is elucidated with respect to program code and its elucidation.
First, for all subframes (the portions PI, P2, ...) in the frames FI, F2, ... the data ESC for the monaural representation SC, the data EIN for the set of stereo parameters SI for the full bandwidth FBW, and the stereo parameters S2, S3, ... for the frequency bins (or regions) FR2, FR3, ... is determined.
The program code is shown at the left hand side, and an elucidation of the program code is provided under description at the right hand side.
code description {
{ for (f = 0; f < nrof_frames; f++) for all frames do:
{ example_mono_frame(f) get data for monaural signal representation (the portion A in Fig. 3)
example_stereo_extension_layer_l(f) get data stereo parameters full bandwidth (the portion PI) example_stereo_extension_layer_2(f) get data stereo parameters frequency bins (the portion P2)
} }
Secondly, depending on the value of the bit refresh_stereo the stereo parameters for the full bandwidth are coded absolutely (the actual value is coded) or the difference with previous values is coded. The following code is valid for the interaural level difference ILD. code description example_stereo_extension_layer_l(f)
{ refresh stereo 1 bit denoting whether or not data is to be absolutely coded or not
if (refresh_stereo == 1) if data is to be coded absolutely
{ ild_global[fJ code the actual interaural intensity difference(ild) for the whole frequency area (global)
} else if not a refresh
{ ild_global_diff[f] code ild with respect to the previous frame }
}
Thirdly, depending on the value of the bit refresh_stereo the stereo parameters for all of the frequency bins are coded absolutely (the actual value is coded) or the difference with the corresponding parameters for the full bandwidth is coded. The following code is valid for the interaural level difference ILD.
code description example_stereo,_extension_layer_2(f)
{ if(refresh_stereo== 1 ) if refresh
{ for(b=0; b<nrof_bins; b++) for all f equency bins { ild_bin[f, b] code the ild in that bin relative to the global value
} } else if no refresh
{ for(b=0; b<nrof_bins; b++) for all bins
{ ild_bin_diff[f, b] code the ild within a particular bin relative to the value in that bin in the previous frame
} }
Wherein:
The term "refresh_stereo" is a flag denoting whether or not the stereo parameters should be refreshed (0 = FALSE, 1 = TRUE).
The term "ild_global[sf]" represents the Huffman encoded absolute representation level of the ILD for the whole frequency area for frame f.
The term "ild_global_diff[f]" represents the Huffman encoded relative representation level of the ILD for the whole frequency area for frame f.
The term "ild_bin[f, b]" represents the Huffman encoded absolute representation level of the ILD for frame f and bin b. The term "ild_bin_diff[f, b]" represents the Huffman encoded relative representation level of the ILD for frame f and bin b.
It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design many alternative embodiments without departing from the scope of the appended claims. Although the invention is elucidated in the Figs, with respect to a stereo signal, the extension to a more than two channel audio signal can easily be accomplished by the skilled person.
In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word "comprising" does not exclude the presence of elements or steps other than those listed in a claim. The invention can be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In the device claim enumerating several means, several of these means can be embodied by one and the same item of hardware. The mere fact that certain measures are recited in mutually different dependent claims does not indicate that a combination of these measures cannot be used to advantage.
In summary, multi-channel audio signals are coded into a monaural audio signal and information allowing to recover the multi-channel audio signal from the monaural audio signal and the information. The information is generated by determining a first portion of the information for a first frequency region of the multi-channel audio signal, and by determining a second portion of the information for a second frequency region of the multichannel audio signal. The second frequency region is a portion of the first frequency region and thus is a sub-range of the first frequency region. The information is multi-layered enabling a scaling of the decoding quality versus bit rate.

Claims

CLAIMS:
1. A method of encoding a multi-channel audio signal comprising at least two audio channels, the method comprising, generating a single channel audio signal and encoding the single channel audio signal into a bit stream as an encoded single channel audio signal, generating information from the at least two audio channels allowing to recover with a required quality level the multi-channel audio signal from the single channel audio signal and the information, the generating of the information comprising, determining a first portion of the information for a first frequency region of the multi-channel audio signal, and encoding the first portion of the information into the bit stream as an encoded first portion of the information, and determining a second portion of the information for a second frequency region of the multi-channel audio signal, the second frequency region being a portion of the first frequency region, and encoding the second portion of the information into the bit stream as an encoded second portion of the information.
2. A method of encoding a multi-channel audio signal comprising at least two audio channels, the method comprising, generating a single channel audio signal, generating information from the at least two audio channels allowing to recover with a required quality level the multi-channel audio signal from the single channel audio signal and the information, the generating of the information comprising, receiving a maximum allowable bit rate of the encoded multi-channel audio signal, and only determining a first portion of the information for a first frequency region of the multi-channel audio signal if a bit rate of the encoded multi-channel audio signal comprising the single channel audio signal and the first portion of the information is not higher than the maximum allowable bit rate.
3. A method of encoding as claimed in claim 1 or 2, wherein the single channel audio signal is a particular combination of the at least two audio channels.
4. A method of encoding as claimed in claim 1, characterized in that the information comprises sets of parameters, the first portion comprises at least a first one of the sets of parameters, the second portion comprises at least a second one of the sets of parameters, wherein each set of parameters is associated with a corresponding frequency region.
5. A method of encoding as claimed in claim 4, characterized in that the sets of parameters comprise at least one localization cue.
6. A method of encoding as claimed in claim 5, characterized in that the at least one localization cue is selected from: an interaural level difference, an interaural time or phase difference, or an interaural cross-correlation.
7. A method of encoding as claimed in claim 1 or 2, characterized in that the first frequency region covers a full bandwidth of the multi-channel audio signal.
8. A method of encoding as claimed in claim 1 , characterized in that the first frequency region substantially covers a full bandwidth of the multi-channel audio signal, the second frequency region covers a portion of the full bandwidth, and in that the determining of the second portion of the information is adapted to determine sets of parameters for both the second frequency region and a set of further frequency regions, the second frequency region and the set of further frequency regions substantially covering the full bandwidth, where in the set of further frequency regions comprises at least one further frequency region.
9. A method of encoding as claimed in claim 8, characterized in that the single channel audio signal and the first portion of the information form a base layer of information which is always present in the encoded multi-channel audio signal, and in that the method comprises receiving a maximum allowable bit rate of the encoded multi-channel audio signal, the second portion of the information forming an enhancement layer of information which is encoded only if the bit rate of the encoded base layer and enhancement layer is not higher than the maximum allowable bit rate.
10. A method of encoding as claimed in claim 4, characterized in that the determining of the first portion of information in a particular frame of encoded information comprises determining the first one of the sets of parameters in the particular frame, and coding the first one of the sets of parameters based on the first one of the sets of parameters of a frame preceding the particular frame.
11. A method of encoding as claimed in claim 8, characterized in that the determining of the second portion of information in a particular frame of the encoded information comprises determining the sets of parameters of the second portion in the particular frame and coding the sets of parameters of the second portion in the particular frame based on the sets of parameters of a frame preceding the particular frame.
12. A method of encoding as claimed in claim 8, characterized in that the determining of the second portion of information in a particular frame of the encoded information comprises determining the sets of parameters of the second portion in the particular frame and coding the sets of parameters of the second portion in the particular frame based on the first one of the sets of parameters of a frame preceding the particular frame.
13. A method of encoding as claimed in any one of the claims 10 to 12, characterized in that the determining comprises calculating a difference between the corresponding parameters in the particular frame and the frame preceding the particular frame.
14. An encoder for coding a multi-channel audio signal comprising at least two audio channels, the encoder comprising: means for generating a single channel audio signal, means for generating information from the at least two audio channels allowing to recover with a required quality level the multi-channel audio signal from the single channel audio signal and the information, the generating of the information comprising, means for determining a first portion of the information for a first frequency region of the multi-channel audio signal, and means for determining a second portion of the information for a second frequency region of the multi-channel audio signal, the second frequency region being a portion of the first frequency region.
15. An encoder for encoding a multi-channel audio signal comprising at least two audio channels, the encoder comprising, means for generating a single channel audio signal, means for generating information from the at least two audio channels allowing to recover with a required quality level the multi-channel audio signal from the single channel audio signal and the information, the generating of the information comprising, means for receiving a maximum allowable bit rate of the encoded multichannel audio signal, and means for only determining a first portion of the information for a first frequency region of the multi-channel audio signal if a bit rate of the encoded multi-channel audio signal comprising the single channel audio signal and the first portion of the information is not higher than the maximum allowable bit rate.
16. An apparatus for supplying an audio signal, the apparatus comprising: an input for receiving an audio signal, an encoder as claimed in claim 14 or 15 for encoding the audio signal to obtain an encoded audio signal, and an output for supplying the encoded audio signal.
17. An encoded audio signal comprising: a single channel audio signal, information from the at least two audio channels allowing to recover with a required quality level the multi-channel audio signal from the single channel audio signal and the information, the information comprising, a first portion of the information for a first frequency region of the multichannel audio signal, and a second portion of the information for a second frequency region of the multichannel audio signal, the second frequency region being a portion of the first frequency region.
18. A storage medium on which the encoded audio signal as claimed in claim 17 has been stored.
19. A method of decoding a multi-channel audio signal being encoded as claimed in claim 17, the method of decoding comprising: obtaining a decoded single channel audio signal, obtaining decoded information from the information allowing to recover the multi-channel audio signal from the decoded single channel audio signal and the decoded information, the decoded information comprises the first portion of the information and the second portion of the information, and applying either the first portion of the information or the first portion and the second portion of the information on the single channel audio signal to generate the decoded multi-channel audio signal.
20. A decoder for decoding an encoded audio signal, the decoder comprising: means for obtaining a decoded single channel audio signal, means for obtaining decoded information from the information allowing to recover the multi-channel audio signal from the decoded single channel audio signal and the decoded information, the decoded information comprises the first portion of the information and the second portion of the information, and means for applying the first portion of the information and the second portion of the information on the single channel audio signal to generate the decoded multi-channel audio signal.
21. An apparatus for supplying a decoded audio signal, the apparatus comprising: an input for receiving an encoded audio signal, a decoder as claimed in claim 20 for decoding the encoded audio signal to obtain a multi-channel output signal, and an output for supplying or reproducing the multi-channel output signal.
PCT/IB2003/001591 2002-04-22 2003-04-22 Parametric multi-channel audio representation WO2003090207A1 (en)

Priority Applications (9)

Application Number Priority Date Filing Date Title
KR1020047017069A KR101021079B1 (en) 2002-04-22 2003-04-22 Parametric multi-channel audio representation
DE60306512T DE60306512T2 (en) 2002-04-22 2003-04-22 PARAMETRIC DESCRIPTION OF MULTI-CHANNEL AUDIO
US10/511,806 US8498422B2 (en) 2002-04-22 2003-04-22 Parametric multi-channel audio representation
EP03712597A EP1500083B1 (en) 2002-04-22 2003-04-22 Parametric multi-channel audio representation
BRPI0304542A BRPI0304542B1 (en) 2002-04-22 2003-04-22 “Method and encoder for encoding a multichannel audio signal, encoded multichannel audio signal, and method and decoder for decoding an encoded multichannel audio signal”
BR0304542-0A BR0304542A (en) 2002-04-22 2003-04-22 Method and encoder for encoding a multichannel audio signal, apparatus for providing an audio signal, encoded audio signal, storage medium, and method and decoder for decoding an audio signal
CN038089777A CN1647156B (en) 2002-04-22 2003-04-22 Parameter coding method, parameter coder, device for providing audio frequency signal, decoding method, decoder, device for providing multi-channel audio signal
AU2003216686A AU2003216686A1 (en) 2002-04-22 2003-04-22 Parametric multi-channel audio representation
JP2003586872A JP4714415B2 (en) 2002-04-22 2003-04-22 Multi-channel audio display with parameters

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
EP02076588.9 2002-04-22
EP02076588 2002-04-22
EP02077869.2 2002-07-16
EP02077869 2002-07-16

Publications (1)

Publication Number Publication Date
WO2003090207A1 true WO2003090207A1 (en) 2003-10-30

Family

ID=29252214

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2003/001591 WO2003090207A1 (en) 2002-04-22 2003-04-22 Parametric multi-channel audio representation

Country Status (11)

Country Link
US (1) US8498422B2 (en)
EP (1) EP1500083B1 (en)
JP (1) JP4714415B2 (en)
KR (1) KR101021079B1 (en)
CN (1) CN1647156B (en)
AT (1) ATE332003T1 (en)
AU (1) AU2003216686A1 (en)
BR (2) BRPI0304542B1 (en)
DE (1) DE60306512T2 (en)
ES (1) ES2268340T3 (en)
WO (1) WO2003090207A1 (en)

Cited By (55)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005069274A1 (en) * 2004-01-20 2005-07-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for constructing a multi-channel output signal or for generating a downmix signal
WO2005083679A1 (en) * 2004-02-17 2005-09-09 Koninklijke Philips Electronics N.V. An audio distribution system, an audio encoder, an audio decoder and methods of operation therefore
WO2005094125A1 (en) * 2004-03-04 2005-10-06 Agere Systems Inc. Frequency-based coding of audio channels in parametric multi-channel coding systems
WO2006003891A1 (en) * 2004-07-02 2006-01-12 Matsushita Electric Industrial Co., Ltd. Audio signal decoding device and audio signal encoding device
US7006636B2 (en) 2002-05-24 2006-02-28 Agere Systems Inc. Coherence-based audio coding and synthesis
JP2006166447A (en) * 2004-12-01 2006-06-22 Samsung Electronics Co Ltd Apparatus and method for processing multi-channel audio signal, compression efficiency improving method and system for processing multi-channel audio signal
WO2006089570A1 (en) * 2005-02-22 2006-08-31 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Near-transparent or transparent multi-channel encoder/decoder scheme
US7116787B2 (en) 2001-05-04 2006-10-03 Agere Systems Inc. Perceptual synthesis of auditory scenes
WO2007004831A1 (en) * 2005-06-30 2007-01-11 Lg Electronics Inc. Method and apparatus for encoding and decoding an audio signal
WO2007013781A1 (en) * 2005-07-29 2007-02-01 Lg Electronics Inc. Method for generating encoded audio signal and method for processing audio signal
KR100773539B1 (en) * 2004-07-14 2007-11-05 삼성전자주식회사 Multi channel audio data encoding/decoding method and apparatus
US7292901B2 (en) 2002-06-24 2007-11-06 Agere Systems Inc. Hybrid multi-channel/cue coding/decoding of audio signals
EP1852850A1 (en) * 2005-02-01 2007-11-07 Matsushita Electric Industrial Co., Ltd. Scalable encoding device and scalable encoding method
EP1851866A1 (en) * 2005-02-23 2007-11-07 TELEFONAKTIEBOLAGET LM ERICSSON (publ) Adaptive bit allocation for multi-channel audio encoding
WO2008000901A1 (en) * 2006-06-30 2008-01-03 Nokia Corporation Scalable audio coding
WO2008039045A1 (en) * 2006-09-29 2008-04-03 Lg Electronics Inc., Apparatus for processing mix signal and method thereof
JP2008512708A (en) * 2004-09-08 2008-04-24 フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ Apparatus and method for generating a multi-channel signal or parameter data set
JP2008512890A (en) * 2004-09-06 2008-04-24 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ Audio signal enhancement
JPWO2006022308A1 (en) * 2004-08-26 2008-05-08 松下電器産業株式会社 Multi-channel signal encoding apparatus and multi-channel signal decoding apparatus
WO2008060111A1 (en) * 2006-11-15 2008-05-22 Lg Electronics Inc. A method and an apparatus for decoding an audio signal
JPWO2006070751A1 (en) * 2004-12-27 2008-06-12 松下電器産業株式会社 Speech coding apparatus and speech coding method
JPWO2006070757A1 (en) * 2004-12-28 2008-06-12 松下電器産業株式会社 Speech coding apparatus and speech coding method
JP2008522243A (en) * 2004-11-30 2008-06-26 アギア システムズ インコーポレーテッド Synchronization of spatial audio parametric coding with externally supplied downmix
JP2008522214A (en) * 2004-11-29 2008-06-26 ナショナル ユニバーシティ オブ シンガポール Perceptually conscious low-power audio decoder for portable devices
JP2008522244A (en) * 2004-11-30 2008-06-26 アギア システムズ インコーポレーテッド Parametric coding of spatial audio using object-based side information
JP2008532395A (en) * 2005-03-04 2008-08-14 フラウンホーファーゲゼルシャフト ツール フォルデルング デル アンゲヴァンテン フォルシユング エー.フアー. Apparatus and method for generating an encoded stereo signal of an audio fragment or audio data stream
KR100855561B1 (en) * 2004-04-16 2008-09-01 코딩 테크놀러지스 에이비 Scheme for generating a parametric representation for low-bit rate applications
KR100857118B1 (en) * 2005-10-05 2008-09-05 엘지전자 주식회사 Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor
US7583805B2 (en) 2004-02-12 2009-09-01 Agere Systems Inc. Late reverberation-based synthesis of auditory scenes
US7644001B2 (en) 2002-11-28 2010-01-05 Koninklijke Philips Electronics N.V. Differentially coding an audio signal
US7644003B2 (en) 2001-05-04 2010-01-05 Agere Systems Inc. Cue-based audio coding/decoding
US7653533B2 (en) 2005-10-24 2010-01-26 Lg Electronics Inc. Removing time delays in signal paths
US7693183B2 (en) 2005-07-29 2010-04-06 Lg Electronics Inc. Method for signaling of splitting information
AU2006233512B2 (en) * 2005-04-13 2010-04-22 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Adaptive grouping of parameters for enhanced coding efficiency
US7715569B2 (en) 2006-12-07 2010-05-11 Lg Electronics Inc. Method and an apparatus for decoding an audio signal
US7720230B2 (en) 2004-10-20 2010-05-18 Agere Systems, Inc. Individual channel shaping for BCC schemes and the like
US7756701B2 (en) 2005-10-05 2010-07-13 Lg Electronics Inc. Audio signal processing using pilot based coding
US7787631B2 (en) 2004-11-30 2010-08-31 Agere Systems Inc. Parametric coding of spatial audio with cues based on transmitted channels
US7881817B2 (en) 2006-02-23 2011-02-01 Lg Electronics Inc. Method and apparatus for processing an audio signal
EP2291007A1 (en) * 2006-05-04 2011-03-02 LG Electronics Inc. Enhancing audio with remixing capability
EP2296142A2 (en) 2005-08-02 2011-03-16 Dolby Laboratories Licensing Corporation Controlling spatial audio coding parameters as a function of auditory events
US7945449B2 (en) 2004-08-25 2011-05-17 Dolby Laboratories Licensing Corporation Temporal envelope shaping for spatial audio coding using frequency domain wiener filtering
US8015018B2 (en) 2004-08-25 2011-09-06 Dolby Laboratories Licensing Corporation Multichannel decorrelation in spatial audio coding
JP4794448B2 (en) * 2004-08-27 2011-10-19 パナソニック株式会社 Audio encoder
JP4809234B2 (en) * 2004-09-17 2011-11-09 パナソニック株式会社 Audio encoding apparatus, decoding apparatus, method, and program
US8145498B2 (en) 2004-09-03 2012-03-27 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Device and method for generating a coded multi-channel signal and device and method for decoding a coded multi-channel signal
US8180061B2 (en) 2005-07-19 2012-05-15 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding
US8184817B2 (en) 2005-09-01 2012-05-22 Panasonic Corporation Multi-channel acoustic signal processing device
US8204261B2 (en) 2004-10-20 2012-06-19 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Diffuse sound shaping for BCC schemes and the like
US8265941B2 (en) 2006-12-07 2012-09-11 Lg Electronics Inc. Method and an apparatus for decoding an audio signal
US8391513B2 (en) 2007-10-16 2013-03-05 Panasonic Corporation Stream synthesizing device, decoding unit and method
US20150221319A1 (en) * 2012-09-21 2015-08-06 Dolby International Ab Methods and systems for selecting layers of encoded audio signals for teleconferencing
US9384742B2 (en) 2006-09-29 2016-07-05 Lg Electronics Inc. Methods and apparatuses for encoding and decoding object-based audio signals
US9418667B2 (en) 2006-10-12 2016-08-16 Lg Electronics Inc. Apparatus for processing a mix signal and method thereof
US9626973B2 (en) 2005-02-23 2017-04-18 Telefonaktiebolaget L M Ericsson (Publ) Adaptive bit allocation for multi-channel audio encoding

Families Citing this family (61)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040264713A1 (en) * 2003-06-27 2004-12-30 Robert Grzesek Adaptive audio communication code
SE0400998D0 (en) 2004-04-16 2004-04-16 Cooding Technologies Sweden Ab Method for representing multi-channel audio signals
KR20070056081A (en) * 2004-08-31 2007-05-31 마츠시타 덴끼 산교 가부시키가이샤 Stereo signal generating apparatus and stereo signal generating method
EP1818910A4 (en) * 2004-12-28 2009-11-25 Panasonic Corp Scalable encoding apparatus and scalable encoding method
US7903824B2 (en) * 2005-01-10 2011-03-08 Agere Systems Inc. Compact side information for parametric coding of spatial audio
PL1839297T3 (en) * 2005-01-11 2019-05-31 Koninklijke Philips Nv Scalable encoding/decoding of audio signals
JP4887288B2 (en) * 2005-03-25 2012-02-29 パナソニック株式会社 Speech coding apparatus and speech coding method
US8270439B2 (en) * 2005-07-08 2012-09-18 Activevideo Networks, Inc. Video game system using pre-encoded digital audio mixing
US8074248B2 (en) 2005-07-26 2011-12-06 Activevideo Networks, Inc. System and method for providing video content associated with a source image to a television in a communication network
DE602006019762D1 (en) * 2005-11-04 2011-03-03 Univ Singapore DEVICE AND METHOD FOR PLAYING AUDIO CLIPS
KR100888474B1 (en) * 2005-11-21 2009-03-12 삼성전자주식회사 Apparatus and method for encoding/decoding multichannel audio signal
KR101218776B1 (en) * 2006-01-11 2013-01-18 삼성전자주식회사 Method of generating multi-channel signal from down-mixed signal and computer-readable medium
WO2007088853A1 (en) * 2006-01-31 2007-08-09 Matsushita Electric Industrial Co., Ltd. Audio encoding device, audio decoding device, audio encoding system, audio encoding method, and audio decoding method
CN101390443B (en) * 2006-02-21 2010-12-01 皇家飞利浦电子股份有限公司 Audio encoding and decoding
US8949120B1 (en) 2006-05-25 2015-02-03 Audience, Inc. Adaptive noise cancelation
US9826197B2 (en) 2007-01-12 2017-11-21 Activevideo Networks, Inc. Providing television broadcasts over a managed network and interactive content over an unmanaged network to a client device
EP2632165B1 (en) 2007-01-12 2015-09-30 ActiveVideo Networks, Inc. Interactive encoded content system including object models for viewing on a remote device
KR101453732B1 (en) 2007-04-16 2014-10-24 삼성전자주식회사 Method and apparatus for encoding and decoding stereo signal and multi-channel signal
US8521540B2 (en) * 2007-08-17 2013-08-27 Qualcomm Incorporated Encoding and/or decoding digital signals using a permutation value
KR101464977B1 (en) * 2007-10-01 2014-11-25 삼성전자주식회사 Method of managing a memory and Method and apparatus of decoding multi channel data
US8665914B2 (en) * 2008-03-14 2014-03-04 Nec Corporation Signal analysis/control system and method, signal control apparatus and method, and program
JP5773124B2 (en) * 2008-04-21 2015-09-02 日本電気株式会社 Signal analysis control and signal control system, apparatus, method and program
WO2009135532A1 (en) * 2008-05-09 2009-11-12 Nokia Corporation An apparatus
US8233629B2 (en) * 2008-09-04 2012-07-31 Dts, Inc. Interaural time delay restoration system and method
WO2010036059A2 (en) * 2008-09-25 2010-04-01 Lg Electronics Inc. A method and an apparatus for processing a signal
EP2169664A3 (en) * 2008-09-25 2010-04-07 LG Electronics Inc. A method and an apparatus for processing a signal
EP2169665B1 (en) * 2008-09-25 2018-05-02 LG Electronics Inc. A method and an apparatus for processing a signal
KR20100035121A (en) * 2008-09-25 2010-04-02 엘지전자 주식회사 A method and an apparatus for processing a signal
JP5309944B2 (en) * 2008-12-11 2013-10-09 富士通株式会社 Audio decoding apparatus, method, and program
EP2214162A1 (en) * 2009-01-28 2010-08-04 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Upmixer, method and computer program for upmixing a downmix audio signal
US8666752B2 (en) * 2009-03-18 2014-03-04 Samsung Electronics Co., Ltd. Apparatus and method for encoding and decoding multi-channel signal
US7975063B2 (en) * 2009-05-10 2011-07-05 Vantrix Corporation Informative data streaming server
CN101556799B (en) 2009-05-14 2013-08-28 华为技术有限公司 Audio decoding method and audio decoder
US8194862B2 (en) * 2009-07-31 2012-06-05 Activevideo Networks, Inc. Video game system with mixing of independent pre-encoded digital audio bitstreams
US8718290B2 (en) 2010-01-26 2014-05-06 Audience, Inc. Adaptive noise reduction using level cues
US8473287B2 (en) 2010-04-19 2013-06-25 Audience, Inc. Method for jointly optimizing noise reduction and voice quality in a mono or multi-microphone system
US9378754B1 (en) * 2010-04-28 2016-06-28 Knowles Electronics, Llc Adaptive spatial classifier for multi-microphone systems
AU2011315950B2 (en) 2010-10-14 2015-09-03 Activevideo Networks, Inc. Streaming digital video between video devices using a cable television system
EP2477418B1 (en) * 2011-01-12 2014-06-04 Nxp B.V. Signal processing method
EP2695388B1 (en) 2011-04-07 2017-06-07 ActiveVideo Networks, Inc. Reduction of latency in video distribution networks using adaptive bit rates
KR101930907B1 (en) * 2011-05-30 2019-03-12 삼성전자주식회사 Method for audio signal processing, audio apparatus thereof, and electronic apparatus thereof
WO2013106390A1 (en) 2012-01-09 2013-07-18 Activevideo Networks, Inc. Rendering of an interactive lean-backward user interface on a television
US9800945B2 (en) 2012-04-03 2017-10-24 Activevideo Networks, Inc. Class-based intelligent multiplexing over unmanaged networks
JP2015517121A (en) * 2012-04-05 2015-06-18 ホアウェイ・テクノロジーズ・カンパニー・リミテッド Inter-channel difference estimation method and spatial audio encoding device
US9123084B2 (en) 2012-04-12 2015-09-01 Activevideo Networks, Inc. Graphical application integration with MPEG objects
TWI505262B (en) 2012-05-15 2015-10-21 Dolby Int Ab Efficient encoding and decoding of multi-channel audio signal with multiple substreams
WO2014013294A1 (en) 2012-07-19 2014-01-23 Nokia Corporation Stereo audio signal encoder
WO2014145921A1 (en) 2013-03-15 2014-09-18 Activevideo Networks, Inc. A multiple-mode system and method for providing user selectable video content
EP2976768A4 (en) * 2013-03-20 2016-11-09 Nokia Technologies Oy Audio signal encoder comprising a multi-channel parameter selector
US9219922B2 (en) 2013-06-06 2015-12-22 Activevideo Networks, Inc. System and method for exploiting scene graph information in construction of an encoded video sequence
WO2014197879A1 (en) 2013-06-06 2014-12-11 Activevideo Networks, Inc. Overlay rendering of user interface onto source video
US9294785B2 (en) 2013-06-06 2016-03-22 Activevideo Networks, Inc. System and method for exploiting scene graph information in construction of an encoded video sequence
WO2015104447A1 (en) 2014-01-13 2015-07-16 Nokia Technologies Oy Multi-channel audio signal classifier
KR102244612B1 (en) * 2014-04-21 2021-04-26 삼성전자주식회사 Appratus and method for transmitting and receiving voice data in wireless communication system
US9788029B2 (en) 2014-04-25 2017-10-10 Activevideo Networks, Inc. Intelligent multiplexing using class-based, multi-dimensioned decision logic for managed networks
US10553228B2 (en) * 2015-04-07 2020-02-04 Dolby International Ab Audio coding with range extension
CN107358961B (en) * 2016-05-10 2021-09-17 华为技术有限公司 Coding method and coder for multi-channel signal
CN107358960B (en) * 2016-05-10 2021-10-26 华为技术有限公司 Coding method and coder for multi-channel signal
CN107731238B (en) * 2016-08-10 2021-07-16 华为技术有限公司 Coding method and coder for multi-channel signal
US10063612B2 (en) * 2016-09-30 2018-08-28 Amazon Technologies, Inc. Request-based encoding for streaming content portions
JP7092050B2 (en) * 2019-01-17 2022-06-28 日本電信電話株式会社 Multipoint control methods, devices and programs

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5890125A (en) * 1997-07-16 1999-03-30 Dolby Laboratories Licensing Corporation Method and apparatus for encoding and decoding multiple audio channels at low bit rates using adaptive selection of encoding method
EP1107232A2 (en) * 1999-12-03 2001-06-13 Lucent Technologies Inc. Joint stereo coding of audio signals

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5583962A (en) * 1991-01-08 1996-12-10 Dolby Laboratories Licensing Corporation Encoder/decoder for multidimensional sound fields
DE4409368A1 (en) * 1994-03-18 1995-09-21 Fraunhofer Ges Forschung Method for encoding multiple audio signals
IT1281001B1 (en) * 1995-10-27 1998-02-11 Cselt Centro Studi Lab Telecom PROCEDURE AND EQUIPMENT FOR CODING, HANDLING AND DECODING AUDIO SIGNALS.
US5812971A (en) * 1996-03-22 1998-09-22 Lucent Technologies Inc. Enhanced joint stereo coding method using temporal envelope shaping
JPH09274500A (en) 1996-04-09 1997-10-21 Matsushita Electric Ind Co Ltd Coding method of digital audio signals
EP0990368B1 (en) * 1997-05-08 2002-04-24 STMicroelectronics Asia Pacific Pte Ltd. Method and apparatus for frequency-domain downmixing with block-switch forcing for audio decoding functions
SE0202159D0 (en) * 2001-07-10 2002-07-09 Coding Technologies Sweden Ab Efficientand scalable parametric stereo coding for low bitrate applications
EP1442453B1 (en) * 2001-10-19 2006-09-06 Koninklijke Philips Electronics N.V. Frequency-differential encoding of sinusoidal model parameters
EP1440433B1 (en) * 2001-11-02 2005-05-04 Matsushita Electric Industrial Co., Ltd. Audio encoding and decoding device
US6934677B2 (en) * 2001-12-14 2005-08-23 Microsoft Corporation Quantization matrices based on critical band pattern information for digital audio wherein quantization bands differ from critical bands
DE60307252T2 (en) * 2002-04-11 2007-07-19 Matsushita Electric Industrial Co., Ltd., Kadoma DEVICES, METHODS AND PROGRAMS FOR CODING AND DECODING

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5890125A (en) * 1997-07-16 1999-03-30 Dolby Laboratories Licensing Corporation Method and apparatus for encoding and decoding multiple audio channels at low bit rates using adaptive selection of encoding method
EP1107232A2 (en) * 1999-12-03 2001-06-13 Lucent Technologies Inc. Joint stereo coding of audio signals

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
BOSI M ET AL: "ISO/IEC MPEG-2 ADVANCED AUDIO CODING", JOURNAL OF THE AUDIO ENGINEERING SOCIETY, AUDIO ENGINEERING SOCIETY. NEW YORK, US, vol. 45, no. 10, 1 October 1997 (1997-10-01), pages 789 - 812, XP000730161, ISSN: 0004-7554 *
FALLER C ET AL: "Efficient representation of spatial audio using perceptual parametrization", IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, XX, XX, 21 October 2001 (2001-10-21), pages 199 - 202, XP002245584 *
VAN DER WAAL R G ET AL: "Subband coding of stereophonic digital audio signals", SPEECH PROCESSING 2, VLSI, UNDERWATER SIGNAL PROCESSING. TORONTO, MAY 14 - 17, 1991, INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH & SIGNAL PROCESSING. ICASSP, NEW YORK, IEEE, US, vol. 2 CONF. 16, 14 April 1991 (1991-04-14), pages 3601 - 3604, XP010043648, ISBN: 0-7803-0003-3 *

Cited By (127)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7693721B2 (en) 2001-05-04 2010-04-06 Agere Systems Inc. Hybrid multi-channel/cue coding/decoding of audio signals
US8200500B2 (en) 2001-05-04 2012-06-12 Agere Systems Inc. Cue-based audio coding/decoding
US7644003B2 (en) 2001-05-04 2010-01-05 Agere Systems Inc. Cue-based audio coding/decoding
US7941320B2 (en) 2001-05-04 2011-05-10 Agere Systems, Inc. Cue-based audio coding/decoding
US7116787B2 (en) 2001-05-04 2006-10-03 Agere Systems Inc. Perceptual synthesis of auditory scenes
US7006636B2 (en) 2002-05-24 2006-02-28 Agere Systems Inc. Coherence-based audio coding and synthesis
US7292901B2 (en) 2002-06-24 2007-11-06 Agere Systems Inc. Hybrid multi-channel/cue coding/decoding of audio signals
US7644001B2 (en) 2002-11-28 2010-01-05 Koninklijke Philips Electronics N.V. Differentially coding an audio signal
US7394903B2 (en) 2004-01-20 2008-07-01 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Apparatus and method for constructing a multi-channel output signal or for generating a downmix signal
JP2007519349A (en) * 2004-01-20 2007-07-12 フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ Apparatus and method for constructing a multi-channel output signal or apparatus and method for generating a downmix signal
WO2005069274A1 (en) * 2004-01-20 2005-07-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for constructing a multi-channel output signal or for generating a downmix signal
NO337395B1 (en) * 2004-01-20 2016-04-04 Fraunhofer Ges Forschung Build-up of multi-channel output and generation of down-mix signal
US7583805B2 (en) 2004-02-12 2009-09-01 Agere Systems Inc. Late reverberation-based synthesis of auditory scenes
WO2005083679A1 (en) * 2004-02-17 2005-09-09 Koninklijke Philips Electronics N.V. An audio distribution system, an audio encoder, an audio decoder and methods of operation therefore
NO340421B1 (en) * 2004-03-04 2017-04-18 Fraunhofer Ges Forschung Frequency-based coding of audio channels in parametric multi-channel coding system
KR100717598B1 (en) * 2004-03-04 2007-05-15 에이저 시스템즈 인크 Frequency-based coding of audio channels in parametric multi-channel coding systems
US7805313B2 (en) 2004-03-04 2010-09-28 Agere Systems Inc. Frequency-based coding of channels in parametric multi-channel coding systems
WO2005094125A1 (en) * 2004-03-04 2005-10-06 Agere Systems Inc. Frequency-based coding of audio channels in parametric multi-channel coding systems
AU2005226536B2 (en) * 2004-03-04 2008-09-04 Dolby Laboratories Licensing Corporation Frequency-based coding of audio channels in parametric multi-channel coding systems
KR100855561B1 (en) * 2004-04-16 2008-09-01 코딩 테크놀러지스 에이비 Scheme for generating a parametric representation for low-bit rate applications
KR101120911B1 (en) 2004-07-02 2012-02-27 파나소닉 주식회사 Audio signal decoding device and audio signal encoding device
JP4934427B2 (en) * 2004-07-02 2012-05-16 パナソニック株式会社 Speech signal decoding apparatus and speech signal encoding apparatus
WO2006003891A1 (en) * 2004-07-02 2006-01-12 Matsushita Electric Industrial Co., Ltd. Audio signal decoding device and audio signal encoding device
US7756713B2 (en) 2004-07-02 2010-07-13 Panasonic Corporation Audio signal decoding device which decodes a downmix channel signal and audio signal encoding device which encodes audio channel signals together with spatial audio information
JPWO2006003891A1 (en) * 2004-07-02 2008-04-17 松下電器産業株式会社 Speech signal decoding apparatus and speech signal encoding apparatus
KR100773539B1 (en) * 2004-07-14 2007-11-05 삼성전자주식회사 Multi channel audio data encoding/decoding method and apparatus
US8255211B2 (en) 2004-08-25 2012-08-28 Dolby Laboratories Licensing Corporation Temporal envelope shaping for spatial audio coding using frequency domain wiener filtering
EP3279893A1 (en) 2004-08-25 2018-02-07 Dolby Laboratories Licensing Corporation Temporal envelope shaping for spatial audio coding using frequency domain wiener filtering
EP3940697A1 (en) 2004-08-25 2022-01-19 Dolby Laboratories Licensing Corp. Temporal envelope shaping for spatial audio coding using frequency domain wiener filtering
EP4036914A1 (en) 2004-08-25 2022-08-03 Dolby Laboratories Licensing Corporation Temporal envelope shaping for spatial audio coding using frequency domain wiener filtering
US8015018B2 (en) 2004-08-25 2011-09-06 Dolby Laboratories Licensing Corporation Multichannel decorrelation in spatial audio coding
US7945449B2 (en) 2004-08-25 2011-05-17 Dolby Laboratories Licensing Corporation Temporal envelope shaping for spatial audio coding using frequency domain wiener filtering
JPWO2006022308A1 (en) * 2004-08-26 2008-05-08 松下電器産業株式会社 Multi-channel signal encoding apparatus and multi-channel signal decoding apparatus
JP4794448B2 (en) * 2004-08-27 2011-10-19 パナソニック株式会社 Audio encoder
US8145498B2 (en) 2004-09-03 2012-03-27 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Device and method for generating a coded multi-channel signal and device and method for decoding a coded multi-channel signal
JP2008512890A (en) * 2004-09-06 2008-04-24 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ Audio signal enhancement
JP2008512708A (en) * 2004-09-08 2008-04-24 フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ Apparatus and method for generating a multi-channel signal or parameter data set
US8731204B2 (en) 2004-09-08 2014-05-20 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Device and method for generating a multi-channel signal or a parameter data set
JP4809234B2 (en) * 2004-09-17 2011-11-09 パナソニック株式会社 Audio encoding apparatus, decoding apparatus, method, and program
US7720230B2 (en) 2004-10-20 2010-05-18 Agere Systems, Inc. Individual channel shaping for BCC schemes and the like
US8204261B2 (en) 2004-10-20 2012-06-19 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Diffuse sound shaping for BCC schemes and the like
US8238562B2 (en) 2004-10-20 2012-08-07 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Diffuse sound shaping for BCC schemes and the like
JP2008522214A (en) * 2004-11-29 2008-06-26 ナショナル ユニバーシティ オブ シンガポール Perceptually conscious low-power audio decoder for portable devices
JP2008522244A (en) * 2004-11-30 2008-06-26 アギア システムズ インコーポレーテッド Parametric coding of spatial audio using object-based side information
US7761304B2 (en) 2004-11-30 2010-07-20 Agere Systems Inc. Synchronizing parametric coding of spatial audio with externally provided downmix
US7787631B2 (en) 2004-11-30 2010-08-31 Agere Systems Inc. Parametric coding of spatial audio with cues based on transmitted channels
JP2008522243A (en) * 2004-11-30 2008-06-26 アギア システムズ インコーポレーテッド Synchronization of spatial audio parametric coding with externally supplied downmix
US8340306B2 (en) 2004-11-30 2012-12-25 Agere Systems Llc Parametric coding of spatial audio with object-based side information
US8824690B2 (en) 2004-12-01 2014-09-02 Samsung Electronics Co., Ltd. Apparatus and method for processing multi-channel audio signal using space information
US9552820B2 (en) 2004-12-01 2017-01-24 Samsung Electronics Co., Ltd. Apparatus and method for processing multi-channel audio signal using space information
JP2013251919A (en) * 2004-12-01 2013-12-12 Samsung Electronics Co Ltd Multi-channel audio signal processor, multi-channel audio signal processing method, compression efficiency improving method, and multi-channel audio signal processing system
JP2012070428A (en) * 2004-12-01 2012-04-05 Samsung Electronics Co Ltd Multi-channel audio signal processor, multi-channel audio signal processing method, compression efficiency improving method, and multi-channel audio signal processing system
US9232334B2 (en) 2004-12-01 2016-01-05 Samsung Electronics Co., Ltd. Apparatus and method for processing multi-channel audio signal using space information
JP2006166447A (en) * 2004-12-01 2006-06-22 Samsung Electronics Co Ltd Apparatus and method for processing multi-channel audio signal, compression efficiency improving method and system for processing multi-channel audio signal
JPWO2006070751A1 (en) * 2004-12-27 2008-06-12 松下電器産業株式会社 Speech coding apparatus and speech coding method
JP5046652B2 (en) * 2004-12-27 2012-10-10 パナソニック株式会社 Speech coding apparatus and speech coding method
JPWO2006070757A1 (en) * 2004-12-28 2008-06-12 松下電器産業株式会社 Speech coding apparatus and speech coding method
EP1852850A1 (en) * 2005-02-01 2007-11-07 Matsushita Electric Industrial Co., Ltd. Scalable encoding device and scalable encoding method
US8036390B2 (en) 2005-02-01 2011-10-11 Panasonic Corporation Scalable encoding device and scalable encoding method
EP1852850A4 (en) * 2005-02-01 2011-02-16 Panasonic Corp Scalable encoding device and scalable encoding method
NO339907B1 (en) * 2005-02-22 2017-02-13 Fraunhofer Ges Forschung Near transparent or transparent multichannel coding / decoding system
US7573912B2 (en) 2005-02-22 2009-08-11 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschunng E.V. Near-transparent or transparent multi-channel encoder/decoder scheme
WO2006089570A1 (en) * 2005-02-22 2006-08-31 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Near-transparent or transparent multi-channel encoder/decoder scheme
KR100954179B1 (en) 2005-02-22 2010-04-21 프라운호퍼-게젤샤프트 츄어 푀르더룽 데어 안게반텐 포르슝에.파우. Near-transparent or transparent multi-channel encoder/decoder scheme
US9626973B2 (en) 2005-02-23 2017-04-18 Telefonaktiebolaget L M Ericsson (Publ) Adaptive bit allocation for multi-channel audio encoding
EP1851866A4 (en) * 2005-02-23 2010-05-19 Ericsson Telefon Ab L M Adaptive bit allocation for multi-channel audio encoding
EP1851866A1 (en) * 2005-02-23 2007-11-07 TELEFONAKTIEBOLAGET LM ERICSSON (publ) Adaptive bit allocation for multi-channel audio encoding
US8553895B2 (en) 2005-03-04 2013-10-08 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Device and method for generating an encoded stereo signal of an audio piece or audio datastream
JP2008532395A (en) * 2005-03-04 2008-08-14 フラウンホーファーゲゼルシャフト ツール フォルデルング デル アンゲヴァンテン フォルシユング エー.フアー. Apparatus and method for generating an encoded stereo signal of an audio fragment or audio data stream
AU2006233512C1 (en) * 2005-04-13 2012-01-12 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Adaptive grouping of parameters for enhanced coding efficiency
AU2006233512B8 (en) * 2005-04-13 2010-12-23 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Adaptive grouping of parameters for enhanced coding efficiency
AU2006233512B2 (en) * 2005-04-13 2010-04-22 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Adaptive grouping of parameters for enhanced coding efficiency
WO2007004831A1 (en) * 2005-06-30 2007-01-11 Lg Electronics Inc. Method and apparatus for encoding and decoding an audio signal
AU2006266579B2 (en) * 2005-06-30 2009-10-22 Lg Electronics Inc. Method and apparatus for encoding and decoding an audio signal
US8180061B2 (en) 2005-07-19 2012-05-15 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding
US7761177B2 (en) 2005-07-29 2010-07-20 Lg Electronics Inc. Method for generating encoded audio signal and method for processing audio signal
WO2007013775A1 (en) * 2005-07-29 2007-02-01 Lg Electronics Inc. Mehtod for generating encoded audio signal and method for processing audio signal
KR100888970B1 (en) * 2005-07-29 2009-03-17 엘지전자 주식회사 Mehtod for generating encoded audio signal and method for processing audio signal
KR100841332B1 (en) * 2005-07-29 2008-06-25 엘지전자 주식회사 Method for signaling of splitting in-formation
US7706905B2 (en) 2005-07-29 2010-04-27 Lg Electronics Inc. Method for processing audio signal
WO2007013780A1 (en) * 2005-07-29 2007-02-01 Lg Electronics Inc. Method for signaling of splitting information
KR101162218B1 (en) 2005-07-29 2012-07-04 엘지전자 주식회사 Method for generating encoded audio signal and method for processing audio signal
WO2007013783A1 (en) * 2005-07-29 2007-02-01 Lg Electronics Inc. Method for processing audio signal
KR100857103B1 (en) * 2005-07-29 2008-09-08 엘지전자 주식회사 Method for processing audio signal
US7702407B2 (en) 2005-07-29 2010-04-20 Lg Electronics Inc. Method for generating encoded audio signal and method for processing audio signal
US7693183B2 (en) 2005-07-29 2010-04-06 Lg Electronics Inc. Method for signaling of splitting information
KR100857102B1 (en) * 2005-07-29 2008-09-08 엘지전자 주식회사 Method for generating encoded audio signal and method for processing audio signal
WO2007013784A1 (en) * 2005-07-29 2007-02-01 Lg Electronics Inc. Method for generating encoded audio signal amd method for processing audio signal
KR100857104B1 (en) * 2005-07-29 2008-09-05 엘지전자 주식회사 Method for generating encoded audio signal and method for processing audio signal
WO2007013781A1 (en) * 2005-07-29 2007-02-01 Lg Electronics Inc. Method for generating encoded audio signal and method for processing audio signal
US7693706B2 (en) 2005-07-29 2010-04-06 Lg Electronics Inc. Method for generating encoded audio signal and method for processing audio signal
EP2296142A2 (en) 2005-08-02 2011-03-16 Dolby Laboratories Licensing Corporation Controlling spatial audio coding parameters as a function of auditory events
US8184817B2 (en) 2005-09-01 2012-05-22 Panasonic Corporation Multi-channel acoustic signal processing device
KR100857118B1 (en) * 2005-10-05 2008-09-05 엘지전자 주식회사 Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor
US7756701B2 (en) 2005-10-05 2010-07-13 Lg Electronics Inc. Audio signal processing using pilot based coding
US7653533B2 (en) 2005-10-24 2010-01-26 Lg Electronics Inc. Removing time delays in signal paths
US8095357B2 (en) 2005-10-24 2012-01-10 Lg Electronics Inc. Removing time delays in signal paths
US8095358B2 (en) 2005-10-24 2012-01-10 Lg Electronics Inc. Removing time delays in signal paths
US7991495B2 (en) 2006-02-23 2011-08-02 Lg Electronics Inc. Method and apparatus for processing an audio signal
US7881817B2 (en) 2006-02-23 2011-02-01 Lg Electronics Inc. Method and apparatus for processing an audio signal
US7991494B2 (en) 2006-02-23 2011-08-02 Lg Electronics Inc. Method and apparatus for processing an audio signal
US8213641B2 (en) 2006-05-04 2012-07-03 Lg Electronics Inc. Enhancing audio with remix capability
EP2291007A1 (en) * 2006-05-04 2011-03-02 LG Electronics Inc. Enhancing audio with remixing capability
WO2008000901A1 (en) * 2006-06-30 2008-01-03 Nokia Corporation Scalable audio coding
US9792918B2 (en) 2006-09-29 2017-10-17 Lg Electronics Inc. Methods and apparatuses for encoding and decoding object-based audio signals
US9384742B2 (en) 2006-09-29 2016-07-05 Lg Electronics Inc. Methods and apparatuses for encoding and decoding object-based audio signals
WO2008039045A1 (en) * 2006-09-29 2008-04-03 Lg Electronics Inc., Apparatus for processing mix signal and method thereof
CN101652810B (en) * 2006-09-29 2012-04-11 Lg电子株式会社 Apparatus for processing mix signal and method thereof
US9418667B2 (en) 2006-10-12 2016-08-16 Lg Electronics Inc. Apparatus for processing a mix signal and method thereof
US7672744B2 (en) 2006-11-15 2010-03-02 Lg Electronics Inc. Method and an apparatus for decoding an audio signal
WO2008060111A1 (en) * 2006-11-15 2008-05-22 Lg Electronics Inc. A method and an apparatus for decoding an audio signal
AU2007320218B2 (en) * 2006-11-15 2010-08-12 Lg Electronics Inc. A method and an apparatus for decoding an audio signal
US7715569B2 (en) 2006-12-07 2010-05-11 Lg Electronics Inc. Method and an apparatus for decoding an audio signal
US8428267B2 (en) 2006-12-07 2013-04-23 Lg Electronics Inc. Method and an apparatus for decoding an audio signal
US8340325B2 (en) 2006-12-07 2012-12-25 Lg Electronics Inc. Method and an apparatus for decoding an audio signal
US7986788B2 (en) 2006-12-07 2011-07-26 Lg Electronics Inc. Method and an apparatus for decoding an audio signal
US8311227B2 (en) 2006-12-07 2012-11-13 Lg Electronics Inc. Method and an apparatus for decoding an audio signal
US8265941B2 (en) 2006-12-07 2012-09-11 Lg Electronics Inc. Method and an apparatus for decoding an audio signal
US8488797B2 (en) 2006-12-07 2013-07-16 Lg Electronics Inc. Method and an apparatus for decoding an audio signal
US8005229B2 (en) 2006-12-07 2011-08-23 Lg Electronics Inc. Method and an apparatus for decoding an audio signal
US7783051B2 (en) 2006-12-07 2010-08-24 Lg Electronics Inc. Method and an apparatus for decoding an audio signal
US7783049B2 (en) 2006-12-07 2010-08-24 Lg Electronics Inc. Method and an apparatus for decoding an audio signal
US7783048B2 (en) 2006-12-07 2010-08-24 Lg Electronics Inc. Method and an apparatus for decoding an audio signal
US7783050B2 (en) 2006-12-07 2010-08-24 Lg Electronics Inc. Method and an apparatus for decoding an audio signal
US8391513B2 (en) 2007-10-16 2013-03-05 Panasonic Corporation Stream synthesizing device, decoding unit and method
US9858936B2 (en) * 2012-09-21 2018-01-02 Dolby Laboratories Licensing Corporation Methods and systems for selecting layers of encoded audio signals for teleconferencing
US20150221319A1 (en) * 2012-09-21 2015-08-06 Dolby International Ab Methods and systems for selecting layers of encoded audio signals for teleconferencing

Also Published As

Publication number Publication date
KR101021079B1 (en) 2011-03-14
DE60306512D1 (en) 2006-08-10
EP1500083A1 (en) 2005-01-26
JP4714415B2 (en) 2011-06-29
ATE332003T1 (en) 2006-07-15
AU2003216686A1 (en) 2003-11-03
US8498422B2 (en) 2013-07-30
ES2268340T3 (en) 2007-03-16
US20050226426A1 (en) 2005-10-13
CN1647156A (en) 2005-07-27
CN1647156B (en) 2010-05-26
BR0304542A (en) 2004-07-20
BRPI0304542B1 (en) 2018-05-08
DE60306512T2 (en) 2007-06-21
JP2005523479A (en) 2005-08-04
EP1500083B1 (en) 2006-06-28
KR20040102163A (en) 2004-12-03

Similar Documents

Publication Publication Date Title
EP1500083B1 (en) Parametric multi-channel audio representation
US8359113B2 (en) Method and an apparatus for processing an audio signal
CN105580073B (en) Audio decoder, audio encoder, method, and computer-readable storage medium
KR101315077B1 (en) Scalable multi-channel audio coding
EP2461321B1 (en) Coding device and decoding device
US7848931B2 (en) Audio encoder
JP2021503628A (en) Devices and methods for encoding or decoding directional audio coding parameters using quantization and entropy coding
WO2003069954A2 (en) Parametric audio coding
KR20050021484A (en) Audio coding
KR20070001139A (en) An audio distribution system, an audio encoder, an audio decoder and methods of operation therefore
US7644001B2 (en) Differentially coding an audio signal
US8194754B2 (en) Method for processing a signal and apparatus for processing a signal
Chon et al. Variable bit quantization for virtual source location information in spatial audio coding
Breebaart et al. 19th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007
PNS Sascha Disch1, Jürgen Herre1

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PH PL PT RO RU SC SD SE SG SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2003712597

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 10511806

Country of ref document: US

Ref document number: 2353/CHENP/2004

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 20038089777

Country of ref document: CN

Ref document number: 2003586872

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 1020047017069

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 1020047017069

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 2003712597

Country of ref document: EP

WWG Wipo information: grant in national office

Ref document number: 2003712597

Country of ref document: EP