WO2008069593A1 - A method and an apparatus for processing an audio signal - Google Patents

A method and an apparatus for processing an audio signal Download PDF

Info

Publication number
WO2008069593A1
WO2008069593A1 PCT/KR2007/006315 KR2007006315W WO2008069593A1 WO 2008069593 A1 WO2008069593 A1 WO 2008069593A1 KR 2007006315 W KR2007006315 W KR 2007006315W WO 2008069593 A1 WO2008069593 A1 WO 2008069593A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
channel
signal
downmix
decoder
Prior art date
Application number
PCT/KR2007/006315
Other languages
French (fr)
Inventor
Hyen O Oh
Yang Won Jung
Original Assignee
Lg Electronics Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lg Electronics Inc. filed Critical Lg Electronics Inc.
Priority to CN2007800454197A priority Critical patent/CN101553868B/en
Priority to KR1020097014216A priority patent/KR101128815B1/en
Priority to EP07851286.0A priority patent/EP2122612B1/en
Priority to JP2009540163A priority patent/JP5209637B2/en
Publication of WO2008069593A1 publication Critical patent/WO2008069593A1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/302Electronic adaptation of stereophonic sound system to listener position or orientation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/01Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/03Application of parametric coding in stereophonic audio systems

Definitions

  • the present invention relates to a method and an apparatus for processing an audio signal, and more particularly, to a method and an apparatus for decoding an audio signal received on a digital medium., as a broadcast signal, and so on.
  • an object parameter must be converted flexibly to a multi-channel parameter required in upmixing process.
  • the present invention is directed to a method and an apparatus for processing an audio signal that substantially obviates one or more problems due to limitations and disadvantages of the related art.
  • the present invention provides the following effects or advantages. First of all, the present invention is able to provide a method and an apparatus for processing an audio signal to control object gain and panning unrestrictedly.
  • the present invention is able to provide a method and an apparatus for processing an audio signal to control object gain and panning based on user selection.
  • FIG. 1 is an exemplary block diagram to explain to basic concept of rendering a downmix signal based on playback configuration and user control.
  • FIG. 2 is an exemplary block diagram of an apparatus for processing an audio signal according to one embodiment of the present invention corresponding to the first scheme.
  • FIG. 3 is an exemplary block diagram of an apparatus for processing an audio signal according to another embodiment of the present invention corresponding to the first scheme.
  • FIG. 4 is an exemplary block diagram of an apparatus for processing an audio signal according to one embodiment of present invention corresponding to the second scheme.
  • FIG. 5 is an exemplary block diagram of an apparatus for processing an audio signal according to another embodiment of present invention corresponding to the second scheme.
  • FIG. 6 is an exemplary block diagram of an apparatus for processing an audio signal according to the other embodiment of present invention corresponding to the second scheme.
  • FIG. 7 is an exemplary block diagram of an apparatus for processing an audio signal according to one embodiment of the present invention corresponding to the third scheme.
  • FIG. 8 is an exemplary block diagram of an apparatus for processing an audio signal according to another embodiment of the present invention corresponding to the third scheme.
  • FIG. 9 is an exemplary block diagram to explain to basic concept of rendering unit.
  • FIGS. 1OA to 1OC are exemplary block diagrams of a first embodiment of a downmix processing unit illustrated in FIG. 7.
  • FIG. 11 is an exemplary block diagram of a second embodiment of a downmix processing unit illustrated in FIG. 7.
  • FIG. 12 is an exemplary block diagram of a third embodiment of a downmix processing unit illustrated in FIG. 7.
  • FIG. 13 is an exemplary block diagram of a fourth embodiment of a downmix processing unit illustrated in FIG. 7.
  • FIG. 14 is an exemplary block diagram of a bitstream structure of a compressed audio signal according to a second embodiment of present invention.
  • FIG. 15 is an exemplary block diagram of an apparatus for processing an audio signal according to a second embodiment of present invention.
  • FIG. 16 is an exemplary block diagram of a bitstream structure of a compressed audio signal according to a third embodiment of present invention.
  • FIG. 17 is an exemplary block diagram of an apparatus for processing an audio signal according to a fourth embodiment of present invention.
  • FIG. 18 is an exemplary block diagram to explain transmitting scheme for variable type of object.
  • FIG. 19 is an exemplary block diagram to an apparatus for processing an audio signal according to a fifth embodiment of present invention.
  • a method for processing an audio signal comprising: receiving a downmix signal, a first multichannel information, and an object information; processing the downmix signal using the object information and a mix information; and, transmitting one of the first multi-channel information and a second multi-channel information according to the mix information, wherein the second channel information is generated using the object information and the mix information.
  • the downmix signal contains a plural channel and a plural object.
  • the first multi-channel information is applied to the downmix signal to generate a plural channel signal.
  • the object information corresponds to an information for controlling the plural object.
  • the mix information includes a mode information indicating whether the first multi-channel information is applied to the processed downmix.
  • processing the downmix signal comprising: determining a processing scheme according to the mode information; and, processing the dowmmix signal using the object information and using the mix information according to the determined processing scheme.
  • the transmitting one of the first multi-channel information and a second multi-channel information is performed according to the mode information included in the mix information.
  • the present invention further comprising, generating a multichannel signal using the processed downmix signal and one of the first multi- channel information and the second multi-channel information.
  • the receiving a downmix signal, a first multi-channel information, an object information, and a mix information comprising: receiving the downmix signal and, a bitstream including the first multi-channel information and the object information; and, extracting the multi- channel information and the object information from the received bitstream.
  • the downmix signal is received as a broadcast signal.
  • a computer-readable medium having instructions stored thereon, which, when executed by a processor, causes the processor to perform operations, comprising: receiving a downmix signal, a first multi-channel information, and an object information; processing the downmix signal using the object information and a mix information; and, transmitting one of the first multi-channel information and a second multi-channel information according to the mix information, wherein the second channel information is generated using the object information and the mix information.
  • an apparatus for processing an audio signal comprising: a bitstream de-multiplexer receiving a downmix signal, a first multi-channel information, and an object information; and, an object decoder processing the downmix signal using the object information and a mix information, and transmitting one of the first multi-channel information and a second multichannel information according to the mix information, wherein the second channel information is generated using the object information and the mix information.
  • a data structure of audio signal comprising: a downmix signal having a plural object and a plural channel; an object information for controlling the plural object; and, a multi-channel information for decoding the plural channel, wherein the object information includes an object parameter, and the multi-channel information includes at least one of channel level information and channel correlation information.
  • 'parameter' in the following description means information including values, parameters of narrow sense, coefficients, elements, and so on.
  • 'parameter' term will be used instead of 'information' term like an object parameter, a mix parameter, a downmix processing parameter, and so on, which does not put limitation on the present invention.
  • an object parameter and a spatial parameter can be extracted.
  • a decoder can generate output signal using a downmix signal and the object parameter (or the spatial parameter).
  • the output signal may be rendered based on playback configuration and user control by the decoder. The rendering process shall be explained in details with reference to the FIG. 1 as follow.
  • FIG. 1 is an exemplary diagram to explain to basic concept of rendering downmix based on playback configuration and user control.
  • a decoder 100 may include a rendering information generating unit 110 and a rendering unit 120, and also may include a renderer 110a and a synthesis 120a instead of the rendering information generating unit 110 and the rendering unit 120.
  • a rendering information generating unit 110 can be configured to receive a side information including an object parameter or a spatial parameter from an encoder, and also to receive a playback configuration or a user control from a device setting or a user interface.
  • the object parameter may correspond to a parameter extracted in downmixing at least one object signal
  • the spatial parameter may correspond to a parameter extracted in downmixing at least one channel signal.
  • type information and characteristic information for each object may be included in the side information. Type information and characteristic information may describe instrument name, player name, and so on.
  • the playback configuration may include speaker position and ambient information (speaker's virtual position), and the user control may correspond to a control information inputted by a user in order to control object positions and object gains, and also may correspond to a control information in order to the playback configuration.
  • the payback configuration and user control can be represented as a mix information, which does not put limitation on the present invention.
  • a rendering information generating unit 110 can be configured to generate a rendering information using a mix information (the playback configuration and user control) and the received side information.
  • a rendering unit 120 can configured to generate a multi-channel parameter using the rendering information in case that the downmix of an audio signal (abbreviated 'downmix signal') is not transmitted, and generate multi-channel signals using the rendering information and downmix in case that the downmix of an audio signal is transmitted.
  • a renderer HOa can be configured to generate multi-channel signals using a mix information (the playback configuration and the user control) and the received side information.
  • a synthesis 120a can be configured to synthesis the multi-channel signals using the multi-channel signals generated by the renderer HOa.
  • the decoder may render the downmix signal based on playback configuration and user control.
  • a decoder can receive an object parameter as a side information and control object panning and object gain based on the transmitted object parameter.
  • Variable methods for controlling the individual object signals may be provided. First of all, in case that a decoder receives an object parameter and generates the individual object signals using the object parameter, then, can control the individual object signals base on a mix information (the playback configuration, the object level, etc.)
  • the multi-channel decoder can upmix a downmix signal received from an encoder using the multi-channel parameter.
  • the above-mention second method may be classified into three types of scheme. In particular, 1) using a conventional multi-channel decoder, 2) modifying a multichannel decoder, 3) processing downmix of audio signals before being inputted to a multi-channel decoder may be provided.
  • the conventional multi-channel decoder may correspond to a channel-oriented spatial audio coding (ex: MPEG Surround decoder), which does not put limitation on the present invention. Details of three types of scheme shall be explained as follow. 1.1 Using a multi-channel decoder
  • FIG. 2 is an exemplary block diagram of an apparatus for processing an audio signal according to one embodiment of the present invention corresponding to first scheme.
  • an apparatus for processing an audio signal 200 may include an information generating unit 210 and a multi-channel decoder 230.
  • the information generating unit 210 may receive a side information including an object parameter from an encoder and a mix information from a user interface, and may generate a multi-channel parameter including a arbitrary downmix gain or a gain modification gain(hereinafter simple 'ADG').
  • the ADG may describe a ratio of a first gain estimated based on the mix information and the obejct information over a second gain extimated based on the object information.
  • the information generating unit 210 may generate the ADG only if the downmix signal corresponds to a mono signal.
  • the multi-channel decoder 230 may receive a downmix of an audio signal from an encoder and a multi-channel parameter from the information generating unit 210, and may generate a multi-channel output using the downmix signal and the multi-channel parameter.
  • the multi-channel parameter may include a channel level difference (hereinafter abbreviated 'CLD'), an inter channel correlation (hereinafter abbreviated TCC), a channel prediction coefficient (hereinafter abbreviated 'CPC).
  • 'CLD' channel level difference
  • TCC inter channel correlation
  • 'CPC channel prediction coefficient
  • CLD CLD
  • ICC CPC
  • CPC CLD
  • ICC CPC
  • CPC C-PC
  • intensity difference or correlation between two channels It is able to control object positions and object diffuseness (sonority) using the CLD, the ICC, etc.
  • the CLD describe the relative level difference instead of the absolute level, and energy of the splitted two channels is conserved. Therefore it is unable to control object gains by handling CLD, etc. In other words, specific object cannot be mute or volume up by using the CLD, etc.
  • the ADG describes time and frequency dependent gain for controlling correction factor by a user. If this correction factor be applied, it is able to handle modification of down-mix signal prior to a multi-channel upmixing. Therefore, in case that ADG parameter is received from the information generating unit 210, the multi-channel decoder 230 can control object gains of specific time and frequency using the ADG parameter.
  • a case that the received stereo downmix signal outputs as a stereo channel can be defined the following formula 1.
  • x[] is input channels
  • y[] is output channels
  • g x gains
  • w xx weight.
  • W12 and W2i may be a cross-talk component (in other words, cross-term).
  • the above-mentioned case corresponds to 2-2-2 configuration, which means
  • 2-channel input, 2-channel transmission, and 2-channel output In order to perform the 2-2-2 configuration, 5-2-5 configuration (2-channel input, 5-channel transmission, and 2 channel output) of conventional channel-oriented spatial audio coding (ex: MPEG surround) can be used. At first, in order to output 2 channels for 2-2-2 configuration, certain channel among 5 output channels of 5-2-5 configuration can be set to a disable channel (a fake channel). In order to give cross-talk between 2-transmitted channels and 2-output channels, the above-mentioned CLD and CPC may be adjusted. In brief, gain factor g x in the formula 1 is obtained using the above mentioned ADG, and weighting factor w ⁇ W22 in the formula 1 is obtained using CLD and CPC.
  • default mode of conventional spatial audio coding may be applied. Since characteristic of default CLD is supposed to output 2-channel, it is able to reduce computing amount if the default CLD is applied. Particularly, since there is no need to synthesis a fake channel, it is able to reduce computing amount largely. Therefore, applying the default mode is proper. In particular, only default CLD of 3 CLDs (corresponding to 0, 1, and 2 in MPEG surround standard) is used for decoding. On the other hand, 4 CLDs among left channel, right channel, and center channel (corresponding to 3, 4, 5, and 6 in MPEG surround standard) and 2 ADGs (corresponding to 7 and 8 in MPEG surround standard) is generated for controlling object.
  • 3 CLDs corresponding to 0, 1, and 2 in MPEG surround standard
  • 4 CLDs among left channel, right channel, and center channel corresponding to 3, 4, 5, and 6 in MPEG surround standard
  • 2 ADGs corresponding to 7 and 8 in MPEG surround standard
  • CLDs corresponding 3 and 5 describe channel level difference between left channel plus right channel and center channel ((l+r)/c) is proper to set to 15OdB (approximately infinite) in order to mute center channel.
  • energy based up-mix or prediction based up-mix may be performed, which is invoked in case that TTT mode ('bsTttModeLow' in the MPEG surround standard) corresponds to energy-based mode (with subtraction, matrix compatibility enabled) (3 rd mode), or prediction mode (1 st mode or 2 nd mode).
  • FIG. 3 is an exemplary block diagram of an apparatus for processing an audio signal according to another embodiment of the present invention corresponding to first scheme.
  • an apparatus for processing an audio signal according to another embodiment of the present invention 300 may include a information generating unit 310, a scene rendering unit 320, a multi-channel decoder 330, and a scene remixing unit 350.
  • the information generating unit 310 can be configured to receive a side information including an object parameter from an encoder if the downmix signal corresponds to mono channel signal (i.e., the number of downmix channel is 'V), may receive a mix information from a user interface, and may generate a multi- channel parameter using the side information and the mix information.
  • the number of downmix channel can be estimated based on a flag information included in the side information as well as the downmix signal itself and user selection.
  • the information generating unit 310 may have the same configuration of the former information generating unit 210.
  • the multi-channel parameter is inputted to the multi-channel decoder 330, the multi-channel decoder 330 may have the same configuration of the former multi-channel decoder 230.
  • the scene rendering unit 320 can be configured to receive a side information including an object parameter from and encoder if the downmix signal corresponds to non-mono channel signal (i.e., the number of downmix channel is more than '2'), may receive a mix information from a user interface, and may generate a remixing parameter using the side information and the mix information.
  • the remixing parameter corresponds to a parameter in order to remix a stereo channel and generate more than 2-channel outputs.
  • the remixing parameter is inputted to the scene remixing unit 350.
  • the scene remixing unit 350 can be configured to remix the downmix signal using the remixing parameter if the downmix signal is more than 2-channel signal.
  • Second scheme may modify a conventional multi-channel decoder.
  • a case of using virtual output for controlling object gains and a case of modifying a device setting for controlling object panning shall be explained with reference to FIG. 4 as follow.
  • a case of Performing TBT(2x2) functionality in a multi-channel decoder shall be explained with reference to FIG. 5.
  • FIG. 4 is an exemplary block diagram of an apparatus for processing an audio signal according to one embodiment of present invention corresponding to the second scheme. Referring to FIG.
  • an apparatus for processing an audio signal according to one embodiment of present invention corresponding to the second scheme 400 may include an information generating unit 410, an internal multi-channel synthesis 420, and an output mapping unit 430.
  • the internal multi-channel synthesis 420 and the output mapping unit 430 may be included in a synthesis unit.
  • the information generating unit 410 can be configured to receive a side information including an object parameter from an encoder, and a mix parameter from a user interface. And the information generating unit 410 can be configured to generate a multi-channel parameter and a device setting information using the side information and the mix information.
  • the multi-channel parameter may have the same configuration of the former multi-channel parameter. So, details of the multi- channel parameter shall be omitted in the following description.
  • the device setting information may correspond to parameterized HRTF for binaural processing, which shall be explained in the description of '1.2.2 Using a device setting information'.
  • the internal multi-channel synthesis 420 can be configured to receive a multi-channel parameter and a device setting information from the parameter generation unit 410 and downmix signal from an encoder.
  • the internal multichannel synthesis 420 can be configured to generate a temporal multi-channel output including a virtual output, which shall be explained in the description of '1.2.1 Using a virtual output'. 1.2.1 Using a virtual output
  • multi-channel parameter can control object panning, it is hard to control object gain as well as object panning by a conventional multichannel decoder.
  • the decoder 400 may map relative energy of object to a virtual channel (ex: center channel).
  • the relative energy of object corresponds to energy to be reduced.
  • the decoder 400 may map more than 99.9% of object energy to a virtual channel.
  • the decoder 400 (especially, the output mapping unit 430) does not output the virtual channel to which the rest energy of object is mapped. In conclusion, if more than 99.9% of object is mapped to a virtual channel which is not outputted, the desired object can be almost mute.
  • the decoder 400 can adjust a device setting information in order to control object panning and object gain.
  • the decoder can be configured to generate a parameterized HRTF for binaural processing in MPEG Surround standard.
  • the parameterized HRTF can be variable according to device setting. It is able to assume that object signals can be controlled according to the following formula 2.
  • Rnew bl * ⁇ bji + b 2 * ⁇ bJ2 + b3 * ⁇ bJ3 + .. + b n * ⁇ bjn, where objk is object signals, L ne w and R ne w is a desired stereo signal, and ak and bk are coefficients for object control.
  • An object information of the object signals objk may be estimated from an object parameter included in the transmitted side information.
  • the coefficients ak, bk which are defined according to object gain and object panning may be estimated from the mix information.
  • the desired object gain and object panning can be adjusted using the coefficients ak, bk.
  • the coefficients ak, bk can be set to correspond to HRTF parameter for binaural processing, which shall be explained in details as follow.
  • HRTF parameter for binaural processing In MPEG Surround standard (5-l-5i configuration) (from ISO/ IEC FDIS 23003-1 :2006(E), Information Technology - MPEG Audio Technologies - Parti: MPEG Surround), binaural processing is as below.
  • FIG. 5 is an exemplary block diagram of an apparatus for processing an audio signal according to another embodiment of present invention corresponding to the second scheme.
  • FIG. 5 is an exemplary block diagram of TBT functionality in a multi-channel decoder.
  • a TBT module 510 can be configured to receive input signals and a TBT control information, and generate output signals.
  • the TBT module 510 may be included in the decoder 200 of the FIG. 2 (or in particular, the multi-channel decoder 230).
  • the multi-channel decoder 230 may be implemented according to the MPEG Surround standard, which does not put limitation on the present invention.
  • the TBT control information inputted in the TBT module 510 includes elements which can compose the weight w (W 1 I, W 12 , W2i, W22).
  • TBT (2x2) module 510 (hereinafter abbreviated 'TBT module 510') may be provided.
  • the TBT module 510 may can be figured to receive a stereo signal and output the remixed stereo signal.
  • the weight w may be composed using CLD(s) and ICC(s).
  • the decoder may control object gain as well as object panning using the received weight term.
  • variable scheme may be provided.
  • a TBT control information includes cross term like the W12 and W21.
  • a TBT control information does not include the cross term like the W12 and W21.
  • the number of the term as a TBT control information varies adaptively.
  • the terms which number is NxM may be transmitted as TBT control information.
  • the terms can be quantized based on a CLD parameter quantization table introduced in a MPEG Surround, which does not put limitation on the present invention.
  • left object is shifted to right position, (i.e. when left object is moved to more left position or left position adjacent to center position, or when only level of the object is adjusted), there is no need to use the cross term. In the case, it is proper that the term except for the cross term is transmitted.
  • N input channels and M output channels the terms which number is just N may be transmitted.
  • the number of the TBT control information varies adaptively according to need of cross term in order to reduce the bit rate of a TBT control information.
  • a flag information 'cross_flag' indicating whether the cross term is present or not is set to be transmitted as a TBT control information. Meaning of the flag information 'crossjQag' is shown in the following table 1. [table 1] meaning of cross_flag
  • the TBT control information does not include the cross term, only the non-cross term like the W 1 I and W22 is present. Otherwise ( / cross_flag / is equal to 1), the TBT control information includes the cross term.
  • flag information / reverse_flag' indicating whether cross term is present or non-cross term is present is set to be transmitted as a TBT control information. Meaning of flag information / reverse_flag' is shown in the following table 2.
  • the TBT control information does not include the cross term, only the non-cross term like the Wi 1 and W22 is present. Otherwise ( / reverse_flag' is equal to 1), the TBT control information includes only the cross term.
  • Futhermore a flag information 'side_flag' indicating whether cross term is present and non-cross is present is set to be transmitted as a TBT control information. Meaning of flag information / side_flag / is shown in the following table
  • FIG. 6 is an exemplary block diagram of an apparatus for processing an audio signal according to the other embodiment of present invention corresponding to the second scheme.
  • an apparatus for processing an audio signal 630 shown in the FIG. 6 may correspond to a binaural decoder included in the multi-channel decoder 230 of FIG. 2 or the synthesis unit of FIG. 4, which does not put limitation on the present invention.
  • An apparatus for processing an audio signal 630 may include a QMF analysis 632, a parameter conversion 634, a spatial synthesis 636, and a QMF synthesis 638.
  • Elements of the binaural decoder 630 may have the same configuration of MPEG Surround binaural decoder in MPEG Surround standard.
  • the spatial synthesis 636 can be configured to consist of 1 2x2 (filter) matrix, according to the following formula 10: [formula 10] with yo being the QMF-domain input channels and VB being the binaural output channels, k represents the hybrid QMF channel index, and i is the HRTF filter tap index, and n is the QMF slot index.
  • the binaural decoder 630 can be configured to perform the above-mentioned functionality described in subclause '1.2.2 Using a device setting information'. However, the elements hij may be generated using a multi-channel parameter and a mix information instead of a multi-channel parameter and HRTF parameter. In this case, the binaural decoder 600 can perform the functionality of the TBT module 510 in the FIG. 5. Details of the elements of the binaural decoder 630 shall be omitted.
  • the binaural decoder 630 can be operated according to a flag information 'binauraLflag'. In particular, the binaural decoder 630 can be skipped in case that a flag information binaural_flag is O', otherwise (the binaural_flag is 'Y), the binaural decoder 630 can be operated as below.
  • FIG. 7 is an exemplary block diagram of an apparatus for processing an audio signal according to one embodiment of the present invention corresponding to the third scheme.
  • FIG. 8 is an exemplary block diagram of an apparatus for processing an audio signal according to another embodiment of the present invention corresponding to the third scheme.
  • an apparatus for processing an audio signal 700 may include an information generating unit 710, a downmix processing unit 720, and a multi-channel decoder 730.
  • an apparatus for processing an audio signal 800 (hereinafter simply 'a decoder 800') may include an information generating unit 810 and a multi-channel synthesis unit 840 having a multi-channel decoder 830.
  • the decoder 800 may be another aspect of the decoder 700.
  • the information generating unit 810 has the same configuration of the information generating unit 710
  • the multi-channel decoder 830 has the same configuration of the multi-channel decoder 730
  • the multi-channel synthesis unit 840 may has the same configuration of the downmix processing unit 720 and multi-channel unit 730. Therefore, elements of the decoder 700 shall be explained in details, but details of elements of the decoder 800 shall be omitted.
  • the information generating unit 710 can be configured to receive a side information including an object parameter from an encoder and a mix information from an user-interface, and to generate a multi-channel parameter to be outputted to the multi-channel decoder 730. From this point of view, the information generating unit 710 has the same configuration of the former information generating unit 210 of FIG. 2.
  • the downmix processing parameter may correspond to a parameter for controlling object gain and object panning. For example, it is able to change either the object position or the object gain in case that the object signal is located at both left channel and right channel. It is also able to render the object signal to be located at opposite position in case that the object signal is located at only one of left channel and right channel.
  • the downmix processing unit 720 can be a TBT module (2x2 matrix operation).
  • the information generating unit 710 can be configured to generate ADG described with reference to FIG 2.
  • the downmix processing parameter may include parameter for controlling object panning but object gain.
  • the information generating unit 710 can be configured to receive HRTF information from HRTF database, and to generate an extra multichannel parameter including a HRTF parameter to be inputted to the multi-channel decoder 730.
  • the information generating unit 710 may generate multichannel parameter and extra multi-channel parameter in the same subband domain and transmit in syncronization with each other to the multi-channel decoder 730.
  • the extra multi-channel parameter including the HRTF parameter shall be explained in details in subclause '3. Processing Binaural Mode'.
  • the downmix processing unit 720 can be configured to receive downmix of an audio signal from an encoder and the downmix processing parameter from the information generating unit 710, and to decompose a subband domain signal using subband analysis filter bank.
  • the downmix processing unit 720 can be configured to generate the processed downmix signal using the downmix signal and the downmix processing parameter. In these processing, it is able to pre-process the downmix signal in order to control object panning and object gain.
  • the processed downmix signal may be inputted to the multi-channel decoder 730 to be upmixed. Furthermore, the processed downmix signal may be outputted and play backed via speaker as well.
  • the downmix processing unit 720 may perform synthesis filterbank using the prepossed subband domain signal and output a time-domain PCM signal. It is able to select whether to directly output as PCM signal or input to the multi- channel decoder by user selection.
  • the multi-channel decoder 730 can be configured to generate multi-channel output signal using the processed downmix and the multi-channel parameter.
  • the multi-channel decoder 730 may introduce a delay when the processed downmix signal and the multi-channel parameter are inputted in the multi-channel decoder 730.
  • the processed downmix signal can be synthesized in frequency domain (ex: QMF domain, hybrid QMF domain, etc), and the multi-channel parameter can be synthesized in time domain.
  • delay and synchronization for connecting HE-AAC is introduced. Therefore, the multichannel decoder 730 may introduce the delay according to MPEG Surround standard.
  • the configuration of downmix processing unit 720 shall be explained in detail with reference to FIG. 9 ⁇ FIG. 13.
  • FIG. 9 is an exemplary block diagram to explain to basic concept of rendering unit.
  • a rendering module 900 can be configured to generate M output signals using N input signals, a playback configuration, and a user control.
  • the N input signals may correspond to either object signals or channel signals.
  • the N input signals may correspond to either object parameter or multi-channel parameter.
  • Configuration of the rendering module 900 can be implemented in one of downmix processing unit 720 of FIG. 7, the former rendering unit 120 of FIG. 1, and the former renderer 110a of FIG. 1, which does not put limitation on the present invention.
  • the rendering module 900 can be configured to directly generate M channel signals using N object signals without summing individual object signals corresponding certain channel, the configuration of the rendering module 900 can be represented the following formula 11.
  • Ci is a i ⁇ channel signal
  • Oj is j* input signal
  • R j i is a matrix mapping j* input signal to i Ul channel.
  • cij_i is gain portion mapped to j* channel
  • ⁇ kj is gain portion mapped to k ⁇ > channel
  • is diffuseness level
  • D(o/) is de-correlated output.
  • weight values for all inputs mapped to certain channel are estimated according to the above-stated method, it is able to obtain weight values for each channel by the following method.
  • the dominant channel pair may correspond to left channel and center channel in case that certain input is positioned at point between left and center.
  • FIGS. 1OA to 1OC are exemplary block diagrams of a first embodiment of a downmix processing unit illustrated in FIG. 7.
  • a first embodiment of a downmix processing unit 720a (hereinafter simply 'a downmix processing unit 720a') may be implementation of rendering module 900.
  • the downmix processing unit according to the formula 15 is illustrated FIG. 1OA.
  • a downmix processing unit 720a can be configured to bypass input signal in case of mono input signal (m), and to process input signal in case of stereo input signal (L, R).
  • the downmix processing unit 720a may include a de-correlating part 722a and a mixing part 724a.
  • the de-correlating part 722a has a de-correlator aD and de-correlator bD which can be configured to de-correlate input signal.
  • the de-correlating part 722a may correspond to a 2x2 matrix.
  • the mixing part 724a can be configured to map input signal and the de-correlated signal to each channel.
  • the mixing part 724a may correspond to a 2x4 matrix.
  • the downmix processing unit according to the formula 15 is illustrated FIG. 1OB.
  • a de-correlating part 722' including two de-correlators Di, D 2 can be configured to generate de-correlated signals Di(a*Oi+b*O 2 ) / D 2 (c*Oi+d*O 2 ).
  • the downmix processing unit according to the formula 15 is illustrated FIG. IOC.
  • a de-correlating part 722" including two de-correlators Di, D 2 can be configured to generate de-correlated signals Di(Oi), D 2 (O 2 ).
  • downmix processing unit includes a mixing part corresponding to 2x3 matrix
  • the foregoing formula 15 can be represented as follow: [formula 16]
  • the matrix R is a 2x3 matrix
  • the matrix O is a 3x1 matrix
  • the C is a 2x1 matrix.
  • FIG. 11 is an exemplary block diagram of a second embodiment of a downmix processing unit illustrated in FIG. 7.
  • a second embodiment of a downmix processing unit 720b (hereinafter simply 'a downmix processing unit 720b') may be implementation of rendering module 900 like the downmix processing unit 720a.
  • a downmix processing unit 720b can be configured to skip input signal in case of mono input signal (m), and to process input signal in case of stereo input signal (L, R).
  • the downmix processing unit 720b may include a de-correlating part 722b and a mixing part 724b.
  • the de- correlating part 722b has a de-correlator D which can be configured to de-correlate input signal O 1 , O2 and output the de-correlated signal O(O ⁇ +Oi).
  • the de- correlating part 722b may correspond to a 1x2 matrix.
  • the mixing part 724b can be configured to map input signal and the de-correlated signal to each channel.
  • the mixing part 724b may correspond to a 2x3 matrix which can be shown as a matrix R in the formula 16.
  • the de-correlating part 722b can be configured to de-correlate a difference signal O1-O2 as common signal of two input signal O 1 , O2.
  • the mixing part 724b can be configured to map input signal and the de-correlated common signal to each channel.
  • a case that downmix processing unit includes a mixing part with several matrixes
  • Certain object signal can be audible as a similar impression anywhere without being positioned at a specified position, which may be called as a 'spatial sound signal'.
  • a 'spatial sound signal' For example, applause or noises of a concert hall can be an example of the spatial sound signal.
  • the spatial sound signal needs to be playback via all speakers. If the spatial sound signal playbacks as the same signal via all speakers, it is hard to feel spatialness of the signal because of high inter-correlation (IC) of the signal. Hence, there's need to add correlated signal to the signal of each channel signal.
  • IC inter-correlation
  • FIG. 12 is an exemplary block diagram of a third embodiment of a downmix processing unit illustrated in FIG. 7.
  • a third embodiment of a downmix processing unit 720c (hereinafter simply 'a downmix processing unit 720c') can be configured to generate spatial sound signal using input signal Oi, which may include a de-correlating part 722c with N de-correlators and a mixing part 724c.
  • the de-correlating part 722c may have N de-correlators Di, D2, • •• , DN which can be configured to de-correlate the input signal O,.
  • the mixing part 724c may have N matrix Rj, Rk, "", Ri which can be configured to generate output signals C ]V Ck, • • -, Ci using the input signal O/ and the de-correlated signal Dx(O/).
  • the R j N matrix Rj, Rk, "", Ri which can be configured to generate output signals C ]V Ck, • • -, Ci using the input signal O/
  • Oi is i* input signal
  • R/ is a matrix mapping i fll input signal O/ to ⁇ channel
  • Cj_i is j 1 * 1 output signal.
  • the ⁇ j_i value is de-correlation rate.
  • the ⁇ jj value can be estimated base on ICC included in multi-channel
  • the mixing part 724c can generate output signals base on
  • the number of de-correlators (N) can be equal to the number of output
  • the de-correlated signal can be added to output
  • channels selected by user For example, it is able to position certain spatial sound
  • FIG. 13 is an exemplary block diagram of a fourth embodiment of a downmix processing unit illustrated in FIG. 7.
  • a fourth embodiment of a downmix processing unit 72Od (hereinafter simply 'a downmix processing unit 72Od') can be configured to bypass if the input signal corresponds to a mono signal (m).
  • the downmix processing unit 72Od includes a further downmixing part 722d which can be configured to downmix the stereo signal to be mono signal if the input signal corresponds to a stereo signal.
  • the further downmixed mono channel (m) is used as input to the multi-channel decoder 730.
  • the multi-channel decoder 730 can control object panning (especially cross-talk) by using the mono input signal.
  • the information generating unit 710 may generate a multi-channel parameter base on 5-l-5i configuration of MPEG Surround standard.
  • the ADG may be generated by the information generating unit 710 based on mix information.
  • FIG. 14 is an exemplary block diagram of a bitstream structure of a compressed audio signal according to a second embodiment of present invention.
  • FIG. 15 is an exemplary block diagram of an apparatus for processing an audio signal according to a second embodiment of present invention.
  • downmix signal ⁇ , multi-channel parameter ⁇ , and object parameter ⁇ are included in the bitstream structure.
  • the multi-channel parameter ⁇ is a parameter for upmixing the downmix signal.
  • the object parameter ⁇ is a parameter for controlling object panning and object gain.
  • downmix signal ⁇ , a default parameter ⁇ ', and object parameter ⁇ are included in the bitstream structure.
  • the default parameter ⁇ ' may include preset information for controlling object gain and object panning.
  • the preset information may correspond to an example suggested by a producer of an encoder side. For example, preset information may describes that guitar signal is located at a point between left and center, and guitar's level is set to a certain volume, and the number of output channel in this time is set to a certain channel.
  • the default parameter for either each frame or specified frame may be present in the bitstream.
  • Flag information indicating whether default parameter for this frame is different from default parameter of previous frame or not may be present in the bitstream. By including default parameter in the bitstream, it is able to take less bitrates than side information with object parameter is included in the bitstream. Furthermore, header information of the bitstream is omitted in the FIG. 14.
  • an apparatus for processing an audio signal according to a second embodiment of present invention 1000 may include a bitstream de-multiplexer 1005, an information generating unit 1010, a downmix processing unit 1020, and a multil-channel decoder 1030.
  • the de- multiplexer 1005 can be configured to divide the multiplexed audio signal into a downmix ⁇ , a first multi-channel parameter ⁇ , and an object parameter ⁇ .
  • the information generating unit 1010 can be configured to generate a second multichannel parameter using an object parameter ⁇ and a mix parameter.
  • the mix parameter comprises a mode information indicating whether the first multi- channel information ⁇ is applied to the processed downmix.
  • the mode information may corresponds to an information for selecting by a user. According to the mode information, the information generating information 1020 decides whether to transmit the first multi-channel parameter ⁇ or the second multi-channel parameter.
  • the downmix processing unit 1020 can be configured to determining a processing scheme according to the mode information included in the mix information. Furthermore, the downmix processing unit 1020 can be configured to process the downmix ⁇ according to the determined processing scheme. Then the downmix processing unit 1020 transmits the processed downmix to multi-channel decoder 1030.
  • the multi-channel decoder 1030 can be configured to receive either the first multi-channel parameter ⁇ or the second multi-channel parameter. In case that default parameter ⁇ ' is included in the bitstream, the multi-channel decoder 1030 can use the default parameter ⁇ ' instead of multi-channel parameter ⁇ . Then, the multi-channel decoder 1030 can be configured to generate multichannel output using the processed downmix signal and the received multichannel parameter.
  • the multi-channel decoder 1030 may have the same configuration of the former multi-channel decoder 730, which does not put limitation on the present invention.
  • a multi-channel decoder can be operated in a binaural mode. This enables a multi-channel impression over headphones by means of Head Related Transfer
  • HRTF Function
  • FIG. 16 is an exemplary block diagram of an apparatus for processing an audio signal according to a third embodiment of present invention. Referring to
  • an apparatus for processing an audio signal according to a third embodiment may comprise an information generating unit 1110, a downmix processing unit 1120, and a multi-channel decoder 1130 with a sync matching part 1130a.
  • the information generating unit 1110 may have the same configuration of the information generating unit 710 of FIG. 7, with generating dynamic HRTF.
  • the downmix processing unit 1120 may have the same configuration of the downmix processing unit 720 of FIG. 7.
  • the dynamic HRTF describes the relation between object signals and virtual speaker signals corresponding to the HRTF azimuth and elevation angles, which is time-dependent information according to real-time user control.
  • the dynamic HRTF may correspond to one of HTRF filter coefficients itself, parameterized coefficient information, and index information in case that the multi-channel decoder comprise all HRTF filter set.
  • FIG. 17 is an exemplary block diagram of an apparatus for processing an audio signal according to a fourth embodiment of present invention.
  • the apparatus for processing an audio signal according to a fourth embodiment of present invention 1200 may comprise an encoder 1210 at encoder side 1200A, and a rendering unit 1220 and a synthesis unit 1230 at decoder side 1200B.
  • the encoder 1210 can be configured to receive multi-channel object signal and generate a downmix of audio signal and a side information.
  • the rendering unit 1220 can be configured to receive side information from the encoder 1210, playback configuration and user control from a device setting or a user- interface, and generate rendering information using the side information, playback configuration, and user control.
  • the synthesis unit 1230 can be configured to synthesis multi-channel output signal using the rendering information and the received downmix signal from an encoder 1210.
  • the effect-mode is a mode for remixed or reconstructed signal. For example, live mode, club band mode, karaoke mode, etc may be present.
  • the effect-mode information may correspond to a mix parameter set generated by a producer, other user, etc. If the effect-mode information is applied, an end user don't have to control object panning and object gain in full because user can select one of predetermined effect-mode informations.
  • Two methods of generating an effect-mode information can be distinguished.
  • the effect-mode information may be generated automatically at the decoder side. Details of two methods shall be described as follow.
  • the effect-mode information may be generated at an encoder 1200A by a producer.
  • the decoder 1200B can be configured to receive side information including the effect-mode information and output user- interface by which a user can select one of effect-mode informations.
  • the decoder 1200B can be configured to generate output channel base on the selected effect- mode information.
  • the effect-mode information may be generated at a decoder 1200B.
  • the decoder 1200B can be configured to search appropriate effect-mode informations for the downmix signal. Then the decoder 1200B can be configured to select one of the searched effect-mode by itself (automatic adjustment mode) or enable a user to select one of them (user selection mode). Then the decoder 1200B can be configured to obtain object information (number of objects, instrument names, etc) included in side information, and control object based on the selected effect-mode information and the object information.
  • Controlling in a lump means controlling each object simultaneously rather than controlling objects using the same parameter.
  • object corresponding to main melody may be emphasized in case that volume setting of device is low, object corresponding to main melody may be repressed in case that volume setting of device is high.
  • the input signal inputted to an encoder 1200A may be classified into three types as follow.
  • Mono object is most general type of object. It is possible to synthesis internal downmix signal by simply summing objects. It is also possible to synthesis internal downmix signal using object gain and object panning which may be one of user control and provided information. In generating internal downmix signal, it is also possible to generate rendering information using at least one of object characteristic, user input, and information provided with object.
  • multi-channel object it is able to perform the above mentioned method described with mono object and stereo object. Furthermore, it is able to input multi-channel object as a form of MPEG Surround. In this case, it is able to generate object-based downmix (ex: SAOC downmix) using object downmix channel, and use multi-channel information (ex: spatial information in MPEG Surround) for generating multi-channel information and rendering information.
  • object-based downmix (ex: SAOC downmix)
  • object downmix channel and use multi-channel information (ex: spatial information in MPEG Surround) for generating multi-channel information and rendering information.
  • SAOC encoder object- - oriented encoder
  • variable type of object may be transmitted from the encoder 1200A to the decoder. 1200B.
  • Transmitting scheme for variable type of object can be provided as follow:
  • a side information includes information for each object.
  • a side information includes information for 3 objects (A, B, C).
  • the side. information may comprise correlation flag information indicating whether an object is part of a stereo or multi-channel object, for example, mono object, one channel (L or R) of stereo object, and so on.
  • correlation flag information is '0' if mono object is present
  • correlation flag information is 'V if one channel of stereo object is present.
  • correlation flag information for other part of stereo object may be any value (ex: '0', '1', or whatever).
  • correlation flag information for other part of stereo object may be not transmitted.
  • correlation flag information for one part of multi-channel object may be value describing number of multi-channel object.
  • correlation flag information for left channel of 5.1 channel may be '5'
  • correlation flag information for the other channel (R, Lr, Rr, C, LFE) of 5.1 channel may be either O' or not transmitted.
  • Object may have the three kinds of attribute as follows: a) Single object
  • an encoder 1300 includes a grouping unit 1310 and a downmix unit 1320.
  • the grouping unit 1310 can be configured to group at least two objects among inputted multi-object input, base on a grouping information.
  • the grouping information may be generated by producer at encoder side.
  • the downmix unit 1320 can be configured to generate downmix signal using the grouped object generated by the grouping unit 1310.
  • the downmix unit 1320 can be configured to generate a side information for the grouped object.
  • Combination object is an object combined with at least one source. It is possible to control object panning and gain in a lump, but keep relation between combined objects unchanged. For example, in case of drum, it is possible to control drum, but keep relation between base drum, tam-tam, and symbol unchanged. For example, when base drum is located at center point and symbol is located at left point, it is possible to positioning base drum at right point and positioning symbol at point between center and right in case that drum is moved to right direction.
  • Relation information between combined objects may be transmitted to a decoder.
  • decoder can extract the relation information using combination object.
  • Information concerning element of combination object can be generated in either an encoder or a decoder.
  • Information concerning elements from an encoder can be transmitted as a different form from information concerning combination object.
  • the present invention is applicable to encode and decode an audio signal.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Stereo-Broadcasting Methods (AREA)

Abstract

A method for processing an audio signal, comprising: receiving a downmix signal, a first multi-channel information, and an object information; processing the downmix signal using the object information and a mix information; and, transmitting one of the first multi-channel information and a second multi-channel information according to the mix information, wherein the second channel information is generated using the object information and the mix information is disclosed.

Description

[DESCRIPTION] [Invention Title]
A METHOD AND AN APPARATUS FOR PROCESSING AN AUDIO SIGNAL [Technical Field]
The present invention relates to a method and an apparatus for processing an audio signal, and more particularly, to a method and an apparatus for decoding an audio signal received on a digital medium., as a broadcast signal, and so on.
[Background Art] While downmixing several audio objects to be a mono or stereo signal, parameters from the individual object signals can be extracted. These parameters can be used in a decoder of an audio signal, and repositioning/ panning of the individual sources can be controlled by user' selection.
[Disclosure] [ Technical Problem ]
However, in order to control the individual t object signals, repositioning/ panning of the individual sources included in a downmix signal must be performed suitably. However, for backward compatibility with respect to the channel-oriented decoding method (as a MPEG Surround), an object parameter must be converted flexibly to a multi-channel parameter required in upmixing process.
[Technical Solution] Accordingly, the present invention is directed to a method and an apparatus for processing an audio signal that substantially obviates one or more problems due to limitations and disadvantages of the related art.
An object of the present invention is to provide a method and an apparatus for processing an audio signal to control object gain and panning unrestrictedly. Another object of the present invention is to provide a method and an apparatus for processing an audio signal to control object gain and panning based on user selection.
Additional advantages, objects, and features of the invention will be set forth in part in the description which follows and in part will become apparent to those having ordinary skill in the art upon examination of the following or may be learned from practice of the invention. The objectives and other advantages of the invention may be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
[Advantageous Effects] The present invention provides the following effects or advantages. First of all, the present invention is able to provide a method and an apparatus for processing an audio signal to control object gain and panning unrestrictedly.
Secondly, the present invention is able to provide a method and an apparatus for processing an audio signal to control object gain and panning based on user selection. [Description of Drawings]
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this application, illustrate embodiments of the invention and together with the description serve to explain the principle of the invention. In the drawings;
FIG. 1 is an exemplary block diagram to explain to basic concept of rendering a downmix signal based on playback configuration and user control.
FIG. 2 is an exemplary block diagram of an apparatus for processing an audio signal according to one embodiment of the present invention corresponding to the first scheme.
FIG. 3 is an exemplary block diagram of an apparatus for processing an audio signal according to another embodiment of the present invention corresponding to the first scheme. FIG. 4 is an exemplary block diagram of an apparatus for processing an audio signal according to one embodiment of present invention corresponding to the second scheme.
FIG. 5 is an exemplary block diagram of an apparatus for processing an audio signal according to another embodiment of present invention corresponding to the second scheme.
FIG. 6 is an exemplary block diagram of an apparatus for processing an audio signal according to the other embodiment of present invention corresponding to the second scheme. FIG. 7 is an exemplary block diagram of an apparatus for processing an audio signal according to one embodiment of the present invention corresponding to the third scheme.
FIG. 8 is an exemplary block diagram of an apparatus for processing an audio signal according to another embodiment of the present invention corresponding to the third scheme.
FIG. 9 is an exemplary block diagram to explain to basic concept of rendering unit.
FIGS. 1OA to 1OC are exemplary block diagrams of a first embodiment of a downmix processing unit illustrated in FIG. 7. FIG. 11 is an exemplary block diagram of a second embodiment of a downmix processing unit illustrated in FIG. 7.
FIG. 12 is an exemplary block diagram of a third embodiment of a downmix processing unit illustrated in FIG. 7. FIG. 13 is an exemplary block diagram of a fourth embodiment of a downmix processing unit illustrated in FIG. 7.
FIG. 14 is an exemplary block diagram of a bitstream structure of a compressed audio signal according to a second embodiment of present invention.
FIG. 15 is an exemplary block diagram of an apparatus for processing an audio signal according to a second embodiment of present invention.
FIG. 16 is an exemplary block diagram of a bitstream structure of a compressed audio signal according to a third embodiment of present invention.
FIG. 17 is an exemplary block diagram of an apparatus for processing an audio signal according to a fourth embodiment of present invention. FIG. 18 is an exemplary block diagram to explain transmitting scheme for variable type of object.
FIG. 19 is an exemplary block diagram to an apparatus for processing an audio signal according to a fifth embodiment of present invention. [Best Mode] To achieve these objects and other advantages and in accordance with the purpose of the invention/ as embodied and broadly described herein, a method for processing an audio signal, comprising: receiving a downmix signal, a first multichannel information, and an object information; processing the downmix signal using the object information and a mix information; and, transmitting one of the first multi-channel information and a second multi-channel information according to the mix information, wherein the second channel information is generated using the object information and the mix information.
According to the present invention, wherein the downmix signal contains a plural channel and a plural object.
According to the present invention, wherein the first multi-channel information is applied to the downmix signal to generate a plural channel signal.
According to the present invention, wherein the object information corresponds to an information for controlling the plural object. According to the present invention, wherein the mix information includes a mode information indicating whether the first multi-channel information is applied to the processed downmix.
According to the present invention, wherein the processing the downmix signal, comprising: determining a processing scheme according to the mode information; and, processing the dowmmix signal using the object information and using the mix information according to the determined processing scheme.
According to the present invention, wherein the transmitting one of the first multi-channel information and a second multi-channel information is performed according to the mode information included in the mix information.
According to the present invention, further comprising, transmitting the processed downmix signal.
According to the present invention, further comprising, generating a multichannel signal using the processed downmix signal and one of the first multi- channel information and the second multi-channel information.
According to the present invention, wherein the receiving a downmix signal, a first multi-channel information, an object information, and a mix information, comprising: receiving the downmix signal and, a bitstream including the first multi-channel information and the object information; and, extracting the multi- channel information and the object information from the received bitstream.
According to the present invention, wherein the downmix signal is received as a broadcast signal.
According to the present invention, wherein the downmix signal is received on a digital medium. An another aspect of the present invention, a computer-readable medium having instructions stored thereon, which, when executed by a processor, causes the processor to perform operations, comprising: receiving a downmix signal, a first multi-channel information, and an object information; processing the downmix signal using the object information and a mix information; and, transmitting one of the first multi-channel information and a second multi-channel information according to the mix information, wherein the second channel information is generated using the object information and the mix information.
An another aspect of the present invention, an apparatus for processing an audio signal, comprising: a bitstream de-multiplexer receiving a downmix signal, a first multi-channel information, and an object information; and, an object decoder processing the downmix signal using the object information and a mix information, and transmitting one of the first multi-channel information and a second multichannel information according to the mix information, wherein the second channel information is generated using the object information and the mix information.
An another aspect of the present invention, a data structure of audio signal, comprising: a downmix signal having a plural object and a plural channel; an object information for controlling the plural object; and, a multi-channel information for decoding the plural channel, wherein the object information includes an object parameter, and the multi-channel information includes at least one of channel level information and channel correlation information.
It is to be understood that both the foregoing general description and the following detailed description of the present invention are exemplary and explanatory and are intended to provide further explanation of the invention as claimed. [Mode for Invention]
Reference will now be made in detail to the preferred embodiments of the present invention, examples of which are illustrated in the accompanying drawings. Wherever possible, the same reference numbers will be used throughout the drawings to refer to the same or like parts.
Prior to describing the present invention, it should be noted that most terms disclosed in the present invention correspond to general terms well known in the art, but some terms have been selected by the applicant as necessary and will hereinafter be disclosed in the following description of the present invention. Therefore, it is preferable that the terms defined by the applicant be understood on the basis of their meanings in the present invention.
In particular, 'parameter' in the following description means information including values, parameters of narrow sense, coefficients, elements, and so on. Hereinafter 'parameter' term will be used instead of 'information' term like an object parameter, a mix parameter, a downmix processing parameter, and so on, which does not put limitation on the present invention.
In downmixing several channel signals or object signals, an object parameter and a spatial parameter can be extracted. A decoder can generate output signal using a downmix signal and the object parameter (or the spatial parameter). The output signal may be rendered based on playback configuration and user control by the decoder. The rendering process shall be explained in details with reference to the FIG. 1 as follow.
FIG. 1 is an exemplary diagram to explain to basic concept of rendering downmix based on playback configuration and user control. Referring to FIG. 1, a decoder 100 may include a rendering information generating unit 110 and a rendering unit 120, and also may include a renderer 110a and a synthesis 120a instead of the rendering information generating unit 110 and the rendering unit 120.
A rendering information generating unit 110 can be configured to receive a side information including an object parameter or a spatial parameter from an encoder, and also to receive a playback configuration or a user control from a device setting or a user interface. The object parameter may correspond to a parameter extracted in downmixing at least one object signal, and the spatial parameter may correspond to a parameter extracted in downmixing at least one channel signal. Furthermore, type information and characteristic information for each object may be included in the side information. Type information and characteristic information may describe instrument name, player name, and so on. The playback configuration may include speaker position and ambient information (speaker's virtual position), and the user control may correspond to a control information inputted by a user in order to control object positions and object gains, and also may correspond to a control information in order to the playback configuration. Meanwhile the payback configuration and user control can be represented as a mix information, which does not put limitation on the present invention. A rendering information generating unit 110 can be configured to generate a rendering information using a mix information (the playback configuration and user control) and the received side information. A rendering unit 120 can configured to generate a multi-channel parameter using the rendering information in case that the downmix of an audio signal (abbreviated 'downmix signal') is not transmitted, and generate multi-channel signals using the rendering information and downmix in case that the downmix of an audio signal is transmitted.
A renderer HOa can be configured to generate multi-channel signals using a mix information (the playback configuration and the user control) and the received side information. A synthesis 120a can be configured to synthesis the multi-channel signals using the multi-channel signals generated by the renderer HOa. As previously stated, the decoder may render the downmix signal based on playback configuration and user control. Meanwhile, in order to control the individual object signals, a decoder can receive an object parameter as a side information and control object panning and object gain based on the transmitted object parameter.
1. Controlling gain and panning of object signals
Variable methods for controlling the individual object signals may be provided. First of all, in case that a decoder receives an object parameter and generates the individual object signals using the object parameter, then, can control the individual object signals base on a mix information (the playback configuration, the object level, etc.)
Secondly, in case that a decoder generates the multi-channel parameter to be inputted to a multi-channel decoder, the multi-channel decoder can upmix a downmix signal received from an encoder using the multi-channel parameter. The above-mention second method may be classified into three types of scheme. In particular, 1) using a conventional multi-channel decoder, 2) modifying a multichannel decoder, 3) processing downmix of audio signals before being inputted to a multi-channel decoder may be provided. The conventional multi-channel decoder may correspond to a channel-oriented spatial audio coding (ex: MPEG Surround decoder), which does not put limitation on the present invention. Details of three types of scheme shall be explained as follow. 1.1 Using a multi-channel decoder
First scheme may use a conventional multi-channel decoder as it is without modifying a multi-channel decoder. At first, a case of using the ADG (arbitrary downmix gain) for controlling object gains and a case of using the 5-2-5 configuration for controlling object panning shall be explained with reference to FIG. 2 as follow. Subsequently, a case of being linked with a scene remixing unit will be explained with reference to FIG. 3. FIG. 2 is an exemplary block diagram of an apparatus for processing an audio signal according to one embodiment of the present invention corresponding to first scheme. Referring to FIG. 2, an apparatus for processing an audio signal 200 (hereinafter simply 'a decoder 200') may include an information generating unit 210 and a multi-channel decoder 230. The information generating unit 210 may receive a side information including an object parameter from an encoder and a mix information from a user interface, and may generate a multi-channel parameter including a arbitrary downmix gain or a gain modification gain(hereinafter simple 'ADG'). The ADG may describe a ratio of a first gain estimated based on the mix information and the obejct information over a second gain extimated based on the object information. In particular, the information generating unit 210 may generate the ADG only if the downmix signal corresponds to a mono signal. The multi-channel decoder 230 may receive a downmix of an audio signal from an encoder and a multi-channel parameter from the information generating unit 210, and may generate a multi-channel output using the downmix signal and the multi-channel parameter.
The multi-channel parameter may include a channel level difference (hereinafter abbreviated 'CLD'), an inter channel correlation (hereinafter abbreviated TCC), a channel prediction coefficient (hereinafter abbreviated 'CPC).
Since CLD, ICC, and CPC describe intensity difference or correlation between two channels, and is to control object panning and correlation. It is able to control object positions and object diffuseness (sonority) using the CLD, the ICC, etc. Meanwhile, the CLD describe the relative level difference instead of the absolute level, and energy of the splitted two channels is conserved. Therefore it is unable to control object gains by handling CLD, etc. In other words, specific object cannot be mute or volume up by using the CLD, etc.
Furthermore, the ADG describes time and frequency dependent gain for controlling correction factor by a user. If this correction factor be applied, it is able to handle modification of down-mix signal prior to a multi-channel upmixing. Therefore, in case that ADG parameter is received from the information generating unit 210, the multi-channel decoder 230 can control object gains of specific time and frequency using the ADG parameter.
Meanwhile, a case that the received stereo downmix signal outputs as a stereo channel can be defined the following formula 1. [formula 1] y[Q] = Wi r go *[0] + W 12 - Si - χM y[Α = w2i <?o 40] + W22 g, • χ[i] where x[] is input channels, y[] is output channels, gx is gains, and wxx is weight. It is necessary to control cross-talk between left channel and right channel in order to object panning. In particular, a part of left channel of downmix signal may output as a right channel of output signal, and a part of right channel of downmix signal may output as left channel of output signal. In the formula 1, W12 and W2i may be a cross-talk component (in other words, cross-term). The above-mentioned case corresponds to 2-2-2 configuration, which means
2-channel input, 2-channel transmission, and 2-channel output. In order to perform the 2-2-2 configuration, 5-2-5 configuration (2-channel input, 5-channel transmission, and 2 channel output) of conventional channel-oriented spatial audio coding (ex: MPEG surround) can be used. At first, in order to output 2 channels for 2-2-2 configuration, certain channel among 5 output channels of 5-2-5 configuration can be set to a disable channel (a fake channel). In order to give cross-talk between 2-transmitted channels and 2-output channels, the above-mentioned CLD and CPC may be adjusted. In brief, gain factor gx in the formula 1 is obtained using the above mentioned ADG, and weighting factor wπ~W22 in the formula 1 is obtained using CLD and CPC.
In implementing the 2-2-2 configuration using 5-2-5 configuration, in order to reduce complexity, default mode of conventional spatial audio coding may be applied. Since characteristic of default CLD is supposed to output 2-channel, it is able to reduce computing amount if the default CLD is applied. Particularly, since there is no need to synthesis a fake channel, it is able to reduce computing amount largely. Therefore, applying the default mode is proper. In particular, only default CLD of 3 CLDs (corresponding to 0, 1, and 2 in MPEG surround standard) is used for decoding. On the other hand, 4 CLDs among left channel, right channel, and center channel (corresponding to 3, 4, 5, and 6 in MPEG surround standard) and 2 ADGs (corresponding to 7 and 8 in MPEG surround standard) is generated for controlling object. In this case, CLDs corresponding 3 and 5 describe channel level difference between left channel plus right channel and center channel ((l+r)/c) is proper to set to 15OdB (approximately infinite) in order to mute center channel. And, in order to implement cross-talk, energy based up-mix or prediction based up-mix may be performed, which is invoked in case that TTT mode ('bsTttModeLow' in the MPEG surround standard) corresponds to energy-based mode (with subtraction, matrix compatibility enabled) (3rd mode), or prediction mode (1st mode or 2nd mode).
FIG. 3 is an exemplary block diagram of an apparatus for processing an audio signal according to another embodiment of the present invention corresponding to first scheme. Referring to FIG. 3, an apparatus for processing an audio signal according to another embodiment of the present invention 300 (hereinafter simply a decoder 300) may include a information generating unit 310, a scene rendering unit 320, a multi-channel decoder 330, and a scene remixing unit 350.
The information generating unit 310 can be configured to receive a side information including an object parameter from an encoder if the downmix signal corresponds to mono channel signal (i.e., the number of downmix channel is 'V), may receive a mix information from a user interface, and may generate a multi- channel parameter using the side information and the mix information. The number of downmix channel can be estimated based on a flag information included in the side information as well as the downmix signal itself and user selection. The information generating unit 310 may have the same configuration of the former information generating unit 210. The multi-channel parameter is inputted to the multi-channel decoder 330, the multi-channel decoder 330 may have the same configuration of the former multi-channel decoder 230.
The scene rendering unit 320 can be configured to receive a side information including an object parameter from and encoder if the downmix signal corresponds to non-mono channel signal (i.e., the number of downmix channel is more than '2'), may receive a mix information from a user interface, and may generate a remixing parameter using the side information and the mix information. The remixing parameter corresponds to a parameter in order to remix a stereo channel and generate more than 2-channel outputs. The remixing parameter is inputted to the scene remixing unit 350. The scene remixing unit 350 can be configured to remix the downmix signal using the remixing parameter if the downmix signal is more than 2-channel signal.
In brief, two paths could be considered as separate implementations for separate applications in a decoder 300. 1.2 Modifying a multi-channel decoder
Second scheme may modify a conventional multi-channel decoder. At first, a case of using virtual output for controlling object gains and a case of modifying a device setting for controlling object panning shall be explained with reference to FIG. 4 as follow. Subsequently, a case of Performing TBT(2x2) functionality in a multi-channel decoder shall be explained with reference to FIG. 5. FIG. 4 is an exemplary block diagram of an apparatus for processing an audio signal according to one embodiment of present invention corresponding to the second scheme. Referring to FIG. 4, an apparatus for processing an audio signal according to one embodiment of present invention corresponding to the second scheme 400 (hereinafter simply 'a decoder 400') may include an information generating unit 410, an internal multi-channel synthesis 420, and an output mapping unit 430. The internal multi-channel synthesis 420 and the output mapping unit 430 may be included in a synthesis unit.
The information generating unit 410 can be configured to receive a side information including an object parameter from an encoder, and a mix parameter from a user interface. And the information generating unit 410 can be configured to generate a multi-channel parameter and a device setting information using the side information and the mix information. The multi-channel parameter may have the same configuration of the former multi-channel parameter. So, details of the multi- channel parameter shall be omitted in the following description. The device setting information may correspond to parameterized HRTF for binaural processing, which shall be explained in the description of '1.2.2 Using a device setting information'.
The internal multi-channel synthesis 420 can be configured to receive a multi-channel parameter and a device setting information from the parameter generation unit 410 and downmix signal from an encoder. The internal multichannel synthesis 420 can be configured to generate a temporal multi-channel output including a virtual output, which shall be explained in the description of '1.2.1 Using a virtual output'. 1.2.1 Using a virtual output
Since multi-channel parameter (ex: CLD) can control object panning, it is hard to control object gain as well as object panning by a conventional multichannel decoder.
Meanwhile, in order to object gain, the decoder 400 (especially the internal multi-channel synthesis 420) may map relative energy of object to a virtual channel (ex: center channel). The relative energy of object corresponds to energy to be reduced. For example, in order to mute certain object, the decoder 400 may map more than 99.9% of object energy to a virtual channel. Then, the decoder 400 (especially, the output mapping unit 430) does not output the virtual channel to which the rest energy of object is mapped. In conclusion, if more than 99.9% of object is mapped to a virtual channel which is not outputted, the desired object can be almost mute.
1.2.2 Using a device setting information The decoder 400 can adjust a device setting information in order to control object panning and object gain. For example, the decoder can be configured to generate a parameterized HRTF for binaural processing in MPEG Surround standard. The parameterized HRTF can be variable according to device setting. It is able to assume that object signals can be controlled according to the following formula 2.
[formula 2] Lnew = ai * ObJ1 + a2 * obJ2 + a3 * obJ3 + .. + an * objn,
Rnew = bl * θbji + b2 * θbJ2 + b3 * θbJ3 + .. + bn *θbjn, where objk is object signals, Lnew and Rnew is a desired stereo signal, and ak and bk are coefficients for object control.
An object information of the object signals objk may be estimated from an object parameter included in the transmitted side information. The coefficients ak, bk which are defined according to object gain and object panning may be estimated from the mix information. The desired object gain and object panning can be adjusted using the coefficients ak, bk.
The coefficients ak, bk can be set to correspond to HRTF parameter for binaural processing, which shall be explained in details as follow. In MPEG Surround standard (5-l-5i configuration) (from ISO/ IEC FDIS 23003-1 :2006(E), Information Technology - MPEG Audio Technologies - Parti: MPEG Surround), binaural processing is as below.
[formula 3]
Figure imgf000024_0001
where yβ is output, the matrix H is conversion matrix for binaural processing.
[formula 4]
Figure imgf000024_0004
The elements of matrix H is defined as follows: [formula 5]
Figure imgf000024_0002
[formula 6]
Figure imgf000024_0003
[formula 7]
Figure imgf000025_0001
1.2.3 Performing TBT(2x2) functionality in a multi-channel decoder FIG. 5 is an exemplary block diagram of an apparatus for processing an audio signal according to another embodiment of present invention corresponding to the second scheme. FIG. 5 is an exemplary block diagram of TBT functionality in a multi-channel decoder. Referring to FIG. 5, a TBT module 510 can be configured to receive input signals and a TBT control information, and generate output signals. The TBT module 510 may be included in the decoder 200 of the FIG. 2 (or in particular, the multi-channel decoder 230). The multi-channel decoder 230 may be implemented according to the MPEG Surround standard, which does not put limitation on the present invention.
[formula 9]
Figure imgf000025_0002
where x is input channels, y is output channels, and w is weight. The output yi may correspond to a combination input xi of the downmix multiplied by a first gain W1I and input X2 multiplied by a second gain W12.
The TBT control information inputted in the TBT module 510 includes elements which can compose the weight w (W1I, W12, W2i, W22). In MEPG Surround standard, OTT(One-To-Two) module and TTT(T wo-To-
Three) module is not proper to remix input signal although OTT module and TTT module can upmix the input signal.
In order to remix the input signal, TBT (2x2) module 510 (hereinafter abbreviated 'TBT module 510') may be provided. The TBT module 510 may can be figured to receive a stereo signal and output the remixed stereo signal. The weight w may be composed using CLD(s) and ICC(s).
If the weight term wn ~ W22 is transmitted as a TBT control information, the decoder may control object gain as well as object panning using the received weight term. In transmitting the weight term w, variable scheme may be provided. At first, a TBT control information includes cross term like the W12 and W21. Secondly, a TBT control information does not include the cross term like the W12 and W21. Thirdly, the number of the term as a TBT control information varies adaptively.
At first, there is need to receive the cross term like the W12 and W2i in order to control object panning as left signal of input channel go to right of the output channel. In case of N input channels and M output channels, the terms which number is NxM may be transmitted as TBT control information. The terms can be quantized based on a CLD parameter quantization table introduced in a MPEG Surround, which does not put limitation on the present invention. Secondly, unless left object is shifted to right position, (i.e. when left object is moved to more left position or left position adjacent to center position, or when only level of the object is adjusted), there is no need to use the cross term. In the case, it is proper that the term except for the cross term is transmitted. In case of N input channels and M output channels, the terms which number is just N may be transmitted.
Thirdly, the number of the TBT control information varies adaptively according to need of cross term in order to reduce the bit rate of a TBT control information. A flag information 'cross_flag' indicating whether the cross term is present or not is set to be transmitted as a TBT control information. Meaning of the flag information 'crossjQag' is shown in the following table 1. [table 1] meaning of cross_flag
Figure imgf000027_0001
In case that 'cross_flag' is equal to 0, the TBT control information does not include the cross term, only the non-cross term like the W1I and W22 is present. Otherwise (/cross_flag/ is equal to 1), the TBT control information includes the cross term.
Besides, a flag information /reverse_flag' indicating whether cross term is present or non-cross term is present is set to be transmitted as a TBT control information. Meaning of flag information /reverse_flag' is shown in the following table 2.
[table 2] meaning of reverse_flag
Figure imgf000028_0001
In case that 'reverse_flag' is equal to 0, the TBT control information does not include the cross term, only the non-cross term like the Wi1 and W22 is present. Otherwise (/reverse_flag' is equal to 1), the TBT control information includes only the cross term.
Futhermore, a flag information 'side_flag' indicating whether cross term is present and non-cross is present is set to be transmitted as a TBT control information. Meaning of flag information /side_flag/ is shown in the following table
3.
[table 3] meaning of side_config
Figure imgf000029_0001
Since the table 3 corresponds to combination of the table 1 and the table 2, details of the table 3 shall be omitted.
1.2.4 Performing TBT(2x2) functionality in a multi-channel decoder by modifying a binaural decoder
The case of '1.2.2 Using a device setting information' can be performed without modifying the binaural decoder. Hereinafter, performing TBT fucnctionality by modifying a binaural decoder employed in a MPEG Surround decoder, with reference to FIG. 6.
FIG. 6 is an exemplary block diagram of an apparatus for processing an audio signal according to the other embodiment of present invention corresponding to the second scheme. In particular, an apparatus for processing an audio signal 630 shown in the FIG. 6 may correspond to a binaural decoder included in the multi-channel decoder 230 of FIG. 2 or the synthesis unit of FIG. 4, which does not put limitation on the present invention.
An apparatus for processing an audio signal 630 (hereinafter 'a binaural decoder 630') may include a QMF analysis 632, a parameter conversion 634, a spatial synthesis 636, and a QMF synthesis 638. Elements of the binaural decoder 630 may have the same configuration of MPEG Surround binaural decoder in MPEG Surround standard. For example, the spatial synthesis 636 can be configured to consist of 1 2x2 (filter) matrix, according to the following formula 10: [formula 10]
Figure imgf000030_0001
with yo being the QMF-domain input channels and VB being the binaural output channels, k represents the hybrid QMF channel index, and i is the HRTF filter tap index, and n is the QMF slot index. The binaural decoder 630 can be configured to perform the above-mentioned functionality described in subclause '1.2.2 Using a device setting information'. However, the elements hij may be generated using a multi-channel parameter and a mix information instead of a multi-channel parameter and HRTF parameter. In this case, the binaural decoder 600 can perform the functionality of the TBT module 510 in the FIG. 5. Details of the elements of the binaural decoder 630 shall be omitted. The binaural decoder 630 can be operated according to a flag information 'binauraLflag'. In particular, the binaural decoder 630 can be skipped in case that a flag information binaural_flag is O', otherwise (the binaural_flag is 'Y), the binaural decoder 630 can be operated as below.
[table 4] meaning of binaural_flag
Figure imgf000031_0001
1.3 Processing downmix of audio signals before being inputted to a multichannel decoder
The first scheme of using a conventional multi-channel decoder have been explained in subclause in '1.Y , the second scheme of modifying a multi-channel decoder have been explained in subclause in '1.2'. The third scheme of processing downmix of audio signals before being inputted to a multi-channel decoder shall be explained as follow.
FIG. 7 is an exemplary block diagram of an apparatus for processing an audio signal according to one embodiment of the present invention corresponding to the third scheme. FIG. 8 is an exemplary block diagram of an apparatus for processing an audio signal according to another embodiment of the present invention corresponding to the third scheme. At first, Referring to FIG. 7, an apparatus for processing an audio signal 700 (hereinafter simply 'a decoder 700') may include an information generating unit 710, a downmix processing unit 720, and a multi-channel decoder 730. Referring to FIG. 8, an apparatus for processing an audio signal 800 (hereinafter simply 'a decoder 800') may include an information generating unit 810 and a multi-channel synthesis unit 840 having a multi-channel decoder 830. The decoder 800 may be another aspect of the decoder 700. In other words, the information generating unit 810 has the same configuration of the information generating unit 710, the multi-channel decoder 830 has the same configuration of the multi-channel decoder 730, and, the multi-channel synthesis unit 840 may has the same configuration of the downmix processing unit 720 and multi-channel unit 730. Therefore, elements of the decoder 700 shall be explained in details, but details of elements of the decoder 800 shall be omitted.
The information generating unit 710 can be configured to receive a side information including an object parameter from an encoder and a mix information from an user-interface, and to generate a multi-channel parameter to be outputted to the multi-channel decoder 730. From this point of view, the information generating unit 710 has the same configuration of the former information generating unit 210 of FIG. 2. The downmix processing parameter may correspond to a parameter for controlling object gain and object panning. For example, it is able to change either the object position or the object gain in case that the object signal is located at both left channel and right channel. It is also able to render the object signal to be located at opposite position in case that the object signal is located at only one of left channel and right channel. In order that these cases are performed, the downmix processing unit 720 can be a TBT module (2x2 matrix operation). In case that the information generating unit 710 can be configured to generate ADG described with reference to FIG 2. in order to control object gain, the downmix processing parameter may include parameter for controlling object panning but object gain.
Furthermore, the information generating unit 710 can be configured to receive HRTF information from HRTF database, and to generate an extra multichannel parameter including a HRTF parameter to be inputted to the multi-channel decoder 730. In this case, the information generating unit 710 may generate multichannel parameter and extra multi-channel parameter in the same subband domain and transmit in syncronization with each other to the multi-channel decoder 730. The extra multi-channel parameter including the HRTF parameter shall be explained in details in subclause '3. Processing Binaural Mode'.
The downmix processing unit 720 can be configured to receive downmix of an audio signal from an encoder and the downmix processing parameter from the information generating unit 710, and to decompose a subband domain signal using subband analysis filter bank. The downmix processing unit 720 can be configured to generate the processed downmix signal using the downmix signal and the downmix processing parameter. In these processing, it is able to pre-process the downmix signal in order to control object panning and object gain. The processed downmix signal may be inputted to the multi-channel decoder 730 to be upmixed. Furthermore, the processed downmix signal may be outputted and play backed via speaker as well. In order to directly output the processed signal via speakers, the downmix processing unit 720 may perform synthesis filterbank using the prepossed subband domain signal and output a time-domain PCM signal. It is able to select whether to directly output as PCM signal or input to the multi- channel decoder by user selection.
The multi-channel decoder 730 can be configured to generate multi-channel output signal using the processed downmix and the multi-channel parameter. The multi-channel decoder 730 may introduce a delay when the processed downmix signal and the multi-channel parameter are inputted in the multi-channel decoder 730. The processed downmix signal can be synthesized in frequency domain (ex: QMF domain, hybrid QMF domain, etc), and the multi-channel parameter can be synthesized in time domain. In MPEG surround standard, delay and synchronization for connecting HE-AAC is introduced. Therefore, the multichannel decoder 730 may introduce the delay according to MPEG Surround standard. The configuration of downmix processing unit 720 shall be explained in detail with reference to FIG. 9 ~ FIG. 13.
1.3.1 A general case and special cases of downmix processing unit
FIG. 9 is an exemplary block diagram to explain to basic concept of rendering unit. Referring to FIG. 9, a rendering module 900 can be configured to generate M output signals using N input signals, a playback configuration, and a user control. The N input signals may correspond to either object signals or channel signals. Furthermore, the N input signals may correspond to either object parameter or multi-channel parameter. Configuration of the rendering module 900 can be implemented in one of downmix processing unit 720 of FIG. 7, the former rendering unit 120 of FIG. 1, and the former renderer 110a of FIG. 1, which does not put limitation on the present invention.
If the rendering module 900 can be configured to directly generate M channel signals using N object signals without summing individual object signals corresponding certain channel, the configuration of the rendering module 900 can be represented the following formula 11.
[formula 11]
C = RO
Figure imgf000036_0001
Ci is a iΛ channel signal, Oj is j* input signal, and Rji is a matrix mapping j* input signal to iUl channel.
If R matrix is separated into energy component E and de-correlation component, the formula 11 may be represented as follow, [formula 12]
C - RO = EO +DO
Figure imgf000036_0002
It is able to control object positions using the energy component E, and it is able to control object diffuseness using the de-correlation component D.
Assuming that only r* input signal is inputted to be outputted via j* channel and kft channel, the formula 12 may be represented as follow, [formula 13]
Figure imgf000036_0003
cij_i is gain portion mapped to j* channel, βkj is gain portion mapped to kύ~> channel, θ is diffuseness level, and D(o/) is de-correlated output.
Assuming that de-correlation is omitted, the formula 13 may be simplified as follow.
[formula 14]
C,t , = R,O,
Figure imgf000037_0001
If weight values for all inputs mapped to certain channel are estimated according to the above-stated method, it is able to obtain weight values for each channel by the following method.
1) Summing weight values for all inputs mapped to certain channel. For example, in case that input 1 Oi and input 2 O2 is inputted and output channel corresponds to left channel L, center channel C, and right channel R, a total weight values αL(tot), crc(tot) , ctR(tot) may be obtained as follows: [formula 15] aL(tot) = aL\ aC(tot) = aC\ + aC2 aR(tot) = aR2 where αu is a weight value for input 1 mapped to left channel L, αci is a weight value for input 1 mapped to center channel C, do is a weight value for input2 mapped to center channel C, and CIR2 is a weight value for input 2 mapped to right channel R. In this case, only input 1 is mapped to left channel, only input 2 is mapped to right channel, input 1 and input 2 is mapped to center channel together.
2) Summing weight values for all inputs mapped to certain channel, then dividing the sum into the most dominant channel pair, and mapping de- correlated signal to the other channel for surround effect. In this case, the dominant channel pair may correspond to left channel and center channel in case that certain input is positioned at point between left and center.
3) Estimating weight value of the most dominant channel, giving attenuated correlated signal to the other channel, which value is a relative value of the estimated weight value. 4) Using weight values for each channel pair, combining the de-correlated signal properly, then setting to a side information for each channel.
1.3.2 A case that downmix processing unit includes a mixing part corresponding to 2x4 matrix FIGS. 1OA to 1OC are exemplary block diagrams of a first embodiment of a downmix processing unit illustrated in FIG. 7. As previously stated, a first embodiment of a downmix processing unit 720a (hereinafter simply 'a downmix processing unit 720a') may be implementation of rendering module 900.
First of all, assuming that Dn = Al = aD and Dn = D22 = bD f the formula 12 is simplified as follow, [formula 15]
Figure imgf000039_0001
The downmix processing unit according to the formula 15 is illustrated FIG. 1OA. Referring to FIG. 1OA, a downmix processing unit 720a can be configured to bypass input signal in case of mono input signal (m), and to process input signal in case of stereo input signal (L, R). The downmix processing unit 720a may include a de-correlating part 722a and a mixing part 724a. The de-correlating part 722a has a de-correlator aD and de-correlator bD which can be configured to de-correlate input signal. The de-correlating part 722a may correspond to a 2x2 matrix. The mixing part 724a can be configured to map input signal and the de-correlated signal to each channel. The mixing part 724a may correspond to a 2x4 matrix.
Secondly, assuming that Dn = ClD1 f D21 = υDλ / D12 = cD2 t and D 22 = dD2 , the formula 12 is simplified as follow. formula 15-2]
Figure imgf000040_0001
The downmix processing unit according to the formula 15 is illustrated FIG. 1OB. Referring to FIG. 1OB, a de-correlating part 722' including two de-correlators Di, D2 can be configured to generate de-correlated signals Di(a*Oi+b*O2)/ D2(c*Oi+d*O2).
Thirdly, assuming that
Figure imgf000040_0002
> the formula 12 is simplified as follow.
[formula 15-3]
Figure imgf000040_0003
The downmix processing unit according to the formula 15 is illustrated FIG. IOC. Referring to FIG. 1OC, a de-correlating part 722" including two de-correlators Di, D2 can be configured to generate de-correlated signals Di(Oi), D2(O2).
1.3.2 A case that downmix processing unit includes a mixing part corresponding to 2x3 matrix
The foregoing formula 15 can be represented as follow: [formula 16]
Figure imgf000040_0004
Figure imgf000041_0001
The matrix R is a 2x3 matrix, the matrix O is a 3x1 matrix, and the C is a 2x1 matrix.
FIG. 11 is an exemplary block diagram of a second embodiment of a downmix processing unit illustrated in FIG. 7. As previously stated, a second embodiment of a downmix processing unit 720b (hereinafter simply 'a downmix processing unit 720b') may be implementation of rendering module 900 like the downmix processing unit 720a. Referring to FIG. 11, a downmix processing unit 720b can be configured to skip input signal in case of mono input signal (m), and to process input signal in case of stereo input signal (L, R). The downmix processing unit 720b may include a de-correlating part 722b and a mixing part 724b. The de- correlating part 722b has a de-correlator D which can be configured to de-correlate input signal O1, O2 and output the de-correlated signal O(Oι+Oi). The de- correlating part 722b may correspond to a 1x2 matrix. The mixing part 724b can be configured to map input signal and the de-correlated signal to each channel. The mixing part 724b may correspond to a 2x3 matrix which can be shown as a matrix R in the formula 16.
Furthermore, the de-correlating part 722b can be configured to de-correlate a difference signal O1-O2 as common signal of two input signal O1, O2. The mixing part 724b can be configured to map input signal and the de-correlated common signal to each channel.
1.3.3 A case that downmix processing unit includes a mixing part with several matrixes Certain object signal can be audible as a similar impression anywhere without being positioned at a specified position, which may be called as a 'spatial sound signal'. For example, applause or noises of a concert hall can be an example of the spatial sound signal. The spatial sound signal needs to be playback via all speakers. If the spatial sound signal playbacks as the same signal via all speakers, it is hard to feel spatialness of the signal because of high inter-correlation (IC) of the signal. Hence, there's need to add correlated signal to the signal of each channel signal.
FIG. 12 is an exemplary block diagram of a third embodiment of a downmix processing unit illustrated in FIG. 7. Referring to FIG.12, a third embodiment of a downmix processing unit 720c (hereinafter simply 'a downmix processing unit 720c') can be configured to generate spatial sound signal using input signal Oi, which may include a de-correlating part 722c with N de-correlators and a mixing part 724c. The de-correlating part 722c may have N de-correlators Di, D2, • ••, DN which can be configured to de-correlate the input signal O,. The mixing part 724c may have N matrix Rj, Rk, "", Ri which can be configured to generate output signals C]V Ck, • •-, Ci using the input signal O/ and the de-correlated signal Dx(O/). The Rj
matrix can be represented as the following formula,
[formula 17]
Cj_, = R,o,
Figure imgf000043_0001
Oi is i* input signal, R/ is a matrix mapping ifll input signal O/ to ^ channel,
and Cj_i is j1*1 output signal. The θj_i value is de-correlation rate.
The θjj value can be estimated base on ICC included in multi-channel
parameter. Furthermore, the mixing part 724c can generate output signals base on
spatialness information composing de-correlation rate θjj received from user-
interface via the information generating unit 710, which does not put limitation on
present invention.
The number of de-correlators (N) can be equal to the number of output
channels. On the other hand, the de-correlated signal can be added to output
channels selected by user. For example, it is able to position certain spatial sound
signal at left, right, and center and to output as a spatial sound signal via left
channel speaker.
1.3.4 A case that downmix processing unit includes a further downmixing
part FIG. 13 is an exemplary block diagram of a fourth embodiment of a downmix processing unit illustrated in FIG. 7. A fourth embodiment of a downmix processing unit 72Od (hereinafter simply 'a downmix processing unit 72Od') can be configured to bypass if the input signal corresponds to a mono signal (m). The downmix processing unit 72Od includes a further downmixing part 722d which can be configured to downmix the stereo signal to be mono signal if the input signal corresponds to a stereo signal. The further downmixed mono channel (m) is used as input to the multi-channel decoder 730. The multi-channel decoder 730 can control object panning (especially cross-talk) by using the mono input signal. In this case, the information generating unit 710 may generate a multi-channel parameter base on 5-l-5i configuration of MPEG Surround standard.
Furthermore, if gain for the mono downmix signal like the above-mentioned artistic downmix gain ADG of FIG. 2 is applied, it is able to control object panning and object gain more easily. The ADG may be generated by the information generating unit 710 based on mix information.
2. Upmixing channel signals and controlling object signals
FIG. 14 is an exemplary block diagram of a bitstream structure of a compressed audio signal according to a second embodiment of present invention. FIG. 15 is an exemplary block diagram of an apparatus for processing an audio signal according to a second embodiment of present invention. Referring to (a) of FIG. 14, downmix signal α, multi-channel parameter β, and object parameter γ are included in the bitstream structure. The multi-channel parameter β is a parameter for upmixing the downmix signal. On the other hand, the object parameter γ is a parameter for controlling object panning and object gain. Referring to (b) of FIG. 14, downmix signal α, a default parameter β', and object parameter γ are included in the bitstream structure. The default parameter β' may include preset information for controlling object gain and object panning. The preset information may correspond to an example suggested by a producer of an encoder side. For example, preset information may describes that guitar signal is located at a point between left and center, and guitar's level is set to a certain volume, and the number of output channel in this time is set to a certain channel. The default parameter for either each frame or specified frame may be present in the bitstream. Flag information indicating whether default parameter for this frame is different from default parameter of previous frame or not may be present in the bitstream. By including default parameter in the bitstream, it is able to take less bitrates than side information with object parameter is included in the bitstream. Furthermore, header information of the bitstream is omitted in the FIG. 14. Sequence of the bitstream can be rearranged. Referring to FIG. 15, an apparatus for processing an audio signal according to a second embodiment of present invention 1000 (hereinafter simply 'a decoder 1000') may include a bitstream de-multiplexer 1005, an information generating unit 1010, a downmix processing unit 1020, and a multil-channel decoder 1030. The de- multiplexer 1005 can be configured to divide the multiplexed audio signal into a downmix α, a first multi-channel parameter β, and an object parameter γ. The information generating unit 1010 can be configured to generate a second multichannel parameter using an object parameter γ and a mix parameter. The mix parameter comprises a mode information indicating whether the first multi- channel information β is applied to the processed downmix. The mode information may corresponds to an information for selecting by a user. According to the mode information, the information generating information 1020 decides whether to transmit the first multi-channel parameter β or the second multi-channel parameter.
The downmix processing unit 1020 can be configured to determining a processing scheme according to the mode information included in the mix information. Furthermore, the downmix processing unit 1020 can be configured to process the downmix α according to the determined processing scheme. Then the downmix processing unit 1020 transmits the processed downmix to multi-channel decoder 1030. The multi-channel decoder 1030 can be configured to receive either the first multi-channel parameter β or the second multi-channel parameter. In case that default parameter β' is included in the bitstream, the multi-channel decoder 1030 can use the default parameter β' instead of multi-channel parameter β. Then, the multi-channel decoder 1030 can be configured to generate multichannel output using the processed downmix signal and the received multichannel parameter. The multi-channel decoder 1030 may have the same configuration of the former multi-channel decoder 730, which does not put limitation on the present invention.
3. Binaural Processing
A multi-channel decoder can be operated in a binaural mode. This enables a multi-channel impression over headphones by means of Head Related Transfer
Function (HRTF) filtering. For binaural decoding side, the downmix signal and multi-channel parameters are used in combination with HRTF filters supplied to the decoder.
FIG. 16 is an exemplary block diagram of an apparatus for processing an audio signal according to a third embodiment of present invention. Referring to
FIG. 16, an apparatus for processing an audio signal according to a third embodiment (hereinafter simply 'a decoder 1100') may comprise an information generating unit 1110, a downmix processing unit 1120, and a multi-channel decoder 1130 with a sync matching part 1130a.
The information generating unit 1110 may have the same configuration of the information generating unit 710 of FIG. 7, with generating dynamic HRTF. The downmix processing unit 1120 may have the same configuration of the downmix processing unit 720 of FIG. 7. Like the preceding elements, multi-channel decoder
1130 except for the sync matching part 1130a is the same case of the former elements. Hence, details of the information generating unit 1110, the downmix processing unit 1120, and the multi-channel decoder 1130 shall be omitted. The dynamic HRTF describes the relation between object signals and virtual speaker signals corresponding to the HRTF azimuth and elevation angles, which is time-dependent information according to real-time user control.
The dynamic HRTF may correspond to one of HTRF filter coefficients itself, parameterized coefficient information, and index information in case that the multi-channel decoder comprise all HRTF filter set.
There's need to match a dynamic HRTF information with frame of downmix signal regardless of kind of the dynamic HRTF. In order to match HRTF information with downmix signal, it able to provide three type of scheme as follows: 1) Inserting a tag information into each HRTF information and bitstream downmix signal, then matching the HRTF with bitstream downmix signal based on the inserted tag information. In this scheme, it is proper that tag information may be included in ancillary field in MPEG Surround standard. The tag information may be represented as a time information, a counter information, a index information, etc.
2) Inserting HRTF information into frame of bitstream. In this scheme, it is possible to set to mode information indicating whether current frame corresponds to a default mode or not. If the default mode which describes HRTF information of current frame is equal to the HRTF information of previous frame is applied, it is able to reduce bitrates of HRTF information.
2-1) Furthermore, it is possible to define transmission information indicating whether HRTF information of current frame has already transmitted. If the transmission information which describes HRTF information of current frame is equal to the transmitted HRTF information of frame is applied, it is also possible to reduce bitrates of HRTF information.
3) Transmitting several HRTF informations in advance, then transmitting identifying information indicating which HRTF among the transmitted HRTF informations per each frame. Furthermore, in case that HTRF coefficient varies suddenly, distortion may be generated. In order to reduce this distortion, it is proper to perform smoothing of coefficient or the rendered signal.
4. Rendering
FIG. 17 is an exemplary block diagram of an apparatus for processing an audio signal according to a fourth embodiment of present invention. The apparatus for processing an audio signal according to a fourth embodiment of present invention 1200 (hereinafter simply 'a processor 1200') may comprise an encoder 1210 at encoder side 1200A, and a rendering unit 1220 and a synthesis unit 1230 at decoder side 1200B. The encoder 1210 can be configured to receive multi-channel object signal and generate a downmix of audio signal and a side information. The rendering unit 1220 can be configured to receive side information from the encoder 1210, playback configuration and user control from a device setting or a user- interface, and generate rendering information using the side information, playback configuration, and user control. The synthesis unit 1230 can be configured to synthesis multi-channel output signal using the rendering information and the received downmix signal from an encoder 1210. 4.1 Applying effect-mode The effect-mode is a mode for remixed or reconstructed signal. For example, live mode, club band mode, karaoke mode, etc may be present. The effect-mode information may correspond to a mix parameter set generated by a producer, other user, etc. If the effect-mode information is applied, an end user don't have to control object panning and object gain in full because user can select one of predetermined effect-mode informations.
Two methods of generating an effect-mode information can be distinguished.
First of all, it is possible that an effect-mode information is generated by encoder
1200A and transmitted to the decoder 1200B. Secondly, the effect-mode information may be generated automatically at the decoder side. Details of two methods shall be described as follow.
4.1.1 Transmitting effect-mode information to decoder side
The effect-mode information may be generated at an encoder 1200A by a producer. According to this method, the decoder 1200B can be configured to receive side information including the effect-mode information and output user- interface by which a user can select one of effect-mode informations. The decoder 1200B can be configured to generate output channel base on the selected effect- mode information.
Furthermore, it is inappropriate to hear downmix signal as it is for a listener in case that encoder 1200A downmix the signal in order to raise quality of object signals. However, if effect-mode information is applied in the decoder 1200B, it is possible to playback the downmix signal as the maximum quality.
4.1.2 Generating effect-mode information in decoder side
The effect-mode information may be generated at a decoder 1200B. The decoder 1200B can be configured to search appropriate effect-mode informations for the downmix signal. Then the decoder 1200B can be configured to select one of the searched effect-mode by itself (automatic adjustment mode) or enable a user to select one of them (user selection mode). Then the decoder 1200B can be configured to obtain object information (number of objects, instrument names, etc) included in side information, and control object based on the selected effect-mode information and the object information.
Furthermore, it is able to control similar objects in a lump. For example, instruments associated with a rhythm may be similar objects in case of 'rhythm impression mode'. Controlling in a lump means controlling each object simultaneously rather than controlling objects using the same parameter.
Furthermore, it is able to control object based on the decoder setting and device environment (including whether headphones or speakers). For example, object corresponding to main melody may be emphasized in case that volume setting of device is low, object corresponding to main melody may be repressed in case that volume setting of device is high. 4.2 Object type of input signal at encoder side
The input signal inputted to an encoder 1200A may be classified into three types as follow.
1) Mono object (mono channel object) Mono object is most general type of object. It is possible to synthesis internal downmix signal by simply summing objects. It is also possible to synthesis internal downmix signal using object gain and object panning which may be one of user control and provided information. In generating internal downmix signal, it is also possible to generate rendering information using at least one of object characteristic, user input, and information provided with object.
In case that external downmix signal is present, it is possible to extract and transmit information indicating relation between external downmix and object.
2) Stereo object (stereo channel object)
It is possible to synthesis internal downmix signal by simply summing objects like the case of the former mono object. It is also possible to synthesis internal downmix signal using object gain and object panning which may be one of user control and provided information. In case that downmix signal corresponds to a mono signal, it is possible that encoder 1200A use object converted into mono signal for generating downmix signal. In this case, it is able to extract and transfer information associated with object (ex: panning information in each time-frequency domain) in converting into mono signal. Like the preceding mono object, in generating internal downmix signal, it is also possible to generate rendering information using at least one of object characteristic, user input, and information provided with object. Like the preceding mono object, in case that external downmix signal is present, it is possible to extract and transmit information indicating relation between external downmix and object.
3) Multi-channel object
In case of multi-channel object, it is able to perform the above mentioned method described with mono object and stereo object. Furthermore, it is able to input multi-channel object as a form of MPEG Surround. In this case, it is able to generate object-based downmix (ex: SAOC downmix) using object downmix channel, and use multi-channel information (ex: spatial information in MPEG Surround) for generating multi-channel information and rendering information. Hence, it is possible to reduce computing amount because multi-channel object present in form of MPEG Surround don't have to decode and encode using object- - oriented encoder (ex: SAOC encoder). If object downmix corresponds to stereo and object-based downmix (ex: SAOC downmix) corresponds to mono in this case, it is possible to apply the above-mentioned method described with stereo object.
4) Transmitting scheme for variable type of object As stated previously, variable type of object (mono object, stereo object, and multi-channel object) may be transmitted from the encoder 1200A to the decoder. 1200B. Transmitting scheme for variable type of object can be provided as follow:
Referring to FIG. 18, when the downmix includes a plural object, a side information includes information for each object. For example, when a plural object consists of Nth mono object (A), left channel of N+lth object (B), and right channel of N+lth object (C), a side information includes information for 3 objects (A, B, C).
The side. information may comprise correlation flag information indicating whether an object is part of a stereo or multi-channel object, for example, mono object, one channel (L or R) of stereo object, and so on. For example, correlation flag information is '0' if mono object is present, correlation flag information is 'V if one channel of stereo object is present. When one part of stereo object and the other part of stereo object is transmitted in succession, correlation flag information for other part of stereo object may be any value (ex: '0', '1', or whatever). Furthermore, correlation flag information for other part of stereo object may be not transmitted.
Furthermore, in case of multi-channel object, correlation flag information for one part of multi-channel object may be value describing number of multi-channel object. For example, in case of 5.1 channel object, correlation flag information for left channel of 5.1 channel may be '5', correlation flag information for the other channel (R, Lr, Rr, C, LFE) of 5.1 channel may be either O' or not transmitted. 4.3 Object attribute
Object may have the three kinds of attribute as follows: a) Single object
Single object can be configured as a source. It is able to apply one parameter to single object for controlling object panning and object gain in generating downmix signal and reproducing. The 'one parameter' may mean not only one parameter for all time/ frequency domain but also one parameter for each time/ frequency slot. b) Grouped object Single object can be configured as more than two sources. It is able to apply one parameter to grouped object for controlling object panning and object gain although grouped object is inputted as at least two sources. Details of the grouped object shall be explained with reference to FIG. 19 as follows: Referring to FIG. 19, an encoder 1300 includes a grouping unit 1310 and a downmix unit 1320. The grouping unit 1310 can be configured to group at least two objects among inputted multi-object input, base on a grouping information. The grouping information may be generated by producer at encoder side. The downmix unit 1320 can be configured to generate downmix signal using the grouped object generated by the grouping unit 1310. The downmix unit 1320 can be configured to generate a side information for the grouped object. c) Combination object
Combination object is an object combined with at least one source. It is possible to control object panning and gain in a lump, but keep relation between combined objects unchanged. For example, in case of drum, it is possible to control drum, but keep relation between base drum, tam-tam, and symbol unchanged. For example, when base drum is located at center point and symbol is located at left point, it is possible to positioning base drum at right point and positioning symbol at point between center and right in case that drum is moved to right direction.
Relation information between combined objects may be transmitted to a decoder. On the other hand, decoder can extract the relation information using combination object.
4.4 Controlling objects hierarchically
It is able to control objects hierarchically. For example, after controlling a drum, it is able to control each sub-elements of drum. In order to control objects hierarchically, three schemes is provided as follows: a) UI (user interface)
Only representative element may be displayed without displaying all objects. If the representative element is selected by a user, all objects display. b) Object grouping After grouping objects in order to represent representative element, it is possible to control representative element to control all objects grouped as representative element. Information extracted in grouping process may be transmitted to a decoder. Also, the grouping information may be generated in a decoder. Applying control information in a lump can be performed based on predetermined control information for each element, c) Object configuration
It is possible to use the above-mentioned combination object. Information concerning element of combination object can be generated in either an encoder or a decoder. Information concerning elements from an encoder can be transmitted as a different form from information concerning combination object.
It will be apparent to those skilled in the art that various modifications and variations can be made in the present invention without departing from the spirit or scope of the inventions. Thus, it is intended that the present invention covers the modifications and variations of this invention provided they come within the scope of the appended claims and their equivalents. [Industrial Applicability]
Accordingly, the present invention is applicable to encode and decode an audio signal.

Claims

[CLAIMS]
1. A method for processing an audio signal, comprising: receiving a downmix signal, a first multi-channel information, and an object information; processing the downmix signal using the object information and a mix information; and, transmitting one of the first multi-channel information and a second multi-channel information according to the mix information, wherein the second channel information is generated using the object information and the mix information.
2. The method of claim 1, wherein the downmix signal contains a plural channel and a plural object.
3. The method of claim 2, wherein the first multi-channel information is applied to the downmix signal to generate a plural channel signal.
4. The method of claim 2, wherein the object information corresponds to an information for controlling the plural object.
5. The method of claim 1, wherein the mix information includes a mode information indicating whether the first multi-channel information is applied to the processed downmix.
6. The method of claim 5, wherein the processing the downmix signal, comprising: determining a processing scheme according to the mode information; and, processing the dowmmix signal using the object information and using the mix information according to the determined processing scheme.
7. The method of claim 5, wherein the transmitting one of the first multi-channel information and a second multi-channel information is performed according to the mode information included in the mix information.
8. The method of claim 1, further comprising: transmitting the processed downmix signal.
9. The method of claim 8, further comprising: generating a multi-channel signal using the processed downmix signal and one of the first multi-channel information and the second multi-channel information.
10. The method of claim 1, wherein the receiving a downmix signal, a first multichannel information, an object information, and a mix information, comprising: receiving the downmix signal and, a bitstream including the first multi-channel information and the object information; and, extracting the multi-channel information and the object information from the received bitstream.
11. The method of claim 1, wherein the downmix signal is received as a broadcast signal.
12. The method of claim 1, wherein the downmix signal is received on a digital medium.
13. A computer-readable medium having instructions stored thereon, which, when executed by a processor, causes the processor to perform operations, comprising: receiving a downmix signal, a first multi-channel information, and an object information; processing the downmix signal using the object information and a mix information; and, transmitting one of the first multi-channel information and a second multi-channel information according to the mix information, wherein the second channel information is generated using the object information and the mix information.
14. An apparatus for processing an audio signal, comprising: a bitstream de-multiplexer receiving a downmix signal, a first multichannel information, and an object information; and, an object decoder processing the downmix signal using the object information and a mix information, and transmitting one of the first multichannel information and a second multi-channel information according to the mix information, wherein the second channel information is generated using the object information and the mix information.
15. A data structure of audio signal, comprising: a downmix signal having a plural object and a plural channel; an object information for controlling the plural object; and, a multi-channel information for decoding the plural channel, wherein the object information includes an object parameter, and the multi-channel information includes at least one of channel level information and channel correlation information.
PCT/KR2007/006315 2006-12-07 2007-12-06 A method and an apparatus for processing an audio signal WO2008069593A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
CN2007800454197A CN101553868B (en) 2006-12-07 2007-12-06 A method and an apparatus for processing an audio signal
KR1020097014216A KR101128815B1 (en) 2006-12-07 2007-12-06 A method an apparatus for processing an audio signal
EP07851286.0A EP2122612B1 (en) 2006-12-07 2007-12-06 A method and an apparatus for processing an audio signal
JP2009540163A JP5209637B2 (en) 2006-12-07 2007-12-06 Audio processing method and apparatus

Applications Claiming Priority (20)

Application Number Priority Date Filing Date Title
US86907706P 2006-12-07 2006-12-07
US60/869,077 2006-12-07
US87713406P 2006-12-27 2006-12-27
US60/877,134 2006-12-27
US88356907P 2007-01-05 2007-01-05
US60/883,569 2007-01-05
US88404307P 2007-01-09 2007-01-09
US60/884,043 2007-01-09
US88434707P 2007-01-10 2007-01-10
US60/884,347 2007-01-10
US88458507P 2007-01-11 2007-01-11
US60/884,585 2007-01-11
US88534707P 2007-01-17 2007-01-17
US88534307P 2007-01-17 2007-01-17
US60/885,347 2007-01-17
US60/885,343 2007-01-17
US88971507P 2007-02-13 2007-02-13
US60/889,715 2007-02-13
US95539507P 2007-08-13 2007-08-13
US60/955,395 2007-08-13

Publications (1)

Publication Number Publication Date
WO2008069593A1 true WO2008069593A1 (en) 2008-06-12

Family

ID=39492395

Family Applications (5)

Application Number Title Priority Date Filing Date
PCT/KR2007/006316 WO2008069594A1 (en) 2006-12-07 2007-12-06 A method and an apparatus for processing an audio signal
PCT/KR2007/006317 WO2008069595A1 (en) 2006-12-07 2007-12-06 A method and an apparatus for processing an audio signal
PCT/KR2007/006318 WO2008069596A1 (en) 2006-12-07 2007-12-06 A method and an apparatus for processing an audio signal
PCT/KR2007/006319 WO2008069597A1 (en) 2006-12-07 2007-12-06 A method and an apparatus for processing an audio signal
PCT/KR2007/006315 WO2008069593A1 (en) 2006-12-07 2007-12-06 A method and an apparatus for processing an audio signal

Family Applications Before (4)

Application Number Title Priority Date Filing Date
PCT/KR2007/006316 WO2008069594A1 (en) 2006-12-07 2007-12-06 A method and an apparatus for processing an audio signal
PCT/KR2007/006317 WO2008069595A1 (en) 2006-12-07 2007-12-06 A method and an apparatus for processing an audio signal
PCT/KR2007/006318 WO2008069596A1 (en) 2006-12-07 2007-12-06 A method and an apparatus for processing an audio signal
PCT/KR2007/006319 WO2008069597A1 (en) 2006-12-07 2007-12-06 A method and an apparatus for processing an audio signal

Country Status (11)

Country Link
US (11) US8428267B2 (en)
EP (6) EP2122612B1 (en)
JP (5) JP5450085B2 (en)
KR (5) KR101128815B1 (en)
CN (5) CN101553868B (en)
AU (1) AU2007328614B2 (en)
BR (1) BRPI0719884B1 (en)
CA (1) CA2670864C (en)
MX (1) MX2009005969A (en)
TW (1) TWI371743B (en)
WO (5) WO2008069594A1 (en)

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2140450A1 (en) * 2007-03-09 2010-01-06 LG Electronics Inc. A method and an apparatus for processing an audio signal
WO2010008198A2 (en) * 2008-07-15 2010-01-21 Lg Electronics Inc. A method and an apparatus for processing an audio signal
EP2158587A1 (en) * 2007-06-08 2010-03-03 Lg Electronics Inc. A method and an apparatus for processing an audio signal
EP2175670A1 (en) * 2008-10-07 2010-04-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Binaural rendering of a multi-channel audio signal
WO2010085083A2 (en) * 2009-01-20 2010-07-29 Lg Electronics Inc. An apparatus for processing an audio signal and method thereof
EP2273492A2 (en) * 2008-03-31 2011-01-12 Electronics and Telecommunications Research Institute Method and apparatus for generating additional information bit stream of multi-object audio signal
EP2279618A1 (en) * 2008-04-23 2011-02-02 Electronics and Telecommunications Research Institute Method for generating and playing object-based audio contents and computer readable recording medium for recoding data having file format structure for object-based audio service
KR101171314B1 (en) * 2008-07-15 2012-08-10 엘지전자 주식회사 A method and an apparatus for processing an audio signal
CN102292768B (en) * 2009-01-20 2013-03-27 Lg电子株式会社 A method and an apparatus for processing an audio signal
US8422688B2 (en) 2007-09-06 2013-04-16 Lg Electronics Inc. Method and an apparatus of decoding an audio signal
US8463413B2 (en) 2007-03-09 2013-06-11 Lg Electronics Inc. Method and an apparatus for processing an audio signal
WO2013108200A1 (en) * 2012-01-19 2013-07-25 Koninklijke Philips N.V. Spatial audio rendering and encoding
CN103354630A (en) * 2008-07-17 2013-10-16 弗朗霍夫应用科学研究促进协会 Apparatus and method for generating audio output signals using object based metadata
WO2014174344A1 (en) * 2013-04-26 2014-10-30 Nokia Corporation Audio signal encoder
EP3005352A1 (en) * 2013-05-24 2016-04-13 Dolby International AB Methods for audio encoding and decoding, corresponding computer-readable media and corresponding audio encoder and decoder
US9666198B2 (en) 2013-05-24 2017-05-30 Dolby International Ab Reconstruction of audio scenes from a downmix
US9911423B2 (en) 2014-01-13 2018-03-06 Nokia Technologies Oy Multi-channel audio signal classifier
US9947325B2 (en) 2013-11-27 2018-04-17 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Decoder, encoder and method for informed loudness estimation employing by-pass audio object signals in object-based audio coding systems
US10431227B2 (en) 2013-07-22 2019-10-01 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Multi-channel audio decoder, multi-channel audio encoder, methods, computer program and encoded audio representation using a decorrelation of rendered audio signals
US10448185B2 (en) 2013-07-22 2019-10-15 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Multi-channel decorrelator, multi-channel audio decoder, multi-channel audio encoder, methods and computer program using a premix of decorrelator input signals
EP3063955B1 (en) * 2013-10-31 2019-10-16 Dolby Laboratories Licensing Corporation Binaural rendering for headphones using metadata processing
US10468039B2 (en) 2013-05-24 2019-11-05 Dolby International Ab Decoding of audio scenes
US10672408B2 (en) 2015-08-25 2020-06-02 Dolby Laboratories Licensing Corporation Audio decoder and decoding method

Families Citing this family (80)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1691348A1 (en) * 2005-02-14 2006-08-16 Ecole Polytechnique Federale De Lausanne Parametric joint-coding of audio sources
JP4988716B2 (en) 2005-05-26 2012-08-01 エルジー エレクトロニクス インコーポレイティド Audio signal decoding method and apparatus
US8917874B2 (en) * 2005-05-26 2014-12-23 Lg Electronics Inc. Method and apparatus for decoding an audio signal
JP2009500656A (en) * 2005-06-30 2009-01-08 エルジー エレクトロニクス インコーポレイティド Apparatus and method for encoding and decoding audio signals
US8073702B2 (en) * 2005-06-30 2011-12-06 Lg Electronics Inc. Apparatus for encoding and decoding audio signal and method thereof
WO2007007500A1 (en) * 2005-07-11 2007-01-18 Matsushita Electric Industrial Co., Ltd. Ultrasonic flaw detection method and ultrasonic flaw detection device
TWI333386B (en) * 2006-01-19 2010-11-11 Lg Electronics Inc Method and apparatus for processing a media signal
TWI483244B (en) * 2006-02-07 2015-05-01 Lg Electronics Inc Apparatus and method for encoding/decoding signal
US8611547B2 (en) * 2006-07-04 2013-12-17 Electronics And Telecommunications Research Institute Apparatus and method for restoring multi-channel audio signal using HE-AAC decoder and MPEG surround decoder
JP5450085B2 (en) * 2006-12-07 2014-03-26 エルジー エレクトロニクス インコーポレイティド Audio processing method and apparatus
CN101578658B (en) * 2007-01-10 2012-06-20 皇家飞利浦电子股份有限公司 Audio decoder
WO2010041877A2 (en) * 2008-10-08 2010-04-15 Lg Electronics Inc. A method and an apparatus for processing a signal
CN102440003B (en) 2008-10-20 2016-01-27 吉诺迪奥公司 Audio spatialization and environmental simulation
US8861739B2 (en) 2008-11-10 2014-10-14 Nokia Corporation Apparatus and method for generating a multichannel signal
KR20100065121A (en) * 2008-12-05 2010-06-15 엘지전자 주식회사 Method and apparatus for processing an audio signal
EP2194526A1 (en) * 2008-12-05 2010-06-09 Lg Electronics Inc. A method and apparatus for processing an audio signal
JP5309944B2 (en) * 2008-12-11 2013-10-09 富士通株式会社 Audio decoding apparatus, method, and program
WO2010087631A2 (en) * 2009-01-28 2010-08-05 Lg Electronics Inc. A method and an apparatus for decoding an audio signal
US8139773B2 (en) * 2009-01-28 2012-03-20 Lg Electronics Inc. Method and an apparatus for decoding an audio signal
KR101137361B1 (en) 2009-01-28 2012-04-26 엘지전자 주식회사 A method and an apparatus for processing an audio signal
US20100324915A1 (en) * 2009-06-23 2010-12-23 Electronic And Telecommunications Research Institute Encoding and decoding apparatuses for high quality multi-channel audio codec
MY165327A (en) * 2009-10-16 2018-03-21 Fraunhofer Ges Forschung Apparatus,method and computer program for providing one or more adjusted parameters for provision of an upmix signal representation on the basis of a downmix signal representation and a parametric side information associated with the downmix signal representation,using an average value
WO2011048067A1 (en) * 2009-10-20 2011-04-28 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E. V. Apparatus for providing an upmix signal representation on the basis of a downmix signal representation, apparatus for providing a bitstream representing a multichannel audio signal, methods, computer program and bitstream using a distortion control signaling
KR101106465B1 (en) * 2009-11-09 2012-01-20 네오피델리티 주식회사 Method for adjusting gain of multiband drc system and multiband drc system using the same
AU2010321013B2 (en) * 2009-11-20 2014-05-29 Dolby International Ab Apparatus for providing an upmix signal representation on the basis of the downmix signal representation, apparatus for providing a bitstream representing a multi-channel audio signal, methods, computer programs and bitstream representing a multi-channel audio signal using a linear combination parameter
EP2511908A4 (en) * 2009-12-11 2013-07-31 Korea Electronics Telecomm Audio authoring apparatus and audio playback apparatus for an object-based audio service, and audio authoring method and audio playback method using same
EP2522016A4 (en) 2010-01-06 2015-04-22 Lg Electronics Inc An apparatus for processing an audio signal and method thereof
EP2557190A4 (en) * 2010-03-29 2014-02-19 Hitachi Metals Ltd Initial ultrafine crystal alloy, nanocrystal soft magnetic alloy and method for producing same, and magnetic component formed from nanocrystal soft magnetic alloy
KR20120004909A (en) * 2010-07-07 2012-01-13 삼성전자주식회사 Method and apparatus for 3d sound reproducing
WO2012009851A1 (en) 2010-07-20 2012-01-26 Huawei Technologies Co., Ltd. Audio signal synthesizer
US8948403B2 (en) * 2010-08-06 2015-02-03 Samsung Electronics Co., Ltd. Method of processing signal, encoding apparatus thereof, decoding apparatus thereof, and signal processing system
JP5903758B2 (en) 2010-09-08 2016-04-13 ソニー株式会社 Signal processing apparatus and method, program, and data recording medium
AU2012279357B2 (en) * 2011-07-01 2016-01-14 Dolby Laboratories Licensing Corporation System and method for adaptive audio signal generation, coding and rendering
EP2560161A1 (en) 2011-08-17 2013-02-20 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Optimal mixing matrices and usage of decorrelators in spatial audio processing
CN103050124B (en) 2011-10-13 2016-03-30 华为终端有限公司 Sound mixing method, Apparatus and system
JP6096789B2 (en) * 2011-11-01 2017-03-15 コーニンクレッカ フィリップス エヌ ヴェKoninklijke Philips N.V. Audio object encoding and decoding
US9516446B2 (en) * 2012-07-20 2016-12-06 Qualcomm Incorporated Scalable downmix design for object-based surround codec with cluster analysis by synthesis
US9761229B2 (en) 2012-07-20 2017-09-12 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for audio object clustering
KR20140017338A (en) * 2012-07-31 2014-02-11 인텔렉추얼디스커버리 주식회사 Apparatus and method for audio signal processing
CN104541524B (en) 2012-07-31 2017-03-08 英迪股份有限公司 A kind of method and apparatus for processing audio signal
WO2014020181A1 (en) 2012-08-03 2014-02-06 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Decoder and method for multi-instance spatial-audio-object-coding employing a parametric concept for multichannel downmix/upmix cases
BR112015005456B1 (en) * 2012-09-12 2022-03-29 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E. V. Apparatus and method for providing enhanced guided downmix capabilities for 3d audio
US9344050B2 (en) * 2012-10-31 2016-05-17 Maxim Integrated Products, Inc. Dynamic speaker management with echo cancellation
JP6169718B2 (en) * 2012-12-04 2017-07-26 サムスン エレクトロニクス カンパニー リミテッド Audio providing apparatus and audio providing method
MX347551B (en) 2013-01-15 2017-05-02 Koninklijke Philips Nv Binaural audio processing.
RU2656717C2 (en) 2013-01-17 2018-06-06 Конинклейке Филипс Н.В. Binaural audio processing
EP2757559A1 (en) * 2013-01-22 2014-07-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for spatial audio object coding employing hidden objects for signal mixture manipulation
US9208775B2 (en) 2013-02-21 2015-12-08 Qualcomm Incorporated Systems and methods for determining pitch pulse period signal boundaries
US9497560B2 (en) 2013-03-13 2016-11-15 Panasonic Intellectual Property Management Co., Ltd. Audio reproducing apparatus and method
CN108806704B (en) 2013-04-19 2023-06-06 韩国电子通信研究院 Multi-channel audio signal processing device and method
US10075795B2 (en) 2013-04-19 2018-09-11 Electronics And Telecommunications Research Institute Apparatus and method for processing multi-channel audio signal
KR20140128564A (en) * 2013-04-27 2014-11-06 인텔렉추얼디스커버리 주식회사 Audio system and method for sound localization
US9495968B2 (en) * 2013-05-29 2016-11-15 Qualcomm Incorporated Identifying sources from which higher order ambisonic audio data is generated
KR101454342B1 (en) * 2013-05-31 2014-10-23 한국산업은행 Apparatus for creating additional channel audio signal using surround channel audio signal and method thereof
CN105378826B (en) 2013-05-31 2019-06-11 诺基亚技术有限公司 Audio scene device
EP2830048A1 (en) 2013-07-22 2015-01-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for realizing a SAOC downmix of 3D audio content
EP2830045A1 (en) 2013-07-22 2015-01-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Concept for audio encoding and decoding for audio channels and audio objects
EP2830047A1 (en) 2013-07-22 2015-01-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for low delay object metadata coding
US9319819B2 (en) * 2013-07-25 2016-04-19 Etri Binaural rendering method and apparatus for decoding multi channel audio
KR102243395B1 (en) * 2013-09-05 2021-04-22 한국전자통신연구원 Apparatus for encoding audio signal, apparatus for decoding audio signal, and apparatus for replaying audio signal
TWI671734B (en) 2013-09-12 2019-09-11 瑞典商杜比國際公司 Decoding method, encoding method, decoding device, and encoding device in multichannel audio system comprising three audio channels, computer program product comprising a non-transitory computer-readable medium with instructions for performing decoding m
CA3122726C (en) 2013-09-17 2023-05-09 Wilus Institute Of Standards And Technology Inc. Method and apparatus for processing multimedia signals
WO2015059154A1 (en) * 2013-10-21 2015-04-30 Dolby International Ab Audio encoder and decoder
KR101804744B1 (en) 2013-10-22 2017-12-06 연세대학교 산학협력단 Method and apparatus for processing audio signal
EP2866227A1 (en) * 2013-10-22 2015-04-29 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Method for decoding and encoding a downmix matrix, method for presenting audio content, encoder and decoder for a downmix matrix, audio encoder and audio decoder
EP3934283B1 (en) 2013-12-23 2023-08-23 Wilus Institute of Standards and Technology Inc. Audio signal processing method and parameterization device for same
EP3122073B1 (en) 2014-03-19 2023-12-20 Wilus Institute of Standards and Technology Inc. Audio signal processing method and apparatus
CN106165452B (en) 2014-04-02 2018-08-21 韦勒斯标准与技术协会公司 Acoustic signal processing method and equipment
CN110636415B (en) 2014-08-29 2021-07-23 杜比实验室特许公司 Method, system, and storage medium for processing audio
CN106688253A (en) * 2014-09-12 2017-05-17 杜比实验室特许公司 Rendering audio objects in a reproduction environment that includes surround and/or height speakers
TWI587286B (en) 2014-10-31 2017-06-11 杜比國際公司 Method and system for decoding and encoding of audio signals, computer program product, and computer-readable medium
US9609383B1 (en) * 2015-03-23 2017-03-28 Amazon Technologies, Inc. Directional audio for virtual environments
KR102537541B1 (en) 2015-06-17 2023-05-26 삼성전자주식회사 Internal channel processing method and apparatus for low computational format conversion
CN109427337B (en) 2017-08-23 2021-03-30 华为技术有限公司 Method and device for reconstructing a signal during coding of a stereo signal
US11004457B2 (en) * 2017-10-18 2021-05-11 Htc Corporation Sound reproducing method, apparatus and non-transitory computer readable storage medium thereof
DE102018206025A1 (en) * 2018-02-19 2019-08-22 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for object-based spatial audio mastering
KR102471718B1 (en) * 2019-07-25 2022-11-28 한국전자통신연구원 Broadcastiong transmitting and reproducing apparatus and method for providing the object audio
JP2022544795A (en) * 2019-08-19 2022-10-21 ドルビー ラボラトリーズ ライセンシング コーポレイション Audio binaural steering
CN111654745B (en) * 2020-06-08 2022-10-14 海信视像科技股份有限公司 Multi-channel signal processing method and display device
CN117580779A (en) 2023-04-25 2024-02-20 马渊马达株式会社 Packaging structure

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005086139A1 (en) * 2004-03-01 2005-09-15 Dolby Laboratories Licensing Corporation Multichannel audio coding
US20060115100A1 (en) * 2004-11-30 2006-06-01 Christof Faller Parametric coding of spatial audio with cues based on transmitted channels
US20060133618A1 (en) * 2004-11-02 2006-06-22 Lars Villemoes Stereo compatible multi-channel audio coding
JP2006323408A (en) * 2006-07-07 2006-11-30 Victor Co Of Japan Ltd Audio encoding method and audio decoding method

Family Cites Families (67)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0079886B1 (en) 1981-05-29 1986-08-27 International Business Machines Corporation Aspirator for an ink jet printer
FR2567984B1 (en) * 1984-07-20 1986-08-14 Centre Techn Ind Mecanique PROPORTIONAL HYDRAULIC DISTRIBUTOR
DE69210689T2 (en) 1991-01-08 1996-11-21 Dolby Lab Licensing Corp ENCODER / DECODER FOR MULTI-DIMENSIONAL SOUND FIELDS
US6141446A (en) * 1994-09-21 2000-10-31 Ricoh Company, Ltd. Compression and decompression system with reversible wavelets and lossy reconstruction
US5838664A (en) * 1997-07-17 1998-11-17 Videoserver, Inc. Video teleconferencing system with digital transcoding
US5956674A (en) 1995-12-01 1999-09-21 Digital Theater Systems, Inc. Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels
EP0798866A2 (en) 1996-03-27 1997-10-01 Kabushiki Kaisha Toshiba Digital data processing system
US6128597A (en) * 1996-05-03 2000-10-03 Lsi Logic Corporation Audio decoder with a reconfigurable downmixing/windowing pipeline and method therefor
US5912976A (en) 1996-11-07 1999-06-15 Srs Labs, Inc. Multi-channel audio enhancement system for use in recording and playback and methods for providing same
US6131084A (en) 1997-03-14 2000-10-10 Digital Voice Systems, Inc. Dual subframe quantization of spectral magnitudes
AU740617C (en) 1997-06-18 2002-08-08 Clarity, L.L.C. Methods and apparatus for blind signal separation
US6026168A (en) 1997-11-14 2000-02-15 Microtek Lab, Inc. Methods and apparatus for automatically synchronizing and regulating volume in audio component systems
WO1999053479A1 (en) * 1998-04-15 1999-10-21 Sgs-Thomson Microelectronics Asia Pacific (Pte) Ltd. Fast frame optimisation in an audio encoder
US6122619A (en) 1998-06-17 2000-09-19 Lsi Logic Corporation Audio decoder with programmable downmixing of MPEG/AC-3 and method therefor
FI114833B (en) * 1999-01-08 2004-12-31 Nokia Corp A method, a speech encoder and a mobile station for generating speech coding frames
US7103187B1 (en) * 1999-03-30 2006-09-05 Lsi Logic Corporation Audio calibration system
US6539357B1 (en) 1999-04-29 2003-03-25 Agere Systems Inc. Technique for parametric coding of a signal containing information
BR0109017A (en) 2000-03-03 2003-07-22 Cardiac M R I Inc Magnetic resonance specimen analysis apparatus
KR100809310B1 (en) * 2000-07-19 2008-03-04 코닌클리케 필립스 일렉트로닉스 엔.브이. Multi-channel stereo converter for deriving a stereo surround and/or audio centre signal
US7292901B2 (en) * 2002-06-24 2007-11-06 Agere Systems Inc. Hybrid multi-channel/cue coding/decoding of audio signals
US7583805B2 (en) 2004-02-12 2009-09-01 Agere Systems Inc. Late reverberation-based synthesis of auditory scenes
SE0202159D0 (en) * 2001-07-10 2002-07-09 Coding Technologies Sweden Ab Efficientand scalable parametric stereo coding for low bitrate applications
US7032116B2 (en) * 2001-12-21 2006-04-18 Intel Corporation Thermal management for computer systems running legacy or thermal management operating systems
KR101021079B1 (en) 2002-04-22 2011-03-14 코닌클리케 필립스 일렉트로닉스 엔.브이. Parametric multi-channel audio representation
KR101016982B1 (en) 2002-04-22 2011-02-28 코닌클리케 필립스 일렉트로닉스 엔.브이. Decoding apparatus
JP4013822B2 (en) 2002-06-17 2007-11-28 ヤマハ株式会社 Mixer device and mixer program
JP2005533271A (en) 2002-07-16 2005-11-04 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ Audio encoding
KR100542129B1 (en) 2002-10-28 2006-01-11 한국전자통신연구원 Object-based three dimensional audio system and control method
JP4084990B2 (en) 2002-11-19 2008-04-30 株式会社ケンウッド Encoding device, decoding device, encoding method and decoding method
JP4496379B2 (en) 2003-09-17 2010-07-07 財団法人北九州産業学術推進機構 Reconstruction method of target speech based on shape of amplitude frequency distribution of divided spectrum series
US6937737B2 (en) 2003-10-27 2005-08-30 Britannia Investment Corporation Multi-channel audio surround sound from front located loudspeakers
TWI233091B (en) 2003-11-18 2005-05-21 Ali Corp Audio mixing output device and method for dynamic range control
US7394903B2 (en) * 2004-01-20 2008-07-01 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Apparatus and method for constructing a multi-channel output signal or for generating a downmix signal
US7805313B2 (en) * 2004-03-04 2010-09-28 Agere Systems Inc. Frequency-based coding of channels in parametric multi-channel coding systems
SE0400997D0 (en) 2004-04-16 2004-04-16 Cooding Technologies Sweden Ab Efficient coding or multi-channel audio
SE0400998D0 (en) 2004-04-16 2004-04-16 Cooding Technologies Sweden Ab Method for representing multi-channel audio signals
US8843378B2 (en) 2004-06-30 2014-09-23 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Multi-channel synthesizer and method for generating a multi-channel output signal
JP4934427B2 (en) 2004-07-02 2012-05-16 パナソニック株式会社 Speech signal decoding apparatus and speech signal encoding apparatus
US7391870B2 (en) 2004-07-09 2008-06-24 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E V Apparatus and method for generating a multi-channel output signal
KR100663729B1 (en) 2004-07-09 2007-01-02 한국전자통신연구원 Method and apparatus for encoding and decoding multi-channel audio signal using virtual source location information
KR100745688B1 (en) 2004-07-09 2007-08-03 한국전자통신연구원 Apparatus for encoding and decoding multichannel audio signal and method thereof
WO2006006809A1 (en) 2004-07-09 2006-01-19 Electronics And Telecommunications Research Institute Method and apparatus for encoding and cecoding multi-channel audio signal using virtual source location information
PL1769655T3 (en) 2004-07-14 2012-05-31 Koninl Philips Electronics Nv Method, device, encoder apparatus, decoder apparatus and audio system
DE602005016931D1 (en) * 2004-07-14 2009-11-12 Dolby Sweden Ab TONKANALKONVERTIERUNG
JP4892184B2 (en) * 2004-10-14 2012-03-07 パナソニック株式会社 Acoustic signal encoding apparatus and acoustic signal decoding apparatus
US8204261B2 (en) 2004-10-20 2012-06-19 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Diffuse sound shaping for BCC schemes and the like
US7720230B2 (en) 2004-10-20 2010-05-18 Agere Systems, Inc. Individual channel shaping for BCC schemes and the like
SE0402652D0 (en) 2004-11-02 2004-11-02 Coding Tech Ab Methods for improved performance of prediction based multi-channel reconstruction
KR100682904B1 (en) 2004-12-01 2007-02-15 삼성전자주식회사 Apparatus and method for processing multichannel audio signal using space information
US7903824B2 (en) 2005-01-10 2011-03-08 Agere Systems Inc. Compact side information for parametric coding of spatial audio
EP1691348A1 (en) * 2005-02-14 2006-08-16 Ecole Polytechnique Federale De Lausanne Parametric joint-coding of audio sources
DE602006015294D1 (en) * 2005-03-30 2010-08-19 Dolby Int Ab MULTI-CHANNEL AUDIO CODING
US20060262936A1 (en) * 2005-05-13 2006-11-23 Pioneer Corporation Virtual surround decoder apparatus
KR20060122694A (en) * 2005-05-26 2006-11-30 엘지전자 주식회사 Method of inserting spatial bitstream in at least two channel down-mix audio signal
EP1905004A2 (en) 2005-05-26 2008-04-02 LG Electronics Inc. Method of encoding and decoding an audio signal
CA2610430C (en) 2005-06-03 2016-02-23 Dolby Laboratories Licensing Corporation Channel reconfiguration with side information
US20070055510A1 (en) * 2005-07-19 2007-03-08 Johannes Hilpert Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding
RU2414741C2 (en) 2005-07-29 2011-03-20 ЭлДжи ЭЛЕКТРОНИКС ИНК. Method of generating multichannel signal
US20070083365A1 (en) * 2005-10-06 2007-04-12 Dts, Inc. Neural network classifier for separating audio sources from a monophonic audio signal
EP1640972A1 (en) 2005-12-23 2006-03-29 Phonak AG System and method for separation of a users voice from ambient sound
DE602006016017D1 (en) * 2006-01-09 2010-09-16 Nokia Corp CONTROLLING THE DECODING OF BINAURAL AUDIO SIGNALS
BRPI0713236B1 (en) * 2006-07-07 2020-03-10 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. CONCEPT FOR COMBINING MULTIPLE PARAMETRICALLY ENCODED AUDIO SOURCES
EP2067138B1 (en) 2006-09-18 2011-02-23 Koninklijke Philips Electronics N.V. Encoding and decoding of audio objects
KR20090013178A (en) * 2006-09-29 2009-02-04 엘지전자 주식회사 Methods and apparatuses for encoding and decoding object-based audio signals
EP2068307B1 (en) * 2006-10-16 2011-12-07 Dolby International AB Enhanced coding and parameter representation of multichannel downmixed object coding
RU2431940C2 (en) 2006-10-16 2011-10-20 Фраунхофер-Гезелльшафт цур Фёрдерунг дер ангевандтен Форшунг Е.Ф. Apparatus and method for multichannel parametric conversion
JP5450085B2 (en) * 2006-12-07 2014-03-26 エルジー エレクトロニクス インコーポレイティド Audio processing method and apparatus

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005086139A1 (en) * 2004-03-01 2005-09-15 Dolby Laboratories Licensing Corporation Multichannel audio coding
US20060133618A1 (en) * 2004-11-02 2006-06-22 Lars Villemoes Stereo compatible multi-channel audio coding
US20060115100A1 (en) * 2004-11-30 2006-06-01 Christof Faller Parametric coding of spatial audio with cues based on transmitted channels
JP2006323408A (en) * 2006-07-07 2006-11-30 Victor Co Of Japan Ltd Audio encoding method and audio decoding method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
"Draft Call for Proposals on Spatial Audio Object Coding", JVT OF ISO/IEC MPEG & ITU-T VCEG, 27 October 2006 (2006-10-27)
HERRE J. ET AL.: "From Channel-Oriented to Object-Oriented Spatial Audio Coding", JVT OF ISO/IEC MPEG & ITU-T VCEG, 12 July 2006 (2006-07-12)

Cited By (75)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2008225321B2 (en) * 2007-03-09 2010-11-18 Lg Electronics Inc. A method and an apparatus for processing an audio signal
US8594817B2 (en) 2007-03-09 2013-11-26 Lg Electronics Inc. Method and an apparatus for processing an audio signal
EP2140450A4 (en) * 2007-03-09 2010-03-17 Lg Electronics Inc A method and an apparatus for processing an audio signal
US8463413B2 (en) 2007-03-09 2013-06-11 Lg Electronics Inc. Method and an apparatus for processing an audio signal
US8359113B2 (en) 2007-03-09 2013-01-22 Lg Electronics Inc. Method and an apparatus for processing an audio signal
EP2140450A1 (en) * 2007-03-09 2010-01-06 LG Electronics Inc. A method and an apparatus for processing an audio signal
EP2158587A1 (en) * 2007-06-08 2010-03-03 Lg Electronics Inc. A method and an apparatus for processing an audio signal
EP2158587A4 (en) * 2007-06-08 2010-06-02 Lg Electronics Inc A method and an apparatus for processing an audio signal
EP2278582A3 (en) * 2007-06-08 2011-02-16 Lg Electronics Inc. A method and an apparatus for processing an audio signal
US8422688B2 (en) 2007-09-06 2013-04-16 Lg Electronics Inc. Method and an apparatus of decoding an audio signal
EP2273492A4 (en) * 2008-03-31 2012-06-13 Korea Electronics Telecomm Method and apparatus for generating additional information bit stream of multi-object audio signal
EP3147899A1 (en) * 2008-03-31 2017-03-29 Electronics and Telecommunications Research Institute Method and apparatus for decoding a multi-object audio signal
US9299352B2 (en) 2008-03-31 2016-03-29 Electronics And Telecommunications Research Institute Method and apparatus for generating side information bitstream of multi-object audio signal
EP2273492A2 (en) * 2008-03-31 2011-01-12 Electronics and Telecommunications Research Institute Method and apparatus for generating additional information bit stream of multi-object audio signal
CN102800320A (en) * 2008-03-31 2012-11-28 韩国电子通信研究院 Method and apparatus for generating additional information bit stream of multi-object audio signal
US8976983B2 (en) 2008-04-23 2015-03-10 Electronics And Telecommunications Research Institute Method for generating and playing object-based audio contents and computer readable recording medium for recoding data having file format structure for object-based audio service
EP2279618A1 (en) * 2008-04-23 2011-02-02 Electronics and Telecommunications Research Institute Method for generating and playing object-based audio contents and computer readable recording medium for recoding data having file format structure for object-based audio service
EP2279618A4 (en) * 2008-04-23 2012-11-21 Korea Electronics Telecomm Method for generating and playing object-based audio contents and computer readable recording medium for recoding data having file format structure for object-based audio service
US8639368B2 (en) 2008-07-15 2014-01-28 Lg Electronics Inc. Method and an apparatus for processing an audio signal
WO2010008198A3 (en) * 2008-07-15 2010-06-03 Lg Electronics Inc. A method and an apparatus for processing an audio signal
US9445187B2 (en) 2008-07-15 2016-09-13 Lg Electronics Inc. Method and an apparatus for processing an audio signal
WO2010008198A2 (en) * 2008-07-15 2010-01-21 Lg Electronics Inc. A method and an apparatus for processing an audio signal
US8452430B2 (en) 2008-07-15 2013-05-28 Lg Electronics Inc. Method and an apparatus for processing an audio signal
KR101171314B1 (en) * 2008-07-15 2012-08-10 엘지전자 주식회사 A method and an apparatus for processing an audio signal
CN103354630A (en) * 2008-07-17 2013-10-16 弗朗霍夫应用科学研究促进协会 Apparatus and method for generating audio output signals using object based metadata
US8824688B2 (en) 2008-07-17 2014-09-02 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for generating audio output signals using object based metadata
RU2510906C2 (en) * 2008-07-17 2014-04-10 Фраунхофер-Гезелльшафт цур Фёрдерунг дер ангевандтен Форшунг Е.Ф. Apparatus and method of generating output audio signals using object based metadata
AU2009301467B2 (en) * 2008-10-07 2013-08-01 Dolby International Ab Binaural rendering of a multi-channel audio signal
EP2175670A1 (en) * 2008-10-07 2010-04-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Binaural rendering of a multi-channel audio signal
WO2010040456A1 (en) * 2008-10-07 2010-04-15 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Binaural rendering of a multi-channel audio signal
US8325929B2 (en) 2008-10-07 2012-12-04 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Binaural rendering of a multi-channel audio signal
US9484039B2 (en) 2009-01-20 2016-11-01 Lg Electronics Inc. Method and an apparatus for processing an audio signal
US8620008B2 (en) 2009-01-20 2013-12-31 Lg Electronics Inc. Method and an apparatus for processing an audio signal
CN102292768B (en) * 2009-01-20 2013-03-27 Lg电子株式会社 A method and an apparatus for processing an audio signal
WO2010085083A3 (en) * 2009-01-20 2010-10-21 Lg Electronics Inc. An apparatus for processing an audio signal and method thereof
US9542951B2 (en) 2009-01-20 2017-01-10 Lg Electronics Inc. Method and an apparatus for processing an audio signal
WO2010085083A2 (en) * 2009-01-20 2010-07-29 Lg Electronics Inc. An apparatus for processing an audio signal and method thereof
WO2013108200A1 (en) * 2012-01-19 2013-07-25 Koninklijke Philips N.V. Spatial audio rendering and encoding
US9584912B2 (en) 2012-01-19 2017-02-28 Koninklijke Philips N.V. Spatial audio rendering and encoding
US9659569B2 (en) 2013-04-26 2017-05-23 Nokia Technologies Oy Audio signal encoder
WO2014174344A1 (en) * 2013-04-26 2014-10-30 Nokia Corporation Audio signal encoder
US10468039B2 (en) 2013-05-24 2019-11-05 Dolby International Ab Decoding of audio scenes
US10726853B2 (en) 2013-05-24 2020-07-28 Dolby International Ab Decoding of audio scenes
US9666198B2 (en) 2013-05-24 2017-05-30 Dolby International Ab Reconstruction of audio scenes from a downmix
KR101761099B1 (en) 2013-05-24 2017-07-25 돌비 인터네셔널 에이비 Methods for audio encoding and decoding, corresponding computer-readable media and corresponding audio encoder and decoder
US9818412B2 (en) 2013-05-24 2017-11-14 Dolby International Ab Methods for audio encoding and decoding, corresponding computer-readable media and corresponding audio encoder and decoder
US11894003B2 (en) 2013-05-24 2024-02-06 Dolby International Ab Reconstruction of audio scenes from a downmix
US11682403B2 (en) 2013-05-24 2023-06-20 Dolby International Ab Decoding of audio scenes
US10290304B2 (en) 2013-05-24 2019-05-14 Dolby International Ab Reconstruction of audio scenes from a downmix
US11580995B2 (en) 2013-05-24 2023-02-14 Dolby International Ab Reconstruction of audio scenes from a downmix
EP3005352B1 (en) * 2013-05-24 2017-03-29 Dolby International AB Audio object encoding and decoding
US11315577B2 (en) 2013-05-24 2022-04-26 Dolby International Ab Decoding of audio scenes
EP3005352A1 (en) * 2013-05-24 2016-04-13 Dolby International AB Methods for audio encoding and decoding, corresponding computer-readable media and corresponding audio encoder and decoder
US10468041B2 (en) 2013-05-24 2019-11-05 Dolby International Ab Decoding of audio scenes
US10468040B2 (en) 2013-05-24 2019-11-05 Dolby International Ab Decoding of audio scenes
US10971163B2 (en) 2013-05-24 2021-04-06 Dolby International Ab Reconstruction of audio scenes from a downmix
US10448185B2 (en) 2013-07-22 2019-10-15 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Multi-channel decorrelator, multi-channel audio decoder, multi-channel audio encoder, methods and computer program using a premix of decorrelator input signals
US11381925B2 (en) 2013-07-22 2022-07-05 Fraunhofer-Gesellschaft zur Foerderang der angewandten Forschung e.V. Multi-channel decorrelator, multi-channel audio decoder, multi-channel audio encoder, methods and computer program using a premix of decorrelator input signals
US10431227B2 (en) 2013-07-22 2019-10-01 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Multi-channel audio decoder, multi-channel audio encoder, methods, computer program and encoded audio representation using a decorrelation of rendered audio signals
US11252523B2 (en) 2013-07-22 2022-02-15 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Multi-channel decorrelator, multi-channel audio decoder, multi-channel audio encoder, methods and computer program using a premix of decorrelator input signals
US11240619B2 (en) 2013-07-22 2022-02-01 Fraunhofer-Gesellschaft zur Foerderang der angewandten Forschung e.V. Multi-channel decorrelator, multi-channel audio decoder, multi-channel audio encoder, methods and computer program using a premix of decorrelator input signals
US11115770B2 (en) 2013-07-22 2021-09-07 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Multi-channel decorrelator, multi-channel audio decoder, multi channel audio encoder, methods and computer program using a premix of decorrelator input signals
EP3063955B1 (en) * 2013-10-31 2019-10-16 Dolby Laboratories Licensing Corporation Binaural rendering for headphones using metadata processing
US11875804B2 (en) 2013-11-27 2024-01-16 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Decoder, encoder and method for informed loudness estimation employing by-pass audio object signals in object-based audio coding systems
US10891963B2 (en) 2013-11-27 2021-01-12 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Decoder, encoder, and method for informed loudness estimation in object-based audio coding systems
US10699722B2 (en) 2013-11-27 2020-06-30 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Decoder, encoder and method for informed loudness estimation employing by-pass audio object signals in object-based audio coding systems
US11423914B2 (en) 2013-11-27 2022-08-23 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Decoder, encoder and method for informed loudness estimation employing by-pass audio object signals in object-based audio coding systems
US10497376B2 (en) 2013-11-27 2019-12-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Decoder, encoder, and method for informed loudness estimation in object-based audio coding systems
US9947325B2 (en) 2013-11-27 2018-04-17 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Decoder, encoder and method for informed loudness estimation employing by-pass audio object signals in object-based audio coding systems
US11688407B2 (en) 2013-11-27 2023-06-27 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Decoder, encoder, and method for informed loudness estimation in object-based audio coding systems
US9911423B2 (en) 2014-01-13 2018-03-06 Nokia Technologies Oy Multi-channel audio signal classifier
US10672408B2 (en) 2015-08-25 2020-06-02 Dolby Laboratories Licensing Corporation Audio decoder and decoding method
US11705143B2 (en) 2015-08-25 2023-07-18 Dolby Laboratories Licensing Corporation Audio decoder and decoding method
US11423917B2 (en) 2015-08-25 2022-08-23 Dolby International Ab Audio decoder and decoding method
US12002480B2 (en) 2015-08-25 2024-06-04 Dolby Laboratories Licensing Corporation Audio decoder and decoding method

Also Published As

Publication number Publication date
KR101111520B1 (en) 2012-05-24
BRPI0719884A2 (en) 2014-02-11
US8340325B2 (en) 2012-12-25
CN101553868A (en) 2009-10-07
EP2187386A2 (en) 2010-05-19
EP2122613A4 (en) 2010-01-13
US8311227B2 (en) 2012-11-13
TWI371743B (en) 2012-09-01
CN101553868B (en) 2012-08-29
CN101553866B (en) 2012-05-30
US20090281814A1 (en) 2009-11-12
CN101568958B (en) 2012-07-18
EP2122612A4 (en) 2010-01-13
US7783048B2 (en) 2010-08-24
US20080205670A1 (en) 2008-08-28
EP2102857B1 (en) 2018-07-18
KR20090098866A (en) 2009-09-17
CN101553867B (en) 2013-04-17
CN101553865B (en) 2012-01-25
KR20090098865A (en) 2009-09-17
US20100010818A1 (en) 2010-01-14
WO2008069594A1 (en) 2008-06-12
US20100010820A1 (en) 2010-01-14
EP2102856A1 (en) 2009-09-23
CN101553865A (en) 2009-10-07
WO2008069596A1 (en) 2008-06-12
US7715569B2 (en) 2010-05-11
KR101111521B1 (en) 2012-03-13
KR20090100386A (en) 2009-09-23
JP5450085B2 (en) 2014-03-26
US8005229B2 (en) 2011-08-23
US20100010819A1 (en) 2010-01-14
US7783050B2 (en) 2010-08-24
TW200834544A (en) 2008-08-16
US20080192941A1 (en) 2008-08-14
US8428267B2 (en) 2013-04-23
US20080199026A1 (en) 2008-08-21
EP2102857A4 (en) 2010-01-20
AU2007328614A1 (en) 2008-06-12
JP5290988B2 (en) 2013-09-18
EP2122612B1 (en) 2018-08-15
JP5270566B2 (en) 2013-08-21
KR20090098864A (en) 2009-09-17
US7986788B2 (en) 2011-07-26
JP2010511909A (en) 2010-04-15
AU2007328614B2 (en) 2010-08-26
US7783049B2 (en) 2010-08-24
WO2008069597A1 (en) 2008-06-12
JP2010511912A (en) 2010-04-15
US8488797B2 (en) 2013-07-16
CN101568958A (en) 2009-10-28
JP2010511911A (en) 2010-04-15
EP2122613A1 (en) 2009-11-25
CN101553867A (en) 2009-10-07
JP2010511908A (en) 2010-04-15
EP2122613B1 (en) 2019-01-30
EP2187386A3 (en) 2010-07-28
JP2010511910A (en) 2010-04-15
US20100010821A1 (en) 2010-01-14
WO2008069595A1 (en) 2008-06-12
EP2102858A4 (en) 2010-01-20
US20100014680A1 (en) 2010-01-21
BRPI0719884B1 (en) 2020-10-27
CA2670864A1 (en) 2008-06-12
KR101100222B1 (en) 2011-12-28
US7783051B2 (en) 2010-08-24
EP2122612A1 (en) 2009-11-25
JP5302207B2 (en) 2013-10-02
MX2009005969A (en) 2009-06-16
EP2187386B1 (en) 2020-02-05
JP5209637B2 (en) 2013-06-12
CN101553866A (en) 2009-10-07
US20080205657A1 (en) 2008-08-28
KR101128815B1 (en) 2012-03-27
EP2102858A1 (en) 2009-09-23
EP2102857A1 (en) 2009-09-23
EP2102856A4 (en) 2010-01-13
KR20090098863A (en) 2009-09-17
CA2670864C (en) 2015-09-29
US20080205671A1 (en) 2008-08-28
KR101100223B1 (en) 2011-12-28

Similar Documents

Publication Publication Date Title
US7783050B2 (en) Method and an apparatus for decoding an audio signal

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200780045419.7

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07851286

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2009540163

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 1020097014216

Country of ref document: KR

Ref document number: 2007851286

Country of ref document: EP