CN101341793A - Method to generate multi-channel audio signals from stereo signals - Google Patents

Method to generate multi-channel audio signals from stereo signals Download PDF

Info

Publication number
CN101341793A
CN101341793A CNA2006800322282A CN200680032228A CN101341793A CN 101341793 A CN101341793 A CN 101341793A CN A2006800322282 A CNA2006800322282 A CN A2006800322282A CN 200680032228 A CN200680032228 A CN 200680032228A CN 101341793 A CN101341793 A CN 101341793A
Authority
CN
China
Prior art keywords
band
sub
sound
signal
input
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CNA2006800322282A
Other languages
Chinese (zh)
Other versions
CN101341793B (en
Inventor
克里斯托夫·法勒
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
LG Electronics Inc
Original Assignee
LG Electronics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by LG Electronics Inc filed Critical LG Electronics Inc
Publication of CN101341793A publication Critical patent/CN101341793A/en
Application granted granted Critical
Publication of CN101341793B publication Critical patent/CN101341793B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/002Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S5/00Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation 

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Stereophonic System (AREA)

Abstract

A perceptually motivated spatial decomposition for two-channel stereo audio signals, capturing the information about the virtual sound stage, is proposed. The spatial decomposition allows to re-synthesize audio signals for playback over other sound systems than two-channel stereo. With the use of more front loudspeakers, the width of the virtual sound stage can be increased beyond +/- 30 DEG and the sweet spot region is extended. Optionally, lateral independent sound components can be played back separately over loudspeakers on the two sides of a listener to increase listener envelopment. It is also explained how the spatial decomposition can be used with surround sound and wavefield synthesis based audio system. According to the main embodiment of the invention applying to multiple audio signals, it is proposed to generate multiple output audio signals (y1,..., yM) from multiple input audio signals (x1, ..., xL), in which the number of output is equal or higher than the number of input signals , this method comprising the steps of: by means of linear combinations of the input subbands (X1(i), ..., XL(i)), computing one or more independent sound subbands representing signal components which are independent between the input subbands, by means of linear combinations of the input subbands (X1(i), ..., XL(i)), computing one or more localized direct sound subbands representing signal components which are contained in more than one of the input subbands and direction factors representing the ratios with which these signal components are contained in two or more input subbands, generating the output subband signals (Y1(i)...YM(i)), where each output subband signal is a linear combination of the independent sound subbands and the localized direct sound subbands, converting the output subband signals (Y1(i)...YM(i)), to time domain audio signals (y1,..., yM).

Description

Produce the method for multi-channel audio signal from stereophonic signal
Background technology
Many technological innovations beyond the stereophony are because cost, can't implementation (for example, the number of loud speaker) fail, and last but be not the demand that least importantly is used for back compatible.And 5.1 adopted by the user widely around the multichannel audio system, in addition, this system is with regard to the number of loud speaker, and since the restriction of back compatible (a preceding left side is placed on the angle identical with stereophony with right loud speaker, promptly+/-30 °, cause narrow positive virtual sound level) be half measure.
The fact further most of audio contents in two-channel stereo format are available.For the audio system that strengthens stereo sound experience in addition, compare with legacy system, can play the stereo audio content by means of the experience that improves therefore be crucial yearningly.
Realized using more preceding loud speaker to improve virtual sound level for a long time for the listener in the central point that not exclusively is arranged on club face in addition.For improving the purpose of result's existence through plural loudspeaker plays stereophonic signal.Especially, there are many concerns for utilizing extra center loudspeaker to play stereophonic signal.But the improvement of these technology on the played in stereo of routine is not clear enough, and they are widely used.The major limitation of these technology is that they only consider the position, and do not consider other aspect clearly, such as surrounding environment and listener's envelope.In addition, the situation theory after these technology is based on a virtual information source situation, also limits its performance when many information sources side by side come across different directions.
But these weakness are to excite the technology of the spatial decomposition of stereo audio signal to overcome by what propose in this manual with passing through the use perception.Provide this and decompose, can present audio signal for the capable array of loud speaker, loud speaker and the wave field synthesis system that increase number.
The technology of this proposition is for by means of more sound channel stereophonic signal (two sound channels) being converted to audio signal without limits.But the signal that normally, has L sound channel can be converted into the signal with M sound channel.This signal can or stereo, or purpose is the multi-channel audio signal that is used to play, perhaps they can be unprocessed microphone signals, perhaps the linear combination of microphone signal.It also illustrates this technology and how to be applied to microphone signal (for example, ambiophony sound B form), and matrix be used in various loud speaker general layouts, reproducing these around following mixed frequency signal.
When we mentioned stereo or have the multi-channel audio signal of many sound channels, we refer to when we mention many (monophone) audio signals was identical.
Summary of the invention
According to the main embodiment that is applied to a plurality of audio signals, it has proposed from a plurality of input audio signal (x 1..., x L) a plurality of output audio signal (y of middle generation 1..., y M), wherein Shu Chu number equals or is higher than the number of input signal, and this method comprises step:
-utilize and import sub-band X 1(i) ..., X LThe mode of linear combination (i) is calculated one or more independently sound sub-bands of representing signal component, and this signal component is independently between the input sub-band;
-utilize and import sub-band X 1(i) ..., X LThe mode of linear combination (i), calculate the direct sound wave sub-band of one or more parts of expression signal component, this signal component be comprised in the input sub-band more than one in, with the direction factor of expression ratio, these signal components are included in this ratio in two or more input sub-bands;
-produce and export sub-band signal Y 1(i) ... Y M(i), here each output sub-band signal is the independently linear combination of sound sub-band and local direct sound wave sub-band;
-will export sub-band signal Y 1(i) ... Y M(i) be converted to time-domain audio signal y 1... y M
This index i is the index of the sub-band of consideration.According to first embodiment, this method can each audio track a sub-frequency bands and using only, even the more sub-band of each sound channel provides the better sound result.
The scheme of this proposition is based on following reason.Many input audio signal x 1..., x LBe broken down into the signal component of expression sound, this sound be between audio track and signal component independently, this signal component is illustrated between the audio track relevant sound.This different consciousness effect that is the signal component by these two types has inspires.This independently signal component represent the information of relevant information source width, listener's envelope and surrounding environment, and should relevant (subordinate) signal component represent the position or the sense of hearing ground direct sound wave of auditory events.For each relevant signal component, there is relevant directional information, it can be by one than value representation, and this sound is included in many audio input signals with this ratio.When loud speaker (perhaps headphone) is gone up broadcast,, the purpose of reproducing specific auditory space image decomposes the many audio output signals of generation for can providing this.Should relevant signal component be rendered as output signal (y 1..., y M), make it by the directional perception of listener from expectation.This independently signal component be rendered as output signal (loud speaker), make it simulate the consciousness effect of non-direct sound wave and its expectation.This function of describing on high standard is to extract spatial information from input audio signal, and this spatial information is transformed to the spatial information that has the parameter of expectation in this output channels.
Description of drawings
Because additional accompanying drawing will be understood the present invention better, wherein:
Fig. 1 illustrates the setting of standard boombox;
Fig. 2 illustrates the position for the auditory events of the perception of the different level difference value of two relevant loudspeaker signals, determines the position of the auditory events between present two loud speakers at the level between a pair of relevant loudspeaker signal and time difference;
Fig. 3 (a) illustrates the early reflection of sending from the side loud speaker with auditory events expansion effect;
Fig. 3 (b) illustrates the later stage reflection of sending from the side loud speaker that relates to as more environment of listener's envelope;
Fig. 4 illustrates stereophonic signal and side-reflected mode of hybrid analog-digital simulation direct sound wave;
It is the T/F tiling demonstration of sub-band with signal decomposition that Fig. 5 illustrates expression as the function of time;
Fig. 6 illustrates the normalization power of direction factor A and S and AS;
Fig. 7 illustrates least square estimation weight w 1And W 2, and the back scale factor that is used to calculate estimation s;
Fig. 8 illustrates least square estimation weight w 3And w 4, and be used for calculating estimation N 1Back scale factor;
Fig. 9 illustrates least square estimation weight w 5And w 6, and be used for calculating estimation N 2Back scale factor;
Figure 10 illustrates s, A, the n of estimation 1And n 2
Figure 11 illustrates ± 30 ° of virtual sound levels (a) is converted to the have loudspeaker array virtual sound level of width in slit of (b);
Figure 12 illustrate loud speaker to select 1 with the factor a relevant with the stereophonic signal level difference 1And a 2
Figure 13 illustrates the plane wave that sends via a plurality of loud speakers;
Figure 14 illustrates the virtual sound level that ± 30 ° of virtual sound levels (a) is converted to the width in the slit with loudspeaker array, and sound improves listener's envelope by sending independently from side loudspeaker (b);
Figure 15 illustrate for as eight signals that generation is set in Figure 14 (b);
Figure 16 illustrates each signal corresponding to the preceding sound level that is interpreted as virtual source.This independently horizontal sound is used as plane wave (virtual source in the far field) and sends;
Figure 17 illustrates quadraphonic sound system (a) and expands to for more loud speaker (b) and use.
Embodiment
The space is listened attentively to boombox and is play
The scheme of this proposition inspires the description for the important condition of two input sound channels (stereo audio input) and M audio frequency output channels (M 〉=2).After a while, how its description will be applied to the situation of more conventional L input sound channel with the identical reason of deriving in the example of stereo input signal.
The most normally used user's Play System that is used for space audio is boombox setting as shown in Figure 1.Two loud speakers are placed on this listener's left side and front, right side.Usually, these loud speakers are arranged on the circle with angle-30 ° and+30 °.The width of the auditory space image of perception when listening to such stereo playing system is approximate to be limited between two loud speakers and the zone after two loud speakers.
When listening to naturally and working as the sound of listening to reproduction, the auditory space image of this perception mainly depends on the ears position indicating, that is, and and coherence (IC) between level difference (ILD) and ear between interaural difference (ITD), ear.In addition, its elevation angle that perception has been shown is relevant with monaural prompting.
Make that the ability of playing the auditory space image that generates the simulation sound level by means of boombox is possible by the consciousness phenomenon of position summation, promptly, by being controlled at level and/or the time difference between the signal of giving this loud speaker, auditory events can with the loud speaker of listener front between any angle occur.In nineteen thirty, Blumlein recognizes the power of this principle, and the present famous relevant stereophonic patent of his application.The position summation is based on the following fact, that is, the approximate crudely dominant prompting of ITD that on ear, causes and ILD prompting, if physical resource is located on the direction of the auditory events that occurs between the loud speaker, it will occur.
Fig. 2 illustrates the position for the auditory events of the different level difference perception of two relevant loudspeaker signals.When a left side and right loudspeaker signal are concerned with, have identical level, and when postpone differing from, an auditory events appears at two central authorities between the loud speaker, as illustrational by the zone among Fig. 21.By in a side, for example improve level on the right side, this auditory events moves to as by 2 illustrational those sides of the zone in Fig. 2.Under unusual situation, when only the signal on the left side is effective, appear at locational this auditory events of left speaker as illustrational by the zone in Fig. 23.Can control the position of this auditory events similarly by the delay of change between loudspeaker signal.When this loud speaker in the front not the listener, be controlled at loud speaker between the described principle in auditory events position also be applicatory.But, for some restricted applications of loud speaker in listener's side.
As in Fig. 2 illustrated, the position summation can be used for simulating a kind of situation, and different here instruments is being positioned on the virtual sound level on the different directions, that is, and and in the zone between two loud speakers.Hereinafter, except can the control position, how description can control other attribute.
As one man important venue acoustics is to consider the reflection that arrives on the listener from the side, that is, and and sideswipe.Original sideswipe has been shown has had the effect that enlarges auditory events.It is constant having effect less than the approximately primary reflection of the delay of 80ms approximate, and therefore, has been defined in the concrete measure of considering the lateral part that primary reflection is represented in this scope.This lateral part is the ratio of horizontal acoustic energy to total acoustic energy, and total acoustic energy is after direct sound wave arrives at, and obtains in initial 80ms, and measures the width of auditory events.
Be used to imitate early stage side-reflected experimental facilities in Fig. 3 (a) illustrated.This direct sound wave sends from center loudspeaker, and independently early reflection is sent from left side and right speaker.When early stage side-reflected relative intensity improved, the width of this auditory events increased.
After direct sound wave arrived at, the above sideswipe of 80ms tended to help more the perception of environment except that auditory events itself.This is tangible on the meaning of " envelope " of often representing listener's envelope or " broad environment ".Also measure the reflection in the later stage of listener's envelope degree applicable to confession as the similar measure of the lateral part that is used for early reflection.This measures the transverse energy part in expression later stage.Can be with the sideswipe that the simulation later stage is set shown in Fig. 3 (b).This direct sound wave sends from center loudspeaker, and independently the reflection in later stage is sent from left side and right speaker.When the side-reflected relative intensity in later stage improved, the sensing of this listener's envelope increased, simultaneously the width of this auditory events be expect affected hardly.
Stereophonic signal is recorded or mixes, make for each information source, this signal enters left side and right-side signal sound channel with specific direction prompting (level difference, time difference) consistently, and the independently signal of reflection/repercussion enters the sound channel of determining auditory events width and the prompting of listener's envelope.Further discussing mixing and recording technique is beyond the scope of this specification.
The spatial decomposition of stereophonic signal
With use on the contrary from the direct sound wave of true information source, as in Fig. 3 illustrated, people can use the direct sound wave corresponding to the virtual source of utilizing the position summation to produce.The auditory events of perception is represented in this shadow region.That is to say, can only realize by means of two loud speakers as experiment shown in Figure 3.These are in Fig. 4 illustrated, and signal s simulation here comes the direct sound wave of the definite direction of free factor a.This independently signal n1 and n2 corresponding to sideswipe.The situation of this description is by means of the natural decomposition of an auditory events for stereophonic signal,
x 1(n)=s(n)+n 1(n) x 2(n)=as(n)+n 2(n)
(1)
Catch the position and the width of this auditory events and listener's envelope.
In order to decompose, under an auditory events situation, it is not only effectively, but, having the non-static case of a plurality of effective information sources simultaneously, the decomposition of this description is independently at many frequency band ranges with realize in the time adaptively,
X 1(i,k)=S(i,k)+N 1(i,k) X 2(i,k)=A(i,k)S(i,k)+N 2(i,k)
(2)
Here i is the sub-band index, and k is the sub-band time index.This promptly, shows this signal S, N in each the T/F tiling with index i and k in Fig. 5 illustrated 1, N 2Estimated independently with direction factor A.For mark for simplicity, this sub-band and time index are left in the basket hereinafter usually.We use sub-band to decompose by means of consciousness ground exciton band bandwidth, that is, the bandwidth of sub-band is selected to equal a critical band.About every 20ms estimation S, N in each sub-band 1, N 2With direction factor A.
Notice that in general, people also can consider the time difference of direct sound wave in formula (2).That is to say that people are the service orientation factors A not only, and the service orientation delay, this direction postpones to be defined as having S and is included in X 1And X 2In delay.In the following description, we do not consider above-mentioned delay, still, should be understood that this analysis can easily expand to the above-mentioned delay of consideration.
Provide stereo sub-band signal X 1And X 2, this target is to calculate S, N 1, N 2Estimated value with A.X 1The estimated value in short-term of power be expressed P x 1 ( i , k ) = E { X 1 2 ( i , k ) } . For other signal, use identical agreement, that is, and P X2, Ps and P N=P N1=P N2It is power estimated value in short-term accordingly.N 1And N 2Power to be assumed to be identical, that is, suppose that laterally independently amount of sound is the same to a left side with the right side.
Notice, can use except that P N=P N1=P N2Outside other hypothesis.For example, A 2P N1=P N2
Estimation Ps, A and P N
The sub-band that provides this stereophonic signal is represented, calculates this power (P X1, P X2) and standardized cross-correlation.Standardized cross-correlation between a left side and the right side is:
Φ ( i , k ) = E { X 1 ( i , k ) X 2 ( i , k ) } E { X 1 2 ( i , k ) } E { X 2 2 ( i , k ) } - - - ( 3 )
A, Ps and P NBe calculated as the P of estimation X1, P X2Function with Ф.Relate to the sixth of the twelve Earthly Branches and know and three formula of unknown variable are:
Px 1=P S+P N Px 2=A 2P S+P N Φ = aS Px 1 Px 2 - - - ( 4 )
These formula are obtained A, P SAnd P N, obtain:
A = B 2 C P S = 2 C 2 B P N = X 1 - 2 C 2 B - - - ( 5 )
And
B = Px 2 - Px 1 + ( Px 1 - Px 2 ) 2 + 4 Px 1 Px 2 Φ 2 C = Φ Px 1 Px 2 - - - ( 6 )
S, N 1And N 2Least square estimation.
Next, S, N 1And N 2Least square estimation be calculated as A, Ps and P NFunction.For each i and k, this signal S is estimated as:
S ^ = ω 1 X 1 + ω 2 X 2 = ω 1 ( S + N 1 ) + ω 2 ( AS + N 2 ) - - - ( 7 )
Here ω 1And ω 2It is real-valued weight.This estimation error is:
E=(1-ω 12A)S-ω 1N 12N 2 (8)
When this error E is to be orthogonal to X 1And X 2The time, this weights omega 1And ω 2In the lowest mean square sensing, be best, that is,
E{EX 1}=0 E{EX 2}=0 (9)
Obtain two formula,
(1-ω 12A)P S1P N=0,
A(1-ω 12A)P S2P N=0 (10)
This is extremely heavily from wherein being calculated,
ω 1 = P S P N ( A 2 + 1 ) P S P N + P N 2 ω 2 = AP S P N ( A 2 + 1 ) P S P N + P N 2 - - - ( 11 )
Similarly, N 1And N 2Estimated.N 1Estimated value be:
N ^ 1 = ω 3 X 1 + ω 4 X 2 = ω 3 ( S + N 1 ) + ω 4 ( AS + N 2 ) - - - ( 12 )
This estimation error is:
E=(-ω 34A)S-(1-ω 3)N 12N 2 (13)
Equally, calculating this weight makes this estimation error be orthogonal to X 1And X 2, the result forms:
ω 3 = A 2 P S P N + P N 2 ( A 2 + 1 ) P S P N + P N 2 ω 4 = - AP S P N ( A 2 + 1 ) P S P N + P N 2 - - - ( 14 )
Be used to calculate N 2The weight of least square estimation be:
N ^ 2 = ω 5 X 1 + ω 6 X 2 = ω 5 ( S + N 1 ) + ω 6 ( AS + N 2 ) - - - ( 15 )
Be
ω 5 = - A P S P N ( A 2 + 1 ) P S P N + P N 2 ω 6 = P S P N + P N 2 ( A 2 + 1 ) P S P N + P N 2 - - - ( 16 )
Back scale
Provide the least square estimation, these are made and estimate by (selectively) back scale
Figure A20068003222800154
Figure A20068003222800155
Power equal Ps and PN=P N1=P N2
Figure A20068003222800156
Power be:
P S · = ( ω 1 + a ω 2 ) 2 P S + ( ω 1 2 + ω 2 2 ) P N - - - ( 17 )
Therefore, for by means of by the power Ps of scale,
Figure A20068003222800158
Obtain the estimated value of S:
S ^ ′ = P N ( ω 1 + a ω 2 ) 2 P S + ( ω 1 2 + ω 2 2 ) P N S ^ - - - ( 18 )
By means of similar reason,
Figure A200680032228001510
With
Figure A200680032228001511
By scale, that is,
N ^ ′ 1 = P N ( ω 3 + a ω 4 ) 2 P S + ( ω 3 2 + ω 4 2 ) P N N ^ 1
N ^ ′ 2 = P N ( ω 5 + a ω 6 ) 2 P S + ( ω 5 2 + ω 6 2 ) P N N ^ 2 - - - ( 19 )
Numerical examples
The normalization power of this direction factor A and S and AS is shown as the function of stereophonic signal level difference and Ф in Fig. 6.
Be used to calculate the weights omega of the least square estimated value of S 1And ω 2On Fig. 7, be illustrated as the function of stereophonic signal level difference and Ф in two plates.Be used for
Figure A200680032228001514
Back scale factor shown in the bottom end plate.
Be used to calculate N 1Least square estimation and the weights omega of corresponding back scale factor (19) 3And ω 2In Fig. 7, be shown the function of stereophonic signal level difference and Ф.
Be used to calculate N 2Least square estimation and the weights omega of corresponding back scale factor (19) 5And ω 6In Fig. 7, be illustrated as the function of stereophonic signal level difference and Ф.
An example that utilizes singer placed in the middle to be used for the stereo rock music folder of spatial decomposition shown in Figure 10.S, A, n 1And n 2Estimated value be illustrated.At this signal shown in the time domain, and for each T/F tiling illustrates A.With horizontal sound n independently 1And n 2Compare, the direct sound wave s of this estimation is relatively strong, because singer placed in the middle is in the highest flight.
The stereophonic signal of playing decomposition is set in different broadcasts
Provide the spatial decomposition of stereophonic signal, that is, and the direct sound wave of the part that is used to estimate
Figure A20068003222800161
Direction factor A and horizontal sound independently
Figure A20068003222800162
With
Figure A20068003222800163
Sub-band signal, people can define relevant how from different broadcast settings send corresponding to
Figure A20068003222800164
With
Figure A20068003222800165
The rule of signal component.
A plurality of loud speakers are in listener's front
Figure 11 illustrates the situation of illustrating.At the virtual sound level width φ shown in the part (a) of this accompanying drawing 0Virtual sound level width φ ' shown in=30 ° of parts (b) that are scaled at this accompanying drawing 0, this virtual sound level width φ ' 0Quilt is by means of a plurality of loudspeaker reproduction.
The independently horizontal sound of this estimation
Figure A20068003222800166
With
Figure A20068003222800167
Sent from the loud speaker on this side, for example, the loud speaker 1 and 6 in Figure 11 (b).That is to say that because the horizontal sound that sends from the side is high more, surrounding the listener into, this sound is effective more clearly.Provide the direction factor A of estimation, use " stereo sinusoidal law " (perhaps with other law of the angle of the relevant perception of A) estimation auditory events with respect to ± φ 0The angle φ of virtual sound level,
φ = sin - 1 ( A - 1 A + 1 sin φ 0 ) - - - ( 20 )
This angle by convergent-divergent linearly calculating angle with respect to the sound level that enlarges,
φ ′ = φ ′ 0 φ 0 φ - - - ( 21 )
The loud speaker that centers on φ ' is to selected.In the illustrational example of Figure 11 (b), this is to having sign 4 and 5.Be used for this loud speaker between shake the relevant angle γ of amplitude 0And γ 1Be defined as in the drawings and illustrate.If the loud speaker of this selection is to having sign 1 and 1+1, so, this signal that these loud speakers provide is:
a 1 1 + A 2 S
a 2 1 + A 2 S - - - ( 22 )
Here shake factor a by means of stereo sinusoidal law (perhaps other amplitude is shaken law) calculating and standardization amplitude 1And a 2, make a 1 2 + a 2 2 = 1 ,
a 1 = 1 1 + C 2 a 2 = C 1 + C 2 - - - ( 23 )
And
C = sin ( γ 0 + γ ) sin ( γ 0 - γ ) - - - ( 24 )
The factor in (22)
Figure A20068003222800178
Be such, the gross power of these signals equals in this stereophonic signal the gross power of relevant component S and AS.Alternatively, people can the use amplitude shake law, and it side by side gives signal to plural loud speaker.
Figure 12 illustrates and is used for loud speaker and goes up the φ ' for M=8 loud speaker to l and l+1 with in angle { 30 ° ,-20 ° ,-12 ° ,-4 °, 4 °, 12 °, 20 °, 30 ° } 00=30 ° amplitude is shaken factor a 1And a 2The example of selecting.
Provide above reason, each T/F tiling of this output signal sound channel shows that i and k are calculated as:
Y m = δ ( m - 1 ) N ^ ′ 1 + δ ( m - M ) N ^ ′ 2 + ( δ ( m - l ) a 1 + δ ( m - l - 1 ) a 2 ) 1 + A 2 S ^ ′ - - - ( 25 )
Here
δ ( m ) = 1 for m = 0 0 otherwise - - - ( 26 )
And m is output channels sign 1≤m≤M.The sub-band signal of this output channels is converted back to time domain, and forms output channels y 1To y MHereinafter, this last step is not always mentioned once more clearly.
The restriction of the scheme of this description is, when the listener is on a side, when for example approaching loud speaker 1, and relatively from the horizontal acoustic phase of opposite side, this laterally independently sound will arrive him with bigger intensity.In order to produce the purpose of two transverse plane ripples, this problem can be evaded by send laterally independently sound from all loud speakers.These are in Figure 13 illustrated.Be somebody's turn to do horizontal independently sound quilt along with the delay of simulating plane wave with certain direction is given to all loud speakers,
Y m ( i , k ) = N ^ ′ 1 ( i , k - ( m - 1 ) d ) M + N ^ ′ 2 ( i , k - ( M - m ) d ) M +
( δ ( m - l ) a 1 + δ ( m - l - 1 ) a 2 ) 1 + A 2 S ^ ′ - - - ( 27 )
Here d postpones,
d = sf s sin α v - - - ( 28 )
S is the distance between equally spaced loud speaker, and v is a velocity of sound, f sBe the sub-band sample frequency, and ± α is two plane directions of wave travel.In our system, this sub-band sample frequency is not sufficiently high, makes d can be expressed as an integer.Therefore, we at first will With
Figure A20068003222800187
Be converted to time domain, we add its various delay versions to this output channels then.
Loud speaker plus side loud speaker before a plurality of
Previously described broadcast situation purpose is to enlarge virtual sound level, and purpose is to produce the sound level with the perception of listener's location independent.
Optionally, people can utilize the independently horizontal sound of two the independent loudspeaker plays that are arranged at the listener side more With
Figure A20068003222800192
, as in Figure 14 illustrated.± 30 ° of virtual sound levels (a) are converted to the have loudspeaker array virtual sound level of width in slit of (b).In addition, this laterally independently sound by being play from the side by means of the independent loud speaker that is used for stronger listener's envelope.What people expected is that these results form the stronger impression of listener's envelope.In this case, this output signal is also calculated by (25), and the indication that has sign 1 and M here is the loud speaker on the side.This loud speaker is in this situation selecting l and l+1, makes S ' give never to the signal with index 1 and M, because the whole width of this vitual stage only is projected to preceding loud speaker 2≤m≤M-1.
Figure 15 illustrates an example that is used for being provided with for the identical Musical clip of confession shown in Figure 14 eight signals of generation, and this spatial decomposition that is used for Musical clip is shown in Figure 10.Notice that singer in the highest flight placed in the middle is at two loudspeaker signal y of central authorities 4And y 5Between amplitude shake.
5.1 conventional circulating loudspeaker settings
Stereophonic signal is converted to 5.1 possibilities around the multi-channel audio signal of compatibility is to use the setting of having shown in Figure 14 (b) as loud speaker behind three preceding loud speakers arranging with 5.1 standard codes and two.In this case, this back loud speaker sends independently laterally sound, should be used for the reproducing virtual sound level by preceding loud speaker simultaneously.Informally listen to expression and compare with played in stereo, it is more significant when playing the audio signal of describing as listener's envelope.
Stereophonic signal is converted to 5.1 another possibilities around the signal of compatibility is to use as shown in figure 11 setting, this loud speaker is rearranged to mate 5.1 structures here.In this case, ± 30 ° vitual stage be expanded into around the listener ± 110 ° of vitual stages.
Wave field synthesizes Play System
At first, signal y 1, y 2... y MBy with as generation similarly being provided with of Figure 14 (b) illustrated.Then, for each signal y 1, y 2... y M, virtual source is defined in wave field synthesis system.Horizontal sound y independently 1And y MBy as sending as plane wave or the information source in the far field illustrational for M=8 among Figure 16.For mutual signal, the virtual source quilt is according to requiring with location definition.In the example shown in Figure 16, this distance changes for different information sources, and some information sources to be defined as be the front of sending array at sound, that is, and can be with special distance this virtual sound level of direction definition for each qualification.
The unitized scheme that is used for 2 to M conversions
Generally speaking, the loudspeaker signal that is used for any description scheme can be illustrated for:
Y=MN (29)
Here N comprises signal With
Figure A20068003222800202
Vector.This vector Y comprises all loudspeaker signals.This matrix M has many elements, makes that this loudspeaker signal in vector Y will be identical with what calculated by (25) or (27).As what select, different matrix M can be used filtering and/or different amplitudes to shake law and (for example, use plural loud speaker
Figure A20068003222800203
Shake) realize.For wave field synthesis system, all loudspeaker signals that this vector Y can comprise this system (normally>M).In this case, this matrix M also comprises delay, all-pass filter, and filter go usually to realize corresponding to
Figure A20068003222800204
With
Figure A20068003222800205
Sending of the wave field of relevant virtual source.In the claims, have delay, all-pass filter and/or be illustrated in the linear combination of element among the N usually as the relational expression of similar (29) of the filter of the matrix element of M.
Revise the audio signal of decomposing
The width of guide sound base
By revising the direction factor of estimation, for example, (i, k), people can control the width of virtual sound level to A.By with greater than 1 factor linear scale direction factor, the instrument that belongs to this sound level is further moved to the side.On the contrary can be by realizing with factor scale less than 1.Alternatively, the people's amplitude that can revise the angle that is used to calculate local direct sound wave is shaken law (20).
Be modified in local direct sound wave and the ratio between the sound independently
In order to control the numerical value of surrounding environment, people can the independently horizontal voice signal of scale
Figure A20068003222800211
With
Figure A20068003222800212
So that obtain surrounding environment more or less.Similarly, can utilize scale Signal is revised local direct sound wave aspect intensity.
Revise stereophonic signal
The number that people can also need not to increase sound channel is used to revise the decomposition that stereophonic signal proposes.Here, this purpose only is or revises the width of virtual sound level, perhaps at the direct sound wave of part and the ratio between the sound independently.In this case, the sub-band that is used for this stereo output is:
Y 1 = v 1 N ^ ′ 1 + v 2 S ^ ′ Y 2 = v 1 N ^ ′ 2 + v 2 v 3 A S ^ ′ - - - ( 30 )
Here this factor v 1And v 2Be used to be controlled at the ratio between sound independently and the local sound.For v 3≠ 1, same, the width of this sound level is modified (v and in this case, 2Be modified with compensation for v 3≠ 1 level variation aspect the sound of part).
Generally turn to plural input sound channel
Show in a word, be used for two input sound channel situations
Figure A20068003222800216
With Generation following (this is the purpose of lowest mean square estimation).This is sound independently laterally
Figure A20068003222800218
Be by from X 1Remove and be included in X equally 2In signal component calculate.Similarly,
Figure A20068003222800219
Be by from X 1Remove and be included in X equally 1In signal component calculate.Calculate this local direct sound wave
Figure A200680032228002110
, make it comprise and be present in X 1And X 2Signal component among both, and A is the amplitude ratio that calculates, Be comprised in X with this ratio 1And X 2In.A represents the direction of local direct sound wave.
As an example, description now has the scheme of four input sound channels.Suppose the loudspeaker signal x that has as in Figure 17 (a) illustrated 1To x 4Quadraphonic system be considered to expand to more broadcast sound channel as in Figure 17 (b) illustrated.With similar under two input sound channel situations, calculate independently sound sound channel.In this case, this is four (if perhaps wanting still less) signals
Figure A200680032228002112
With These signals by with as above for two described identical spirit of input sound channel situation under calculate.That is to say that this is sound independently
Figure A20068003222800221
Be by from X 1Remove or be included in equally X 2Perhaps X 4Signal component in (signal of adjacent quadrasonics loud speaker) is calculated.Similarly, calculate
Figure A20068003222800222
With
Figure A20068003222800223
For each sound channel of adjacent loud speaker to calculating local direct sound wave, that is,
Figure A20068003222800224
With
Figure A20068003222800225
Calculate this local direct sound wave
Figure A20068003222800226
Make it comprise and be present in X 1And X 2Signal component among both, and A12 is the amplitude ratio that calculates,
Figure A20068003222800227
Be included in X with this ratio 1And X 2In.A12 represents the direction of local direct sound wave.Because similarly reason is calculated
Figure A20068003222800228
With
Figure A20068003222800229
A 23, A 34And A 41In order in the system shown in Figure 17 (b), to play with 12 sound channels, With
Figure A200680032228002211
By from loud speaker with signal y 1, y 4, y 7And y 12Send.For preceding loud speaker y 1To y 4, similar algorithms is applied to for sending
Figure A200680032228002212
Two input sound channel situations, that is, and the loud speaker of the direction that approaches most to limit by A12 on
Figure A200680032228002213
Amplitude shake.Similarly, With
Figure A200680032228002215
Be used as A 23, A 34And A 41Function send from the loudspeaker array that points to three other sides.Alternatively, as under two input sound channel situations, can be used as plane wave and send this independently sound sound channel.Equally, by using for each loud speaker in Figure 17 (b) for the synthetic similar spiritual defining virtual source of the wave field of two input sound channel situations, to play on the wave field synthesis system of listener's loudspeaker array be possible having.Equally, this scheme can be similar to generalization (29), here in this case, vector N comprise all calculating independently with the sub-band signal of the sound sound channel of part.
Because similar reason, 5.1 multichannels can expand to five above main loudspeakers around audio system and play.But center channel needs to note especially because generate content usually here, amplitude shake be applied in left front and right front between (without central authorities).Sometimes amplitude shake also be applied in left front and central between, and between right front and central, perhaps side by side between all three sound channels.Compare with previously described quadrasonics example, this is different, here we used signal imitation supposition only adjacent loud speaker between have public signal component.Therefore perhaps people consider that these remove to calculate local direct sound wave, and perhaps simpler solution is to be mixed into two sound channels under three sound channels with the front, and to use this system description then be quadrasonics.
A kind of be used for will have the scheme expansion of two input sound channels simpler solution that is used for more input sound channel be, some sound channel between use scheme for two input sound channels heuristicly, the synthetic then decomposition that produces with calculated example under quadraphonic situation as
Figure A20068003222800231
A 12, A 23, A 34And A 41These broadcast can be used as the description for the quadrasonics situation.
Be used for the calculating of ambiophony sound loudspeaker signal
It is irrelevant around audio system that this ambiophony sound system is that characteristics are that signal and specific broadcast are provided with.Single order ambiophony sound system is a characteristic with following signal, and it is defined with respect to some P specific in the space:
W=S
X=S?cosψcosФ
Y=S?sinψcosФ
Z=S?sin
(31)
Here W=S is (omnidirectional) sound pressure signal in P.This signal X, Y and Z are the signals that obtains from dipole antenna in P, that is, these signals (source point is at a P here) in Cartesian coordinate direction x, y and z are directly proportional with particle rapidity.Angle ψ and Ф represent azimuth and the elevation angle (spherical polar coordinates) respectively.So-called " B form " signal is in addition to be used for W, X, Y and Z
Figure A20068003222800232
The factor be characteristic.
Be used for M signal on the broadcast system of M accustic channel three-dimension, playing in order to produce, calculate expression from eight direction x ,-x, y ,-y, z ,-signal of the sound that z obtains.This is to finish to obtain directivity (for example, heart-shaped curve) response by synthetic W, X, Y and Z, for example, and (31)
x 1=W+X x 3=W+Y x 5=W+Z
x 2=W-X x 4=W-Y x 6=W-Z
Provide these signals, as being used to calculate eight independently sound sub-band signals (perhaps if desired still less) for describing the similar reason of above quadraphonic system
Figure A20068003222800241
For example, this sound independently
Figure A20068003222800242
Be by from X 1Remove or be included in equally ground, space adjacent channels X 3, X 4, X 5Perhaps X 6In signal component calculate.In addition, just by adjacent between or the direct sound wave of three times input signal part and the direction factor of representing its direction.Provide this and decompose, as what describe in the quadrasonics example formerly, on loud speaker, send this sound similarly, perhaps common (29).
For the ambiophony sound system of two dimension,
W=S
X=S?cos?ψ
Y=S?sin?ψ (33)
The result forms four input signal x 1To x 4, this processing is similar to the quadraphonic system of description.
Matrix ring around decoding
Matrix ring is mixed down stereophonic signal around encoder under with multi-channel audio signal (for example, 5.1 around signal).This form of expression multi-channel audio signal is represented " matrix ring around ".For example, 5.1 can be by matrix encoder with mixing under the following mode (for the sake of simplicity, we ignore the low-frequency effect sound channel) around the sound channel of signal:
x 1 ( n ) = l ( n ) + 1 2 c ( n ) + j 1 2 l s ( n ) + j 1 6 r s ( n )
x 2 ( n ) = r ( n ) + 1 2 c ( n ) - j 1 2 r s ( n ) - j 1 6 l s ( n )
Here I, r, c, l sAnd r sRepresent left front, right front, central, a left back and right back sound channel respectively.J represents 90 degree phase shifts, and-j is-90 degree phase shifts.Other matrix encoder can use the modification of the following mixing of description.
With before changed described similarly for 2 to M sound channels, people can be applicable to spatial decomposition that matrix ring is around following mixed frequency signal.Therefore, for each sub-band, sound sub-band independently calculates local sound sub-band and direction factor at every turn.Independently the linear combination of sound sub-band and local sound sub-band is sent by each loud speaker from this surrounding system, that is to say, send the matrix decoding around signal.
Notice and since matrix around the out-phase component in the following mixed frequency signal, the standardized relevant negative value that adopts equally probably.If this situation, corresponding direction factor will be a negative value, be illustrated in the sound channel that sound in the original multi-channel audio signal derives from the back (under matrix before the mixing).
This decoding matrix around mode be very attractive because it has low complexity, and abundant simultaneously surrounding environment is that independently sound sub-band by estimation reproduces.Do not need to produce artificial surrounding environment, it is complete computable aggregate.
The embodiment details
In order to calculate sub-band signal, can use discrete (fast) Fourier transform (DFT).Reduce and the better number of the frequency band that excites of audio quality in order to reduce by complexity, this DFT frequency band can be synthesized and make each synthetic frequency band have the frequency resolution that the frequency resolution by the human auditory system excites.The processing procedure of this description is carried out for each synthetic sub-band then.Alternatively, can use quadrature mirror filter (QMF) group or any other non-cascade or the cascaded filter group.
Two minimum detectable signal types are signals of transient state and static state/tone.In order to illustrate both effectively, can use bank of filters with adaptive T/F resolution mode.With detected transient, and the temporal resolution of this bank of filters (perhaps alternatively, only this processing procedure) will be increased to handle this transient state effectively.Static/the signal component of tone is equally with detected, and the temporal resolution of this bank of filters and/or processing procedure will be lowered for such signal.As the criterion of the signal component that is used to detect stable/tone, people can use " tone measurement ".
Our embodiment of this algorithm uses fast Fourier transform (FFT).For the 44.1kHz sampling rate, we use the FFT size between 256 and 1024.The sub-band that we synthesize has the bandwidth of about human auditory system's twice critical bandwidth.This causes using about 20 synthetic sub-bands for the 44.1kHz sampling rate.
Example application
Television set
In order to play, for the benefit that obtains " stable centers " (for example, the film dialogue appears at the central authorities of screen, is used for all locational listeners) can produce center channel based on stereo audiovisual TV content.Alternatively, if want, stereo can be converted to 5.1 around.
Stereo to the multichannel boxcar
It is a kind of form of playing on plural loud speaker that is applicable to that conversion equipment will be changed audio content.For example, this box can be used to the stereo music player, and is connected to 5.1 speaker units.This user can have multiple choices: the stereo+center channel 5.1 with preceding vitual stage around, with have around the listener ± surrounding environment 5.1 of 110 ° of virtual sound levels around, perhaps all loudspeaker arrangements are used for better/wideer preceding vitual stage in front.
Such boxcar can be input as characteristic with stereo analog line input audio frequency input and/or digital SP-DIF audio frequency.This output or the output of multichannel circuit, perhaps digital audio output alternatively, for example, SP-DIF.
Equipment and device with improved broadcast performance
Around with regard to the audio content, such equipment and device will be supported improved broadcast with comparing traditionally with regard to or multichannel stereo with more loudspeaker plays.In addition, they can to support to change stereo audio content be that multichannel is around content.
The multi-channel loudspeaker device
Multi-channel loudspeaker device prospect has its audio input signal of conversion is used for the signal of each loud speaker for its characteristics performance.
Automobile audio
Automobile audio is a challenge topic.Owing to listener's position with owing to barrier (seat, each listener's human body), and be used for the restriction that loud speaker is placed, it is difficult to play stereo or multi-channel audio signal, makes them reproduce virtual sound level well.The algorithm of this proposition can be used to calculate the signal that is used to be arranged on the loud speaker on the specific position, makes virtual sound level be enhanced for the listener in the central point of club face not.
Other use field
The spatial decomposition that excites with having described the consciousness that is used for stereo and multi-channel audio signal.Laterally independently sound and local sound with and specific angle (perhaps level difference) in many sub-bands and as the function of time, estimated.Provide the signal imitation of a hypothesis, calculate the lowest mean square estimation of these signals.
In addition, how its stereophonic signal of having described this decomposition can be play on a plurality of loud speakers, loudspeaker array and wave field synthesis system.In addition, it has been described the spatial decomposition that proposes and how to be applied to the ambiophony acoustical signal form that " decoding " is used for the multi-channel loudspeaker broadcast.In addition, its outline principle of describing how to be applied to microphone signal, ambiophony sound B format signal and matrix around signal.

Claims (22)

  1. One kind from a plurality of input audio tracks (x1 ..., xL) produce a plurality of output audio sound channels (y1 ..., method yM), wherein the number of output channels equals or is higher than the number of input sound channel, the method comprising the steps of:
    -utilize and import sub-band X1 (i) ..., one or more independently sound sub-bands of representing signal component are calculated in the linear combination of XL (i), and this signal component is independently between the input sub-band;
    -utilize and import sub-band X1 (i), ..., the linear combination of XL (i), calculate the direct sound wave sub-band of one or more parts, its expression is comprised in the signal component in the more than one input sub-band, with the corresponding direction factor that calculates the expression ratio, these signal components are included in this ratio in two or more input sub-bands;
    -producing and export sub-band Y1 (i) ... YM (i) comprises step:
    -export sub-band to be set to zero;
    -for each sound sub-band independently, select the subclass of output sub-band, and these are added give the corresponding independently zoom version of sound sub-band;
    -select a pair of output sub-band for each direction factor, and these are added the zoom version of the direct sound wave sub-band of giving corresponding part;
    -will export sub-band, Y1 (i) ... YM (i) is converted to time-domain audio signal y1...yM.
  2. 2. according to the method for claim 1, wherein, at least one independently sound sub-band N (i) calculate by remove the signal component that also is present among another input sub-band one or more the sub-band from input, and on a pair of input sub-band of at least one selection
    Local direct sound wave sub-band S (i) calculates according to the signal component that is included in the input sub-band that belongs to accordingly right, and direction factor A (i) is calculated as a ratio, and direct sound wave sub-band S (i) is included in this ratio to be belonged in the corresponding right input sub-band.
  3. 3. according to the method for claim 1 or 2, wherein, sound sub-band N (i) independently, local direct sound wave sub-band S (i) and the calculating of direction factor A (i) are calculated as input sub-band X i(i) ... X L(i), input sub-band power and the input sub-band between the function of standardization cross-correlation.
  4. 4. according to the method for claim 1 to 3, wherein, independently the calculating of the direct sound wave sub-band S (i) of sound sub-band N (i) and part is input sub-band X 1(i) ... X L(i) linear combination, the weight of linear combination is here determined by means of the lowest mean square criterion.
  5. 5. according to the method for claim 4, wherein, the sub-band power of the independently sound sub-band N (i) of estimation and local direct sound wave sub-band S (i) is adjusted, make its sub-band power equal to be calculated as input sub-band power and import sub-band between the corresponding sub-band power of function of standardized cross-correlation.
  6. 6. according to the method for claim 1 to 5, wherein, input sound channel x 1... x LOnly be multi-channel audio signal x 1... x DThe subclass of sound channel, output channels y here 1... y MReplenished by input sound channel with non-processor.
  7. 7. according to the process of claim 1 wherein input sound channel x 1... x LWith output channels y 1... y MCorresponding to the signal that is used to be positioned at respect to the loud speaker on the specific specific direction of listening to the position, and the generation of output signal sub-band is as follows:
    Independently the linear combination of sound sub-band N (i) and local direct sound wave sub-band S (i) makes this output sub-band Y 1(i) ... Y M(i) according to following generation:
    Independently sound sub-band N (i) is mixed in the output sub-band, makes the predefined direction of simulation send corresponding sound;
    Local direct sound wave sub-band S (i) is mixed in the output sub-band, makes simulation send corresponding sound by the definite direction of corresponding direction factor A (i).
  8. 8. according to the method for claim 7, wherein, sound by sub-band signal being applied to simulate specific direction corresponding to the output sub-band of the loud speaker that approaches specific direction most.
  9. 9. according to the method for claim 7, wherein, be applied to simulate specific direction by the identical sub-band signal that will have different gains and sound corresponding to the output sub-band of two loud speakers that directly are adjacent to specific direction.
  10. 10. according to the method for claim 7, wherein, be applied to a plurality of output sub-bands by the identical filtering sub-band signal that will have specific delay and gain factor and simulate specific direction with the simulated sound wave field and sound.
  11. 11. method according to claim 1 to 10, wherein, this independently sound sub-band N (i), local sound sub-band S (i) and direction factor A (i) be modified attribute with the such width of the virtual sound level of control reproduction, and point to independently sound ratio.
  12. 12. according to the method for claim 1 to 11, wherein, the function that all method steps are used as the time repeats.
  13. 13. according to the method for claim 12, wherein, the repetition rate of this processing is applicable to specific input signal characteristics, such as, the existence of transient state or static signal component.
  14. 14. according to the method for claim 1 to 13, wherein, the number of the criterion chooser frequency band of the frequency resolution of use simulating human auditory system and corresponding sub-band bandwidth.
  15. 15. according to any one method of previous claim, wherein, this input sound channel is represented stereophonic signal, and this output channels is represented multi-channel audio signal.
  16. 16. according to the method for claim 1 to 14, wherein, the ambient signal of this input stereo audio sound channel representing matrix coding, and this output channels is represented multi-channel audio signal.
  17. 17. according to the method for claim 1 to 14, wherein, this input sound channel is a microphone signal, and this output channels is represented multi-channel audio signal.
  18. 18. according to the method for claim 1 to 14, wherein, this input sound channel is the linear combination of ambiophony sound B format signal, and this output channels is represented multi-channel audio signal.
  19. 19. according to the method for claim 1 to 18, wherein, this output multi-channel audio signal is represented the signal that is used for resetting on wave field synthesis system.
  20. 20. an audio conversion devices, wherein this equipment comprises that enforcement of rights requires the device of the step of a method in 1 to 19 the method.
  21. 21. according to the audio conversion devices of claim 20, wherein, this equipment is embedded in the audio frequency automotive system.
  22. 22. according to the audio conversion devices of claim 20, wherein, this equipment is embedded in TV or the system of cinema.
CN2006800322282A 2005-09-02 2006-09-01 Method to generate multi-channel audio signals from stereo signals Expired - Fee Related CN101341793B (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP05108078.6 2005-09-02
EP05108078A EP1761110A1 (en) 2005-09-02 2005-09-02 Method to generate multi-channel audio signals from stereo signals
PCT/EP2006/065939 WO2007026025A2 (en) 2005-09-02 2006-09-01 Method to generate multi-channel audio signals from stereo signals

Publications (2)

Publication Number Publication Date
CN101341793A true CN101341793A (en) 2009-01-07
CN101341793B CN101341793B (en) 2010-08-04

Family

ID=35820407

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2006800322282A Expired - Fee Related CN101341793B (en) 2005-09-02 2006-09-01 Method to generate multi-channel audio signals from stereo signals

Country Status (5)

Country Link
US (1) US8295493B2 (en)
EP (1) EP1761110A1 (en)
KR (1) KR101341523B1 (en)
CN (1) CN101341793B (en)
WO (1) WO2007026025A2 (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103430574A (en) * 2011-03-02 2013-12-04 弗兰霍菲尔运输应用研究公司 Apparatus and method for determining a measure for a perceived level of reverberation, audio processor and method for processing a signal
CN104394498A (en) * 2014-09-28 2015-03-04 北京塞宾科技有限公司 A three-channel holographic sound field playback method and sound field collecting device
CN104919822A (en) * 2012-11-15 2015-09-16 弗兰霍菲尔运输应用研究公司 Segment-wise adjustment of spatial audio signal to different playback loudspeaker setup
CN106796794A (en) * 2014-10-07 2017-05-31 高通股份有限公司 The normalization of environment high-order ambiophony voice data
CN107135460A (en) * 2012-03-28 2017-09-05 杜比国际公司 From the method and apparatus of high-order ambiophony sound audio signals decoding stereoscopic sound loudspeaker signal
CN108156561A (en) * 2017-12-26 2018-06-12 广州酷狗计算机科技有限公司 Processing method, device and the terminal of audio signal
CN108737930A (en) * 2017-04-17 2018-11-02 哈曼国际工业有限公司 Audible prompting in Vehicular navigation system
CN109089203A (en) * 2018-09-17 2018-12-25 中科上声(苏州)电子有限公司 The multi-channel signal conversion method and car audio system of car audio system
CN110800048A (en) * 2017-05-09 2020-02-14 杜比实验室特许公司 Processing of input signals in multi-channel spatial audio format
WO2020057050A1 (en) * 2018-09-17 2020-03-26 中科上声(苏州)电子有限公司 Method for extracting direct sound and background sound, and loudspeaker system and sound reproduction method therefor

Families Citing this family (49)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2000001B1 (en) * 2006-03-28 2011-12-21 Telefonaktiebolaget LM Ericsson (publ) Method and arrangement for a decoder for multi-channel surround sound
US9014377B2 (en) * 2006-05-17 2015-04-21 Creative Technology Ltd Multichannel surround format conversion and generalized upmix
US8712061B2 (en) * 2006-05-17 2014-04-29 Creative Technology Ltd Phase-amplitude 3-D stereo encoder and decoder
US8379868B2 (en) 2006-05-17 2013-02-19 Creative Technology Ltd Spatial audio coding based on universal spatial cues
US8345899B2 (en) * 2006-05-17 2013-01-01 Creative Technology Ltd Phase-amplitude matrixed surround decoder
US8374365B2 (en) * 2006-05-17 2013-02-12 Creative Technology Ltd Spatial audio analysis and synthesis for binaural reproduction and format conversion
RU2454825C2 (en) * 2006-09-14 2012-06-27 Конинклейке Филипс Электроникс Н.В. Manipulation of sweet spot for multi-channel signal
US8908873B2 (en) 2007-03-21 2014-12-09 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Method and apparatus for conversion between multi-channel audio formats
US8290167B2 (en) 2007-03-21 2012-10-16 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Method and apparatus for conversion between multi-channel audio formats
US9015051B2 (en) 2007-03-21 2015-04-21 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Reconstruction of audio channels with direction parameters indicating direction of origin
KR101439205B1 (en) * 2007-12-21 2014-09-11 삼성전자주식회사 Method and apparatus for audio matrix encoding/decoding
WO2010000313A1 (en) * 2008-07-01 2010-01-07 Nokia Corporation Apparatus and method for adjusting spatial cue information of a multichannel audio signal
KR101599534B1 (en) 2008-07-29 2016-03-03 엘지전자 주식회사 A method and an apparatus for processing an audio signal
JP5520300B2 (en) * 2008-09-11 2014-06-11 フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ Apparatus, method and apparatus for providing a set of spatial cues based on a microphone signal and a computer program and a two-channel audio signal and a set of spatial cues
US8023660B2 (en) 2008-09-11 2011-09-20 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus, method and computer program for providing a set of spatial cues on the basis of a microphone signal and apparatus for providing a two-channel audio signal and a set of spatial cues
WO2010045869A1 (en) 2008-10-20 2010-04-29 华为终端有限公司 Method, system and apparatus for processing 3d audio signal
KR101499785B1 (en) 2008-10-23 2015-03-09 삼성전자주식회사 Method and apparatus of processing audio for mobile device
KR101008060B1 (en) * 2008-11-05 2011-01-13 한국과학기술연구원 Apparatus and Method for Estimating Sound Arrival Direction In Real-Time
CN102577440B (en) * 2009-07-22 2015-10-21 斯托明瑞士有限责任公司 Improve apparatus and method that are stereo or pseudo-stereophonic audio signals
CA2767988C (en) 2009-08-03 2017-07-11 Imax Corporation Systems and methods for monitoring cinema loudspeakers and compensating for quality problems
KR101567461B1 (en) 2009-11-16 2015-11-09 삼성전자주식회사 Apparatus for generating multi-channel sound signal
EP2360681A1 (en) 2010-01-15 2011-08-24 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for extracting a direct/ambience signal from a downmix signal and spatial parametric information
US9264813B2 (en) * 2010-03-04 2016-02-16 Logitech, Europe S.A. Virtual surround for loudspeakers with increased constant directivity
US8542854B2 (en) * 2010-03-04 2013-09-24 Logitech Europe, S.A. Virtual surround for loudspeakers with increased constant directivity
KR101673232B1 (en) 2010-03-11 2016-11-07 삼성전자주식회사 Apparatus and method for producing vertical direction virtual channel
US9426574B2 (en) * 2010-03-19 2016-08-23 Bose Corporation Automatic audio source switching
EP2578000A1 (en) * 2010-06-02 2013-04-10 Koninklijke Philips Electronics N.V. System and method for sound processing
RU2589377C2 (en) * 2010-07-22 2016-07-10 Конинклейке Филипс Электроникс Н.В. System and method for reproduction of sound
US9271081B2 (en) 2010-08-27 2016-02-23 Sonicemotion Ag Method and device for enhanced sound field reproduction of spatially encoded audio input signals
EP2523472A1 (en) 2011-05-13 2012-11-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method and computer program for generating a stereo output signal for providing additional output channels
KR102160248B1 (en) 2012-01-05 2020-09-25 삼성전자주식회사 Apparatus and method for localizing multichannel sound signal
US9020623B2 (en) 2012-06-19 2015-04-28 Sonos, Inc Methods and apparatus to provide an infrared signal
EP2901667B1 (en) 2012-09-27 2018-06-27 Dolby Laboratories Licensing Corporation Spatial multiplexing in a soundfield teleconferencing system
KR102149046B1 (en) * 2013-07-05 2020-08-28 한국전자통신연구원 Virtual sound image localization in two and three dimensional space
KR102231755B1 (en) * 2013-10-25 2021-03-24 삼성전자주식회사 Method and apparatus for 3D sound reproducing
US9749747B1 (en) * 2015-01-20 2017-08-29 Apple Inc. Efficient system and method for generating an audio beacon
US9678707B2 (en) 2015-04-10 2017-06-13 Sonos, Inc. Identification of audio content facilitated by playback device
EP3357259B1 (en) 2015-09-30 2020-09-23 Dolby International AB Method and apparatus for generating 3d audio content from two-channel stereo content
US9956910B2 (en) * 2016-07-18 2018-05-01 Toyota Motor Engineering & Manufacturing North America, Inc. Audible notification systems and methods for autonomous vehicles
EP3297298B1 (en) 2016-09-19 2020-05-06 A-Volute Method for reproducing spatially distributed sounds
US10542153B2 (en) 2017-08-03 2020-01-21 Bose Corporation Multi-channel residual echo suppression
US10200540B1 (en) * 2017-08-03 2019-02-05 Bose Corporation Efficient reutilization of acoustic echo canceler channels
US10594869B2 (en) 2017-08-03 2020-03-17 Bose Corporation Mitigating impact of double talk for residual echo suppressors
US10863269B2 (en) 2017-10-03 2020-12-08 Bose Corporation Spatial double-talk detector
CN108319445B (en) * 2018-02-02 2020-05-22 维沃移动通信有限公司 Audio playing method and mobile terminal
US11385859B2 (en) 2019-01-06 2022-07-12 Silentium Ltd. Apparatus, system and method of sound control
US10964305B2 (en) 2019-05-20 2021-03-30 Bose Corporation Mitigating impact of double talk for residual echo suppressors
GB2588801A (en) * 2019-11-08 2021-05-12 Nokia Technologies Oy Determination of sound source direction
CN112135227B (en) * 2020-09-30 2022-04-05 京东方科技集团股份有限公司 Display device, sound production control method, and sound production control device

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2000226583A1 (en) * 2000-02-18 2001-08-27 Bang And Olufsen A/S Multi-channel sound reproduction system for stereophonic signals
WO2004019656A2 (en) * 2001-02-07 2004-03-04 Dolby Laboratories Licensing Corporation Audio channel spatial translation
US7583805B2 (en) * 2004-02-12 2009-09-01 Agere Systems Inc. Late reverberation-based synthesis of auditory scenes
DE602004005846T2 (en) * 2003-04-17 2007-12-20 Koninklijke Philips Electronics N.V. AUDIO SIGNAL GENERATION
US7394903B2 (en) * 2004-01-20 2008-07-01 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Apparatus and method for constructing a multi-channel output signal or for generating a downmix signal
US8204261B2 (en) * 2004-10-20 2012-06-19 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Diffuse sound shaping for BCC schemes and the like

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9672806B2 (en) 2011-03-02 2017-06-06 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for determining a measure for a perceived level of reverberation, audio processor and method for processing a signal
CN103430574B (en) * 2011-03-02 2016-05-25 弗劳恩霍夫应用研究促进协会 For determining apparatus and method, audio process and the method for the treatment of signal for the tolerance of reverberation perception level
CN103430574A (en) * 2011-03-02 2013-12-04 弗兰霍菲尔运输应用研究公司 Apparatus and method for determining a measure for a perceived level of reverberation, audio processor and method for processing a signal
CN107135460A (en) * 2012-03-28 2017-09-05 杜比国际公司 From the method and apparatus of high-order ambiophony sound audio signals decoding stereoscopic sound loudspeaker signal
CN107135460B (en) * 2012-03-28 2019-11-15 杜比国际公司 From the method and apparatus of high-order ambiophony sound audio signals decoding stereoscopic sound loudspeaker signal
US11172317B2 (en) 2012-03-28 2021-11-09 Dolby International Ab Method and apparatus for decoding stereo loudspeaker signals from a higher-order ambisonics audio signal
US10433090B2 (en) 2012-03-28 2019-10-01 Dolby International Ab Method and apparatus for decoding stereo loudspeaker signals from a higher-order ambisonics audio signal
US9805726B2 (en) 2012-11-15 2017-10-31 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Segment-wise adjustment of spatial audio signal to different playback loudspeaker setup
CN104919822B (en) * 2012-11-15 2017-07-07 弗劳恩霍夫应用研究促进协会 Segmented adjustment to the spatial audio signal of different playback loudspeaker groups
CN104919822A (en) * 2012-11-15 2015-09-16 弗兰霍菲尔运输应用研究公司 Segment-wise adjustment of spatial audio signal to different playback loudspeaker setup
CN104394498B (en) * 2014-09-28 2017-01-18 北京塞宾科技有限公司 A three-channel holographic sound field playback method and sound field collecting device
CN104394498A (en) * 2014-09-28 2015-03-04 北京塞宾科技有限公司 A three-channel holographic sound field playback method and sound field collecting device
CN106796794A (en) * 2014-10-07 2017-05-31 高通股份有限公司 The normalization of environment high-order ambiophony voice data
CN108737930A (en) * 2017-04-17 2018-11-02 哈曼国际工业有限公司 Audible prompting in Vehicular navigation system
CN110800048A (en) * 2017-05-09 2020-02-14 杜比实验室特许公司 Processing of input signals in multi-channel spatial audio format
CN110800048B (en) * 2017-05-09 2023-07-28 杜比实验室特许公司 Processing of multichannel spatial audio format input signals
US10924877B2 (en) 2017-12-26 2021-02-16 Guangzhou Kugou Computer Technology Co., Ltd Audio signal processing method, terminal and storage medium thereof
CN108156561A (en) * 2017-12-26 2018-06-12 广州酷狗计算机科技有限公司 Processing method, device and the terminal of audio signal
CN109089203A (en) * 2018-09-17 2018-12-25 中科上声(苏州)电子有限公司 The multi-channel signal conversion method and car audio system of car audio system
WO2020057050A1 (en) * 2018-09-17 2020-03-26 中科上声(苏州)电子有限公司 Method for extracting direct sound and background sound, and loudspeaker system and sound reproduction method therefor

Also Published As

Publication number Publication date
WO2007026025A2 (en) 2007-03-08
US8295493B2 (en) 2012-10-23
WO2007026025A3 (en) 2007-04-26
EP1761110A1 (en) 2007-03-07
CN101341793B (en) 2010-08-04
KR101341523B1 (en) 2013-12-16
KR20080042160A (en) 2008-05-14
US20080267413A1 (en) 2008-10-30

Similar Documents

Publication Publication Date Title
CN101341793B (en) Method to generate multi-channel audio signals from stereo signals
CN101263741B (en) Method of and device for generating and processing parameters representing HRTFs
CN102395098B (en) Method of and device for generating 3D sound
CN101843114B (en) Method, apparatus and integrated circuit for focusing on audio signal
TWI646847B (en) Method and apparatus for enhancing directivity of a 1st order ambisonics signal
Avendano et al. Frequency domain techniques for stereo to multichannel upmix
Laitinen et al. Binaural reproduction for directional audio coding
CN113170271B (en) Method and apparatus for processing stereo signals
CN104170408A (en) A method of applying a combined or hybrid sound -field control strategy
JP2006519406A (en) Method for reproducing natural or modified spatial impressions in multi-channel listening
Lee Multichannel 3D microphone arrays: A review
US10375472B2 (en) Determining azimuth and elevation angles from stereo recordings
CN103535052A (en) Apparatus and method for a complete audio signal
Floros et al. Spatial enhancement for immersive stereo audio applications
CN109923877A (en) The device and method that stereo audio signal is weighted
Laitinen et al. Using spaced microphones with directional audio coding
Cobos et al. Interactive enhancement of stereo recordings using time-frequency selective panning
Peters et al. Sound spatialization across disciplines using virtual microphone control (ViMiC)
Kelly Subjective Evaluations of Spatial Room Impulse Response Convolution Techniques in Channel-and Scene-Based Paradigms
Lee et al. A 3-D sound orchestra in the cyber space
Zea Binaural monitoring for live music performances
Masiero et al. EUROPEAN SYMPOSIUM ON ENVIRONMENTAL ACOUSTICS AND ON BUILDINGS ACOUSTICALLY SUSTAINABLE
AUDIO—PART AES 40th INTERNATIONAL CONfERENCE

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20100804

Termination date: 20180901

CF01 Termination of patent right due to non-payment of annual fee