CN101977349A - Decoding optimizing and improving method of Ambisonic voice repeating system - Google Patents

Decoding optimizing and improving method of Ambisonic voice repeating system Download PDF

Info

Publication number
CN101977349A
CN101977349A CN2010102989354A CN201010298935A CN101977349A CN 101977349 A CN101977349 A CN 101977349A CN 2010102989354 A CN2010102989354 A CN 2010102989354A CN 201010298935 A CN201010298935 A CN 201010298935A CN 101977349 A CN101977349 A CN 101977349A
Authority
CN
China
Prior art keywords
theta
sigma
phi
function
optimization
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2010102989354A
Other languages
Chinese (zh)
Inventor
龚惠哲
赵越喆
吴硕贤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
South China University of Technology SCUT
Original Assignee
South China University of Technology SCUT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by South China University of Technology SCUT filed Critical South China University of Technology SCUT
Priority to CN2010102989354A priority Critical patent/CN101977349A/en
Publication of CN101977349A publication Critical patent/CN101977349A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Stereophonic System (AREA)

Abstract

The invention relates to a decoding optimizing and improving method of an Ambisonic voice repeating system, which comprises the steps of: respectively establishing a direction optimizing target function, a loudness optimizing target function and a definition optimizing target function through calculating velocity vectors, sound energy flow vectors and loudness of a virtual sound image according to a sound image positioning principle, and introducing an amplitude optimizing target function of directivity of a virtual microphone into a comprehensive optimizing target function; determining the comprehensive optimizing target function with optimal fitness value through an iteration operation by adopting an optimizing algorithm, and obtaining an optimal decoding result; controlling the introduction of the amplitude optimizing target function of directivity of the virtual microphone to obtain a decoding result the same phase with each speaker, and furthest reducing the mixing of repeated voice in the space direction, wherein the optimizing convergence speed is increased and the operation time is shortened. The invention is suitable for a decoding process of the Ambisonic voice repeating system in any order, any speaker quantity and any arrangement mode.

Description

The optimization of Ambisonic sound reproduction system decodes is improved one's methods
Technical field
The present invention relates to the improvement of Ambisonic multipath three dimensional sound repeater system decoding technique, specifically be meant and utilize synthetic virtual microphone and directive property control technology thereof, Ambisonic sound reproduction system decodes method based on the vector composition principle is optimized improvement, improve the synthetic precision of virtual sound image, improve the sound reproduction quality of Ambisonic system.
Background technology
Plane wave can be decomposed and reconstruct by a series of orthogonal sphere hamonic functions.Use spheric harmonic function
Figure BSA00000291746700011
It is the theoretical foundation of Ambisonic sound reproduction system that plane wave is decomposed with reconstruct.The exponent number of spheric harmonic function is high more, and the precision of decomposition or reconstruct is just high more.Use spheric harmonic function to s signal plane ripple S (θ S, φ S) sample, can obtain original Ambisonic code signal
Figure BSA00000291746700012
A m , n σ = Y m , n σ ( θ S , φ S ) · s - - - ( 1 )
In the formula, m is the spheric harmonic function exponent number, m 〉=0,0≤n≤m, σ=± 1, θ SAnd φ SGlancing incidence angle and vertical incidence angle for plane wave s.Utilize the loud speaker of some to form specific array, simultaneously Ambisonic code signal branch is equipped with different weights and reconfigures, flow to the loud speaker of diverse location, can reconstruct plane wave S (θ in the loudspeaker array center S, φ S).Flow to orientation angles and be (θ j, φ j) the recurrent signal g of j loud speaker jCan obtain by formula (2), wherein
Figure BSA00000291746700014
It is the code signal of corresponding j loud speaker
Figure BSA00000291746700015
The weight coefficient group, and
Figure BSA00000291746700016
g j = Σ 0 ≤ m ≤ M , 0 ≤ n ≤ m , σ = ± 1 c m , n , j σ · A m , n σ - - - ( 2 )
Wherein, M is that the spheric harmonic function that is used to calculate is ended exponent number.How according to the specific arrangements form of loudspeaker array, it is suitable to obtain
Figure BSA00000291746700018
Make the virtual sound image that reconstructs at the array center place near original plane ripple signal S (θ S, φ S) process, be exactly the decode procedure of Ambisonic sound reproduction system.
Gerzon serves as the vector composition principle that the basis proposes with the psychologic acoustics research that the human auditory system synthesizes virtual sound image to many sound sources, is applied to Ambisonic sound reproduction system decodes process.The original acoustical signal of different directions incident can produce the true acoustic image of different directions in people's auditory system.Gerzon thinks, as long as this virtual sound image is with true acoustic image direction unanimity, loudness is identical and have acoustic image clearly, can think the accurate reconstruct to original signal.Therefore the key of Ambisonic sound reproduction system is the weight coefficient group
Figure BSA00000291746700019
Determine.For this reason, at first need at specific loudspeaker array, respectively according to the psychoacoustic principle of the synthetic virtual sound image of lower frequency region and high-frequency domain, calculate the data such as direction, loudness and definition of virtual sound image, and then utilize the system optimization theory, set up the optimization aim function, come the different weight coefficient groups of comparison by the computer optimization program
Figure BSA000002917467000110
Fitness value, the final decoded result optimized of obtaining.
Craven and Wiggins are once based on this method and use 4 rank spheric harmonic functions respectively the horizontal multi-loudspeaker repeater system of ITU form to be optimized, the former uses the conjugate gradient optimization method, the latter as initial value, uses the TABU search method to be optimized computing with the former optimization result.The decoded result of Craven shows, the scope of the quality of virtual sound image between three loud speakers of the place ahead comparatively stable and accurately, the less stable of side direction and rear virtual sound image, the direction of virtual sound image are also not too accurate.Comparatively speaking, the decoded result of Wiggins, all better aspect the stability of virtual sound image and accuracy.But all only at the horizontal plane sound reproduction system of IUT form, the system to other types does not discuss in both research.David Moore and P.W.M Tsang have also announced them to how improving searching algorithm improving the research of 1 rank Ambisonic system optimization, but do not see to have and be higher than 1 rank systematic research report.And it is to the research of searching algorithm itself, do not relate to sound control, and can not effectively solve in optimizing process, because the nonlinear optimization that introducing caused of high-frequency domain acoustic energy flow vector is difficult to obtain globally optimal solution, and the problem of calculating length consuming time.
Summary of the invention
The objective of the invention is to overcome the deficiency that has now based on the Ambisonic sound reproduction system decodes method of virtual sound image vector composition principle, providing a kind of controls based on sound, utilize the optimization of the Ambisonic sound reproduction system decodes of virtual directivity of microphone control technology to improve one's methods, improve the synthetic precision of virtual sound image, improve the sound reproduction quality of Ambisonic system.。
The present invention realizes above-mentioned purpose by the following technical solutions:
The optimization of this Ambisonic sound reproduction system decodes is improved one's methods, and may further comprise the steps:
Step 1:, obtain the velocity r of decision low-frequency range and high band virtual sound image alignment quality respectively according to the acoustic image positioning principle VWith acoustic energy flow vector r E
Step 2:, calculate the loudness V of virtual sound image in low-frequency range according to the acoustic image positioning principle LiLoudness V with high band Hi
Step 3: ideally, the virtual sound image that is produced during Ambisonic system sound reproduction, should with the counterparty to true acoustic image direction unanimity, loudness identical, acoustic image is clear simultaneously.Set up direction optimization aim function Afit, loudness optimization aim function Lfit and resolution optimization target function Mfit thus respectively;
Step 4: the optimization aim function C fit that sets up virtual directivity of microphone control;
Step 5: synthetic complex optimum target function Tfit;
Step 6: in conjunction with the position of loud speaker in the Ambisonic sound reproduction system, adopt optimized Algorithm, obtain the complex optimum target function Tfit fitness value with optimum, its pairing spheric harmonic function weight coefficient group by interative computation Be optimum decoded result.
Compared with prior art, the present invention has following advantage and beneficial effect:
1) introduces the amplitude optimization aim function C fit that is used to control the virtual directivity of microphone.Virtual microphone with the single flap directive property feature that does not have secondary lobe, can suppress from the sound of corresponding repeating transmission loud speaker different directions, farthest reduce and retransmit sound obscuring in the direction in space sense;
2) decoded result can make each loudspeaker signal homophase (in phase), and the natural tone color and the maximum range of audibility that this has at utmost guaranteed the sound reproduction system are particularly suitable for many people and listen to, or head has the sound reproduction of rotation situation to use.
3) behind the introducing Cfit, optimization aim is more clear and definite, and the convergence rate of optimization aim function is accelerated, and shorten operation time greatly.
4) new optimization is improved one's methods all suitable to the Ambisonic system of any exponent number, any number of loudspeakers and arrangement.Wherein, for the Ambisonic system of asymmetric loudspeaker arrangement mode, can be by the independent directive property of adjusting the synthetic virtual microphone of the pairing difference of repeating transmission loud speaker of asymmetric distribution, the guiding optimizer obtains decoded result with fast speed.
5) proposed by the invention by the synthetic virtual directivity of microphone control method of spheric harmonic function, only relevant with spheric harmonic function itself, have nothing to do with adopting which kind of acoustic image orientation criterion.Therefore, the synthetic virtual directivity of microphone control method that the present invention proposes also can with other any acoustic image orientation criterion theories, as combining, the Ambisonic system is decoded based on the acoustic image positioning principle of number of people transfer function etc.
6) the invention solves prior art and be difficult to obtain the problem of globally optimal solution in the nonlinear optimization of using the vector composition principle to carry out occurring in the Ambisonic decode procedure; Though the overall merit target function value is better when having overcome computation optimization, the synthetic virtual directivity of microphone is undesirable, thus the physical property error result that acoustic image is obscured before and after when causing sound reproduction.
Description of drawings
Fig. 1 is the directive property parameter schematic diagram of synthetic virtual microphone;
Fig. 2 is a flow chart of the present invention;
Fig. 3 optimizes flow chart.
Embodiment
The present invention is described in further detail below in conjunction with embodiment and accompanying drawing, but embodiments of the present invention are not limited thereto.
Embodiment
Spheric harmonic function can be counted as the virtual microphone with particular orientation.At this moment, the cataloged procedure of Ambisonic system can be regarded the pick up process of the virtual microphone of a plurality of spheric harmonic functions to original sound wave of using as.During sound reproduction, the recurrent signal of each loud speaker of feeding can be by signal that these virtual microphones picked up by weight coefficient
Figure BSA00000291746700031
Stack obtains, and the stack back produces the synthetic virtual microphone SVM with new directive property, and anti-phase back is corresponding to a loud speaker of retransmitting in the loudspeaker array.
If the directive property of a certain synthetic virtual microphone SVM is undesirable, then when sound reproduction, the sound of all directions of being picked up all will be applied and retransmit the mistake that all causes the sound fore-and-aft direction to obscure by a certain loud speaker on physical significance and subjective sensation.If virtual microphone SVM is to other direction, particularly the inhibition of rear sound is very capable, the problem that the sound fore-and-aft direction that occurs easily in the time of can solving sound reproduction to a great extent during sound reproduction is obscured.Therefore, the synthetic virtual microphone SVM that utilizes the virtual microphone array of spheric harmonic function to produce, whether its directive property is desirable, with having a strong impact on original sound field carried out the quality that pick up in the space, and the subjectivity when influencing sound reproduction is simultaneously listened to effect.
The present invention has controlled the directive property of synthetic virtual microphone SVM effectively by setting up virtual directivity of microphone controlled target function C fit.
As Fig. 1, suppose that the azimuth of virtual microphone orientation of its axis is (θ Aim, φ Aim), its level is Δ θ and Δ φ with vertical effective angle of coverage, the start angle that calculates cone of coverage is respectively θ 1, θ 2, and φ 1, φ 2, and θ 1<θ 2, φ 1<φ 2Then under perfect condition, this virtual microphone should have good restraining ability to the outer sound of its effective coverage range.Microphone is picked up the power of ability by amplitude r to certain direction sound M(θ, φ) expression, r M(θ, φ) 〉=0.The directive property of virtual microphone should be subjected to effective control aspect following three:
(1) orientation of its axis of virtual microphone, i.e. azimuth (θ Aim, φ Aim);
(2) effective angle of coverage of virtual microphone, i.e. level and vertical angle of coverage Δ θ and Δ φ;
(3) the outer sound ability to accept of virtual microphone effective coverage range should be able to be subjected to the inhibition of maximum program.
For reaching above-mentioned controlled target, use the amplitude comparison method, set up relatively optimization aim function C fit of amplitude, the amplitude sum that the microphone effective coverage range is outer, divided by the amplitude sum in the effective coverage range, consider θ and φ in definition at the spherical coordinates of Ambisonic system, promptly-180 °≤θ≤180 °,-90 °≤φ≤90 °, have:
(1) at θ 1<θ Aim<θ 2And under ° situation of Δ φ≤180:
Cfit = Σ - 180 ≤ θ ≤ 180 ( Σ - 90 ≤ φ ≤ 90 r M ( θ , φ ) ) Σ θ 1 ≤ θ ≤ θ 2 ( Σ φ 1 ≤ φ ≤ φ 2 r M ( θ , φ ) ) - 1 - - - ( 3 )
(2) at θ 1<θ Aim<θ 2And under ° situation of Δ φ≤180:
Cfit = &Sigma; - 180 &le; &theta; &le; 180 ( &Sigma; - 90 &le; &phi; &le; 90 r M ( &theta; , &phi; ) ) &Sigma; &theta; 1 &le; &theta; &le; &theta; 2 ( &Sigma; - 90 &le; &phi; &le; 90 r M ( &theta; , &phi; ) ) + &Sigma; - 180 &le; &theta; < &theta; 1 , &theta; 2 < &theta; &le; 180 ( &Sigma; - 90 < &phi; &le; &phi; 1 , &phi; 2 &le; &phi; < 90 r M ( &theta; , &phi; ) ) - 1 - - - ( 4 )
(3) at θ Aim<θ 1Or θ Aim>θ 2And under ° situation of Δ φ≤180:
Cfit = &Sigma; - 180 &le; &theta; &le; 180 ( &Sigma; - 90 &le; &phi; &le; 90 r M ( &theta; , &phi; ) ) &Sigma; - 180 &le; &theta; < &theta; 1 , &theta; 2 < &theta; &le; 180 ( &Sigma; &phi; 1 &le; &phi; &le; &phi; 2 r M ( &theta; , &phi; ) ) - 1 - - - ( 5 )
(4) at θ Aim<θ 1Or θ Aim>θ 2And under ° situation of Δ φ≤180:
Cfit = &Sigma; - 180 &le; &theta; &le; 180 ( &Sigma; - 90 &le; &phi; &le; 90 r M ( &theta; , &phi; ) ) &Sigma; - 180 &le; &theta; < &theta; 1 , &theta; 2 < &theta; i &le; 180 ( &Sigma; - 90 &le; &phi; &le; 90 r M ( &theta; , &phi; ) ) + &Sigma; &theta; 1 &le; &theta; < &theta; 2 ( &Sigma; - 90 < &phi; &le; &phi; 1 , &phi; 2 &le; &phi; < 90 r M ( &theta; , &phi; ) ) - 1 - - - ( 6 )
r M(θ, φ) be equivalent to all loud speakers at the azimuth for (θ, the absolute value of the signal level value of closing on direction φ), as shown in the formula:
r M ( &theta; , &phi; ) = | &Sigma; j = 1 k g j | = | &Sigma; j = 1 k ( &Sigma; 0 &le; m &le; M , 0 &le; n &le; m , &sigma; = &PlusMinus; 1 c m , n , j &sigma; &CenterDot; Y m , n &sigma; ( &theta; , &phi; ) ) | - - - ( 7 )
In the formula (7), k represents the sum of loud speaker in the Ambisonic system, the recurrent signal g of loud speaker jCan obtain function by formula (1) and (2)
Figure BSA00000291746700053
Be spheric harmonic function,
Figure BSA00000291746700054
Be spheric harmonic function
Figure BSA00000291746700055
The weight coefficient group.Obviously, carry out the m rank Ambisonic system of encoding and decoding to adopting any m rank spheric harmonic function, formula (3) all is suitable for to formula (7).
Under any circumstance, must make that all Cfit has minimum value (under the perfect condition, Cfit=0), promptly when weakening microphone as far as possible sound pick up ability outside to effective coverage range, will be to the maximized of picking up of the sound in the effective coverage range, reach the purpose that the directive property of virtual microphone is controlled, can obtain comparatively desirable virtual microphone directional property this moment.
Performing step of the present invention and concrete scheme following (seeing accompanying drawing 2):
Step 1:, obtain the velocity r of decision low-frequency range and high band virtual sound image alignment quality respectively according to the acoustic image positioning principle VWith acoustic energy flow vector r E, so that carry out virtual sound image and the truly contrast of acoustic image.
r VAnd r EDirection, i.e. the deflection of virtual sound image is by θ ViAnd φ Vi, θ EiAnd φ EiExpression.Wherein, θ ViAnd φ ViBe the horizontal angle and the vertical angle of low-frequency range virtual sound image, θ EiAnd φ EiThen be the horizontal angle and the vertical angle of high band virtual sound image.r VAnd r EAmplitude can represent the definition of virtual sound image in the physical sense.Vector synthetic sound with Gerzon is an example as positioning principle, and these angles can be calculated by formula (8)~(11):
tan &theta; Vi = &Sigma; j = 1 k g j cos &phi; j sin &theta; j &Sigma; j = 1 k g j cos &phi; j cos &theta; j - - - ( 8 )
tan &phi; Vi = &Sigma; j = 1 k ( g j sin &phi; j ) ( &Sigma; j = 1 k g j cos &phi; j cos &theta; j ) 2 + ( &Sigma; j = 1 k g j cos &phi; j sin &theta; j ) 2 - - - ( 9 )
tan &theta; Ei = &Sigma; j = 1 k g j 2 cos &phi; j sin &theta; j &Sigma; j = 1 k g j 2 cos &phi; j cos &theta; j - - - ( 10 )
tan &phi; Ei = &Sigma; j = 1 k ( g j 2 sin &phi; j ) ( &Sigma; j = 1 k g j 2 cos &phi; j cos &theta; j ) 2 + ( &Sigma; j = 1 k g j 2 cos &phi; j sin &theta; j ) 2 - - - ( 11 )
With corresponding low-frequency range of true sound source and the high band virtual sound image definition evaluation of estimate r on a certain direction in space ViAnd r EiTry to achieve by formula (12)~(13)
r Vi = ( &Sigma; j = 1 k g j cos &phi; j cos &theta; j ) 2 + ( &Sigma; j = 1 k g j cos &phi; j sin &theta; j ) 2 + ( &Sigma; j = 1 k g j sin &phi; j ) 2 &Sigma; j = 1 k g j - - - ( 12 )
r Ei = ( &Sigma; j = 1 k g j 2 cos &phi; j cos &theta; j ) 2 + ( &Sigma; j = 1 k g j 2 cos &phi; j sin &theta; j ) 2 + ( &Sigma; j = 1 k g j 2 sin &phi; j ) 2 &Sigma; j = 1 k g j 2 - - - ( 13 )
Step 2:, calculate the loudness parameter V of virtual sound image respectively at low-frequency range and high band according to the acoustic image positioning principle LiAnd V HiVector synthetic sound with Gerzon is an example as positioning principle, V LiAnd V HiCan calculate by formula (14)~(15):
V Li = &Sigma; j = 1 k g j - - - ( 14 )
V Hi = &Sigma; j = 1 k g j 2 - - - ( 15 )
During with Ambisonic system sound reproduction, can be by the size of each loud speaker input signal of adjusted in concert Ambisonic system, the sound level of control virtual sound image.
Step 3: ideally, the virtual sound image that is produced during Ambisonic system sound reproduction, with the counterparty to true acoustic image identical on direction, loudness and definition, set up direction optimization aim function Afit, loudness optimization aim function Lfit and resolution optimization target function Mfit thus respectively.
The level angle and the vertical angle of the true sound source on the definition space direction are respectively θ OiAnd φ Oi, be example with the vector synthetic sound of Gerzon as positioning principle, then have:
r Vi=r Ei=1 (16)
θ Vi=θ Ei=θ Oi (17)
φ Vi=φ Ei=φ Oi (18)
In addition, when same sound when different direction in spaces occurs, with the loudness of the virtual sound image of its corresponding all directions all should be identical, have thus:
V Li=V L0 (19)
V Hi=V H0 (20)
To (23), the coupling system optimum theory can be set up relevant direction optimization aim function Afit, loudness optimization aim function Lfit and resolution optimization target function Mfit according to formula (19).
Afit = &Sigma; k = 1 6 ( &Sigma; i = 1 n ( &alpha; ki - &beta; ki ) 2 n ) - - - ( 21 )
Lfit = &Sigma; i = 1 n ( 1 - V Li / V L 0 ) 2 n + &Sigma; i = 1 n ( 1 - V Hi / V H 0 ) 2 n - - - ( 22 )
Mfit = &Sigma; i = 1 n ( 1 - r Vi ) 2 n + &Sigma; i = 1 n ( 1 - r Ei ) 2 n - - - ( 23 )
Wherein, n represents that virtual sound image is at space different directions hits.。
Step 4: the optimization aim function C fit that sets up the virtual directivity of microphone control of being calculated by formula (3) to (7);
Step 5: synthetic complex optimum target function Tfit;
Complex optimum target function Tfit is:
Tfit=b 1·Afit+b 2·Lfit+b 3·Mfit+b 4·Cfit (24)
In the formula, b j(j=1,2,3,4) are optimization aim function shared weight in total optimization aim function, and 0≤b j≤ 1.When Tfit has minimum value, show that virtual acoustic and actual sound that the Ambisonic system produced are the most approaching on subjective sensation, this moment, system reached optimum;
Step 6: retransmit loudspeaker array in conjunction with complex optimum target function Tfit and concrete Ambisonic system, adopt optimized Algorithm, calculate the fitness value of optimum optimization aim function T fit, its pairing spheric harmonic function weight coefficient group by interative computation
Figure BSA00000291746700074
Be optimum decoded result.The computation optimization flow process as shown in Figure 3.Optimized Algorithm can be selected calculus of finite differences, genetic algorithm, conjugate gradient method, simulated annealing or tabu search algorithm etc. for use.
The foregoing description is a preferred implementation of the present invention; but embodiments of the present invention are not restricted to the described embodiments; other any do not deviate from change, the modification done under spirit of the present invention and the principle, substitutes, combination, simplify; all should be the substitute mode of equivalence, be included within protection scope of the present invention.

Claims (5)

1.Ambisonic the optimization of sound reproduction system decodes is improved one's methods, and it is characterized in that, may further comprise the steps:
Step 1:, obtain the velocity r of decision low-frequency range and high band virtual sound image alignment quality respectively according to the acoustic image positioning principle VWith acoustic energy flow vector r E
Step 2:, calculate the loudness control parameter V of virtual sound image at low-frequency range and high band according to the acoustic image positioning principle LiAnd V Hi
Step 3: ideally, the virtual sound image that is produced during Ambisonic system sound reproduction, with the counterparty to true acoustic image identical on direction, loudness and definition.Set up direction optimization aim function Afit, loudness optimization aim function Lfit and resolution optimization target function Mfit thus respectively;
Step 4: the optimization aim function C fit that sets up virtual directivity of microphone control;
Step 5: set up complex optimum target function Tfit;
Step 6: in conjunction with the position of loud speaker in the Ambisonic sound reproduction system, adopt optimized Algorithm, by the complex optimum target function Tfit that interative computation determines to have adaptive optimal control degree value, its pairing spheric harmonic function weight coefficient group
Figure FSA00000291746600011
Be optimum decoded result.
2. the optimization of Ambisonic sound reproduction system decodes according to claim 1 is improved one's methods, and it is characterized in that, the described direction optimization aim of step 3 function Afit, loudness optimization aim function Lfit and resolution optimization target function Mfit are respectively:
Afit = &Sigma; k = 1 6 ( &Sigma; i = 1 n ( &alpha; ki - &beta; ki ) 2 n )
Lfit = &Sigma; i = 1 n ( 1 - V Li / V L 0 ) 2 n + &Sigma; i = 1 n ( 1 - V Hi / V H 0 ) 2 n
Mfit = &Sigma; i = 1 n ( 1 - r Vi ) 2 n + &Sigma; i = 1 n ( 1 - r Ei ) 2 n
Wherein, n represents that virtual sound image is at space different directions hits.
3. the optimization of Ambisonic sound reproduction system decodes according to claim 1 is improved one's methods, and it is characterized in that, the described complex optimum target function of step 5 Tfit is:
Tfit=b 1·Afit+b 2·Lfit+b 3·Mfit+b 4·Cfit
Wherein, b j(j=1,2,3,4) are different optimization aim function shared weights in the complex optimum target function, and 0≤b j≤ 1.
4. the optimization of Ambisonic sound reproduction system decodes according to claim 1 is improved one's methods, and it is characterized in that, the optimization aim function C fit of the virtual directivity of microphone control that step 4 is set up is:
(1) at θ 1<θ Aim<θ 2And under ° situation of Δ φ≤180:
Cfit = &Sigma; - 180 &le; &theta; &le; 180 ( &Sigma; - 90 &le; &phi; &le; 90 r M ( &theta; , &phi; ) ) &Sigma; &theta; 1 &le; &theta; &le; &theta; 2 ( &Sigma; &phi; 1 &le; &phi; &le; &phi; 2 r M ( &theta; , &phi; ) ) - 1
(2) at θ 1<θ Aim<θ 2And under ° situation of Δ φ≤180:
Cfit = &Sigma; - 180 &le; &theta; &le; 180 ( &Sigma; - 90 &le; &phi; &le; 90 r M ( &theta; , &phi; ) ) &Sigma; &theta; 1 &le; &theta; &le; &theta; 2 ( &Sigma; - 90 &le; &phi; &le; 90 r M ( &theta; , &phi; ) ) + &Sigma; - 180 &le; &theta; < &theta; 1 , &theta; 2 < &theta; &le; 180 ( &Sigma; - 90 < &phi; &le; &phi; 1 , &phi; 2 &le; &phi; < 90 r M ( &theta; , &phi; ) ) - 1
(3) at θ Aim<θ 1Or θ Aim>θ 2And under ° situation of Δ φ≤180:
Cfit = &Sigma; - 180 &le; &theta; &le; 180 ( &Sigma; - 90 &le; &phi; &le; 90 r M ( &theta; , &phi; ) ) &Sigma; - 180 &le; &theta; < &theta; 1 , &theta; 2 < &theta; &le; 180 ( &Sigma; &phi; 1 &le; &phi; &le; &phi; 2 r M ( &theta; , &phi; ) ) - 1
(4) at θ Aim<θ 1Or θ Aim>θ 2And under ° situation of Δ φ≤180:
Cfit = &Sigma; - 180 &le; &theta; &le; 180 ( &Sigma; - 90 &le; &phi; &le; 90 r M ( &theta; , &phi; ) ) &Sigma; - 180 &le; &theta; < &theta; 1 , &theta; 2 < &theta; i &le; 180 ( &Sigma; - 90 &le; &phi; &le; 90 r M ( &theta; , &phi; ) ) + &Sigma; &theta; 1 &le; &theta; < &theta; 2 ( &Sigma; - 90 < &phi; &le; &phi; 1 , &phi; 2 &le; &phi; < 90 r M ( &theta; , &phi; ) ) - 1
r M(θ, φ) be all loud speakers at the azimuth for (θ, the absolute value of the signal level value of closing on direction φ):
r M ( &theta; , &phi; ) = | &Sigma; j = 1 k g j | = | &Sigma; j = 1 k ( &Sigma; 0 &le; m &le; M , 0 &le; n &le; m , &sigma; = &PlusMinus; 1 c m , n , j &sigma; &CenterDot; Y m , n &sigma; ( &theta; , &phi; ) ) |
Wherein, the k representative constitutes the number of loudspeakers of loudspeaker array, g jBe the recurrent signal of loud speaker, Function Y is a spheric harmonic function,
Figure FSA00000291746600026
Mixed stocker array for spheric harmonic function Y.
5. the optimization of Ambisonic sound reproduction system decodes according to claim 1 is improved one's methods, and it is characterized in that, the described optimized Algorithm of step 6 can adopt calculus of finite differences, genetic algorithm, conjugate gradient method, simulated annealing or tabu search algorithm.
CN2010102989354A 2010-09-29 2010-09-29 Decoding optimizing and improving method of Ambisonic voice repeating system Pending CN101977349A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2010102989354A CN101977349A (en) 2010-09-29 2010-09-29 Decoding optimizing and improving method of Ambisonic voice repeating system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010102989354A CN101977349A (en) 2010-09-29 2010-09-29 Decoding optimizing and improving method of Ambisonic voice repeating system

Publications (1)

Publication Number Publication Date
CN101977349A true CN101977349A (en) 2011-02-16

Family

ID=43577187

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2010102989354A Pending CN101977349A (en) 2010-09-29 2010-09-29 Decoding optimizing and improving method of Ambisonic voice repeating system

Country Status (1)

Country Link
CN (1) CN101977349A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2623886C2 (en) * 2012-12-12 2017-06-29 Долби Интернэшнл Аб Method and device for compressing and restoring representation of high-order ambisonic system for sound field
CN107147975A (en) * 2017-04-26 2017-09-08 北京大学 A kind of Ambisonics matching pursuit coding/decoding methods put towards irregular loudspeaker
US9838822B2 (en) 2013-03-22 2017-12-05 Dolby Laboratories Licensing Corporation Method and apparatus for enhancing directivity of a 1st order ambisonics signal
CN105340008B (en) * 2013-05-29 2019-06-14 高通股份有限公司 The compression through exploded representation of sound field
CN110099351A (en) * 2019-04-01 2019-08-06 中车青岛四方机车车辆股份有限公司 A kind of sound field back method, device and system
US10499176B2 (en) 2013-05-29 2019-12-03 Qualcomm Incorporated Identifying codebooks to use when coding spatial components of a sound field
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
《华南理工大学博士学位论文》 20091121 龚惠哲 Ambisonic声重发***的优化改进 第20-31页 1-5 , 2 *

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10609501B2 (en) 2012-12-12 2020-03-31 Dolby Laboratories Licensing Corporation Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field
CN109545235B (en) * 2012-12-12 2023-11-17 杜比国际公司 Method and apparatus for compressing and decompressing higher order ambisonic representations of a sound field
US10038965B2 (en) 2012-12-12 2018-07-31 Dolby Laboratories Licensing Corporation Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field
CN109545235A (en) * 2012-12-12 2019-03-29 杜比国际公司 The method and apparatus that the high-order ambiophony of sound field is indicated to carry out compression and decompression
US10257635B2 (en) 2012-12-12 2019-04-09 Dolby Laboratories Licensing Corporation Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field
US11546712B2 (en) 2012-12-12 2023-01-03 Dolby Laboratories Licensing Corporation Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field
US11184730B2 (en) 2012-12-12 2021-11-23 Dolby Laboratories Licensing Corporation Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field
RU2623886C2 (en) * 2012-12-12 2017-06-29 Долби Интернэшнл Аб Method and device for compressing and restoring representation of high-order ambisonic system for sound field
US9838822B2 (en) 2013-03-22 2017-12-05 Dolby Laboratories Licensing Corporation Method and apparatus for enhancing directivity of a 1st order ambisonics signal
US10499176B2 (en) 2013-05-29 2019-12-03 Qualcomm Incorporated Identifying codebooks to use when coding spatial components of a sound field
US11146903B2 (en) 2013-05-29 2021-10-12 Qualcomm Incorporated Compression of decomposed representations of a sound field
CN105340008B (en) * 2013-05-29 2019-06-14 高通股份有限公司 The compression through exploded representation of sound field
US11962990B2 (en) 2013-05-29 2024-04-16 Qualcomm Incorporated Reordering of foreground audio objects in the ambisonics domain
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals
CN107147975A (en) * 2017-04-26 2017-09-08 北京大学 A kind of Ambisonics matching pursuit coding/decoding methods put towards irregular loudspeaker
CN110099351B (en) * 2019-04-01 2020-11-03 中车青岛四方机车车辆股份有限公司 Sound field playback method, device and system
CN110099351A (en) * 2019-04-01 2019-08-06 中车青岛四方机车车辆股份有限公司 A kind of sound field back method, device and system

Similar Documents

Publication Publication Date Title
CN101977349A (en) Decoding optimizing and improving method of Ambisonic voice repeating system
US20220408209A1 (en) Method for and apparatus for decoding/rendering an ambisonics audio soundfield representation for audio playback using 2d setups
US9641929B2 (en) Audio signal processing method and apparatus and differential beamforming method and apparatus
US8213623B2 (en) Method to generate an output audio signal from two or more input audio signals
JP4987358B2 (en) Microphone modeling
CN103583054B (en) For producing the apparatus and method of audio output signal
CN101828407B (en) Based on the microphone array processor of spatial analysis
US10492000B2 (en) Cylindrical microphone array for efficient recording of 3D sound fields
CN102790931A (en) Distance sense synthetic method in three-dimensional sound field synthesis
CN104244164A (en) Method, device and computer program product for generating surround sound field
CN103180752B (en) For resolving equipment and the method for the fuzziness arriving direction estimation
CN107104720A (en) The relatively prime array adaptive beamforming method rebuild based on covariance matrix virtual Domain discretization
CN110784799B (en) Sound directional transmission method and system
CN100530350C (en) Sound radiant generation method to object
Epain et al. Ambisonic decoding with constant angular spread
CN106324602B (en) A kind of MIMO Sonar system
CN104768099B (en) Mode Beam-former and frequency domain bandwidth realization method for annular battle array
EP2755405A1 (en) Zonal sound distribution
Favrot et al. Metrics for performance assessment of mixed-order Ambisonics spherical microphone arrays
Shi et al. Design of a constant beamwidth beamformer for the parametric array loudspeaker
Poletti et al. Analysis of 2D sound reproduction with fixed-directivity loudspeakers
CN108476371A (en) Acoustic wavefield generates
Tseng A directional audible sound system using ultrasonic transducers
JP7373810B2 (en) Sound output device and sound output method
Zhang et al. Sound reproduction with a circular loudspeaker array using differential beamforming method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20110216