Surround sound matrix coding/decoding system and its implementation
Technical field
The application relates to a kind of multimedia treatment technology, particularly relates to a kind ofly improving the surround sound matrix decoder to the method and apparatus of the backwards compatibility of the original source of sound of binary channels with audio frequency watermark.
Background technology
Stereoly refer to have relief sound.The sound that nature sends is stereo, and people wish these stereo informations after record, transmission, retransmitting (playback), still accurate original stereo of perception.
A kind of existing stereo processing mode is the binary channels stereophonic sound system, referred to as binary channels, two-channel.2.0 system is typical binary channels playback system, only puts loudspeaker L and the right front loudspeaker R that puts forms by left front.2.1 system has increased bass unit LFE on the basis of 2.0 systems.
Another kind of existing stereo processing mode is the surround sound stereophonic sound system, referred to as surround sound.5.1 system is typical surround sound playback system, by left front put loudspeaker L, right front put loudspeaker R, in preposition loudspeaker C, bass unit LFE, left back circulating loudspeaker Ls, right back circulating loudspeaker Rs form.7.1 system has increased left speaker and right speaker on the basis of 5.1 systems.
Ambiophonic system has obtained Rapid Popularization and widespread use as than the better stereophonics mode of dual channel system in occasions such as home theater, electric athletic game, live reproduction, virtual realities.Yet dual channel system has the semicentennial applicating history of surpassing, and the quantity of hardware device and audio resource is all much larger than ambiophonic system.How to take full advantage of the equipment such as making, storage, transmission of existing double-channel signal, realize the performance of most of ambiophonic system, huge marketable value is just arranged.A typical application scenarios is that existing FM stereophonic broadcast is upgraded to accurate surround sound broadcasting easily.
Because the stereo information that ambiophonic system comprises is far more than dual channel system, in case when dual channel system can't be stored extra surround sound information by its original design, the conversion from the double-channel signal to the surround sound signal (being called surround sound matrix coder, blind decoding) just faced the difficult problem that part surround sound information can't be recovered.Be different from the pure surround sound codec of mainly bearing data compression function, industry will be called surround sound matrix solution code system to the system of surround sound signal from the double-channel signal audio mixing that makes progress traditionally, also claim surround sound blind decoding system.
See also Fig. 1, this is a kind of existing surround sound matrix coder system, only comprises surround sound matrix encoder 11.Described surround sound matrix encoder 11 is converted to double-channel signal with the surround sound signal of any sound channel (among Fig. 1 take 5.1 sound channels as example), so that using in making, storage and the transmission equipment of present a large amount of double-channel signals that exist.
See also Fig. 2, this is a kind of existing surround sound matrix solution code system, comprises surround sound matrix decoder 21, surround sound amplifier 22 etc.Described surround sound matrix decoder 21 is converted to surround sound signal (among Fig. 2 take 5.1 sound channels as example) with double-channel signal.Described surround sound amplifier 22 is play surround sound signal through exporting to surround sound playback system (among Fig. 2 take 5.1 surround sound playback systems as example) after amplifying, processing.
Because the information of original source of sound type is lost in storage, transmission, coding link, surround sound matrix solution code system can't learn that original source of sound type is binary channels or surround sound, so the repeating transmission mode of surround sound matrix solution code system may just be runed counter to the optimum retransmission mode of original source of sound type.For example, if the original source of sound type of the surround sound among Fig. 1 is double-channel signal.What listen that the hearer perceives at typical binary channels playback system is at left front loudspeaker L and the right front virtual mid-acoustic image M ' that puts between the loudspeaker R of putting.And through behind the surround sound matrix solution code system, listen the hearer 5.1 surround sound playback systems hear be by in the mid-acoustic image M of reality that sends of preposition loudspeaker C, this may increase because of amplitude cause width of cloth distortion.
From the angle of high-fidelity, when original source of sound is double-channel signal, listens the hearer and be more prone to the dual channel system playback that consisted of by L, R loudspeaker.If 5.1 surround sound playback systems among Fig. 1 can know exactly that original source of sound type is double-channel signal, just 5.1 surround sound playback systems degradation can be switched to 2.0 or 2.1 binary channels playback system, with the compatible binary channels stereo audio content of perfection.
Summary of the invention
The application's technical matters to be solved provides a kind of surround sound matrix coder system, can embed original source of sound type information in double-channel signal.For this reason, the application also will provide the implementation method of described surround sound matrix coder system.
Correspondingly, the application also will provide a kind of surround sound matrix solution code system, can carry out different processing according to different original source of sound type information entrained in the double-channel signal.For this reason, the application also will provide the implementation method of described surround sound matrix solution code system.
For solving the problems of the technologies described above, the application's surround sound matrix coder system comprises:
---the surround sound matrix encoder is converted to double-channel signal with surround sound signal;
---water mark embedding device, in double-channel signal, embed original source of sound type information in the audio frequency watermark mode, output has the double-channel signal of original source of sound type information.
The implementation method of described surround sound matrix coder system is: surround sound signal is converted into double-channel signal through the surround sound matrix encoder; Described double-channel signal passes through water mark embedding device again, be embedded into the expression original source of sound type information audio frequency watermark after export.
Correspondingly, the application's surround sound matrix solution code system comprises:
---the surround sound matrix decoder, the double-channel signal that will have original source of sound type information is converted to surround sound signal, exports to mixer;
---the watermark extracting device, from the double-channel signal with original source of sound type information, identify and extract original source of sound type information, the mode of operation of control mixer;
---mixer directly gathers the double-channel signal with original source of sound type information;
When original source of sound type was binary channels, the mode of operation that the watermark extracting device is set mixer was: the double-channel signal with original source of sound type information that gathers is exported to the surround sound amplifier;
When original source of sound type was surround sound, the mode of operation that the watermark extracting device is set mixer was: the surround sound signal of surround sound matrix decoder output is exported to the surround sound amplifier;
---the surround sound amplifier, double-channel signal or the surround sound signal that will have original source of sound type information are exported to the surround sound playback system plays after amplifying, processing.
The implementation method of described surround sound matrix solution code system is: the double-channel signal with original source of sound type type enters surround sound matrix decoder, watermark extracting device and mixer simultaneously; Described surround sound matrix decoder is converted into surround sound signal and exports to mixer; Described watermark extracting device is from wherein extracting original source of sound type information, and the mode of operation signal that generates mixer is exported to mixer; Described mixer is perhaps exported to surround sound signal the surround sound amplifier according to the mode of operation signal, and the double-channel signal that perhaps will have original source of sound type type is exported to the surround sound amplifier.
Compared with prior art, the application's surround sound matrix coding/decoding system and its implementation have the following advantages:
One, surround sound matrix solution code system is compatible original binary channels source of sound fully, so that the surround sound playback system can show original binary channels source of sound better.
Its two, do not change making, storage, the transmission system of existing double-channel signal, only add water mark embedding device at surround sound matrix encoder end, add watermark extracting device and mixer at surround sound matrix decoder end.The module that increases more can be incorporated into respectively among surround sound scrambler and the surround sound decoder, to save cost.
Its three, embed audio frequency watermark and can adopt the matrixing technology so that have, indication that the no-watermark signal can be used as binary channels program source and surround sound program source; Perhaps, also can so that the different information in the watermark signal as the indication of binary channels program source and surround sound program source.This is the most widely used and poor system, pilot in the compatible stereophonic FM broadcasting better.
Description of drawings
Fig. 1 is the structural representation of existing surround sound matrix coder system;
Fig. 2 is the structural representation of existing surround sound matrix solution code system;
Fig. 3 is the structural representation of the application's surround sound matrix coder system;
Fig. 4 is the structural representation of the application's surround sound matrix solution code system;
Fig. 5 is the structural representation of an embodiment of water mark embedding device among Fig. 3;
Fig. 6 is the structural representation of an embodiment of watermark embedded unit among Fig. 5;
Fig. 7 is the structural representation of another embodiment of watermark embedded unit among Fig. 5;
Fig. 8 is the structural representation of an embodiment of watermark extracting device among Fig. 4;
Fig. 9 is that the de-jitter module is processed forward and backward oscillogram to signal and intersected the audio mixing switch to signal C fades in, signal D fades out oscillogram among Fig. 8;
Figure 10 is the structural representation of an embodiment of mixer among Fig. 4.
Reference numeral is among the figure:
11 is the surround sound matrix encoder; 21 is the surround sound matrix decoder; 22 is the surround sound amplifier; 23 is 5.1 sound channel surround sound playback systems; 31 is water mark embedding device; 311 is matrix M; 312 are the watermark embedded unit; 313 is matrix M
-141 is the watermark extracting device; 411 is difference engine; 412 is watermark detection unit; 413 is the redundant correcting decoder module; 414 is the de-jitter module; 415 are the watermark discrimination module; 42 is mixer; 421 for intersecting the audio mixing switch; 422 are time-delay, gain regulation module; 51 is the redundant correcting coding module; 52 is carrier signal generator; 53 is modulator; 54 is mixer; 55 is wave filter; 56 is acoustic convolver.
Embodiment
See also Fig. 3, this is the structural representation of the application's surround sound matrix coder system, comprises surround sound matrix encoder 11 and water mark embedding device 31.Described surround sound matrix encoder 11 is converted to double-channel signal with surround sound signal (among Fig. 3 take 5.1 sound channels as example), in order to be applicable at present making, transmission and the memory device of a large amount of double-channel signals that exist.Water mark embedding device 31 then embeds original source of sound type information (uncorrelated with sample frequency) in double-channel signal, the mode of audio frequency watermark is adopted in this embedding.Described audio frequency watermark refers to hide a newly-increased signal in sound signal, and this newly-increased signal can not be perceived by the human ear, and only having Special Equipment just can find from sound signal and extract should newly-increased signal.Described original source of sound type comprises single channel, 2.0 binary channels, 2.1 binary channels, 5.1 surround sounds, 7.1 surround sounds, 9.2 surround sounds etc.Water mark embedding device 31 outputs have the double-channel signal of original source of sound type information.
The implementation method of above-mentioned surround sound matrix coder system is: surround sound signal is converted into double-channel signal through surround sound matrix encoder 11.Described double-channel signal passes through water mark embedding device 31 again, be embedded into the expression original source of sound type information audio frequency watermark after export.
See also Fig. 4, this is the structural representation of the application's surround sound matrix solution code system, is used with surround sound matrix coder system shown in Figure 2.It comprises surround sound matrix decoder 21, watermark extracting device 41, mixer 42 and surround sound amplifier 22.The double-channel signal that described surround sound matrix decoder 21 will have original source of sound type information is converted to surround sound signal (among Fig. 4 take 5.1 sound channels as example), exports to mixer 42.Original source of sound type information is identified and extracted to described watermark extracting device 41 from the double-channel signal with original source of sound type information, the mode of operation of control mixer 42.Mixer 42 directly gathers the double-channel signal with original source of sound type information.When the original source of sound type that extracts when watermark extracting device 41 was binary channels, setting mixer 42 be the binary channels mode of operation, and mixer 42 is just directly exported to surround sound amplifier 22 with the double-channel signal with original source of sound type information that gathers.When the original source of sound type that extracts when watermark extracting device 41 was surround sound, setting mixer 42 be the surround sound mode of operation, and the surround sound signal that mixer 42 is just directly exported surround sound matrix decoder 21 is exported to surround sound amplifier 22.Described surround sound amplifier 22 will have double-channel signal or the surround sound signal of original source of sound type information and export to the surround sound playback system plays after amplifying, processing.
The implementation method of above-mentioned surround sound matrix solution code system is: the double-channel signal with original source of sound type type enters surround sound matrix decoder 21, watermark extracting device 41 and mixer 42 simultaneously.Described surround sound matrix decoder 21 is converted into surround sound signal and exports to mixer 42.Described watermark extracting device 41 therefrom extracts original source of sound type information, and the mode of operation signal of generation mixer 42 is exported to mixer 42.Described mixer 42 is perhaps exported to surround sound signal surround sound amplifier 22 according to the mode of operation signal, and the double-channel signal that perhaps will have original source of sound type type is exported to surround sound amplifier 22.
In the first surround sound matrix coding and decoding scheme, when original source of sound type was binary channels, water mark embedding device 31 is embed watermark in double-channel signal not.When original source of sound type was surround sound, water mark embedding device 31 embedded the audio frequency watermark of the original source of sound type information of expression in double-channel signal.So just, whether has watermark in the double-channel signal of being exported with water mark embedding device 31, as the foundation of differentiating original source of sound type.
Correspondingly, when watermark extracting device 41 can not find watermark from the double-channel signal with original source of sound type type, show that original source of sound type is binary channels.This moment, watermark extracting device 41 sent binary channels mode of operation signal to mixer 42.When watermark extracting device 41 extracts watermark from the double-channel signal with original source of sound type type, show that original source of sound type is the surround sound of putting down in writing in the watermark.Watermark extracting device 41 just sends surround sound mode of operation signal to mixer 42 at this moment.
In the second surround sound matrix coding and decoding scheme, no matter which kind of original source of sound type is, water mark embedding device 31 is embed watermark in double-channel signal all, has put down in writing concrete original source of sound type in the watermark.
Correspondingly, watermark extracting device 41 finds watermark from the double-channel signal with original source of sound type type, and learns original source of sound type according to the information in the watermark, thereby sends corresponding mode of operation signal to mixer 42.
No matter be above-mentioned any scheme, the application can allow audio frequency obtain optimum playback system in the surround sound playback system according to original source of sound type.
Described water mark embedding device 31, watermark extracting device 41, mixer 42 can be realized with various ways, such as discrete component electronic equipment, composite signal integrated circuits, by digital signal processor or universal central processing unit are carried out the modes such as Software Coding, or the combination of these implementations.The below only exemplarily enumerates several embodiment.
See also Fig. 5, this is a specific embodiment of water mark embedding device 31, comprises matrixing unit 311, watermark embedded unit 312, matrix inversion converter unit 313.
Described matrixing unit 311 is converted to M signal (common-mode signal) and S signal (differential signal) with double-channel signal.Described matrixing unit 311 for example is 2 * 2 matrix M:
Described matrixing for example be multiply by 2 * 1 matrix with 2 * 2 matrix M
Wherein L, R represent double-channel signal.
Described watermark embedded unit 312 is modulated the watermark of the original source of sound type information of expression and is output as S ' signal (differential signal with original source of sound type information) in the S signal.If take have, no-watermark represents respectively original source of sound type as surround sound, binary channels, when original source of sound type was binary channels, watermark embedded unit 312 is modulation watermark in the S signal not so, directly the S signal was exported as S ' signal.
Described matrix inversion converter unit 313 is converted to the double-channel signal with original source of sound type information with M signal and S ' signal.Described matrix inversion converter unit for example is 2 * 2 matrix M
-1:
Described matrix inversion conversion is for example with 2 * 2 matrix M
-1Multiply by 2 * 1 matrix
Fig. 6 has provided an embodiment of watermark embedded unit 312, comprises redundant correcting coding module 51, carrier signal generator 52, modulator 53, mixer 54.Original source of sound type information becomes the redundant correcting code word behind redundant correcting coding module 51.In the time of major part, water mark embedding device 31 and watermark extracting device 41 are asynchronous duty, so also can embed necessary synchronization code word in the redundant correcting code word.Carrier signal generator 52 generates carrier signal, and carrier signal is generally broadband white noise, simple signal or multiple-frequency signal.Redundant correcting code word on modulator 53 is modulated in carrier signal, modulator approach commonly used are BPSK(Binary Phase Shift Keying, two-phase PSK) etc.Modulation signal and S signal are transformed to S ' signal after entering together mixer 54.Mixer 54 also need be finished attenuated output signal or frequency response shaping feature, makes modulation signal be sheltered and can not be perceived by the human ear by the S signal.
Fig. 7 has provided another embodiment of watermark embedded unit 45, comprises redundant correcting coding module 51, wave filter 55, acoustic convolver 56 etc.Original source of sound type information becomes the redundant correcting code word behind redundant correcting coding module 51.In the time of major part, water mark embedding device 31 and watermark extracting device 41 are asynchronous duty, so also can embed necessary synchronization code word in the redundant correcting code word.Wave filter 55 is pulse train with the redundant correcting code construction, and building method commonly used is that echo is hidden sequence, such as the positive and negative echo sequences of single echo, positive and negative echo, time shaft expansion etc.Pulse train and S signal enter together acoustic convolver 56 and are transformed to S ' signal.
See also Fig. 8, this is a specific embodiment of watermark extracting device 41, comprises difference engine 411, watermark detection unit 412, redundant correcting decoder module 413, de-jitter module 414, watermark judging module 415 etc.Described difference engine 411 extracts S ' signal from the double-channel signal with original source of sound type information.Described watermark detection unit 412 this locality generate with S ' signal in consistent synchronization code word, carry out related operation with this synchronization code word and S ' signal, thereby detect the sync bit of S ' signal.When correlation was maximum, expression had detected sync bit.The code word that 413 pairs of described redundant correcting decoder modules obtain in the S ' signal behind the sync bit is carried out the redundant correcting decoding, obtains the watermark of the original source of sound type information of expression that embeds in the S ' signal.The original source of sound type information that 414 pairs of described de-jitter modules obtain disturbs correction, the watermark information mistake that the erroneous judgement that reduction causes because of various interference certainly causes.Perhaps, the S ' signal that 414 pairs of described de-jitter modules do not detect sync bit disturbs correction, the information errors that the erroneous judgement that reduction causes because of various interference certainly causes.Signal A, B among Fig. 9 shows respectively de-jitter module 414 and processes forward and backward signal waveform.Described watermark judging module 415 is worked as and is had watermark in the S ' signal then to mixer output surround sound mode of operation signal in one case, and working as does not have watermark then to mixer output binary channels mode of operation signal in the S ' signal.Described watermark judging module 415 in another case, work as watermark in the S ' signal put down in writing original source of sound type be binary channels then to mixer output binary channels mode of operation signal, working as watermark in the S ' signal, to have put down in writing original source of sound type be that surround sound is then to mixer output surround sound mode of operation signal.
If take have, no-watermark represents that respectively original source of sound type is as surround sound, binary channels, do not detect when synchronous when watermark detection unit 412 so, just directly S ' signal is sent to de-jitter module 414, is processed by binary channels by watermark discrimination module 415 again.
If no matter which kind of original source of sound type all have watermark for, specifically judge with the information of putting down in writing in the watermark, after watermark detection module 412 must detect sync bit so, just S ' signal is sent to redundant correcting decoder module 413.Do not detect synchronously, then do not proceed down.
See also Figure 10, this is a specific embodiment of mixer 42, comprises intersection audio mixing switch 421, time-delay and gain regulation module 422 etc.What receive when mixer 42 is binary channels mode of operation signal, described intersection audio mixing switch 421 will have the double-channel signal of original source of sound type information and do the processing of fading in again after time-delay and gain regulation module 422 processing, simultaneously surround sound signal be done the processing of fading out.Receive it is surround sound mode of operation signal when mixer 42, described intersection audio mixing switch 421 is done the processing of fading in surround sound signal, and the double-channel signal that will have original source of sound type information is done the processing of fading out again after time-delay and gain regulation module processing.Signal C among Fig. 9 has expressed the gain trace of 5.1 sound channel surround sound signals, and signal D has expressed the gain trace of double-channel signal.(owing to one the tunnel fade in, the one tunnel fade out, roughly be equivalent to only export wherein one road signal) passes to surround sound amplifier 22 after the two paths of signals stack.Owing to have the double-channel signal of original source of sound type information, input, the output signal that surround sound signal is respectively surround sound matrix decoder 41, both are poor, the amplitude difference of life period inevitably.Described time-delay and gain regulation module 412 are used for mating double-channel signal and the mistiming between the surround sound signal and the amplitude difference with original source of sound type information, thereby weaken the mode of operation switching of mixer 42 to the perception degree of people's ear.
Be the application's preferred embodiment only below, and be not used in restriction the application.For a person skilled in the art, the application can have various modifications and variations.All within the application's spirit and principle, any modification of doing, be equal to replacement, improvement etc., all should be included within the application's the protection domain.