CN101065991A - Video-audio recording apparatus and method, and video-audio reproducing apparatus and method - Google Patents

Video-audio recording apparatus and method, and video-audio reproducing apparatus and method Download PDF

Info

Publication number
CN101065991A
CN101065991A CN 200580039496 CN200580039496A CN101065991A CN 101065991 A CN101065991 A CN 101065991A CN 200580039496 CN200580039496 CN 200580039496 CN 200580039496 A CN200580039496 A CN 200580039496A CN 101065991 A CN101065991 A CN 101065991A
Authority
CN
China
Prior art keywords
signal
sound
microphone
dual track
video
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN 200580039496
Other languages
Chinese (zh)
Other versions
CN100553373C (en
Inventor
村田寿子
铃木琢磨
安良定浩
羽石实希
长谷川胜巳
相崎正子
藤浪喜久
武田正美
菅原隆幸
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
JVCKenwood Corp
Original Assignee
Victor Company of Japan Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Victor Company of Japan Ltd filed Critical Victor Company of Japan Ltd
Publication of CN101065991A publication Critical patent/CN101065991A/en
Application granted granted Critical
Publication of CN100553373C publication Critical patent/CN100553373C/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Studio Devices (AREA)

Abstract

A video/audio recording/reproducing apparatus (101) has an internally-mounted stereo microphones (21a,21b) and an external microphone connection terminal (32). The external microphone connection terminal (32) is connected to binaural microphones (3) to be attached to the ears of a recording user (300). In a case of collecting ambient sounds by use of the binaural microphones (3), the user switches the audio signals to be recorded in a recording medium from the audio signals obtained from the internally-mounted stereo microphones (21a,21b) to the binaural audio signals obtained from the binaural microphones (3). The recording user (300) attaches the binaural microphones (3(31a,31b)) to his ears to collect sounds around him including sounds generated from an imaged subject. A camera part (11) images the subject. The binaural audio signals, video signals obtained by imaging the subject, and binaural flag signals are recorded into the recording medium.

Description

Video-audio recording apparatus and method and video-audio reproducing apparatus and method
Technical field
The present invention relates to a kind of video-audio recording apparatus and method that signal of video signal and voice signal are write down, and the video-audio reproducing apparatus of signal of video signal that writes down in the regenerative recording medium and voice signal and method, above-mentioned signal of video signal is the signal of video signal that object is made a video recording and obtained, the tut signal is to collect the voice signal that sound obtains to comprising the sound that object sends at the sound around the interior cameraman, be particularly related to a kind of video-audio recording apparatus and the method that can regenerate captured image and have the sound of presence, and video-audio reproducing apparatus and method.
Background technology
The signal of video signal that the reference object thing is obtained and collect the image and sound recording regenerating unit (so-called video camera) that voice signal that sound obtains writes down at the sound around the interior cameraman and popularize to comprising sound that object sends.This image and sound recording regenerating unit possesses the stereophony microphone that is used for sound is carried out stereophonic recording.In recent years, the just further miniaturization of image and sound recording regenerating unit exists the stereophony microphone of installing on the image and sound recording regenerating unit by miniaturization to be difficult to write down the problem of the sound with presence.Therefore, expectation can be write down the image and sound recording regenerating unit of the stronger sound of presence.
Image and sound recording regenerating unit below in the world discloses No. 96/10884 brochure, disclosing, it is by at the both sides of image and sound recording regenerating unit configuration ear tectosome, the signal of video signal that can obtain the reference object thing and the sound that the sound field around the cameraman is carried out dual track collection sound carried out record.
But, above-mentioned document disclosed at device from having installed in the image and sound recording regenerating unit of dual track microphone on one's body, if the width of device itself, promptly the interval of left and right sides microphone is not identical degree with people's head, then can't write down the sound with presence.The image and sound recording regenerating unit of Sheng Chaning is along with the progress of high density recording technology, digital signal record technology and image compression treatment technology is miniaturized in recent years, even so be the microphone of purpose, also can't obtain desired effect from installing with dual track collection sound on one's body at the image and sound recording regenerating unit.In addition, because the shape of device and people's head differ bigger, think and be difficult to obtain the disclosed effect of above-mentioned document.
Summary of the invention
The present invention invents in view of above problem, its purpose is to provide a kind of size or shape with device self irrelevant, can regenerate image of taking and video-audio recording apparatus and method and video-audio reproducing apparatus and method with sound of presence.
In addition, the object of the present invention is to provide a kind of when object being amplified (zoom up) shooting, can with the video-audio recording apparatus and the method for the stronger sound of the amplification linkage ground of object regeneration presence, and video-audio reproducing apparatus and method.
In addition, a kind ofly photograph, look arbitrarily under the situation of hearer's audiovisual even the object of the present invention is to provide cameraman arbitrarily, for example looking under hearer's situation different with the cameraman who has carried out dual track collection sound of audiovisual regenerated signal also can be listened to the video-audio reproducing apparatus and the method for the sound with presence that does not almost have inharmonic sensation.
The present invention is in order to solve above-mentioned existing technical task, a kind of video-audio recording apparatus is provided, it collects the voice signal that sound obtains at the sound around the interior cameraman and carries out record at the signal of video signal that the reference object thing is obtained and to comprising sound that described object sends, it is characterized in that possessing: image pickup part, it is made a video recording to described object; Switching part, switching are to use the dual track microphone that is worn on described cameraman's ear also to be to use dual track microphone microphone in addition to be used as described ambient sound is collected the microphone of sound; Video processing portion, its signal of video signal to described image pickup part output is handled; Audio treatment part, it handles the voice signal that the sound around described is collected the microphone output of sound; The flag information generating unit, the microphone that the sound around described is collected sound in conduct, when switching to described dual track microphone by described switching part, acoustic pattern when the sound around described is collected sound generates the dual track marking signal of expression dual track pattern as the dual track pattern; And recording portion, it writes down in recording medium by described Video processing portion and has carried out the signal of video signal of handling, carried out voice signal and the described dual track marking signal handled by described audio treatment part.
According to the present invention, irrelevant with the size and the shape of device itself, can regenerate image of taking and sound with presence.In addition, to the object amplifying camera time, can with the stronger sound of the amplification linkage ground of object regeneration presence.And, even photograph cameraman arbitrarily, look arbitrarily under the situation of hearer's audiovisual, for example looking under hearer's situation different with the cameraman who carries out dual track collection sound of audiovisual regenerated signal also can be listened to the sound with presence that does not almost have inharmonic sensation.
In this structure, preferably have: be built in the built-in microphone in the described video-audio recording apparatus; The external microphone splicing ear; Set the configuration part that described dual track microphone or dual track microphone microphone in addition is used as being connected with described external microphone splicing ear the external microphone that uses; The joint detection portion whether external microphone is connected and detects with described external microphone splicing ear; As the voice signal that described audio treatment part is provided, the switch that between the voice signal that the voice signal and the described external microphone of described built-in microphone output are exported, switches; And described external microphone is being set at the dual track microphone by described configuration part, and detect when on described external microphone splicing ear, being connected with external microphone by described joint detection portion, acoustic pattern is made as the control part of described dual track pattern, when described control part is described dual track pattern at acoustic pattern, described switch is carried out switching controls so that the voice signal of described external microphone output is offered described audio treatment part, and control described flag information generating unit and make described flag information generating unit generate described dual track marking signal.
In addition, preferably possess the display part that the signal of video signal to the output of described image pickup part shows, and possess display control unit, when it is described dual track pattern at acoustic pattern, make described display part show the dual track mark of the described dual track pattern of expression.
And preferably described image pickup part possesses the zoom function that described object is carried out amplifying camera, and possesses sound convergent-divergent handling part, and the magnification ratio of its corresponding described image pickup part amplifies the voice signal that uses described dual track microphone collection sound to obtain.
And, preferably described image pickup part possesses the zoom function that described object is carried out amplifying camera, and possesses sound convergent-divergent handling part, described sound convergent-divergent handling part possesses: the transfer function memory, a plurality of distances of its corresponding virtual source of sound and listener are stored head transfer functions, and described head transfer functions is used to make the source of sound of the voice signal that described dual track microphone collection sound obtains to form virtual source of sound near the listener virtually; Function is obtained device, and the magnification ratio of its corresponding described image pickup part is optionally obtained some in a plurality of head transfer functions of storing in this transfer function memory; And the convolution algorithm device, it obtains the head transfer functions that device is obtained to the voice signal convolution that is obtained by described dual track microphone collection sound by described function.
The present invention is in order to solve above-mentioned existing technical task, a kind of image and sound recording method is provided, the signal of video signal that this method obtains the reference object thing and collect the voice signal that sound obtains at the sound around the interior cameraman and carry out record to comprising sound that described object sends, it is characterized in that, possess: the shooting step, described object is made a video recording; Switch step, switching are to use the dual track microphone that is worn on described cameraman's ear also to be to use dual track microphone microphone in addition to be used as described ambient sound is collected the microphone of sound; The Video processing step is handled taking the signal of video signal that described object obtains; The Audio Processing step is handled the voice signal that the sound around described is collected the microphone output of sound; Flag information generates step, as the microphone that the sound around described is collected sound, when in described switch step, switching to described dual track microphone, acoustic pattern when the sound around described is collected sound is made as the dual track pattern, generates the dual track marking signal of expression dual track pattern; And recording step, in recording medium, be recorded in the signal of video signal of handling in the described Video processing step, the voice signal of in described Audio Processing step, handling and described dual track marking signal.
The present invention is in order to solve the problem of above-mentioned prior art, a kind of video-audio reproducing apparatus is provided, this device regenerative recording the signal of video signal that obtains of reference object thing and to comprising sound that described object sends collects the voice signal that sound obtains at the sound around the interior cameraman recording medium, it is characterized in that, possess: reproducing unit, the tracer signal that writes down in its described recording medium of regenerating; Separated part, its described tracer signal from described reproducing unit regeneration is separated described signal of video signal and described voice signal; Video processing portion, it is to being handled by the isolated described signal of video signal of described separated part; Audio treatment part, it is to being handled by the isolated described voice signal of described separated part; The flag information obtaining section, it records expression when having used the dual track marking signal of the dual track microphone on the ear that is worn on described cameraman as the microphone that the sound around described is collected sound in described recording medium, obtain described dual track marking signal; Crosstalk (cross talk) arrester, when it has obtained described dual track marking signal in described flag information obtaining section, described voice signal handled offset the crosstalk signal that produces when the described voice signal of described audio treatment part being handled by loud speaker pronounces; Described crosstalk arrester has filter, the filter characteristic that this filter is obtained in advance to described voice signal convolution, described filter characteristic is based on head transfer functions, and this head transfer functions is to use by a pair of microphone of installing in the cylindrical configuration body measured signal is collected the voice signal that sound obtains and the head transfer functions that measures.
In addition, a kind of video-audio renovation process is provided, this method regenerative recording the signal of video signal that obtains of reference object thing and to comprising sound that described object sends collects the voice signal that sound obtains at the sound around the interior cameraman recording medium, it is characterized in that possessing: regeneration step, the tracer signal that writes down in the described recording medium of regenerating; Separating step, the described tracer signal of regenerating from described regeneration step is separated described signal of video signal and described voice signal; The Video processing step is handled isolated described signal of video signal in the described separating step; The Audio Processing step is handled isolated described voice signal in described separating step; Flag information is obtained step, in described recording medium, record expression when having used the dual track marking signal of the dual track microphone on the ear that is worn on described cameraman, obtain described dual track marking signal as the microphone that the sound around described is collected sound; And crosstalk removal process, obtain when having obtained described dual track marking signal in the step at described flag information, described voice signal is handled offset the crosstalk signal that produces when the described voice signal of handling being pronounced in described Audio Processing step by loud speaker; Described crosstalk removal process is the step of filter characteristic that described voice signal convolution is obtained in advance, described filter characteristic is based on head transfer functions, and this head transfer functions is to use by a pair of microphone of installing in the cylindrical configuration body measured signal is collected the voice signal that sound obtains and the head transfer functions that measures.
Description of drawings
Fig. 1 is the outward appearance oblique view of the image and sound recording regenerating unit of expression first embodiment of the invention.
State when Fig. 2 represents to use the image and sound recording regenerating unit of first embodiment of the invention that object is made a video recording.
Fig. 3 is the block diagram of inside configuration example of the image and sound recording regenerating unit of expression first embodiment of the invention.
Fig. 4 is illustrated in the display frame of carrying out the initial setting of acoustic pattern in the image and sound recording regenerating unit of each execution mode of the present invention.
Fig. 5 represents the demonstration example of the dual track microphone in the image and sound recording regenerating unit of each execution mode of the present invention.
Fig. 6 is illustrated in the variation of the dual track microphone that uses in the image and sound recording regenerating unit of each execution mode of the present invention.
Fig. 7 is illustrated in the variation of the dual track microphone that uses in the image and sound recording regenerating unit of each execution mode of the present invention.
Fig. 8 is illustrated in the variation of the dual track microphone that uses in the image and sound recording regenerating unit of each execution mode of the present invention.
Fig. 9 represents an example of the record form of the dual track marking signal in the image and sound recording regenerating unit of each execution mode of the present invention.
Figure 10 represents other examples of the record form of the dual track marking signal in the image and sound recording regenerating unit of each execution mode of the present invention.
Figure 11 represents another example of the record form of the dual track marking signal in the image and sound recording regenerating unit of each execution mode of the present invention.
Figure 12 is the flow chart of operation of recording that is used to illustrate the image and sound recording regenerating unit of first embodiment of the invention.
Figure 13 is the flow chart of regeneration action that is used to illustrate the image and sound recording regenerating unit of first embodiment of the invention.
Figure 14 is the block diagram of the configuration example of the crosstalk arrester that uses in the image and sound recording regenerating unit of expression the present invention each execution mode.
Figure 15 represents to be used for to obtain the head transfer functions determinator of the employed head transfer functions characteristic of crosstalk arrester that the image and sound recording regenerating unit at each execution mode of the present invention uses.
Figure 16 has represented to install the cylindrical configuration body of the microphone assembly that uses in the head transfer functions determinator shown in Figure 15, and the dummy head who is used to compare (dummy head) wears microphone.
Figure 17 is the oscillogram of expression by the impulse response waveform of head transfer functions determinator mensuration shown in Figure 15.
Figure 18 is the oscillogram of the frequency characteristic that determined by head transfer functions determinator shown in Figure 15 of expression.
Figure 19 is the oscillogram that expression end user foreman wears the impulse response waveform that microphone determines.
Figure 20 is the oscillogram that expression end user foreman wears the frequency characteristic that microphone determines.
The crosstalk that Figure 21 is based on filter characteristic is eliminated the key diagram of characteristic, and this filter characteristic has been installed the head transfer functions that the cylindrical configuration body examination of microphone assembly is made based on use.
Figure 22 is based on the key diagram of the crosstalk elimination characteristic of filter characteristic, and this filter characteristic is worn the head transfer functions that microphone determines based on end user foreman.
The crosstalk that Figure 23 is based on filter characteristic is eliminated the key diagram of characteristic, and this filter characteristic has been installed the head transfer functions that the cylindrical configuration body examination of microphone assembly is made based on use.
Figure 24 is based on the key diagram of the crosstalk elimination characteristic of filter characteristic, and this filter characteristic is worn the head transfer functions that microphone determines based on end user foreman.
Figure 25 is the block diagram that is illustrated in other configuration examples of the crosstalk arrester that uses in the image and sound recording regenerating unit of each execution mode of the present invention.
Figure 26 is the block diagram that is illustrated in another configuration example of the crosstalk arrester that uses in the image and sound recording regenerating unit of each execution mode of the present invention.
Figure 27 is the flow chart of the regeneration action of the headgear in the image and sound recording regenerating unit of expression the present invention each execution mode.
Figure 28 is the block diagram of inside configuration example of the image and sound recording regenerating unit of expression second embodiment of the invention.
Figure 29 is the block diagram of configuration example of the sound convergent-divergent handling part in the image and sound recording regenerating unit of expression second embodiment of the invention.
Figure 30 is used for the image and sound recording regenerating unit in second embodiment of the invention is carried out the flow chart that the action of sound convergent-divergent when handling describes.
Figure 31 is the block diagram of other configuration examples of the sound convergent-divergent handling part in the image and sound recording regenerating unit of expression second embodiment of the invention.
Figure 32 represents to be used for to obtain the head transfer functions determinator of the head transfer functions of being used by other configuration examples of the sound convergent-divergent handling part of Figure 31.
Figure 33 is illustrated in the sectional view of manually wearing microphone that uses in the head transfer functions determinator of Figure 32.
Figure 34 represents to be measured by the head transfer functions determinator of Figure 32 the characteristic of the head transfer functions that obtains.
Figure 35 represents to be measured by the head transfer functions determinator of Figure 32 the characteristic of the head transfer functions that obtains.
Figure 36 represents to be measured by the head transfer functions determinator of Figure 32 the characteristic of the head transfer functions that obtains.
Figure 37 represents to be measured by the head transfer functions determinator of Figure 32 the characteristic of the head transfer functions that obtains.
Figure 38 represents to be measured by the head transfer functions determinator of Figure 32 the characteristic of the head transfer functions that obtains.
Figure 39 represents to be measured by the head transfer functions determinator of Figure 32 the characteristic of the head transfer functions that obtains.
Figure 40 is the flow chart that is used for illustrating the action when the image and sound recording regenerating unit of second embodiment of the invention uses other configuration examples of sound convergent-divergent handling part shown in Figure 31 to carry out the processing of sound convergent-divergent.
Figure 41 is the block diagram of inside configuration example of the image and sound recording regenerating unit of expression third embodiment of the invention.
Figure 42 is the block diagram of configuration example of the sound convergent-divergent handling part in the image and sound recording regenerating unit of expression third embodiment of the invention.
Figure 43 is the block diagram of other configuration examples of the sound convergent-divergent handling part in the image and sound recording regenerating unit of expression third embodiment of the invention.
Figure 44 is the block diagram of inside configuration example of the image and sound recording regenerating unit of expression four embodiment of the invention.
Figure 45 is the block diagram of configuration example of the sound convergent-divergent handling part in the image and sound recording regenerating unit of expression four embodiment of the invention.
Figure 46 is the flow chart that is used for illustrating that artificial (manual) sound convergent-divergent of the image and sound recording regenerating unit of four embodiment of the invention is handled.
Figure 47 is the block diagram of inside configuration example of the image and sound recording regenerating unit of expression fifth embodiment of the invention.
Figure 48 is the block diagram of configuration example of the sound convergent-divergent handling part in the image and sound recording regenerating unit of expression fifth embodiment of the invention.
Figure 49 is the outward appearance oblique view of the image and sound recording regenerating unit of expression sixth embodiment of the invention.
Figure 50 is the block diagram of inside configuration example of the image and sound recording regenerating unit of expression sixth embodiment of the invention.
Figure 51 is the plane graph of structure example of the code resettlement section in the image and sound recording regenerating unit of expression sixth embodiment of the invention.
Figure 52 is the outward appearance oblique view of the image and sound recording regenerating unit of expression seventh embodiment of the invention.
Figure 53 is the block diagram of inside configuration example of the image and sound recording regenerating unit of expression seventh embodiment of the invention.
Figure 54 is the radio-type dual track microphone in the image and sound recording regenerating unit of expression seventh embodiment of the invention and the block diagram of the concrete configuration example of radio receiving and transmitting part.
Figure 55 is the key diagram of the warning when radio-type dual track microphone surpasses can communicate by letter regional in the image and sound recording regenerating unit of seventh embodiment of the invention.
Figure 56 is illustrated in the image and sound recording regenerating unit of seventh embodiment of the invention, when radio-type dual track microphone surpasses can communicate by letter regional on display part the example of shown warning label.
Figure 57 is the flow chart of action that is used to illustrate the image and sound recording regenerating unit of seventh embodiment of the invention.
Embodiment
Following with reference to accompanying drawing to video-audio recording apparatus of the present invention and method, and the execution mode of video-audio reproducing apparatus and method describes.
First execution mode
Fig. 1 is the oblique view of outward appearance configuration example of the image and sound recording regenerating unit 101 of expression first execution mode.
Image and sound recording regenerating unit 101 shown in Figure 1 has image pickup part 11, display part 17, built-in stereophony microphone 21a, 21b and external microphone splicing ear 32.Externally connect removably on the microphone splicing ear 32 and have the left and right sides microphone 31a of astaticism and the earpiece type dual track microphone 3 of 31b.At this, illustrate the state that dual track microphone 3 is connected with external microphone splicing ear 32. Microphone 31a, 31b have oscillating plate in inside.As the back was described in detail, image and sound recording regenerating unit 101 can optionally use the photography (sound is included) of built-in stereophony microphone 21a, 21b and use the photography (sound is included) of dual track microphone 3.In addition, so-called photography singly is not that the image of object is made a video recording, and also represents sometimes the image of object is made a video recording and collected sound to comprising the sound that object sends at the sound around the interior cameraman.
State when Fig. 2 represents that cameraman 300 uses 101 pairs of image and sound recording regenerating units not have illustrated object to make a video recording.Using alliteration to say microphone 3 when collecting sound object is made a video recording, as shown in Figure 2, cameraman 300 wears left and right sides microphone 31a, 31b on left and right sides ear 302.Thus, the dual track acoustic characteristic that the position relation of the head 30 by cameraman 300 and microphone 31a, 31b gives collects sound to comprising the sound of sound around interior cameraman 300 that object sends.The cameraman 300 supervision image of the visual object that shows on display part 17 on one side makes a video recording by 11 pairs of objects of image pickup part on one side, and collects sound by 3 pairs of sound on every side of dual track microphone.As the back is described in detail, there are not to write down in the illustrated recording medium signal of video signal of taking by image pickup part 11 and the voice signal that collects sounds by dual track microphone 3 here.In addition, as the back is described in detail, with the signal of video signal that writes down in the recording medium with look the hearer just like in the photography environment identical, carry out the sound with presence of audiovisual and regenerate with cameraman 300.
Fig. 3 is the block diagram that the concrete inside of expression image and sound recording regenerating unit 101 constitutes example.
Image and sound recording regenerating unit 101 possesses: image pickup part 11, video coding portion 12, multiplexed 13, record regenerating portion 14, separated part 15, video decode portion 16, display part 17, built-in stereophony microphone 21 (21 is the general name of 21a and 21b), audio coding portion 22, audio decoder portion 26, crosstalk arrester 27, external microphone splicing ear 32, flag information obtaining section 36, image output terminal 37a, voice output terminal 37b, joint detection portion 41, flag information generating unit 42, recording medium 44, control part 47, operating portion 48, and switch Sw1, Sw2, Sw3.In addition, as recording medium 44, can be the recording medium that disc-shaped recording medium or cassette tape etc. can dismounting, also can be as hard disk, be installed in the recording medium in the image and sound recording regenerating unit 101 in advance.
On image output terminal 37a, connect TV and be subjected to displays 52 such as camera.On voice output terminal 37b, connect loud speaker 53,54 via amplifier 51.Listen to by looking hearer 59 from the sound that loud speaker 53,54 sends.In addition, in Fig. 3, for convenience, illustrate cameraman 300 simultaneously and look hearer 59, but can carry out cameraman 300 photography respectively usually and look the audiovisual of hearer 59 regeneration video-audio.
(operation of recording)
Operation of recording to image and sound recording regenerating unit 101 describes.
At first, when cameraman 300 showed the operation of initial setting image (window) of acoustic pattern by operating portion 48, as an example, control part 47 showed initial setting image 170 shown in Figure 4 on display part 17.As external microphone, any in connection layout 1 on the microphone splicing ear 32, dual track microphone 3 illustrated in fig. 2 and the common external microphone externally.By the predetermined operation of operating portion 48, cameraman 300 selects " dual track " when using dual track microphone 3 to collect sound as shown in Figure 4; When using common external microphone to collect sound, select " common ".Control part 47 becomes the configuration part, and the microphone beyond this configuration part setting dual track microphone 3 or the dual track microphone is used as being connected with external microphone splicing ear 32 external microphone of use.Selecting " dual track " as external microphone input, and when the plug that joint detection portion 41 detects external microphone has inserted in the external microphone splicing ear 32,47 pairs of each ones of circuit of control part control, and image and sound recording regenerating unit 101 is carried out and the corresponding operation of recording of photography of having used dual track microphone 3.Using 3 pairs of sound on every side of dual track microphone to collect sound, come the acoustic pattern that the voice signal that the collection sound obtains writes down is called the dual track pattern.Using built-in stereophony microphone 21 or common external microphone that sound is on every side collected sound, come the acoustic pattern that the voice signal that the collection sound obtains writes down is called general mode.
In addition, if the plug of dual track microphone 3 is made the shape of dual track microphone 3 special uses different with common external microphone, external microphone splicing ear 32 is made the splicing ear of dual track microphone 3 special uses, then can omit the initial setting of described acoustic pattern.
Specifically, in Fig. 3, when on joint detection portion 41 detects externally microphone splicing ear 32, having connected external microphone, detection signal is offered control part 47.Be illustrated in the detection signal that has connected external microphone under the state that external microphone is set at " dual track " if control part 47 is received, then switch Sw1 switched to the terminal b of input from the voice signal of dual track microphone 3 from input from the terminal a of the voice signal of built-in stereophony microphone 21.Thus, the voice signal from dual track microphone 3 is offered audio coding portion 22.Switch Sw1 is a switching part, and its switching is to use the dual track microphone on the ear 302 that is worn on cameraman 300 also to be to use dual track microphone microphone in addition to be used as sound is on every side collected the microphone of sound.
In addition, control part 47 is controlled, and makes flag information generating unit 42 generate and export the flag information (dual track marking signal) of expression dual track pattern.The dual track marking signal is inputed to multiplexed 13.
When so having carried out the setting of dual track pattern, control part 47 preferably makes display part 17 display lists be shown as mark into the dual track pattern.The example of expressive notation in Fig. 5.Mark 171 shown in Fig. 5 (A) is marks that the cameraman 300 of dual track microphone 3 is used in the expression of pattern ground, and the mark 172 shown in Fig. 5 (B) is marks of the loud speaker regeneration of pattern ground expression dual track sound.As the mark of expression dual track pattern, can use some marks of Fig. 5 (A), (B), can also use other mark certainly.These marks are as long as finish the back or externally connected under the situation of dual track microphone 3 on the microphone splicing ear 32 at described initial setting, and the image overlap ground of taking with image pickup part 11 shows on display part 17 just can.Show that by such mark cameraman 300 can confirm whether acoustic pattern to be set at the dual track pattern when using dual track microphone 3.When acoustic pattern was the dual track pattern, control part 47 became the display control unit that makes display part 17 show the dual track mark (mark 171,172) of expression dual track pattern.
Having worn 11 pairs of objects of cameraman's 300 usefulness image pickup parts of left and right sides microphone 31a, the 31b of dual track microphone 3 on left and right sides ear 302 makes a video recording.Be transfused to terminal g from the signal of video signal of image pickup part 11 output to (Video processing portion) 12 of video coding portion and switch Sw3.When photographing (record) by image and sound recording regenerating unit 101, switch Sw3 switches to terminal g, is provided for display part 17, the image of display object thing from the signal of video signal of image pickup part 11.Meanwhile, concern, obtain making carry out the voice signal that dual track collection sound obtain of object towards the mesion direction from microphone 31a, 31b according to cameraman 300 head 30 and the position of microphone 31a, 31b.This voice signal inputs to audio coding portion (audio treatment part) 22 via switch Sw1.
At this, when recording medium 44 was described for the situation of DV cassette tape, the signal of video signal of 12 pairs of inputs of video coding portion carried out the A/D conversion, and encoded with the DV compress mode and to generate the coding signal of video signal.The voice signal of 22 pairs of inputs of audio coding portion carries out the A/D conversion, and generates coded sound signal by the Data Position that moves the incompressible voice signal of (shuffling) rearrangement.
According to the signal format of civilian digital VCR standard code the coding signal of video signal of input and coded sound signal and dual track marking signal are carried out time division multiplexing for multiplexed 13 and generate multiplex signal.The multichannel of multiplexed 13 output is taken signal input to record regenerating portion 14.Record regenerating portion 14 with the input multiplex signal according in recording medium 44, carrying out record by the record format of civilian digital VCR standard code.To at length record and narrate the concrete recording method of dual track marking signal in the back.
At this, the variation of dual track microphone 3 is described.
(variation of dual track microphone 3)
Fig. 6 (A) has represented the first variation microphone 31c of microphone 31a, 31b, and Fig. 6 (B) has represented the second variation microphone 31d of microphone 31a, 31b.Microphone 31c shown in Fig. 6 (A) possesses the microphone maintaining part 312 in the ear 302 that inserts cameraman 300; And connect the microphone resettlement section 311 that is provided with, microphone assemblies such as oscillating plate are accommodated with the top of microphone maintaining part 312.By making dual track microphone 3 become the microphone 31c of microphone resettlement section 311 and microphone maintaining part 312 separated structures,, also can hear sound well from the outside even cameraman 300 has on dual track microphone 3.Microphone 31d shown in Fig. 6 (B) is provided with microphone resettlement section 311 with the bottom of microphone maintaining part 312 with connecting mutually.Microphone 31d also has the effect identical with microphone 31c.
Fig. 7 is the oblique view of the concrete configuration example of expression microphone maintaining part 312.Represented to be provided with the microphone 31c of Fig. 6 (A) of microphone resettlement section 311 here on the top of microphone maintaining part 312.Microphone maintaining part 312 shown in Fig. 7 (A) has formed the continuant orifice 313a of the taper that diminishes along the direction diameter to ear 302 inside on maintaining part self 312a.Microphone maintaining part 312 shown in Fig. 7 (B) has formed columned sound hole 313b on maintaining part self 312a.Maintaining part self 312a of Fig. 7 (A) is inserted in cameraman 300 the ear 302 easily, and maintaining part self 312b of Fig. 7 (B) has the little characteristic of external voice decay when wearing.
Fig. 8 has represented to make the different example of profile of microphone maintaining part 312 of the microphone 31c of Fig. 6 (A).In Fig. 8, (A) be microphone maintaining part 312 with big profile; (B) be microphone maintaining part 312 with medium profile; (C) be microphone maintaining part 312 with less profile.So, if prepare the microphone maintaining part 312 of a plurality of sizes, then cameraman 300 can select to be fit to the microphone maintaining part of own ear 302 sizes.In addition, in Fig. 8 (A)~(C), the shape of microphone resettlement section 311 (microphone assembly) and size are identical, and the sensitivity of microphone assembly and response frequency characteristic are also identical.
(record of dual track marking signal)
Cameraman 300 to wear dual track microphone 3 and collects dual track sound that sound obtains and collect the sound that sound obtains by built-in stereophony microphone 21 in order to discern, and when carrying out dual track sound set sound the dual track marking signal is recorded in the recording medium 44 with dual track sound.The dual track marking signal is to generate in flag information generating unit 42.
The concrete recording method of the dual track marking signal when at first, recording medium 44 being the DV cassette tape describes.
Fig. 9 is the data format during recording audio data in the DV cassette tape.When the voice data of record 0~89 byte, 0th, 1 byte records synchronizing code, the 2nd~4 byte records ID (identification) code, the 5th~9 byte records audio frequency auxiliary data (AUX), the 10th~81 byte records voice data, the 82nd~89 byte records are used for the ISN parity check data that misdata detects and revises.As an example, if be the dual track pattern, then flag information generating unit 42 is as dual track marking signal output 1, if not dual track pattern (if being general mode) then export 0.Multiplexed 13 signal that generates data format shown in Figure 9.
The concrete recording method of the dual track marking signal when then, recording medium 44 being indicator describes.As indicator, have the red laser of uses such as DVD-RAM, DVD-RW, DVD-R and carry out the dish of record regenerating and the dish that use blue lasers such as Blu-ray dish or HD-DVD carry out record regenerating.At this,, come the situation of overlapping dual track marking signal to describe to using the DVD video standard that in all these indicators, can be suitable for.
As first method of using the DVD video standard to come overlapping dual track marking signal, consider the method for overlapping dual track marking signal in the DVD-Video of DVD video standard zone.
As shown in figure 10, the volume space of dvd standard (Volume space) is made of Volume and Filestructure and DVD-Video zone and DVD others zone.DVD-Video zone contains VMG (Video Manager) and VTS (Video Title Set) #1~DVTS#n.N is the integer of the regulation more than 2.Each VTS comprises control data (Control Data) and VOBS (VideoObject Set) respectively.VOBS comprises a plurality of VOB (Video Object).VOBS comprises a plurality of CELL.CELL comprises a plurality of VOBU (Video Object Unit).VOBU comprises navigation bag (NV_PACK), audio pack (A_PACK), video packets (V_PACK).In the present embodiment, in VOBU, be provided with the packet (D_PACK) that comprises the dual track marking signal.
Packet (D_PACK) comprises packet header (pack header), packets headers (packet header), son stream ID, audio frame information, voice data information and dual track marking signal.The dual track marking signal is made of a plurality of audio frame layers.
So, with the form based on the DVD video standard, the information that will comprise the dual track marking signal forms bag as packet (D_PACK), and it is multiplexed to carry out MPEG.Thus, can keep the interchangeability with the DVD video standard, can learn which audio frame of audio signal partly is the dual track voice signal simultaneously, which audio frame partly is common stereo sound.Therefore, can differentiate simply and which audio frame partly to carry out crosstalk at and eliminate and to be treated to.
As second method of using the DVD video standard to come overlapping dual track marking signal, consider method at the overlapping dual track marking signal of the DVD of DVD video standard others zone.DVD others zone is the zone that the auxiliary data that the data with image, sound self are associated is write down, and also is the user data posting field.
By relatively Figure 10 and Figure 11 as can be known, in the present embodiment, make data structure as the DVD others zone of user data posting field adopt the similar structure of data structure with DVD-Video zone.As shown in figure 11, DVD others zone also comprises various information such as VMG, VTS, VOBS, VOB, CELL, VOBU.For these each information among the DVD others zone shown in Figure 11 are distinguished mutually with the information among Figure 10, adopt the title of having added D at head.
As shown in figure 11, DVD others zone contains DVMG and DVTS#1~DVTS#n.Each DVTS comprises DVTSI (Video Title Set information) and DVOBS respectively.DVOBS comprises a plurality of DVOB.DVOBS comprises a plurality of DCELL.DCELL comprises a plurality of DVOBU.DVOBU comprises a plurality of audio frame layers.The audio frame layer is according to the zone of writing down, with the part zone of this audio frame layer posting field as the dual track marking signal to the audio frame number such as coding parameter of audio signal.
So in DVD others zone, record and narrate the dual track marking signal, make the audio signal (dual track voice signal or common stereo sound) and the dual track marking signal that comprise among the DVD-Video zone interrelated thus with form based on the DVD video standard.Can keep the interchangeability with the DVD video standard, can learn which audio frame of audio signal partly is the dual track voice signal simultaneously, which audio frame partly is common stereo sound.Therefore, can differentiate simply and which audio frame partly to carry out crosstalk at and eliminate and to be treated to.
Under the situation of Figure 10, Figure 11, for example the operation start button is begun to photograph, operate then and collect audio signal that sound obtains during stop button finishes to photograph as one or more audio frame layer, and to the additional sound pattern information of each audio frame layer.Then, in acoustic pattern information, record and narrate the dual track marking signal, manage as the dual track packets of information.By the dual track marking signal is managed as the dual track packets of information, can easily obtain acoustic pattern information to each audio frame.Thus, when regenerative recording medium 44, even at hybrid recording under the situation of dual track voice signal and common stereo sound signal, also can carry out the switching of crosstalk arrester 27 described later rightly by corresponding acoustic pattern.At least when the operation that at every turn begins to photograph, carry out the acoustic pattern recording of information, be more preferably at each official hour and carry out record.
In addition, even for example using as recording medium 44 under the situation of semiconductor memory, define the posting field of dual track marking signal also same as described abovely, and the acoustic pattern of the voice signal of designated recorder, can discern the voice signal of dual track record thus, and can switch on one side the action of crosstalk arrester 27/be failure to actuate, rightly Yi Bian the voice signal of regenerating.
And, also considered in multiplexed layer user data of coded systems such as MPEG, to record and narrate the dual track marking signal.As an example, suppose following situation: use the mobile phone of communication function, the signal of video signal that photographs is transferred to the mobile phone of receiver side with the voice signal that obtains by dual track microphone 3 collection sounds from the mobile phone of transmitter side with image and sound.Under these circumstances, also can be from the mobile phone of transmitter side mobile phone transmission dual track marking signal to receiver side.Transmit by voice signal being given the dual track marking signal, can be at the sound that has presence for regeneration under the situation of dual track sound.At this moment, recording and narrating the dual track marking signal, be used as grouped data and transmit with the image of mobile phone transmission and the assigned position of voice packets (packet) data.For example, during the transmission means in using MPEG-4, stipulated, in the user data posting field of basic stream, can transmit dual track marking signal shown in Figure 9.During the transport stream in using the MPEG-4 standard, stipulated, can (privat data_type) record and narrate the dual track marking signal in the exclusive data zone.
In addition, when image data and voice data are transmitted as file data with the form of annex, also can record and narrate the dual track marking signal in the part of this document head.
Use flow chart shown in Figure 12 that the operation of recording of image and sound recording regenerating unit 101 is further detailed.
At first, control part 47 judges in step S151 whether initial setting illustrated in fig. 4 is will be set at dual track microphone 3 with the external microphone that external microphone splicing ear 32 is connected.If judge that in step S151 initial setting has been set at dual track (YES), then moves to step S152.Be not set at dual track (NO) if judge, then control part 47 switches switch Sw1 to terminal a, and in step S154, image and sound recording regenerating unit 101 is obtained the voice signal from built-in stereophony microphone 21.In step S152, control part 47 judges whether joint detection portion 41 detects the plug that has inserted external microphone in the microphone splicing ear 32 externally.If in step S152, be judged as externally and connected external microphone (YES) on the microphone splicing ear 32, then control part 47 switches switch Sw1 to terminal b, and image and sound recording regenerating unit 101 is obtained the voice signal from dual track microphone 3 in step S153.If in step S152, do not judge externally and connected external microphone (NO) on the microphone splicing ear 32, then control part 47 switches switch Sw1 to terminal a, and image and sound recording regenerating unit 101 is obtained the voice signal from built-in stereophony microphone 21 in step S154.
Then, in step S155, not having in video coding portion 12 temporarily stored the signal of video signal from image pickup part 11 in the illustrated memory, and the voice signal from dual track microphone 3 or built-in stereophony microphone 21 temporarily is stored in not having in the illustrated memory in the audio coding portion 22.In step S156,12 pairs of signal of video signal of video coding portion are encoded, and 22 pairs of voice signals of audio coding portion are encoded.In step S157, the coding signal of video signal temporarily is stored in not having in the illustrated buffer storage in the video coding portion 12, coded sound signal temporarily is stored in not having in the illustrated buffer storage in the audio coding portion 22.And in step S158, flag information generating unit 42 indication according to control part 47 when being the dual track pattern generates the dual track marking signal.
Then, multiplexed 13 is carried out multiplexedly in step S159 to coding signal of video signal and coded sound signal and dual track marking signal, generate the stream of packets signal in step S160.In step S161, record regenerating portion 14 is the recorded stream signal in recording medium 44.In step S162, video coding portion 12 and audio coding portion 22 judge whether to exist signal of video signal and the voice signal that encode, if be judged as signal of video signal and voice signal (YES) that still existence should be encoded, then return step S152 and repeat above action.If do not judge signal of video signal and voice signal (NO), then end process that still existence should be encoded.
(regeneration action)
Return Fig. 3, the regeneration of image and sound recording regenerating unit 101 action is described.In Fig. 3, when to operating portion 48 do not have illustrated regeneration button to operate the time, record regenerating portion 14 is according to the control of control part 47, regeneration is as the multiplex signal of the tracer signal of record in the recording medium 44.Multiplex signal by 14 regeneration of record regenerating portion is provided for separated part 15, and separated part 15 is separated into coding signal of video signal, coded sound signal and dual track marking signal with the multiplex signal of input.
The signal of video signal of will encoding offers video decode portion (Video processing portion) 16, and coded sound signal is offered audio decoder portion (audio treatment part) 26, and the dual track marking signal is offered flag information obtaining section 36.The coding signal of video signal of 16 pairs of inputs of video decode portion is decoded and is obtained signal of video signal.Follow the operation of regeneration button, switch Sw3 is switched to terminal h according to the control of control part 47.On display part 17, show signal of video signal, and offer display 52 by image output terminal 37a and show by 16 outputs of video decode portion.The coded sound signal of 26 pairs of inputs of audio decoder portion is decoded and is obtained voice signal.Voice signal is offered the terminal c of crosstalk arrester 27 and switch Sw2.
Crosstalk arrester 27 generates the signal that is used to offset the first crosstalk composition and the second crosstalk composition and adds to voice signal, generate the crosstalk processing signals thus, the described first crosstalk composition and the second crosstalk composition be when the voice signal that is obtained by 53,54 pairs of dual track collection sounds of loud speaker is regenerated, produce, listener 59 listens to the crosstalk composition of the signal that sends from the loud speaker 54 in left side and listener 59 listen to the signal that sends from the loud speaker 53 on right side by the ear in left side crosstalk composition by the ear on right side.Flag information obtaining section 36 is preserved the dual track marking signal that is obtained by separated part 15.Whether control part 47 has preserved the dual track marking signal according to flag information obtaining section 36 is come switch Sw2 is carried out switching controls.Promptly, if flag information obtaining section 36 has been preserved the dual track marking signal, then control switch Sw2 is connected with terminal d, crosstalk processing signals from crosstalk arrester 27 is exported voice output terminal 37b, if do not preserve the dual track marking signal, then control switch Sw2 is connected with terminal c, the voice signal that crosstalk handles that do not carry out from audio decoder portion 26 is exported voice output terminal 37b.
The voice signal that amplifies from voice output terminal 37b output and by amplifier 52 from about loud speaker 53,54 send.When the voice signal from voice output terminal 37b output is crosstalk processing signals from 27 outputs of crosstalk arrester, look hearer 59 and can watch on display 52 image that shows on one side, listen on one side as the sound with presence, the cameraman when cameraman 300 photographs 300 sound on every side.At this moment, crosstalk arrester 27 uses head transfer functions described later to offset the crosstalk composition, even so in cameraman 300 situation different with looking hearer 59, cameraman 300 photographs, looks arbitrarily hearer 59 and carries out under the situation of audiovisual arbitrarily, also can listen to the sound with presence that does not almost have inharmonic sensation.
Use flow chart shown in Figure 13 that the regeneration action of image and sound recording regenerating unit 101 is further detailed.
In Figure 13, in step S181, record regenerating portion 14 regenerative recording medium 44 obtain the stream signal based on multiplex signal.In step S182, record regenerating portion 14 convection current signals are decoded and are generated packet signal.In step S183, separated part 15 is separated into vision signal, audio signal and dual track marking signal with packet signal.In step S184,16 pairs of decoding video signals of video decode portion, 26 pairs of audio signals of audio decoder portion are decoded.In step S185, video decode portion 16 and audio decoder portion 26 temporarily are stored in decoded vision signal and audio signal not to be had in the illustrated buffer storage.In step S186, flag information obtaining section 36 obtains the dual track marking signal.
Then, in step S187, the dual track marking signal that control part 47 is obtained according to flag information obtaining section 36 judges that the audio signal of regeneration is common stereo sound signal or dual track voice signal.In step S187, judge (YES) when being the dual track voice signal, move to step S188.When in step S187, judging (NO) when not being the dual track voice signal, move to step S191, each one of control part 47 control circuits switches to terminal c with switch Sw2, synchronizing regeneration vision signal and audio signal.
If be the dual track voice signal, then in step S188, control part 47 switches to terminal d with switch Sw2, makes the crosstalk of crosstalk arrester 27 eliminate the effective state that is in thus.In step S189, control part 47 control circuits each ones is so that synchronizing regeneration vision signal and carried out crosstalk by crosstalk arrester 27 and eliminated the audio signal of handling.In step S190, if be judged as vision signal and the audio signal (YES) that still existence should be regenerated, then return step S182, repeat above action.If in step S190, do not judge vision signal and audio signal (NO), then end process that still existence should be regenerated.
(crosstalk elimination)
At this, use Figure 14 that the concrete structure and the action of crosstalk arrester 27 are described.As shown in figure 14, crosstalk arrester 27 possesses filter 272a~272d, adder 274a, 274b and filter 275a, 275b.
In Figure 14, in the dual track voice signal, the signal P of L channel L(t) be transfused to the signal P of R channel to filter 272a, 272b R(t) be transfused to filter 272c, 272d.Store in order based on head transfer functions h described later among filter 272a~272d Rs(t), h Lo(t), h Ro(t) and h Ls(t) filter characteristic of Sheng Chenging (filter coefficient). Filter 272a, 272d have and head transfer functions h Rs(t), h Ls(t) identical filter characteristic, filter 272b, 272c have and head transfer functions h Lo(t), h Ro(t) the identical filter characteristic of characteristic after the counter-rotating.For convenience, the filter characteristic with filter 272a~272d is labeled as h Rs(t) ,-h Lo(t) ,-h Ro(t) and h Ls(t).Filter 272a~272d is to the signal P of input L(t), P R(t) filter characteristic of giving is separately exported.
Adder 274a carries out add operation to the signal from filter 272a, 272c output, and the signal of filter 275a after to add operation given the filter characteristic that becomes d (t).Adder calculator 274b carries out add operation to the signal from filter 272b, 272d output, and the signal of filter 275b after to add operation given the filter characteristic that becomes d (t).
At this, the filter characteristic d that in filter 275a, 275b, stores (t) for below (1) formula.
d(t)={h ls(t)×h rs(t)-h lo(t)×h ro(t)} -1 …(1)
From the signal of filter 275a, 275b output is in order to have carried out crosstalk and to eliminate the crosstalk processing signals of handling eliminating crosstalk when loud speaker 53,54 is sounded.Amplify by the L channel amplifier 51a and the R channel amplifier 51b of amplifier 51 respectively from the crosstalk processing signals of filter 275a, 275b output, send by loud speaker 53,54.
The signal (sound) that sends from loud speaker 53 is looked left ear of the hearer 59 and is listened to, and the auris dextra that the second crosstalk signal represented as dotted line of the part of the signal that sends is looked hearer 59 is listened to.Crosstalk arrester 27 generates and is used to offset the first crosstalk offseting signal of looking the first crosstalk signal that hearer 59 listens to by auris dextra and sounds from loud speaker 54.Offset (decay) first crosstalk signal by the first crosstalk offseting signal.Similarly, the signal (sound) that sends from loud speaker 54 is looked hearer 59 auris dextra and is listened to, and the left ear that the second crosstalk signal represented as dotted line of the part of the signal that sends is looked hearer 59 is listened to.Crosstalk arrester 27 generates and is used to offset the second crosstalk offseting signal of looking the second crosstalk signal that hearer 59 listens to by left ear and sounds from loud speaker 53.Offset (decay) second crosstalk signal by the second crosstalk offseting signal.As a result, look hearer 59 and listen to the voice signal P that has eliminated crosstalk by left ear l(t), listen to the voice signal P that has eliminated crosstalk by auris dextra r(t).
(mensuration of head transfer functions)
Use Figure 15 that the head transfer functions determinator 6 of the head transfer functions characteristic that is used for obtaining filter 272a~272d, 275a, 275b and stores is described.As shown in figure 15, head transfer functions determinator 6 has personal computer 61, amplifier 62, loud speaker 63, microphone assembly 65a, 65b, cylindrical configuration body 65e and amplifier 66a, 66b.
Assay method to head transfer functions describes.
At first, generate the measured signal that for example forms by personal computer 61 by the pulse sound.Measured signal is amplified by amplifier 62.By about microphone assembly 65a, 65b receive the measured signal of sending from the loud speaker 63 in left side.Signal about receiving is amplified by amplifier 66a, 66b, inputs to personal computer 61 then.This input signal is by being installed in the head transfer functions h that left and right sides microphone assembly 65a, 65b on the cylindrical configuration body 65e receives the signal of the sound that sends from loud speaker 63 Ls(t), h Lo(t).Head transfer functions h Ls(t) be the characteristic that microphone assembly 65a by the left side listens to the signal that sends from the loud speaker 63 in left side, head transfer functions h Lo(t) be the characteristic that microphone assembly 65b by the right side listens to the crosstalk composition of the signal that sends from the loud speaker 63 in left side.
Similarly, by about microphone assembly 65a, 65b receive the measured signal of sending from the loud speaker 64 on right side.Signal about receiving is amplified by amplifier 66a, 66b, inputs to personal computer 61 then.Personal computer 61 compares measured signal that generates and the signal that receives, and obtains thus by being installed in the head transfer functions h that left and right sides microphone assembly 65a, 65b on the cylindrical configuration body 65e receives the signal of the sound that sends from loud speaker 64 Rs(t), h Ro(t).Head transfer functions h Rs(t) be the characteristic that microphone assembly 65b by the right side listens to the signal that sends from the loud speaker 64 on right side, head transfer functions h Ro(t) be the characteristic that microphone assembly 65a by the left side listens to the crosstalk composition of the signal that sends from the loud speaker 64 on right side.
Use Figure 16 that cylindrical configuration body 65e is described.In Figure 16, (A) be the vertical view of cylindrical configuration body 65e, (B) be the stereogram of cylindrical configuration body 65e, (C) be the so-called sectional view of manually wearing microphone of expression in order to compare.
Shown in Figure 16 (A), (B), microphone assembly 65a, 65b have been installed in 180 degree at the spaced surface of cylindrical configuration body 65e.As shown in the figure, microphone assembly 65a, 65b do not have auricle and external auditory meatus.The illustrated oscillating plate that do not have of microphone assembly 65a, 65b is configured in position surperficial unanimous on the whole with cylindrical configuration body 65e.On the other hand, the microphone 69 of manually wearing shown in Figure 16 (C) possesses auricle shape parts 692a, 692b and duct 693a, 693b in the both sides of artificial head 691, possesses microphone assembly 694a, 694b in the inside of duct 693a, 693b.Microphone assembly 694a, 694b are set on the position of the eardrum position that is equivalent to the people, and the approximate voice signal of being listened to the people of sound is collected sound.
The receiving feature that is installed in microphone assembly 65a, 65b on the cylindrical configuration body 65e shown in Figure 16 (A), (B) is can not be subjected to because the influence of the property difference that everyone size or variform auricle or external auditory meatus cause and can measure the characteristic of head transfer functions.Though the sound wave that sends from loud speaker 63,64 is subjected to blocking of cylindrical configuration body 65e, can arrive microphone assembly 65a, 65b along cylindrical configuration body 65e diffraction.Microphone assembly 65a, 65b measure the characteristic that is formed by sound wave that sends from loud speaker 63,64 and directly arrive and the sound wave that arrives along cylindrical configuration body 65e diffraction.By using cylindrical configuration body 65e can obtain having the head transfer functions that average head blocks characteristic.Therefore, even the size of head or shape difference, head block the different various hearers of looking of characteristic and listen to the dual track voice signal, can uppick almost there be the sound with presence of inharmonic sensation yet.
Figure 17 (A)~(D) has represented on the pulse sound that described acoustic signal transmission characteristic determinator 6 the generates convolution head transfer functions h of the cylindrical configuration body 65e that is measured to by acoustic signal transmission characteristic determinator 6 Ls(t), h Lo(t), h Rs(t), h Ro(t) impulse response waveform after.Figure 17 (D) has represented the filter characteristic d (t) shown in described (1) formula.In Figure 17 (A)~(E), the longitudinal axis is represented the amplitude of the normalized signal voltage of output voltage with regulation, the represented time of number of the sample when transverse axis is represented with 48kHz the signal that is measured to be sampled.
Figure 18 (A)~(E) expression is carried out the frequency characteristic that Fourier analysis obtains to the signal shown in Figure 17 (A)~(E).In Figure 18 (A)~(E), represent 100Hz by the ordinate of dotted line, 1kHz, the frequency location of 10kHz, the gain inequality between two dotted line horizontal lines in the response characteristic of the longitudinal axis is 10dB.
Filter 272a~272d to Figure 14 gives the head transfer functions h that obtains like that based on above-mentioned Rs(t), h Lo(t), h Ro(t), h Ls(t) filter characteristic.As mentioned above, filter 272a, 272d are given be equivalent to head transfer functions h Rs(t), h Ls(t) filter characteristic is given filter 272b, 272c and to be equivalent to head transfer functions h Lo(t), h Ro(t) carry out after the polarity inversion-h Lo(t) ,-h Ro(t) filter characteristic.
In order to compare, expression replaces the microphone assembly 65a, the 65b that install on cylindrical configuration body 65e in Figure 19, Figure 20, uses manually to wear the characteristic that microphone 69 is measured to shown in Figure 16 (C).Figure 19 measures the characteristic that obtains in the same manner with Figure 17.By Figure 17 and Figure 19 are compared as can be known, using the microphone assembly 65a that installs on cylindrical configuration body 65e, impulse response waveform that 65b is measured to is to wear impulse response waveform that microphone 69 is measured to end user foreman to compare more approaching mensuration be the waveform of pulse sound with input signal.
Figure 20 is to use and manually wears the frequency response characteristic that microphone 69 is measured to.By Figure 18 and Figure 20 are compared as can be known, use cylindrical configuration body 65e to go up the microphone assembly 65a that installs, the characteristic that 65b obtains, the fluctuation of its frequency characteristic is little, near smooth.Each characteristic shown in Figure 20 (A)~(E) has the enhancing of response characteristic and weakens in the frequency of 1.5~7kHz.Characteristic shown in Figure 18 (A)~(E), the enhancing of its response characteristic and the degree of decay are little.Its reason is, do not exist because the fluctuation of the characteristic that auricle or external auditory meatus cause when the microphone assembly 65a, the 65b that install on using cylindrical configuration body 65e.By manually wearing microphone 69, the part of the sound wave that is sent by loud speaker 63,64 is by auricle reflex, and the sound wave of reflection and the sound wave that directly arrives increase with being combined to the back, and perhaps anti-phase weakens after synthetic.In addition, because the influence of the resonance of external auditory meatus and antiresonance etc. increases or decay with specific frequency.Harmful effect when wearing microphone 69 by using cylindrical configuration body 65e to go up microphone assembly 65a, the 65b that installs, can suppressing end user foreman.
Filter 272a~the 272d of crosstalk arrester 27 and filter 275a, 275b are given based on the filter characteristic (being made as first condition) of the microphone assembly 65a that uses cylindrical configuration body 65e to go up to install, head transfer functions that 65b is measured to and the filter characteristic (being made as second condition) of wearing the head transfer functions that microphone 69 is measured to based on end user foreman, carried out a plurality of comparison the two audiovisual tests that the hearer participates in of looking.Insert tiny lapel mike in looking hearer's duct, the sound that this lapel mike is received is as having measured receiving feature by the sound of looking hearer's uppick.
Figure 21 has represented to look the characteristic that the hearer is measured to by certain under first condition.In Figure 21, (A) be input signal P with the left side L(t) as pulse signal, with the input signal P on right side R(t) send from loud speaker 53,54 as no signal, and the waveform of the impulse response signal that receives by the lapel mike of installing in the left ear of looking the hearer.(B) be sounding under the condition identical with (A), and the waveform of the crosstalk composition that receives by the lapel mike of installing in the auris dextra of looking the hearer.In Figure 21 (A), obtain the impulse response waveform of big level, in Figure 21 (B), obtained the waveform of less level.Figure 21 (C) is the frequency analysis result of response wave shape, and Ca is that the response wave shape to (A) has carried out the response characteristic after the frequency analysis, and Cb is that the response wave shape to (B) has carried out the response characteristic after the frequency analysis.In the scope of 100Hz~2kHz, obtained surpassing the crosstalk elimination effect of 20dB.
And, in Figure 21, (D) be input signal P with the left side L(t) as no signal, with the input signal P on right side R(t) send from loud speaker 53,54 as pulse signal, and the waveform of the crosstalk composition that receives by the lapel mike of installing in the left ear of looking the hearer.(E) be sounding under the condition identical with (D), and the waveform of the impulse response that receives by the lapel mike of installing in the auris dextra of looking the hearer.In Figure 21 (D), obtain the waveform of less level, in Figure 21 (E), obtained the impulse response waveform of big level.Figure 21 (F) is the frequency analysis result of response wave shape, and Fd is that the response wave shape to (D) has carried out the response characteristic after the frequency analysis, and Fe is that the response wave shape to (E) has carried out the response characteristic after the frequency analysis.The crosstalk that has obtained in the scope of 100Hz~2kHz about 16dB is eliminated effect.
Figure 22 is illustrated under the second condition, by look the characteristic that hearer be measured to identical with Figure 21.Condition determination is identical with Figure 21.The crosstalk of Figure 22 (C) is eliminated effect and is approximately 14dB, and the crosstalk of Figure 22 (F) is eliminated effect and is approximately 11dB.Under second condition, compare effect with first condition and worsen as can be known.
Figure 23 has represented to change the look hearer different with Figure 21, Figure 22 into, under first condition, and the characteristic that under the condition determination identical, is measured to Figure 21.The crosstalk of Figure 23 (C) is eliminated effect and is approximately 22dB, and the crosstalk of Figure 23 (F) is eliminated effect and is approximately 18dB.Even different look the hearer, also can obtain good effect.
Figure 24 represented by the look hearer identical with Figure 23, under second condition, and the characteristic that under the condition determination identical, is measured to Figure 22.The crosstalk of Figure 24 (C) is eliminated effect and is approximately 14dB, and the crosstalk of Figure 24 (F) is eliminated effect and is approximately 10dB.Even different as can be known look the hearer, under second condition, compare effect and worsen with first condition.And, also carry out same mensuration by the different hearers that looks, confirmed under first condition and second condition, to obtain respectively effect same as described above.
According to above measurement result, can confirm: at filter 272a~272d and filter 275a crosstalk arrester 27, in the filter characteristic that 275b gives, compare with the filter characteristic of wearing the head transfer functions that microphone 69 is measured to based on end user foreman, based on using cylindrical configuration body 65e to go up the microphone assembly 65a that installs, the filter characteristic of the head transfer functions that 65b is measured to, it offsets the loud speaker sounding from the left side, the crosstalk composition of listening to by the ear on right side and from the loud speaker sounding on right side, the effect of the crosstalk composition of being listened to by the ear in left side is bigger.
In the filter characteristic of the microphone assembly 65a that installs on based on use cylindrical configuration body 65e, the head transfer functions that 65b is measured to, the fluctuation of the frequency characteristic of high audio frequency range is few.That is,, frequency characteristic can be reduced, the deterioration of tonequality can be done one's utmost to suppress in the situation that specific frequency part descends or rises significantly by using cylindrical configuration body 65e.Thus, look the hearer and can hear almost there is not sound inharmonic sensation, that have presence.
When the filter characteristic that filter 272a~272d and filter 275a, the 275b of crosstalk arrester 27 given as based on the filter characteristic of the microphone assembly 65a that uses cylindrical configuration body 65e to go up to install, head transfer functions that 65b is measured to the time, the tectosome when listening to the dual track voice signal is looked and is carried out crosstalk near hearer 59 the external auditory meatus inlet and eliminate.Therefore, look hearer 59, carry out the counteracting of crosstalk composition effectively at different a plurality of of the acoustic characteristic of auricle or external auditory meatus.
In addition, cylindrical configuration body 65e is not limited to perfect cylindrical shape, also can be cylindrical shape has been carried out shape after some distortion.Desired shapes is not cause the shape of the jog of response characteristic variation as auricle or external auditory meatus, is preferably under the state that microphone assembly 65a, 65b have been installed on the cylindrical configuration body 65e, and the fluctuation of response characteristic is as far as possible little.
Crosstalk arrester 27 is not limited to structure shown in Figure 14, also can be the band segmentation type crosstalk arrester that can further reduce the anti-phase sense that produces in low-frequency band.Band segmentation type crosstalk arrester is divided into low frequency signal and medium-high frequency signal to the dual track voice signal of importing as the signal of whole frequency band, only medium-high frequency dual track voice signal is carried out crosstalk and eliminates processing.
In Figure 25, represented band segmentation type crosstalk arrester 27a, his structure and action described.Give identical symbol to having, and omit its explanation with the component part of crosstalk arrester 27 identical functions shown in Figure 14.
As shown in figure 25, crosstalk arrester 27a compares with the crosstalk arrester 27 of Figure 14, and difference is to have appended low pass filter (LPF) 271a, 271d; High pass filter (HPF) 271b, 271c; Delayer 273a, 273b; Gain-controlled amplifier (GC) 276a~276d; And adder 277a, 277b.
In dual track voice signal to crosstalk arrester 27a input, the signal P of L channel L(t) be transfused to the signal P of R channel to LPF271a and HPF271b R(t) be transfused to LPF271d and HPF271c, be divided into low frequency and medium-high frequency.The cut-off frequency of LPF271a, 271d and HPF271b, 271c is set to about 100~200Hz.
By HPF271b, the medium-high frequency signal of 271c output by with crosstalk arrester 27 have same structure, by filter 272a~272d, adder 274a, 274b and filter 275a, the circuit part that 275b forms carry out crosstalk to be eliminated and handles.The medium-high frequency signal that has carried out after the crosstalk elimination is handled is imported among gain-controlled amplifier 276b, the 276c, adjusts gain.
Low frequency signal by LPF271a, 271d output is imported among delayer 273a, the 273b, postpones to handle the time much at one needed time with eliminating at the crosstalk of the signal of medium-high frequency band.Low frequency signal by delayer 273a, 273b output is imported among gain-controlled amplifier 276a, the 276d, and the level difference with the medium-high frequency signal is eliminated in the adjustment gain.
Then, adder 277a, 277b will carry out addition from the low frequency signal and the medium-high frequency signal of gain-controlled amplifier 276a~276d output.By the signal of adder 277a, 277b output is that only the centering high-frequency signal has been implemented crosstalk and eliminated crosstalk processing signals after handling.Amplify by the L channel amplifier 51a and the R channel amplifier 51b of amplifier 51 respectively from the crosstalk processing signals of adder 277a, 277b output, and by loud speaker 53,54 sounding.
In the structure of this Figure 25, eliminate processing because low frequency signal is not implemented crosstalk, there is not the signal of anti-phase sense so in low-frequency band, regenerate.
In crosstalk arrester 27 shown in Figure 14, as by above-mentioned Figure 21~Figure 24 explanation, the crosstalk of failing to obtain fully based on crosstalk arrester 27 below 100Hz is eliminated effect.The following low-frequency band of 100Hz is the less frequency band of influence that the location of source of sound is brought.Do not eliminate the signal of the signal of crosstalk, can feel inharmonious when listening to as anti-phase.
In crosstalk arrester 27a shown in Figure 25, do not carry out crosstalk in the low-frequency band below 100~200Hz and eliminate, realized in low-frequency band, not producing the crosstalk arrester of inverse phase signal.
In Figure 26, represented to compare the different band segmentation type crosstalk arrester 27b of constructive method of filter, and its formation and action have been described with Figure 25.To giving identical symbol, and omit its explanation with the component part that crosstalk arrester 27a shown in Figure 25 has an identical function.
Crosstalk arrester 27b shown in Figure 26 compares with the crosstalk arrester 27a of Figure 25, and difference is, replaces filter 272a~272d and filter 275a, 275b to possess filter 278a, 278b and filter 279a.79b, and method of attachment is also different.(finite Impules response: finite impulse response (FIR)) type has constituted filter characteristic, and crosstalk arrester 27b has constituted filter characteristic with the FIR type of feedback-type with the FIR of feed-forward type at crosstalk arrester 27a.
In Figure 26, the medium-high frequency signal of being exported by HPF271b, 271c is performed crosstalk elimination processing at FIR mode filter 278a, 278b, 279a, 279b and adder 274a, 274b.The filter characteristic of being obtained by head transfer functions determinator 6 is stored in the illustrated storage area that do not have in filter 278a, 278b, 279a, the 279b, filter 278a, 278b, 279a, the 279b signal convolution filter characteristic separately to importing are exported then.According to crosstalk arrester 27b, do not produce inverse phase signal in low-frequency band and alleviate and play identical action effect on the inharmonious sense this point with crosstalk arrester 27a.Crosstalk arrester 27b shown in Figure 26 compares the number of the filter that can reduce use with crosstalk arrester 27a shown in Figure 25, can simplified structure.In addition, can also replace the FIR mode filter to use IIR (infinite impulse response: infinite impulse response).
In structure shown in Figure 3, crosstalk arrester 27 (perhaps 27a, 27b) and control part 47 are constituted individually, but at control part 47 is when possessing the microprocessor of DSP (Digital signal processor), can make control part 47 carry out the function of crosstalk arresters 27,27a, 27b.In addition, crosstalk arrester 27,27a, 27b are not limited to hardware, also can realize by software.
(headgear regeneration)
Yet, in image and sound recording regenerating unit 101 shown in Figure 3, also can listen to the voice signal of exporting by audio decoder portion 26 by headgear.When listening to the dual track voice signal, can not produce above-mentioned crosstalk composition by headgear.When listening to by headgear, can hear the anti-phase composition of dual track voice signal in left and right sides ear receipts by the crosstalk processing signals after 27 processing of crosstalk arrester.The anti-phase composition is the non-existent acoustic signal composition of occurring in nature, is unfavorable for listening to, and does not eliminate and handles so do not carry out crosstalk when also listening to the dual track voice signal by headgear.
Therefore, as shown in Figure 3, the voice signal of audio decoder portion 26 outputs is not exported to voice output terminal 37c via crosstalk arrester 27 ground.Be provided for headgear 55 from the sound of voice output terminal 37c output.Look hearer 59 and promptly can listen to the crosstalk processing signals of exporting by crosstalk arrester 27 by loud speaker 53,54 as described above, can also listen to the voice signal that does not carry out the processing of crosstalk arrester 27 by headgear 55.
Use Figure 27 that the regeneration of the dual track voice signal of use headgear 55 is described in proper order.Give identical symbol to the processing identical, and omit its explanation with flow process shown in Figure 13.
In Figure 27, step S181~S186 is identical with the explanation among Figure 13, judges whether to regenerate by headgear 55 in step S192.Just passable by not having illustrated joint detection portion whether to detect in voice output terminal 37c, to have inserted plug as the splicing ear of headgear 55.If in step S192, be judged to be headgear regeneration (YES), then in step S193, do not carry out crosstalk and eliminate the dual track voice signal of handling, and move to step S190 by headgear 55 and vision signal synchronously reproducing.In addition, when the voice signal of being regenerated was not dual track voice signal but common stereo sound, it was just passable similarly the voice signal of audio decoder portion 26 outputs to be offered headgear 55.
If in step S192, be not judged to be headgear regeneration (NO), then carry out the processing of step S187~S190 in the same manner with Figure 13.The processing of step S189 is different with the Regeneration Treatment based on headgear 55 among the step S193, is the processing by loud speaker 53,54 regeneration dual track voice signals.
Second execution mode
Cameraman 300 wears dual track microphone 3 on one side and collects sound and on one side object is made a video recording on left and right sides ear 302, and carries out record to recording medium 44, looks hearer 59 thus and can listen to the omnidirectional ambient sound of cameraman 300 when photographing.But the image of common image and sound recording regenerating unit (video camera) photography is the image in the about 60 degree scopes in video camera the place ahead.When carrying out the convergent-divergent photography, photograph with littler visual angle.Therefore, carrying out convergent-divergent when photography, hope can strengthen near the sound that sends the object that is exaggerated and collects sound.Near the sound that second execution mode also strengthens when object is amplified the object collects sound, record.
The image and sound recording regenerating unit 102 that possesses sound convergent-divergent handling part of expression second execution mode in Figure 28, and its structure and action described.Give identical symbol for having, and omit its explanation with the component part of image and sound recording regenerating unit 101 identical functions of first execution mode shown in Figure 3.Image and sound recording regenerating unit 102 is compared difference and is to have sound convergent-divergent handling part 33 with image and sound recording regenerating unit 101.In addition, in Figure 28, the splicing ear that has omitted headgear 55 and headgear 55 is the diagram of voice output terminal 37c.
In Figure 28, the voice signal of importing via external microphone terminal 32 from dual track microphone 3 is transfused to sound convergent-divergent handling part 33.Image pickup part 11 possesses a plurality of camera lenses (not shown), has to make one or more camera lenses move the distance that changes between the camera lens, thus the zoom function that object is amplified/dwindles.When having carried out amplifieroperation by operating portion 48, amplify control signal from control part 47 to image pickup part 11 inputs object is carried out amplifying camera, and control signal is carried out sound to the voice signal of input processing and amplifying is amplified in 33 inputs to sound convergent-divergent handling part.
Sound convergent-divergent handling part 33 is according to amplifying control signal, generates the voice emplifying signal to comprising more to amplify from voice signal near send the object voice signal, that obtain at cameraman 300 mesion collection sound in the dual track voice signal.The voice emplifying signal is input in the audio coding portion 22 via switch Sw1.The signal of video signal of the object of bust shot is encoded by video coding portion 12, and the voice emplifying signal is encoded by audio coding portion 22, and is recorded in the same manner in the recording medium 44 with first execution mode.
Figure 29 is the concrete configuration example of sound convergent-divergent handling part 33.As shown in figure 29, sound convergent-divergent handling part 33 possesses convergent-divergent multiplying power detector 331, coefficient calculator 332, adder 335, variable amplifier 337 and adder 338a, 338b.
In Figure 29, convergent-divergent multiplying power detector 331 detects the convergent-divergent multiplying power according to the amplification control signal that is provided by control part 47.Coefficient calculator 332 is according to detected convergent-divergent multiplying power, and calculating is represented near the factor alpha of the enhancing degree of the sound that sends the object.335 pairs of dual track voice signals by the left and right acoustic channels of external microphone splicing ear 32 inputs of adder carry out addition.Variable amplifier 337 amplifies the output signal of adder 335 according to the factor alpha from coefficient calculator 332.Adder 338a, 338b add the dual track voice signal of left and right acoustic channels respectively to the output signal of variable amplifier 337.Be worn at microphone 31a, 31b with dual track microphone 3 under the state on cameraman 300 the left and right sides ear 302, the oscillating plate in microphone 31a, the 31b is almost parallel.Comprise the composition of mutual anti-phase from the sound of cameraman's 300 left and right directions, so the result who carries out the add operation of left and right acoustic channels by adder 335 weakens from the sound partial offset of left and right directions.Therefore, the voice emplifying signal after the sound that the mesion collection sound cameraman 300 is obtained from 33 outputs of sound convergent-divergent handling part amplifies.
Use Figure 30 that the action that comprises sound convergent-divergent handling part 33 is further detailed in the action of interior image and sound recording regenerating unit 102.In Figure 30, in step S201,335 pairs of dual track voice signals from the left and right acoustic channels of dual track microphone 3 of adder carry out addition, obtain add operation signal S.In step S202, convergent-divergent multiplying power detector 331 detects the convergent-divergent multiplying power that the operation by control part 47 respective operations portions 48 obtains.Can also carry out the relation that applies voltage and driving time of drive electric motor according to camera lens and obtain the convergent-divergent multiplying power image pickup part 11.In step S203, coefficient calculator 332 corresponding convergent-divergent multiplying powers are calculated the factor alpha of expression enhancing degree.In step S204, variable amplifier 337 multiply by factor alpha on the output signal of adder 335 obtains α S.In step S205, adder 338a, 338b carry out addition to the output signal (α S) of the dual track voice signal and the variable amplifier 337 of left and right acoustic channels.Then, in step S206, with the voice emplifying signal record in recording medium 44.In step S207, control part 47 judges whether record finishes, if do not finish (NO), then begins repetitive operation from step S201.If record end in step S207 (YES) then finishes the processing of sound convergent-divergent handling part 33.
Figure 31 has represented the sound convergent-divergent handling part 33a of other configuration examples of sound convergent-divergent handling part 33.When to mesien voice signal convolution give when looking the hearer near the head transfer functions of the effect of source of sound, feel that with the amplification linkage ground of the object that is undertaken by image pickup part 11 looking the hearer seems can listen to the stronger voice signal of presence near source of sound.Sound convergent-divergent handling part 33a shown in Figure 31 gives as the head transfer functions near the effect of source of sound mesien voice signal convolution.
Sound convergent-divergent handling part 33a shown in Figure 31 compares with the sound convergent-divergent handling part 33 of Figure 29, and difference is to have appended function and obtains device 333, transfer function memory 334 and convolution algorithm device 336.
In Figure 31, in transfer function memory 334, stored and be used to form by virtually near locating the head transfer functions of the virtual source of sound of source of sound.This head transfer functions is the function of listening to characteristic that is used to obtain the sound that sends from virtual source of sound, listens to the distance decision of characteristic by virtual source of sound and listener.
Function is obtained device 333 is obtained the source of sound of supposing with the factor alpha that calculates according to coefficient arithmetic unit 332 from transfer function memory 334 the corresponding head transfer functions in position location.In addition, the factor alpha among Figure 29 may not be identical with the factor alpha in Figure 31 and other accompanying drawings, but give identical symbol for convenience.336 pairs of add operation signal convolution by the dual track sound of adder 335 generations of convolution algorithm device obtain the head transfer functions that device 333 is obtained by function.Variable amplifier 337 according to the factor alpha that obtains by coefficient arithmetic unit 332 to convolution the add operation signal of head transfer functions amplify.Adder 338a, 338b carry out addition to the dual track voice signal of left and right acoustic channels and the output signal of variable amplifier 337.Here, though adopted the structure that possesses variable amplifier 337, only, also can omit variable amplifier 337 by just obtaining enough effects near locating source of sound virtually.In addition, can be used in function obtains the factor alpha of device 333 selection head transfer functions and determines that the factor alpha of the amplification degree of variable amplifier 337 is a coefficient inequality.
Use Figure 32 that the assay method of the head transfer functions that is used to form virtual source of sound is described.
Head transfer functions determinator 6a shown in Figure 32 possesses personal computer 61, amplifier 62, loud speaker 63, amplifier 66a, 66b and manually wears microphone 68.Manually wear microphone 68 and on artificial head 681, microphone assembly 684a, 684b have been installed.Head transfer functions determinator 6a compares difference with head transfer functions determinator 6 shown in Figure 15 and is, upward microphone assembly 65a, the 65b of installation have used one (the using loud speaker 63 at this) of manually wearing in microphone 68 and the use left and right sides loud speaker 63,64 that it is configured in the mesion of manually wearing microphone 68 to be substituted in cylindrical configuration body 65e.
Figure 33 represents manually to wear the sectional view of microphone 68.Manually wear microphone 68 and near the inlet of duct 683a, the 683b of the artificial head 681 that possesses auricle shape parts 682a, 682b, disposed microphone assembly 684a, 684b.The microphone 69 of manually wearing shown in Figure 16 (C) has disposed microphone assembly 694a, 694b on the position that is equivalent to people's eardrum position of duct 693a, 693b inside, manually wear in the microphone 68, disposed microphone assembly 684a, 684b near the inlet of duct 683a, 683b, this point is with manually to wear microphone 69 different.In addition, usually said manually wears microphone shown in Figure 16 (C), configuration microphone assembly 694a, 694b on the position that is equivalent to people's eardrum position of duct 693a, 693b inside, but for convenience, near the microphone that has disposed Figure 33 of microphone assembly 684a, 684b the inlet of duct 683a, the 683b of the artificial head 681 that possesses auricle shape parts 682a, 682b is also referred to as manually wears microphone.
Wear microphone 68 by end user foreman, the sound from loud speaker 63 can be collected sound as the dual track sound of having eliminated the influence that is caused by duct 683a, 683b.
Return Figure 32, generate the measured signal that for example forms by the pulse sound by personal computer 61.Measured signal is amplified by amplifier 62.The measured signal of sending from loud speaker 63 is received by the left and right sides microphone assembly 684a, the 684b that manually wear microphone 68.The left and right sides signal that receives is amplified by amplifier 66a, 66b, inputs to personal computer 61.Personal computer 61 compares by measured signal that will generate and the signal that receives, and obtains the head transfer functions h that manually wears microphone 68 l(t), h r(t).Head transfer functions h l(t) be the head transfer functions of obtaining according to the received signal of the microphone assembly 684a in left side, head transfer functions h r(t) be the head transfer functions of obtaining according to the received signal of the microphone assembly 684b on right side.Loud speaker 63 and the distance D of manually wearing microphone 68 for example are set at 0.5m, 1m, 2m etc., obtain the head transfer functions of each distance D successively.
The characteristic of the head transfer functions that expression is measured to by head transfer functions determinator 6a shown in Figure 32 in Figure 34~Figure 39.
Impulse response waveform shown in Figure 34 (A) is the waveform that is received by the microphone assembly 684a in left side when loud speaker 63 is 50cm with the distance D of manually wearing microphone 68.The longitudinal axis is the amplitude (voltage) after the normalization.Transverse axis is the time, is that the signal sampling of 48kHz is counted and represented with sample frequency.Figure 34 (B) carries out the frequency response characteristic that Fourier analysis obtains by the impulse response waveform that 61 couples of Figure 34 of personal computer (A) represent.Transverse axis is frequency (Hz), and the longitudinal axis is a response characteristic.
Figure 35 (A) is the impulse response waveform that distance D is received by the microphone assembly 684b on right side when being 50cm.Figure 35 (B) carries out the frequency response characteristic that Fourier analysis obtains to the impulse response waveform that Figure 35 (A) represents.Condition determination is identical with Figure 34.
Below, similarly, Figure 36 (A) is the impulse response waveform that distance D is received by the microphone assembly 684a in left side when being 1m, Figure 36 (B) is its frequency response characteristic.
Figure 37 (A) is the impulse response waveform that distance D is received by the microphone assembly 684b on right side when being 1m, and Figure 37 (B) is its frequency response characteristic.
Figure 38 (A) is the impulse response waveform that distance D is received by the microphone assembly 684a in left side when being 2m, and Figure 38 (B) is its frequency response characteristic.
Figure 39 (A) is the impulse response waveform that distance D is received by the microphone assembly 684b on right side when being 2m, and Figure 39 (B) is its frequency response characteristic.
Compare these characteristics, when the value of distance D was increased to 0.5m, 1m, 2m, the amplitude of the impulse response waveform shown in Figure 34~Figure 39 (A) reduced.In the frequency part of 1kHz~4kHz that frequency response characteristic shown in Figure 34~Figure 39 (B) is surrounded by the ellipse of dotted line when distance D is 0.5m, produce the concavo-convex characteristic (peak valley) of about 400Hz rule at interval, become irregular a little concavo-convex characteristic during for 1m in distance D, become the characteristic that has made up the different a plurality of concavo-convex characteristics of frequency interval during for 2m in distance D.In addition, when distance D is identical about obtain substantially identical characteristic.
The measured signal of personal computer 61 by the pulse sound that will generate compares with waveform from the impulse response signal of amplifier 66a, 66b input, obtains the head transmission characteristic according to distance D.The head transmission characteristic of obtaining according to distance D is source of sound to be positioned on the position of distance D virtually, makes the listener listen to characteristic from the voice signal of this virtual source of sound.In addition, though in the present embodiment distance D is made as 0.5m, 1m, 2m, also can the interval of distance D can be set at less than 0.5m more apart from the characteristic of obtaining separately.
The head transmission characteristic that as above obtains is stored in the transfer function memory 334 of Figure 31.As judging at an example that is used which stored transfer function by convergent-divergent multiplying power detector 331 detected convergent-divergent multiplying powers, use will be by the factor alpha that obtains after divided by the convergent-divergent multiplying power to the distance of object that does not have illustrated auto-focus measurement function metering in the image pickup part 11.For example, be 10m in distance to object, the convergent-divergent multiplying power is 5 o'clock, factor alpha is 2.In the distance to object is 10m, and the convergent-divergent multiplying power is 10 o'clock, and factor alpha is 1, is 20 o'clock in the convergent-divergent multiplying power, and factor alpha is 0.5.
Use Figure 40 that the action that comprises sound convergent-divergent handling part 33a is further detailed in the action of interior image and sound recording regenerating unit 102.In Figure 40, in step S211,335 pairs of dual track voice signals from the left and right acoustic channels of dual track microphone 3 of adder carry out the phase Calais and obtain add operation signal S.In step S212, the convergent-divergent multiplying power that the operation that convergent-divergent multiplying power detector 331 detects respective operations portion 48 is obtained by control part 47.In step S213, which in a plurality of transfer functions of storage in the transfer function memories 334 the corresponding convergent-divergent multiplying powers decision of coefficient calculator 332 select, and perhaps calculates the factor alpha of the enhancing degree of expression variable amplifier 337.In addition, as factor alpha, the numerical value that can directly use the distance that will arrive object to obtain divided by the convergent-divergent multiplying power also can be the numerical value that generates according to the numerical value that the distance that will arrive object obtains divided by the convergent-divergent multiplying power.
In step S214, function is obtained device 333 coefficient of correspondence α and obtain transfer function from transfer function memory 334, the add operation signal convolution transfer function of 336 pairs of adders of convolution algorithm device, 335 outputs.In step S215, the output signal of 337 pairs of convolution algorithm devices 336 of variable amplifier multiply by factor alpha and comes enhancing signal.In step S216, adder 338a, 338b carry out addition to the dual track voice signal of left and right sound channels and the output signal of variable amplifier 337.Then, in step S216, with the voice emplifying signal record in recording medium 44.In step S218, control part 47 judges whether record finishes, if do not finish (NO), then begins repetitive operation from step S211.Finish (YES) if in step S218, write down, then finish the processing of sound convergent-divergent handling part 33a.
The 3rd execution mode
Second execution mode is the processing and amplifying of carrying out sound in record one side, but the 3rd execution mode is the processing and amplifying of carrying out sound in regeneration one side.In the image and sound recording regenerating unit 103 of the 3rd execution mode shown in Figure 41, give identical symbol to having, and omit its explanation with the component part of image and sound recording regenerating unit 101 identical functions of first execution mode shown in Figure 3.Image and sound recording regenerating unit 103 is compared difference with image and sound recording regenerating unit 101 and is, sound convergent-divergent handling part 33b is set in the rear end of separated part 15, is provided with convergent-divergent multiplying power detector 331 at multiplexed 13 front end.In addition, in Figure 41, omitted headgear 55 and as the diagram of the voice output terminal 37c of the splicing ear of headgear 55.
Action to image and sound recording regenerating unit 103 describes.Operating portion 48 is operated, and the lens driving of the image pickup part 11 that is generated by control part 47 is transfused to image pickup part 11 and convergent-divergent multiplying power detector 331 with signal.331 pairs of lens driving of convergent-divergent multiplying power detector were analyzed with zoom direction, scaling speed and the lens driving time of signal, detected the convergent-divergent multiplying power.The convergent-divergent multiplying power information of the detected convergent-divergent multiplying power of expression is offered multiplexed 13.Carry out convergent-divergent multiplying power information and coding signal of video signal, coded sound signal and dual track marking signal multiplexed together for multiplexed 13.Record regenerating portion 14 will comprise convergent-divergent multiplying power information and be recorded in the recording medium 44 at interior multiplex signal.
The multiplex signal of record regenerating portion 14 regenerative recordings in recording medium 44, separated part 15 is also separated convergent-divergent multiplying power information except separating coding signal of video signal, coded sound signal and dual track marking signal.Convergent-divergent multiplying power information is inputed to sound convergent-divergent handling part 33b.
Figure 42 is the concrete configuration example of sound convergent-divergent handling part 33b.As shown in figure 42, sound convergent-divergent handling part 33b compares with the sound convergent-divergent handling part 33 of Figure 29, and difference is, has omitted convergent-divergent multiplying power detector 331, and the signal that adder 338a, 338b are imported is the output signal of crosstalk arrester 27.
In Figure 42, coefficient calculator 332 is used the convergent-divergent multiplying power information of separating input by separated part 15, the factor alpha of using when calculating is amplified by 337 pairs of input signals of variable amplifier.335 pairs of dual track voice signals from 26 inputs of audio decoder portion of adder carry out addition.Variable amplifier 337 according to by the factor alpha of coefficient calculator 332 input to amplifying from the output signal of adder 335.Adder 338a, 338b are to carrying out addition from the output signal of crosstalk arrester 27 with from the amplifying signal of variable amplifier 337.
But, have following situation: for example use DSP to carry out the convergent-divergent processing that calculation process is carried out image pickup part 11 by software.Under situation that the sound convergent-divergent handles the time is carried out in regeneration, do not need to guarantee that convergent-divergent handles the signal processing time of needed DSP.Therefore, can handle by the optimization that DSP carries out captured signal of video signal fully in when record, signal processing such as the coding of signal of video signal or record controls.Handle by the time carrying out the sound convergent-divergent, the computing of DSP can be distributed to man-hour and be used for the calculation process that convergent-divergent is handled in regeneration, can avoid being used to write down operation time deficiency unfavorable condition.
With respect to the sound convergent-divergent handling part 33b of Figure 42, Figure 43 has represented in the same manner mesien voice signal convolution to be given with Figure 31 the sound convergent-divergent handling part 33c of the head transfer functions of proximity effect.Sound convergent-divergent handling part 33c compares with sound convergent-divergent handling part 33b, and difference is to have appended function and obtains device 333, transfer function memory 334 and convolution algorithm device 336.It is identical with Figure 31 that function is obtained the action of device 333, transfer function memory 334 and convolution algorithm device 336, omits its explanation.
The 4th execution mode
The image and sound recording regenerating unit 104 of the 4th execution mode shown in Figure 44 manually carries out the processing and amplifying of sound by peripheral operation when regenerative recording medium 44.That is, when in recording medium 44, not writing down convergent-divergent multiplying power information, Yi Bian look the signal of video signal that hearer 59 watches regeneration on display 52, Yi Bian carry out the processing and amplifying of sound.Call the artificial sound convergent-divergent and handle looking sound processing and amplifying that hearer 59 manually carries out.
Image and sound recording regenerating unit 104 shown in Figure 44 is compared with image and sound recording regenerating unit 103, and difference is to have omitted convergent-divergent multiplying power detector 331, replaces sound convergent-divergent handling part 33b and possesses sound convergent-divergent handling part 33d.
Operate indication when carrying out the artificial sound convergent-divergent when looking 59 pairs of operating portions 48 of hearer, control part 47 will amplify control signal and input to sound convergent-divergent handling part 33d.Sound convergent-divergent handling part 33d carries out processing and amplifying according to amplifying control signal to the dual track voice signal by 26 decodings of audio decoder portion.
Figure 45 is the concrete configuration example of sound convergent-divergent handling part 33d.As shown in figure 45, sound convergent-divergent handling part 33d compares with the sound convergent-divergent handling part 33b of Figure 42, difference is to be provided with the convergent-divergent multiplying power detector 331a of input from the amplification control signal of control part 47, to coefficient calculator 332 is not the convergent-divergent multiplying power information of input from separated part 15, but the convergent-divergent multiplying power information that input generates at convergent-divergent multiplying power detector 331a.Other parts are and the identical action of convergent-divergent handling part 33b to omit explanation.
Use Figure 46, the artificial sound convergent-divergent in the 4th execution mode is handled further specifying.In Figure 46, in step S221, the left and right sides signal of the binaural signal that 335 pairs of regeneration of adder obtain carries out the phase Calais and obtains add operation signal S.In step S222, control part 47 judges whether to have changed sound convergent-divergent multiplying power by operating portion 48.In step S222, when having changed sound convergent-divergent multiplying power (YES), move to step S223, when not changing (NO), move to step S226.
Under the situation that has changed sound convergent-divergent multiplying power, in step S223, convergent-divergent multiplying power detector 331a calculates the convergent-divergent multiplying power according to amplifying control signal.In step S224, coefficient calculator 332 is according to calculating factor alpha from the convergent-divergent multiplying power of convergent-divergent multiplying power detector 331a input.Factor alpha can comprise the characteristic that is used for the head transfer functions that source of sound positioned in the place ahead of looking the hearer.In step S225, the value of factor alpha is updated to the value that newly calculates.
Then, in step S226,337 pairs of add operation signals of variable amplifier S multiply by factor alpha and generates α S.In addition, under the situation of not passing through step S223~S225, factor alpha is the value before the change sound convergent-divergent multiplying power.In step S227, adder 338a, 338b add signal alpha S to the dual track voice signal that has carried out crosstalk elimination processing by crosstalk arrester 27.In step S228, export the voice signal that in step S227, obtains via switch Sw2 and voice output terminal 37b.In step S229, control part 47 judges whether regeneration finishes, if do not finish (NO), then returns step S221, if finish (YES), then end process.
The 5th execution mode
The image and sound recording regenerating unit 105 of the 5th execution mode shown in Figure 47 has been showed the structure that is suitable for listening to by headgear 55 the voice emplifying signal.Image and sound recording regenerating unit 105 shown in Figure 47 is compared with the image and sound recording regenerating unit 103 of Figure 41, difference is to replace sound convergent-divergent handling part 33b and possesses sound convergent-divergent handling part 33e, and the voice signal of sound convergent-divergent handling part 33e output is offered headgear 55 via voice output terminal 37c.Headgear 55 do not provided carry out crosstalk by crosstalk arrester 27 and eliminate and handle, but implemented the dual track voice signal of processing and amplifying by sound convergent-divergent handling part 33e.
Figure 48 is the concrete formation example of sound convergent-divergent handling part 33e.Sound convergent-divergent handling part 33e compares with the sound convergent-divergent handling part 33b of Figure 42, and difference is to have appended adder 338c, 338d.Adder 338c, 338d add from the voice emplifying signal of variable amplifier 337 outputs the dual track voice signal that obtains by 26 decodings of audio decoder portion.The headgear that adder 338c, 338d are calculated is listened to voice signal and is offered headgear 55 via voice output terminal 37c.
The regeneration effect of the voice emplifying signal of second~the 5th execution mode discussed above is as described below.
Be set at wide visual angle at visual angle with image pickup part 11, and under the less situation of convergent-divergent multiplying power, carry out the signal that add operation obtains in adder 335 and be not enhanced, so look the signal of video signal that shows on 59 pairs of displays 52 of hearer and carried out the voice signal that regeneration obtains by 53,54 pairs of voice signals of loud speaker and carried out audiovisual with presence around cameraman 300 360 degree at variable amplifier 337.The visual angle is being set under the situation at wide visual angle, the visual angle also is about 60 degree.Sometimes, look hearer 59 can identify because the difference of the visual angle of the image angular range when voice signal is collected sound, the sound deficiency that object sent that on display 52, shows, promptly so-called intercalary delection sound (dropped mediumrange sound).To this, above-mentioned such voice signal of having implemented the sound processing and amplifying is, strengthens behind the signal component of the mesion direction from towards cameraman 300 time and the addition of dual track voice signal, so become the voice signal that the centre disappearance has been carried out compensation.Thus, can prevent to look the sound that hearer 59 identifies intercalary delection, compare with first execution mode, look hearer 59 can hear inharmonic sensation still less, have the sound of presence.
The 6th execution mode
The image and sound recording regenerating unit 106 of Figure 49, the 6th execution mode shown in Figure 50 is provided with built-in stereophony microphone 21 and dual track microphone 3 respectively as first~the 5th execution mode, but has showed the public structure that is generally used for the microphone and the dual track microphone of stereo collection sound.Figure 49 is the plane graph that the outward appearance of the image and sound recording regenerating unit 106 of expression the 6th execution mode constitutes example, and Figure 50 is the block diagram of the concrete inside configuration example of expression image and sound recording regenerating unit 106.In Figure 49, Figure 50, give identical symbol to having, and omit its explanation with the component part of Fig. 1, Fig. 3 identical functions.
As shown in figure 49, image and sound recording regenerating unit 106 possesses: be used for disposing the 35a of microphone arrangement portion, the 35b of microphone 31e, 31f and the cable resettlement section 34 of accommodating the microphone cables 310e, the 310f that are connected with microphone 31e, 31f in the above.
In Figure 49, when using microphone 31e, 31f to carry out common stereo collection sound, cameraman 300 is configured in microphone 31e, 31f on the 35a of microphone arrangement portion, the 35b and collects sound.When carrying out the collection sound of dual track sound, cameraman 300 draws microphone cables 310e, 310f from cable resettlement section 34, microphone 31e, 31f is worn on the ear 302 collect sound.Image and sound recording regenerating unit 106 has not shown overshooting shape test section, and it detects microphone 31e, the 31f that disposes on the 35a of microphone arrangement portion, 35b.By the ON/OFF of the switch (the switch Sw4 that is equivalent to Figure 50) that moves in linkage with the overshooting shape test section, image and sound recording regenerating unit 106 detects whether disposed microphone 31e, 31f on the 35a of microphone arrangement portion, 35b.The detection that whether has been configured on the 35a of microphone arrangement portion, the 35b is not limited thereto, and for example can also use Hall element, magnetoresistive element that the magnetic field that alnico magnets built-in among microphone 31e, the 31f produce is detected.
Specifically, in Figure 50, if microphone 31e, 31f are not configured on the 35a of microphone arrangement portion, the 35b, then switch Sw4 and terminal e are connected to become off status, if disposed microphone 31e, 31f, then with the terminal f state of being connected to become out.The 41a of configuration detection portion is to be connected with terminal e or to be connected with terminal f according to switch Sw4, detects and whether has disposed microphone 31e, 31f on the 35a of microphone arrangement portion, 35b.The detection signal of the 41a of configuration detection portion is offered control part 47.
The 41a of configuration detection portion is detecting when having disposed microphone 31e, 31f, microphone 31e, 31f carry out common stereo collection sound, 47 pairs of each ones of circuit of control part control, make image and sound recording regenerating unit 106 carry out with general mode under the corresponding operation of recording of photography.At this moment, built-in stereophony microphone 21 equivalences among the effect of microphone 31e, 31f and Fig. 3, flag information generating unit 42 do not generate the dual track marking signal of expression dual track pattern.On the other hand, detect when not disposing microphone 31e, 31f at the 41a of configuration detection portion, regard cameraman 300 as and microphone 31e, 31f are worn on the dual track pattern of carrying out dual track collection sound on the ear 302, each one of control part 47 control circuits, make image and sound recording regenerating unit 106 carry out with the dual track pattern under the corresponding operation of recording of photography.At this moment, flag information generating unit 42 generates the dual track marking signal according to the control of control part 47.
In the 6th execution mode, microphone 31e, 31f, the 35a of microphone arrangement portion, 35b, the 41a of configuration detection portion and control part 47 integral body are moved as switching part, and described switching part switching is to use the dual track microphone that is worn on cameraman's ear also to be to use dual track microphone microphone in addition to be used as sound is on every side collected the microphone of sound.
At this, use Figure 51 that an example of the structure of cable resettlement section 34 is described.Figure 51 (A) is the internal structure of overlooking cable resettlement section 34 from above, and microphone cables 310e, 310f are wound onto on the spool 341 with rotating shaft 343.Figure 51 (B) is the internal structure of looking up cable resettlement section 34 from below, possesses spiral helicine spring 342 in the inside of spool 341.Replace the detection of whether having disposed microphone 31e, 31f on the 35a of microphone arrangement portion, 35b, whether perhaps in addition, can also detect by the anglec of rotation to spool 341 is the dual track pattern.
The 7th execution mode
The image and sound recording regenerating unit 107 of the 7th execution mode that Figure 52 represents has been showed to use and will have been collected the structure that voice signal that sound obtains radios to device self radio type dual track microphone.In Figure 52, give identical symbol, and omit its explanation having with the component part of Fig. 1 identical functions.
In Figure 52, image and sound recording regenerating unit 107 possesses the external microphone splicing ear 32 that radio receiving and transmitting part 39 replaces Fig. 1, replaces dual track microphone 3 to use radio-type dual track microphone 38 to record.The wireless radio-type dual track microphone 38 that combines of 300 of cameramans and device self is worn on the head, and will left and right sides microphone 38a, 38b inserts in the ear 302 and collect sound, not photographed by the obstruction ground of microphone cables.In addition, can carry out cameraman 300 and not shown two people's of collection sound person photography.
Use Figure 53 that the internal structure of image and sound recording regenerating unit 107 is described.In Figure 53, give identical symbol to having, and omit its explanation with the component part of Fig. 3 identical functions.Image and sound recording regenerating unit 107 shown in Figure 53 is compared with the image and sound recording regenerating unit 101 of Fig. 3, and difference is to possess radio receiving and transmitting part 39 and replaces external microphone splicing ear 32 and joint detection portion 41.
Be in the distance of device self regulation being judged to be radio-type dual track microphone 38, when radio receiving and transmitting part 39 receives the dual track voice signal from radio-type dual track microphone 38, control part 47 makes switch Sw1 be connected with terminal b, provides the dual track voice signal from radio-type dual track microphone 38 to audio coding portion 22.At this moment, control part 47 is controlled and is made flag information generating unit 42 generate the dual track flag information.In addition, be judged to be radio-type dual track microphone 38 apart from the distance of device self surpass regulation apart from the time, control part 47 makes switch Sw1 be connected with terminal a, and the stereo sound signal from built-in stereophony microphone 21 is offered audio coding portion 22.At this moment, flag information generating unit 42 does not generate the dual track marking signal.
Figure 54 has represented the inside configuration example of radio-type dual track microphone 38 and radio receiving and transmitting part 39, and their action is described.
Shown in Figure 54, the microphone 38a of radio-type dual track microphone 38 possesses microphone assembly 381, amplifier of microphone 382, transceiver unit 383, antenna 384 and warning signal sending part 385.Though omitted diagram, microphone 38b is not except possessing warning signal sending part 385 this point, and is identical with the structure of microphone 38a.But radio receiving and transmitting part 39 possesses transceiver unit 391, microphone detection unit 392, range determination portion 393 communication zone detection units 394, warning signal sending part 395 and antenna 396.
The microphone assembly 381 of microphone 38a (38b) generates the dual track voice signal.382 pairs of dual track voice signals from microphone assembly 381 of amplifier of microphone amplify.Transceiver unit 383 modulation system is according to the rules modulated the dual track voice signal that is exaggerated from amplifier of microphone 382, and sends via antenna 384.Warning signal sending part 385 is according to generating warning signal by warning signal sending part 395 generations of radio receiving and transmitting part 39 described later and the warning signal that transmits via transceiver unit 391 and transceiver unit 383.
The antenna 396 of radio receiving and transmitting part 39 receive from about the modulation signal that sends of microphone 38a, 38b.391 pairs of modulation signals of receiving of transceiver unit carry out demodulation and generate the dual track voice signal, and the intensity of the received power of modulation signal is measured.The distance from radio receiving and transmitting part 39 to radio-type dual track microphone 38 is inferred according to the intensity of the received power that is measured to by range determination portion 393.But whether 394 pairs of distances of inferring out of communication zone detection unit are judged in the predetermined distance that can communicate by letter.But the result of determination of communication zone detection unit 394 is offered control part 47, if the distance of inferring out is in the predetermined distance that can communicate by letter, then control part 47 is controlled switch Sw1 is connected with terminal b, makes flag information generating unit 42 generate the dual track marking signal.If the distance of inferring out has surpassed the predetermined distance that can communicate by letter, then control part 47 makes switch Sw1 be connected with terminal a.
But be judged to be the distance of inferring out when having surpassed the predetermined distance that can communicate by letter at communication zone detection unit 394, warning signal sending part 395 generates warning signals.Warning signal is offered control part 47.Control part 47 generates warning label and offers display part 17 then, shows the warning label of regulation on display part 17.When warning signal sending part 395 did not generate warning signal, microphone detection unit 392 was judged to be and normally obtains the dual track voice signal, will offer video coding portion 22 via switch Sw1 by the dual track voice signal that transceiver unit 391 demodulation obtain.
In Figure 55, represented the example that the warning that formed by radio-type dual track microphone 38 and image and sound recording regenerating unit 107 shows.On microphone 38a, be provided with the bar-like member that light-emitting diode (LED) 386 has been installed at front end.Provide the warning signal that is generated by warning signal sending part 385 to LED386, LED386 is according to warning signal glimmer (or lighting).LED386 perhaps replaces flicker except flicker, sound can also give a warning.At this moment, listen to less than warning tones (being difficult to uppick), preferably reduce the level of warning tones, perhaps the frequency of warning tones for example is set at the low frequency below tens of Hz in order to make microphone assembly 381.
But be judged to be radio-type dual track microphone 38 when having surpassed the communication zone of circle expression of with dashed lines, warning signal sending part 395 generates warning signals.Shown in Figure 55, if but radio-type dual track microphone 38 has surpassed communication zone, then on display part 17, show the warning label of regulation.
Figure 56 is illustrated in an example of the warning label that shows on the display part 17.The warning label 171a of Figure 56 (A) expression is overlapping on the dual track mark 171 shown in Fig. 5 (A) * shown mark.The warning label 172a of Figure 56 (A) expression is the mark of the mark 172 shown in the displayed map 5 (B) hazily.Can use the some marks among Figure 56 (A), (B) to be used as warning label, the mark beyond can certainly using.Show that even carried out warning in hypothesis the received power of radio receiving and transmitting part 39 still is receiver margin when following, control part 47 switches to built-in stereophony microphone 21 from radio-type dual track microphone 38.
Use Figure 57 that the action of image and sound recording regenerating unit 107 is further detailed.In Figure 57, in step S251, control part 47 takes a decision as to whether the dual track pattern.If be not judged to be dual track pattern (NO) in step S251, then record regenerating portion 14 collects sound by built-in stereophony microphone 21 in step S253, with common stereo sound signal record in recording medium 44.If be judged to be dual track pattern (YES) in step S251, then radio receiving and transmitting part 39 receives transmission signal from radio-type dual track microphone 38 in step S252.In step S254, the distance from radio receiving and transmitting part 39 to radio-type dual track microphone 38 detects according to the intensity (receiving intensity) of received power in range determination portion 393.In step S255, but communication zone detection unit 394 judges that detected distance is whether in the distance of regulation.
If in step S255, be not judged to be in the distance of regulation (NO), then in step S257, control part 47 judges whether warning demonstration time t is 0 (non-prompting), if warning demonstration time t is 0, then control warning signal sending part 395 makes it generate warning signal in step S300.After generating warning signal, return step S254 and repeat identical processing.In step S257, if warning demonstration time t is not 0, then in step S258, control part 47 judges whether warning demonstration time t has surpassed predetermined maximum time tmax, if do not surpass maximum time tmax (NO), then move to step S300, similarly return step S254.If surpassed maximum time tmax (YES), then control part 47 control switch Sw1 switch to built-in stereophony microphone 21 from radio-type dual track microphone 38 in step S259, and control warning signal sending part 395 makes it stop to generate warning signal.Then, move to step S253.
Be judged to be in step S255 in the distance of regulation when (YES), in step S256, if warning signal sending part 395 is generating warning signal, then control part 47 control warning signal sending parts 395 make it stop to generate warning signal.Then, in step S301, record regenerating portion 14 collects sound by radio-type dual track microphone 38, and the dual track voice signal is recorded in the recording medium 44.In step S302, control part 47 judges whether carried out making the operation of record end, if do not carry out record end operation (NO), then returns step S251, if carried out record end operation (YES), then end process.
Utilizability on the industry
The present invention is applicable to not only in civilian video camera, also needs to regenerate in video camera in industry and takes Image and the image and sound recording regenerating unit with sound of presence. In addition, also applicable to having The digital camera of animation shoot function, mobile phone. And the present invention preferably is applicable to signal of video signal Carry out the image and sound recording regenerating unit of record regenerating with the voice signal both sides, but namely be used in only to sound The SoundRec regenerating unit that signal carries out record regenerating also can obtain enough effects.

Claims (8)

1. a video-audio recording apparatus (101,102,103,104,105,107), the signal of video signal that it obtains the reference object thing and collect the voice signal that sound obtains at interior cameraman (300) sound on every side and carry out record to comprising sound that described object sends, it is characterized in that possessing:
Image pickup part (11), it is made a video recording to described object;
Switching part (Sw1), its switching are to use the dual track microphone (3) that is worn on described cameraman (300) ear also to be to use dual track microphone (3) microphone in addition to be used as described ambient sound is collected the microphone of sound;
Video processing portion (12), it is to handling from the signal of video signal of described image pickup part (11) output;
Audio treatment part (22), it is handled from the voice signal of the microphone output that the sound around described is collected sound;
Flag information generating unit (42), when switching to described dual track microphone (3) by described switching part (Sw1) and be used as that the sound around described collected the microphone of sound, acoustic pattern when the sound around described is collected sound is made as the dual track pattern, generates the dual track marking signal of expression dual track pattern; And
Recording portion (14), it writes down in recording medium by described Video processing portion (12) and has carried out the signal of video signal of handling, carried out voice signal and the described dual track marking signal handled by described audio treatment part (22).
2. video-audio recording apparatus according to claim 1 (101,102,103,104,105,107) is characterized in that possessing:
Be built in the built-in microphone (21) in the described video-audio recording apparatus (101,102,103,104,105,107);
External microphone splicing ear (32);
Set the configuration part (48,47) that described dual track microphone (3) or dual track microphone microphone in addition is used as being connected with described external microphone splicing ear (32) external microphone that uses;
The joint detection portion (41) whether external microphone is connected and detects with described external microphone splicing ear (32);
From the voice signal of described built-in microphone (21) output with from switching the switch (Sw1) that is used as voice signal that described audio treatment part (22) is provided between the voice signal of described external microphone output; And
By described configuration part (47,48) described external microphone is being set at dual track microphone (3), and detect when on described external microphone splicing ear (32), being connected with external microphone by described joint detection portion (41), acoustic pattern is made as the control part (47) of described dual track pattern
When acoustic pattern is described dual track pattern, described switch (Sw1) is carried out switching controls to described control part (47) so that the voice signal that described switch (Sw1) is exported described external microphone offers described audio treatment part (22), and described control part (47) control described flag information generating unit (42) is so that described flag information generating unit (42) generates described dual track marking signal.
3. video-audio recording apparatus according to claim 1 (101,102,103,104,105,107) is characterized in that,
Possesses the display part (17) that the signal of video signal from described image pickup part (11) output is shown, and possesses display control unit (47), when it is described dual track pattern at acoustic pattern, make described display part (17) show the dual track mark of the described dual track pattern of expression.
4. video-audio recording apparatus according to claim 1 (102) is characterized in that,
Described image pickup part (11) possesses the zoom function that described object is carried out amplifying camera, and possesses sound convergent-divergent handling part (33), the magnification ratio of its corresponding described image pickup part (11) amplifies the voice signal that uses described dual track microphone (3) collection sound to obtain.
5. video-audio recording apparatus according to claim 1 (102,103) is characterized in that,
Described image pickup part (101) possesses the zoom function that described object is carried out amplifying camera,
And possesses sound convergent-divergent handling part (33a, 33c), described sound convergent-divergent handling part (33a, 33c) possesses: transfer function memory (334), a plurality of distances of its corresponding virtual source of sound and listener are stored head transfer functions, and described head transfer functions is used to make the source of sound of the voice signal that described dual track microphone (3) collection sound obtains to form virtual source of sound near described listener virtually; Function is obtained device (333), and the magnification ratio of its corresponding described image pickup part (11) is optionally obtained some in a plurality of head transfer functions of storage in this transfer function memory (334); And convolution algorithm device (336), it obtains the head transfer functions that device (333) is obtained to the voice signal convolution that is obtained by described dual track microphone (3) collection sound by described function.
6. image and sound recording method, the signal of video signal that it obtains the reference object thing and collect the voice signal that sound obtains at interior cameraman (300) sound on every side and carry out record comprising sound that described object sends is characterized in that possessing:
Shooting step (S155), it is made a video recording to described object;
Switch step (S151), switching are to use the dual track microphone (3) that is worn on described cameraman (300) ear also to be to use dual track microphone microphone in addition to be used as described ambient sound is collected the microphone of sound;
Video processing step (S156), it is handled the signal of video signal of taking described object and obtaining;
Audio Processing step (S156), it is handled from the voice signal of the microphone output that the sound around described is collected sound;
Flag information generates step (S158), when in described switch step (S151), switching to described dual track microphone (3) and being used as that the sound around described collected the microphone of sound, acoustic pattern when the sound around described is collected sound is made as the dual track pattern, generates the dual track marking signal of expression dual track pattern; And
Recording step (S161), it is recorded in the signal of video signal of handling in the described Video processing step (S156), voice signal and the described dual track marking signal handled in described Audio Processing step (S156) in recording medium (44).
7. a video-audio reproducing apparatus (101,102,103,104,105,106,107), to having write down signal of video signal that the reference object thing obtains and having regenerated in the recording medium (44) that interior cameraman (300) sound on every side collects the voice signal that sound obtains to comprising sound that described object sends, it is characterized in that possessing:
Reproducing unit (14), the tracer signal that writes down in its described recording medium of regenerating;
Separated part (15), its described tracer signal from described reproducing unit (14) regeneration is separated described signal of video signal and described voice signal;
Video processing portion (16), it is to being handled by the isolated described signal of video signal of described separated part (15);
Audio treatment part (26), it is to being handled by the isolated described voice signal of described separated part (15);
Flag information obtaining section (36), it records expression when having used the dual track marking signal of the dual track microphone on the ear that is worn on described cameraman (300) as the microphone that the sound around described is collected sound in described recording medium (44), obtain described dual track marking signal; And
Crosstalk arrester (27), when it has obtained described dual track marking signal in described flag information obtaining section (36), described voice signal handled offset the crosstalk signal that the described voice signal of described audio treatment part (26) being handled by loud speaker (53,54) produces when pronouncing
Described crosstalk arrester (27) has filter (272a-272d), the filter characteristic that this filter (272a-272d) is obtained in advance to described voice signal convolution, described filter characteristic is based on the filter characteristic of head transfer functions, and this head transfer functions is to use by a pair of microphone of installing in the cylindrical configuration body measured signal is collected the voice signal that sound obtains and the head transfer functions that measures.
8. video-audio renovation process, it is regenerated in the recording medium (44) that interior cameraman (300) sound on every side collects the voice signal that sound obtains to the signal of video signal that write down the reference object thing and obtain and to comprising sound that described object sends, it is characterized in that possessing:
Regeneration step (S181), the tracer signal of record in its described recording medium (44) of regenerating;
Separating step (S183), its described tracer signal of regenerating from described regeneration step is separated described signal of video signal and described voice signal;
Video processing step (S184), it is handled isolated described signal of video signal in described separating step (S183);
Audio Processing step (S184), it is handled isolated described voice signal in described separating step (S183);
Flag information is obtained step (S186), it records expression when having used the dual track marking signal of the dual track microphone (3) on the ear that is worn on described cameraman (300) as the microphone that the sound around described is collected sound in described recording medium (44), obtain described dual track marking signal; And
Crosstalk removal process (S188), it is obtained when having obtained described dual track marking signal in the step (S186) at described flag information, described voice signal is handled the crosstalk signal that produces when offsetting the described voice signal of handling to be pronounced in described Audio Processing step (S184) by loud speaker (53,54)
Described crosstalk removal process (S188) is the step of filter characteristic that described voice signal convolution is obtained in advance, described filter characteristic is based on the filter characteristic of head transfer functions, and this head transfer functions is to use by a pair of microphone of installing in the cylindrical configuration body measured signal is collected the voice signal that sound obtains and the head transfer functions that measures.
CNB2005800394962A 2004-11-19 2005-11-18 Video-audio recording apparatus and method and video-audio reproducing apparatus and method Expired - Fee Related CN100553373C (en)

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
JP2004335667 2004-11-19
JP335667/2004 2004-11-19
JP337165/2004 2004-11-22
JP369406/2004 2004-12-21
JP149276/2005 2005-05-23
JP178919/2005 2005-06-20

Publications (2)

Publication Number Publication Date
CN101065991A true CN101065991A (en) 2007-10-31
CN100553373C CN100553373C (en) 2009-10-21

Family

ID=38965764

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2005800394962A Expired - Fee Related CN100553373C (en) 2004-11-19 2005-11-18 Video-audio recording apparatus and method and video-audio reproducing apparatus and method

Country Status (1)

Country Link
CN (1) CN100553373C (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102447995A (en) * 2010-09-30 2012-05-09 蒂雅克股份有限公司 Portable sound recorder with video camera
CN105075294A (en) * 2013-04-30 2015-11-18 华为技术有限公司 Audio signal processing apparatus
CN107094192A (en) * 2017-06-19 2017-08-25 深圳怡化电脑股份有限公司 Wireless calling apparatus and VTS equipment and system for VTS equipment
CN108111952A (en) * 2017-12-26 2018-06-01 上海展扬通信技术有限公司 A kind of way of recording, device, terminal and computer readable storage medium
CN110418226A (en) * 2018-04-28 2019-11-05 北京唱吧科技股份有限公司 A kind of microphone and microphone system
CN113366863A (en) * 2019-01-30 2021-09-07 脸谱科技有限责任公司 Compensating for the effects of a head-mounted device on a head-related transfer function
CN114422933A (en) * 2021-12-31 2022-04-29 歌尔光学科技有限公司 Electronic equipment, sound field processing method thereof and sound field processing equipment

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5556543B2 (en) * 2010-09-30 2014-07-23 ティアック株式会社 Camera module mounting device

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102447995A (en) * 2010-09-30 2012-05-09 蒂雅克股份有限公司 Portable sound recorder with video camera
CN102447995B (en) * 2010-09-30 2015-01-14 蒂雅克股份有限公司 Portable sound recorder with video camera
CN105075294A (en) * 2013-04-30 2015-11-18 华为技术有限公司 Audio signal processing apparatus
CN105075294B (en) * 2013-04-30 2018-03-09 华为技术有限公司 Audio signal processor
CN107094192A (en) * 2017-06-19 2017-08-25 深圳怡化电脑股份有限公司 Wireless calling apparatus and VTS equipment and system for VTS equipment
CN107094192B (en) * 2017-06-19 2023-04-25 深圳怡化电脑股份有限公司 Wireless communication device for VTS equipment, VTS equipment and system
CN108111952A (en) * 2017-12-26 2018-06-01 上海展扬通信技术有限公司 A kind of way of recording, device, terminal and computer readable storage medium
CN108111952B (en) * 2017-12-26 2020-07-31 上海展扬通信技术有限公司 Recording method, device, terminal and computer readable storage medium
CN110418226A (en) * 2018-04-28 2019-11-05 北京唱吧科技股份有限公司 A kind of microphone and microphone system
CN113366863A (en) * 2019-01-30 2021-09-07 脸谱科技有限责任公司 Compensating for the effects of a head-mounted device on a head-related transfer function
CN114422933A (en) * 2021-12-31 2022-04-29 歌尔光学科技有限公司 Electronic equipment, sound field processing method thereof and sound field processing equipment
CN114422933B (en) * 2021-12-31 2024-04-26 歌尔科技有限公司 Electronic equipment, sound field processing method thereof and sound field processing equipment

Also Published As

Publication number Publication date
CN100553373C (en) 2009-10-21

Similar Documents

Publication Publication Date Title
CN101065991A (en) Video-audio recording apparatus and method, and video-audio reproducing apparatus and method
JP4775264B2 (en) Video / audio recording apparatus and method, and video / audio reproduction apparatus and method
CN1248223C (en) Information signal representing apparatus
US20050237395A1 (en) Information processing apparatus, imaging apparatus, information processing method, and program
CN1287633C (en) Sound image localizer
CN1148731C (en) Digital audio recording medium and its replaying device
CN1116737C (en) User adjustable volume control that accommodates hearing
CN1650528A (en) Multi-channel downmixing device
CN101053152A (en) Audio tuning system
CN1728892A (en) Sound-field correcting apparatus and method therefor
JP4934580B2 (en) Video / audio recording apparatus and video / audio reproduction apparatus
CN1694580A (en) Measuring apparatus and method, and recording medium
CN1845582A (en) Imaging device, sound record device, and sound record method
JP2005318061A (en) Signal processor and binaural reproducing method
CN100346631C (en) Information output recording apapratus and method, recording system and method and recording medium
CN1347263A (en) Signal processing apparatus and record medium
CN1454446A (en) Creating virtual surround using dipole and monopole pressure field
JP4241544B2 (en) Electronic device and program
JP2008136215A (en) Video/audio recording apparatus and method
CN1898989A (en) Acoustic signal reproducing method
JP5241865B2 (en) Video camera
JP4415775B2 (en) Audio signal processing apparatus and method, audio signal recording / reproducing apparatus, and program
CN1625228A (en) Data recording system, data recording apparatus, data transmission apparatus
JP2011101284A (en) Sound signal processing apparatus and method
JP5072730B2 (en) Signal processing device, imaging device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: JVC KENWOOD CORPORATION

Free format text: FORMER OWNER: VICTORY CO. LTD.

Effective date: 20140310

C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20140310

Address after: Kanagawa

Patentee after: JVC KENWOOD Corp.

Address before: Kanagawa

Patentee before: VICTOR COMPANY OF JAPAN, Ltd.

CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20091021