CN101513056B - Audio conference apparatus and audio conference system - Google Patents

Audio conference apparatus and audio conference system Download PDF

Info

Publication number
CN101513056B
CN101513056B CN2007800321284A CN200780032128A CN101513056B CN 101513056 B CN101513056 B CN 101513056B CN 2007800321284 A CN2007800321284 A CN 2007800321284A CN 200780032128 A CN200780032128 A CN 200780032128A CN 101513056 B CN101513056 B CN 101513056B
Authority
CN
China
Prior art keywords
acoustic beam
mentioned
receives
audio conference
audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN2007800321284A
Other languages
Chinese (zh)
Other versions
CN101513056A (en
Inventor
石桥利晃
田中良
鹈饲训史
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yamaha Corp
Original Assignee
Yamaha Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yamaha Corp filed Critical Yamaha Corp
Publication of CN101513056A publication Critical patent/CN101513056A/en
Application granted granted Critical
Publication of CN101513056B publication Critical patent/CN101513056B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R27/00Public address systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/56Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/56Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
    • H04M3/568Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities audio processing specific to telephonic conferencing, e.g. spatial distribution, mixing of participants
    • H04M3/569Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities audio processing specific to telephonic conferencing, e.g. spatial distribution, mixing of participants using the instant speaker's algorithm
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2203/00Aspects of automatic or semi-automatic exchanges
    • H04M2203/50Aspects of automatic or semi-automatic exchanges related to audio conference
    • H04M2203/5072Multiple active speakers

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Telephonic Communication Services (AREA)
  • Obtaining Desirable Characteristics In Audible-Bandwidth Transducers (AREA)

Abstract

Provided is a teleconference system for collecting a wide range of voices of participants in a conference while imaging a main speaker. The voice conference device (1) collects a wide range of voices and voices divided into narrow ranges by using a microphone array formed by arranging a plurality of microphones MIC. Voice signals (MB1, MB2) collected in a wide range are used as a voice signal (MB0) for voice collection. Moreover, the voice collection direction (DS) is detected by using the voice signal of the highest level detected from voice signals (MB11 to MB14, MB21 to MB24) which have been collected by dividing a range into a narrow ranges and the imaging direction of a camera (7) is controlled according to the voice collection direction (DS).

Description

Audio conference device and audio conference system
Technical field
The present invention relates to a kind of audio conference device and audio conference system, it uses the reception audio frequency of the microphone array that is made of a plurality of microphones, detects spokesman's direction, and the shooting direction of camera is controlled to be the direction towards the spokesman.
Background technology
Current, as the method for between remote site, carrying out meeting, use following method mostly: have the conference system of shoot function in the configuration of each place, this conference system is connected transmission/reception view data and voice data by network etc.And, the scheme of the multiple audio conference system that uses in above-mentioned meeting has been proposed.
Use in the camera head in the meeting of patent documentation 1, following technology is disclosed: based on the audio signal that is received by the directional microphone at each participant's configuration, detect spokesman's position, by the image on this spokesman's of camera the locality.
Patent documentation 1: the spy opens clear 61-198891 communique
Summary of the invention
But the invention of patent documentation 1 need be disposed directional microphone at each participant, must be corresponding with the number of participant of meeting and prepare directional microphone.
In addition, owing to be used for that sound receives and what be used to detect the spokesman position is same microphone acoustic beam, so there is following problems: if reception sound in a big way, then can't determine the spokesman, if sound is more among a small circle recorded, though then can determine the spokesman,, then can only receive a people's speech if exist two people or more people to make a speech simultaneously.
The present invention In view of the foregoing proposes, and audio conference device has:
Microphone array, it has a plurality of microphones of arranging with prescribed form;
The zone receives acoustic beam formation portion, and it forms the 1st and receive acoustic beam based on a plurality of received audio signals that each microphone by above-mentioned microphone array receives, and wherein, receives acoustic beam at the 1st and is set with this device the 1st range of receiving on every side;
Point receives acoustic beam formation portion, and a plurality of received audio signals that it receives based on each microphone by above-mentioned microphone array form the 2nd and receive acoustic beam, wherein, receives acoustic beam at the 2nd and is set with 2nd range of receiving littler than above-mentioned the 1st range of receiving; And
Take the direction test section, it detects spokesman's direction according to received a plurality of the 2nd reception acoustic beams that the acoustic beam portion of formation forms by above-mentioned point, and this spokesman's direction is detected as taking direction.
According to this structure, audio conference device uses the microphone array that is made of a plurality of microphones to carry out sound and receives.Audio conference device according to the audio signal that receives form with in a big way regional corresponding zone reception acoustic beam and with more among a small circle a plurality of corresponding some reception acoustic beam.In addition, audio conference device receives acoustic beam based on the zone and generates voice data and output.Audio conference device is based on a shooting direction of reception acoustic beam control camera.
Thus, audio conference device can be exported the voice data that receives in a big way.In addition, audio conference device can make the direction of the shooting direction of camera towards main spokesman.In addition, audio conference device of the present invention is if because main spokesman changes, then can change the shooting direction of camera automatically, so can all the time main spokesman be appointed as the shooting direction.
In addition, receiving acoustic beam formation portion only uses the radio-frequency component of above-mentioned received audio signal and forms the reception acoustic beam.
In addition, audio conference device also has: Department of Communication Force, and it is connected with other audio conference device via network, communicates with this other audio conference device; And control part, it generates voice data based on received the 1st reception acoustic beam that the acoustic beam portion of formation forms by above-mentioned zone, via above-mentioned Department of Communication Force this voice data is sent to above-mentioned other audio conference device.
According to this structure, be used to control the audio signal of the shooting direction of camera, only use radio-frequency component, form the reception acoustic beam that directive property strengthens thus.
Thus, because audio conference device can only strengthen the directive property of the reception acoustic beam of the shooting direction that is used to control camera, so can detect spokesman's position more accurately.
In addition, audio conference system has:
Microphone array, it has a plurality of microphones of arranging with prescribed form;
The zone receives acoustic beam formation portion, and it forms the 1st and receive acoustic beam based on a plurality of received audio signals that each microphone by above-mentioned microphone array receives, and wherein, receives acoustic beam at the 1st and is set with this device the 1st range of receiving on every side;
Point receives acoustic beam formation portion, and a plurality of received audio signals that it receives based on each microphone by above-mentioned microphone array form the 2nd and receive acoustic beam, wherein, receives acoustic beam at the 2nd and is set with 2nd range of receiving littler than above-mentioned the 1st range of receiving;
Take the direction test section, it detects spokesman's direction according to received a plurality of the 2nd reception acoustic beams that the acoustic beam portion of formation forms by this point, and this spokesman's direction is detected as taking direction; And
Shoot part, it generates view data according to being taken by the detected shooting direction of shooting direction test section of above-mentioned audio conference device.
According to this structure, audio conference system has audio conference device and camera.The voice data that audio conference device receives in being created on is in a big way controlled camera with main spokesman as taking direction simultaneously.Camera generates photographed data according to being taken by the shooting direction of audio conference device indication.
Thus, audio conference system can carry out sound on one side and receive in a big way, on one side with the shooting direction of main spokesman as camera.In addition, audio conference system of the present invention is if because main spokesman changes, then can change the shooting direction of camera automatically, so camera can be taken main spokesman all the time.
The effect of invention
As noted above, according to the present invention, can in a big way, receive convention goer's speech on one side, on one side main spokesman is taken.
Description of drawings
Fig. 1 be and remote site between carry out the key diagram of the audio conference system of audio conferencing.
Fig. 2 is the three-view diagram of the related audio conference device of present embodiment 1.
Fig. 3 is the three-view diagram of the related audio conference device 1 of expression present embodiment.
Fig. 4 is the block diagram that the function of the related audio conference system of expression present embodiment constitutes.
Fig. 5 is the key diagram of receiving area.
Fig. 6 is that other of the related audio conference device of present embodiment utilize the key diagram of method.
Fig. 7 is the block diagram that the function of the related audio conference system of other execution modes of expression constitutes.
Fig. 8 is the block diagram of the related audio conference system of other execution modes.
The explanation of label
The 1-audio conference device
The 2-framework
3-foot
The 4-operating portion
The 5-illuminating part
6-lower surface grid
The 7-camera
The 8-display terminal
The 9-video communication device
The 10-control part
11-input and output connector panel
The 12-input/output interface
13-plays the directive property control part
The 14-D/A converter
15-plays and uses amplifier
16-receives and uses amplifier
The 17-A/D converter
19-receives the acoustic beam selection portion
20-echo elimination portion
21-self-adaptation type Echo Canceller
22-camera control part
The 71-image pickup part
72,82-splicing ear portion
The 81-display part
The 91-input/output interface
92-image encoding decoder
The 100-network
The 110-input and output connector
181,182-receives the acoustic beam generating unit
191-spokesman position detection part
211-self-adaptation type filter
The 212-post processor
MIC101~MIC116, MIC201~MIC216-microphone
SP1~SP16-loud speaker
Embodiment
With reference to Fig. 1 the audio conference system that embodiments of the present invention are related is described.Fig. 1 is the key diagram that carries out the audio conference system of video conference with remote site.
As shown in Figure 1, audio conference system of the present invention is made of audio conference device 1, camera 7, display terminal 8, video communication device 9.Audio conference device 1 is connected with camera 7.Camera 7 is connected with video communication device 9.Video communication device 9 is connected with display terminal 8.In addition, when carrying out audio conferencing between remote site, audio conference device 1 and video communication device 9 are connected with the audio conference system that is positioned at the remote site place via network 100.
Below, the structure of the camera 7 of constructing audio conference system, display terminal 8, video communication device 9, audio conference device 1 is described.
Camera 7 is used for the convention goer is taken, constitute by image pickup part 71 and splicing ear portion 72, from audio conference device 1 via splicing ear portion 72 receiving inputted signals (receive direction DS described later), by making image pickup part 71 up and down (for example, about up and down 120 degree, about about 200 degree) rotate, take towards audio conference device 1 indicated direction.Camera 7 is exported photographed data via splicing ear portion 72 to video communication device 9.In addition, as splicing ear portion 72, have video output terminal, multi-connector, power supply terminal etc.
Display terminal 8 is used to show the view data that receives from the video conference system of remote site via network 100, constitute by display part 81 and splicing ear portion 82,, show in display part 81 via splicing ear portion 82 receiving inputted signals from video communication device 9.In addition, display terminal 8 is projecting apparatus or LCD etc.
Video communication device 9 is the devices that carry out the compression expansion and the agreement control of view data, carries out the transmission/reception of view data via network 100.Specifically, video communication device 9 will encapsulate and will export to network 100 after will compressing from the photographed data of camera 7 inputs.In addition, if come view data from network 100 input, then video communication device 9, is launched after the bit streamization and is exported to display terminal 8 by the time series arrangement and export successively by the view data after will encapsulate.
Below, with reference to the structure of Fig. 2,3 explanation audio conference devices 1.In addition, the audio conference device 1 that present embodiment is related uses the microphone array that is made of a plurality of microphones of arranging with linearity.Like this, by applying delay respectively by the sound that each microphone receives and synthesizing, form reception directive property.The reception directive property of this formation is called the reception acoustic beam.Kind as receiving acoustic beam has following two kinds of settings: the sensing that will receive acoustic beam is set at setting more among a small circle of specific acceptance point; And the sound in a big way that produces in will be to a certain extent bigger zone (for example, each side surface direction of audio conference device 1 (speech zone)) receives with high-gain, is suppressed at the setting of the sound (noise) of other region generating simultaneously.
Fig. 2 is the three-view diagram of expression audio conference device.Fig. 2 (A) is a vertical view, and Fig. 2 (B) is a front view, and Fig. 2 (C) is a right side view.Fig. 3 is the figure that loud speaker is arranged and microphone is arranged of expression audio conference device shown in Figure 2, and the microphone in the above-mentioned front of Fig. 3 (A) expression is arranged, and the loud speaker of Fig. 3 (B) expression bottom surface is arranged, and the microphone at Fig. 3 (C) expression back side is arranged.
In the following description, the face shown in Fig. 2 (B) is called the front, up and down based on this figure device specifies.
Audio conference device 1 has the outward appearance that is made of framework 2 and foot 3, and framework 2 has operating portion 4, illuminating part 5 and input and output connector panel 11.Long roughly rectangular shape about framework 2 forms on the left and right end portions of framework 2, is provided with bottom surface with framework 2 from the foot 3 that face lifts predetermined distance is set.
In the upper surface right part of framework 2 setting operation portion 4, it has action button and display frames such as numerical key.Operating portion 4 is connected with control part 10 in being arranged on framework 2.Operating portion 4 receives from participant's operation input and to control part 10 outputs, simultaneously according to the control of control part 10, and execution pattern etc. perhaps in the display operation in display frame.
Upper face center portion in framework 2 is provided with illuminating part 5, and it is center and constituting with the light-emitting components such as LED of radial configuration by the substantial middle with framework 2.Illuminating part 5 is with corresponding and luminous from the light emitting control of control part 10.The led control signal that control part 10 lights a lamp the LED of receive direction to illuminating part 5 inputs.
Right flank in framework 2 is provided with input and output connector panel 11, it has LAN interface, analogue audio frequency input terminal, analogue audio frequency lead-out terminal, digital audio input and output terminal, serial terminal etc., each connector of this input and output connector panel 11 (below be called input and output connector 110) is connected with the input/output interface 12 that is arranged on framework 2 inside.In addition, the DC socket that carries out the power supply supply also is set on input and output connector panel 11.
16 loud speaker SP1~SP16 of same size are set at the lower surface of framework 2.These loud speakers SP1~SP16 disposes with linearity across fixing interval along the length direction of framework 2, constitutes loudspeaker array by these loud speakers.At the front and the back side of framework 2, the microphone MIC101~MIC116 and the microphone MIC201~MIC216 of same size is set.These microphones MIC101~MIC116, microphone MIC201~MIC216 with the linearity configuration, constitute microphone array by these microphones along its length.
In addition, in the lower surface of framework 2 and front, rear side installs lower surface grid 6, it covers above-mentioned loudspeaker array and microphone array, forms the groove shape along its length with section U word shape.This lower surface grid 6 is made of the metallic plate that is formed with the perforation mesh, and protection loud speaker SP1~SP16, microphone MIC101~MIC116 and MIC201~MIC216 make the sound of playing and receiving pass through simultaneously.
This microphone MIC101~MIC116 and reception acoustic beam generating unit 181 form the reception acoustic beam of face side, and microphone MIC201~MIC216 and reception acoustic beam generating unit 182 form the reception acoustic beam of rear side.
In addition, in the present embodiment, making the number of loudspeakers of loudspeaker array is 16, makes the microphone quantity of each microphone array be respectively 16, but is not limited thereto, as long as suitably set the quantity of loud speaker and microphone according to specification.In addition, loudspeaker array and microphone array is spaced apart arbitrarily.That is, can have fixed intervals, also can dispose thick and fast, along with sparsely disposing gradually near both ends at central portion.In addition, in the present embodiment, microphone array is made of linear array, but microphone array is not limited to linear array, also can be the array that forms with rectangular arrangement.
Below, with reference to the function of Fig. 4,5 explanation audio conference systems.Fig. 4 is the block diagram that the function of expression audio conference system constitutes.Fig. 5 is the key diagram of receiving area.Fig. 5 (A) expression sound receives the receiving area of usefulness, the receiving area that Fig. 5 (B) expression position probing is used.
Audio conference system has control part 10 on function, input and output connector 110, the input/output interface 12 of audio conference device 1, play directive property control part 13, D/A converter 14, play with amplifier 15, loudspeaker array (loud speaker SP1~SP16), microphone array (microphone MIC101~MIC116, MIC201~MIC216), receive with amplifier 16, A/D converter 17, receive acoustic beam generating unit 181,182, receive acoustic beam selection portion 19, echo elimination portion 20, camera control part 22, camera 7, display terminal 8, the input/output interface 91 of video communication device 9, image encoding decoder 92, operating portion 4.
Control part 10 receives from the input of operating portion 4 and controls playing directive property control part 13, receives from the input of spokesman's position detection part 191 and controls camera control part 22.About control detailed content as described later.
Input and output connect 12 and will encapsulate from the audio signal of echo elimination portion 20 inputs, to network 100 outputs.In addition, will be transformed to the digital audio and video signals S1 of bit stream via the audio signal of input and output connector 110 input and export.Digital audio and video signals S1 is supplied to via echo elimination portion 20 and plays directive property control part 13.
More particularly, import under the situation of audio signal via network 100 and LAN connector, input/output interface 12 by time series arrangement and output successively, carries out bit streamization and to playing 13 outputs of directive property control part by the audio signal after will encapsulating.In addition, import via the analogue audio frequency input terminal under the situation of analog signal, input/output interface 12 should be exported to playing directive property control part 13 signal digitalized back.
Playing directive property control part 13 is following function portions: according to the indication of control part 10, based on connect 12 audio signals of supplying with from input and output, generate the independent play signal that each the loud speaker SP1~SP16 to loudspeaker array supplies with.Play directive property control part 13 and generate the independent play signal of supplying with to each loud speaker SP1~SP16, promptly play acoustic beam with the sound that plays out acoustic beamization from loudspeaker array.Therefore, delay processing that the audio signals that 13 pairs of inputs of broadcast directive property control part come are stipulated respectively and specified amplitude processing etc. generate independent play signal.In addition, play acoustic beam and have broadcast acoustic beam of in more among a small circle, playing and the broadcast acoustic beam of in a big way, playing, can switch by the mode initialization that the operation of operating portion 4 is carried out by the participant respectively.
Then, play the independent play signal that directive property control part 13 will generate, to D/A converter 14 outputs that are provided with at each loud speaker SP1~SP16.Each D/A converter 14 is play with amplifier 15 outputs after independent play signal is converted to analog form to each, and each broadcast is amplified independent play signal with amplifier 15 and supplied with to loud speaker SP1~SP16.
Each loud speaker SP1~SP16 of loudspeaker array carries out after the audio frequency conversion the independent play signal of supply with to external played.Because loud speaker SP1~SP16 is provided with down at the lower surface of framework 2, so the sound that plays out is just propagated from the side of the residing device of participant obliquely by the face that the is provided with reflection of the desk that audio conference device 1 is set.
Each microphone MIC101~MIC116, MIC201~MIC216 of microphone array receive the face side of audio conference device 1, the sound of rear side respectively, and being transformed to the signal of telecommunication is audio signal, and this audio signal is received with amplifier 16 outputs to each.Each reception is amplified audio signal with amplifier 16 and is supplied with to A/D converter 17 respectively, and A/D converter 17 is transformed to digital signal with simulated audio signal and exports to receiving acoustic beam generating unit 181,182.Here, in receiving acoustic beam generating unit 181, import the audio signal of the face side that receives by the microphone MIC101~MIC 116 that is arranged on the front, in receiving acoustic beam generating unit 182, import the audio signal of the rear side that receives by the microphone MIC201~MIC216 that is arranged on the back side.
Receive the reception acoustic beam in a big way of usefulness and the reception acoustic beam more among a small circle of camera 7 control usefulness in order to form sound, the audio signal that reception acoustic beam generating unit 181,182 couples of each microphone MIC101~MIC116, MIC201~MIC216 receive postpones to handle.
Specifically,, shown in Fig. 5 (A), all set 1 zone, form reception acoustic beam MB 1, the MB2 that sound receives carried out in these zones, to receiving 19 outputs of acoustic beam selection portion in face side, rear side in order in a big way, to receive sound.
In addition, for camera 7 is controlled to be towards main spokesman, shown in Fig. 5 (B), form at a plurality of points (being face side, each 4 point of rear side in Fig. 5 (B)) simultaneously and receive acoustic beam MB11~MB14, MB21~MB24, to receiving 19 outputs of acoustic beam selection portion.
In addition, because when generating more among a small circle the reception acoustic beam of camera 7 control usefulness, different with the situation of carrying out the sound reception, do not need to consider tonequality, so also can use high pass filter to carry out filtering the audio signal that receives, only use the high-frequency band signals of the strong 1kHz of directive property~3kHz degree, generate and receive acoustic beam MB11~MB14, MB21~MB24.
In addition, in the present embodiment, form 4 points respectively in face side, rear side, but be not limited thereto, so long as a plurality of point gets final product.
Receive acoustic beam selection portion 19 and utilize spokesman's position detection part 191, the highest audio signal of level in 8 audio signals that receive 8 points that acoustic beam MB11~MB14, MB21~MB24 receive will be utilized, as the purpose audio signal (promptly, not noise but convention goer's speech), detect the receive direction DS of the audio signal of maximum level, receive direction DS is exported to control part 10.
In addition, receive acoustic beam selection portion 19 and receive the reception acoustic beam of selecting to comprise receive direction DS among acoustic beam MB1, the MB2, as audio signal MB0 and to echo elimination portion 20 outputs of back segment at 2.
Echo elimination portion 20 is the function portions that are used to prevent echoing, this echoing be meant " play from the audio signal of input/output interface 12 inputs by loud speaker SP1~SP16, this audio signal that plays out be back to microphone MIC101~MIC116, MIC201~MIC216 and once more from input/output interface 12 outputs ".Echo elimination portion 20 uses self-adaptation type filter 211 to infer the sound that returns in above-mentioned path, suppresses echo by deducting the sound that returns of inferring out the audio signal that receives from microphone.
Specifically, echo elimination portion 20 has self-adaptation type Echo Canceller 21.Self-adaptation type Echo Canceller 21 has self-adaptation type filter 211 and post processor 212.Self-adaptation type filter 211 is inferred the audio signal composition that is back to microphone MIC based on the audio signal of supplying with to loud speaker SP, generates the virtual tone signal of returning.Post processor 212 deducts the corresponding virtual tone signal of returning with input audio signal S1 by from the audio signal MB0 that receives 19 outputs of acoustic beam selection portion, thereby removes the echo composition.To input to input/output interface 12 from the audio signal behind this audio signal MB0 removal echo composition.
By carrying out above-mentioned echo cancellation process, can predict and remove the audio signal that is back to microphone MIC from loud speaker SP exactly, can only the audio signal that is received by microphone MIC be exported from input/output interface 12.
If imported receive direction DS from control part 10, then camera control part 22 is controlled the direction of the image pickup part 71 of cameras 7, with the center of receive direction DS as the shooting direction.Thus, camera 7 determines to take direction according to the receive direction DS from audio conference device 1 input.Thus, can take the spokesman automatically.The photographed data of camera 7 exports image encoding decoder 92 to.
92 pairs of photographed datas from camera 7 inputs of image encoding decoder compress, and export input/output interface 91 to.In addition, 92 couples of picture signal P1 from input/output interface 91 inputs of image encoding decoder launch, and export display terminal 8 to.
Input/output interface 91 will encapsulate from the photographed data of image encoding decoder 92 inputs, to network 100 outputs.In addition, input/output interface 91 will be the data image signal P1 and the output of bit stream from the image signal transformation of network 100 inputs.Data image signal P1 supplies with to display terminal 8 via image encoding decoder 92.
More particularly, import via network 100 under the situation of image information, input/output interface 91 by time series arrangement and output successively, carries out bit streamization and to display terminal 8 output by the picture signal after will encapsulating.
As noted above, in the audio conference system of present embodiment, generate sound receive with spokesman's position probing with these 2 different reception acoustic beams.In addition, receive the reception acoustic beam of usefulness, do not receive, only receive the sound of main spokesman's side effectively, thereby can make main spokesman's speech sharpening with respect to the sound of audio conference device with the main opposite side of spokesman by using sound.In addition, by the reception acoustic beam that uses spokesman's position probing to use, determine main spokesman's position, thereby camera 7 is taken towards main spokesman.In addition, if main spokesman changes the direction of the camera 7 that then can automatically switch.
In addition, audio conference system of the present invention can not utilize video communication device 9 and uses in meeting as public address set as shown in Figure 6.In the case, audio conference device 1 is connected with camera 7, and camera 7 is connected with display terminal 8.Audio conference device 1 amplifies the sound that receives and plays.In addition, camera 7 determines to take direction according to the receive direction DS that imports from audio conference device 1, takes and generate photographed data.Camera 7 is exported the photographed data that generates to display terminal 8, show photographed data in display terminal 8.
Thus, spokesman's speech can be amplified and play, utilize 7 couples of main spokesmans of camera to take simultaneously and demonstration in display terminal 8.Therefore, even in the meeting that large conference room etc. carries out, the participant also can easily hear spokesman's speech.In addition, carry out meeting owing to main spokesman being presented in the display terminal 8, so the convention goer can easily know main spokesman.
In addition, being not limited to present embodiment, as shown in Figure 7, also can be the receive direction regardless of audio signal, reception acoustic beam selection portion 19 all receives the synthetic and generation audio signal MB0 of acoustic beam MB1, MB2 with 2, with echo elimination portion 20 outputs of this audio signal MB0 to back segment.
Thus, reception acoustic beam MB1, a MB2 generate audio signal MB0 owing to Synthetic 2, so can be when utilizing camera 7 reliably main spokesman to be taken, by in a big way, sound being received, and be not only main spokesman's side is received, thereby receive all participants' speech effectively.
In addition, be not limited to present embodiment, as shown in Figure 8, the Department of Communication Force of audio frequency and image can be set in audio conference device 1 also.The audio conference device that can pass through this Department of Communication Force and the other side's side communicates meeting.In the case, utilize photographed data that camera 7 photographs and the voice data that utilizes microphone to receive, via audio conference device 1 to network 100 outputs.And the picture signal from other audio conference devices that are positioned at remote site are imported via network 100 is presented in the display terminal 8 via audio conference device 1.For the photographed data and the voice data that send to other audio conference devices, send following photographed data: will with according to the corresponding receive direction of the audio signal of a plurality of detected high level of reception acoustic beam more among a small circle as taking direction, after camera 7 controlled, the photographed data that photographs by camera 7.In addition, send the voice data that generates based on the reception acoustic beam in a big way that comprises receive direction, this receive direction is to utilize reception acoustic beam more among a small circle detected.In addition, in the case, as long as the input/output interface 91 of picture signal is integrated with the input/output interface 12 of audio signal, be connected with network 100 via common input and output connector 110 and get final product.
In addition, Fig. 8 further is provided with Image Communication portion in the audio conference device 1 of Fig. 4, but is not limited thereto, and also can in the audio conference device 1 of Fig. 7 Image Communication portion be set further.

Claims (6)

1. audio conference device, it has:
Microphone array, it has a plurality of microphones of arranging with prescribed form;
The zone receives acoustic beam formation portion, and it forms the 1st and receive acoustic beam based on a plurality of received audio signals that each microphone by above-mentioned microphone array receives, and wherein, receives acoustic beam at the 1st and is set with this device the 1st range of receiving on every side;
Point receives acoustic beam formation portion, a plurality of received audio signals that it receives based on each microphone by above-mentioned microphone array, when the above-mentioned the 1st receives the formation of acoustic beam, form the 2nd and receive acoustic beam, wherein, receive acoustic beam at the 2nd and be set with a plurality of 2nd range of receiving littler than above-mentioned the 1st range of receiving; And
Take the direction test section, it a plurality of the 2nd receives acoustic beams and compares what received by above-mentioned point that the acoustic beam portion of formation forms simultaneously, receives acoustic beam and detects spokesman's direction based on the 2nd of maximum level, this spokesman's direction is detected as taking direction,
The above-mentioned the 1st above-mentioned the 1st range of receiving that receives acoustic beam is independent of by the detected shooting direction of above-mentioned shooting direction test section to be set.
2. audio conference device according to claim 1, wherein,
Point receives acoustic beam formation portion and only uses the radio-frequency component of above-mentioned received audio signal and form the reception acoustic beam.
3. audio conference device according to claim 1 wherein, also has:
Department of Communication Force, it is connected with other audio conference device via network, communicates with this other audio conference device; And
Control part, it generates voice data based on received the 1st reception acoustic beam that the acoustic beam portion of formation forms by above-mentioned zone, via above-mentioned Department of Communication Force this voice data is sent to above-mentioned other audio conference device.
4. audio conference system, it has:
Microphone array, it has a plurality of microphones of arranging with prescribed form;
The zone receives acoustic beam formation portion, and it forms the 1st and receive acoustic beam based on a plurality of received audio signals that each microphone by above-mentioned microphone array receives, and wherein, receives acoustic beam at the 1st and is set with this device the 1st range of receiving on every side;
Point receives acoustic beam formation portion, a plurality of received audio signals that it receives based on each microphone by above-mentioned microphone array, when the above-mentioned the 1st receives the formation of acoustic beam, form the 2nd and receive acoustic beam, wherein, receive acoustic beam at the 2nd and be set with a plurality of 2nd range of receiving littler than above-mentioned the 1st range of receiving;
Take the direction test section, it compares a plurality of the 2nd reception acoustic beams that formed simultaneously by this some reception acoustic beam portion of formation, receives acoustic beam and detects spokesman's direction based on the 2nd of maximum level, and this spokesman's direction is detected as taking direction; And
Shoot part, it generates view data according to being taken by the detected shooting direction of above-mentioned shooting direction test section,
The above-mentioned the 1st above-mentioned the 1st range of receiving that receives acoustic beam is independent of by the detected shooting direction of above-mentioned shooting direction test section to be set.
5. audio conference system according to claim 4, wherein,
Point receives acoustic beam formation portion and only uses the radio-frequency component of above-mentioned received audio signal and form the reception acoustic beam.
6. audio conference system according to claim 4 wherein, also has:
Department of Communication Force, it is connected with other audio conference device via network, communicates with this other audio conference device; And
Control part, it generates voice data based on received the 1st reception acoustic beam that the acoustic beam portion of formation forms by above-mentioned zone, via above-mentioned Department of Communication Force this voice data is sent to above-mentioned other audio conference device.
CN2007800321284A 2006-10-17 2007-10-16 Audio conference apparatus and audio conference system Active CN101513056B (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP2006282565A JP5028944B2 (en) 2006-10-17 2006-10-17 Audio conference device and audio conference system
JP282565/2006 2006-10-17
PCT/JP2007/070195 WO2008047804A1 (en) 2006-10-17 2007-10-16 Voice conference device and voice conference system

Publications (2)

Publication Number Publication Date
CN101513056A CN101513056A (en) 2009-08-19
CN101513056B true CN101513056B (en) 2011-12-14

Family

ID=39314031

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2007800321284A Active CN101513056B (en) 2006-10-17 2007-10-16 Audio conference apparatus and audio conference system

Country Status (3)

Country Link
JP (1) JP5028944B2 (en)
CN (1) CN101513056B (en)
WO (1) WO2008047804A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106911484A (en) * 2015-12-23 2017-06-30 卡讯电子股份有限公司 Microphone speech system control method

Families Citing this family (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100970609B1 (en) * 2008-12-01 2010-07-16 박철우 camera place control unit with sensing the sound
WO2011147070A1 (en) * 2010-05-24 2011-12-01 Mediatek Singapore Pte. Ltd. Method for generating multimedia data to be displayed on display apparatus and associated multimedia player
CN102404663A (en) * 2010-09-10 2012-04-04 中兴通讯股份有限公司 Microphone array device, conference system and intelligent terminal
US9565493B2 (en) 2015-04-30 2017-02-07 Shure Acquisition Holdings, Inc. Array microphone system and method of assembling the same
US9554207B2 (en) 2015-04-30 2017-01-24 Shure Acquisition Holdings, Inc. Offset cartridge microphones
JP2017034313A (en) 2015-07-28 2017-02-09 株式会社リコー Imaging apparatus, program, and imaging method
JP6631166B2 (en) * 2015-08-03 2020-01-15 株式会社リコー Imaging device, program, and imaging method
JP6547496B2 (en) * 2015-08-03 2019-07-24 株式会社リコー Communication apparatus, communication method, program and communication system
JP6551155B2 (en) 2015-10-28 2019-07-31 株式会社リコー Communication system, communication apparatus, communication method and program
CN106101885A (en) * 2016-08-05 2016-11-09 上海柏莱特视听设备服务有限公司 Meeting mike
US10367948B2 (en) 2017-01-13 2019-07-30 Shure Acquisition Holdings, Inc. Post-mixing acoustic echo cancellation systems and methods
EP3804356A1 (en) 2018-06-01 2021-04-14 Shure Acquisition Holdings, Inc. Pattern-forming microphone array
US11297423B2 (en) 2018-06-15 2022-04-05 Shure Acquisition Holdings, Inc. Endfire linear array microphone
WO2020061353A1 (en) 2018-09-20 2020-03-26 Shure Acquisition Holdings, Inc. Adjustable lobe shape for array microphones
US11438691B2 (en) 2019-03-21 2022-09-06 Shure Acquisition Holdings, Inc. Auto focus, auto focus within regions, and auto placement of beamformed microphone lobes with inhibition functionality
CN113841419A (en) 2019-03-21 2021-12-24 舒尔获得控股公司 Housing and associated design features for ceiling array microphone
US11558693B2 (en) 2019-03-21 2023-01-17 Shure Acquisition Holdings, Inc. Auto focus, auto focus within regions, and auto placement of beamformed microphone lobes with inhibition and voice activity detection functionality
TWI699120B (en) * 2019-04-30 2020-07-11 陳筱涵 Conference recording system and conference recording method
EP3973716A1 (en) 2019-05-23 2022-03-30 Shure Acquisition Holdings, Inc. Steerable speaker array, system, and method for the same
US11302347B2 (en) 2019-05-31 2022-04-12 Shure Acquisition Holdings, Inc. Low latency automixer integrated with voice and noise activity detection
EP4018680A1 (en) 2019-08-23 2022-06-29 Shure Acquisition Holdings, Inc. Two-dimensional microphone array with improved directivity
US12028678B2 (en) 2019-11-01 2024-07-02 Shure Acquisition Holdings, Inc. Proximity microphone
US11552611B2 (en) 2020-02-07 2023-01-10 Shure Acquisition Holdings, Inc. System and method for automatic adjustment of reference gain
US11706562B2 (en) 2020-05-29 2023-07-18 Shure Acquisition Holdings, Inc. Transducer steering and configuration systems and methods using a local positioning system
JP2024505068A (en) 2021-01-28 2024-02-02 シュアー アクイジッション ホールディングス インコーポレイテッド Hybrid audio beamforming system

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1411278A (en) * 2002-11-25 2003-04-16 北京邮电通信设备厂 IP network TV conference system
CN2701199Y (en) * 2004-06-18 2005-05-18 陈荣 Desktop automatic controlled video-audio conference control device

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH09163334A (en) * 1995-12-14 1997-06-20 Fujitsu Ltd Speaker detection circuit and video conference system
JPH10145763A (en) * 1996-11-15 1998-05-29 Mitsubishi Electric Corp Conference system
JPH10191290A (en) * 1996-12-27 1998-07-21 Kyocera Corp Video camera with built-in microphone
JP2002186084A (en) * 2000-12-14 2002-06-28 Matsushita Electric Ind Co Ltd Directive sound pickup device, sound source direction estimating device and system
JP3739673B2 (en) * 2001-06-22 2006-01-25 日本電信電話株式会社 Zoom estimation method, apparatus, zoom estimation program, and recording medium recording the program
JP4138680B2 (en) * 2004-02-27 2008-08-27 株式会社東芝 Acoustic signal processing apparatus, acoustic signal processing method, and adjustment method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1411278A (en) * 2002-11-25 2003-04-16 北京邮电通信设备厂 IP network TV conference system
CN2701199Y (en) * 2004-06-18 2005-05-18 陈荣 Desktop automatic controlled video-audio conference control device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
JP特开平10-191290A 1998.07.21

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106911484A (en) * 2015-12-23 2017-06-30 卡讯电子股份有限公司 Microphone speech system control method

Also Published As

Publication number Publication date
CN101513056A (en) 2009-08-19
JP2008103824A (en) 2008-05-01
WO2008047804A1 (en) 2008-04-24
JP5028944B2 (en) 2012-09-19

Similar Documents

Publication Publication Date Title
CN101513056B (en) Audio conference apparatus and audio conference system
CN101297587B (en) Sound pickup device and voice conference apparatus
JP3972921B2 (en) Voice collecting device and echo cancellation processing method
JP5637661B2 (en) Method for recording and playing back sound sources with time-varying directional characteristics
CN1956601B (en) Audio reproducing apparatus and audio reproducing method
JP2005086365A (en) Talking unit, conference apparatus, and photographing condition adjustment method
JP2004343262A (en) Microphone-loudspeaker integral type two-way speech apparatus
JP2008154056A (en) Audio conference device and audio conference system
CN110349582B (en) Display device and far-field voice processing circuit
CN113203988B (en) Sound source positioning method and device
JP4411959B2 (en) Audio collection / video imaging equipment
CN102724604A (en) Sound processing method for video meeting
WO2011153907A1 (en) Method, apparatus and remote video conference system for playing audio of remote participator
CN110035372A (en) Output control method and device of sound amplification system, sound amplification system and computer equipment
CN111145773B (en) Sound field restoration method and device
CN108510997A (en) Electronic equipment and echo cancel method applied to electronic equipment
CN207676616U (en) A kind of intelligent advertisement board based on interactive voice
CN102209225A (en) Method and device for realizing video communication
CN107750020A (en) A kind of microphone and conference system with electronic table tablet stand
JP4479227B2 (en) Audio pickup / video imaging apparatus and imaging condition determination method
CN214851543U (en) Recording and broadcasting equipment
JP4225129B2 (en) Microphone / speaker integrated type interactive communication device
CN115988163A (en) Plug-and-play wireless intelligent audio and video receiving and transmitting system
CN112788489B (en) Control method and device and electronic equipment
JP4269854B2 (en) Telephone device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant