WO2020252886A1 - Directional sound pickup method, recording device and storage medium - Google Patents

Directional sound pickup method, recording device and storage medium Download PDF

Info

Publication number
WO2020252886A1
WO2020252886A1 PCT/CN2019/101711 CN2019101711W WO2020252886A1 WO 2020252886 A1 WO2020252886 A1 WO 2020252886A1 CN 2019101711 W CN2019101711 W CN 2019101711W WO 2020252886 A1 WO2020252886 A1 WO 2020252886A1
Authority
WO
WIPO (PCT)
Prior art keywords
sound
pointing
directional
noise
sounds
Prior art date
Application number
PCT/CN2019/101711
Other languages
French (fr)
Chinese (zh)
Inventor
陈永金
Original Assignee
深圳壹秘科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳壹秘科技有限公司 filed Critical 深圳壹秘科技有限公司
Publication of WO2020252886A1 publication Critical patent/WO2020252886A1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/04Time compression or expansion
    • G10L21/055Time compression or expansion for synchronising with other signals, e.g. video signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/06Protocols specially adapted for file transfer, e.g. file transfer protocol [FTP]

Definitions

  • This application relates to the field of audio processing technology, and in particular to a directional sound pickup method, recording device and storage medium.
  • the main purpose of this application is to provide a directional sound pickup method, recording device and storage medium, aiming to solve the technical problem that the directional sound pickup function of the recording device cannot achieve the expected effect.
  • this application provides a directional sound pickup method, which includes the following steps:
  • the target sound corresponding to the directional sound pickup instruction among the plurality of target sounds is used as a pointing sound, and the noise of the pointing sound is reduced.
  • the step of compensating the time delay of the multiple target sounds so that the timings of the multiple target sounds are consistent includes:
  • the timing of the standard sound is determined, and the timing of the non-standard sound is adjusted according to the timing of the standard sound, so that the timing of the non-standard sound is consistent with the timing of the standard sound.
  • the step of acquiring the collected multiple target sounds and the directional sound pickup instruction input by the user includes:
  • the method further includes:
  • the step of using the target sound corresponding to the directional sound pickup instruction among the plurality of target sounds as a pointing sound includes:
  • the corresponding pointing azimuth is determined according to the horizontal angle value and the elevation angle value, and a target sound corresponding to the pointing azimuth among the multiple target sounds is used as the pointing sound.
  • the method further includes:
  • the normalized pointing sound is input into a preset filter to enhance the pointing sound.
  • the step of reducing the noise of the pointing sound includes:
  • the method further includes:
  • the model of the microphone array is ES7210, and the directivity of the microphone array is at least one of omnidirectional, cardioid, super cardioid, and figure-8;
  • the preset filter is at least one of a spatial filter, a time domain filter, an auditory filter, and a Wiener filter;
  • the preset noise estimation algorithm is the logarithmic spectrum amplitude optimal estimation method and/or MCRA (Minima-Controlled-Recursive-Averaging Algorithms, recursive average) algorithm.
  • the present application also provides a recording device, the recording device comprising: a memory, a processor, and computer readable instructions stored in the memory and running on the processor, the When the computer-readable instructions are executed by the processor, the steps of the directional sound pickup method described above are realized.
  • the present application also provides a computer-readable storage medium having computer-readable instructions stored on the computer-readable storage medium, and when the computer-readable instructions are executed by a processor, the above-mentioned orientation is achieved.
  • the steps of the pickup method are not limited to, but not limited to, but not limited to, but not limited to, but not limited to, but not limited to, but not limited to, but not limited to, but not limited to, but not limited to, but not limited to the pickup method.
  • This application provides a directional sound pickup method, a recording device, and a storage medium.
  • the directional sound pickup method acquires multiple collected target sounds and directional sound pickup instructions input by a user, and responds to the multiple target sounds. Delay compensation to make the timings of the multiple target sounds consistent; use the target sound corresponding to the directional sound pickup instruction among the multiple target sounds as the pointing sound, and perform noise reduction on the pointing sound.
  • the recording device when the recording device picks up the sound, it compensates for the time delay of the recorded sound in all directions, and then accurately picks up the sound in the direction specified by the user, and performs noise reduction processing on the noise in the picked up sound, thereby reducing non-direction
  • the sound of the direction interferes with the recording result, so that the directional sound pickup of the recording equipment achieves the best effect.
  • Figure 1 is a schematic structural diagram of a hardware operating environment involved in a solution of an embodiment of the present application
  • FIG. 2 is a schematic flowchart of an embodiment of a directional sound pickup method according to the present application
  • FIG. 3 is a detailed schematic diagram of the steps of compensating the time delay of the multiple target sounds to make the sequence of the multiple target sounds consistent according to the application;
  • FIG. 4 is a schematic flowchart of another embodiment of the directional sound pickup method according to the present application.
  • FIG. 1 is a schematic diagram of a terminal structure of a hardware operating environment involved in a solution of an embodiment of the present application.
  • the terminal of this application is a device.
  • the device can be a recording device, or a terminal device with a storage function, such as a server, a computer, a smart phone, a tablet computer, and a portable computer.
  • the terminal may include a processor 1001, such as a CPU, a communication bus 1002, a user interface 1003, a network interface 1004, and a memory 1005.
  • the communication bus 1002 is used to implement connection and communication between these components.
  • the user interface 1003 may include a display screen (Display) and an input unit such as a keyboard (Keyboard), and the optional user interface 1003 may also include a standard wired interface and a wireless interface.
  • the network interface 1004 may optionally include a standard wired interface and a wireless interface (such as a WI-FI interface).
  • the memory 1005 can be a high-speed RAM memory or a stable memory (non-volatile memory), such as disk storage.
  • the memory 1005 may also be a storage device independent of the foregoing processor 1001.
  • the terminal may also include a camera, a Wi-Fi module, etc., which will not be repeated here.
  • terminal structure shown in FIG. 1 does not constitute a limitation on the terminal, and may include more or fewer components than shown in the figure, or combine some components, or arrange different components.
  • the network interface 1004 is mainly used to connect to a back-end server and perform data communication with the back-end server;
  • the user interface 1003 mainly includes an input unit such as a keyboard.
  • the keyboard includes a wireless keyboard and a wired keyboard to connect to the client.
  • Perform data communication with the client; and the processor 1001 can be used to call computer readable instructions stored in the memory 1005 and perform the following operations:
  • the target sound corresponding to the directional sound pickup instruction among the plurality of target sounds is used as a pointing sound, and the noise of the pointing sound is reduced.
  • processor 1001 may call computer-readable instructions stored in the memory 1005, and also perform the following operations:
  • the timing of the standard sound is determined, and the timing of the non-standard sound is adjusted according to the timing of the standard sound, so that the timing of the non-standard sound is consistent with the timing of the standard sound.
  • processor 1001 may call computer-readable instructions stored in the memory 1005, and also perform the following operations:
  • processor 1001 may call computer-readable instructions stored in the memory 1005, and also perform the following operations:
  • the step of using the target sound corresponding to the directional sound pickup instruction among the plurality of target sounds as a pointing sound includes:
  • the corresponding pointing azimuth is determined according to the horizontal angle value and the elevation angle value, and a target sound corresponding to the pointing azimuth among the multiple target sounds is used as the pointing sound.
  • processor 1001 may call computer-readable instructions stored in the memory 1005, and also perform the following operations:
  • the normalized pointing sound is input into a preset filter to enhance the pointing sound.
  • processor 1001 may call computer-readable instructions stored in the memory 1005, and also perform the following operations:
  • processor 1001 may call computer-readable instructions stored in the memory 1005, and also perform the following operations:
  • the specific embodiment of the terminal is basically the same as the following embodiments of the directional sound pickup method, and will not be repeated here.
  • FIG. 2 is a schematic flowchart of an embodiment of a directional sound pickup method according to the present application.
  • the directional sound pickup method provided in this embodiment includes the following steps:
  • Step S10 Acquire the collected multiple target sounds and the directional sound pickup instruction input by the user, and compensate for the time delay of the multiple target sounds, so that the timings of the multiple target sounds are consistent;
  • the recording device in this embodiment has multiple built-in microphones to acquire multiple collected target sounds and a directional sound pickup instruction input by the user.
  • the directional sound pickup instruction is an instruction input by the user to determine the pickup position. It is easy to understand that the positions of the multiple microphones built into the recording device and the sound source are different, so the timing of the multiple target sounds collected is also different. Generally speaking, the microphone closest to the sound source receives the sound first, and the microphone furthest from the sound source receives the sound last, and the sound is longer. In this embodiment, time delay compensation is performed on multiple target sounds, so that the timings of the multiple target sounds are consistent.
  • Step S20 Use the target sound corresponding to the directional sound pickup instruction among the multiple target sounds as the pointing sound, and perform noise reduction on the pointing sound.
  • the directional pickup instruction contains azimuth information
  • extract the target sound corresponding to the azimuth information from the multiple target sounds and use the target sound as the pointing sound
  • the pointing sound is the sound recorded by the recording device and corresponding to the direction specified by the user. Noise reduction is performed on pointing sound to enhance the directional sound pickup effect of recording equipment.
  • This embodiment provides a directional sound pickup method, which acquires a plurality of collected target sounds and a directional sound pickup instruction input by a user, and compensates for the delay of the plurality of target sounds, so that the plurality of target sounds The time sequence of is consistent; the target sound corresponding to the directional sound pickup instruction among the multiple target sounds is used as the pointing sound, and the noise of the pointing sound is reduced.
  • the recording device when the recording device picks up the sound, it compensates for the time delay of the recorded sound in all directions, and then accurately picks up the sound in the direction specified by the user, and performs noise reduction processing on the noise in the picked up sound, thereby reducing non-direction
  • the sound of the direction interferes with the recording result, so that the directional sound pickup of the recording equipment achieves the best effect.
  • FIG. 3 is a detailed schematic diagram of the steps of compensating the time delay of the multiple target sounds to make the sequence of the multiple target sounds consistent according to the application.
  • the step of compensating the time delay of a target sound so that the sequence of the multiple target sounds is consistent includes:
  • Step S11 Determine the sound collected by the preset reference microphone from the collected multiple target sounds, and use the sound collected by the preset reference microphone as the standard sound.
  • the multiple collected target sounds excluding the standard sound Target sound as non-standard sound;
  • Step S12 Determine the timing of the standard sound, and adjust the timing of the non-standard sound according to the timing of the standard sound, so that the timing of the non-standard sound is consistent with the timing of the standard sound.
  • the recording device includes multiple microphones, and the sound recorded by each microphone is different in timing.
  • a microphone is preset as the reference microphone, and the sound collected by the preset reference microphone is used as the standard sound, then in the process of delay compensation for the target sound, the timing of the standard sound recorded by the recording device As a standard; the target sounds other than the standard sound among the collected multiple target sounds are regarded as non-standard sounds, that is, in the process of delay compensation for the target sound, only the non-standard sounds are needed. Make timing adjustments.
  • the timing of the standard sounds is determined, and the timing of the standard sounds is used as a reference to adjust the timing of the non-standard sounds so that the timing of the non-standard sounds is consistent with the standard
  • the sound sequence is consistent, so that the sequence of the acquired multiple target sounds is consistent, and the time delay compensation for the recorded sound is realized.
  • the target sound is compensated for the time delay in the above-mentioned manner, thereby reducing the interference of non-directional sound on the recording result.
  • the step of acquiring the collected multiple target sounds and the directional sound pickup instruction input by the user includes:
  • Step S13 Generate a corresponding directional sound pickup instruction according to the user's operation on the display interface, and collect sound through each microphone in the microphone array according to the directional sound pickup instruction;
  • Step S14 Use the sound collected by each microphone as the target sound.
  • the corresponding directional pickup instruction is generated according to the user's operation on the display interface. It is easy to understand that the user can also generate the corresponding directional pickup instruction in other ways. For example, the user can generate the corresponding directional pickup instruction by adjusting the position of the terminal. Corresponding directional sound pickup instructions; the user can also directly input location information to generate corresponding directional sound pickup instructions, which is not limited in this embodiment. After acquiring the directional sound pickup instruction input by the user, the sound is collected through each microphone in the microphone array according to the directional sound pickup instruction.
  • the microphone array in the microphone array is composed of a 6+1 mode, that is, 6 microphones surround the middle microphone to form a closed loop; the model of the microphone array is ES7210, and the direction of the microphone array Sex is at least one of omnidirectional, cardioid, super cardioid, and figure-eight.
  • the sound of the environment where the recording device is located is collected in all directions, and the sound collected by each microphone is used as the target sound.
  • the sound around the recording device is collected in an all-round manner and used as the target sound, thereby ensuring that the user can pick up the sound in a specified direction of 360 degrees.
  • step S10 After acquiring the multiple target sounds collected and the directional sound pickup instruction input by the user in the above step S10, it further includes:
  • Step S30 Determine the corresponding horizontal angle value and the elevation angle value according to the directional sound pickup instruction input by the user, and display the horizontal angle value and the elevation angle value on the display interface;
  • the step of using the target sound corresponding to the directional sound pickup instruction among the plurality of target sounds as a pointing sound includes:
  • Step S21 Determine the corresponding pointing azimuth according to the horizontal angle value and the elevation angle value, and use the target sound corresponding to the pointing azimuth among the multiple target sounds as the pointing sound.
  • the directional sound pickup instruction input by the user includes the position information for the user to determine the pickup direction, and the directional sound pickup instruction includes the horizontal angle value and the elevation angle value.
  • the directional sound pickup instruction includes the horizontal angle value and the elevation angle value.
  • the direction in which the user intends to pick up the sound can be determined according to the horizontal angle value and the elevation angle value, and the above direction is regarded as the pointing direction. Since a target sound is collected by a microphone, and the position of each microphone on the recording device is fixed, the corresponding microphone on the recording device can be determined according to the horizontal angle value and the elevation angle value, and the microphone is recorded The target sound serves as the pointing sound. In this embodiment, in the above manner, the corresponding microphone on the recording device is determined according to the directional sound pickup instruction, and the sound recorded by the microphone is used as the directional sound to accurately realize the directional sound pickup.
  • step S20 uses the target sound corresponding to the directional sound pickup instruction among the multiple target sounds as the pointing sound
  • the method further includes:
  • Step S40 performing signal sampling and normalization processing on the pointing sound
  • Step S50 Input the normalized pointing sound into a preset filter to enhance the pointing sound.
  • signal sampling is performed on the pointing sound.
  • the above-mentioned signal sampling is to collect the instantaneous value of the sound data point by point according to a certain time interval, thereby realizing the discretization of the sound.
  • the discrete sound can also be Fourier transformed, and different sound data can be weighted according to the different frequency bands used, and multiple discrete sound data can be classified One treatment.
  • After normalizing the pointing sound further input the normalized pointing sound into a preset filter to enhance the pointing sound and improve the performance of the pointing sound in a reverberant environment.
  • the preset filter is at least one of a spatial filter, a time domain filter, an auditory filter, and a Wiener filter.
  • the sound directed in the direction is enhanced, thereby improving the effect of directional sound pickup.
  • the step of reducing the noise of the pointing sound includes:
  • Step S22 input the pointing sound into a preset noise estimation algorithm to identify the noise in the pointing sound;
  • Step S23 Determine the frequency, amplitude and phase of the noise, and output the anti-noise corresponding to the noise to reduce the noise of the pointing sound, wherein the anti-noise is the same frequency and amplitude as the noise And sound waves of opposite phase.
  • a noise estimation algorithm is also preset.
  • the preset noise estimation algorithm is the logarithmic spectrum amplitude optimal estimation method and/or the MCRA algorithm.
  • the preset noise estimation algorithm is used to perform noise estimation on the pointing sound to identify Point out the noise in the sound, and determine the frequency, amplitude and phase of the noise. Then input the anti-noise sound wave with the same frequency, the same amplitude and the opposite phase to the pointing sound to achieve the phase cancellation with the noise, thereby reducing the noise of the pointing sound and reducing the interference of the noise source in the pointing sound to the recording result. Make the directional pickup of recording equipment achieve the best effect.
  • FIG. 4 is a schematic flowchart of another embodiment of the directional sound pickup method according to the present application.
  • step S20 performs noise reduction on the pointing sound, it further includes:
  • step S60 the noise-reduced pointing sound is uploaded to the cloud server, and after the reading instruction input by the user is obtained, the corresponding pointing sound is downloaded from the cloud server.
  • the pointing sound at this time is the most ideal.
  • there is also an automatic backup function of pointing sound and the noise-reduced pointing sound is uploaded to the cloud server, so that the user can extract the pointing sound at any time.
  • the server downloads and plays the corresponding pointing sound.
  • the model of the microphone array is ES7210, and the directivity of the microphone array is at least one of omnidirectional, cardioid, super cardioid, and figure-8;
  • the preset filter is at least one of a spatial filter, a time domain filter, an auditory filter, and a Wiener filter;
  • the preset noise estimation algorithm is the logarithmic spectrum amplitude optimal estimation method and/or MCRA (Minima-Controlled-Recursive-Averaging Algorithms, recursive average) algorithm.
  • an embodiment of the present application also proposes a computer-readable storage medium having computer-readable instructions stored thereon, and when the computer-readable instructions are executed by a processor, the following operations are implemented:
  • the target sound corresponding to the directional sound pickup instruction among the plurality of target sounds is used as a pointing sound, and the noise of the pointing sound is reduced.
  • the timing of the standard sound is determined, and the timing of the non-standard sound is adjusted according to the timing of the standard sound, so that the timing of the non-standard sound is consistent with the timing of the standard sound.
  • the step of using the target sound corresponding to the directional sound pickup instruction among the plurality of target sounds as a pointing sound includes:
  • the corresponding pointing azimuth is determined according to the horizontal angle value and the elevation angle value, and a target sound corresponding to the pointing azimuth among the multiple target sounds is used as the pointing sound.
  • the normalized pointing sound is input into a preset filter to enhance the pointing sound.
  • the method of the embodiment can be realized by means of software plus the necessary general hardware platform, of course, it can also be realized by Over hardware, but in many cases the former is a better implementation.
  • the technical solution of this application essentially or the part that contributes to the existing technology can be embodied in the form of a software product.
  • the computer software product is stored in a storage medium (such as ROM/RAM, magnetic disk, optical The disk) includes several instructions to make a terminal device (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) execute the methods described in the various embodiments of the present application.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

A directional sound pickup method, a recording device and a storage medium, the method comprising: acquiring a plurality of collected target sounds and a directional sound pickup instruction inputted by a user, and performing delay compensation on the plurality of target sounds so that the timing of the plurality of target sounds is the same (S10); and using a target sound among the plurality of target sounds that corresponds to the directional sound pickup instruction as a specified sound, and performing noise reduction on the specified sound (S20). When a recording device picks up sounds, delay compensation is performed on recorded sounds from all directions, then a sound in a user-specified direction is accurately picked up, and noise reduction processing is performed on noise in the picked up sound, thereby reducing the interference of sounds in the non-specified directions on a recording result so that the directional sound pickup of the recording device achieves an optimal effect.

Description

定向拾音方法、录音设备和存储介质 Directional sound pickup method, recording device and storage medium To
本申请要求于2019年6月17日提交中国专利局、申请号为201910524995.4、发明名称为“定向拾音方法、录音设备和存储介质”的中国专利申请的优先权,其全部内容通过引用结合在申请中This application claims the priority of a Chinese patent application filed with the Chinese Patent Office, the application number is 201910524995.4, and the invention title is "directional sound pickup method, recording equipment and storage medium" on June 17, 2019, the entire content of which is incorporated by reference Applying
技术领域Technical field
本申请涉及音频处理技术领域,尤其涉及一种定向拾音方法、录音设备和存储介质。This application relates to the field of audio processing technology, and in particular to a directional sound pickup method, recording device and storage medium.
背景技术Background technique
市场上大部分录音设备,例如录音笔、录音机以及桌面会议记录仪等录音设备已经走进了大众视野。随着生活水平的提高,人们对录音设备的要求越来越高,越来越多的录音设备也具备定向录音这一功能。Most recording equipment on the market, such as voice recorders, tape recorders, and desktop conference recorders have entered the public eye. With the improvement of living standards, people have higher and higher requirements for recording equipment, and more and more recording equipment also have the function of directional recording.
然而,大多数录音设备都是通过用户对录音设备的调整来确定录音方向,再选取与录音方向对应的麦克风所录制的声音数据,并对选取的声音数据进行增强,从而实现指定方向的定向拾音。但是,在实际应用中,录音设备中麦克风的指定方向并不明确,进而或多或少的收录非指定方向的声音,使得录音设备的定向拾音难以达到最佳效果。However, most recording devices determine the recording direction by adjusting the recording device by the user, and then select the sound data recorded by the microphone corresponding to the recording direction, and enhance the selected sound data, so as to achieve directional pickup in the specified direction. sound. However, in practical applications, the designated direction of the microphone in the recording device is not clear, and more or less sounds in non-designated directions are recorded, making it difficult for the directional sound pickup of the recording device to achieve the best results.
发明内容Summary of the invention
本申请的主要目的在于提供了一种定向拾音方法、录音设备和存储介质,旨在解决录音设备的定向拾音功能难以达到预期效果的技术问题。The main purpose of this application is to provide a directional sound pickup method, recording device and storage medium, aiming to solve the technical problem that the directional sound pickup function of the recording device cannot achieve the expected effect.
为实现上述目的,本申请提供了一种定向拾音方法,包括以下步骤:In order to achieve the above objective, this application provides a directional sound pickup method, which includes the following steps:
获取采集到的多个目标声音以及用户输入的定向拾音指令,并对所述多个目标声音时延补偿,以使得所述多个目标声音的时序一致;Acquiring the collected multiple target sounds and the directional sound pickup instruction input by the user, and compensating for the time delay of the multiple target sounds, so that the timings of the multiple target sounds are consistent;
将所述多个目标声音中与所述定向拾音指令对应的目标声音作为指向声音,并对所述指向声音进行降噪。The target sound corresponding to the directional sound pickup instruction among the plurality of target sounds is used as a pointing sound, and the noise of the pointing sound is reduced.
可选地,所述对所述多个目标声音时延补偿,以使得所述多个目标声音的时序一致的步骤包括:Optionally, the step of compensating the time delay of the multiple target sounds so that the timings of the multiple target sounds are consistent includes:
从采集到的多个目标声音中确定预设基准麦克风采集的声音,并将预设基准麦克风采集的声音作为标准声音,采集到的多个目标声音中除所述标准声音之外的目标声音作为非标准声音;Determine the sound collected by the preset reference microphone from the collected multiple target sounds, and use the sound collected by the preset reference microphone as the standard sound. Among the multiple target sounds collected, the target sound other than the standard sound is used as Non-standard voice
确定所述标准声音的时序,并根据所述标准声音的时序调整所述非标准声音的时序,以使得所述非标准声音的时序与所述标准声音的时序一致。The timing of the standard sound is determined, and the timing of the non-standard sound is adjusted according to the timing of the standard sound, so that the timing of the non-standard sound is consistent with the timing of the standard sound.
可选地,所述获取采集到的多个目标声音以及用户输入的定向拾音指令的步骤包括:Optionally, the step of acquiring the collected multiple target sounds and the directional sound pickup instruction input by the user includes:
根据用户在显示界面上的操作生成对应的定向拾音指令,并根据所述定向拾音指令通过麦克风阵列中的各个麦克风采集声音;Generate a corresponding directional sound pickup instruction according to the user's operation on the display interface, and collect sound through each microphone in the microphone array according to the directional sound pickup instruction;
将所述各个麦克风采集到的声音作为目标声音。Use the sound collected by each microphone as the target sound.
可选地,所述获取采集到的多个目标声音以及用户输入的定向拾音指令的步骤之后,还包括:Optionally, after the step of acquiring the collected multiple target sounds and the directional sound pickup instruction input by the user, the method further includes:
根据用户输入的定向拾音指令确定对应的水平角值以及仰角值,并将所述水平角值以及所述仰角值显示在所述显示界面上;Determine the corresponding horizontal angle value and the elevation angle value according to the directional sound pickup instruction input by the user, and display the horizontal angle value and the elevation angle value on the display interface;
所述将所述多个目标声音中与所述定向拾音指令对应的目标声音作为指向声音的步骤包括:The step of using the target sound corresponding to the directional sound pickup instruction among the plurality of target sounds as a pointing sound includes:
根据所述水平角值以及所述仰角值确定对应的指向方位,并将所述多个目标声音中与所述指向方位对应的目标声音作为所述指向声音。The corresponding pointing azimuth is determined according to the horizontal angle value and the elevation angle value, and a target sound corresponding to the pointing azimuth among the multiple target sounds is used as the pointing sound.
可选地,所述将所述多个目标声音中与所述定向拾音指令对应的目标声音作为指向声音的步骤之后,还包括:Optionally, after the step of using the target sound corresponding to the directional sound pickup instruction among the multiple target sounds as a pointing sound, the method further includes:
对所述指向声音进行信号采样以及归一化处理;Signal sampling and normalization processing on the pointing sound;
将归一化处理后的指向声音输入至预设滤波器中,以增强所述指向声音。The normalized pointing sound is input into a preset filter to enhance the pointing sound.
可选地,所述对所述指向声音进行降噪的步骤包括:Optionally, the step of reducing the noise of the pointing sound includes:
将所述指向声音输入至预设噪声估计算法中,以识别所述指向声音中的噪声;Input the pointing sound into a preset noise estimation algorithm to identify the noise in the pointing sound;
确定所述噪声的频率、振幅以及相位,并输出与所述噪声对应的反噪声,以对所述指向声音进行降噪,其中所述反噪声为与所述噪声频率相同、振幅相同以及相位相反的声波。Determine the frequency, amplitude and phase of the noise, and output the anti-noise corresponding to the noise to reduce the noise of the pointing sound, wherein the anti-noise is the same frequency, the same amplitude and the opposite phase as the noise Sound waves.
可选地,所述对所述指向声音进行降噪的步骤之后,还包括:Optionally, after the step of reducing the noise of the pointing sound, the method further includes:
将降噪后的指向声音上传至云端服务器,并在获取到用户输入的读取指令后,从云端服务器下载对应的指向声音。Upload the noise-reduced pointing sound to the cloud server, and download the corresponding pointing sound from the cloud server after obtaining the read instruction input by the user.
可选地,所述麦克风阵列的型号为ES7210,所述麦克风阵列的指向性为全向、心形指向、超级心形指向以及8字型指向中的至少一种;Optionally, the model of the microphone array is ES7210, and the directivity of the microphone array is at least one of omnidirectional, cardioid, super cardioid, and figure-8;
预设滤波器为空间滤波器、时域滤波器、听觉滤波器以及维纳滤波器中的至少一种;The preset filter is at least one of a spatial filter, a time domain filter, an auditory filter, and a Wiener filter;
预设噪声估计算法为对数谱幅度最优估计法和/或MCRA(Minima-Controlled-Recursive-Averaging Algorithms,递归平均)算法。The preset noise estimation algorithm is the logarithmic spectrum amplitude optimal estimation method and/or MCRA (Minima-Controlled-Recursive-Averaging Algorithms, recursive average) algorithm.
此外,为实现上述目的,本申请还提供一种录音设备,所述录音设备包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机可读指令,所述计算机可读指令被所述处理器执行时实现如上所述定向拾音方法的步骤。In addition, in order to achieve the above object, the present application also provides a recording device, the recording device comprising: a memory, a processor, and computer readable instructions stored in the memory and running on the processor, the When the computer-readable instructions are executed by the processor, the steps of the directional sound pickup method described above are realized.
此外,为实现上述目的,本申请还提供一种计算机可读存储介质,所述计算机可读存储介质上存储有计算机可读指令,所述计算机可读指令被处理器执行时实现如上所述定向拾音方法的步骤。In addition, in order to achieve the above objective, the present application also provides a computer-readable storage medium having computer-readable instructions stored on the computer-readable storage medium, and when the computer-readable instructions are executed by a processor, the above-mentioned orientation is achieved. The steps of the pickup method.
本申请提供了一种定向拾音方法、录音设备和存储介质,所述定向拾音方法通过获取采集到的多个目标声音以及用户输入的定向拾音指令,并对所述多个目标声音时延补偿,以使得所述多个目标声音的时序一致;将所述多个目标声音中与所述定向拾音指令对应的目标声音作为指向声音,并对所述指向声音进行降噪。从而在录音设备拾取声音的过程中,对录制的各个方向的声音进行时延补偿,再精准的拾取用户指定方向的声音,并对拾取到的声音中的噪声进行降噪处理,进而减少非指向方向的声音对录音结果的干扰,使录音设备的定向拾音达到最佳效果。This application provides a directional sound pickup method, a recording device, and a storage medium. The directional sound pickup method acquires multiple collected target sounds and directional sound pickup instructions input by a user, and responds to the multiple target sounds. Delay compensation to make the timings of the multiple target sounds consistent; use the target sound corresponding to the directional sound pickup instruction among the multiple target sounds as the pointing sound, and perform noise reduction on the pointing sound. Therefore, when the recording device picks up the sound, it compensates for the time delay of the recorded sound in all directions, and then accurately picks up the sound in the direction specified by the user, and performs noise reduction processing on the noise in the picked up sound, thereby reducing non-direction The sound of the direction interferes with the recording result, so that the directional sound pickup of the recording equipment achieves the best effect.
附图说明Description of the drawings
图1是本申请实施例方案涉及的硬件运行环境的结构示意图;Figure 1 is a schematic structural diagram of a hardware operating environment involved in a solution of an embodiment of the present application;
图2为本申请定向拾音方法一实施例的流程示意图;2 is a schematic flowchart of an embodiment of a directional sound pickup method according to the present application;
图3为本申请所述对所述多个目标声音时延补偿,以使得所述多个目标声音的时序一致的步骤流程细化示意图;FIG. 3 is a detailed schematic diagram of the steps of compensating the time delay of the multiple target sounds to make the sequence of the multiple target sounds consistent according to the application;
图4为本申请定向拾音方法另一实施例的流程示意图。4 is a schematic flowchart of another embodiment of the directional sound pickup method according to the present application.
本申请目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The realization, functional characteristics, and advantages of the purpose of this application will be further described in conjunction with the embodiments and with reference to the accompanying drawings.
具体实施方式Detailed ways
为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处所描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。In order to make the purpose, technical solutions, and advantages of this application clearer, the following further describes this application in detail with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the application, and are not used to limit the application.
如图1所示,图1是本申请实施例方案涉及的硬件运行环境的终端结构示意图。As shown in FIG. 1, FIG. 1 is a schematic diagram of a terminal structure of a hardware operating environment involved in a solution of an embodiment of the present application.
本申请终端是一种装置,该装置可以是录音设备,还可以是服务器、电脑、智能手机、平板电脑、便携计算机等具有存储功能的终端设备。The terminal of this application is a device. The device can be a recording device, or a terminal device with a storage function, such as a server, a computer, a smart phone, a tablet computer, and a portable computer.
如图1所示,该终端可以包括:处理器1001,例如CPU,通信总线1002,用户接口1003,网络接口1004,存储器1005。其中,通信总线1002用于实现这些组件之间的连接通信。用户接口1003可以包括显示屏(Display)、输入单元比如键盘(Keyboard),可选的用户接口1003还可以包括标准的有线接口、无线接口。网络接口1004可选的可以包括标准的有线接口、无线接口(如WI-FI接口)。存储器1005可以是高速RAM存储器,也可以是稳定的存储器(non-volatile memory),例如磁盘存储器。存储器1005可选的还可以是独立于前述处理器1001的存储装置。As shown in FIG. 1, the terminal may include a processor 1001, such as a CPU, a communication bus 1002, a user interface 1003, a network interface 1004, and a memory 1005. Among them, the communication bus 1002 is used to implement connection and communication between these components. The user interface 1003 may include a display screen (Display) and an input unit such as a keyboard (Keyboard), and the optional user interface 1003 may also include a standard wired interface and a wireless interface. The network interface 1004 may optionally include a standard wired interface and a wireless interface (such as a WI-FI interface). The memory 1005 can be a high-speed RAM memory or a stable memory (non-volatile memory), such as disk storage. Optionally, the memory 1005 may also be a storage device independent of the foregoing processor 1001.
可选地,终端还可以包括摄像头、Wi-Fi模块等等,在此不再赘述。Optionally, the terminal may also include a camera, a Wi-Fi module, etc., which will not be repeated here.
本领域技术人员可以理解,图1中示出的终端结构并不构成对终端的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。Those skilled in the art can understand that the terminal structure shown in FIG. 1 does not constitute a limitation on the terminal, and may include more or fewer components than shown in the figure, or combine some components, or arrange different components.
在图1所示的终端中,网络接口1004主要用于连接后台服务器,与后台服务器进行数据通信;用户接口1003主要包括输入单元比如键盘,键盘包括无线键盘和有线键盘,用于连接客户端,与客户端进行数据通信;而处理器1001可以用于调用存储器1005中存储的计算机可读指令,并执行以下操作:In the terminal shown in FIG. 1, the network interface 1004 is mainly used to connect to a back-end server and perform data communication with the back-end server; the user interface 1003 mainly includes an input unit such as a keyboard. The keyboard includes a wireless keyboard and a wired keyboard to connect to the client. Perform data communication with the client; and the processor 1001 can be used to call computer readable instructions stored in the memory 1005 and perform the following operations:
获取采集到的多个目标声音以及用户输入的定向拾音指令,并对所述多个目标声音时延补偿,以使得所述多个目标声音的时序一致;Acquiring the collected multiple target sounds and the directional sound pickup instruction input by the user, and compensating for the time delay of the multiple target sounds, so that the timings of the multiple target sounds are consistent;
将所述多个目标声音中与所述定向拾音指令对应的目标声音作为指向声音,并对所述指向声音进行降噪。The target sound corresponding to the directional sound pickup instruction among the plurality of target sounds is used as a pointing sound, and the noise of the pointing sound is reduced.
进一步地,处理器1001可以调用存储器1005中存储的计算机可读指令,还执行以下操作:Further, the processor 1001 may call computer-readable instructions stored in the memory 1005, and also perform the following operations:
从采集到的多个目标声音中确定预设基准麦克风采集的声音,并将预设基准麦克风采集的声音作为标准声音,采集到的多个目标声音中除所述标准声音之外的目标声音作为非标准声音;Determine the sound collected by the preset reference microphone from the collected multiple target sounds, and use the sound collected by the preset reference microphone as the standard sound. Among the multiple target sounds collected, the target sound other than the standard sound is used as Non-standard voice
确定所述标准声音的时序,并根据所述标准声音的时序调整所述非标准声音的时序,以使得所述非标准声音的时序与所述标准声音的时序一致。The timing of the standard sound is determined, and the timing of the non-standard sound is adjusted according to the timing of the standard sound, so that the timing of the non-standard sound is consistent with the timing of the standard sound.
进一步地,处理器1001可以调用存储器1005中存储的计算机可读指令,还执行以下操作:Further, the processor 1001 may call computer-readable instructions stored in the memory 1005, and also perform the following operations:
根据用户在显示界面上的操作生成对应的定向拾音指令,并根据所述定向拾音指令通过麦克风阵列中的各个麦克风采集声音;Generate a corresponding directional sound pickup instruction according to the user's operation on the display interface, and collect sound through each microphone in the microphone array according to the directional sound pickup instruction;
将所述各个麦克风采集到的声音作为目标声音。Use the sound collected by each microphone as the target sound.
进一步地,处理器1001可以调用存储器1005中存储的计算机可读指令,还执行以下操作:Further, the processor 1001 may call computer-readable instructions stored in the memory 1005, and also perform the following operations:
根据用户输入的定向拾音指令确定对应的水平角值以及仰角值,并将所述水平角值以及所述仰角值显示在所述显示界面上;Determine the corresponding horizontal angle value and the elevation angle value according to the directional sound pickup instruction input by the user, and display the horizontal angle value and the elevation angle value on the display interface;
所述将所述多个目标声音中与所述定向拾音指令对应的目标声音作为指向声音的步骤包括:The step of using the target sound corresponding to the directional sound pickup instruction among the plurality of target sounds as a pointing sound includes:
根据所述水平角值以及所述仰角值确定对应的指向方位,并将所述多个目标声音中与所述指向方位对应的目标声音作为所述指向声音。The corresponding pointing azimuth is determined according to the horizontal angle value and the elevation angle value, and a target sound corresponding to the pointing azimuth among the multiple target sounds is used as the pointing sound.
进一步地,处理器1001可以调用存储器1005中存储的计算机可读指令,还执行以下操作:Further, the processor 1001 may call computer-readable instructions stored in the memory 1005, and also perform the following operations:
对所述指向声音进行信号采样以及归一化处理;Signal sampling and normalization processing on the pointing sound;
将归一化处理后的指向声音输入至预设滤波器中,以增强所述指向声音。The normalized pointing sound is input into a preset filter to enhance the pointing sound.
进一步地,处理器1001可以调用存储器1005中存储的计算机可读指令,还执行以下操作:Further, the processor 1001 may call computer-readable instructions stored in the memory 1005, and also perform the following operations:
将所述指向声音输入至预设噪声估计算法中,以识别所述指向声音中的噪声;Input the pointing sound into a preset noise estimation algorithm to identify the noise in the pointing sound;
确定所述噪声的频率、振幅以及相位,并输出与所述噪声对应的反噪声,以对所述指向声音进行降噪,其中所述反噪声为与所述噪声频率相同、振幅相同以及相位相反的声波。Determine the frequency, amplitude and phase of the noise, and output the anti-noise corresponding to the noise to reduce the noise of the pointing sound, wherein the anti-noise is the same frequency, the same amplitude and the opposite phase as the noise Sound waves.
进一步地,处理器1001可以调用存储器1005中存储的计算机可读指令,还执行以下操作:Further, the processor 1001 may call computer-readable instructions stored in the memory 1005, and also perform the following operations:
将降噪后的指向声音上传至云端服务器,并在获取到用户输入的读取指令后,从云端服务器下载对应的指向声音。Upload the noise-reduced pointing sound to the cloud server, and download the corresponding pointing sound from the cloud server after obtaining the read instruction input by the user.
本终端的具体实施例与下述定向拾音方法各实施例基本相同,在此不作赘述。The specific embodiment of the terminal is basically the same as the following embodiments of the directional sound pickup method, and will not be repeated here.
请参阅图2,图2为本申请定向拾音方法一实施例的流程示意图,本实施例提供的定向拾音方法包括如下步骤:Please refer to FIG. 2. FIG. 2 is a schematic flowchart of an embodiment of a directional sound pickup method according to the present application. The directional sound pickup method provided in this embodiment includes the following steps:
步骤S10,获取采集到的多个目标声音以及用户输入的定向拾音指令,并对所述多个目标声音时延补偿,以使得所述多个目标声音的时序一致;Step S10: Acquire the collected multiple target sounds and the directional sound pickup instruction input by the user, and compensate for the time delay of the multiple target sounds, so that the timings of the multiple target sounds are consistent;
本实施例中的录音设备内置有多个麦克风,获取采集到的多个目标声音以及用户输入的定向拾音指令,所述定向拾音指令为用户输入的确定拾音方位的指令。容易理解的,录音设备中内置的多个麦克风与发声源的位置是不同的,因此采集到的多个目标声音的时序也各不相同。一般来说,距离发声源最近的麦克风,最先接收到声音,距离发声源最远的麦克风,最晚接收到声音,声音延长较长。本实施例中,对多个目标声音进行时延补偿,以使得多个目标声音的时序一致。The recording device in this embodiment has multiple built-in microphones to acquire multiple collected target sounds and a directional sound pickup instruction input by the user. The directional sound pickup instruction is an instruction input by the user to determine the pickup position. It is easy to understand that the positions of the multiple microphones built into the recording device and the sound source are different, so the timing of the multiple target sounds collected is also different. Generally speaking, the microphone closest to the sound source receives the sound first, and the microphone furthest from the sound source receives the sound last, and the sound is longer. In this embodiment, time delay compensation is performed on multiple target sounds, so that the timings of the multiple target sounds are consistent.
步骤S20,将所述多个目标声音中与所述定向拾音指令对应的目标声音作为指向声音,并对所述指向声音进行降噪。Step S20: Use the target sound corresponding to the directional sound pickup instruction among the multiple target sounds as the pointing sound, and perform noise reduction on the pointing sound.
得到时序一致的多个目标声音后,由于所述定向拾音指令包含有方位信息,则从多个目标声音中提取与所述方位信息对应的目标声音,并将所述目标声音作为指向声音,所述指向声音即录音设备收录的与用户指定方向对应的声音。对指向声音进行降噪,以增强录音设备的定向拾音效果。After obtaining multiple target sounds with the same time sequence, since the directional pickup instruction contains azimuth information, extract the target sound corresponding to the azimuth information from the multiple target sounds, and use the target sound as the pointing sound, The pointing sound is the sound recorded by the recording device and corresponding to the direction specified by the user. Noise reduction is performed on pointing sound to enhance the directional sound pickup effect of recording equipment.
本实施例提供了一种定向拾音方法,通过获取采集到的多个目标声音以及用户输入的定向拾音指令,并对所述多个目标声音时延补偿,以使得所述多个目标声音的时序一致;将所述多个目标声音中与所述定向拾音指令对应的目标声音作为指向声音,并对所述指向声音进行降噪。从而在录音设备拾取声音的过程中,对录制的各个方向的声音进行时延补偿,再精准的拾取用户指定方向的声音,并对拾取到的声音中的噪声进行降噪处理,进而减少非指向方向的声音对录音结果的干扰,使录音设备的定向拾音达到最佳效果。This embodiment provides a directional sound pickup method, which acquires a plurality of collected target sounds and a directional sound pickup instruction input by a user, and compensates for the delay of the plurality of target sounds, so that the plurality of target sounds The time sequence of is consistent; the target sound corresponding to the directional sound pickup instruction among the multiple target sounds is used as the pointing sound, and the noise of the pointing sound is reduced. Therefore, when the recording device picks up the sound, it compensates for the time delay of the recorded sound in all directions, and then accurately picks up the sound in the direction specified by the user, and performs noise reduction processing on the noise in the picked up sound, thereby reducing non-direction The sound of the direction interferes with the recording result, so that the directional sound pickup of the recording equipment achieves the best effect.
进一步地,请参阅图3,图3为本申请所述对所述多个目标声音时延补偿,以使得所述多个目标声音的时序一致的步骤流程细化示意图,所述对所述多个目标声音时延补偿,以使得所述多个目标声音的时序一致的步骤包括:Further, please refer to FIG. 3, which is a detailed schematic diagram of the steps of compensating the time delay of the multiple target sounds to make the sequence of the multiple target sounds consistent according to the application. The step of compensating the time delay of a target sound so that the sequence of the multiple target sounds is consistent includes:
步骤S11,从采集到的多个目标声音中确定预设基准麦克风采集的声音,并将预设基准麦克风采集的声音作为标准声音,采集到的多个目标声音中除所述标准声音之外的目标声音作为非标准声音;Step S11: Determine the sound collected by the preset reference microphone from the collected multiple target sounds, and use the sound collected by the preset reference microphone as the standard sound. Among the multiple collected target sounds, excluding the standard sound Target sound as non-standard sound;
步骤S12,确定所述标准声音的时序,并根据所述标准声音的时序调整所述非标准声音的时序,以使得所述非标准声音的时序与所述标准声音的时序一致。Step S12: Determine the timing of the standard sound, and adjust the timing of the non-standard sound according to the timing of the standard sound, so that the timing of the non-standard sound is consistent with the timing of the standard sound.
如上所述,录音设备中包括有多个麦克风,各个麦克风录制的声音在时序上存在不同。在本实施例中,预先设置一个麦克风为基准麦克风,并将预设基准麦克风采集的声音作为标准声音,那么在对目标声音进行时延补偿这一过程中,以录音设备收录的标准声音的时序作为标准;将采集到的多个目标声音中除所述标准声音之外的目标声音作为非标准声音,也就是说,在对目标声音进行时延补偿这一过程中,只需要对非标准声音进行时序上的调整。As described above, the recording device includes multiple microphones, and the sound recorded by each microphone is different in timing. In this embodiment, a microphone is preset as the reference microphone, and the sound collected by the preset reference microphone is used as the standard sound, then in the process of delay compensation for the target sound, the timing of the standard sound recorded by the recording device As a standard; the target sounds other than the standard sound among the collected multiple target sounds are regarded as non-standard sounds, that is, in the process of delay compensation for the target sound, only the non-standard sounds are needed. Make timing adjustments.
将多个目标声音划分为标准声音以及非标准声音后,确定所述标准声音的时序,并以所述标准声音的时序作为基准,调整非标准声音的时序,以使得非标准声音的时序与标准声音的时序一致,从而使得获取的多个目标声音的时序一致,实现对录制声音的时延补偿。本实施例通过上述方式,对目标声音进行时延补偿,从而减少非指向方向的声音对录音结果的干扰。After the multiple target sounds are divided into standard sounds and non-standard sounds, the timing of the standard sounds is determined, and the timing of the standard sounds is used as a reference to adjust the timing of the non-standard sounds so that the timing of the non-standard sounds is consistent with the standard The sound sequence is consistent, so that the sequence of the acquired multiple target sounds is consistent, and the time delay compensation for the recorded sound is realized. In this embodiment, the target sound is compensated for the time delay in the above-mentioned manner, thereby reducing the interference of non-directional sound on the recording result.
进一步地,所述获取采集到的多个目标声音以及用户输入的定向拾音指令的步骤包括:Further, the step of acquiring the collected multiple target sounds and the directional sound pickup instruction input by the user includes:
步骤S13,根据用户在显示界面上的操作生成对应的定向拾音指令,并根据所述定向拾音指令通过麦克风阵列中的各个麦克风采集声音;Step S13: Generate a corresponding directional sound pickup instruction according to the user's operation on the display interface, and collect sound through each microphone in the microphone array according to the directional sound pickup instruction;
步骤S14,将所述各个麦克风采集到的声音作为目标声音。Step S14: Use the sound collected by each microphone as the target sound.
特别的,根据用户在显示界面上的操作生成对应的定向拾音指令,容易理解的是,用户也可以通过其他方式生成对应的定向拾音指令,例如,用户可以通过调整终端的方位,来产生对应的定向拾音指令;用户也可以通过直接输入方位信息,来产生对应的定向拾音指令,本实施例在此不做限制。在获取到用户输入的定向拾音指令后,根据所述定向拾音指令通过麦克风阵列中的各个麦克风采集声音。可选地,为了达到全方位的拾音效果,麦克风阵列中麦克风的组成形式为6+1模式,即6个麦克风围绕中间一个麦克风形成闭环;上述麦克风阵列的型号为ES7210,上述麦克风阵列的指向性为全向、心形指向、超级心形指向以及8字型指向中的至少一种。这样一来,全方位的收集录音设备所处环境的声音,并将各个麦克风采集到的声音作为目标声音。In particular, the corresponding directional pickup instruction is generated according to the user's operation on the display interface. It is easy to understand that the user can also generate the corresponding directional pickup instruction in other ways. For example, the user can generate the corresponding directional pickup instruction by adjusting the position of the terminal. Corresponding directional sound pickup instructions; the user can also directly input location information to generate corresponding directional sound pickup instructions, which is not limited in this embodiment. After acquiring the directional sound pickup instruction input by the user, the sound is collected through each microphone in the microphone array according to the directional sound pickup instruction. Optionally, in order to achieve a omnidirectional sound pickup effect, the microphone array in the microphone array is composed of a 6+1 mode, that is, 6 microphones surround the middle microphone to form a closed loop; the model of the microphone array is ES7210, and the direction of the microphone array Sex is at least one of omnidirectional, cardioid, super cardioid, and figure-eight. In this way, the sound of the environment where the recording device is located is collected in all directions, and the sound collected by each microphone is used as the target sound.
本实施例通过上述方式,全方位的收集录音设备周围的声音,并将其作为目标声音,进而保障用户可以做到360度指定方向进行拾音。In this embodiment, through the above method, the sound around the recording device is collected in an all-round manner and used as the target sound, thereby ensuring that the user can pick up the sound in a specified direction of 360 degrees.
进一步地,上述步骤S10获取采集到的多个目标声音以及用户输入的定向拾音指令之后,还包括:Further, after acquiring the multiple target sounds collected and the directional sound pickup instruction input by the user in the above step S10, it further includes:
步骤S30,根据用户输入的定向拾音指令确定对应的水平角值以及仰角值,并将所述水平角值以及所述仰角值显示在所述显示界面上;Step S30: Determine the corresponding horizontal angle value and the elevation angle value according to the directional sound pickup instruction input by the user, and display the horizontal angle value and the elevation angle value on the display interface;
所述将所述多个目标声音中与所述定向拾音指令对应的目标声音作为指向声音的步骤包括:The step of using the target sound corresponding to the directional sound pickup instruction among the plurality of target sounds as a pointing sound includes:
步骤S21,根据所述水平角值以及所述仰角值确定对应的指向方位,并将所述多个目标声音中与所述指向方位对应的目标声音作为所述指向声音。Step S21: Determine the corresponding pointing azimuth according to the horizontal angle value and the elevation angle value, and use the target sound corresponding to the pointing azimuth among the multiple target sounds as the pointing sound.
用户输入的定向拾音指令中包括有用户确定拾音方向的方位信息,所述定向拾音指令中包含有水平角值以及仰角值,为了让用户确定方位信息的正确性,方便用户判断录音设备定向拾音的方向是否符合预期,将水平角值以及仰角值显示在显示界面上。通过这种方式,如若用户发现此时录音设备拾取声音的方向不符合预期,或用户想更改拾音方向,都可通过重新输入定向拾音指令实现。The directional sound pickup instruction input by the user includes the position information for the user to determine the pickup direction, and the directional sound pickup instruction includes the horizontal angle value and the elevation angle value. In order to allow the user to determine the correctness of the position information, it is convenient for the user to judge the recording equipment Whether the direction of directional pickup meets expectations, display the horizontal angle and elevation angle on the display interface. In this way, if the user finds that the direction of the sound picked up by the recording device at this time does not meet expectations, or the user wants to change the direction of the sound pickup, it can be realized by re-inputting the directional sound pickup instruction.
容易理解的是,可以根据水平角值以及仰角值确定用户意欲进行拾音的方向,并将上述方向作为指向方向。由于一个目标声音是由一个麦克风所采集到的,而各个麦克风在录音设备上的位置为固定的,因此可以根据水平角值以及仰角值确定录音设备上对应的麦克风,并将所述麦克风录制的目标声音作为指向声音。本实施例通过上述方式,根据定向拾音指令确定录音设备上对应的麦克风,并将所述麦克风录制的声音作为指向声音,精准的实现定向拾音。It is easy to understand that the direction in which the user intends to pick up the sound can be determined according to the horizontal angle value and the elevation angle value, and the above direction is regarded as the pointing direction. Since a target sound is collected by a microphone, and the position of each microphone on the recording device is fixed, the corresponding microphone on the recording device can be determined according to the horizontal angle value and the elevation angle value, and the microphone is recorded The target sound serves as the pointing sound. In this embodiment, in the above manner, the corresponding microphone on the recording device is determined according to the directional sound pickup instruction, and the sound recorded by the microphone is used as the directional sound to accurately realize the directional sound pickup.
进一步地,上述步骤S20将所述多个目标声音中与所述定向拾音指令对应的目标声音作为指向声音之后,还包括:Further, after the above step S20 uses the target sound corresponding to the directional sound pickup instruction among the multiple target sounds as the pointing sound, the method further includes:
步骤S40,对所述指向声音进行信号采样以及归一化处理;Step S40, performing signal sampling and normalization processing on the pointing sound;
步骤S50,将归一化处理后的指向声音输入至预设滤波器中,以增强所述指向声音。Step S50: Input the normalized pointing sound into a preset filter to enhance the pointing sound.
确定指向声音后,对指向声音进行信号采样,上述信号采样,即按照一定的时间间隔在声音数据上逐点采集其瞬时值,从而实现声音的离散化。特别的,对指向声音进行信号采样后,还可以对离散后的声音进行傅里叶变换,并根据使用频段的不同,对不同的声音数据进行加权计算,并将离散的多个声音数据进行归一化处理。对指向声音进行归一化处理后,进一步地,将归一化处理后的指向声音输入至预设滤波器中,以增强所述指向声音,提高指向声音在混响环境下的表现,特别的,预设滤波器为空间滤波器、时域滤波器、听觉滤波器以及维纳滤波器中的至少一种。After determining the pointing sound, signal sampling is performed on the pointing sound. The above-mentioned signal sampling is to collect the instantaneous value of the sound data point by point according to a certain time interval, thereby realizing the discretization of the sound. In particular, after sampling the pointing sound, the discrete sound can also be Fourier transformed, and different sound data can be weighted according to the different frequency bands used, and multiple discrete sound data can be classified One treatment. After normalizing the pointing sound, further input the normalized pointing sound into a preset filter to enhance the pointing sound and improve the performance of the pointing sound in a reverberant environment. , The preset filter is at least one of a spatial filter, a time domain filter, an auditory filter, and a Wiener filter.
本实施例通过上述方式,增强指向方向的声音,进而提高定向拾音的效果。In this embodiment, through the above method, the sound directed in the direction is enhanced, thereby improving the effect of directional sound pickup.
进一步地,所述对所述指向声音进行降噪的步骤包括:Further, the step of reducing the noise of the pointing sound includes:
步骤S22,将所述指向声音输入至预设噪声估计算法中,以识别所述指向声音中的噪声;Step S22, input the pointing sound into a preset noise estimation algorithm to identify the noise in the pointing sound;
步骤S23,确定所述噪声的频率、振幅以及相位,并输出与所述噪声对应的反噪声,以对所述指向声音进行降噪,其中所述反噪声为与所述噪声频率相同、振幅相同以及相位相反的声波。Step S23: Determine the frequency, amplitude and phase of the noise, and output the anti-noise corresponding to the noise to reduce the noise of the pointing sound, wherein the anti-noise is the same frequency and amplitude as the noise And sound waves of opposite phase.
本实施例中,还预设有噪声估计算法,预设噪声估计算法为对数谱幅度最优估计法和/或MCRA算法,使用预设噪声估计算法对所述指向声音进行噪声估计,以识别出指向声音中的噪声,并确定所述噪声的频率、振幅以及相位。再向所述指向声音输入与噪声声波频率相同、振幅相同以及相位相反的反噪声声波实现与噪声的相位抵消,从而对指向声音进行降噪,减少指向声音中的噪声源对录音结果的干扰,使录音设备的定向拾音达到最佳效果。In this embodiment, a noise estimation algorithm is also preset. The preset noise estimation algorithm is the logarithmic spectrum amplitude optimal estimation method and/or the MCRA algorithm. The preset noise estimation algorithm is used to perform noise estimation on the pointing sound to identify Point out the noise in the sound, and determine the frequency, amplitude and phase of the noise. Then input the anti-noise sound wave with the same frequency, the same amplitude and the opposite phase to the pointing sound to achieve the phase cancellation with the noise, thereby reducing the noise of the pointing sound and reducing the interference of the noise source in the pointing sound to the recording result. Make the directional pickup of recording equipment achieve the best effect.
进一步地,请参阅图4,图4为本申请定向拾音方法另一实施例的流程示意图,上述步骤S20对所述指向声音进行降噪之后,还包括:Further, please refer to FIG. 4, which is a schematic flowchart of another embodiment of the directional sound pickup method according to the present application. After the above step S20 performs noise reduction on the pointing sound, it further includes:
步骤S60,将降噪后的指向声音上传至云端服务器,并在获取到用户输入的读取指令后,从云端服务器下载对应的指向声音。In step S60, the noise-reduced pointing sound is uploaded to the cloud server, and after the reading instruction input by the user is obtained, the corresponding pointing sound is downloaded from the cloud server.
实现对指向声音的增强和降噪后,容易理解的是,此时的指向声音最为理想。本实施例中,还具备指向声音的自动备份功能,将降噪后的指向声音上传至云端服务器,以便用户能随时提取指向声音,相应的,在获取到用户输入的读取指令后,从云端服务器下载并播放对应的指向声音。通过上述方式,实现对指向声音的备份,从而满足用户更多的需求。After realizing the enhancement and noise reduction of the pointing sound, it is easy to understand that the pointing sound at this time is the most ideal. In this embodiment, there is also an automatic backup function of pointing sound, and the noise-reduced pointing sound is uploaded to the cloud server, so that the user can extract the pointing sound at any time. The server downloads and plays the corresponding pointing sound. Through the above method, the backup of the pointing sound is realized, thereby satisfying more needs of users.
进一步地,上述麦克风阵列的型号为ES7210,上述麦克风阵列的指向性为全向、心形指向、超级心形指向以及8字型指向中的至少一种;Further, the model of the microphone array is ES7210, and the directivity of the microphone array is at least one of omnidirectional, cardioid, super cardioid, and figure-8;
预设滤波器为空间滤波器、时域滤波器、听觉滤波器以及维纳滤波器中的至少一种;The preset filter is at least one of a spatial filter, a time domain filter, an auditory filter, and a Wiener filter;
预设噪声估计算法为对数谱幅度最优估计法和/或MCRA(Minima-Controlled-Recursive-Averaging Algorithms,递归平均)算法。The preset noise estimation algorithm is the logarithmic spectrum amplitude optimal estimation method and/or MCRA (Minima-Controlled-Recursive-Averaging Algorithms, recursive average) algorithm.
此外,本申请实施例还提出一种计算机可读存储介质,所述计算机可读存储介质上存储有计算机可读指令,所述计算机可读指令被处理器执行时实现如下操作:In addition, an embodiment of the present application also proposes a computer-readable storage medium having computer-readable instructions stored thereon, and when the computer-readable instructions are executed by a processor, the following operations are implemented:
获取采集到的多个目标声音以及用户输入的定向拾音指令,并对所述多个目标声音时延补偿,以使得所述多个目标声音的时序一致;Acquiring the collected multiple target sounds and the directional sound pickup instruction input by the user, and compensating for the time delay of the multiple target sounds, so that the timings of the multiple target sounds are consistent;
将所述多个目标声音中与所述定向拾音指令对应的目标声音作为指向声音,并对所述指向声音进行降噪。The target sound corresponding to the directional sound pickup instruction among the plurality of target sounds is used as a pointing sound, and the noise of the pointing sound is reduced.
进一步地,所述计算机可读指令被处理器执行时还实现如下操作:Further, when the computer-readable instructions are executed by the processor, the following operations are also implemented:
从采集到的多个目标声音中确定预设基准麦克风采集的声音,并将预设基准麦克风采集的声音作为标准声音,采集到的多个目标声音中除所述标准声音之外的目标声音作为非标准声音;Determine the sound collected by the preset reference microphone from the collected multiple target sounds, and use the sound collected by the preset reference microphone as the standard sound. Among the multiple target sounds collected, the target sound other than the standard sound is used as Non-standard voice
确定所述标准声音的时序,并根据所述标准声音的时序调整所述非标准声音的时序,以使得所述非标准声音的时序与所述标准声音的时序一致。The timing of the standard sound is determined, and the timing of the non-standard sound is adjusted according to the timing of the standard sound, so that the timing of the non-standard sound is consistent with the timing of the standard sound.
进一步地,所述计算机可读指令被处理器执行时还实现如下操作:Further, when the computer-readable instructions are executed by the processor, the following operations are also implemented:
根据用户在显示界面上的操作生成对应的定向拾音指令,并根据所述定向拾音指令通过麦克风阵列中的各个麦克风采集声音;Generate a corresponding directional sound pickup instruction according to the user's operation on the display interface, and collect sound through each microphone in the microphone array according to the directional sound pickup instruction;
将所述各个麦克风采集到的声音作为目标声音。Use the sound collected by each microphone as the target sound.
进一步地,所述计算机可读指令被处理器执行时还实现如下操作:Further, when the computer-readable instructions are executed by the processor, the following operations are also implemented:
根据用户输入的定向拾音指令确定对应的水平角值以及仰角值,并将所述水平角值以及所述仰角值显示在所述显示界面上;Determine the corresponding horizontal angle value and the elevation angle value according to the directional sound pickup instruction input by the user, and display the horizontal angle value and the elevation angle value on the display interface;
所述将所述多个目标声音中与所述定向拾音指令对应的目标声音作为指向声音的步骤包括:The step of using the target sound corresponding to the directional sound pickup instruction among the plurality of target sounds as a pointing sound includes:
根据所述水平角值以及所述仰角值确定对应的指向方位,并将所述多个目标声音中与所述指向方位对应的目标声音作为所述指向声音。The corresponding pointing azimuth is determined according to the horizontal angle value and the elevation angle value, and a target sound corresponding to the pointing azimuth among the multiple target sounds is used as the pointing sound.
进一步地,所述计算机可读指令被处理器执行时还实现如下操作:Further, when the computer-readable instructions are executed by the processor, the following operations are also implemented:
对所述指向声音进行信号采样以及归一化处理;Signal sampling and normalization processing on the pointing sound;
将归一化处理后的指向声音输入至预设滤波器中,以增强所述指向声音。The normalized pointing sound is input into a preset filter to enhance the pointing sound.
进一步地,所述计算机可读指令被处理器执行时还实现如下操作:Further, when the computer-readable instructions are executed by the processor, the following operations are also implemented:
将所述指向声音输入至预设噪声估计算法中,以识别所述指向声音中的噪声;Input the pointing sound into a preset noise estimation algorithm to identify the noise in the pointing sound;
确定所述噪声的频率、振幅以及相位,并输出与所述噪声对应的反噪声,以对所述指向声音进行降噪,其中所述反噪声为与所述噪声频率相同、振幅相同以及相位相反的声波。Determine the frequency, amplitude and phase of the noise, and output the anti-noise corresponding to the noise to reduce the noise of the pointing sound, wherein the anti-noise is the same frequency, the same amplitude and the opposite phase as the noise Sound waves.
进一步地,所述计算机可读指令被处理器执行时还实现如下操作:Further, when the computer-readable instructions are executed by the processor, the following operations are also implemented:
将降噪后的指向声音上传至云端服务器,并在获取到用户输入的读取指令后,从云端服务器下载对应的指向声音。Upload the noise-reduced pointing sound to the cloud server, and download the corresponding pointing sound from the cloud server after obtaining the read instruction input by the user.
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者***不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者***所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者***中还存在另外的相同要素。It should be noted that in this article, the terms "include", "include" or any other variants thereof are intended to cover non-exclusive inclusion, so that a process, method, article or system including a series of elements not only includes those elements, It also includes other elements that are not explicitly listed, or elements inherent to the process, method, article, or system. If there are no more restrictions, the element defined by the sentence "including a..." does not exclude the existence of other identical elements in the process, method, article or system that includes the element.
上述本申请实施例序号仅仅为了描述,不代表实施例的优劣。The serial numbers of the foregoing embodiments of the present application are only for description, and do not represent the advantages and disadvantages of the embodiments.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述 实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通 过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体 现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光 盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,空调器,或者网络设备等)执行本申请各个实施例所述的方法。Through the description of the above embodiments, those skilled in the art can clearly understand the above The method of the embodiment can be realized by means of software plus the necessary general hardware platform, of course, it can also be realized by Over hardware, but in many cases the former is a better implementation. Based on this understanding, the technical solution of this application essentially or the part that contributes to the existing technology can be embodied in the form of a software product. Now, the computer software product is stored in a storage medium (such as ROM/RAM, magnetic disk, optical The disk) includes several instructions to make a terminal device (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) execute the methods described in the various embodiments of the present application.
以上仅为本申请的优选实施例,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利保护范围内。The above are only preferred embodiments of this application, and do not limit the scope of this application. Any equivalent structure or equivalent process transformation made using the content of the description and drawings of this application, or directly or indirectly used in other related technical fields , The same reason is included in the scope of patent protection of this application.

Claims (20)

  1. 一种定向拾音方法,其中,所述方法包括以下步骤: A directional sound pickup method, wherein the method includes the following steps:
    获取采集到的多个目标声音以及用户输入的定向拾音指令,并对所述多个目标声音时延补偿,以使得所述多个目标声音的时序一致;Acquiring the collected multiple target sounds and the directional sound pickup instruction input by the user, and compensating for the time delay of the multiple target sounds, so that the timings of the multiple target sounds are consistent;
    将所述多个目标声音中与所述定向拾音指令对应的目标声音作为指向声音,并对所述指向声音进行降噪。The target sound corresponding to the directional sound pickup instruction among the plurality of target sounds is used as a pointing sound, and the noise of the pointing sound is reduced.
  2. 如权利要求1所述的定向拾音方法,其中,所述对所述多个目标声音时延补偿,以使得所述多个目标声音的时序一致的步骤包括:The directional sound pickup method according to claim 1, wherein the step of compensating the time delay of the multiple target sounds so that the timings of the multiple target sounds are consistent comprises:
    从采集到的多个目标声音中确定预设基准麦克风采集的声音,并将预设基准麦克风采集的声音作为标准声音,采集到的多个目标声音中除所述标准声音之外的目标声音作为非标准声音;Determine the sound collected by the preset reference microphone from the collected multiple target sounds, and use the sound collected by the preset reference microphone as the standard sound. Among the multiple target sounds collected, the target sound other than the standard sound is used as Non-standard voice
    确定所述标准声音的时序,并根据所述标准声音的时序调整所述非标准声音的时序,以使得所述非标准声音的时序与所述标准声音的时序一致。The timing of the standard sound is determined, and the timing of the non-standard sound is adjusted according to the timing of the standard sound, so that the timing of the non-standard sound is consistent with the timing of the standard sound.
  3. 如权利要求1所述的定向拾音方法,其中,所述获取采集到的多个目标声音以及用户输入的定向拾音指令的步骤包括:5. The directional sound pickup method according to claim 1, wherein the step of acquiring the collected multiple target sounds and the directional sound pickup instruction input by the user comprises:
    根据用户在显示界面上的操作生成对应的定向拾音指令,并根据所述定向拾音指令通过麦克风阵列中的各个麦克风采集声音;Generate a corresponding directional sound pickup instruction according to the user's operation on the display interface, and collect sound through each microphone in the microphone array according to the directional sound pickup instruction;
    将所述各个麦克风采集到的声音作为目标声音。Use the sound collected by each microphone as the target sound.
  4. 如权利要求1所述的定向拾音方法,其中,所述获取采集到的多个目标声音以及用户输入的定向拾音指令的步骤之后,还包括:5. The directional sound pickup method according to claim 1, wherein after the step of acquiring the collected multiple target sounds and the directional sound pickup instruction input by the user, the method further comprises:
    根据用户输入的定向拾音指令确定对应的水平角值以及仰角值,并将所述水平角值以及所述仰角值显示在所述显示界面上;Determine the corresponding horizontal angle value and the elevation angle value according to the directional sound pickup instruction input by the user, and display the horizontal angle value and the elevation angle value on the display interface;
    所述将所述多个目标声音中与所述定向拾音指令对应的目标声音作为指向声音的步骤包括:The step of using the target sound corresponding to the directional sound pickup instruction among the plurality of target sounds as a pointing sound includes:
    根据所述水平角值以及所述仰角值确定对应的指向方位,并将所述多个目标声音中与所述指向方位对应的目标声音作为所述指向声音。The corresponding pointing azimuth is determined according to the horizontal angle value and the elevation angle value, and a target sound corresponding to the pointing azimuth among the multiple target sounds is used as the pointing sound.
  5. 如权利要求1所述的定向拾音方法,其中,所述将所述多个目标声音中与所述定向拾音指令对应的目标声音作为指向声音的步骤之后,还包括:5. The directional sound pickup method according to claim 1, wherein after the step of using the target sound corresponding to the directional sound pickup instruction among the plurality of target sounds as a directional sound, the method further comprises:
    对所述指向声音进行信号采样以及归一化处理;Signal sampling and normalization processing on the pointing sound;
    将归一化处理后的指向声音输入至预设滤波器中,以增强所述指向声音。The normalized pointing sound is input into a preset filter to enhance the pointing sound.
  6. 如权利要求1所述的定向拾音方法,其中,所述对所述指向声音进行降噪的步骤包括:The directional sound pickup method according to claim 1, wherein the step of reducing the noise of the directional sound comprises:
    将所述指向声音输入至预设噪声估计算法中,以识别所述指向声音中的噪声;Input the pointing sound into a preset noise estimation algorithm to identify the noise in the pointing sound;
    确定所述噪声的频率、振幅以及相位,并输出与所述噪声对应的反噪声,以对所述指向声音进行降噪,其中所述反噪声为与所述噪声频率相同、振幅相同以及相位相反的声波。Determine the frequency, amplitude and phase of the noise, and output the anti-noise corresponding to the noise to reduce the noise of the pointing sound, wherein the anti-noise is the same frequency, the same amplitude and the opposite phase as the noise Sound waves.
  7. 如权利要求1所述的定向拾音方法,其中,所述对所述指向声音进行降噪的步骤之后,还包括:5. The directional sound pickup method according to claim 1, wherein after the step of reducing the noise of the pointing sound, the method further comprises:
    将降噪后的指向声音上传至云端服务器,并在获取到用户输入的读取指令后,从云端服务器下载对应的指向声音。Upload the noise-reduced pointing sound to the cloud server, and download the corresponding pointing sound from the cloud server after obtaining the read instruction input by the user.
  8. 如权利要求1所述的定向拾音方法,其中,所述麦克风阵列的型号为ES7210,所述麦克风阵列的指向性为全向、心形指向、超级心形指向以及8字型指向中的至少一种;The directional sound pickup method of claim 1, wherein the model of the microphone array is ES7210, and the directivity of the microphone array is at least one of omnidirectional, cardioid direction, super cardioid direction, and figure-8 One kind
    预设滤波器为空间滤波器、时域滤波器、听觉滤波器以及维纳滤波器中的至少一种;The preset filter is at least one of a spatial filter, a time domain filter, an auditory filter, and a Wiener filter;
    预设噪声估计算法为对数谱幅度最优估计法和/或MCRA(Minima-Controlled-Recursive-Averaging Algorithms,递归平均)算法。The preset noise estimation algorithm is the logarithmic spectrum amplitude optimal estimation method and/or MCRA (Minima-Controlled-Recursive-Averaging Algorithms, recursive average) algorithm.
  9. 一种录音设备,其中,所述录音设备包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机可读指令,其中所述计算机可读指令被所述处理器执行时,实现如下步骤:A recording device, wherein the recording device includes: a memory, a processor, and computer-readable instructions stored in the memory and capable of running on the processor, wherein the computer-readable instructions are processed by the processor When the device is executed, the following steps are implemented:
    获取采集到的多个目标声音以及用户输入的定向拾音指令,并对所述多个目标声音时延补偿,以使得所述多个目标声音的时序一致;Acquiring the collected multiple target sounds and the directional sound pickup instruction input by the user, and compensating for the time delay of the multiple target sounds, so that the timings of the multiple target sounds are consistent;
    将所述多个目标声音中与所述定向拾音指令对应的目标声音作为指向声音,并对所述指向声音进行降噪。The target sound corresponding to the directional sound pickup instruction among the plurality of target sounds is used as a pointing sound, and the noise of the pointing sound is reduced.
  10. 如权利要求9所述的录音设备,其中,所述对所述多个目标声音时延补偿,以使得所述多个目标声音的时序一致的步骤包括:9. The recording device according to claim 9, wherein the step of compensating for the time delay of the multiple target sounds so that the timings of the multiple target sounds are consistent comprises:
    从采集到的多个目标声音中确定预设基准麦克风采集的声音,并将预设基准麦克风采集的声音作为标准声音,采集到的多个目标声音中除所述标准声音之外的目标声音作为非标准声音;Determine the sound collected by the preset reference microphone from the collected multiple target sounds, and use the sound collected by the preset reference microphone as the standard sound. Among the multiple target sounds collected, the target sound other than the standard sound is used as Non-standard voice
    确定所述标准声音的时序,并根据所述标准声音的时序调整所述非标准声音的时序,以使得所述非标准声音的时序与所述标准声音的时序一致。The timing of the standard sound is determined, and the timing of the non-standard sound is adjusted according to the timing of the standard sound, so that the timing of the non-standard sound is consistent with the timing of the standard sound.
  11. 如权利要求10所述的录音设备,其中,所述获取采集到的多个目标声音以及用户输入的定向拾音指令的步骤包括:9. The recording device of claim 10, wherein the step of acquiring the collected multiple target sounds and the directional sound pickup instruction input by the user comprises:
    根据用户在显示界面上的操作生成对应的定向拾音指令,并根据所述定向拾音指令通过麦克风阵列中的各个麦克风采集声音;Generate a corresponding directional sound pickup instruction according to the user's operation on the display interface, and collect sound through each microphone in the microphone array according to the directional sound pickup instruction;
    将所述各个麦克风采集到的声音作为目标声音。Use the sound collected by each microphone as the target sound.
  12. 如权利要求11所述的录音设备,其中,所述获取采集到的多个目标声音以及用户输入的定向拾音指令的步骤之后,还包括:11. The recording device according to claim 11, wherein after the step of obtaining the collected multiple target sounds and the directional sound pickup instruction input by the user, the method further comprises:
    根据用户输入的定向拾音指令确定对应的水平角值以及仰角值,并将所述水平角值以及所述仰角值显示在所述显示界面上;Determine the corresponding horizontal angle value and the elevation angle value according to the directional sound pickup instruction input by the user, and display the horizontal angle value and the elevation angle value on the display interface;
    所述将所述多个目标声音中与所述定向拾音指令对应的目标声音作为指向声音的步骤包括:The step of using the target sound corresponding to the directional sound pickup instruction among the plurality of target sounds as a pointing sound includes:
    根据所述水平角值以及所述仰角值确定对应的指向方位,并将所述多个目标声音中与所述指向方位对应的目标声音作为所述指向声音。The corresponding pointing azimuth is determined according to the horizontal angle value and the elevation angle value, and a target sound corresponding to the pointing azimuth among the multiple target sounds is used as the pointing sound.
  13. 如权利要求12所述的录音设备,其中,所述将所述多个目标声音中与所述定向拾音指令对应的目标声音作为指向声音的步骤之后,还包括:The recording device according to claim 12, wherein after the step of using the target sound corresponding to the directional sound pickup instruction among the plurality of target sounds as the directed sound, the method further comprises:
    对所述指向声音进行信号采样以及归一化处理;Signal sampling and normalization processing on the pointing sound;
    将归一化处理后的指向声音输入至预设滤波器中,以增强所述指向声音。The normalized pointing sound is input into a preset filter to enhance the pointing sound.
  14. 如权利要求13所述的录音设备,其中,所述对所述指向声音进行降噪的步骤包括:The recording device according to claim 13, wherein the step of reducing the noise of the directed sound comprises:
    将所述指向声音输入至预设噪声估计算法中,以识别所述指向声音中的噪声;Input the pointing sound into a preset noise estimation algorithm to identify the noise in the pointing sound;
    确定所述噪声的频率、振幅以及相位,并输出与所述噪声对应的反噪声,以对所述指向声音进行降噪,其中所述反噪声为与所述噪声频率相同、振幅相同以及相位相反的声波。Determine the frequency, amplitude and phase of the noise, and output the anti-noise corresponding to the noise to reduce the noise of the pointing sound, wherein the anti-noise is the same frequency, the same amplitude and the opposite phase as the noise Sound waves.
  15. 如权利要求14所述的录音设备,其中,所述对所述指向声音进行降噪的步骤之后,还包括:The recording device according to claim 14, wherein after the step of reducing the noise of the pointing sound, it further comprises:
    将降噪后的指向声音上传至云端服务器,并在获取到用户输入的读取指令后,从云端服务器下载对应的指向声音。Upload the noise-reduced pointing sound to the cloud server, and download the corresponding pointing sound from the cloud server after obtaining the read instruction input by the user.
  16. 一种存储介质,其中,所述存储介质上存储有计算机可读指令,所述计算机可读指令被处理器执行时实现以下步骤:A storage medium, wherein computer-readable instructions are stored on the storage medium, and the following steps are implemented when the computer-readable instructions are executed by a processor:
    获取采集到的多个目标声音以及用户输入的定向拾音指令,并对所述多个目标声音时延补偿,以使得所述多个目标声音的时序一致;Acquiring the collected multiple target sounds and the directional sound pickup instruction input by the user, and compensating for the time delay of the multiple target sounds, so that the timings of the multiple target sounds are consistent;
    将所述多个目标声音中与所述定向拾音指令对应的目标声音作为指向声音,并对所述指向声音进行降噪。The target sound corresponding to the directional sound pickup instruction among the plurality of target sounds is used as a pointing sound, and the noise of the pointing sound is reduced.
  17. 如权利要求16所述的存储介质,其中,所述对所述多个目标声音时延补偿,以使得所述多个目标声音的时序一致的步骤包括:The storage medium according to claim 16, wherein the step of compensating the time delay of the multiple target sounds so that the timings of the multiple target sounds are consistent comprises:
    从采集到的多个目标声音中确定预设基准麦克风采集的声音,并将预设基准麦克风采集的声音作为标准声音,采集到的多个目标声音中除所述标准声音之外的目标声音作为非标准声音;Determine the sound collected by the preset reference microphone from the collected multiple target sounds, and use the sound collected by the preset reference microphone as the standard sound. Among the multiple target sounds collected, the target sound other than the standard sound is used as Non-standard voice
    确定所述标准声音的时序,并根据所述标准声音的时序调整所述非标准声音的时序,以使得所述非标准声音的时序与所述标准声音的时序一致。The timing of the standard sound is determined, and the timing of the non-standard sound is adjusted according to the timing of the standard sound, so that the timing of the non-standard sound is consistent with the timing of the standard sound.
  18. 如权利要求17所述的存储介质,其中,所述获取采集到的多个目标声音以及用户输入的定向拾音指令的步骤包括:17. The storage medium of claim 17, wherein the step of acquiring the collected multiple target sounds and the directional sound pickup instruction input by the user comprises:
    根据用户在显示界面上的操作生成对应的定向拾音指令,并根据所述定向拾音指令通过麦克风阵列中的各个麦克风采集声音;Generate a corresponding directional sound pickup instruction according to the user's operation on the display interface, and collect sound through each microphone in the microphone array according to the directional sound pickup instruction;
    将所述各个麦克风采集到的声音作为目标声音。Use the sound collected by each microphone as the target sound.
  19. 如权利要求18所述的存储介质,其中,所述获取采集到的多个目标声音以及用户输入的定向拾音指令的步骤之后,还包括:18. The storage medium of claim 18, wherein after the step of acquiring the multiple collected target sounds and the directional sound pickup instruction input by the user, the method further comprises:
    根据用户输入的定向拾音指令确定对应的水平角值以及仰角值,并将所述水平角值以及所述仰角值显示在所述显示界面上;Determine the corresponding horizontal angle value and the elevation angle value according to the directional sound pickup instruction input by the user, and display the horizontal angle value and the elevation angle value on the display interface;
    所述将所述多个目标声音中与所述定向拾音指令对应的目标声音作为指向声音的步骤包括:The step of using the target sound corresponding to the directional sound pickup instruction among the plurality of target sounds as a pointing sound includes:
    根据所述水平角值以及所述仰角值确定对应的指向方位,并将所述多个目标声音中与所述指向方位对应的目标声音作为所述指向声音。The corresponding pointing azimuth is determined according to the horizontal angle value and the elevation angle value, and a target sound corresponding to the pointing azimuth among the multiple target sounds is used as the pointing sound.
  20. 如权利要求19所述的存储介质,其中,所述将所述多个目标声音中与所述定向拾音指令对应的目标声音作为指向声音的步骤之后,还包括:22. The storage medium of claim 19, wherein after the step of using the target sound corresponding to the directional sound pickup instruction among the plurality of target sounds as a pointing sound, the method further comprises:
    对所述指向声音进行信号采样以及归一化处理;Signal sampling and normalization processing on the pointing sound;
    将归一化处理后的指向声音输入至预设滤波器中,以增强所述指向声音。 The normalized pointing sound is input into a preset filter to enhance the pointing sound.
PCT/CN2019/101711 2019-06-17 2019-08-21 Directional sound pickup method, recording device and storage medium WO2020252886A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910524995.4 2019-06-17
CN201910524995.4A CN110197671A (en) 2019-06-17 2019-06-17 Orient sound pick-up method, sound pick-up outfit and storage medium

Publications (1)

Publication Number Publication Date
WO2020252886A1 true WO2020252886A1 (en) 2020-12-24

Family

ID=67754676

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/101711 WO2020252886A1 (en) 2019-06-17 2019-08-21 Directional sound pickup method, recording device and storage medium

Country Status (2)

Country Link
CN (1) CN110197671A (en)
WO (1) WO2020252886A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110769318A (en) * 2019-09-30 2020-02-07 视联动力信息技术股份有限公司 Control method and device for video network recording equipment
CN112992169B (en) * 2019-12-12 2024-06-11 华为技术有限公司 Voice signal acquisition method and device, electronic equipment and storage medium
CN113068101B (en) * 2021-03-16 2023-06-09 Oppo广东移动通信有限公司 Ring array pickup control method and device, storage medium and ring array
CN113038329A (en) * 2021-03-22 2021-06-25 深圳市科奈信科技有限公司 Earphone radio control method, device and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102509552A (en) * 2011-10-21 2012-06-20 浙江大学 Method for enhancing microphone array voice based on combined inhibition
US20150281853A1 (en) * 2011-07-11 2015-10-01 SoundFest, Inc. Systems and methods for enhancing targeted audibility
CN107123421A (en) * 2017-04-11 2017-09-01 广东美的制冷设备有限公司 Sound control method, device and home appliance
CN108962272A (en) * 2018-06-21 2018-12-07 湖南优浪语音科技有限公司 Sound pick-up method and system
CN109788382A (en) * 2019-01-25 2019-05-21 深圳大学 A kind of distribution microphone array pickup system and method

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101447190A (en) * 2008-06-25 2009-06-03 北京大学深圳研究生院 Voice enhancement method employing combination of nesting-subarray-based post filtering and spectrum-subtraction
CN102324237B (en) * 2011-05-30 2013-01-02 深圳市华新微声学技术有限公司 Microphone-array speech-beam forming method as well as speech-signal processing device and system
CN102831898B (en) * 2012-08-31 2013-11-13 厦门大学 Microphone array voice enhancement device with sound source direction tracking function and method thereof
CN102969002B (en) * 2012-11-28 2014-09-03 厦门大学 Microphone array speech enhancement device capable of suppressing mobile noise
CN103916723B (en) * 2013-01-08 2018-08-10 联想(北京)有限公司 A kind of sound collection method and a kind of electronic equipment
CN104378570A (en) * 2014-09-28 2015-02-25 小米科技有限责任公司 Sound recording method and device
CN105679329B (en) * 2016-02-04 2019-08-06 厦门大学 It is suitable for the microphone array speech enhancement device of strong background noise
CN106603878B (en) * 2016-12-09 2019-09-06 奇酷互联网络科技(深圳)有限公司 Voice localization method, device and system
CN107481729A (en) * 2017-09-13 2017-12-15 百度在线网络技术(北京)有限公司 A kind of method and system that intelligent terminal is upgraded to far field speech-sound intelligent equipment
US10089998B1 (en) * 2018-01-15 2018-10-02 Advanced Micro Devices, Inc. Method and apparatus for processing audio signals in a multi-microphone system
CN108269582B (en) * 2018-01-24 2021-06-01 厦门美图之家科技有限公司 Directional pickup method based on double-microphone array and computing equipment

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150281853A1 (en) * 2011-07-11 2015-10-01 SoundFest, Inc. Systems and methods for enhancing targeted audibility
CN102509552A (en) * 2011-10-21 2012-06-20 浙江大学 Method for enhancing microphone array voice based on combined inhibition
CN107123421A (en) * 2017-04-11 2017-09-01 广东美的制冷设备有限公司 Sound control method, device and home appliance
CN108962272A (en) * 2018-06-21 2018-12-07 湖南优浪语音科技有限公司 Sound pick-up method and system
CN109788382A (en) * 2019-01-25 2019-05-21 深圳大学 A kind of distribution microphone array pickup system and method

Also Published As

Publication number Publication date
CN110197671A (en) 2019-09-03

Similar Documents

Publication Publication Date Title
WO2020252886A1 (en) Directional sound pickup method, recording device and storage medium
WO2020141824A2 (en) Processing method of audio signal and electronic device supporting the same
WO2017078361A1 (en) Electronic device and method for recognizing speech
WO2018008885A1 (en) Image processing device, operation method of image processing device, and computer-readable recording medium
WO2017111319A1 (en) Electronic device and method for controlling operation of electronic device
WO2020057227A1 (en) Television sound adjustment method, television set, and storage medium
WO2018205413A1 (en) Audio volume adjustment method and terminal, and computer-readable storage medium
WO2021025517A1 (en) Electronic device with audio zoom and operating method thereof
WO2019045474A1 (en) Method and device for processing audio signal using audio filter having non-linear characteristics
WO2016169413A1 (en) Echo cancellation data synchronization control method, terminal, and storage medium
WO2019156338A1 (en) Method for acquiring noise-refined voice signal, and electronic device for performing same
EP3162085A1 (en) Operating method for microphones and electronic device supporting the same
WO2018038381A1 (en) Portable device for controlling external device, and audio signal processing method therefor
WO2019156339A1 (en) Apparatus and method for generating audio signal with noise attenuated on basis of phase change rate according to change in frequency of audio signal
WO2020124845A1 (en) Video parameter adjustment method and apparatus based on voiceprint recognition, and readable storage medium
WO2021060680A1 (en) Methods and systems for recording mixed audio signal and reproducing directional audio
WO2013162329A1 (en) Apparatus and method for outputting audio
WO2014021670A1 (en) Mobile apparatus and control method thereof
EP3906704A1 (en) Sound outputting device including plurality of microphones and method for processing sound signal using plurality of microphones
WO2018217059A1 (en) Method and electronic device for managing loudness of audio signal
WO2016117793A1 (en) Speech enhancement method and system
WO2017090793A1 (en) Wearable device, wearable device system and method for controlling wearable device
WO2018186656A1 (en) Audio signal processing method and device
EP3583783A1 (en) Method and apparatus for in-room low-frequency sound power optimization
WO2020076013A1 (en) Mobile platform based active noise cancellation (anc)

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19933919

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE