CN111010649A - Sound pickup and microphone array - Google Patents

Sound pickup and microphone array Download PDF

Info

Publication number
CN111010649A
CN111010649A CN201811168320.2A CN201811168320A CN111010649A CN 111010649 A CN111010649 A CN 111010649A CN 201811168320 A CN201811168320 A CN 201811168320A CN 111010649 A CN111010649 A CN 111010649A
Authority
CN
China
Prior art keywords
distance
microphone
microphones
array
range
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811168320.2A
Other languages
Chinese (zh)
Inventor
余涛
王一行
冯津伟
银鞍
李新国
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201811168320.2A priority Critical patent/CN111010649A/en
Publication of CN111010649A publication Critical patent/CN111010649A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • H04R1/32Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
    • H04R1/40Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
    • H04R1/406Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2201/00Details of transducers, loudspeakers or microphones covered by H04R1/00 but not provided for in any of its subgroups
    • H04R2201/40Details of arrangements for obtaining desired directional characteristic by combining a number of identical transducers covered by H04R1/40 but not provided for in any of its subgroups
    • H04R2201/403Linear arrays of transducers

Landscapes

  • Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

The invention discloses a sound pickup and a microphone array. Wherein, this adapter includes: the microphone array is approximately linearly distributed, the microphone array comprises an omnidirectional microphone and a directional microphone, and the sound pick-up is in a strip shape. The invention solves the technical problems that the microphone in the related art has poor pickup effect and is easy to be interfered by noise.

Description

Sound pickup and microphone array
Technical Field
The invention relates to the field of audio signal processing, in particular to a sound pickup and a microphone array.
Background
In the related art, a gooseneck directional microphone is generally adopted in a conference room, and when a speaker speaks at a position right in front of the microphone and close to the microphone, the microphone can effectively pick up sound; if the speaker deviates from the microphone, the speaker leans back and moves laterally to cause the microphone to be incapable of picking up sound effectively, so that the situation that the voice transcription system transcribes and loses words is caused, and the user experience is influenced.
Meanwhile, the gooseneck directional microphone adopts a longer pull rod, so that the microphone can be conveniently closer to a speaker to speak. However, in the way of physically enhancing sound pickup, there are many problems in practical use, 1, the pull rod is easy to bend, so that the microphone cannot normally pick up sound, and even noise and interference sound are picked up; 2, the long pull rod is very abrupt in vision, which affects the field conference communication and brings visual interference to the display screen in the paperless conference system.
Therefore, the microphone in the related art has a problem that the sound pickup effect is not good, causing interference.
In view of the above problems, no effective solution has been proposed.
Disclosure of Invention
The embodiment of the invention provides a sound pickup and a microphone array, which at least solve the technical problems that a microphone in the related art has poor sound pickup effect and is easily interfered by noise.
According to an aspect of an embodiment of the present invention, there is provided a sound pickup including: the sound pickup includes: the microphone array is approximately linearly distributed, the microphone array comprises an omnidirectional microphone and a directional microphone, and the sound pickup is in a long strip shape.
Optionally, omni-directional microphones are arranged at the edges and in the middle of the microphone array, and directional microphones are arranged at other positions of the microphone array.
Optionally, the omni-directional microphone disposed at the middle of the microphone array is located at a predetermined distance range of a vertical direction perpendicular to the array direction.
Optionally, in the microphone array, the omnidirectional microphones are symmetrically distributed, and the directional microphones are symmetrically distributed.
Optionally, the symmetric distribution comprises: centrosymmetry, mirror symmetry, and rotational symmetry.
Optionally, the position of the directional microphone is within a predetermined range of the center of the microphone matrix.
Optionally, the longest length of the microphone array is determined depending on the application scenario.
Optionally, the microphone array is obtained by stretching a standard microphone array, where the standard microphone array includes four microphones, two microphones at two ends of the edge and two microphones in the middle are omnidirectional microphones, and the rest microphones include directional microphones.
Optionally, the distance in the array direction between the two microphones at the two ends of the outermost edge is a first distance, the distance in the vertical direction of the array direction is zero, the distance in the array direction between the two microphones at the two secondary edges is a second distance, the distance in the vertical direction of the array direction is zero, the distance in the array direction between the two microphones at the third edges of the two sides is a third distance, the distance in the vertical direction of the array direction is zero, the distance in the array direction between the two microphones in the middle is a fourth distance, and the distance in the vertical direction of the array direction is a fifth distance.
Optionally, the first distance supports fluctuation within a first range, the second distance supports fluctuation within a second range, the third distance supports fluctuation within a third range, the fourth distance supports fluctuation within a fourth range, and the fifth distance supports fluctuation within a fifth range.
Optionally, the first range includes: 31 to 40 centimeters, the second range comprising: 21 to 30 centimeters, the third range comprising: 11 to 20 centimeters, the fourth range comprising: 1 to 10 centimeters, the fifth range comprising: 1 to 5 cm.
According to another aspect of the present invention, a microphone array is provided, which includes omnidirectional microphones and directional microphones, and the microphone array is substantially linearly distributed.
In the embodiment of the invention, the microphone array which is approximately linearly distributed is adopted to form the strip-shaped sound pick-up, and the purpose of improving the sound pick-up effect is achieved through the microphone array which comprises the omnidirectional microphone and the directional microphone, so that the technical effects of good sound pick-up effect and interference avoidance are realized, and the technical problems that the sound pick-up effect of the microphone in the related technology is poor and the microphone is easily interfered by noise are solved.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the invention and together with the description serve to explain the invention without limiting the invention. In the drawings:
fig. 1 illustrates a block diagram of a hardware configuration of a computer terminal for implementing a sound source localization method of a microphone array;
fig. 2 is a schematic structural diagram of a sound pickup according to a first embodiment of the present invention;
fig. 3 is a schematic diagram of a microphone array arrangement according to an embodiment of the invention;
FIG. 4 is a schematic diagram of near field sound propagation according to an embodiment of the present invention;
fig. 5 is a block diagram of a computer terminal according to an embodiment of the present invention.
Detailed Description
In order to make the technical solutions of the present invention better understood, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
It should be noted that the terms "first," "second," and the like in the description and claims of the present invention and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used is interchangeable under appropriate circumstances such that the embodiments of the invention described herein are capable of operation in sequences other than those illustrated or described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
First, some terms or terms appearing in the description of the embodiments of the present application are applicable to the following explanations:
a directional microphone for picking up only the speaker's voice; the pickup radius is small, the pickup can be used by a single person, the pickup is placed beside the mouth, the tone quality is good, and the sound which is far away can not be picked up;
the omnidirectional microphone has a large pickup radius, can pick up the sound of a person, has high sensitivity, and can pick up more environmental noises due to the large pickup radius.
Example 1
According to the sound pickup provided by the embodiment of the present invention, a sound source can be localized using a sound source localization method of a microphone array, which can be performed in a mobile terminal, a computer terminal, or a similar arithmetic device. Fig. 1 shows a hardware configuration block diagram of a computer terminal (or mobile device) for implementing a sound source localization method of a microphone array. As shown in fig. 1, the computer terminal 10 (or mobile device 10) may include one or more (shown as 102a, 102b, … …, 102 n) processors 102 (the processors 102 may include, but are not limited to, a processing device such as a microprocessor MCU or a programmable logic device FPGA), and memory 104 for storing data. Besides, the method can also comprise the following steps: a transmission module, a display, an input/output interface (I/O interface), a Universal Serial Bus (USB) port (which may be included as one of the ports of the I/O interface), a network interface, a power source, and/or a camera. It will be understood by those skilled in the art that the structure shown in fig. 1 is only an illustration and is not intended to limit the structure of the electronic device. For example, the computer terminal 10 may also include more or fewer components than shown in FIG. 1, or have a different configuration than shown in FIG. 1.
It should be noted that the one or more processors 102 and/or other data processing circuitry described above may be referred to generally herein as "data processing circuitry". The data processing circuitry may be embodied in whole or in part in software, hardware, firmware, or any combination thereof. Further, the data processing circuit may be a single stand-alone processing module, or incorporated in whole or in part into any of the other elements in the computer terminal 10 (or mobile device). As referred to in the embodiments of the application, the data processing circuit acts as a processor control (e.g. selection of a variable resistance termination path connected to the interface).
The memory 104 may be used to store software programs and modules of application software, such as program instructions/data storage devices corresponding to the method in the embodiment of the present invention, and the processor 102 executes various functional applications and data processing by running the software programs and modules stored in the memory 104, that is, implementing the vulnerability detection method of the application program. The memory 104 may include high speed random access memory, and may also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory. In some examples, the memory 104 may further include memory located remotely from the processor 102, which may be connected to the computer terminal 10 via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The transmission module is used for receiving or sending data through a network. Specific examples of the network described above may include a wireless network provided by a communication provider of the computer terminal 10. In one example, the transmission module includes a Network adapter (NIC) that can be connected to other Network devices through a base station to communicate with the internet. In one example, the transmission module may be a Radio Frequency (RF) module, which is used for communicating with the internet in a wireless manner.
The display may be, for example, a touch screen type Liquid Crystal Display (LCD) that may enable a user to interact with a user interface of the computer terminal 10 (or mobile device).
It should be noted here that in some alternative embodiments, the computer device (or mobile device) shown in fig. 1 described above may include hardware elements (including circuitry), software elements (including computer code stored on a computer-readable medium), or a combination of both hardware and software elements. It should be noted that fig. 1 is only one example of a particular specific example and is intended to illustrate the types of components that may be present in the computer device (or mobile device) described above.
In the above-described operating environment, the present application provides a sound pickup as shown in fig. 2. Fig. 2 is a schematic structural diagram of a sound pickup according to a first embodiment of the present invention. As shown in fig. 2, the sound pickup includes: the microphone array is approximately linearly distributed, the microphone array comprises an omnidirectional microphone and a directional microphone, and the sound pick-up is in a strip shape.
It should be noted that the above-mentioned approximately linear distribution is not a linear distribution in a planar view, and the microphone array is a linear distribution (because the array is two-dimensional in a planar view, and the microphone array is two-dimensional in nature); the microphone array is arranged on the sound pick-up, the shape of the sound pick-up is a strip, and the microphone array is also a strip in the view of the whole shape of the sound pick-up, so that the microphone array is approximately linearly distributed.
The substantially linear distribution means that the microphone array is not an absolute linear distribution, but a linear distribution within a certain error range of the absolute linear distribution, and the microphones may not be completely aligned. For example, it may be a regular curve, or a symmetrical curved distribution, etc.
In addition, the shape of the sound pickup described herein is a long strip, which means a long strip in the general meaning of the present invention, and may be a rectangle in the meaning of planar vision, or a cuboid or a cylinder in the sense of stereoscopic vision. For example, in the case of a sound pickup, a cross section along the length direction thereof is rectangular, that is, rectangular in a planar visual sense. While the other side of the microphone may be substantially circular in cross-section (e.g. perpendicular to the length direction), but the microphone may be seen as elongated in shape as it gives a direct visual perception of the length direction as a whole.
The omnidirectional microphone can ensure a better and comprehensive sound pickup function, and the directional microphone has a better directional function, so that the microphone array has two functions of directional pickup and sound pickup at the same time, the conference playback function requires low time delay, the directional microphone has an anti-noise function, and compared with a mode of processing by the omnidirectional microphone and an algorithm, the directional microphone is adopted while the directional microphone is adopted, so that the time delay caused by the algorithm processing can be reduced.
The mode that the microphone array that adopts roughly to be linear distribution forms rectangular shape adapter through the microphone array including omnidirectional microphone and directional microphone, has reached the purpose that improves the pickup effect to not only the pickup effect is good has been realized, has avoided the technological effect of interference moreover, and then has solved the microphone among the correlation technique and has had the pickup effect not good, easily receives noise jamming's technical problem. Not only satisfied traditional adapter and can carry out voice conversation, the function of on-the-spot playback, can effectively improve intelligent speech recognition's the degree of accuracy moreover.
As an alternative embodiment, since the omni-directional microphone can pick up sound in a wide range and has high sound pickup sensitivity, the omni-directional microphone may be disposed at the edge and the middle of the microphone array and the directional microphone may be disposed at other positions of the microphone array.
As an alternative embodiment, the omni-directional microphone disposed at the center of the microphone array is located at a predetermined distance range in the vertical direction perpendicular to the array direction. For example, for a certain size of array (e.g. an array of 8 microphones), the distance between the centrally located omnidirectional microphone and the vertical direction perpendicular to the array direction may be in the range of 2 to 4 cm, preferably 3 cm.
As an alternative embodiment, in the microphone array, the omnidirectional microphones are symmetrically distributed, and the directional microphones are symmetrically distributed. By adopting the processing, the sound collecting effect of the sound collector is more uniform and soft, and sharp sound and the like do not occur.
It should be noted that, when the omnidirectional microphones are symmetrically distributed, and the directional microphones are symmetrically distributed, the symmetric distribution may be of various types, for example, the symmetric distribution includes: centrosymmetry, mirror symmetry, and rotational symmetry. For example, the central symmetry may be a central symmetry about a center of the microphone array, the mirror symmetry may be a plane symmetry about a center line of the microphone array, and the rotational symmetry may be a rotational symmetry about a rotation axis of the microphone array.
In order to make the position of the directional microphones lying directly in front of the array better covered, the position of the directional microphones may be located within a predetermined range in the centre of the microphone matrix.
As an alternative embodiment, the predetermined range referred to above may refer to a predetermined planar range in a two-dimensional concept, or may refer to a predetermined stereoscopic range in a three-dimensional sense. For example, the pointing microphone may be formed approximately along a predetermined rectangular range formed by a first length in the array direction and a second length in the vertical array direction, a predetermined circular range, and other regular pattern ranges. For example, the pointing microphone may be a rectangular area approximately 0.5m or so in the array direction and 1.2m perpendicular to the array direction. For another example, the pointing microphone may be formed within a predetermined cube approximately along a third length in the array direction, a fourth length perpendicular to the array direction, and a fifth length in a direction perpendicular to both the array direction and the direction perpendicular to the array direction. Since the microphone is generally provided on the surface of the sound pickup, the predetermined range mentioned above generally refers to a two-dimensional predetermined planar range.
It should be noted that the longest length of the microphone array is determined according to the application scenario. The longest lengths corresponding to different application scenarios may be different, for example, when the application scenario is larger, the longest length of the microphone array may be designed to be proportionally longer, thus making the whole microphone array larger; in the application scenario, the longest length of the microphone array may be designed to be shorter in proportion to the ground, so that the whole microphone array is designed to be smaller. Namely, the longest length of the microphone array can be flexibly changed according to application scenes so as to adapt to the requirements of different scenes.
As an alternative embodiment, the microphone array is obtained by scaling according to a standard microphone array, the standard microphone array comprises four microphones respectively at the left and the right, two microphones at the two ends of the extreme edge and two microphones in the middle are omnidirectional microphones, and the rest microphones comprise directional microphones.
As an alternative embodiment, the standard microphone array may be designed, for example, such that the distance in the array direction between two microphones at two ends of the outermost edge is a first distance, the distance in the vertical direction of the array direction is zero, the distance in the array direction between two microphones at two edges of the second edge is a second distance, the distance in the vertical direction of the array direction is zero, the distance in the array direction between two microphones at two edges of the third edge is a third distance, the distance in the vertical direction of the array direction is zero, the distance in the array direction between two microphones in the middle is a fourth distance, and the distance in the vertical direction of the array direction is a fifth distance. It should be noted that the above-described microphone array is described only from the lateral direction and the vertical direction which are relatively simple standards, and of course, the microphone array may also be described from the lateral direction and the vertical direction which are not standards, for example, the arrangement of the microphones may be described from two random directions which are symmetrical to the center of the microphone matrix.
In addition, the first distance, the second distance, the third distance, the fourth distance, and the fifth distance are not fixed, but may fluctuate within a certain range, for example, the first distance may support fluctuation within a first range, the second distance may support fluctuation within a second range, the third distance may support fluctuation within a third range, the fourth distance may support fluctuation within a fourth range, and the fifth distance may support fluctuation within a fifth range. The first range, the second range, the third range, the fourth range and the fifth range of the fluctuation may be the same or different, depending on the specific scene requirements.
As an alternative embodiment, the first range may include: 31 to 40 cm, and the second range may include: 21 to 30 cm, and the third range may include: 11 to 20 cm, and the fourth range may include: 1 to 10 cm, and the fifth range may include: 1 to 5 cm. It should be noted that the above ranges are only examples, and other allowable ranges also belong to the implementation of the embodiments of the present invention.
The examples given above are exemplified in conjunction with the examples described above. Fig. 3 is a schematic view of an arrangement of a microphone array according to an embodiment of the present invention, as shown in fig. 3, 8 microphones are arranged from left to right, four microphones on the left and four microphones on the right are symmetrical about a center, the horizontal distances of the four microphones on the left and four microphones on the right are 35cm, 29cm, 11cm and 5cm in sequence, and the vertical distance of the two microphones in the middle is 3 cm.
Two microphones at two ends and the middle part are set to be omnidirectional microphones, and the other microphones are omnidirectional and directional microphones which can be replaced.
It should be noted that the longest distance between the two end microphones can be changed with the application scene, other microphones are scaled up and down, the second microphone on the left side can fluctuate by 2cm from side to side in the current configuration, the third and fourth microphones on the left side can fluctuate by 0.5cm from side to side, and the distance between the front and the back of the fourth microphone is more than 3 cm.
The strip-shaped microphone array sound pick-up can support effective sound pick-up in a middle far field range. The size and the arrangement design of the microphone array of the pickup can achieve the following good effects: (1) the design of the array can effectively support the distance estimation (main array side) in the range of 1.2m and the horizontal direction estimation of 360 degrees. (2) The array can better support direct or indirect dereverberation and noise reduction algorithm design. (3) The array microphones are distributed in a roughly linear mode, and the strip array appearance is designed conveniently.
It should be noted that the microphone arrays of the sound pickup provided in the above embodiments and preferred embodiments can perform sound source localization by using a sound source localization method generally used in the related art when localizing a sound source. For example, the following localization method may be adopted to perform sound source localization in cooperation with the above-described microphone array:
fig. 4 is a schematic diagram of near-field sound propagation according to an embodiment of the present invention, as shown in fig. 4, sound source propagation in the near field differs from that in the far field, and a time difference τ of a sound source signal reaching each array microphone varies not only with an angle but also with a distance compared with the far field. Setting the distance from the target speaker to each microphone as R1,R2,...,RN-1,RNThe propagation speed of sound wave in air is C, the time difference of sound wave reaching the ith microphone relative to the 1 st microphone is
Figure BDA0001821780100000081
The method adopts a classic broadband MUSIC algorithm to carry out near-field sound source localization:
1. first, a covariance matrix of the data is obtained: r (f) ═ E [ x (f)H]And x (f) is data in the frequency band f after short-time fourier transform of the signal received by the microphone, where E is a mathematical expectation and may be replaced by an average value. Then decomposing the characteristic value of R (f), wherein R (f) is Us(f)∑sUs(f)H+UN(f)∑NUN(f)HWherein U iss(f) The eigenvectors corresponding to the large eigenvalues form a signal subspace, UN(f) The feature vectors corresponding to the small features form a noise subspace.
2. Decomposing to obtain Us(f) Then, the response calculation formula of the target signal in the two-dimensional space is as follows:
Figure BDA0001821780100000082
where a (R, θ, f) can be obtained from the relative time difference τ.
3. The two-dimensional coordinate of the target speaker is (Rtarget, theta target) ═ argmax(R,θ)SR,θ
The placement position of the omnidirectional microphone is also determined according to a specific scene, for example, when the microphone array is designed under the condition that the distance between two ends is fixed (35cm), and under the reverberation condition of a conference room, through a simulation experiment, a small positioning error exists in a range of 1.2 m.
According to simulation experiments, a sound source is placed at a position about 0.8m in front of a microphone array, and according to the microphone array arrangement and a positioning spatial response diagram obtained by randomly selecting two groups of arrangements for comparison, the microphone array has obvious and more prominent response at a position with a distance of 0.8, the random arrangement peak value is not obvious, and the random arrangement peak value deviates from 0.8 m.
It should be noted that, for simplicity of description, the above-mentioned method embodiments are described as a series of acts or combination of acts, but those skilled in the art will recognize that the present invention is not limited by the order of acts, as some steps may occur in other orders or concurrently in accordance with the invention. Further, those skilled in the art should also appreciate that the embodiments described in the specification are preferred embodiments and that the acts and modules referred to are not necessarily required by the invention.
Through the above description of the embodiments, those skilled in the art can clearly understand that the method according to the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but the former is a better implementation mode in many cases. Based on such understanding, the technical solutions of the present invention may be embodied in the form of a software product, which is stored in a storage medium (such as ROM/RAM, magnetic disk, optical disk) and includes instructions for enabling a terminal device (such as a mobile phone, a computer, a server, or a network device) to execute the method according to the embodiments of the present invention.
Example 2
According to the embodiment of the invention, a microphone array is also provided, wherein the microphone array comprises an omnidirectional microphone and a directional microphone, and the microphone array is approximately linearly distributed.
As an alternative embodiment, omnidirectional microphones are arranged at the edges and in the middle of the microphone array, and directional microphones are arranged at other positions of the microphone array.
As an alternative embodiment, the omni-directional microphone disposed in the middle of the microphone array is located at a predetermined distance range in the vertical direction perpendicular to the array direction.
As an alternative embodiment, in the microphone array, the omnidirectional microphones are symmetrically distributed, and the directional microphones are symmetrically distributed.
As an alternative embodiment, the symmetric distribution includes: centrosymmetry, mirror symmetry, and rotational symmetry.
As an alternative embodiment, the position of the directional microphone is located within a predetermined range of the center of the microphone matrix.
As an alternative embodiment, the longest length of the microphone array is determined depending on the application scenario.
As an alternative embodiment, the microphone array is obtained by scaling according to a standard microphone array, the standard microphone array includes four microphones respectively at the left and right, two microphones at the two ends of the edge and two microphones in the middle are omnidirectional microphones, and the rest of the microphones include directional microphones.
As an alternative embodiment, the array direction distance between the two microphones at the two ends of the outermost edge is a first distance, the distance in the vertical direction of the array direction is zero, the array direction distance between the two microphones at the two secondary edges is a second distance, the distance in the vertical direction of the array direction is zero, the array direction distance between the two microphones at the third edges is a third distance, the distance in the vertical direction of the array direction is zero, the array direction distance between the two microphones at the middle is a fourth distance, and the distance in the vertical direction of the array direction is a fifth distance.
As an alternative embodiment, the first distance supports fluctuations within a first range, the second distance supports fluctuations within a second range, the third distance supports fluctuations within a third range, the fourth distance supports fluctuations within a fourth range, and the fifth distance supports fluctuations within a fifth range.
As an alternative embodiment, the first range includes: 31 to 40 cm, said second range comprising: 21 to 30 cm, said third range comprising: 11 to 20 cm, said fourth range comprising: 1 to 10 cm, said fifth range comprising: 1 to 5 cm.
Example 3
The embodiment of the invention can provide a computer terminal which can be any computer terminal device in a computer terminal group. Optionally, in this embodiment, the computer terminal may also be replaced with a terminal device such as a mobile terminal.
Optionally, in this embodiment, the computer terminal may be located in at least one network device of a plurality of network devices of a computer network.
In this embodiment, the computer terminal may perform the steps of the sound source localization method of an application program in cooperation with a microphone array.
Alternatively, fig. 5 is a block diagram of a computer terminal according to an embodiment of the present invention. As shown in fig. 5, the computer terminal 500 may include: one or more processors 502 (only one of which is shown), memory 504, and peripheral interfaces, among others.
The memory may be configured to store software programs and modules, such as program instructions/modules corresponding to the sound source localization method and apparatus in conjunction with a microphone array in the embodiment of the present invention, and the processor executes various functional applications and data processing by running the software programs and modules stored in the memory, that is, the sound source localization method in conjunction with a microphone array described above is implemented. The memory may include high speed random access memory, and may also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory. In some examples, the memories may further include a memory located remotely from the processor, which may be connected to the terminal 500 via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The processor may invoke the information stored in the memory and the application program via the transmission means to perform the steps in the sound source localization method in conjunction with the microphone array. The technical problems that a microphone in the related art is poor in pickup effect and easy to be interfered by noise are solved.
It can be understood by those skilled in the art that the structure shown in fig. 5 is only an illustration, and the computer terminal may also be a terminal device such as a smart phone (e.g., an Android phone, an iOS phone, etc.), a tablet computer, a palmtop computer, a Mobile Internet Device (MID), a PAD, and the like. Fig. 5 is a diagram illustrating a structure of the electronic device. For example, computer terminal 500 may also include more or fewer components (e.g., network interfaces, display devices, etc.) than shown in FIG. 5, or have a different configuration than shown in FIG. 5.
Those skilled in the art will appreciate that all or part of the steps in the methods of the above embodiments may be implemented by a program instructing hardware associated with the terminal device, where the program may be stored in a computer-readable storage medium, and the storage medium may include: flash disks, Read-Only memories (ROMs), Random Access Memories (RAMs), magnetic or optical disks, and the like.
Example 4
The embodiment of the invention also provides a storage medium. Alternatively, in this embodiment, the storage medium may be used to store the program code executed by the sound source localization method according to the microphone array provided in the first embodiment.
Optionally, in this embodiment, the storage medium may be located in any one of computer terminals in a computer terminal group in a computer network, or in any one of mobile terminals in a mobile terminal group.
Alternatively, in the present embodiment, the storage medium is configured to store program codes for performing steps in a sound source localization method incorporating a microphone array.
The above-mentioned serial numbers of the embodiments of the present invention are merely for description and do not represent the merits of the embodiments.
In the above embodiments of the present invention, the descriptions of the respective embodiments have respective emphasis, and for parts that are not described in detail in a certain embodiment, reference may be made to related descriptions of other embodiments.
In the embodiments provided in the present application, it should be understood that the disclosed technology can be implemented in other ways. The above-described embodiments of the apparatus are merely illustrative, and for example, a division of a unit is merely a division of a logic function, and an actual implementation may have another division, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, units or modules, and may be in an electrical or other form.
Units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic or optical disk, and other various media capable of storing program codes.
The foregoing is only a preferred embodiment of the present invention, and it should be noted that, for those skilled in the art, various modifications and decorations can be made without departing from the principle of the present invention, and these modifications and decorations should also be regarded as the protection scope of the present invention.

Claims (12)

1. A sound pickup, comprising: the sound pickup includes: the microphone array is approximately linearly distributed, the microphone array comprises an omnidirectional microphone and a directional microphone, and the sound pickup is in a long strip shape.
2. The pickup of claim 1 wherein omni-directional microphones are located at the edges and in the middle of the microphone array and directional microphones are located elsewhere in the microphone array.
3. The pickup of claim 2, wherein the omni-directional microphone disposed in the middle of the microphone array is located at a predetermined distance range in a vertical direction perpendicular to an array direction.
4. The pickup of claim 1 wherein the omnidirectional microphones are symmetrically distributed and the directional microphones are symmetrically distributed in the microphone array.
5. The pickup of claim 4, wherein the symmetrical distribution comprises: centrosymmetry, mirror symmetry, and rotational symmetry.
6. The pickup of claim 1 wherein the location of the directional microphone is within a predetermined range of the center of the microphone matrix.
7. The pickup of claim 1, wherein the longest length of the microphone array is determined by the application scenario.
8. The pickup of claim 1 wherein the microphone array is scaled from a standard microphone array, the standard microphone array including four microphones at each of the left and right sides, the two microphones at the two ends of the edge and the two microphones in the middle being omni-directional microphones, the remaining microphones including directional microphones.
9. The pickup of claim 8, wherein the array direction distance between the two microphones at the two ends of the edge is a first distance, the distance in the vertical direction of the array direction is zero, the array direction distance between the two microphones at the two secondary edges is a second distance, the distance in the vertical direction of the array direction is zero, the array direction distance between the two microphones at the third edges is a third distance, the distance in the vertical direction of the array direction is zero, the array direction distance between the two microphones at the middle is a fourth distance, and the distance in the vertical direction of the array direction is a fifth distance.
10. The pickup of claim 9 wherein the first distance supports fluctuation within a first range, the second distance supports fluctuation within a second range, the third distance supports fluctuation within a third range, the fourth distance supports fluctuation within a fourth range, and the fifth distance supports fluctuation within a fifth range.
11. The pickup of claim 10, wherein the first range comprises: 31 to 40 centimeters, the second range comprising: 21 to 30 centimeters, the third range comprising: 11 to 20 centimeters, the fourth range comprising: 1 to 10 centimeters, the fifth range comprising: 1 to 5 cm.
12. A microphone array comprises omnidirectional microphones and directional microphones, and the microphone array is approximately linearly distributed.
CN201811168320.2A 2018-10-08 2018-10-08 Sound pickup and microphone array Pending CN111010649A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811168320.2A CN111010649A (en) 2018-10-08 2018-10-08 Sound pickup and microphone array

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811168320.2A CN111010649A (en) 2018-10-08 2018-10-08 Sound pickup and microphone array

Publications (1)

Publication Number Publication Date
CN111010649A true CN111010649A (en) 2020-04-14

Family

ID=70111086

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811168320.2A Pending CN111010649A (en) 2018-10-08 2018-10-08 Sound pickup and microphone array

Country Status (1)

Country Link
CN (1) CN111010649A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115052232A (en) * 2022-08-12 2022-09-13 杭州海康威视数字技术股份有限公司 Pickup
CN115249476A (en) * 2022-07-15 2022-10-28 北京市燃气集团有限责任公司 Intelligent linkage gas cooker based on voice recognition and intelligent linkage method

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS59149494A (en) * 1983-02-16 1984-08-27 Matsushita Electric Ind Co Ltd Microphone device
US4703506A (en) * 1985-07-23 1987-10-27 Victor Company Of Japan, Ltd. Directional microphone apparatus
JPH03106299A (en) * 1989-09-20 1991-05-02 Sanyo Electric Co Ltd Microphone device
JPH048097A (en) * 1990-04-26 1992-01-13 Matsushita Electric Ind Co Ltd Magnetic recording and reproducing device
JPH0481199A (en) * 1990-07-23 1992-03-13 Matsushita Electric Ind Co Ltd Stereophonic microphone
US7174022B1 (en) * 2002-11-15 2007-02-06 Fortemedia, Inc. Small array microphone for beam-forming and noise suppression
CN101447190A (en) * 2008-06-25 2009-06-03 北京大学深圳研究生院 Voice enhancement method employing combination of nesting-subarray-based post filtering and spectrum-subtraction
CN205249484U (en) * 2015-12-30 2016-05-18 临境声学科技江苏有限公司 Microphone linear array reinforcing directive property adapter
CN106019232A (en) * 2016-05-11 2016-10-12 北京地平线信息技术有限公司 Sound source positioning system and method
US20170034616A1 (en) * 2015-07-27 2017-02-02 Kabushiki Kaisha Audio-Technica Microphone and microphone apparatus
CN106679651A (en) * 2017-02-08 2017-05-17 北京地平线信息技术有限公司 Sound localization method and device and electronic equipment
CN107064878A (en) * 2017-06-28 2017-08-18 山东大学 A kind of sound localization method based on high-precision GPS and its realize system

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS59149494A (en) * 1983-02-16 1984-08-27 Matsushita Electric Ind Co Ltd Microphone device
US4703506A (en) * 1985-07-23 1987-10-27 Victor Company Of Japan, Ltd. Directional microphone apparatus
JPH03106299A (en) * 1989-09-20 1991-05-02 Sanyo Electric Co Ltd Microphone device
JPH048097A (en) * 1990-04-26 1992-01-13 Matsushita Electric Ind Co Ltd Magnetic recording and reproducing device
JPH0481199A (en) * 1990-07-23 1992-03-13 Matsushita Electric Ind Co Ltd Stereophonic microphone
US7174022B1 (en) * 2002-11-15 2007-02-06 Fortemedia, Inc. Small array microphone for beam-forming and noise suppression
CN101447190A (en) * 2008-06-25 2009-06-03 北京大学深圳研究生院 Voice enhancement method employing combination of nesting-subarray-based post filtering and spectrum-subtraction
US20170034616A1 (en) * 2015-07-27 2017-02-02 Kabushiki Kaisha Audio-Technica Microphone and microphone apparatus
CN205249484U (en) * 2015-12-30 2016-05-18 临境声学科技江苏有限公司 Microphone linear array reinforcing directive property adapter
CN106019232A (en) * 2016-05-11 2016-10-12 北京地平线信息技术有限公司 Sound source positioning system and method
CN106679651A (en) * 2017-02-08 2017-05-17 北京地平线信息技术有限公司 Sound localization method and device and electronic equipment
CN107064878A (en) * 2017-06-28 2017-08-18 山东大学 A kind of sound localization method based on high-precision GPS and its realize system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115249476A (en) * 2022-07-15 2022-10-28 北京市燃气集团有限责任公司 Intelligent linkage gas cooker based on voice recognition and intelligent linkage method
CN115052232A (en) * 2022-08-12 2022-09-13 杭州海康威视数字技术股份有限公司 Pickup

Similar Documents

Publication Publication Date Title
CN103026734B (en) Electronic apparatus for generating beamformed audio signals with steerable nulls
US9641929B2 (en) Audio signal processing method and apparatus and differential beamforming method and apparatus
US10080088B1 (en) Sound zone reproduction system
CN110379439B (en) Audio processing method and related device
US11950063B2 (en) Apparatus, method and computer program for audio signal processing
US20230040846A1 (en) Audio device auto-location
US12022276B2 (en) Apparatus, method or computer program for processing a sound field representation in a spatial transform domain
US10783896B2 (en) Apparatus, methods and computer programs for encoding and decoding audio signals
WO2018234625A1 (en) Determination of targeted spatial audio parameters and associated spatial audio playback
CN111010649A (en) Sound pickup and microphone array
US11044555B2 (en) Apparatus, method and computer program for obtaining audio signals
CN110491409B (en) Method and device for separating mixed voice signal, storage medium and electronic device
CN110225445A (en) A kind of processing voice signal realizes the method and device of three-dimensional sound field auditory effect
CN113645542B (en) Voice signal processing method and system and audio and video communication equipment
EP3340648B1 (en) Processing audio signals
US10264383B1 (en) Multi-listener stereo image array
CN115460529A (en) Microphone array consistency detection method, device, equipment and storage medium
US11128953B2 (en) Loudspeaker beamforming for improved spatial coverage
CN115884038A (en) Audio acquisition method, electronic device and storage medium
US20240073571A1 (en) Generating microphone arrays from user devices
US10200807B2 (en) Audio rendering in real time
US20230413002A1 (en) Electronic device for applying directionality to audio signal, and method therefor
RU2793625C1 (en) Device, method or computer program for processing sound field representation in spatial transformation area
US20230145713A1 (en) System and method for omnidirectional adaptive loudspeaker
Schrammen et al. Robust Self-Localization of Microphone Arrays Using a Minimum Number of Acoustic Sources

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 40027878

Country of ref document: HK

RJ01 Rejection of invention patent application after publication

Application publication date: 20200414

RJ01 Rejection of invention patent application after publication