CN113453124A

CN113453124A - Audio processing method, device and system

Info

Publication number: CN113453124A
Application number: CN202110738126.9A
Authority: CN
Inventors: 修平平; 刘焕; 朱赛男; 严涛; 浦宏杰; 鄢仁祥
Original assignee: Suzhou Keda Technology Co Ltd
Current assignee: Suzhou Keda Technology Co Ltd
Priority date: 2021-06-30
Filing date: 2021-06-30
Publication date: 2021-09-28
Anticipated expiration: 2041-06-30
Also published as: CN113453124B

Abstract

The application provides an audio processing method, an audio processing device and an audio processing system, wherein the audio processing method comprises the following steps: acquiring an audio signal acquired by acquisition equipment; performing double filtering processing on the audio signal acquired by the acquisition equipment, wherein the double filtering processing comprises static filtering processing and dynamic filtering processing, and the filtering coefficient of the static filtering processing is obtained through sound field transfer function estimation processing; and playing by a public address device based on the audio signal processed by the double filtering, wherein the public address device and the acquisition device are positioned in the same space. The method and the device can prevent the generation of howling and improve the sound transmission gain of the system.

Description

Audio processing method, device and system

Technical Field

The present application relates to the field of audio processing, and in particular, to an audio processing method, apparatus and system.

Background

Under various audio scenes, when the acquisition equipment and the public address equipment are used simultaneously, sound played by the public address equipment can be transmitted to the acquisition equipment through the space, the sound energy emitted by the public address equipment is large enough, and the pickup sensitivity of the acquisition equipment is high enough, the voice played by the public address equipment can generate howling.

Currently, howling can be improved by:

1) the distance between the acquisition equipment and the sound source is adjusted, so that the acquisition equipment is close to the sound source as much as possible, the acquisition equipment is far away from the public address equipment as far as possible, and meanwhile, the closer the public address equipment is to the audience, the better. The acquisition equipment is placed on the back of the radiation direction of the sound amplification equipment.

2) A frequency equalization method (broadband notch method) is used. The frequency response fluctuates greatly due to the fact that the frequency curve of the pickup and amplification device is not an ideal flat straight line (especially some amplification devices with poor quality) and the acoustic resonance effect of the sound field. Therefore, the frequency equalizer can be used to compensate the sound amplification curve, and the frequency response of the system is adjusted to be approximate straight line, so that the gains of all frequency bands are basically consistent.

However, in the above method, manual intervention is required to adjust the distance between the acquisition device and the sound source, and the adjustment effect can be determined only by continuous experiments, whereas the frequency equalization method is a compensation method after howling occurs, and it is difficult to perform howling prevention before the howling occurs.

Therefore, how to conveniently prevent the howling is a technical problem to be solved urgently by those skilled in the art.

Disclosure of Invention

In order to overcome the defects in the prior art, the present application provides an audio processing method, device and system, which can conveniently prevent the generation of howling.

According to an aspect of the present application, there is provided an audio processing method including:

acquiring an audio signal acquired by acquisition equipment;

performing double filtering processing on the audio signal acquired by the acquisition equipment, wherein the double filtering processing comprises static filtering processing and dynamic filtering processing, and the filtering coefficient of the static filtering processing is obtained through sound field transfer function estimation processing;

and playing by a public address device based on the audio signal processed by the double filtering, wherein the public address device and the acquisition device are positioned in the same space.

In some embodiments of the present application, the sound field transfer function estimation process comprises:

enabling the public address equipment to play a preset voice signal;

acquiring a pre-processing audio signal acquired by the acquisition equipment, wherein the pre-processing audio signal comprises a preset voice signal played by the public address equipment;

performing dynamic filtering processing on the preprocessed audio signal until a filter coefficient of the dynamic filtering processing is converged, wherein the preset voice signal is a reference signal of the dynamic filtering processing;

and taking the converged filter coefficients of the dynamic filtering process as the filter coefficients of the static filtering process.

In some embodiments of the present application, the static filtering process and the dynamic filtering process respectively process the audio signals collected by the collecting device, wherein the audio signals output by the dual filtering process are obtained by weighting the audio signals output by the static filtering process and the audio signals output by the dynamic filtering process.

In some embodiments of the present application, the weight of the static filtering process and the weight of the dynamic filtering process are obtained by one or more of calculating a residual echo energy size after filtering by the static filtering process and the dynamic filtering process, a product of differences between residual echo energies after filtering by the static filtering process and the dynamic filtering process, and a product of the residual echo energy of the static filtering process and an estimated echo difference energy, where the estimated echo energy is an estimated echo difference energy calculated from estimated echo differences of the static filtering process and the dynamic filtering process.

In some embodiments of the present application, the acquiring the audio signal acquired by the acquisition device further includes:

performing howling detection on the audio signal acquired by the acquisition equipment;

judging whether the audio signal collected by the collecting equipment generates howling or not;

and if so, carrying out equalization notch filtering processing on the audio signal subjected to the double filtering processing so as to play the audio signal subjected to the equalization notch filtering processing by the public address equipment.

In some embodiments of the present application, after the performing the double filtering process on the audio signal acquired by the acquisition device and before the playing by the public address device based on the audio signal subjected to the double filtering process, the method further includes:

and performing decorrelation processing on the audio signal to be played by the public address equipment.

In some embodiments of the present application, the static filtering process and the dynamic filtering process employ a normalized least mean square algorithm for filtering.

According to still another aspect of the present application, there is also provided an audio processing apparatus including:

the acquisition module is configured to acquire the audio signal acquired by the acquisition equipment;

the double-filtering module is configured to perform double-filtering processing on the audio signal acquired by the acquisition equipment, wherein the double-filtering processing comprises static filtering processing and dynamic filtering processing, and a filtering coefficient of the static filtering processing is obtained through sound field transfer function estimation processing;

and the control module is configured to control a public address device to play based on the audio signal processed by the double filtering, and the public address device and the acquisition device are positioned in the same space.

In some embodiments of the present application, further comprising:

the squeaking detection module is configured to perform squeaking detection on the audio signal acquired by the acquisition equipment;

and the equalizing notch filtering module is configured to perform equalizing notch filtering processing on the audio signal subjected to the double filtering processing when the howling detection module detects that the howling is generated by the audio signal acquired by the acquisition equipment, so that the audio signal subjected to the equalizing notch filtering processing is played by the public address equipment.

In some embodiments of the present application, further comprising:

and the decorrelation module is configured to perform decorrelation processing on the audio signal to be played by the public address equipment.

According to yet another aspect of the present application, there is also provided an audio processing system, comprising:

collecting equipment;

the sound amplifying equipment and the collecting equipment are positioned in the same space;

an audio processing apparatus as described above.

According to yet another aspect of the present application, there is also provided an electronic apparatus, including: a processor; a storage medium having stored thereon a computer program which, when executed by the processor, performs the steps as described above.

According to yet another aspect of the present application, there is also provided a storage medium having stored thereon a computer program which, when executed by a processor, performs the steps as described above.

Therefore, compared with the prior art, the scheme provided by the application has the following advantages:

the application adopts static filtering processing and dynamic filtering processing, is right the audio signal that collection equipment gathered carries out dual-filtering processing to audio information after handling plays, wherein, the filter coefficient that static filtering processed is estimated to handle via the sound field transfer function and is obtained, and the filter coefficient that static filtering processed keeps unchangeable in dual-filtering processing, and the filter coefficient that dynamic filtering processed changes according to actual environment, compromise the environment of dynamic change when the broadcast is gathered to actual audio when considering the sound field transfer function of initial environment is estimated, from this, through dual-filtering processing, can effectively carry out the prevention of whistling automatically, improve system audio gain simultaneously.

Drawings

The above and other features and advantages of the present application will become more apparent by describing in detail exemplary embodiments thereof with reference to the attached drawings.

Fig. 1 shows a flow chart of an audio processing method according to an embodiment of the application.

Fig. 2 shows a flow chart of a sound field transfer function estimation process according to a specific embodiment of the present application.

Fig. 3 shows a block diagram of an audio processing system and an audio processing apparatus according to an embodiment of the present application.

Fig. 4 shows a schematic diagram of an audio processing system at the time of sound field transfer estimation processing according to an embodiment of the present application.

Fig. 5 is a block diagram of an audio processing system and an audio processing apparatus according to an embodiment of the present application.

Fig. 6 schematically illustrates a computer-readable storage medium in an exemplary embodiment of the disclosure.

Fig. 7 schematically illustrates an electronic device in an exemplary embodiment of the disclosure.

Detailed Description

Example embodiments will now be described more fully with reference to the accompanying drawings. Example embodiments may, however, be embodied in many different forms and should not be construed as limited to the examples set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of example embodiments to those skilled in the art. The described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments.

Furthermore, the drawings are merely schematic illustrations of the present application and are not necessarily drawn to scale. The same reference numerals in the drawings denote the same or similar parts, and thus their repetitive description will be omitted. Some of the block diagrams shown in the figures are functional entities and do not necessarily correspond to physically or logically separate entities. These functional entities may be implemented in the form of software, or in one or more hardware modules or integrated circuits, or in different networks and/or processor devices and/or microcontroller devices.

The flow charts shown in the drawings are merely illustrative and do not necessarily include all of the steps. For example, some steps may be decomposed, and some steps may be combined or partially combined, so that the actual execution sequence may be changed according to the actual situation.

In order to overcome the defects in the prior art, the present application provides an audio processing method and an audio processing apparatus, so as to conveniently prevent howling.

In particular, the invention can be applied to a space with a collecting device and a public address device, and the audio played by the public address device is collected by the collecting device. Referring first to fig. 1, fig. 1 shows a flow chart of an audio processing method according to an embodiment of the present application. Fig. 1 shows the following steps in total:

step S110: and acquiring the audio signal acquired by the acquisition equipment.

Specifically, the audio signals collected by the collecting device comprise the voice produced by the speaker in the space where the collecting device is located and the audio signals played by the public address device.

Step S120: and carrying out double filtering processing on the audio signals collected by the collection equipment, wherein the double filtering processing comprises static filtering processing and dynamic filtering processing, and the filtering coefficient of the static filtering processing is obtained through sound field transfer function estimation processing.

Specifically, the static filtering process and the dynamic filtering process may be performed by using a Normalized Least Mean Square (NLMS) algorithm. The invention is not limited thereto, and other filtering algorithms of static filtering process and dynamic filtering process are also within the protection scope of the invention. The static filter process has static filter coefficients (filter coefficients). Further, when the sound field transfer function estimation process is performed again, the filter coefficient of the updated static filter process can be obtained. The filter coefficients of the dynamic filtering process are dynamically changed.

The normalized least mean square algorithm employs a steepest gradient descent algorithm, with the goal of the dual-filtered filter being to compute the difference between the desired signal and the dual-filtered filter output. The error signal is fed back to the filter for the double filtering process and its coefficients are changed by an algorithm to minimize a function of the difference, i.e. a cost function. The algorithm has good convergence rate and relatively simple computing power.

Specifically, the normalized least mean square algorithm processes an input audio signal by using a reference signal, thereby realizing filtering operations such as echo cancellation. In this embodiment, the filter coefficient of the static filter processing is obtained through the sound field transfer function estimation processing, so that the sound field transfer function where the acquisition device and the public address device are currently located can be estimated, and thus, the audio output by the static filter processing and the dynamic filter processing can be combined, so that the dynamic filter processing can perform optimized echo cancellation and filtering operation according to the estimation of the sound field transfer function where the acquisition device and the public address device are currently located.

Specifically, the static filtering process and the dynamic filtering process respectively process the audio signals collected by the collecting device, wherein the audio signals output by the dual-filtering process are obtained by weighting the audio signals output by the static filtering process and the audio signals output by the dynamic filtering process, so that the advantages of the static filtering and the advantages of the dynamic filtering can be simultaneously obtained. Further, the weight of the static filtering process and the weight of the dynamic filtering process are obtained by one or more of the size of the residual echo energy after filtering by the static filtering process and the dynamic filtering process, the product of the difference value of the residual echo energy after filtering by the static filtering process and the dynamic filtering process, and the product of the residual echo energy of the static filtering process and the estimated echo difference value energy. The estimated echo energy is estimated echo difference energy obtained by calculating according to the estimated echo difference of the static filtering processing and the dynamic filtering processing. Thus, the weights of the static filtering process and the dynamic filtering process are determined based on one or more of the aforementioned items of data that can represent the filtering effect, so that the audio signal output by the double filtering process is an optimal audio signal.

Step S130: and playing by the public address equipment based on the audio signal subjected to the double filtering processing.

Specifically, the step S130 plays the processed audio signal, thereby performing howling prevention and reducing the probability of howling occurring in the played audio signal. Further, the audio signal played by the public address device is collected by the collecting device and further processed by double filtering.

In the audio processing method provided by the application, static filtering processing and dynamic filtering processing are adopted, the audio signals collected by the collecting device are subjected to double filtering processing, and processed audio information is played, wherein the filtering coefficient of the static filtering processing is obtained through sound field transfer function estimation processing, the filtering coefficient of the static filtering processing is kept unchanged in the double filtering processing, the filtering coefficient of the dynamic filtering processing changes according to the actual environment, and the sound field transfer function estimation of the initial environment is considered while the environment of dynamic change during actual audio collection and playing is considered, so that howling prevention can be automatically and effectively carried out through the double filtering processing, and meanwhile, the audio gain of the system is improved.

In some specific embodiments of the present application, after the audio signal acquired by the acquisition device is acquired in step S110 in fig. 1, the following steps may be further included: and carrying out squeaking detection on the audio signal acquired by the acquisition equipment. And then, judging whether the audio signal collected by the collecting equipment generates howling. And if the audio signal acquired by the acquisition equipment is judged to generate howling, carrying out equalization notch filtering processing on the audio signal subjected to double filtering processing so as to play the audio signal subjected to the equalization notch filtering processing by the public address equipment. Specifically, in some embodiments, the audio signal subjected to howling detection may be an audio signal acquired by an acquisition device or an audio signal subjected to the double filtering process. The squeaking detection and the double filtering processing can be executed in parallel by using the audio signals collected by the collecting equipment to perform the squeaking detection, so that the overall efficiency of the system is improved; howling detection using the audio signal subjected to the double filtering process can improve the accuracy of howling detection. Further, howling detection can detect a howling frequency point, so that an equalization filter coefficient of the howling frequency point can be calculated, and the equalization filter coefficient is used for equalization notch filtering processing to perform equalization filtering, so that the howling frequency point is weakened or removed, and conditions generated by howling are damaged.

Further, after the audio signal collected by the collecting device is subjected to the double filtering processing in step S120 in fig. 1, and before the audio signal subjected to the double filtering processing is played by the public address device in step S130, the following steps may be further included: and performing decorrelation processing on the audio signal to be played by the public address equipment. Specifically, the decorrelation processing may be implemented by, for example, frequency shift and/or phase modulation, so that the correlation between the audio signal to be played by the public address device and the audio signal collected by the collecting device is reduced, which is beneficial to the double filtering processing and can avoid the generation of howling. Specifically, in the above embodiments, the decorrelation process may be performed after performing a double filtering process on the audio signal acquired by the acquisition apparatus or after performing an equalization notch filtering process on the double-filtered audio signal.

Therefore, in the above preferred embodiment, the dual filtering process and the decorrelation process are combined to prevent the howling, but if the system gain is too large, for example, the dual filtering process and the decorrelation process can improve the gain of preventing the howling by 15 db. When the system gain is adjusted to 20 decibels (more than 15 decibels), the system may still have howls after the double filtering processing, so the generated howls are removed through howling detection and equalization notch filtering processing, and therefore the howls are prevented in advance through the double filtering processing, and are removed through the howling detection and the equalization notch filtering processing, so that the occurrence of the howls is further reduced, and the playing quality of the sound amplifying equipment is improved.

Referring now to fig. 2, fig. 2 shows a flow diagram of a sound field transfer function estimation process according to a specific embodiment of the present application. Fig. 2 shows the following steps together:

step S101: and enabling the public address equipment to play a preset voice signal.

Specifically, the preset voice signal is, for example, a pre-stored voice of a human voice.

Step S102: and acquiring the pre-processing audio signal acquired by the acquisition equipment, wherein the pre-processing audio signal comprises a preset voice signal played by the public address equipment.

Step S103: and performing dynamic filtering processing on the preprocessed audio signal until a filter coefficient of the dynamic filtering processing is converged, wherein the preset voice signal is a reference signal of the dynamic filtering processing.

Step S104: and taking the converged filter coefficients of the dynamic filtering process as the filter coefficients of the static filtering process.

Therefore, the estimation of the environmental transfer function of the space where the sound amplifying device and the collecting device are located can be realized through the steps S101 to S104, so as to obtain the filter coefficient of the static filtering processing. Therefore, the method is particularly suitable for scenes in which the relative positions of the loudspeaker device and the acquisition device are fixed and the space in which the loudspeaker device and the acquisition device are located is relatively fixed. When the loudspeaker device and the acquisition device are shifted in position or located in another space, the step of the sound field transfer function estimation process shown in fig. 2 may be performed again to realize the sound field transfer function estimation again, so that the filter coefficients of the static filtering process are adjusted based on the new sound field transfer function estimation process to fit in the new scene.

The above exemplary embodiments of the present application are shown, the present application is not limited thereto, and in each embodiment, the addition, the omission, and the sequence change of the steps are all within the protection scope of the present application; the embodiments may be implemented individually or in combination.

Fig. 3 shows a block diagram of an audio processing system and an audio processing apparatus according to an embodiment of the present application. The audio processing system comprises an acquisition device 210, a loudspeaker device 230 and an audio processing apparatus 220. The amplification device 230 is located in the same space as the acquisition device 210. The audio signal played by the audio amplifier 230 is captured by the capture device 210 via the ambient transfer function between the audio amplifier 230 and the space where the capture device 210 is located.

The audio processing device 220 includes an acquisition module 221, a dual filtering module 222, and a control module 223.

The acquisition module 221 is configured to acquire an audio signal acquired by an acquisition device.

The dual filtering module 222 is configured to perform dual filtering processing on the audio signal acquired by the acquisition device, where the dual filtering processing includes static filtering processing and dynamic filtering processing, and a filter coefficient of the static filtering processing is obtained through sound field transfer function estimation processing.

The control module 223 is configured to control the sound emitting device to play based on the dual filtered audio signal.

Referring now to fig. 4, fig. 4 shows a schematic diagram of an audio processing system at the time of a sound field transfer estimation process according to an embodiment of the present application. Fig. 4 is only schematic, and some modules are omitted for clarity, as shown in fig. 4, the audio processing apparatus 220 may further include a coefficient extraction module 227 in addition to the obtaining module 221, and the dynamic filtering processing module 222A belongs to the dual filtering module 222. Specifically, the dual filtering module 222 may include a dynamic filtering processing module 222A and a static filtering processing module. The dynamic filter processing module 222A and the static filter processing module may be a dynamic filter having dynamic filter coefficients and a static filter having static filter coefficients, respectively.

In the embodiment shown in fig. 4, the preset voice signal is input to the public address device 230 for playing through the public address device, and at the same time, the preset voice signal is used as the reference signal of the dynamic filtering module 222A. The preset voice signal played by the audio amplifier 230 is collected by the collection device 210 through the environment transfer function, the acquisition module 221 of the audio processing apparatus 220 acquires the collected audio signal from the collection device 210, and inputs the acquired audio signal to the dynamic filtering module 222A for processing, and the filtering parameter of the dynamic filtering module 222A changes in real time, and is extracted by the coefficient extraction module 227 when the filtering parameter converges. Further, the extracted filter parameters are saved as a file in a set format (for example, log format or txt format, but not limited thereto) and initialized and used as the filter coefficients of the static filter module.

Referring now to fig. 5, fig. 5 is a block diagram illustrating an audio processing system and an audio processing device according to an embodiment of the present application. The audio processing system comprises an acquisition device 210, a loudspeaker device 230 and an audio processing apparatus 220. The amplification device 230 is located in the same space as the acquisition device 210. The audio signal played by the audio amplifier 230 is captured by the capture device 210 via the ambient transfer function between the audio amplifier 230 and the space where the capture device 210 is located.

The audio processing apparatus 220 comprises an acquisition module 221, a double filtering module 222, a control module 223, a howling detection module 224, an equalization notch filtering module 225 and a decorrelation module 226.

The obtaining module 221, the dual filtering module 222, and the control module 223 are already described with reference to fig. 3, and are not repeated here.

The howling detection module 224 is configured to perform howling detection on the audio signal collected by the collection device.

The equalizing notch filtering module 225 is configured to perform equalizing notch filtering processing on the dual-filtered audio signal when the howling detection module detects that the howling is generated by the audio signal acquired by the acquisition device, so that the sound amplifying device can play the audio signal based on the equalizing notch filtering processing.

The decorrelation module 226 is configured to decorrelate audio signals to be played by the loudspeaker device.

In the audio processing device and system provided by the application, static filtering processing and dynamic filtering processing are adopted, the audio signals collected by the collecting device are subjected to double filtering processing, and processed audio information is played, wherein the filtering coefficient of the static filtering processing is obtained through sound field transfer function estimation processing, the filtering coefficient of the static filtering processing is kept unchanged in the double filtering processing, the filtering coefficient of the dynamic filtering processing changes according to the actual environment, the sound field transfer function estimation of the initial environment is considered, and meanwhile, the dynamic changing environment during the actual audio collection playing is considered, so that howling prevention can be automatically and effectively carried out through the double filtering processing, and meanwhile, the audio gain of the system is improved.

The audio processing device and system can be realized by software, hardware, firmware and any combination thereof. Fig. 3-5 are merely schematic illustrations of the audio processing apparatus and system provided in the present application, and the splitting, combining, and adding of modules are within the scope of the present application without departing from the spirit of the present application.

In an exemplary embodiment of the present disclosure, there is also provided a computer-readable storage medium, on which a computer program is stored, which when executed by, for example, a processor, may implement the steps of the audio processing method described in any one of the above embodiments. In some possible embodiments, the various aspects of the present application may also be implemented in the form of a program product comprising program code for causing a terminal device to perform the steps according to the various exemplary embodiments of the present application described in the audio processing method section above of this specification, if the program product is run on the terminal device.

Referring to fig. 6, a program product 800 for implementing the above method according to an embodiment of the present application is described, which may employ a portable compact disc read only memory (CD-ROM) and include program code, and may be run on a terminal device, such as a personal computer. However, the program product of the present application is not limited thereto, and in this document, a readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.

The program product may employ any combination of one or more readable media. The readable medium may be a readable signal medium or a readable storage medium. A readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples (a non-exhaustive list) of the readable storage medium include: an electrical connection having one or more wires, a portable disk, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.

The computer readable storage medium may include a propagated data signal with readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A readable storage medium may also be any readable medium that is not a readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a readable storage medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.

Program code for carrying out operations of the present application may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the tenant computing device, partly on the tenant device, as a stand-alone software package, partly on the tenant computing device and partly on a remote computing device, or entirely on the remote computing device or server. In the case of remote computing devices, the remote computing devices may be connected to the tenant computing device through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computing device (e.g., through the internet using an internet service provider).

In an exemplary embodiment of the present disclosure, there is also provided an electronic device, which may include a processor, and a memory for storing executable instructions of the processor. Wherein the processor is configured to perform the steps of the audio processing method in any of the above embodiments via execution of the executable instructions.

As will be appreciated by one skilled in the art, aspects of the present application may be embodied as a system, method or program product. Accordingly, various aspects of the present application may be embodied in the form of: an entirely hardware embodiment, an entirely software embodiment (including firmware, microcode, etc.) or an embodiment combining hardware and software aspects that may all generally be referred to herein as a "circuit," module "or" system.

An electronic device 600 according to this embodiment of the present application is described below with reference to fig. 7. The electronic device 600 shown in fig. 7 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present application.

As shown in fig. 7, the electronic device 600 is embodied in the form of a general purpose computing device. The components of the electronic device 600 may include, but are not limited to: at least one processing unit 610, at least one storage unit 620, a bus 630 that connects the various system components (including the storage unit 620 and the processing unit 610), a display unit 640, and the like.

Wherein the storage unit stores program code executable by the processing unit 610 to cause the processing unit 610 to perform steps according to various exemplary embodiments of the present application described in the above-mentioned audio processing method section of the present specification. For example, the processing unit 610 may perform the steps as shown in fig. 1 or 2.

The storage unit 620 may include readable media in the form of volatile memory units, such as a random access memory unit (RAM)6201 and/or a cache memory unit 6202, and may further include a read-only memory unit (ROM) 6203.

The memory unit 620 may also include a program/utility 6204 having a set (at least one) of program modules 6205, such program modules 6205 including, but not limited to: an operating system, one or more application programs, other program modules, and program data, each of which, or some combination thereof, may comprise an implementation of a network environment.

Bus 630 may be one or more of several types of bus structures, including a memory unit bus or memory unit controller, a peripheral bus, an accelerated graphics port, a processing unit, or a local bus using any of a variety of bus architectures.

The electronic device 600 may also communicate with one or more external devices 700 (e.g., keyboard, pointing device, bluetooth device, etc.), with one or more devices that enable a tenant to interact with the electronic device 600, and/or with any devices (e.g., router, modem, etc.) that enable the electronic device 600 to communicate with one or more other computing devices. Such communication may occur via an input/output (I/O) interface 650. Also, the electronic device 600 may communicate with one or more networks (e.g., a Local Area Network (LAN), a Wide Area Network (WAN), and/or a public network such as the Internet) via the network adapter 660. The network adapter 660 may communicate with other modules of the electronic device 600 via the bus 630. It should be appreciated that although not shown, other hardware and/or software modules may be used in conjunction with the electronic device 600, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID systems, tape drives, and data backup storage systems, among others.

Through the above description of the embodiments, those skilled in the art will readily understand that the exemplary embodiments described herein may be implemented by software, or by software in combination with necessary hardware. Therefore, the technical solution according to the embodiments of the present disclosure may be embodied in the form of a software product, which may be stored in a non-volatile storage medium (which may be a CD-ROM, a usb disk, a removable hard disk, etc.) or on a network, and includes several instructions to enable a computing device (which may be a personal computer, a server, or a network device, etc.) to execute the above-mentioned audio processing method according to the embodiments of the present disclosure.

Other embodiments of the disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the disclosure disclosed herein. This application is intended to cover any variations, uses, or adaptations of the disclosure following, in general, the principles of the disclosure and including such departures from the present disclosure as come within known or customary practice within the art to which the disclosure pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the disclosure being indicated by the following claims.

Claims

1. An audio processing method, comprising:

acquiring an audio signal acquired by acquisition equipment;

2. The audio processing method of claim 1, wherein the sound field transfer function estimation process comprises:

enabling the public address equipment to play a preset voice signal;

3. The audio processing method according to claim 1, wherein the static filtering process and the dynamic filtering process the audio signal collected by the collection device separately, wherein the audio signal output by the double filtering process is obtained by weighting the audio signal output by the static filtering process and the audio signal output by the dynamic filtering process.

4. The audio processing method according to claim 1, wherein the weight of the static filtering process and the weight of the dynamic filtering process are obtained by one or more of a residual echo energy size after filtering by the static filtering process and the dynamic filtering process, a product of a difference between residual echo energies after filtering by the static filtering process and the dynamic filtering process, and a product of a residual echo energy of the static filtering process and an estimated echo difference energy calculated from an estimated echo difference of the static filtering process and the dynamic filtering process.

5. The audio processing method of claim 1, wherein obtaining the audio signal captured by the capture device further comprises:

6. The audio processing method as claimed in any one of claims 1 to 5, wherein after the double filtering processing of the audio signal acquired by the acquisition device and before the playing by the public address device based on the audio signal subjected to the double filtering processing, further comprises:

7. An audio processing apparatus, comprising:

8. The audio processing apparatus of claim 7, further comprising:

9. The audio processing apparatus of claim 7, further comprising:

10. An audio processing system, comprising:

collecting equipment;

the audio processing device of any of claims 7 to 9.