CN116980804A

CN116980804A - Volume adjustment method, device, equipment and readable storage medium

Info

Publication number: CN116980804A
Application number: CN202311239752.9A
Authority: CN
Inventors: 梁俊斌
Original assignee: Tencent Technology Shenzhen Co Ltd
Current assignee: Tencent Technology Shenzhen Co Ltd
Priority date: 2023-09-25
Filing date: 2023-09-25
Publication date: 2023-10-31
Anticipated expiration: 2043-09-25
Also published as: CN116980804B

Abstract

The application provides a volume adjustment method, a volume adjustment device, volume adjustment equipment and a readable storage medium, wherein the volume adjustment method comprises the following steps: when the set volume value reaches the first upper limit volume value, if the volume increasing instruction is detected, determining a target volume value according to the volume increasing instruction; determining audio class information according to the original audio signal, and determining adjustment reference data according to the audio class information; determining a frequency domain signal from the original audio signal; gain calculation is carried out according to the auditory perception weighting data, the target volume value, the adjustment reference data and the frequency domain signal to obtain gain data; gain processing is carried out on the frequency domain signal by utilizing the gain data to obtain a gain frequency domain signal; and determining a gain audio signal according to the gain frequency domain signal, wherein the perceived volume corresponding to the gain audio signal is matched with the target volume value. The method provided by the application can enable the perceived volume corresponding to the audio signal after volume adjustment to be matched with the target volume value, and has a good volume adjustment effect.

Description

Volume adjustment method, device, equipment and readable storage medium

Technical Field

The present application relates to the field of computer technologies, and in particular, to a method, an apparatus, a device, and a readable storage medium for adjusting volume.

Background

The volume of audio, also called loudness, refers to the subjective perception of the intensity of the sound heard by the human ear. The volume of the audio is related to the amplitude of the sound. For some audio playback devices (e.g., cell phones, personal computers, speakers, etc.), the volume of the audio they play may be adjusted by setting a volume value.

Currently, a digital adjustment or an analog adjustment method is generally used for volume amplification processing of audio. However, the digital adjusting method may make the adjusted audio have a sense of broken sound in hearing, and the adjusting effect is poor; the analog adjusting method is limited by the amplifying capability of hardware equipment, the volume adjustable range is smaller, and the adjusting effect is also poorer.

Disclosure of Invention

The embodiment of the application provides a volume adjusting method, a volume adjusting device, volume adjusting equipment and a readable storage medium, which can enable the perceived volume corresponding to a volume-adjusted audio signal to be matched with a target volume value, and have a good volume adjusting effect.

In one aspect, an embodiment of the present application provides a method for adjusting volume, including:

when the set volume value reaches a first upper limit volume value, if a volume increasing instruction is detected, determining a target volume value according to the volume increasing instruction, wherein the target volume value is larger than the first upper limit volume value and smaller than or equal to a second upper limit volume value, and the second upper limit volume value is larger than the first upper limit volume value;

Performing category analysis processing on the original audio signal to obtain audio category information, and determining adjustment reference data according to the audio category information;

converting the original audio signal from a time domain to a frequency domain to obtain a frequency domain signal corresponding to the original audio signal;

gain calculation is carried out according to the auditory perception weighting data, the target volume value, the adjustment reference data and the frequency domain signal to obtain gain data;

and performing gain processing on the frequency domain signal by using the gain data to obtain a gain frequency domain signal, and converting the gain frequency domain signal from a frequency domain to a time domain to obtain a gain audio signal, wherein the perceived volume corresponding to the gain audio signal is matched with the target volume value.

In one aspect, an embodiment of the present application provides a volume adjustment device, including:

the determining unit is used for determining a target volume value according to the volume increasing instruction when the volume increasing instruction is detected when the set volume value reaches a first upper limit volume value, wherein the target volume value is larger than the first upper limit volume value and smaller than or equal to a second upper limit volume value, and the second upper limit volume value is larger than the first upper limit volume value;

The processing unit is used for carrying out category analysis processing on the original audio signal to obtain audio category information, and determining adjustment reference data according to the audio category information;

the processing unit is further used for converting the original audio signal from a time domain to a frequency domain to obtain a frequency domain signal corresponding to the original audio signal;

the processing unit is further used for performing gain calculation according to the auditory perception weighted data, the target volume value, the adjustment reference data and the frequency domain signal to obtain gain data;

the conversion unit is used for performing gain processing on the frequency domain signal by utilizing the gain data to obtain a gain frequency domain signal, converting the gain frequency domain signal from a frequency domain to a time domain to obtain a gain audio signal, and the perceived volume corresponding to the gain audio signal is matched with the target volume value.

In one aspect, an embodiment of the present application provides a computer device, including: the system comprises a processor, a communication interface and a memory, wherein the processor, the communication interface and the memory are mutually connected, executable program codes are stored in the memory, and the processor is used for calling the executable program codes to realize the volume adjustment method provided by the embodiment of the application.

Correspondingly, the embodiment of the application also provides a computer readable storage medium, wherein the computer readable storage medium stores instructions which, when run on a computer, cause the computer to realize the volume adjustment method provided by the embodiment of the application.

Accordingly, embodiments of the present application also provide a computer program product comprising a computer program or computer instructions stored in a computer readable storage medium. The processor of the computer device reads the computer program or the computer instructions from the computer readable storage medium, and the processor executes the computer program or the computer instructions, so that the computer device realizes the volume adjustment method provided by the embodiment of the application.

When the set volume value reaches the first upper limit volume value, if the volume increasing instruction is detected, the target volume value can be determined according to the volume increasing instruction, wherein the target volume value is larger than the first upper limit volume value and smaller than or equal to the second upper limit volume value; performing category analysis processing on the original audio signal to obtain audio category information, and determining adjustment reference data according to the audio category information; converting an original audio signal from a time domain to a frequency domain to obtain a frequency domain signal; gain calculation is carried out according to the auditory perception weighting data, the target volume value, the adjustment reference data and the frequency domain signal to obtain gain data; gain processing is carried out on the frequency domain signal by utilizing the gain data to obtain a gain frequency domain signal; and converting the gain frequency domain signal from the frequency domain to the time domain to obtain a gain audio signal, wherein the perceived volume corresponding to the gain audio signal is matched with the target volume value. According to the volume adjusting method provided by the embodiment of the application, when the set volume value reaches the first upper limit volume value and the volume increasing instruction is detected, the volume of the original audio signal is adjusted, so that the volume of the audio signal is further increased, and the volume requirement of a user is met; the original audio signals can be analyzed and processed to obtain adjustment reference data, so that the audio signals of different categories can be subjected to targeted volume adjustment; the original audio signal can be converted to obtain a frequency domain signal, so that gain processing is conveniently carried out according to the frequency domain signal, the processed audio signal has better hearing experience, and the adjusted audio signal cannot be represented as sound truncated; gain data can be determined according to auditory perception weighting data, target volume values, adjustment reference data and frequency domain signals, and the volume adjustable range of the original audio signal is larger because the value range of the target volume values is larger; the gain frequency domain signal can be determined according to the gain data, a gain audio signal is obtained, and the perceived volume corresponding to the gain audio signal is matched with the target volume value; the method provided by the embodiment of the application can further amplify the volume of the audio signal when the volume of the audio signal reaches the first upper limit volume value, so that the definition of the audio signal is ensured, and the method provided by the application has a better volume adjusting effect, so that the adjusted audio signal has better hearing experience sense and does not have sound breaking sense in hearing, and meanwhile, the method provided by the embodiment of the application can be used for adjusting various different types of audio signals, and the range of the target volume value is larger, so that the method provided by the embodiment of the application can be applied to various use scenes (such as noise use scenes) and has better universality.

Drawings

In order to more clearly illustrate the embodiments of the application or the technical solutions in the prior art, the drawings that are required in the embodiments or the description of the prior art will be briefly described, it being obvious that the drawings in the following description are only some embodiments of the application, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.

Fig. 1 is a schematic diagram of a system architecture of a volume adjustment system according to an embodiment of the present application;

fig. 2 is a flow chart of a volume adjustment method according to an embodiment of the present application;

fig. 3 is a flow chart of another volume adjustment method according to an embodiment of the present application;

fig. 4 is a schematic diagram of a volume adjustment method according to an embodiment of the present application;

FIG. 5 is a schematic diagram of an acoustic equal loudness graph according to an embodiment of the present application;

FIG. 6 is a schematic diagram of auditory perception weighting data provided by an embodiment of the present application;

fig. 7 is a block diagram of a volume adjusting device according to an embodiment of the present application;

fig. 8 is a block diagram of a computer device according to an embodiment of the present application.

Detailed Description

The following description of the embodiments of the present application will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present application, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to be within the scope of the application.

It should be noted that the descriptions of "first," "second," and the like in the embodiments of the present application are for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a technical feature defining "first", "second" may include at least one such feature, either explicitly or implicitly.

For some audio playback devices, the volume of audio that it plays may be adjusted by setting a volume value. Currently, the method for adjusting the volume of an audio signal is mainly a digital adjusting method and an analog adjusting method. However, both methods have poor volume adjustment effect, and the volume adjustable range of the audio is small.

Based on this, the embodiment of the application provides a volume adjustment method, which can determine a target volume value according to a volume increasing instruction when the volume value reaches a first upper limit volume value, wherein the target volume value is larger than the first upper limit volume value and smaller than or equal to a second upper limit volume value, and the second upper limit volume value is larger than the first upper limit volume value if the volume increasing instruction is detected; performing category analysis processing on the original audio signal to obtain audio category information, and determining adjustment reference data according to the audio category information; converting the original audio signal from a time domain to a frequency domain to obtain a frequency domain signal corresponding to the original audio signal; gain calculation is carried out according to the auditory perception weighting data, the target volume value, the adjustment reference data and the frequency domain signal to obtain gain data; and performing gain processing on the frequency domain signal by using the gain data to obtain a gain frequency domain signal, converting the gain frequency domain signal from a frequency domain to a time domain to obtain a gain audio signal, and matching the perceived volume corresponding to the gain audio signal with a target volume value. The method provided by the embodiment of the application can enable the adjusted audio signal to have better volume adjusting effect and better hearing experience sense, and can not have sound breaking sense in hearing, and meanwhile, the value range of the target volume value in the embodiment of the application is larger, so that the volume of the original audio signal can be further amplified, and the method provided by the embodiment of the application has better universality.

The volume adjustment method provided by the embodiment of the application can be applied to the field of intelligent transportation. The intelligent transportation system (Intelligent Traffic System, ITS), also called intelligent transportation system (Intelligent Transportation System), is a comprehensive transportation system which uses advanced scientific technology (information technology, computer technology, data communication technology, sensor technology, electronic control technology, automatic control theory, operation study, artificial intelligence, etc.) effectively and comprehensively for transportation, service control and vehicle manufacturing, and enhances the connection among vehicles, roads and users, thereby forming a comprehensive transportation system for guaranteeing safety, improving efficiency, improving environment and saving energy. In the running process, external noise may cause that the intelligent traffic system cannot better complete the connection between the vehicle and the user, and the volume adjustment method provided by the embodiment of the application can be adopted to adjust the volume of the output audio of the intelligent traffic system, so that the perceived volume of the output audio can be clearer, and the good man-machine interaction of the intelligent traffic system is facilitated.

The volume adjustment method provided by the embodiment of the application can also be applied to the field of artificial intelligence. Artificial intelligence (Artificial Intelligence, AI) is the theory, method, technique and application system that uses a digital computer or a machine controlled by a digital computer to simulate, extend and extend human intelligence, sense the environment, acquire knowledge and use the knowledge to obtain optimal results. The field of artificial intelligence includes speech technology (Speech Technology). Key technologies to speech technology are Automatic Speech Recognition (ASR) and speech synthesis (TTS) technologies and voiceprint recognition technologies. In the speech synthesis technology, the method provided by the embodiment of the application can be adopted to carry out volume adjustment processing on the synthesized audio signal. According to the volume adjustment method provided by the embodiment of the application, the original audio signal can be subjected to gain processing according to the auditory perception weighting data and the target volume value to obtain the gain audio signal, so that the perception volume corresponding to the gain audio signal is matched with the target volume value, the loudness of the audio generated by the voice synthesis technology is higher, and the quality of the audio is higher.

The architecture of the volume adjustment system provided by the embodiment of the application will be described with reference to the accompanying drawings.

Referring to fig. 1, the system architecture of a volume adjustment system provided by the embodiment of the application includes an audio acquisition device 101, a volume adjustment device 102 and a database 103, wherein the volume adjustment device 102 can perform data interaction with the audio acquisition device 101 and the database 103, and the volume adjustment device 102 includes a gain data module 1021 and a gain frequency domain signal module 1022. Wherein:

the audio acquisition device 101 may generate an audio signal, or may receive an audio signal transmitted by another device, and the audio acquisition device 101 may transmit the original audio signal to the volume adjustment device 102. The audio acquisition device 101 may be a device independent of the volume adjustment device 102 or may be a module disposed in the volume adjustment device 102. The audio acquisition device 101 may be, but is not limited to, a handheld device (e.g., a smart phone, a tablet computer), a computing device (e.g., a personal computer (Personal Computer, PC), an in-vehicle terminal, a smart voice interaction device, a wearable device, or other smart appliance, etc. having audio generation and communication functions.

The volume adjustment device 102 may receive the original audio signal transmitted from the audio acquisition device 101 and perform volume adjustment processing on the original audio signal. The volume adjustment device 102 includes a gain data module 1021 and a gain frequency domain signal module 1022, where the gain data module 1021 is configured to generate gain data, and the gain frequency domain signal module 1022 is configured to generate a gain audio signal according to the gain data and the original audio signal. The volume adjustment device 102 may be a terminal device or a server. When the volume adjustment device 102 is a terminal device, the volume adjustment device 102 may be a mobile phone, a computer, an intelligent voice interaction device, an intelligent home appliance, a vehicle-mounted terminal, an aircraft, or the like, but is not limited thereto. When the volume adjusting device 102 is a server, the volume adjusting device 102 may be an independent physical server, a server cluster or a distributed system formed by a plurality of physical servers, or a cloud server providing cloud services, cloud databases, cloud computing, cloud functions, cloud storage, network services, cloud communication, middleware services, domain name services, security services, a content distribution network (Content Delivery Network, CDN), and basic cloud computing services such as big data and an artificial intelligence platform.

The database 103 is used to store relevant data of the volume adjustment device 102, such as: auditory perception weighting data, and the like. The database 103 may be a local database in the volume adjustment device 102, or may be a cloud database (i.e. a database deployed in the cloud) associated with the volume adjustment device 102, specifically may be deployed based on any one of a private cloud, a public cloud, a hybrid cloud, an edge cloud, and the like, so that the functions of the cloud databases that are focused are different. For example, the database deployed in the private cloud is a personal device of the user, and is more focused on serving a small part of the user, while the database deployed in the public cloud is deployed based on a cloud platform provided by a third party, so that data stored in the database can be shared, data of any user can be stored in the database, and data in the database can be used by any user.

The principle of operation of the volume adjustment system shown in fig. 1 will be described in detail as follows:

the audio acquisition device 101 transmits the original audio signal to the volume adjustment device 102; when the set volume value reaches the first upper limit volume value, if the volume adjustment device 102 detects a volume increase instruction, the volume adjustment device 102 determines a target volume value according to the volume increase instruction, wherein the target volume value is greater than the first upper limit volume value and less than or equal to the second upper limit volume value, and the second upper limit volume value is greater than the first upper limit volume value; the volume adjustment device 102 may perform a category analysis process on the received original audio signal to obtain audio category information, and determine adjustment reference data according to the audio category information; the volume adjustment device 102 may convert the original audio signal from a time domain to a frequency domain, to obtain a frequency domain signal corresponding to the original audio signal; the volume adjustment device 102 may obtain auditory sense weighted data from the database 103, and the gain data module 1021 in the volume adjustment device 102 may perform gain calculation according to the obtained auditory sense weighted data, the target volume value, the adjustment reference data, and the frequency domain signal, to obtain gain data; the gain data module 1021 sends the gain data to the gain frequency domain signal module 1022, and the gain frequency domain signal module 1022 can perform gain processing on the frequency domain signal by using the gain data to obtain a gain frequency domain signal; and converting the gain frequency domain signal from the frequency domain to the time domain to obtain a gain audio signal, wherein the perceived volume corresponding to the gain audio signal is matched with the target volume value. The volume adjustment device 102 may send the gain audio signal to the audio acquisition device 101. The volume adjusting method provided by the embodiment of the application can enable the adjusted audio signal to have a good volume adjusting effect, has a good auditory experience sense, does not have a sound breaking sense in auditory sense, and simultaneously has a large value range of the target volume value so that the volume of the original audio signal can be further amplified.

It should be understood that the architecture diagram of the volume adjustment system described in the embodiments of the present application is for more clearly describing the volume adjustment method of the embodiments of the present application, and does not constitute a limitation of the volume adjustment method provided in the embodiments of the present application. For example, the volume adjustment method provided by the embodiment of the present application may be performed by other devices that are different from the volume adjustment device 102 and that are capable of communicating with the audio acquisition device 101 and the database 103, in addition to the volume adjustment device 102. Those of ordinary skill in the art will appreciate that the number of audio acquisition devices 101, volume adjustment devices 102, and databases 103 in fig. 1 are merely illustrative. Any number of devices may be configured as desired for a service implementation. Moreover, with the evolution of the system architecture and the appearance of new service scenes, the volume adjustment method provided by the embodiment of the application is also applicable to similar technical problems.

It should be noted that, in the present disclosure, the collection and processing of related data (e.g., the original audio signal, etc.) should be strictly based on the requirements of related laws and regulations during the actual application, so as to obtain the informed consent or independent consent of the personal information body, and develop the subsequent data use and processing behavior within the authorized range of the laws and regulations and the personal information body.

Referring to fig. 2, fig. 2 is a flow chart of a volume adjustment method according to an embodiment of the application. The volume adjustment method may be implemented by the volume adjustment device 102 described above, or may be implemented by another device. The flow of the volume adjustment method provided in the embodiment of the application includes but is not limited to:

and S201, when the set volume value reaches a first upper limit volume value, if a volume increasing instruction is detected, determining a target volume value according to the volume increasing instruction, wherein the target volume value is larger than the first upper limit volume value and smaller than or equal to a second upper limit volume value, and the second upper limit volume value is larger than the first upper limit volume value.

In the embodiment of the application, the set volume value can be the volume value input by a user, and the first upper limit volume value can be the upper limit value of the audio volume when the volume is adjusted by adopting the existing volume adjustment method (such as a digital adjustment method or an analog adjustment method); the second upper limit volume value may be an upper limit value of audio volume when the volume adjustment method provided by the embodiment of the application is used for volume adjustment. When the set volume value reaches the first upper limit volume value, the volume increase of the audio cannot be further realized by adopting the existing volume adjustment method, if the volume increase instruction is still detected, the target volume value can be determined according to the volume increase instruction, and the volume of the audio is further increased by adopting the volume adjustment method provided by the embodiment of the application. For example: the method comprises the steps of adjusting audio volume by adopting an existing method, wherein a first upper limit volume value is 100; when the set volume value reaches the first upper limit volume value, if the volume increasing instruction is detected, the volume of the audio signal can be further increased by adopting the volume adjusting method provided by the embodiment of the application, the second upper limit volume value can be 120, and the target volume value is determined according to the volume increasing instruction, and at this time, the value range of the target volume value can be [100,120]. The volume adjusting method provided by the embodiment of the application can further expand the volume of the audio on the basis of the limit volume of the existing audio, so that the audio signal can be clear in a noise environment and can be recognized by human ears.

S202, carrying out category analysis processing on the original audio signal to obtain audio category information, and determining adjustment reference data according to the audio category information.

In the embodiment of the application, the category analysis processing can be performed on the original audio information to obtain the audio category information of the original audio information, for example: the audio category information may be a voice category, a music category, a noise category, etc., and the audio category information may also be a vocal category, a tubular category, a string category, a percussion category, etc. Adjustment reference data may be determined from the audio category information, which may be used to indicate a volume adjustment to the original audio signal. The adjustment reference data corresponding to different categories of original audio signals may be different. The method provided by the embodiment of the application can determine the adjustment reference data according to the category information of the original audio signals, thereby realizing the targeted volume adjustment of the audio signals of different categories.

S203, converting the original audio signal from a time domain to a frequency domain to obtain a frequency domain signal corresponding to the original audio signal.

In an embodiment of the present application, the original audio signal may be an analog signal that is continuous in both time and amplitude, for example: the audio signal corresponding to the audio of the video may be an audio signal of a piece of music or an audio signal when a call is made. The time and frequency domains are the fundamental properties of a signal, and are the angles of two different analysis signals. Time domain refers to a relationship that describes a physical signal (e.g., an audio signal) versus time. For example: the time domain waveform of an audio signal may be expressed as the audio signal changing over time. The frequency domain is a coordinate system used in describing the frequency-wise characteristics of a signal. For an audio signal, the law of change of signal strength with time is its time domain characteristic, and the audio signal is synthesized by signals of which single frequencies is its frequency domain characteristic. The method for adjusting the volume of the audio mainly adopts a data adjustment or analog adjustment method to linearly adjust the audio signal in the time domain so as to realize the volume change of the audio. This approach may result in the adjusted audio (the volume of the audio is amplified to some extent) having a truncated sound and an acoustically broken sound, and the volume extension of the audio is limited by the power of the device. In the embodiment of the application, the original audio signal can be converted from the time domain to the frequency domain to obtain the frequency domain signal corresponding to the original audio signal, which is equivalent to converting a time domain audio signal with time-varying signal strength into a single frequency (i.e. the frequency domain signal corresponding to the audio signal) forming the audio signal. By the method provided by the embodiment of the application, the original audio signal can be converted from the time domain to the frequency domain to obtain the frequency domain signal corresponding to the original audio signal, so that the energy adjustment of the audio signal from the frequency domain is realized, and the situation of sound truncated top is avoided while the increase of the audio volume is realized.

And S204, performing gain calculation according to the auditory perception weighted data, the target volume value, the adjustment reference data and the frequency domain signal to obtain gain data.

In the embodiment of the application, the auditory perception weighting data can be used for describing the sensitivity degree of human ears to sounds with different frequencies. The frequency domain signals corresponding to the original audio signals can comprise a plurality of different frequencies, and some of the frequencies in the human ears are very sensitive and are not very sensitive to other frequencies, so that the frequencies sensitive to the human ears can be pertinently enhanced, and the frequencies not sensitive to the human ears are properly weakened, so that the volume of the whole finally obtained audio signals is improved for the human ears. The target volume value may be an integer greater than or equal to 0 that is randomly set, and is used to control the degree of volume adjustment. Gain calculation can be performed according to auditory sense weighting data, target volume values, adjustment reference data and frequency domain signals to obtain gain data, and the gain data can be used for adjusting frequencies in the frequency domain signals. The method provided by the embodiment of the application can accurately determine the gain data, is beneficial to carrying out gain processing on the frequency domain signal according to the gain data, and realizes the volume adjustment of the audio signal.

S205, performing gain processing on the frequency domain signal by using the gain data to obtain a gain frequency domain signal, and converting the gain frequency domain signal from a frequency domain to a time domain to obtain a gain audio signal, wherein the perceived volume corresponding to the gain audio signal is matched with the target volume value.

In the embodiment of the application, the gain data may include a plurality of gain coefficients, the frequency domain signal may include a plurality of different frequencies, and the different frequencies may correspond to different gain coefficients in the gain data. Some gain coefficients may increase the corresponding frequencies and some frequencies may decrease the corresponding frequencies. The gain data can be used for carrying out gain processing on the frequency domain signal, so that the frequency sensitive to human ears in the frequency domain signal is gain, and the frequency insensitive to human ears is attenuated, and the gain frequency domain signal is obtained. After determining the gain frequency domain signal, the gain frequency domain signal may be converted from the frequency domain to the time domain to obtain a gain audio signal. The perceived volume corresponding to the gain audio signal is matched with the target volume value, namely, the volume of the gain audio signal can be matched with the target volume value by adopting the volume adjusting method provided by the application. For example: when the existing volume adjustment method is adopted to adjust the volume of the audio signal, the upper limit volume value is set to be 100, and when the volume adjustment method provided by the embodiment of the application is adopted to adjust the volume of the audio signal, the gain audio signal with larger perceived volume can be obtained, and the upper limit value of the audio volume can be 120. The method provided by the embodiment of the application can pointedly enhance the frequency sensitive to the human ear and weaken the frequency insensitive to the human ear, thereby increasing the audio volume perceived by the human ear under the condition of excessively amplifying the whole energy of the audio signal, meeting the requirement of a user for expanding the volume of corresponding equipment, further increasing the volume of the audio signal in a noise environment and ensuring the identifiability of the audio signal.

Based on the embodiment, the application has the following beneficial effects: according to the volume adjustment method provided by the embodiment of the application, when the set volume value reaches the first upper limit volume value and the volume increasing command is provided, the volume of the original audio signal can be adjusted, so that the volume is further increased, and the volume requirement of a user is met; the reference data can be determined and adjusted according to the category information of the original audio signals, so that volume adjustment of the original audio signals of different categories is realized, and universality and pertinence are better; the original audio signal can be converted into a bit frequency domain signal, so that the energy adjustment of the signal from the frequency domain is realized, and the occurrence of sound truncated is avoided while the volume is increased; the frequency sensitive to the human ear can be pertinently enhanced, and the frequency insensitive to the human ear is weakened, so that the perceived volume of the audio signal is improved under the condition that the whole energy of the signal is not required to be excessively amplified; the volume of the audio signal can be increased in a noise environment, the intelligibility of the audio signal is ensured, and the resources consumed for increasing the volume are effectively reduced.

Referring to fig. 3, fig. 3 is a flow chart of another volume adjustment method according to an embodiment of the application. The volume adjustment method may be implemented by the volume adjustment device 102 described above, or may be implemented by another device. The flow of the volume adjustment method provided in the embodiment of the application includes but is not limited to:

And S301, when the set volume value reaches a first upper limit volume value, if a volume increasing instruction is detected, determining a target volume value according to the volume increasing instruction, wherein the target volume value is larger than the first upper limit volume value and smaller than or equal to a second upper limit volume value, and the second upper limit volume value is larger than the first upper limit volume value.

In the embodiment of the application, the set volume value can be a volume value input by a user, the first upper limit volume value can be an upper limit value of audio volume when the volume is adjusted by adopting the existing volume adjustment method, and the second upper limit volume value can be an upper limit value of audio volume when the volume is adjusted by adopting the volume adjustment method provided by the embodiment of the application. When the set volume value reaches the first upper limit volume value, the volume increase of the audio cannot be further realized by adopting the existing volume adjustment method, if the volume increase instruction is still detected, the target volume value can be determined according to the volume increase instruction, and the volume of the audio is further increased by adopting the volume adjustment method provided by the embodiment of the application. For example: the method comprises the steps of adjusting audio volume by adopting an existing method, wherein a first upper limit volume value is 100; when the set volume value reaches the first upper limit volume value, if the volume increasing instruction is detected, the volume adjusting method provided by the embodiment of the application can be adopted to further increase the volume of the audio signal, the second upper limit volume value can be 120, the target volume value is determined according to the volume increasing instruction, and the range of the target volume value can be [100,120]. Fig. 4 is a schematic diagram of a volume adjustment method according to an embodiment of the application. The "volume" in fig. 4 indicates that the interface is currently a volume adjustment interface, and when the volume value set by the user is less than 100, the volume of the audio can be adjusted by adopting the existing digital adjustment or analog adjustment method, so that the volume of the audio is matched with the set volume value; when the sound volume value set by the user is greater than or equal to 100 (the sound volume value set in fig. 4 is 110) in a noise environment, the sound volume adjustment method provided by the embodiment of the application can be used for adjusting the sound volume of the audio signal, so that the perceived sound volume of the adjusted audio signal is matched with the set sound volume value, and further amplification of the sound volume of the audio is realized. The volume adjusting method provided by the embodiment of the application can further expand the volume of the audio on the basis of the limit volume of the existing audio, so that the audio signal can be clear in a noise environment and can be recognized by human ears.

S302, carrying out category analysis processing on the original audio signal to obtain audio category information, and determining adjustment reference data according to the audio category information.

In the embodiment of the application, the category analysis processing can be performed on the original audio information to obtain the audio category information of the original audio information, for example: the audio category information may be a voice category, a music category, a noise category, etc., and the audio category information may also be a vocal category, a tubular category, a string category, a percussion category, etc. The adjustment reference data may be determined according to the audio category information, and the adjustment reference data may be used to indicate volume adjustment of the original audio signal, so that a frequency domain signal sensitive to human ears in the frequency domain signal corresponding to the original audio signal is enhanced, and a frequency domain signal insensitive to human ears is weakened. The adjustment reference data corresponding to different categories of original audio signals may be different. For example: if the audio class information of the original audio signal is voice, the original audio signal is indicated to contain human voice, and as the frequency corresponding to the human voice is generally within 100 to 1000 hertz, the adjustment reference data with smaller value can be determined according to the audio class information; if the category information of the original audio signal is a percussion music, it is indicated that the original audio signal contains a percussion music, and since the frequency corresponding to the percussion music (for example, a drum) is generally about 2500 hz, the adjustment reference data with a larger value can be determined according to the audio category information. The method provided by the embodiment of the application can determine the adjustment reference data according to the category information of the original audio signals, thereby realizing the targeted volume adjustment of the audio signals of different categories.

In an embodiment, the implementation manner of performing the category analysis processing on the original audio signal to obtain the audio category information may be: acquiring a reference audio signal corresponding to an original audio signal, wherein the time corresponding to the reference audio signal is earlier than the time corresponding to the original audio signal; and performing feature analysis processing according to the reference audio signal and the original audio signal to obtain audio class information corresponding to the original audio signal, wherein the audio class information is used for indicating a sounding object corresponding to the original audio signal. A reference audio signal corresponding to the original audio signal may be acquired, the reference audio signal corresponding to a time earlier than the original audio signal corresponding to a time, for example: the reference audio signal and the original audio signal belong to the same piece of music, the reference audio signal can be an audio signal corresponding to audio in the music from 0 th second to 10 th second, and the original audio signal can be an audio signal corresponding to audio in the music from 10 th second to 11 th second, namely, the time corresponding to the reference audio signal (from 0 second to 10 seconds) is earlier than the time corresponding to the original audio signal (from 10 seconds to 11 seconds). The characteristic analysis processing can be performed according to the reference audio signal and the original audio signal by using a method such as a neural network model, so as to obtain audio class information corresponding to the original audio signal, where the audio class information can be used to indicate a sound object corresponding to the original audio signal, for example: the audio category information of the original audio signal may be a percussion instrument, i.e. the sound object corresponding to the original audio signal is a percussion instrument. The method provided by the embodiment of the application can accurately determine the audio category information of the original audio signal, and is convenient for determining and adjusting the reference data according to the audio category information, thereby realizing the volume adjustment processing of the audio signals of different categories.

S303, converting the original audio signal from a time domain to a frequency domain to obtain a frequency domain signal corresponding to the original audio signal.

In an embodiment of the present application, the original audio signal may be an analog signal that is continuous in both time and amplitude, for example: the audio signal corresponding to the audio of the video may be an audio signal of a piece of music or an audio signal when a call is made. The time and frequency domains are the fundamental properties of a signal, and are the angles of two different analysis signals. Time domain refers to a relationship that describes a physical signal (e.g., an audio signal) versus time. For example: the time domain waveform of an audio signal may be expressed as the audio signal changing over time. The frequency domain is a coordinate system used in describing the frequency-wise characteristics of a signal. For an audio signal, the law of change of signal strength with time is its time domain characteristic, and the audio signal is synthesized by signals of which single frequencies is its frequency domain characteristic. The method for adjusting the volume of the audio mainly adopts a data adjustment or analog adjustment method to linearly adjust the audio signal on the time domain so as to realize the volume increase of the audio. This approach may result in sound truncated and audibly broken sound in the adjusted audio and the volume expansion of the audio is limited by the power of the device. In the embodiment of the application, the original audio signal can be converted from the time domain to the frequency domain to obtain the frequency domain signal corresponding to the original audio signal, which is equivalent to converting a time domain audio signal with time-varying signal strength into a single frequency (namely, the frequency domain signal corresponding to the audio signal) forming the audio signal. The method provided by the application can convert the original audio signal from the time domain to the frequency domain to obtain the frequency domain signal corresponding to the original audio signal, thereby realizing the energy adjustment of the audio signal from the frequency domain, and avoiding the situation of sound truncated while realizing the increase of the audio volume.

In an embodiment, the method of converting the original audio signal from the time domain to the frequency domain may be implemented by performing fourier transform on the original audio signal. Fourier transform is a method of analyzing signals, which can analyze the components of the signals, and can use these components to synthesize the signals. When analyzing signals, the method is mainly applied to processing stationary signals, and components of which frequencies are generally contained in a section of signals can be obtained through Fourier transformation, but the occurrence time of each component cannot be known. The method provided by the embodiment of the application can rapidly convert the original audio signal from the time domain to the frequency domain, and effectively improves the volume adjustment efficiency of the audio signal.

S304, determining relative auditory perception weighting data according to the auditory perception weighting data and the adjustment reference data, wherein the auditory perception weighting data comprises a corresponding relation between signal frequencies and auditory perception weighting coefficients, the relative auditory perception weighting coefficient corresponding to a first signal frequency in the relative auditory perception weighting data is smaller than a set value, the relative auditory perception weighting coefficient corresponding to a second signal frequency is larger than or equal to the set value, and the second signal frequency is larger than the first signal frequency.

In the embodiment of the application, the auditory perception weighting data can be used for describing the sensitivity degree of human ears to sounds with different frequencies. The frequency domain signals corresponding to the original audio signals can comprise a plurality of different frequencies, and some of the frequencies in the human ears are very sensitive and are not very sensitive to other frequencies, so that the frequencies sensitive to the human ears can be pertinently enhanced, and the frequencies not sensitive to the human ears are properly weakened, so that the whole perception volume of the finally obtained audio signals is improved for the human ears. The auditory perception weighting data includes correspondence of signal frequencies and auditory perception weighting coefficients, for example: when the signal frequency is 4000 hertz, the corresponding auditory perception weighting coefficient can be 2.6; when the signal frequency is 2000 hz, the corresponding auditory perception weighting coefficient may be 1.2. The adjustment reference data is used to indicate the degree of volume adjustment for the original audio signal. Determining relative auditory sense weighting data based on the auditory sense weighting data and the adjustment reference data; the relative auditory perception weighting data comprises a plurality of relative auditory perception weighting coefficients, the relative auditory perception weighting coefficient corresponding to a first signal frequency in the relative auditory perception weighting data is smaller than a set value, the relative auditory perception weighting coefficient corresponding to a second signal frequency is larger than or equal to the set value, and the second signal frequency is larger than the first signal frequency. The relative auditory perception weighting data is used to achieve gain or attenuation of signals of different frequencies. For example: the setting data may be 1, the relative auditory perception weighting coefficient corresponding to the first signal frequency in the relative auditory perception weighting data may be 0.5, the relative auditory perception weighting coefficient corresponding to the second signal frequency in the relative auditory perception weighting data may be 1.1, the relative auditory perception weighting coefficient corresponding to the first signal frequency may attenuate the first signal frequency, and the relative auditory perception weighting coefficient corresponding to the second signal frequency may gain the second signal frequency. The method provided by the embodiment of the application can accurately determine the relative auditory perception weighted data, is beneficial to determining the gain data according to the relative auditory perception weighted data, and realizes the volume increasing processing of the original audio signal.

In one embodiment, auditory perception weighting data may be determined from the acoustic isotone graphs, thereby enabling quantization of auditory perception. The primary basis for auditory perception is the loudness of the audio, which varies with the intensity and frequency of the sound, and the same intensity but different frequencies of the audio have different auditory perceptions. Referring to fig. 5, an acoustic equal-loudness curve according to an embodiment of the present application is shown. In fig. 5, the abscissa indicates the signal frequency (in hertz (Hz)), and the signal frequency in fig. 5 has a value ranging from 20 Hz to 2 khz; the ordinate is sound pressure level (in decibels (dB SPL)), and the sound pressure level in fig. 5 ranges from-10 dB to 130 dB. The curve in fig. 5 is an equal-loudness curve, which is a curve describing the relationship between sound pressure level and signal frequency under equal-loudness conditions, i.e. which sound pressure level needs to be reached to obtain a consistent auditory loudness (or auditory perception) for a listener for audio signals of different frequencies. In fig. 5, 6 equal-loudness curves are included, from top to bottom: an equal loudness curve of 100 auditory loudness (labeled 100phon for the equal loudness curve in fig. 5), an equal loudness curve of 80 auditory loudness (labeled 80 for the equal loudness curve in fig. 5), an equal loudness curve of 60 auditory loudness (labeled 60 for the equal loudness curve in fig. 5), an equal loudness curve of 40 auditory loudness (labeled 40 for the equal loudness curve in fig. 5), an equal loudness curve of 20 auditory loudness (labeled 20 for the equal loudness curve in fig. 5), and a threshold curve, i.e., an equal loudness curve of the lowest auditory loudness (labeled threshold for the equal loudness curve in fig. 5). For any one of the curves in fig. 5, it can be found that when the signal frequency is middle-low frequency (below 1 kHz), the lower the signal frequency, the higher the sound pressure level (i.e., energy) required for achieving equal sound, i.e., the greater the sound energy is required to make the human ear have the same auditory sensation; when the signal frequency is medium-high frequency (above 1 kHz), the signal frequencies of different frequency bands have different acoustic perception characteristics. In some cases, the signal frequency of the human voice audio signal is generally concentrated at a medium-low frequency (e.g., the human voice signal frequency is 1500Hz or less), and as can be seen from fig. 5 described above, the human ear perception is less sensitive to signals having a frequency below a low frequency (500 Hz). Compared with the frequency band sensitive to the human ear of medium and high frequencies (for example, 3-4 khz), the signal of the low frequency part needs to be many times higher in physical absolute energy than the signal of the medium and high frequencies to achieve a near perception effect in the sense of hearing. Therefore, the input frequency domain signal can be regulated, the frequency domain signal insensitive to auditory perception is subjected to attenuation treatment, and the frequency domain signal sensitive to auditory perception is subjected to enhancement treatment, so that the human ear has stronger auditory perception on the audio corresponding to the regulated whole frequency domain signal, and the increase of the audio volume is realized. In some cases, psycho-acoustic equal loudness curve data based on the BS3383 standard (i.e., the BS3383 normal equal loudness level contour specification for pure tones under automatic sound field listening conditions, BS3383 Specification for normal equal-loudness level contours for pure tones under free-field listening conditions) may be used to calculate the auditory perception weighting data. Specific calculation methods can be represented by the following formulas (1), (2), (3) and (4):

（1）

（2）

（3）

（4）

Wherein k is an input frequency value, ff, af, bf, cf is related data in an equal loudness curve data table disclosed in the BS3383 standard, and the loudness value loud corresponding to the target frequency point k can be obtained by interpolating the existing equal loudness curve data by using a linear interpolation method by adopting the methods shown in the above formulas (1), (2), (3) and (4). After the loudness value is calculated by the above equation (1), equation (2), equation (3) and equation (4), the auditory perception weighting data may be determined according to the following equation (5):

（5）

in equation (5), cof (k) represents an auditory perception weighting coefficient corresponding to frequency k, and loud represents a loudness value corresponding to frequency k. Auditory perception weighting coefficients corresponding to the respective frequencies can be determined by the method described in equation (5). Referring to fig. 6, a schematic diagram of auditory perception weighting data according to an embodiment of the present application is shown. The auditory sense weighting data includes a correspondence of signal frequencies to auditory sense weighting coefficients. In fig. 6, the abscissa represents the signal frequency (in hertz (Hz)), the range of the signal frequency is 0Hz to 8000 Hz, the ordinate represents the auditory perception weighting coefficient, and the range of the auditory perception weighting coefficient is 0 to 3. The curves in fig. 6 show the values of the auditory perception weighting coefficients corresponding to different signal frequencies. As can be seen from fig. 6, different frequencies correspond to different auditory perception weighting coefficients, and high frequency (greater than 2500Hz and less than 4500 Hz) signals correspond to auditory perception weighting coefficients greater than 2. The method provided by the embodiment of the application can accurately determine the auditory perception weighting coefficient, and is convenient for carrying out targeted gain adjustment on the frequency domain signal corresponding to the original audio signal according to the auditory perception weighting coefficient, thereby realizing accurate adjustment of volume.

In an embodiment, the implementation manner of determining the relative auditory perception weighting data according to the auditory perception weighting data and the adjustment reference data may be: acquiring a plurality of auditory sense weighting coefficients in auditory sense weighting data, and dividing and calculating any auditory sense weighting coefficient with adjustment reference data according to any auditory sense weighting coefficient in the plurality of auditory sense weighting coefficients to acquire a relative auditory sense weighting coefficient corresponding to any auditory sense weighting coefficient; after determining the relative auditory sense weighting coefficient corresponding to each auditory sense weighting coefficient in the plurality of auditory sense weighting coefficients, determining relative auditory sense weighting data according to the relative auditory sense weighting coefficient corresponding to each auditory sense weighting coefficient. The adjustment reference data determined according to the original audio signal is used for enabling the relative auditory perception weighting coefficient corresponding to the low-frequency signal insensitive to human ears to be smaller than 1, so that the attenuation of the low-frequency signal is achieved, and the relative auditory perception weighting coefficient corresponding to the high-frequency signal sensitive to human ears is enabled to be larger than 1, so that the gain of the high-frequency signal is achieved. The method can acquire a plurality of auditory sense weighting coefficients in auditory sense weighting data, and can divide and calculate the auditory sense weighting coefficient and adjustment reference data for any auditory sense weighting coefficient in the plurality of auditory sense weighting coefficients to obtain a relative auditory sense weighting coefficient corresponding to the auditory sense weighting coefficient. The method for determining the relative auditory perception weighting coefficient may be as shown in the following equation (6):

（6）

In the formula (6), b0 represents adjustment reference data, cof (freq) represents an auditory perception weighting coefficient corresponding to the frequency freq,representing the relative auditory perception weighting coefficient corresponding to the frequency freq. After determining the relative auditory sense weighting coefficient corresponding to each auditory sense weighting coefficient in the plurality of auditory sense weighting coefficients, the relative auditory sense weighting data may be determined according to the relative auditory sense weighting coefficient corresponding to each auditory sense weighting coefficient. The method provided by the embodiment of the application can accurately determine the auditory perception weighting data and the relative auditory perception weighting data, is convenient for carrying out gain processing on the frequency domain signals according to the relative auditory perception weighting data, thereby realizing the adjustment of the perceived volume of the audio signals, simultaneously ensuring the quality of the audio signals and avoiding the situation of sound truncated of the adjusted audio signals.

S305, obtaining frequency information corresponding to the frequency domain signal, wherein the frequency information comprises a plurality of frequencies, obtaining a relative auditory perception weighting coefficient corresponding to any frequency from the relative auditory perception weighting data according to any frequency, and determining a gain coefficient corresponding to any frequency according to the relative auditory perception weighting coefficient corresponding to any frequency and a target volume value.

In the embodiment of the application, the frequency information corresponding to the frequency domain signal can be acquired, and the frequency information can comprise a plurality of frequencies. For any one of a plurality of frequencies, a relative auditory perception weighting coefficient corresponding to the frequency may be obtained from the relative auditory perception weighting data, for example: as can be seen from fig. 6, the frequency information includes a frequency of 3000Hz, and the auditory sense weighting coefficient corresponding to the signal frequency is 2.2, and if the adjustment reference data is 2.0, the relative auditory sense weighting data can be calculated, and the relative auditory sense weighting coefficient corresponding to the frequency (3000 Hz) is determined to be 1.1 from the relative auditory sense weighting data. The gain factor for the frequency may be determined based on the relative auditory perception weighting data and the target volume value for the frequency. The method provided by the embodiment of the application can determine the relative auditory perception weight coefficient corresponding to each frequency in the frequency domain signal, and is convenient for determining the gain coefficient of the frequency according to the relative auditory perception weight coefficient, thereby realizing the gain processing of the frequency domain signal.

In an embodiment, according to the relative auditory perception weighting coefficient and the target volume value corresponding to any frequency, the implementation manner of determining the gain coefficient corresponding to any frequency may be: converting the target volume value to obtain volume control data; and performing power operation by taking the relative auditory perception weighting coefficient corresponding to any frequency as a base and the volume control data as an exponent to obtain a gain coefficient corresponding to any frequency. The target volume value can be an expected volume value input by a user, the target volume value can be converted to obtain volume control data, any frequency corresponding to the relative auditory perception weighting coefficient in the frequency information is used as a base, the volume control data is used as an exponent to carry out power operation, and the gain coefficient corresponding to the frequency is obtained. The implementation manner of determining the gain coefficient corresponding to a certain frequency can be as shown in the following formula (7):

（7）

In the formula (7), q represents a target volume value, func (q) represents volume control data corresponding to the target volume value,representing the relative auditory sense weighting coefficient corresponding to the frequency k, power (a, b) representing exponentiation with a as the base and b as the exponent, gain (q, k)The gain factor corresponding to frequency k is indicated. By the method provided by the embodiment of the application, the gain coefficient related to the target volume value and the frequency can be determined, so that the gain processing of the corresponding frequency domain signal can be realized according to the gain coefficient, further the enhancement of the frequency domain signal sensitive to human ears is realized, and the reduction of the frequency domain signal insensitive to human ears is realized.

It should be noted that, the volume adjustment method provided in the embodiment of the present application may be used in combination with other volume adjustment methods, or may be used alone. For example: the volume adjustment method provided by the embodiment of the application can be combined with the existing volume adjustment method, and at the moment, the value range of the set volume value can be [0,120], when the set volume value input by a user is smaller than 100, the volume of the audio signal can be adjusted by using the existing volume adjustment method, and when the set volume value input by the user is larger than 100, the volume of the audio signal can be adjusted by using the volume adjustment method provided by the embodiment of the application, so that the perceived volume of the audio signal can be larger, and the situation of sound breaking feeling in hearing can not occur. Also for example: the volume adjustment method provided by the embodiment of the application can be independently used, and the value range of the set volume value can be [0,100], when the set volume value is input by a user, the volume of the audio can be adjusted according to the set volume value by adopting the volume adjustment method provided by the embodiment of the application, so that the adjusted audio has better and clearer hearing.

S306, after determining the gain coefficient corresponding to each frequency in the plurality of frequencies, determining gain data according to the gain coefficient corresponding to each frequency.

In the embodiment of the application, the frequency information corresponding to the frequency domain signal comprises a plurality of frequencies, and after the gain coefficient corresponding to each frequency in the plurality of frequencies is determined, the gain data can be determined according to the gain coefficient corresponding to each frequency. By the method provided by the embodiment of the application, the gain data corresponding to the frequency domain signal can be determined, and the subsequent gain processing of the frequency domain signal can be conveniently realized according to the gain data.

S307, performing gain processing on the frequency domain signal by using the gain data to obtain a gain frequency domain signal.

In the embodiment of the application, the gain data may include a plurality of gain coefficients, the frequency domain signal may include a plurality of different frequencies, and the different frequencies may correspond to different gain coefficients in the gain data. Some gain coefficients may increase the corresponding frequencies and some frequencies may decrease the corresponding frequencies. The gain data can be used for carrying out gain processing on the frequency domain signal, so that the frequency sensitive to human ears in the frequency domain signal is gain, and the frequency insensitive to human ears is attenuated, and the gain frequency domain signal is obtained. By the method provided by the embodiment of the application, the frequency sensitive to the human ear can be pertinently enhanced, and the frequency insensitive to the human ear can be reduced, so that the audio volume perceived by the human ear can be increased under the condition that the whole energy of the audio signal is excessively amplified, and the requirement of a user for expanding the volume of corresponding equipment can be met.

In an embodiment, for any one of a plurality of frequencies, a signal value corresponding to the any one frequency is obtained from the frequency domain signal; weighting the signal value corresponding to any frequency by using a gain coefficient corresponding to any frequency in the gain data to obtain a signal value corresponding to any frequency after gain processing; after determining the gain-processed signal values corresponding to each of the plurality of frequencies, determining a gain frequency domain signal from the gain-processed signal values corresponding to each of the frequencies. The frequency information corresponding to the frequency domain signal comprises a plurality of frequencies, and for any frequency in the plurality of frequencies, a signal value corresponding to the frequency can be obtained from the frequency domain signal, and the signal value corresponding to the frequency is weighted (for example, multiplied) according to a gain coefficient corresponding to the frequency in the gain data, so as to obtain a signal value after the gain processing corresponding to the frequency; after determining the gain-processed signal values corresponding to each of the plurality of frequencies, determining a gain frequency domain signal from the gain-processed signal values corresponding to each of the frequencies. In the embodiment of the application, gain coefficients corresponding to different frequencies are different, and signal values after gain processing corresponding to the frequencies are also different: for the frequency sensitive to the human ear, the gain coefficient corresponding to the frequency is larger than 1, the signal value corresponding to the frequency is weighted according to the gain coefficient corresponding to the frequency, the signal value after gain processing corresponding to the frequency is obtained, and the signal value after gain processing is larger than the original signal value; for the frequency insensitive to human ears, the gain coefficient corresponding to the frequency is smaller than 1, and the signal value corresponding to the frequency is obtained after the weighting processing is carried out on the signal value corresponding to the frequency according to the gain coefficient corresponding to the frequency, wherein the signal value after the gain processing is smaller than the original signal value. The method provided by the embodiment of the application can accurately determine the gain frequency domain signal, realize the gain processing of the frequency domain signal sensitive to human ears, and cut down the processing of the frequency domain signal insensitive to human ears, thereby realizing the overall gain processing of the gain frequency domain signal.

S308, converting the gain frequency domain signal from a frequency domain to a time domain to obtain a gain audio signal, wherein the perceived volume corresponding to the gain audio signal is matched with the target volume value.

In the embodiment of the application, the gain frequency domain signal can be converted from the frequency domain to the time domain to obtain the gain audio signal. The perceived volume corresponding to the gain audio signal is matched with the target volume value, namely, the volume of the gain audio signal can be matched with the target volume value by adopting the volume adjusting method provided by the application. For example: when the existing volume adjustment method is adopted to adjust the volume of the audio signal, the upper limit volume value is set to be 100, and when the volume adjustment method provided by the embodiment of the application is adopted to adjust the volume of the audio signal, the gain audio signal with larger perceived volume can be obtained, and the upper limit value of the audio volume can be 120. The method provided by the embodiment of the application can further improve the perceived volume of the audio signal, so that the volume of the audio signal in a noise environment can be further increased, and the identification degree of the audio signal is ensured.

In an embodiment, the implementation method for converting the gain frequency domain signal from the frequency domain to the time domain may be an inverse fourier transform. The inverse fourier transform is also called inverse fourier transform, and the calculation principle is to take the conjugate complex number from the frequency domain data and then perform fourier transform, so that the frequency domain signal is converted into the time domain.

It should be noted that, the embodiment of the present application mainly illustrates a volume increasing scene in the volume adjustment of the original audio signal, and the method provided by the present application is applicable to a volume decreasing scene in the volume adjustment of the original audio signal.

Based on the embodiment, the application has the following beneficial effects: according to the volume adjustment method provided by the embodiment of the application, when the set volume value reaches the first upper limit volume value and the volume increasing command is provided, the volume of the original audio signal can be adjusted, so that the volume is further increased, and the volume requirement of a user is met; the reference data can be determined and adjusted according to the category information of the original audio signals, so that volume adjustment of the original audio signals of different categories is realized, and universality and pertinence are better; the original audio signal can be converted into a bit frequency domain signal, so that the energy adjustment of the signal from the frequency domain is realized, and the occurrence of the condition of sound truncated is avoided while the volume adjustment is realized; the relative auditory perception weighting coefficients corresponding to different frequencies can be determined, so that the intensity of auditory perception is quantized, frequency domain signals sensitive to human ears can be enhanced in a targeted manner, frequency domain signals insensitive to human ears are weakened, the perceived volume of an audio signal is improved under the condition that the whole energy of the signals is not required to be amplified too much, and the processing efficiency of volume adjustment is also improved; the method can realize the volume increase of the audio signal in a noise environment, ensure the intelligibility of the audio signal, and effectively reduce the resources consumed by the volume increase.

Referring to fig. 7, fig. 7 is a block diagram illustrating a volume adjusting device according to an embodiment of the application. The device comprises:

a determining unit 701, configured to determine, when the set volume value reaches a first upper limit volume value, a target volume value according to the volume increase instruction if the volume increase instruction is detected, where the target volume value is greater than the first upper limit volume value and less than or equal to a second upper limit volume value, and the second upper limit volume value is greater than the first upper limit volume value;

the processing unit 702 is configured to perform category analysis processing on an original audio signal, obtain audio category information, and determine adjustment reference data according to the audio category information;

the processing unit 702 is further configured to convert the original audio signal from a time domain to a frequency domain, so as to obtain a frequency domain signal corresponding to the original audio signal;

the processing unit 702 is further configured to perform gain calculation according to auditory perception weighted data, the target volume value, the adjustment reference data, and the frequency domain signal, so as to obtain gain data;

and a conversion unit 703, configured to perform gain processing on the frequency domain signal by using the gain data to obtain a gain frequency domain signal, and convert the gain frequency domain signal from a frequency domain to a time domain to obtain a gain audio signal, where a perceived volume corresponding to the gain audio signal is matched with the target volume value.

In one embodiment, the processing unit 702 is specifically configured to, when performing gain calculation according to the auditory sense weighted data, the target volume value, the adjustment reference data, and the frequency domain signal to obtain gain data: determining relative auditory perception weighting data according to the auditory perception weighting data and the adjustment reference data, wherein the auditory perception weighting data comprises a corresponding relation between signal frequency and auditory perception weighting coefficients, the relative auditory perception weighting coefficient corresponding to a first signal frequency in the relative auditory perception weighting data is smaller than a set value, the relative auditory perception weighting coefficient corresponding to a second signal frequency is larger than or equal to the set value, and the second signal frequency is larger than the first signal frequency; and performing gain calculation according to the relative auditory perception weighted data, the target volume value and the frequency domain signal to obtain gain data.

In one embodiment, the processing unit 702 is specifically configured to, when performing gain calculation according to the relative auditory sense weighted data, the target volume value, and the frequency domain signal to obtain gain data: acquiring frequency information corresponding to the frequency domain signal, wherein the frequency information comprises a plurality of frequencies; for any frequency of the plurality of frequencies, acquiring a relative auditory perception weighting coefficient corresponding to the any frequency from the relative auditory perception weighting data, and determining a gain coefficient corresponding to the any frequency according to the relative auditory perception weighting coefficient corresponding to the any frequency and the target volume value; after determining the gain coefficient corresponding to each frequency in the plurality of frequencies, determining gain data according to the gain coefficient corresponding to each frequency.

In an embodiment, the processing unit 702 is specifically configured to, when performing gain processing on the frequency domain signal using the gain data to obtain a gain frequency domain signal: for any frequency of the plurality of frequencies, acquiring a signal value corresponding to the any frequency from the frequency domain signal; weighting the signal value corresponding to any frequency by using the gain coefficient corresponding to any frequency in the gain data to obtain a signal value after gain processing corresponding to any frequency; after determining the gain-processed signal values corresponding to each of the plurality of frequencies, determining a gain frequency domain signal according to the gain-processed signal values corresponding to each of the frequencies.

In an embodiment, the processing unit 702 is specifically configured to, when determining the gain coefficient corresponding to the arbitrary frequency according to the relative auditory perception weighting coefficient corresponding to the arbitrary frequency and the target volume value: converting the target volume value to obtain volume control data; and performing power operation by taking the relative auditory perception weighting coefficient corresponding to any frequency as a base number and taking the volume control data as an exponent to obtain a gain coefficient corresponding to any frequency.

In one embodiment, the processing unit 702 is specifically configured to, when determining the relative auditory sense weighting data according to the auditory sense weighting data and the adjustment reference data: acquiring a plurality of auditory perception weighting coefficients in the auditory perception weighting data, and dividing and calculating any auditory perception weighting coefficient with the adjustment reference data according to any auditory perception weighting coefficient in the auditory perception weighting coefficients to obtain a relative auditory perception weighting coefficient corresponding to the any auditory perception weighting coefficient; after determining the relative auditory sense weighting coefficient corresponding to each auditory sense weighting coefficient in the plurality of auditory sense weighting coefficients, determining relative auditory sense weighting data according to the relative auditory sense weighting coefficient corresponding to each auditory sense weighting coefficient.

In an embodiment, the processing unit 702 is specifically configured to, when performing a class analysis process on an original audio signal to obtain audio class information: acquiring a reference audio signal corresponding to the original audio signal, wherein the time corresponding to the reference audio signal is earlier than the time corresponding to the original audio signal; and performing feature analysis processing according to the reference audio signal and the original audio signal to obtain audio class information corresponding to the original audio signal, wherein the audio class information is used for indicating a sounding object corresponding to the original audio signal.

It may be understood that the functions of each functional unit of the volume adjusting device according to the embodiment of the present application may be specifically implemented according to the volume adjusting method in the embodiment of the method, and the specific implementation process may refer to the related description in the embodiment of the volume adjusting method, which is not repeated herein.

Referring to fig. 8, fig. 8 is a block diagram of a computer device according to an embodiment of the present application. The computer device described in the embodiment of the application comprises: a processor 801, a communication interface 802, and a memory 803. The processor 801, the communication interface 802, and the memory 803 may be connected by a bus or other means, for example, in the embodiment of the present application.

Among them, the processor 801 (or CPU (Central Processing Unit, central processing unit)) is a computing core and a control core of a computer device, which can parse various instructions in the computer device and process various data of the computer device, for example: the CPU can be used for analyzing a startup and shutdown instruction sent by a user to the computer equipment and controlling the computer equipment to perform startup and shutdown operation; and the following steps: the CPU may transmit various types of interaction data between internal structures of the computer device, and so on. The communication interface 802 may optionally include a standard wired interface, a wireless interface (e.g., wi-Fi, mobile communication interface, etc.), controlled by the processor 801 for transceiving data. The Memory 803 (Memory) is a Memory device in the computer device for storing programs and data. It will be appreciated that the memory 803 herein may include both built-in memory of the computer device and extended memory supported by the computer device. Memory 803 provides storage space that stores the operating system of the computer device, which may include, but is not limited to: android systems, iOS systems, windows Phone systems, etc., the application is not limited in this regard.

In an embodiment of the present application, the processor 801 performs the following operations by executing executable program code in the memory 803:

In one embodiment, the processor 801 is specifically configured to, when performing gain calculation according to the auditory sense weighted data, the target volume value, the adjustment reference data, and the frequency domain signal, obtain gain data: determining relative auditory perception weighting data according to the auditory perception weighting data and the adjustment reference data, wherein the auditory perception weighting data comprises a corresponding relation between signal frequency and auditory perception weighting coefficients, the relative auditory perception weighting coefficient corresponding to a first signal frequency in the relative auditory perception weighting data is smaller than a set value, the relative auditory perception weighting coefficient corresponding to a second signal frequency is larger than or equal to the set value, and the second signal frequency is larger than the first signal frequency; and performing gain calculation according to the relative auditory perception weighted data, the target volume value and the frequency domain signal to obtain gain data.

In one embodiment, the processor 801 is specifically configured to, when performing gain calculation according to the relative auditory sense weighted data, the target volume value, and the frequency domain signal to obtain gain data: acquiring frequency information corresponding to the frequency domain signal, wherein the frequency information comprises a plurality of frequencies; for any frequency of the plurality of frequencies, acquiring a relative auditory perception weighting coefficient corresponding to the any frequency from the relative auditory perception weighting data, and determining a gain coefficient corresponding to the any frequency according to the relative auditory perception weighting coefficient corresponding to the any frequency and the target volume value; after determining the gain coefficient corresponding to each frequency in the plurality of frequencies, determining gain data according to the gain coefficient corresponding to each frequency.

In an embodiment, the processor 801 is specifically configured to, when performing gain processing on the frequency domain signal using the gain data, obtain a gain frequency domain signal: for any frequency of the plurality of frequencies, acquiring a signal value corresponding to the any frequency from the frequency domain signal; weighting the signal value corresponding to any frequency by using the gain coefficient corresponding to any frequency in the gain data to obtain a signal value after gain processing corresponding to any frequency; after determining the gain-processed signal values corresponding to each of the plurality of frequencies, determining a gain frequency domain signal according to the gain-processed signal values corresponding to each of the frequencies.

In an embodiment, the processor 801 is specifically configured to, when determining the gain coefficient corresponding to the arbitrary frequency according to the relative auditory perception weighting coefficient corresponding to the arbitrary frequency and the target volume value: converting the target volume value to obtain volume control data; and performing power operation by taking the relative auditory perception weighting coefficient corresponding to any frequency as a base number and taking the volume control data as an exponent to obtain a gain coefficient corresponding to any frequency.

In one embodiment, the processor 801 is specifically configured to, when determining the relative auditory sense weighting data based on the auditory sense weighting data and the adjustment reference data: acquiring a plurality of auditory perception weighting coefficients in the auditory perception weighting data, and dividing and calculating any auditory perception weighting coefficient with the adjustment reference data according to any auditory perception weighting coefficient in the auditory perception weighting coefficients to obtain a relative auditory perception weighting coefficient corresponding to the any auditory perception weighting coefficient; after determining the relative auditory sense weighting coefficient corresponding to each auditory sense weighting coefficient in the plurality of auditory sense weighting coefficients, determining relative auditory sense weighting data according to the relative auditory sense weighting coefficient corresponding to each auditory sense weighting coefficient.

In one embodiment, the processor 801 is specifically configured to, when performing a class analysis process on an original audio signal to obtain audio class information: acquiring a reference audio signal corresponding to the original audio signal, wherein the time corresponding to the reference audio signal is earlier than the time corresponding to the original audio signal; and performing feature analysis processing according to the reference audio signal and the original audio signal to obtain audio class information corresponding to the original audio signal, wherein the audio class information is used for indicating a sounding object corresponding to the original audio signal.

In a specific implementation, the processor 801, the communication interface 802, and the memory 803 described in the embodiments of the present application may execute an implementation manner of the volume adjustment device described in the volume adjustment method provided in the embodiments of the present application, or may execute an implementation manner described in the volume adjustment device provided in the embodiments of the present application, which is not described herein again.

The embodiment of the application also provides a computer readable storage medium, wherein a computer program is stored in the computer readable storage medium, and when the computer program runs on a computer, the computer is caused to execute the volume adjustment method according to the embodiment of the application. The specific implementation manner may refer to the foregoing description, and will not be repeated here.

Embodiments of the present application also provide a computer program product comprising a computer program or computer instructions stored in a computer readable storage medium. A processor of a computer device reads the computer program or computer instructions from the computer-readable storage medium, and the processor executes the computer program or computer instructions to cause the computer device to perform a volume adjustment method according to an embodiment of the present application. The specific implementation manner may refer to the foregoing description, and will not be repeated here.

It should be noted that, for simplicity of description, the foregoing method embodiments are all expressed as a series of action combinations, but it should be understood by those skilled in the art that the present application is not limited by the order of action described, as some steps may be performed in other order or simultaneously according to the present application. Further, those skilled in the art will also appreciate that the embodiments described in the specification are all preferred embodiments, and that the acts and modules referred to are not necessarily required for the present application.

Those of ordinary skill in the art will appreciate that all or part of the steps in the various methods of the above embodiments may be implemented by a program to instruct related hardware, the program may be stored in a computer readable storage medium, and the storage medium may include: flash disk, read-Only Memory (ROM), random-access Memory (Random Access Memory, RAM), magnetic or optical disk, and the like.

The above disclosure is illustrative only of some embodiments of the application and is not intended to limit the scope of the application, which is defined by the claims and their equivalents.

Claims

1. A method of volume adjustment, the method comprising:

2. The method of claim 1, wherein performing gain calculations based on auditory perception weighting data, the target volume value, the adjustment reference data, and the frequency domain signal to obtain gain data comprises:

determining relative auditory perception weighting data according to the auditory perception weighting data and the adjustment reference data, wherein the auditory perception weighting data comprises a corresponding relation between signal frequency and auditory perception weighting coefficients, the relative auditory perception weighting coefficient corresponding to a first signal frequency in the relative auditory perception weighting data is smaller than a set value, the relative auditory perception weighting coefficient corresponding to a second signal frequency is larger than or equal to the set value, and the second signal frequency is larger than the first signal frequency;

And performing gain calculation according to the relative auditory perception weighted data, the target volume value and the frequency domain signal to obtain gain data.

3. The method according to claim 2, wherein said performing gain calculations based on said relative auditory sense weighting data, said target volume value, and said frequency domain signal to obtain gain data comprises:

acquiring frequency information corresponding to the frequency domain signal, wherein the frequency information comprises a plurality of frequencies;

for any frequency of the plurality of frequencies, acquiring a relative auditory perception weighting coefficient corresponding to the any frequency from the relative auditory perception weighting data, and determining a gain coefficient corresponding to the any frequency according to the relative auditory perception weighting coefficient corresponding to the any frequency and the target volume value;

after determining the gain coefficient corresponding to each frequency in the plurality of frequencies, determining gain data according to the gain coefficient corresponding to each frequency.

4. A method according to claim 3, wherein said gain processing said frequency domain signal using said gain data to obtain a gain frequency domain signal comprises:

For any frequency of the plurality of frequencies, acquiring a signal value corresponding to the any frequency from the frequency domain signal;

weighting the signal value corresponding to any frequency by using the gain coefficient corresponding to any frequency in the gain data to obtain a signal value after gain processing corresponding to any frequency;

after determining the gain-processed signal values corresponding to each of the plurality of frequencies, determining a gain frequency domain signal according to the gain-processed signal values corresponding to each of the frequencies.

5. The method according to claim 3 or 4, wherein said determining a gain factor corresponding to said arbitrary frequency from a relative auditory perception weighting factor corresponding to said arbitrary frequency and said target volume value comprises:

converting the target volume value to obtain volume control data;

and performing power operation by taking the relative auditory perception weighting coefficient corresponding to any frequency as a base number and taking the volume control data as an exponent to obtain a gain coefficient corresponding to any frequency.

6. The method according to claim 3 or 4, wherein determining relative auditory sense weighting data from the auditory sense weighting data and the adjustment reference data comprises:

Acquiring a plurality of auditory perception weighting coefficients in the auditory perception weighting data, and dividing and calculating any auditory perception weighting coefficient with the adjustment reference data according to any auditory perception weighting coefficient in the auditory perception weighting coefficients to obtain a relative auditory perception weighting coefficient corresponding to the any auditory perception weighting coefficient;

after determining the relative auditory sense weighting coefficient corresponding to each auditory sense weighting coefficient in the plurality of auditory sense weighting coefficients, determining relative auditory sense weighting data according to the relative auditory sense weighting coefficient corresponding to each auditory sense weighting coefficient.

7. The method according to any one of claims 1-4, wherein the performing a class analysis process on the original audio signal to obtain audio class information includes:

acquiring a reference audio signal corresponding to the original audio signal, wherein the time corresponding to the reference audio signal is earlier than the time corresponding to the original audio signal;

and performing feature analysis processing according to the reference audio signal and the original audio signal to obtain audio class information corresponding to the original audio signal, wherein the audio class information is used for indicating a sounding object corresponding to the original audio signal.

8. A volume adjustment device, the device comprising:

9. A computer device, comprising: the device comprises a processor, a communication interface and a memory, wherein the processor, the communication interface and the memory are mutually connected, the memory stores executable program codes, and the processor is used for calling the executable program codes to realize the volume adjustment method according to any one of claims 1-7.

10. A computer readable storage medium having stored therein computer instructions which, when run on a computer, cause the computer to implement the volume adjustment method of any one of claims 1-7.