CN107172255A

CN107172255A - Voice signal self-adapting regulation method, device, mobile terminal and storage medium

Info

Publication number: CN107172255A
Application number: CN201710599150.2A
Authority: CN
Inventors: 杨宗业
Original assignee: Guangdong Oppo Mobile Telecommunications Corp Ltd
Current assignee: Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority date: 2017-07-21
Filing date: 2017-07-21
Publication date: 2017-09-15

Abstract

The invention discloses a kind of voice signal self-adapting regulation method, device, mobile terminal and storage medium, method includes：It is in mobile terminal under handsfree talk mode, the voice signal in environment is gathered in real time, and the distance between the mobile terminal and caller are obtained in real time, parse voice signal, the vocal print feature of each sound of separate sources in voice signal is obtained, identification belongs to the target vocal print feature of caller, and determines voice amplitude of the sound belonging to target vocal print feature in voice signal, according to voice amplitude and distance, the loudness value and frequency values of the sound belonging to adjustment target vocal print feature.Pass through the target vocal print feature of caller in recognition of speech signals, make it possible to according to the distance between the voice amplitude of the sound belonging to the target vocal print feature and caller and mobile terminal, realize the adjustment of caller's sound, the problem of amplifying to ambient noise can be prevented effectively from, speech quality is lifted, improves usage experience.

Description

Voice signal self-adapting regulation method, device, mobile terminal and storage medium

Technical field

The present invention relates to technical field of mobile terminals, more particularly to a kind of voice signal self-adapting regulation method, device, shifting Dynamic terminal and storage medium.

Background technology

At present, the application of mobile phone is very universal, and the conventional call mode of mobile phone includes hand-held call mode and hands-free Call mode, under handsfree talk mode, because posture, the custom of the adept machine of each user are different, mobile phone and local user it Between distance also have very big difference, mobile phone is when carrying out radio reception, and these differences can cause the voice signal that mobile phone is collected Volume it is different, and overall loudness is less than normal.In order to which the conversation object of the other end in conversing can catch dialog context, need Conversation object is then forwarded to after processing is amplified to the voice signal collected.

In the prior art, it is by automatic growth control (Automatic Gain under handsfree talk mode Control, AGC) adaptive gain regulating mode, increase mobile phone be sent to other end conversation object voice signal sound Amount, to lift the quality of hand-free call., can be to the voice signal that collects however, by way of AGC adaptive gain regulatings It is amplified, what is the ambient noise in voice signal will certainly also put is very big, causes the quality reduction of call, user's communication body Test bad.

The content of the invention

It is a primary object of the present invention to provide a kind of voice signal self-adapting regulation method, device, mobile terminal and deposit Storage media, can solve the mode of AGC adaptive gain regulatings in the prior art can amplify ambient noise, cause the matter of call Amount reduction, user's communication experience is bad.

To achieve the above object, first aspect present invention provides a kind of voice signal self-adapting regulation method, and method includes：

It is in mobile terminal under handsfree talk mode, the voice signal in collection environment, and obtaining in real time described in real time The distance between mobile terminal and caller；

The voice signal is parsed, the vocal print feature of each sound of separate sources in the voice signal is obtained；

Belong to the target vocal print feature of the caller in the vocal print feature for recognizing each sound, and determine the target Voice amplitude of the sound in the voice signal belonging to vocal print feature；

According to the voice amplitude and the distance, the sound belonging to target vocal print feature described in the voice signal is adjusted The loudness value and frequency values of sound.

To achieve the above object, second aspect of the present invention provides a kind of voice signal self-adapting adjusting apparatus, and device includes： Acquisition module is gathered, for being in mobile terminal under handsfree talk mode, the voice signal in collection environment, and in real time in real time Obtain the distance between the mobile terminal and caller；

Acquisition module is parsed, for parsing the voice signal, each sound of separate sources in the voice signal is obtained Vocal print feature；

Recognize that the target vocal print for belonging to the caller in determining module, the vocal print feature for recognizing each sound is special Levy, and determine voice amplitude of the sound in the voice signal belonging to the target vocal print feature；

Adjusting module, for according to the voice amplitude and the distance, adjusting target sound described in the voice signal The loudness value and frequency values of sound belonging to line feature.

To achieve the above object, third aspect present invention provides a kind of mobile terminal, including memory, processor and storage On a memory and the computer program that can run on a processor, described in the computing device during computer program, realize Each step in the voice signal self-adapting regulation method that first aspect is provided.

To achieve the above object, fourth aspect present invention provides a kind of storage medium, and the storage medium is that computer can Storage medium is read, computer program is stored thereon with, when the computer program is executed by processor, realizes that first aspect is provided Voice signal self-adapting regulation method in each step.

The present invention provides a kind of voice signal self-adapting regulation method, device, mobile terminal and storage medium, this method bag Include：It is in mobile terminal under handsfree talk mode, the voice signal in collection environment, and obtain the mobile terminal in real time in real time The distance between with caller, the voice signal is parsed, the vocal print feature of each sound of separate sources in the voice signal is obtained, Belong to the target vocal print feature of caller in the vocal print feature for recognizing each sound, and determine the sound belonging to the target vocal print feature Voice amplitude of the sound in voice signal, according to the voice amplitude and above-mentioned distance, adjusts target vocal print in the voice signal special The loudness value and frequency values of sound belonging to levying.Relative to prior art, under handsfree talk mode, for the voice collected Signal, by the target vocal print feature for recognizing caller in the voice signal, enabling according to belonging to the target vocal print feature Sound voice amplitude and the distance between caller and mobile terminal, to the loudness value of the sound belonging to target vocal print feature And frequency values are adjusted, to realize the adjustment of the sound for caller, relative to AGC adaptive gain regulative modes, energy The problem of amplifying to ambient noise is enough prevented effectively from, speech quality is lifted, improves usage experience.

Brief description of the drawings

In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing There is the accompanying drawing used required in technology description to be briefly described, it should be apparent that, drawings in the following description are only this Some embodiments of invention, for those skilled in the art, on the premise of not paying creative work, can also basis These accompanying drawings obtain other accompanying drawings.

Fig. 1 is a kind of structured flowchart of mobile terminal；

Fig. 2 is the schematic flow sheet of voice signal self-adapting regulation method in first embodiment of the invention；

Fig. 3 is the schematic flow sheet of voice signal self-adapting regulation method in second embodiment of the invention；

Fig. 4 is the schematic flow sheet of voice signal self-adapting regulation method in third embodiment of the invention；

Fig. 5 is the schematic diagram of the program module of voice signal self-adapting adjusting apparatus in fourth embodiment of the invention；

Fig. 6 is the schematic diagram of the program module of voice signal self-adapting adjusting apparatus in fifth embodiment of the invention；

Fig. 7 is the schematic diagram of the program module of voice signal self-adapting adjusting apparatus in sixth embodiment of the invention.

Embodiment

To enable goal of the invention, feature, the advantage of the present invention more obvious and understandable, below in conjunction with the present invention Accompanying drawing in embodiment, the technical scheme in the embodiment of the present invention is clearly and completely described, it is clear that described reality It is only a part of embodiment of the invention to apply example, and not all embodiments.Based on the embodiment in the present invention, people in the art The every other embodiment that member is obtained under the premise of creative work is not made, belongs to the scope of protection of the invention.

Fig. 1 shows a kind of structured flowchart of mobile terminal.Voice signal provided in an embodiment of the present invention is adaptively adjusted Method can be applied in mobile terminal 10 as shown in Figure 1, and mobile terminal 10 can be, but not limited to include：It need to be maintained by battery Normal operation and support network and the smart mobile phone of download function, notebook, tablet personal computer, wearing smart machine etc..

As shown in figure 1, mobile terminal 10 includes memory 101, storage control 102, it is one or more (only to be shown in figure One) processor 103, Peripheral Interface 104, radio-frequency module 105, key-press module 106, audio-frequency module 107 and Touch Screen 108.These components are mutually communicated by one or more communication bus/signal wire 109.

It is appreciated that the structure shown in Fig. 1 is only signal, it does not cause to limit to the structure of mobile terminal.It is mobile whole End 10 may also include than shown in Fig. 1 more either less components or with the configuration different from shown in Fig. 1.Shown in Fig. 1 Each component can be realized using hardware, software or its combination.

The voice signal that memory 101 can be used in storage software program and module, such as embodiment of the present invention is adaptive Method of adjustment and the corresponding programmed instruction/module of device, processor 103 are stored in the software journey in memory 101 by operation Sequence and module, so as to perform various function application and data processing, that is, realize the above-mentioned voice signal adaptively side of adjustment Method and device.

Memory 101 may include high speed random access memory, may also include nonvolatile memory, such as one or more magnetic Property storage device, flash memory or other non-volatile solid state memories.In some instances, memory 101 can further comprise The memory remotely located relative to processor 103, these remote memories can pass through network connection to mobile terminal 10.On The example for stating network includes but is not limited to internet, intranet, LAN, mobile radio communication and combinations thereof.Processor 103 And access of other possible components to memory 101 can be carried out under the control of storage control 102.

Various input/output devices are coupled to CPU and memory 101 by Peripheral Interface 104.The operation of processor 103 is deposited Various softwares in reservoir 101, instruction are to perform the various functions of mobile terminal 10 and carry out data processing.

In certain embodiments, Peripheral Interface 104, processor 103 and storage control 102 can be in one single chips Realize.In some other example, they can be realized by independent chip respectively.

Radio-frequency module 105 is used to receiving and sending electromagnetic wave, realizes the mutual conversion of electromagnetic wave and electric signal, so that with Communication network or other equipment are communicated.Radio-frequency module 105 may include the various existing electricity for being used to perform these functions Circuit component, for example, antenna, RF transceiver, digital signal processor, encryption/deciphering chip, subscriber identity module (SIM) card, Memory etc..Radio-frequency module 105 can with various networks for example internet, intranet, preset type wireless network carry out Communication or the wireless network by preset type are communicated with other equipment.The wireless network of above-mentioned preset type can be wrapped Include cellular telephone networks, WLAN or Metropolitan Area Network (MAN).The wireless network of above-mentioned preset type can use various communications Standard, agreement and technology, including but not limited to global system for mobile communications (Global System for Mobile Communication, GSM), enhanced mobile communication technology (Enhanced Data GSM Environment, EDGE) is wide Band CDMA (Wideband Code Division Multiple Access, W-CDMA), CDMA (Code Division Access, CDMA), TDMA (Time Division Multiple Access, TDMA), Bluetooth, adopting wireless fidelity technology (Wireless-Fidelity, WiFi) (such as American Institute of Electrical and Electronics Engineers's standard IEEE 802.11a, IEEE 802.11b, IEEE802.11g and/or IEEE 802.11n), the networking telephone (Voice over Internet Protocal, VoIP), worldwide interoperability for microwave accesses (Worldwide Interoperability for Microwave Access, Wi-Max), other are used for the agreement of mail, instant messaging and short message, and any other is suitable Communications protocol.

Key-press module 106 provides the interface that is inputted to mobile terminal of user, user can by press it is different by Key is so that mobile terminal 10 performs different functions.

Audio-frequency module 107 provides a user COBBAIF, and it may include one or more microphones, one or more raises Sound device and voicefrequency circuit.Voicefrequency circuit receives voice data at Peripheral Interface 104, and voice data is converted into power information, Power information is transmitted to loudspeaker.Power information is converted to the sound wave that human ear can be heard by loudspeaker.Voicefrequency circuit is also from microphone Place receives power information, converts electrical signals to voice data, and by data transmission in network telephony into Peripheral Interface 104 to enter traveling one The processing of step.Voice data can be obtained at memory 101 or by radio-frequency module 105.In addition, voice data can also Store into memory 101 or be transmitted by radio-frequency module 105.In some instances, audio-frequency module 107 may also include One earphone broadcasts hole, for providing COBBAIF to earphone or other equipment.

Touch Screen 108 provides an output and inputting interface simultaneously between mobile terminal and user.Specifically, touch-control Screen 108 shows video frequency output to user, and the content of these video frequency outputs may include word, figure, video and its any group Close.Some output results correspond to some user interface objects.Touch Screen 108 also receives the input of user, such as user The gesture operation such as click, slip, so that user interface object is responded to the input of these users.Detect user's input Technology can be based on resistance-type, condenser type or other any possible touch control detection technologies.The display unit of Touch Screen 108 Instantiation include but is not limited to liquid crystal display or light emitting polymer displays.

Voice signal self-adapting regulation method in the embodiment of the present invention is described based on above-mentioned mobile terminal.

Due in the prior art, the ambient noise in voice signal can be put by way of AGC adaptive gain regulatings Greatly, existing causes speech quality to decline, and user's communication experiences bad technical problem.

In order to solve the above problems, the present invention proposes a kind of voice signal self-adapting regulation method, in handsfree talk mode Under, for the voice signal collected, by the target vocal print feature for recognizing caller in the voice signal, enabling according to The distance between the voice amplitude of sound belonging to the target vocal print feature and caller and mobile terminal, to target vocal print feature The loudness value and frequency values of affiliated sound are adjusted, adaptive relative to AGC to realize the adjustment of the sound for caller Gain-adjusted mode is answered, the problem of amplifying to ambient noise can be prevented effectively from, speech quality is lifted, improves usage experience.

Referring to Fig. 2, be the schematic flow sheet of the self-adapting regulation method of voice signal in first embodiment of the invention, should Method includes：

Step 201, it is under handsfree talk mode in mobile terminal, in real time the voice signal in collection environment, and in real time Obtain the distance between the mobile terminal and caller；

In embodiments of the present invention, above-mentioned voice signal self-adapting regulation method is by voice signal self-adapting adjusting apparatus (hereinafter referred to as：Adjusting apparatus) to realize, the adjusting apparatus is program module, is stored in the computer-readable storage of mobile terminal In medium, the above method can be realized by computing device.

In communication process, if mobile terminal is under handsfree talk mode, show current caller and mobile terminal Between there is distance, wherein, the caller refers to the local user of the mobile terminal.Now, the microphone on mobile terminal will The voice signal in environment is gathered, the adjusting apparatus will get the voice signal that microphone is collected in real time, it is possible to understand that It is, in the case where caller speaks, the sound of the caller to be comprised at least in the voice signal, and if having other in environment Sound, microphone will also collect other sound present in environment.

Wherein, adjusting apparatus will also obtain the distance between mobile terminal and caller in real time, and the distance, which can pass through, moves Move the interior range sensor detection set of terminal to obtain, the range sensor can be optical displacement sensor, linearly approach Sensor or ultrasonic displacement sensor.The range sensor can be arranged on the both sides of the receiver of mobile terminal, or move In the groove of the receiver of dynamic terminal, or mobile terminal side face etc. is provided in, in actual applications, can be according to specific The particular type for the set location and range sensor used that range sensor is set is needed, is not limited this time.

Step 202, the vocal print spy for parsing each sound of separate sources in the voice signal, the acquisition voice signal Levy；

Vocal print, when being shown with electro-kinetic instrument, is the sound wave spectrum for the carrying language message that can be watched, human language During generation, there is a complicated biophysics process between Body Languages maincenter and vocal organs, people is used in speech Phonatory organ includes：Tongue, larynx, lung, nasal cavity etc., due to everyone phonatory organ in size and form each not phase Together, so mutual voiceprint map can also have differences.Vocal print feature is the characteristic parameter that vocal print possesses, and is so that vocal print can The parameter leaned on, different vocal print features can distinguish different sound.

In embodiments of the present invention, for the voice signal collected, the voice signal will be parsed, obtains the voice signal The vocal print feature of each sound of middle separate sources, wherein, source can be that caller, TV, animal, machine etc. are various The people that can produce sound or thing or equipment.

Belong to the target vocal print feature of the caller in step 203, the vocal print feature of identification each sound, and determine Voice amplitude of the sound in the voice signal belonging to the target vocal print feature；

Step 204, according to the voice amplitude and the distance, adjust target vocal print feature described in the voice signal The loudness value and frequency values of affiliated sound.

In embodiments of the present invention, which adjusting apparatus from the vocal print feature of each sound of separate sources, will recognize It is the vocal print feature of current caller, and regard the vocal print feature of identification as target vocal print feature, it is to be understood that call Person can be one or more, and each caller has one group of target vocal print feature.And it is further, adjusting apparatus will also Voice amplitude of the sound belonging to the target vocal print feature in voice signal is determined, wherein, belonging to the target vocal print feature Sound is the sound of caller, and the voice amplitude refers to the average value of wave amplitude in the sound wave that the sound of caller is formed, Or the minimum value of wave amplitude.

Wherein, the distance that adjusting apparatus will be got according to voice amplitude and by range sensor, adjusts voice signal The loudness value and frequency values of sound belonging to middle target vocal print feature, that is, adjust the loudness value of the sound of caller in voice signal And frequency values.

Wherein, loudness value is used for the size for weighing volume, and frequency values are used for the definition for weighing sound.

It should be noted that after the adjustment to voice signal is completed, the voice signal can be sent into the other end Conversation object used in mobile terminal, so that the conversation object can uppick be clear and the suitable voice of volume.

In embodiments of the present invention, it is in mobile terminal under handsfree talk mode, in real time the voice letter in collection environment Number, and the distance between the mobile terminal and caller are obtained in real time, the voice signal is parsed, obtains different in the voice signal Belong to the target vocal print feature of caller in the vocal print feature of each sound in source, the vocal print feature for recognizing each sound, and really Voice amplitude of the sound in voice signal belonging to the fixed target vocal print feature, according to the voice amplitude and above-mentioned distance, is adjusted The loudness value and frequency values of sound in the whole voice signal belonging to target vocal print feature.Relative to prior art, hands-free logical Under words pattern, for the voice signal collected, by the target vocal print feature for recognizing caller in the voice signal so that energy The distance between the voice amplitude of the enough sound according to belonging to the target vocal print feature and caller and mobile terminal, to target sound The loudness value and frequency values of sound belonging to line feature are adjusted, to realize the adjustment of the sound for caller, relative to AGC adaptive gain regulative modes, can be prevented effectively from the problem of amplifying to ambient noise, lift speech quality, and improvement is used Experience.

Referring to Fig. 3, be the schematic flow sheet of voice signal self-adapting regulation method in second embodiment of the invention, bag Include：

Step 301, it is under handsfree talk mode in mobile terminal, in real time the voice signal in collection environment, and in real time Obtain the distance between the mobile terminal and caller；

Step 302, the vocal print spy for parsing each sound of separate sources in the voice signal, the acquisition voice signal Levy；

It is understood that step 301 and step 302 are described with the step 201 and step 202 in first embodiment respectively Content it is similar, refer to the related content in first embodiment, do not repeat herein.

In the preset vocal print feature storehouse of step 303, lookup, the vocal print feature for judging each sound, if exist and institute State the vocal print feature of the vocal print feature matching in vocal print feature storehouse；

If step 304, the vocal print feature that there is matching, are defined as the caller's by the vocal print feature of the matching Target vocal print feature, and determine voice amplitude of the sound in the voice signal belonging to the target vocal print feature；

In embodiments of the present invention, vocal print feature storehouse, including one or more vocal prints never are prefixed in mobile terminal Feature, specific set-up mode can be：User enters the setting interface of mobile terminal by clicking operation, and selects vocal print to set Function, so that the display interface of mobile terminal shows that the start button that vocal print is set, user are said arbitrarily after clicking on Content, or the content that display interface is shown is read out, the content that the microphone collection user on mobile terminal says, and carry out sound Whether the analysis of line feature, the vocal print feature that discriminatory analysis is obtained meets the requirements, if meeting the requirements, and preserves the vocal print feature extremely In vocal print feature storehouse, to complete the setting of vocal print feature, if undesirable, display reminding message points out user to enter again Row is set.Pass through this kind of mode, it is possible to achieve the setting of vocal print feature of one or more users on a mobile terminal.

After the vocal print feature of each sound in getting voice signal, adjusting apparatus will search preset vocal print feature Storehouse, judges in the vocal print feature of each sound, if there is the vocal print feature matched with the vocal print feature in vocal print feature storehouse, Specifically, for the vocal print feature of the various sound obtained, successively by the vocal print feature of each sound and preset vocal print Each vocal print feature in feature database is matched, if there is the sound matched with the vocal print feature of a certain sound in vocal print feature storehouse The vocal print feature of the matching, then be defined as the target vocal print feature of caller by line feature, and determines the target vocal print feature institute Voice amplitude of the sound of category in voice signal.

Step 305, determine amplitude difference between the voice amplitude and predetermined threshold value；

Step 306, search mapping relations between preset difference and adjusting parameter table, it is determined that with the amplitude difference pair The adjusting parameter table answered；

Step 307, lookup adjusting parameter table corresponding with the amplitude difference, it is determined that being rung with described apart from corresponding target Angle value and target frequency value；

Step 308, the sound according to belonging to the target loudness value and target frequency value adjust the target vocal print feature Loudness value and frequency values.

In embodiments of the present invention, adjusting apparatus is obtaining the voice amplitude of target vocal print feature and caller and movement eventually After the distance between end, the amplitude difference between the voice amplitude and predetermined threshold value will be determined, wherein, the predetermined threshold value is to use In the adjustment degree for controlling sound.

Wherein, the amplitude difference is the parameter for determining adjustment.Specifically, being prefixed difference and tune in mobile terminal Mapping relations between whole parameter list so that different adjusting parameter tables are needed to use for different differences, wherein, the adjustment The mapping relations between distance, loudness value and frequency values are contained in parameter list.

Adjusting apparatus will search the adjusting parameter table, really after adjusting parameter table corresponding with amplitude difference is found It is fixed with apart from corresponding target loudness value and target frequency value.

Further, adjusting apparatus will be according to belonging to the target loudness value and target frequency value adjust the target vocal print feature Sound loudness value and frequency values, specifically：The sound belonging to target vocal print feature is extracted from the voice signal collected, It is used as targeted voice signal；The loudness value of the targeted voice signal is adjusted to target loudness value, by the targeted voice signal Frequency values adjust to target frequency value.

In embodiments of the present invention, preset vocal print feature storehouse is passed through so that the vocal print of each sound in voice signal is obtained After feature, it can be matched using the vocal print feature storehouse, to obtain target vocal print feature, and by preset difference with adjusting Mapping relations between whole parameter list, and preset adjusting parameter table, enabling using target vocal print feature voice amplitude with Difference between preset threshold value searches above-mentioned mapping relations to determine adjusting parameter table, and further search using distance to be somebody's turn to do Adjusting parameter table obtains target loudness value and target frequency value, to carry out careful tune to the sound belonging to target vocal print feature It is whole.And by being adjusted for the sound belonging to target vocal print feature in voice signal, relative to the regulation of AGC adaptive gains Mode, can be prevented effectively from the problem of amplifying to ambient noise, lift speech quality, improve usage experience.

Referring to Fig. 4, be the schematic flow sheet of voice signal self-adapting regulation method in third embodiment of the invention, bag Include：

Step 401, it is under handsfree talk mode in mobile terminal, in real time the voice signal in collection environment, and in real time Obtain the distance between the mobile terminal and caller；

Step 402, the vocal print spy for parsing each sound of separate sources in the voice signal, the acquisition voice signal Levy；

Belong to the target vocal print feature of the caller in step 403, the vocal print feature of identification each sound, and determine Voice amplitude of the sound in the voice signal belonging to the target vocal print feature；

Step 404, according to the voice amplitude and the distance, adjust target vocal print feature described in the voice signal The loudness value and frequency values of affiliated sound；

Step 405, extract from the voice signal belonging to other vocal print features in addition to the target vocal print feature Sound, obtain disturb voice signal；

Step 406, to it is described interference voice signal carry out noise reduction process.

It is understood that step 401 to step 404 is described with the step 201 in first embodiment to step 204 respectively Content it is similar, specifically can refer to first embodiment, do not repeat herein.

It is understood that 3rd embodiment is described on the basis of first embodiment, in another feasible reality In existing mode, 3rd embodiment can also be described on the basis of second embodiment, not repeated herein.

In embodiments of the present invention, after being adjusted for the sound belonging to target vocal print feature, in order to further carry High speech quality, can also be adjusted, specifically for other sound：Adjusting apparatus will be extracted from voice signal removes mesh The sound belonging to other vocal print features beyond vocal print feature is marked, obtains disturbing voice signal, if for example, including in voice signal The sound of caller, motor machine sowing put the sound of advertisement, then the sound of the caller is the sound belonging to target vocal print feature, Adjusting apparatus will extract the sound of television for play advertisement from the voice signal, and be used as interference voice signal.Further, Adjusting apparatus will carry out noise reduction process to the interference voice signal, so that the voice signal after by adjustment is sent to the other end After conversation object, useful signal (i.e. the sound of caller) becomes apparent from and sound in the voice signal of the conversation object uppick Amount is suitable, and invalid signals (disturbing voice signal) are weaker.

Wherein, the noise reduction process can have a variety of by the way of, such as Noise gate Method of Noise, sampling Method of Noise, filtering drop Make an uproar method etc..

In embodiments of the present invention, after the sound in voice signal belonging to target vocal print feature is adjusted, will also The further interference voice signal in voice signal carries out noise reduction process, so as to further lifting speech quality, improves Call experience.

Referring to Fig. 5, being the signal of the program module of voice signal self-adapting adjusting apparatus in fourth embodiment of the invention Figure, the device includes：

Acquisition module 501 is gathered, for being in mobile terminal under handsfree talk mode, the voice in environment is gathered in real time Signal, and the distance between the mobile terminal and caller are obtained in real time；

In embodiments of the present invention, above-mentioned voice signal self-adapting adjusting apparatus is program module, is stored in mobile whole , can be by computing device in the computer-readable recording medium at end.

In communication process, if mobile terminal is under handsfree talk mode, show current caller and mobile terminal Between there is distance, wherein, the caller refers to the local user of the mobile terminal.Now, the microphone on mobile terminal will The voice signal in environment is gathered, collection acquisition module 501 will get the voice signal that microphone is collected, Ke Yili in real time Solution, in the case where caller speaks, comprises at least the sound of the caller, and if having in environment in the voice signal Other sound, microphone will also collect other sound present in environment.

Wherein, collection acquisition module 501 will also obtain the distance between mobile terminal and caller in real time, and the distance can be with Obtained by the range sensor detection set in mobile terminal, the range sensor can be optical displacement sensor, line Property proximity transducer or ultrasonic displacement sensor.The range sensor can be arranged on the both sides of the receiver of mobile terminal, or During person is the groove of the receiver of mobile terminal, or mobile terminal side face etc. is provided in, in actual applications, can basis The set location of range sensor and the particular type of the range sensor used are set the need for specific, do not limited this time.

Acquisition module 502 is parsed, for parsing the voice signal, each sound of separate sources in the voice signal is obtained The vocal print feature of sound；

In embodiments of the present invention, for the voice signal collected, parsing acquisition module 502 will parse voice letter Number, the vocal print feature of each sound of separate sources in the voice signal is obtained, wherein, source can be caller, TV, move The various people that can produce sound of thing, machine etc. or thing or equipment.

Recognize the target sound for belonging to the caller in determining module 503, the vocal print feature for recognizing each sound Line feature, and determine voice amplitude of the sound in the voice signal belonging to the target vocal print feature；

Adjusting module 504, for according to the voice amplitude and the distance, adjusting target described in the voice signal The loudness value and frequency values of sound belonging to vocal print feature.

In embodiments of the present invention, identification determining module 503 is recognized from the vocal print feature of each sound of separate sources Which is only the vocal print feature of current caller, and regard the vocal print feature of identification as target vocal print feature, it is possible to understand that It is that caller can be one or more, and each caller has one group of target vocal print feature.And further, identification Determining module 503 will also determine voice amplitude of the sound in voice signal belonging to the target vocal print feature, wherein, the target Sound belonging to vocal print feature is the sound of caller, and the voice amplitude refers in the sound wave that the sound of caller is formed The average value of wave amplitude, or wave amplitude minimum value.

Wherein, the distance that adjusting module 504 will be got according to voice amplitude and by range sensor, adjustment voice letter The loudness value and frequency values of sound in number belonging to target vocal print feature, that is, adjust the loudness of the sound of caller in voice signal Value and frequency values.

Referring to Fig. 6, being the signal of the program module of voice signal self-adapting adjusting apparatus in fifth embodiment of the invention Figure, the device include fourth embodiment in collection acquisition module 501, parsing acquisition module 502, identification determining module 503 and Adjusting module, and it is similar to the content described in fourth embodiment, do not repeat herein.

In embodiments of the present invention, identification determining module 503 includes：

Search in judge module 601, the vocal print feature storehouse preset for searching, the vocal print feature for judging each sound, With the presence or absence of the vocal print feature matched with the vocal print feature in the vocal print feature storehouse；

Target determination module 602, if for there is the vocal print feature of matching, the vocal print feature of the matching is defined as The target vocal print feature of the caller；

Amplitude determining module 603, for determining the sound belonging to the target vocal print feature in the voice signal Voice amplitude.

After the vocal print feature of each sound in getting voice signal, preset sound will be searched by searching judge module 601 Line feature database, judges in the vocal print feature of each sound, if there is the sound matched with the vocal print feature in vocal print feature storehouse Line feature, specifically, for the vocal print feature of the various sound obtained, successively by the vocal print feature of each sound with it is preset Vocal print feature storehouse in each vocal print feature matched, if there is vocal print feature with a certain sound in vocal print feature storehouse The vocal print feature matched somebody with somebody, then target determination module 602 vocal print feature of the matching is defined as to the target vocal print feature of caller, and As voice amplitude of the sound belonging to amplitude determining module 603 determines the target vocal print feature in voice signal.

In embodiments of the present invention, adjusting module 504 includes：

Difference determining module 604, for determining the amplitude difference between the voice amplitude and predetermined threshold value；

First searching modul 605, the mapping relations for searching between preset difference and adjusting parameter table, it is determined that and institute State the corresponding adjusting parameter table of amplitude difference；

Second searching modul 606, for searching corresponding with amplitude difference adjusting parameter table, it is determined that with the distance Reflecting between distance, loudness value and frequency values is included in corresponding target loudness value and target frequency value, the adjusting parameter table Penetrate relation；

Target adjustment module 607, it is special for adjusting the target vocal print according to the target loudness value and target frequency value The loudness value and frequency values of sound belonging to levying.

Wherein, the target adjustment module 607 includes：

First extraction module 608, for extracting the sound belonging to the target vocal print feature from the voice signal, makees For targeted voice signal；

Data point reuse module 609, will for the loudness value of the targeted voice signal to be adjusted to the target loudness value The frequency values of the targeted voice signal are adjusted to the target frequency value.

In embodiments of the present invention, obtaining between the voice amplitude and caller and mobile terminal of target vocal print feature After distance, difference determining module 604 will determine amplitude difference between the voice amplitude and predetermined threshold value, wherein, this is preset Threshold value is the adjustment degree for controlling sound.

After the first searching modul 605 finds adjusting parameter table corresponding with amplitude difference, the second searching modul 606 Will search the adjusting parameter table, it is determined that with apart from corresponding target loudness value and target frequency value.

Further, target adjustment module 607 will adjust the target vocal print according to the target loudness value and target frequency value The loudness value and frequency values of sound belonging to feature, specifically：First extraction module 608 is extracted from the voice signal collected Sound belonging to target vocal print feature, is used as targeted voice signal；Data point reuse module 609 is by the loudness of the targeted voice signal Value is adjusted to target loudness value, and the frequency values of the targeted voice signal are adjusted to target frequency value.

Referring to Fig. 7, being the signal of the program module of voice signal self-adapting adjusting apparatus in sixth embodiment of the invention Figure, including：Collection acquisition module 501, parsing acquisition module 502, identification determining module 503 and adjustment mould in fourth embodiment Block 504, and it is similar to the content described in fourth embodiment, do not repeat herein.

It is understood that sixth embodiment is described on the basis of fourth embodiment, in addition, the sixth embodiment It can also be described on the basis of the 5th embodiment.

In embodiments of the present invention, the device also includes：

Second extraction module 701, for extracting other in addition to the target vocal print feature from the voice signal Sound belonging to vocal print feature, obtains disturbing voice signal；

Noise reduction module 702, for carrying out noise reduction process to the interference voice signal.

In embodiments of the present invention, after being adjusted for the sound belonging to target vocal print feature, in order to further carry High speech quality, can also be adjusted, specifically for other sound：Second extraction module 701 will be from voice signal The sound belonging to other vocal print features in addition to target vocal print feature is extracted, obtains disturbing voice signal, if for example, voice is believed The sound of advertisement is put in number comprising the sound of caller, motor machine sowing, then the sound of the caller is target vocal print feature institute The sound of category, adjusting apparatus will extract the sound of television for play advertisement from the voice signal, and be used as interference voice signal. Further, noise reduction module 702 will carry out noise reduction process to the interference voice signal, so as to the voice signal hair after by adjustment After giving the conversation object of the other end, useful signal (i.e. the sound of caller) in the voice signal of the conversation object uppick Become apparent from and volume is suitable, and invalid signals (disturbing voice signal) are weaker.

The embodiment of the present invention also provides a kind of mobile terminal, including memory, processor and stores on a memory and can The computer program run on a processor, during computing device computer program, realizes first embodiment to 3rd embodiment In each step in voice signal self-adapting regulation method in any one embodiment.

The embodiment of the present invention also provides a kind of storage medium, and the storage medium is specifically as follows computer-readable storage medium Matter, is stored thereon with computer program, when computer program is executed by processor, and realizes first embodiment into 3rd embodiment Each step in voice signal self-adapting regulation method in any one embodiment.

, can be by it in several embodiments provided herein, it should be understood that disclosed apparatus and method Its mode is realized.For example, device embodiment described above is only schematical, for example, the division of the module, only Only a kind of division of logic function, can there is other dividing mode when actually realizing, such as multiple module or components can be tied Another system is closed or is desirably integrated into, or some features can be ignored, or do not perform.It is another, it is shown or discussed Coupling each other or direct-coupling or communication connection can be the INDIRECT COUPLINGs or logical of device or module by some interfaces Letter connection, can be electrical, machinery or other forms.

The module illustrated as separating component can be or may not be it is physically separate, it is aobvious as module The part shown can be or may not be physical module, you can with positioned at a place, or can also be distributed to multiple On mixed-media network modules mixed-media.Some or all of module therein can be selected to realize the mesh of this embodiment scheme according to the actual needs 's.

In addition, each functional module in each embodiment of the invention can be integrated in a processing module, can also That modules are individually physically present, can also two or more modules be integrated in a module.Above-mentioned integrated mould Block can both be realized in the form of hardware, it would however also be possible to employ the form of software function module is realized.

If the integrated module is realized using in the form of software function module and as independent production marketing or used When, it can be stored in a computer read/write memory medium.Understood based on such, technical scheme is substantially The part contributed in other words to prior art or all or part of the technical scheme can be in the form of software products Embody, the computer software product is stored in a storage medium, including some instructions are to cause a computer Equipment (can be personal computer, server, or network equipment etc.) performs the complete of each embodiment methods described of the invention Portion or part steps.And foregoing storage medium includes：USB flash disk, mobile hard disk, read-only storage (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disc or CD etc. are various can store journey The medium of sequence code.

It should be noted that for foregoing each method embodiment, for simplicity description, therefore it is all expressed as a series of Combination of actions, but those skilled in the art should know, the present invention is not limited by described sequence of movement because According to the present invention, some steps can use other orders or carry out simultaneously.Secondly, those skilled in the art should also know Know, embodiment described in this description belongs to preferred embodiment, and involved action and module might not all be this hairs Necessary to bright.

In the above-described embodiments, the description to each embodiment all emphasizes particularly on different fields, and does not have the portion being described in detail in some embodiment Point, it may refer to the associated description of other embodiments.

It is to a kind of voice signal self-adapting regulation method provided by the present invention, device, mobile terminal and storage above The description of medium, for those skilled in the art, according to the thought of the embodiment of the present invention, in embodiment and applies model Place and will change, to sum up, this specification content should not be construed as limiting the invention.

Claims

1. a kind of voice signal self-adapting regulation method, it is characterised in that methods described includes：

It is in mobile terminal under handsfree talk mode, the voice signal in collection environment, and obtain the movement in real time in real time The distance between terminal and caller；

Belong to the target vocal print feature of the caller in the vocal print feature for recognizing each sound, and determine the target vocal print Voice amplitude of the sound in the voice signal belonging to feature；

According to the voice amplitude and the distance, sound described in the voice signal belonging to target vocal print feature is adjusted Loudness value and frequency values.

2. according to the method described in claim 1, it is characterised in that belong to institute in the vocal print feature of identification each sound The target vocal print feature of caller is stated, including：

Search in preset vocal print feature storehouse, the vocal print feature for judging each sound, if exist and the vocal print feature storehouse In vocal print feature matching vocal print feature；

If there is the vocal print feature of matching, the target vocal print that the vocal print feature of the matching is defined as into the caller is special Levy.

3. according to the method described in claim 1, it is characterised in that described according to the voice amplitude and the distance, adjustment The loudness value and frequency values of sound described in the voice signal belonging to target vocal print feature, including：

Determine the amplitude difference between the voice amplitude and predetermined threshold value；

Mapping relations between the preset difference of lookup and adjusting parameter table, it is determined that adjusting parameter corresponding with the amplitude difference Table；

Search corresponding with amplitude difference adjusting parameter table, it is determined that with it is described apart from corresponding target loudness value and target frequently The mapping relations between distance, loudness value and frequency values are included in rate value, the adjusting parameter table；

The loudness value and frequency of sound according to belonging to the target loudness value and target frequency value adjust the target vocal print feature Rate value.

4. method according to claim 3, it is characterised in that described to be adjusted according to the target loudness value and target frequency value The loudness value and frequency values of sound belonging to the whole target vocal print feature, including：

The sound belonging to the target vocal print feature is extracted from the voice signal, targeted voice signal is used as；

The loudness value of the targeted voice signal is adjusted to the target loudness value, by the frequency values of the targeted voice signal Adjust to the target frequency value.

5. the method according to Claims 1-4 any one, it is characterised in that methods described also includes：

The sound belonging to other vocal print features in addition to the target vocal print feature is extracted from the voice signal, is done Disturb voice signal；

Noise reduction process is carried out to the interference voice signal.

6. a kind of voice signal self-adapting adjusting apparatus, it is characterised in that described device includes：

Acquisition module is gathered, for being in mobile terminal under handsfree talk mode, the voice signal in environment is gathered in real time, and The distance between the mobile terminal and caller are obtained in real time；

Acquisition module is parsed, for parsing the voice signal, the sound of each sound of separate sources in the voice signal is obtained Line feature；

The target vocal print feature for belonging to the caller in determining module, the vocal print feature for recognizing each sound is recognized, And determine voice amplitude of the sound in the voice signal belonging to the target vocal print feature；

Adjusting module, it is special for according to the voice amplitude and the distance, adjusting target vocal print described in the voice signal The loudness value and frequency values of sound belonging to levying.

7. device according to claim 6, it is characterised in that the identification determining module includes：

Search in judge module, the vocal print feature storehouse preset for searching, the vocal print feature for judging each sound, if exist The vocal print feature matched with the vocal print feature in the vocal print feature storehouse；

Target determination module, if for there is the vocal print feature of matching, the vocal print feature of the matching is defined as described logical The target vocal print feature of words person；

Amplitude determining module, for determining voice width of the sound belonging to the target vocal print feature in the voice signal Value.

8. device according to claim 6, it is characterised in that the adjusting module includes：

Difference determining module, for determining the amplitude difference between the voice amplitude and predetermined threshold value；

First searching modul, the mapping relations for searching between preset difference and adjusting parameter table, it is determined that with the amplitude The corresponding adjusting parameter table of difference；

Second searching modul, for searching adjusting parameter table corresponding with the amplitude difference, it is determined that with described apart from corresponding The mapping relations between distance, loudness value and frequency values are included in target loudness value and target frequency value, the adjusting parameter table；

Target adjustment module, belonging to adjusting the target vocal print feature according to the target loudness value and target frequency value The loudness value and frequency values of sound.

9. device according to claim 8, it is characterised in that the target adjustment module includes：

First extraction module, for extracting the sound belonging to the target vocal print feature from the voice signal, is used as target Voice signal；

Data point reuse module, for the loudness value of the targeted voice signal to be adjusted to the target loudness value, by the mesh The frequency values of mark voice signal are adjusted to the target frequency value.

10. the device according to claim 6 to 9 any one, it is characterised in that described device also includes：

Second extraction module, for extracting other vocal print features in addition to the target vocal print feature from the voice signal Affiliated sound, obtains disturbing voice signal；

Noise reduction module, for carrying out noise reduction process to the interference voice signal.

11. a kind of mobile terminal, including memory, processor and storage are on a memory and the calculating that can run on a processor Machine program, it is characterised in that described in the computing device during computer program, is realized described in claim 1 to 5 any one Voice signal self-adapting regulation method in each step.

12. a kind of storage medium, the storage medium is computer-readable recording medium, computer program is stored thereon with, its It is characterised by, when the computer program is executed by processor, realizes the voice signal described in claim 1 to 5 any one Each step in self-adapting regulation method.