CN107172255A - Voice signal self-adapting regulation method, device, mobile terminal and storage medium - Google Patents
Voice signal self-adapting regulation method, device, mobile terminal and storage medium Download PDFInfo
- Publication number
- CN107172255A CN107172255A CN201710599150.2A CN201710599150A CN107172255A CN 107172255 A CN107172255 A CN 107172255A CN 201710599150 A CN201710599150 A CN 201710599150A CN 107172255 A CN107172255 A CN 107172255A
- Authority
- CN
- China
- Prior art keywords
- vocal print
- voice signal
- print feature
- target
- sound
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 49
- 230000033228 biological regulation Effects 0.000 title claims abstract description 26
- 238000003860 storage Methods 0.000 title claims abstract description 25
- 230000001755 vocal effect Effects 0.000 claims abstract description 225
- 230000015654 memory Effects 0.000 claims description 23
- 238000013507 mapping Methods 0.000 claims description 14
- 238000004590 computer program Methods 0.000 claims description 11
- 238000011946 reduction process Methods 0.000 claims description 10
- 238000000605 extraction Methods 0.000 claims description 6
- 230000009467 reduction Effects 0.000 claims description 5
- 230000006854 communication Effects 0.000 description 12
- 230000003044 adaptive effect Effects 0.000 description 11
- 238000004891 communication Methods 0.000 description 10
- 230000006870 function Effects 0.000 description 10
- 238000005516 engineering process Methods 0.000 description 7
- 210000000056 organ Anatomy 0.000 description 6
- 230000002093 peripheral effect Effects 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 4
- 238000006073 displacement reaction Methods 0.000 description 4
- 230000001105 regulatory effect Effects 0.000 description 4
- 238000005303 weighing Methods 0.000 description 4
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000010295 mobile communication Methods 0.000 description 3
- 230000009471 action Effects 0.000 description 2
- 230000001276 controlling effect Effects 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 210000004209 hair Anatomy 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 210000000867 larynx Anatomy 0.000 description 2
- 210000004072 lung Anatomy 0.000 description 2
- 210000003928 nasal cavity Anatomy 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 238000009331 sowing Methods 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 210000002105 tongue Anatomy 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000006855 networking Effects 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/02—Constructional features of telephone sets
- H04M1/19—Arrangements of transmitters, receivers, or complete sets to prevent eavesdropping, to attenuate local noise or to prevent undesired transmission; Mouthpieces or receivers specially adapted therefor
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
- G10L21/028—Voice signal separating using properties of sound source
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
- G10L21/0324—Details of processing therefor
- G10L21/0332—Details of processing therefor involving modification of waveforms
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
- G10L21/0324—Details of processing therefor
- G10L21/034—Automatic adjustment
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/60—Substation equipment, e.g. for use by subscribers including speech amplifiers
- H04M1/6033—Substation equipment, e.g. for use by subscribers including speech amplifiers for providing handsfree use or a loudspeaker mode in telephone sets
- H04M1/6041—Portable telephones adapted for handsfree use
- H04M1/605—Portable telephones adapted for handsfree use involving control of the receiver volume to provide a dual operational mode at close or far distance from the user
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Telephone Function (AREA)
Abstract
The invention discloses a kind of voice signal self-adapting regulation method, device, mobile terminal and storage medium, method includes:It is in mobile terminal under handsfree talk mode, the voice signal in environment is gathered in real time, and the distance between the mobile terminal and caller are obtained in real time, parse voice signal, the vocal print feature of each sound of separate sources in voice signal is obtained, identification belongs to the target vocal print feature of caller, and determines voice amplitude of the sound belonging to target vocal print feature in voice signal, according to voice amplitude and distance, the loudness value and frequency values of the sound belonging to adjustment target vocal print feature.Pass through the target vocal print feature of caller in recognition of speech signals, make it possible to according to the distance between the voice amplitude of the sound belonging to the target vocal print feature and caller and mobile terminal, realize the adjustment of caller's sound, the problem of amplifying to ambient noise can be prevented effectively from, speech quality is lifted, improves usage experience.
Description
Technical field
The present invention relates to technical field of mobile terminals, more particularly to a kind of voice signal self-adapting regulation method, device, shifting
Dynamic terminal and storage medium.
Background technology
At present, the application of mobile phone is very universal, and the conventional call mode of mobile phone includes hand-held call mode and hands-free
Call mode, under handsfree talk mode, because posture, the custom of the adept machine of each user are different, mobile phone and local user it
Between distance also have very big difference, mobile phone is when carrying out radio reception, and these differences can cause the voice signal that mobile phone is collected
Volume it is different, and overall loudness is less than normal.In order to which the conversation object of the other end in conversing can catch dialog context, need
Conversation object is then forwarded to after processing is amplified to the voice signal collected.
In the prior art, it is by automatic growth control (Automatic Gain under handsfree talk mode
Control, AGC) adaptive gain regulating mode, increase mobile phone be sent to other end conversation object voice signal sound
Amount, to lift the quality of hand-free call., can be to the voice signal that collects however, by way of AGC adaptive gain regulatings
It is amplified, what is the ambient noise in voice signal will certainly also put is very big, causes the quality reduction of call, user's communication body
Test bad.
The content of the invention
It is a primary object of the present invention to provide a kind of voice signal self-adapting regulation method, device, mobile terminal and deposit
Storage media, can solve the mode of AGC adaptive gain regulatings in the prior art can amplify ambient noise, cause the matter of call
Amount reduction, user's communication experience is bad.
To achieve the above object, first aspect present invention provides a kind of voice signal self-adapting regulation method, and method includes:
It is in mobile terminal under handsfree talk mode, the voice signal in collection environment, and obtaining in real time described in real time
The distance between mobile terminal and caller;
The voice signal is parsed, the vocal print feature of each sound of separate sources in the voice signal is obtained;
Belong to the target vocal print feature of the caller in the vocal print feature for recognizing each sound, and determine the target
Voice amplitude of the sound in the voice signal belonging to vocal print feature;
According to the voice amplitude and the distance, the sound belonging to target vocal print feature described in the voice signal is adjusted
The loudness value and frequency values of sound.
To achieve the above object, second aspect of the present invention provides a kind of voice signal self-adapting adjusting apparatus, and device includes:
Acquisition module is gathered, for being in mobile terminal under handsfree talk mode, the voice signal in collection environment, and in real time in real time
Obtain the distance between the mobile terminal and caller;
Acquisition module is parsed, for parsing the voice signal, each sound of separate sources in the voice signal is obtained
Vocal print feature;
Recognize that the target vocal print for belonging to the caller in determining module, the vocal print feature for recognizing each sound is special
Levy, and determine voice amplitude of the sound in the voice signal belonging to the target vocal print feature;
Adjusting module, for according to the voice amplitude and the distance, adjusting target sound described in the voice signal
The loudness value and frequency values of sound belonging to line feature.
To achieve the above object, third aspect present invention provides a kind of mobile terminal, including memory, processor and storage
On a memory and the computer program that can run on a processor, described in the computing device during computer program, realize
Each step in the voice signal self-adapting regulation method that first aspect is provided.
To achieve the above object, fourth aspect present invention provides a kind of storage medium, and the storage medium is that computer can
Storage medium is read, computer program is stored thereon with, when the computer program is executed by processor, realizes that first aspect is provided
Voice signal self-adapting regulation method in each step.
The present invention provides a kind of voice signal self-adapting regulation method, device, mobile terminal and storage medium, this method bag
Include:It is in mobile terminal under handsfree talk mode, the voice signal in collection environment, and obtain the mobile terminal in real time in real time
The distance between with caller, the voice signal is parsed, the vocal print feature of each sound of separate sources in the voice signal is obtained,
Belong to the target vocal print feature of caller in the vocal print feature for recognizing each sound, and determine the sound belonging to the target vocal print feature
Voice amplitude of the sound in voice signal, according to the voice amplitude and above-mentioned distance, adjusts target vocal print in the voice signal special
The loudness value and frequency values of sound belonging to levying.Relative to prior art, under handsfree talk mode, for the voice collected
Signal, by the target vocal print feature for recognizing caller in the voice signal, enabling according to belonging to the target vocal print feature
Sound voice amplitude and the distance between caller and mobile terminal, to the loudness value of the sound belonging to target vocal print feature
And frequency values are adjusted, to realize the adjustment of the sound for caller, relative to AGC adaptive gain regulative modes, energy
The problem of amplifying to ambient noise is enough prevented effectively from, speech quality is lifted, improves usage experience.
Brief description of the drawings
In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing
There is the accompanying drawing used required in technology description to be briefly described, it should be apparent that, drawings in the following description are only this
Some embodiments of invention, for those skilled in the art, on the premise of not paying creative work, can also basis
These accompanying drawings obtain other accompanying drawings.
Fig. 1 is a kind of structured flowchart of mobile terminal;
Fig. 2 is the schematic flow sheet of voice signal self-adapting regulation method in first embodiment of the invention;
Fig. 3 is the schematic flow sheet of voice signal self-adapting regulation method in second embodiment of the invention;
Fig. 4 is the schematic flow sheet of voice signal self-adapting regulation method in third embodiment of the invention;
Fig. 5 is the schematic diagram of the program module of voice signal self-adapting adjusting apparatus in fourth embodiment of the invention;
Fig. 6 is the schematic diagram of the program module of voice signal self-adapting adjusting apparatus in fifth embodiment of the invention;
Fig. 7 is the schematic diagram of the program module of voice signal self-adapting adjusting apparatus in sixth embodiment of the invention.
Embodiment
To enable goal of the invention, feature, the advantage of the present invention more obvious and understandable, below in conjunction with the present invention
Accompanying drawing in embodiment, the technical scheme in the embodiment of the present invention is clearly and completely described, it is clear that described reality
It is only a part of embodiment of the invention to apply example, and not all embodiments.Based on the embodiment in the present invention, people in the art
The every other embodiment that member is obtained under the premise of creative work is not made, belongs to the scope of protection of the invention.
Fig. 1 shows a kind of structured flowchart of mobile terminal.Voice signal provided in an embodiment of the present invention is adaptively adjusted
Method can be applied in mobile terminal 10 as shown in Figure 1, and mobile terminal 10 can be, but not limited to include:It need to be maintained by battery
Normal operation and support network and the smart mobile phone of download function, notebook, tablet personal computer, wearing smart machine etc..
As shown in figure 1, mobile terminal 10 includes memory 101, storage control 102, it is one or more (only to be shown in figure
One) processor 103, Peripheral Interface 104, radio-frequency module 105, key-press module 106, audio-frequency module 107 and Touch Screen
108.These components are mutually communicated by one or more communication bus/signal wire 109.
It is appreciated that the structure shown in Fig. 1 is only signal, it does not cause to limit to the structure of mobile terminal.It is mobile whole
End 10 may also include than shown in Fig. 1 more either less components or with the configuration different from shown in Fig. 1.Shown in Fig. 1
Each component can be realized using hardware, software or its combination.
The voice signal that memory 101 can be used in storage software program and module, such as embodiment of the present invention is adaptive
Method of adjustment and the corresponding programmed instruction/module of device, processor 103 are stored in the software journey in memory 101 by operation
Sequence and module, so as to perform various function application and data processing, that is, realize the above-mentioned voice signal adaptively side of adjustment
Method and device.
Memory 101 may include high speed random access memory, may also include nonvolatile memory, such as one or more magnetic
Property storage device, flash memory or other non-volatile solid state memories.In some instances, memory 101 can further comprise
The memory remotely located relative to processor 103, these remote memories can pass through network connection to mobile terminal 10.On
The example for stating network includes but is not limited to internet, intranet, LAN, mobile radio communication and combinations thereof.Processor 103
And access of other possible components to memory 101 can be carried out under the control of storage control 102.
Various input/output devices are coupled to CPU and memory 101 by Peripheral Interface 104.The operation of processor 103 is deposited
Various softwares in reservoir 101, instruction are to perform the various functions of mobile terminal 10 and carry out data processing.
In certain embodiments, Peripheral Interface 104, processor 103 and storage control 102 can be in one single chips
Realize.In some other example, they can be realized by independent chip respectively.
Radio-frequency module 105 is used to receiving and sending electromagnetic wave, realizes the mutual conversion of electromagnetic wave and electric signal, so that with
Communication network or other equipment are communicated.Radio-frequency module 105 may include the various existing electricity for being used to perform these functions
Circuit component, for example, antenna, RF transceiver, digital signal processor, encryption/deciphering chip, subscriber identity module (SIM) card,
Memory etc..Radio-frequency module 105 can with various networks for example internet, intranet, preset type wireless network carry out
Communication or the wireless network by preset type are communicated with other equipment.The wireless network of above-mentioned preset type can be wrapped
Include cellular telephone networks, WLAN or Metropolitan Area Network (MAN).The wireless network of above-mentioned preset type can use various communications
Standard, agreement and technology, including but not limited to global system for mobile communications (Global System for Mobile
Communication, GSM), enhanced mobile communication technology (Enhanced Data GSM Environment, EDGE) is wide
Band CDMA (Wideband Code Division Multiple Access, W-CDMA), CDMA
(Code Division Access, CDMA), TDMA (Time Division Multiple Access, TDMA),
Bluetooth, adopting wireless fidelity technology (Wireless-Fidelity, WiFi) (such as American Institute of Electrical and Electronics Engineers's standard IEEE
802.11a, IEEE 802.11b, IEEE802.11g and/or IEEE 802.11n), the networking telephone (Voice over
Internet Protocal, VoIP), worldwide interoperability for microwave accesses (Worldwide Interoperability for
Microwave Access, Wi-Max), other are used for the agreement of mail, instant messaging and short message, and any other is suitable
Communications protocol.
Key-press module 106 provides the interface that is inputted to mobile terminal of user, user can by press it is different by
Key is so that mobile terminal 10 performs different functions.
Audio-frequency module 107 provides a user COBBAIF, and it may include one or more microphones, one or more raises
Sound device and voicefrequency circuit.Voicefrequency circuit receives voice data at Peripheral Interface 104, and voice data is converted into power information,
Power information is transmitted to loudspeaker.Power information is converted to the sound wave that human ear can be heard by loudspeaker.Voicefrequency circuit is also from microphone
Place receives power information, converts electrical signals to voice data, and by data transmission in network telephony into Peripheral Interface 104 to enter traveling one
The processing of step.Voice data can be obtained at memory 101 or by radio-frequency module 105.In addition, voice data can also
Store into memory 101 or be transmitted by radio-frequency module 105.In some instances, audio-frequency module 107 may also include
One earphone broadcasts hole, for providing COBBAIF to earphone or other equipment.
Touch Screen 108 provides an output and inputting interface simultaneously between mobile terminal and user.Specifically, touch-control
Screen 108 shows video frequency output to user, and the content of these video frequency outputs may include word, figure, video and its any group
Close.Some output results correspond to some user interface objects.Touch Screen 108 also receives the input of user, such as user
The gesture operation such as click, slip, so that user interface object is responded to the input of these users.Detect user's input
Technology can be based on resistance-type, condenser type or other any possible touch control detection technologies.The display unit of Touch Screen 108
Instantiation include but is not limited to liquid crystal display or light emitting polymer displays.
Voice signal self-adapting regulation method in the embodiment of the present invention is described based on above-mentioned mobile terminal.
Due in the prior art, the ambient noise in voice signal can be put by way of AGC adaptive gain regulatings
Greatly, existing causes speech quality to decline, and user's communication experiences bad technical problem.
In order to solve the above problems, the present invention proposes a kind of voice signal self-adapting regulation method, in handsfree talk mode
Under, for the voice signal collected, by the target vocal print feature for recognizing caller in the voice signal, enabling according to
The distance between the voice amplitude of sound belonging to the target vocal print feature and caller and mobile terminal, to target vocal print feature
The loudness value and frequency values of affiliated sound are adjusted, adaptive relative to AGC to realize the adjustment of the sound for caller
Gain-adjusted mode is answered, the problem of amplifying to ambient noise can be prevented effectively from, speech quality is lifted, improves usage experience.
Referring to Fig. 2, be the schematic flow sheet of the self-adapting regulation method of voice signal in first embodiment of the invention, should
Method includes:
Step 201, it is under handsfree talk mode in mobile terminal, in real time the voice signal in collection environment, and in real time
Obtain the distance between the mobile terminal and caller;
In embodiments of the present invention, above-mentioned voice signal self-adapting regulation method is by voice signal self-adapting adjusting apparatus
(hereinafter referred to as:Adjusting apparatus) to realize, the adjusting apparatus is program module, is stored in the computer-readable storage of mobile terminal
In medium, the above method can be realized by computing device.
In communication process, if mobile terminal is under handsfree talk mode, show current caller and mobile terminal
Between there is distance, wherein, the caller refers to the local user of the mobile terminal.Now, the microphone on mobile terminal will
The voice signal in environment is gathered, the adjusting apparatus will get the voice signal that microphone is collected in real time, it is possible to understand that
It is, in the case where caller speaks, the sound of the caller to be comprised at least in the voice signal, and if having other in environment
Sound, microphone will also collect other sound present in environment.
Wherein, adjusting apparatus will also obtain the distance between mobile terminal and caller in real time, and the distance, which can pass through, moves
Move the interior range sensor detection set of terminal to obtain, the range sensor can be optical displacement sensor, linearly approach
Sensor or ultrasonic displacement sensor.The range sensor can be arranged on the both sides of the receiver of mobile terminal, or move
In the groove of the receiver of dynamic terminal, or mobile terminal side face etc. is provided in, in actual applications, can be according to specific
The particular type for the set location and range sensor used that range sensor is set is needed, is not limited this time.
Step 202, the vocal print spy for parsing each sound of separate sources in the voice signal, the acquisition voice signal
Levy;
Vocal print, when being shown with electro-kinetic instrument, is the sound wave spectrum for the carrying language message that can be watched, human language
During generation, there is a complicated biophysics process between Body Languages maincenter and vocal organs, people is used in speech
Phonatory organ includes:Tongue, larynx, lung, nasal cavity etc., due to everyone phonatory organ in size and form each not phase
Together, so mutual voiceprint map can also have differences.Vocal print feature is the characteristic parameter that vocal print possesses, and is so that vocal print can
The parameter leaned on, different vocal print features can distinguish different sound.
In embodiments of the present invention, for the voice signal collected, the voice signal will be parsed, obtains the voice signal
The vocal print feature of each sound of middle separate sources, wherein, source can be that caller, TV, animal, machine etc. are various
The people that can produce sound or thing or equipment.
Belong to the target vocal print feature of the caller in step 203, the vocal print feature of identification each sound, and determine
Voice amplitude of the sound in the voice signal belonging to the target vocal print feature;
Step 204, according to the voice amplitude and the distance, adjust target vocal print feature described in the voice signal
The loudness value and frequency values of affiliated sound.
In embodiments of the present invention, which adjusting apparatus from the vocal print feature of each sound of separate sources, will recognize
It is the vocal print feature of current caller, and regard the vocal print feature of identification as target vocal print feature, it is to be understood that call
Person can be one or more, and each caller has one group of target vocal print feature.And it is further, adjusting apparatus will also
Voice amplitude of the sound belonging to the target vocal print feature in voice signal is determined, wherein, belonging to the target vocal print feature
Sound is the sound of caller, and the voice amplitude refers to the average value of wave amplitude in the sound wave that the sound of caller is formed,
Or the minimum value of wave amplitude.
Wherein, the distance that adjusting apparatus will be got according to voice amplitude and by range sensor, adjusts voice signal
The loudness value and frequency values of sound belonging to middle target vocal print feature, that is, adjust the loudness value of the sound of caller in voice signal
And frequency values.
Wherein, loudness value is used for the size for weighing volume, and frequency values are used for the definition for weighing sound.
It should be noted that after the adjustment to voice signal is completed, the voice signal can be sent into the other end
Conversation object used in mobile terminal, so that the conversation object can uppick be clear and the suitable voice of volume.
In embodiments of the present invention, it is in mobile terminal under handsfree talk mode, in real time the voice letter in collection environment
Number, and the distance between the mobile terminal and caller are obtained in real time, the voice signal is parsed, obtains different in the voice signal
Belong to the target vocal print feature of caller in the vocal print feature of each sound in source, the vocal print feature for recognizing each sound, and really
Voice amplitude of the sound in voice signal belonging to the fixed target vocal print feature, according to the voice amplitude and above-mentioned distance, is adjusted
The loudness value and frequency values of sound in the whole voice signal belonging to target vocal print feature.Relative to prior art, hands-free logical
Under words pattern, for the voice signal collected, by the target vocal print feature for recognizing caller in the voice signal so that energy
The distance between the voice amplitude of the enough sound according to belonging to the target vocal print feature and caller and mobile terminal, to target sound
The loudness value and frequency values of sound belonging to line feature are adjusted, to realize the adjustment of the sound for caller, relative to
AGC adaptive gain regulative modes, can be prevented effectively from the problem of amplifying to ambient noise, lift speech quality, and improvement is used
Experience.
Referring to Fig. 3, be the schematic flow sheet of voice signal self-adapting regulation method in second embodiment of the invention, bag
Include:
Step 301, it is under handsfree talk mode in mobile terminal, in real time the voice signal in collection environment, and in real time
Obtain the distance between the mobile terminal and caller;
Step 302, the vocal print spy for parsing each sound of separate sources in the voice signal, the acquisition voice signal
Levy;
It is understood that step 301 and step 302 are described with the step 201 and step 202 in first embodiment respectively
Content it is similar, refer to the related content in first embodiment, do not repeat herein.
In the preset vocal print feature storehouse of step 303, lookup, the vocal print feature for judging each sound, if exist and institute
State the vocal print feature of the vocal print feature matching in vocal print feature storehouse;
If step 304, the vocal print feature that there is matching, are defined as the caller's by the vocal print feature of the matching
Target vocal print feature, and determine voice amplitude of the sound in the voice signal belonging to the target vocal print feature;
In embodiments of the present invention, vocal print feature storehouse, including one or more vocal prints never are prefixed in mobile terminal
Feature, specific set-up mode can be:User enters the setting interface of mobile terminal by clicking operation, and selects vocal print to set
Function, so that the display interface of mobile terminal shows that the start button that vocal print is set, user are said arbitrarily after clicking on
Content, or the content that display interface is shown is read out, the content that the microphone collection user on mobile terminal says, and carry out sound
Whether the analysis of line feature, the vocal print feature that discriminatory analysis is obtained meets the requirements, if meeting the requirements, and preserves the vocal print feature extremely
In vocal print feature storehouse, to complete the setting of vocal print feature, if undesirable, display reminding message points out user to enter again
Row is set.Pass through this kind of mode, it is possible to achieve the setting of vocal print feature of one or more users on a mobile terminal.
After the vocal print feature of each sound in getting voice signal, adjusting apparatus will search preset vocal print feature
Storehouse, judges in the vocal print feature of each sound, if there is the vocal print feature matched with the vocal print feature in vocal print feature storehouse,
Specifically, for the vocal print feature of the various sound obtained, successively by the vocal print feature of each sound and preset vocal print
Each vocal print feature in feature database is matched, if there is the sound matched with the vocal print feature of a certain sound in vocal print feature storehouse
The vocal print feature of the matching, then be defined as the target vocal print feature of caller by line feature, and determines the target vocal print feature institute
Voice amplitude of the sound of category in voice signal.
Step 305, determine amplitude difference between the voice amplitude and predetermined threshold value;
Step 306, search mapping relations between preset difference and adjusting parameter table, it is determined that with the amplitude difference pair
The adjusting parameter table answered;
Step 307, lookup adjusting parameter table corresponding with the amplitude difference, it is determined that being rung with described apart from corresponding target
Angle value and target frequency value;
Step 308, the sound according to belonging to the target loudness value and target frequency value adjust the target vocal print feature
Loudness value and frequency values.
In embodiments of the present invention, adjusting apparatus is obtaining the voice amplitude of target vocal print feature and caller and movement eventually
After the distance between end, the amplitude difference between the voice amplitude and predetermined threshold value will be determined, wherein, the predetermined threshold value is to use
In the adjustment degree for controlling sound.
Wherein, the amplitude difference is the parameter for determining adjustment.Specifically, being prefixed difference and tune in mobile terminal
Mapping relations between whole parameter list so that different adjusting parameter tables are needed to use for different differences, wherein, the adjustment
The mapping relations between distance, loudness value and frequency values are contained in parameter list.
Adjusting apparatus will search the adjusting parameter table, really after adjusting parameter table corresponding with amplitude difference is found
It is fixed with apart from corresponding target loudness value and target frequency value.
Further, adjusting apparatus will be according to belonging to the target loudness value and target frequency value adjust the target vocal print feature
Sound loudness value and frequency values, specifically:The sound belonging to target vocal print feature is extracted from the voice signal collected,
It is used as targeted voice signal;The loudness value of the targeted voice signal is adjusted to target loudness value, by the targeted voice signal
Frequency values adjust to target frequency value.
In embodiments of the present invention, preset vocal print feature storehouse is passed through so that the vocal print of each sound in voice signal is obtained
After feature, it can be matched using the vocal print feature storehouse, to obtain target vocal print feature, and by preset difference with adjusting
Mapping relations between whole parameter list, and preset adjusting parameter table, enabling using target vocal print feature voice amplitude with
Difference between preset threshold value searches above-mentioned mapping relations to determine adjusting parameter table, and further search using distance to be somebody's turn to do
Adjusting parameter table obtains target loudness value and target frequency value, to carry out careful tune to the sound belonging to target vocal print feature
It is whole.And by being adjusted for the sound belonging to target vocal print feature in voice signal, relative to the regulation of AGC adaptive gains
Mode, can be prevented effectively from the problem of amplifying to ambient noise, lift speech quality, improve usage experience.
Referring to Fig. 4, be the schematic flow sheet of voice signal self-adapting regulation method in third embodiment of the invention, bag
Include:
Step 401, it is under handsfree talk mode in mobile terminal, in real time the voice signal in collection environment, and in real time
Obtain the distance between the mobile terminal and caller;
Step 402, the vocal print spy for parsing each sound of separate sources in the voice signal, the acquisition voice signal
Levy;
Belong to the target vocal print feature of the caller in step 403, the vocal print feature of identification each sound, and determine
Voice amplitude of the sound in the voice signal belonging to the target vocal print feature;
Step 404, according to the voice amplitude and the distance, adjust target vocal print feature described in the voice signal
The loudness value and frequency values of affiliated sound;
Step 405, extract from the voice signal belonging to other vocal print features in addition to the target vocal print feature
Sound, obtain disturb voice signal;
Step 406, to it is described interference voice signal carry out noise reduction process.
It is understood that step 401 to step 404 is described with the step 201 in first embodiment to step 204 respectively
Content it is similar, specifically can refer to first embodiment, do not repeat herein.
It is understood that 3rd embodiment is described on the basis of first embodiment, in another feasible reality
In existing mode, 3rd embodiment can also be described on the basis of second embodiment, not repeated herein.
In embodiments of the present invention, after being adjusted for the sound belonging to target vocal print feature, in order to further carry
High speech quality, can also be adjusted, specifically for other sound:Adjusting apparatus will be extracted from voice signal removes mesh
The sound belonging to other vocal print features beyond vocal print feature is marked, obtains disturbing voice signal, if for example, including in voice signal
The sound of caller, motor machine sowing put the sound of advertisement, then the sound of the caller is the sound belonging to target vocal print feature,
Adjusting apparatus will extract the sound of television for play advertisement from the voice signal, and be used as interference voice signal.Further,
Adjusting apparatus will carry out noise reduction process to the interference voice signal, so that the voice signal after by adjustment is sent to the other end
After conversation object, useful signal (i.e. the sound of caller) becomes apparent from and sound in the voice signal of the conversation object uppick
Amount is suitable, and invalid signals (disturbing voice signal) are weaker.
Wherein, the noise reduction process can have a variety of by the way of, such as Noise gate Method of Noise, sampling Method of Noise, filtering drop
Make an uproar method etc..
In embodiments of the present invention, after the sound in voice signal belonging to target vocal print feature is adjusted, will also
The further interference voice signal in voice signal carries out noise reduction process, so as to further lifting speech quality, improves
Call experience.
Referring to Fig. 5, being the signal of the program module of voice signal self-adapting adjusting apparatus in fourth embodiment of the invention
Figure, the device includes:
Acquisition module 501 is gathered, for being in mobile terminal under handsfree talk mode, the voice in environment is gathered in real time
Signal, and the distance between the mobile terminal and caller are obtained in real time;
In embodiments of the present invention, above-mentioned voice signal self-adapting adjusting apparatus is program module, is stored in mobile whole
, can be by computing device in the computer-readable recording medium at end.
In communication process, if mobile terminal is under handsfree talk mode, show current caller and mobile terminal
Between there is distance, wherein, the caller refers to the local user of the mobile terminal.Now, the microphone on mobile terminal will
The voice signal in environment is gathered, collection acquisition module 501 will get the voice signal that microphone is collected, Ke Yili in real time
Solution, in the case where caller speaks, comprises at least the sound of the caller, and if having in environment in the voice signal
Other sound, microphone will also collect other sound present in environment.
Wherein, collection acquisition module 501 will also obtain the distance between mobile terminal and caller in real time, and the distance can be with
Obtained by the range sensor detection set in mobile terminal, the range sensor can be optical displacement sensor, line
Property proximity transducer or ultrasonic displacement sensor.The range sensor can be arranged on the both sides of the receiver of mobile terminal, or
During person is the groove of the receiver of mobile terminal, or mobile terminal side face etc. is provided in, in actual applications, can basis
The set location of range sensor and the particular type of the range sensor used are set the need for specific, do not limited this time.
Acquisition module 502 is parsed, for parsing the voice signal, each sound of separate sources in the voice signal is obtained
The vocal print feature of sound;
Vocal print, when being shown with electro-kinetic instrument, is the sound wave spectrum for the carrying language message that can be watched, human language
During generation, there is a complicated biophysics process between Body Languages maincenter and vocal organs, people is used in speech
Phonatory organ includes:Tongue, larynx, lung, nasal cavity etc., due to everyone phonatory organ in size and form each not phase
Together, so mutual voiceprint map can also have differences.Vocal print feature is the characteristic parameter that vocal print possesses, and is so that vocal print can
The parameter leaned on, different vocal print features can distinguish different sound.
In embodiments of the present invention, for the voice signal collected, parsing acquisition module 502 will parse voice letter
Number, the vocal print feature of each sound of separate sources in the voice signal is obtained, wherein, source can be caller, TV, move
The various people that can produce sound of thing, machine etc. or thing or equipment.
Recognize the target sound for belonging to the caller in determining module 503, the vocal print feature for recognizing each sound
Line feature, and determine voice amplitude of the sound in the voice signal belonging to the target vocal print feature;
Adjusting module 504, for according to the voice amplitude and the distance, adjusting target described in the voice signal
The loudness value and frequency values of sound belonging to vocal print feature.
In embodiments of the present invention, identification determining module 503 is recognized from the vocal print feature of each sound of separate sources
Which is only the vocal print feature of current caller, and regard the vocal print feature of identification as target vocal print feature, it is possible to understand that
It is that caller can be one or more, and each caller has one group of target vocal print feature.And further, identification
Determining module 503 will also determine voice amplitude of the sound in voice signal belonging to the target vocal print feature, wherein, the target
Sound belonging to vocal print feature is the sound of caller, and the voice amplitude refers in the sound wave that the sound of caller is formed
The average value of wave amplitude, or wave amplitude minimum value.
Wherein, the distance that adjusting module 504 will be got according to voice amplitude and by range sensor, adjustment voice letter
The loudness value and frequency values of sound in number belonging to target vocal print feature, that is, adjust the loudness of the sound of caller in voice signal
Value and frequency values.
Wherein, loudness value is used for the size for weighing volume, and frequency values are used for the definition for weighing sound.
It should be noted that after the adjustment to voice signal is completed, the voice signal can be sent into the other end
Conversation object used in mobile terminal, so that the conversation object can uppick be clear and the suitable voice of volume.
In embodiments of the present invention, it is in mobile terminal under handsfree talk mode, in real time the voice letter in collection environment
Number, and the distance between the mobile terminal and caller are obtained in real time, the voice signal is parsed, obtains different in the voice signal
Belong to the target vocal print feature of caller in the vocal print feature of each sound in source, the vocal print feature for recognizing each sound, and really
Voice amplitude of the sound in voice signal belonging to the fixed target vocal print feature, according to the voice amplitude and above-mentioned distance, is adjusted
The loudness value and frequency values of sound in the whole voice signal belonging to target vocal print feature.Relative to prior art, hands-free logical
Under words pattern, for the voice signal collected, by the target vocal print feature for recognizing caller in the voice signal so that energy
The distance between the voice amplitude of the enough sound according to belonging to the target vocal print feature and caller and mobile terminal, to target sound
The loudness value and frequency values of sound belonging to line feature are adjusted, to realize the adjustment of the sound for caller, relative to
AGC adaptive gain regulative modes, can be prevented effectively from the problem of amplifying to ambient noise, lift speech quality, and improvement is used
Experience.
Referring to Fig. 6, being the signal of the program module of voice signal self-adapting adjusting apparatus in fifth embodiment of the invention
Figure, the device include fourth embodiment in collection acquisition module 501, parsing acquisition module 502, identification determining module 503 and
Adjusting module, and it is similar to the content described in fourth embodiment, do not repeat herein.
In embodiments of the present invention, identification determining module 503 includes:
Search in judge module 601, the vocal print feature storehouse preset for searching, the vocal print feature for judging each sound,
With the presence or absence of the vocal print feature matched with the vocal print feature in the vocal print feature storehouse;
Target determination module 602, if for there is the vocal print feature of matching, the vocal print feature of the matching is defined as
The target vocal print feature of the caller;
Amplitude determining module 603, for determining the sound belonging to the target vocal print feature in the voice signal
Voice amplitude.
In embodiments of the present invention, vocal print feature storehouse, including one or more vocal prints never are prefixed in mobile terminal
Feature, specific set-up mode can be:User enters the setting interface of mobile terminal by clicking operation, and selects vocal print to set
Function, so that the display interface of mobile terminal shows that the start button that vocal print is set, user are said arbitrarily after clicking on
Content, or the content that display interface is shown is read out, the content that the microphone collection user on mobile terminal says, and carry out sound
Whether the analysis of line feature, the vocal print feature that discriminatory analysis is obtained meets the requirements, if meeting the requirements, and preserves the vocal print feature extremely
In vocal print feature storehouse, to complete the setting of vocal print feature, if undesirable, display reminding message points out user to enter again
Row is set.Pass through this kind of mode, it is possible to achieve the setting of vocal print feature of one or more users on a mobile terminal.
After the vocal print feature of each sound in getting voice signal, preset sound will be searched by searching judge module 601
Line feature database, judges in the vocal print feature of each sound, if there is the sound matched with the vocal print feature in vocal print feature storehouse
Line feature, specifically, for the vocal print feature of the various sound obtained, successively by the vocal print feature of each sound with it is preset
Vocal print feature storehouse in each vocal print feature matched, if there is vocal print feature with a certain sound in vocal print feature storehouse
The vocal print feature matched somebody with somebody, then target determination module 602 vocal print feature of the matching is defined as to the target vocal print feature of caller, and
As voice amplitude of the sound belonging to amplitude determining module 603 determines the target vocal print feature in voice signal.
In embodiments of the present invention, adjusting module 504 includes:
Difference determining module 604, for determining the amplitude difference between the voice amplitude and predetermined threshold value;
First searching modul 605, the mapping relations for searching between preset difference and adjusting parameter table, it is determined that and institute
State the corresponding adjusting parameter table of amplitude difference;
Second searching modul 606, for searching corresponding with amplitude difference adjusting parameter table, it is determined that with the distance
Reflecting between distance, loudness value and frequency values is included in corresponding target loudness value and target frequency value, the adjusting parameter table
Penetrate relation;
Target adjustment module 607, it is special for adjusting the target vocal print according to the target loudness value and target frequency value
The loudness value and frequency values of sound belonging to levying.
Wherein, the target adjustment module 607 includes:
First extraction module 608, for extracting the sound belonging to the target vocal print feature from the voice signal, makees
For targeted voice signal;
Data point reuse module 609, will for the loudness value of the targeted voice signal to be adjusted to the target loudness value
The frequency values of the targeted voice signal are adjusted to the target frequency value.
In embodiments of the present invention, obtaining between the voice amplitude and caller and mobile terminal of target vocal print feature
After distance, difference determining module 604 will determine amplitude difference between the voice amplitude and predetermined threshold value, wherein, this is preset
Threshold value is the adjustment degree for controlling sound.
Wherein, the amplitude difference is the parameter for determining adjustment.Specifically, being prefixed difference and tune in mobile terminal
Mapping relations between whole parameter list so that different adjusting parameter tables are needed to use for different differences, wherein, the adjustment
The mapping relations between distance, loudness value and frequency values are contained in parameter list.
After the first searching modul 605 finds adjusting parameter table corresponding with amplitude difference, the second searching modul 606
Will search the adjusting parameter table, it is determined that with apart from corresponding target loudness value and target frequency value.
Further, target adjustment module 607 will adjust the target vocal print according to the target loudness value and target frequency value
The loudness value and frequency values of sound belonging to feature, specifically:First extraction module 608 is extracted from the voice signal collected
Sound belonging to target vocal print feature, is used as targeted voice signal;Data point reuse module 609 is by the loudness of the targeted voice signal
Value is adjusted to target loudness value, and the frequency values of the targeted voice signal are adjusted to target frequency value.
In embodiments of the present invention, preset vocal print feature storehouse is passed through so that the vocal print of each sound in voice signal is obtained
After feature, it can be matched using the vocal print feature storehouse, to obtain target vocal print feature, and by preset difference with adjusting
Mapping relations between whole parameter list, and preset adjusting parameter table, enabling using target vocal print feature voice amplitude with
Difference between preset threshold value searches above-mentioned mapping relations to determine adjusting parameter table, and further search using distance to be somebody's turn to do
Adjusting parameter table obtains target loudness value and target frequency value, to carry out careful tune to the sound belonging to target vocal print feature
It is whole.And by being adjusted for the sound belonging to target vocal print feature in voice signal, relative to the regulation of AGC adaptive gains
Mode, can be prevented effectively from the problem of amplifying to ambient noise, lift speech quality, improve usage experience.
Referring to Fig. 7, being the signal of the program module of voice signal self-adapting adjusting apparatus in sixth embodiment of the invention
Figure, including:Collection acquisition module 501, parsing acquisition module 502, identification determining module 503 and adjustment mould in fourth embodiment
Block 504, and it is similar to the content described in fourth embodiment, do not repeat herein.
It is understood that sixth embodiment is described on the basis of fourth embodiment, in addition, the sixth embodiment
It can also be described on the basis of the 5th embodiment.
In embodiments of the present invention, the device also includes:
Second extraction module 701, for extracting other in addition to the target vocal print feature from the voice signal
Sound belonging to vocal print feature, obtains disturbing voice signal;
Noise reduction module 702, for carrying out noise reduction process to the interference voice signal.
In embodiments of the present invention, after being adjusted for the sound belonging to target vocal print feature, in order to further carry
High speech quality, can also be adjusted, specifically for other sound:Second extraction module 701 will be from voice signal
The sound belonging to other vocal print features in addition to target vocal print feature is extracted, obtains disturbing voice signal, if for example, voice is believed
The sound of advertisement is put in number comprising the sound of caller, motor machine sowing, then the sound of the caller is target vocal print feature institute
The sound of category, adjusting apparatus will extract the sound of television for play advertisement from the voice signal, and be used as interference voice signal.
Further, noise reduction module 702 will carry out noise reduction process to the interference voice signal, so as to the voice signal hair after by adjustment
After giving the conversation object of the other end, useful signal (i.e. the sound of caller) in the voice signal of the conversation object uppick
Become apparent from and volume is suitable, and invalid signals (disturbing voice signal) are weaker.
Wherein, the noise reduction process can have a variety of by the way of, such as Noise gate Method of Noise, sampling Method of Noise, filtering drop
Make an uproar method etc..
In embodiments of the present invention, after the sound in voice signal belonging to target vocal print feature is adjusted, will also
The further interference voice signal in voice signal carries out noise reduction process, so as to further lifting speech quality, improves
Call experience.
The embodiment of the present invention also provides a kind of mobile terminal, including memory, processor and stores on a memory and can
The computer program run on a processor, during computing device computer program, realizes first embodiment to 3rd embodiment
In each step in voice signal self-adapting regulation method in any one embodiment.
The embodiment of the present invention also provides a kind of storage medium, and the storage medium is specifically as follows computer-readable storage medium
Matter, is stored thereon with computer program, when computer program is executed by processor, and realizes first embodiment into 3rd embodiment
Each step in voice signal self-adapting regulation method in any one embodiment.
, can be by it in several embodiments provided herein, it should be understood that disclosed apparatus and method
Its mode is realized.For example, device embodiment described above is only schematical, for example, the division of the module, only
Only a kind of division of logic function, can there is other dividing mode when actually realizing, such as multiple module or components can be tied
Another system is closed or is desirably integrated into, or some features can be ignored, or do not perform.It is another, it is shown or discussed
Coupling each other or direct-coupling or communication connection can be the INDIRECT COUPLINGs or logical of device or module by some interfaces
Letter connection, can be electrical, machinery or other forms.
The module illustrated as separating component can be or may not be it is physically separate, it is aobvious as module
The part shown can be or may not be physical module, you can with positioned at a place, or can also be distributed to multiple
On mixed-media network modules mixed-media.Some or all of module therein can be selected to realize the mesh of this embodiment scheme according to the actual needs
's.
In addition, each functional module in each embodiment of the invention can be integrated in a processing module, can also
That modules are individually physically present, can also two or more modules be integrated in a module.Above-mentioned integrated mould
Block can both be realized in the form of hardware, it would however also be possible to employ the form of software function module is realized.
If the integrated module is realized using in the form of software function module and as independent production marketing or used
When, it can be stored in a computer read/write memory medium.Understood based on such, technical scheme is substantially
The part contributed in other words to prior art or all or part of the technical scheme can be in the form of software products
Embody, the computer software product is stored in a storage medium, including some instructions are to cause a computer
Equipment (can be personal computer, server, or network equipment etc.) performs the complete of each embodiment methods described of the invention
Portion or part steps.And foregoing storage medium includes:USB flash disk, mobile hard disk, read-only storage (ROM, Read-Only
Memory), random access memory (RAM, Random Access Memory), magnetic disc or CD etc. are various can store journey
The medium of sequence code.
It should be noted that for foregoing each method embodiment, for simplicity description, therefore it is all expressed as a series of
Combination of actions, but those skilled in the art should know, the present invention is not limited by described sequence of movement because
According to the present invention, some steps can use other orders or carry out simultaneously.Secondly, those skilled in the art should also know
Know, embodiment described in this description belongs to preferred embodiment, and involved action and module might not all be this hairs
Necessary to bright.
In the above-described embodiments, the description to each embodiment all emphasizes particularly on different fields, and does not have the portion being described in detail in some embodiment
Point, it may refer to the associated description of other embodiments.
It is to a kind of voice signal self-adapting regulation method provided by the present invention, device, mobile terminal and storage above
The description of medium, for those skilled in the art, according to the thought of the embodiment of the present invention, in embodiment and applies model
Place and will change, to sum up, this specification content should not be construed as limiting the invention.
Claims (12)
1. a kind of voice signal self-adapting regulation method, it is characterised in that methods described includes:
It is in mobile terminal under handsfree talk mode, the voice signal in collection environment, and obtain the movement in real time in real time
The distance between terminal and caller;
The voice signal is parsed, the vocal print feature of each sound of separate sources in the voice signal is obtained;
Belong to the target vocal print feature of the caller in the vocal print feature for recognizing each sound, and determine the target vocal print
Voice amplitude of the sound in the voice signal belonging to feature;
According to the voice amplitude and the distance, sound described in the voice signal belonging to target vocal print feature is adjusted
Loudness value and frequency values.
2. according to the method described in claim 1, it is characterised in that belong to institute in the vocal print feature of identification each sound
The target vocal print feature of caller is stated, including:
Search in preset vocal print feature storehouse, the vocal print feature for judging each sound, if exist and the vocal print feature storehouse
In vocal print feature matching vocal print feature;
If there is the vocal print feature of matching, the target vocal print that the vocal print feature of the matching is defined as into the caller is special
Levy.
3. according to the method described in claim 1, it is characterised in that described according to the voice amplitude and the distance, adjustment
The loudness value and frequency values of sound described in the voice signal belonging to target vocal print feature, including:
Determine the amplitude difference between the voice amplitude and predetermined threshold value;
Mapping relations between the preset difference of lookup and adjusting parameter table, it is determined that adjusting parameter corresponding with the amplitude difference
Table;
Search corresponding with amplitude difference adjusting parameter table, it is determined that with it is described apart from corresponding target loudness value and target frequently
The mapping relations between distance, loudness value and frequency values are included in rate value, the adjusting parameter table;
The loudness value and frequency of sound according to belonging to the target loudness value and target frequency value adjust the target vocal print feature
Rate value.
4. method according to claim 3, it is characterised in that described to be adjusted according to the target loudness value and target frequency value
The loudness value and frequency values of sound belonging to the whole target vocal print feature, including:
The sound belonging to the target vocal print feature is extracted from the voice signal, targeted voice signal is used as;
The loudness value of the targeted voice signal is adjusted to the target loudness value, by the frequency values of the targeted voice signal
Adjust to the target frequency value.
5. the method according to Claims 1-4 any one, it is characterised in that methods described also includes:
The sound belonging to other vocal print features in addition to the target vocal print feature is extracted from the voice signal, is done
Disturb voice signal;
Noise reduction process is carried out to the interference voice signal.
6. a kind of voice signal self-adapting adjusting apparatus, it is characterised in that described device includes:
Acquisition module is gathered, for being in mobile terminal under handsfree talk mode, the voice signal in environment is gathered in real time, and
The distance between the mobile terminal and caller are obtained in real time;
Acquisition module is parsed, for parsing the voice signal, the sound of each sound of separate sources in the voice signal is obtained
Line feature;
The target vocal print feature for belonging to the caller in determining module, the vocal print feature for recognizing each sound is recognized,
And determine voice amplitude of the sound in the voice signal belonging to the target vocal print feature;
Adjusting module, it is special for according to the voice amplitude and the distance, adjusting target vocal print described in the voice signal
The loudness value and frequency values of sound belonging to levying.
7. device according to claim 6, it is characterised in that the identification determining module includes:
Search in judge module, the vocal print feature storehouse preset for searching, the vocal print feature for judging each sound, if exist
The vocal print feature matched with the vocal print feature in the vocal print feature storehouse;
Target determination module, if for there is the vocal print feature of matching, the vocal print feature of the matching is defined as described logical
The target vocal print feature of words person;
Amplitude determining module, for determining voice width of the sound belonging to the target vocal print feature in the voice signal
Value.
8. device according to claim 6, it is characterised in that the adjusting module includes:
Difference determining module, for determining the amplitude difference between the voice amplitude and predetermined threshold value;
First searching modul, the mapping relations for searching between preset difference and adjusting parameter table, it is determined that with the amplitude
The corresponding adjusting parameter table of difference;
Second searching modul, for searching adjusting parameter table corresponding with the amplitude difference, it is determined that with described apart from corresponding
The mapping relations between distance, loudness value and frequency values are included in target loudness value and target frequency value, the adjusting parameter table;
Target adjustment module, belonging to adjusting the target vocal print feature according to the target loudness value and target frequency value
The loudness value and frequency values of sound.
9. device according to claim 8, it is characterised in that the target adjustment module includes:
First extraction module, for extracting the sound belonging to the target vocal print feature from the voice signal, is used as target
Voice signal;
Data point reuse module, for the loudness value of the targeted voice signal to be adjusted to the target loudness value, by the mesh
The frequency values of mark voice signal are adjusted to the target frequency value.
10. the device according to claim 6 to 9 any one, it is characterised in that described device also includes:
Second extraction module, for extracting other vocal print features in addition to the target vocal print feature from the voice signal
Affiliated sound, obtains disturbing voice signal;
Noise reduction module, for carrying out noise reduction process to the interference voice signal.
11. a kind of mobile terminal, including memory, processor and storage are on a memory and the calculating that can run on a processor
Machine program, it is characterised in that described in the computing device during computer program, is realized described in claim 1 to 5 any one
Voice signal self-adapting regulation method in each step.
12. a kind of storage medium, the storage medium is computer-readable recording medium, computer program is stored thereon with, its
It is characterised by, when the computer program is executed by processor, realizes the voice signal described in claim 1 to 5 any one
Each step in self-adapting regulation method.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710599150.2A CN107172255A (en) | 2017-07-21 | 2017-07-21 | Voice signal self-adapting regulation method, device, mobile terminal and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710599150.2A CN107172255A (en) | 2017-07-21 | 2017-07-21 | Voice signal self-adapting regulation method, device, mobile terminal and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107172255A true CN107172255A (en) | 2017-09-15 |
Family
ID=59817249
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710599150.2A Pending CN107172255A (en) | 2017-07-21 | 2017-07-21 | Voice signal self-adapting regulation method, device, mobile terminal and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107172255A (en) |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107682553A (en) * | 2017-10-10 | 2018-02-09 | 广东欧珀移动通信有限公司 | Speech signal sending method, device, mobile terminal and storage medium |
CN107819964A (en) * | 2017-11-10 | 2018-03-20 | 广东欧珀移动通信有限公司 | Improve method, apparatus, terminal and the computer-readable recording medium of speech quality |
CN108446091A (en) * | 2018-02-26 | 2018-08-24 | 浙江创课教育科技有限公司 | Language play back system based on noise measuring |
CN109272996A (en) * | 2018-11-09 | 2019-01-25 | 广州长嘉电子有限公司 | A kind of noise-reduction method and system |
WO2019033438A1 (en) * | 2017-08-18 | 2019-02-21 | 广东欧珀移动通信有限公司 | Audio signal adjustment method and device, storage medium, and terminal |
CN110186171A (en) * | 2019-05-30 | 2019-08-30 | 广东美的制冷设备有限公司 | Air conditioner and its control method and computer readable storage medium |
CN110225195A (en) * | 2019-05-30 | 2019-09-10 | 维沃移动通信有限公司 | A kind of audio communication method and terminal |
CN110706688A (en) * | 2019-11-11 | 2020-01-17 | 广州国音智能科技有限公司 | Method, system, terminal and readable storage medium for constructing voice recognition model |
CN112820307A (en) * | 2020-02-19 | 2021-05-18 | 腾讯科技(深圳)有限公司 | Voice message processing method, device, equipment and medium |
CN113132193A (en) * | 2021-04-13 | 2021-07-16 | Oppo广东移动通信有限公司 | Control method and device of intelligent device, electronic device and storage medium |
CN113986187A (en) * | 2018-12-28 | 2022-01-28 | 阿波罗智联(北京)科技有限公司 | Method and device for acquiring range amplitude, electronic equipment and storage medium |
CN115052070A (en) * | 2022-06-24 | 2022-09-13 | 歌尔股份有限公司 | Method and device for adjusting call volume, call equipment and medium |
CN116319071A (en) * | 2023-05-11 | 2023-06-23 | 深圳奥联信息安全技术有限公司 | Voiceprint password authentication method and system |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101022682A (en) * | 2006-02-13 | 2007-08-22 | 明基电通股份有限公司 | Method for adjusting gain value of sound signal in gain adjustment system and audio system |
CN202920470U (en) * | 2012-11-02 | 2013-05-08 | 姜鸿彦 | Sound compounding play device |
CN103211600A (en) * | 2013-04-27 | 2013-07-24 | 江苏贝泰福医疗科技有限公司 | Hearing diagnosis and treatment device |
JP5301037B2 (en) * | 2010-06-28 | 2013-09-25 | 三菱電機株式会社 | Voice recognition device |
CN103514884A (en) * | 2012-06-26 | 2014-01-15 | 华为终端有限公司 | Communication voice denoising method and terminal |
CN103841241A (en) * | 2012-11-21 | 2014-06-04 | 联想(北京)有限公司 | Volume adjusting method and apparatus |
CN104021632A (en) * | 2014-06-11 | 2014-09-03 | 张慧燕 | Voice interaction processing method based on situations, welcoming device and welcoming system |
CN106569773A (en) * | 2016-10-31 | 2017-04-19 | 努比亚技术有限公司 | Terminal and voice interaction processing method |
-
2017
- 2017-07-21 CN CN201710599150.2A patent/CN107172255A/en active Pending
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101022682A (en) * | 2006-02-13 | 2007-08-22 | 明基电通股份有限公司 | Method for adjusting gain value of sound signal in gain adjustment system and audio system |
JP5301037B2 (en) * | 2010-06-28 | 2013-09-25 | 三菱電機株式会社 | Voice recognition device |
CN103514884A (en) * | 2012-06-26 | 2014-01-15 | 华为终端有限公司 | Communication voice denoising method and terminal |
CN202920470U (en) * | 2012-11-02 | 2013-05-08 | 姜鸿彦 | Sound compounding play device |
CN103841241A (en) * | 2012-11-21 | 2014-06-04 | 联想(北京)有限公司 | Volume adjusting method and apparatus |
CN103211600A (en) * | 2013-04-27 | 2013-07-24 | 江苏贝泰福医疗科技有限公司 | Hearing diagnosis and treatment device |
CN104021632A (en) * | 2014-06-11 | 2014-09-03 | 张慧燕 | Voice interaction processing method based on situations, welcoming device and welcoming system |
CN106569773A (en) * | 2016-10-31 | 2017-04-19 | 努比亚技术有限公司 | Terminal and voice interaction processing method |
Cited By (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2019033438A1 (en) * | 2017-08-18 | 2019-02-21 | 广东欧珀移动通信有限公司 | Audio signal adjustment method and device, storage medium, and terminal |
US11251763B2 (en) | 2017-08-18 | 2022-02-15 | Guangdong Oppo Mobile Telecommunications Corp., Ltd. | Audio signal adjustment method, storage medium, and terminal |
CN107682553A (en) * | 2017-10-10 | 2018-02-09 | 广东欧珀移动通信有限公司 | Speech signal sending method, device, mobile terminal and storage medium |
CN107819964A (en) * | 2017-11-10 | 2018-03-20 | 广东欧珀移动通信有限公司 | Improve method, apparatus, terminal and the computer-readable recording medium of speech quality |
CN108446091B (en) * | 2018-02-26 | 2021-03-23 | 浙江创课教育科技有限公司 | Voice playing system based on noise detection |
CN108446091A (en) * | 2018-02-26 | 2018-08-24 | 浙江创课教育科技有限公司 | Language play back system based on noise measuring |
CN109272996A (en) * | 2018-11-09 | 2019-01-25 | 广州长嘉电子有限公司 | A kind of noise-reduction method and system |
CN109272996B (en) * | 2018-11-09 | 2021-11-30 | 广州长嘉电子有限公司 | Noise reduction method and system |
CN113986187A (en) * | 2018-12-28 | 2022-01-28 | 阿波罗智联(北京)科技有限公司 | Method and device for acquiring range amplitude, electronic equipment and storage medium |
CN113986187B (en) * | 2018-12-28 | 2024-05-17 | 阿波罗智联(北京)科技有限公司 | Audio region amplitude acquisition method and device, electronic equipment and storage medium |
CN110186171A (en) * | 2019-05-30 | 2019-08-30 | 广东美的制冷设备有限公司 | Air conditioner and its control method and computer readable storage medium |
CN110225195A (en) * | 2019-05-30 | 2019-09-10 | 维沃移动通信有限公司 | A kind of audio communication method and terminal |
CN110706688A (en) * | 2019-11-11 | 2020-01-17 | 广州国音智能科技有限公司 | Method, system, terminal and readable storage medium for constructing voice recognition model |
CN110706688B (en) * | 2019-11-11 | 2022-06-17 | 广州国音智能科技有限公司 | Method, system, terminal and readable storage medium for constructing voice recognition model |
CN112820307B (en) * | 2020-02-19 | 2023-12-15 | 腾讯科技(深圳)有限公司 | Voice message processing method, device, equipment and medium |
CN112820307A (en) * | 2020-02-19 | 2021-05-18 | 腾讯科技(深圳)有限公司 | Voice message processing method, device, equipment and medium |
CN113132193A (en) * | 2021-04-13 | 2021-07-16 | Oppo广东移动通信有限公司 | Control method and device of intelligent device, electronic device and storage medium |
CN115052070A (en) * | 2022-06-24 | 2022-09-13 | 歌尔股份有限公司 | Method and device for adjusting call volume, call equipment and medium |
CN116319071B (en) * | 2023-05-11 | 2023-08-25 | 深圳奥联信息安全技术有限公司 | Voiceprint password authentication method and system |
CN116319071A (en) * | 2023-05-11 | 2023-06-23 | 深圳奥联信息安全技术有限公司 | Voiceprint password authentication method and system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107172255A (en) | Voice signal self-adapting regulation method, device, mobile terminal and storage medium | |
CN107172256A (en) | Earphone call self-adapting regulation method, device, mobile terminal and storage medium | |
US9685161B2 (en) | Method for updating voiceprint feature model and terminal | |
CN101668085B (en) | Method for regulating voice output of mobile terminal and mobile terminal | |
CN108156291A (en) | Speech signal collection method, apparatus, electronic equipment and readable storage medium storing program for executing | |
CN107395883A (en) | Voice signal adjusting method, communication terminal and computer-readable recording medium | |
CN107172313A (en) | Improve method, device, mobile terminal and the storage medium of hand-free call quality | |
CN106657528A (en) | Incoming call management method and device | |
EP4191579A1 (en) | Electronic device and speech recognition method therefor, and medium | |
CN107170457A (en) | Age recognition methods, device and terminal | |
CN107682553A (en) | Speech signal sending method, device, mobile terminal and storage medium | |
CN112735388B (en) | Network model training method, voice recognition processing method and related equipment | |
CN105744609B (en) | Improve the method and device of mobile terminal power consumption | |
CN107948055A (en) | Shielding group members send out the method, apparatus and computer-readable recording medium of message | |
CN105930084A (en) | Screenshot picture editing method and apparatus | |
CN107222629A (en) | Call processing method and related product | |
CN101753657B (en) | Method and device for reducing call noise | |
CN104769966A (en) | Receiver device | |
US20230396913A1 (en) | Secure actuation with in-ear electronic device | |
CN108154886A (en) | Noise suppressing method and device, electronic device and computer readable storage medium | |
CN108364346A (en) | Build the method, apparatus and computer readable storage medium of three-dimensional face model | |
CN111787149A (en) | Noise reduction processing method, system and computer storage medium | |
CN108170347A (en) | A kind of image processing method, mobile terminal and computer readable storage medium | |
CN108900706B (en) | Call voice adjustment method and mobile terminal | |
CN105912114A (en) | Method and device for adjusting vibration grade of mobile terminal |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170915 |